Skip to content

Stage 2: extraction

uv run python -m examples.medlit.scripts.extract \ --input-dir pmc_xmls --output-dir extracted \ --vocab-file vocab/vocab.json --papers PMC12345.xml,...