Deterministic PDF/DOCX parser for RAG/LLMs — Rust core with byte-identical Python/Node/WASM bindings

1 stars 0 forks 1 watchers Rust Apache License 2.0
deterministic document-parsing docx llm nodejs ocr pdf pdf-parser pdfplumber-alternative pymupdf-alternative python rag rust text-extraction wasm
3 Open Issues Need Help Last updated: Jul 3, 2026

Open Issues Need Help

View All on GitHub
good first issue

Deterministic PDF/DOCX parser for RAG/LLMs — Rust core with byte-identical Python/Node/WASM bindings

Rust
#deterministic#document-parsing#docx#llm#nodejs#ocr#pdf#pdf-parser#pdfplumber-alternative#pymupdf-alternative#python#rag#rust#text-extraction#wasm
good first issue

Deterministic PDF/DOCX parser for RAG/LLMs — Rust core with byte-identical Python/Node/WASM bindings

Rust
#deterministic#document-parsing#docx#llm#nodejs#ocr#pdf#pdf-parser#pdfplumber-alternative#pymupdf-alternative#python#rag#rust#text-extraction#wasm

Deterministic PDF/DOCX parser for RAG/LLMs — Rust core with byte-identical Python/Node/WASM bindings

Rust
#deterministic#document-parsing#docx#llm#nodejs#ocr#pdf#pdf-parser#pdfplumber-alternative#pymupdf-alternative#python#rag#rust#text-extraction#wasm