Open Issues Need Help
View All on GitHub Add RTF parser about 2 hours ago
good first issue
Document parsing that never loses provenance: Markdown + JSON output where every block knows its source page, section, and location.
Python
#ai-agents#citations#document-parsing#docx#llm#markdown#pdf#provenance#python#rag
Add Docling as a benchmark baseline about 2 hours ago
good first issue
Document parsing that never loses provenance: Markdown + JSON output where every block knows its source page, section, and location.
Python
#ai-agents#citations#document-parsing#docx#llm#markdown#pdf#provenance#python#rag
Grow the golden corpus with real-world documents about 2 hours ago
good first issue
Document parsing that never loses provenance: Markdown + JSON output where every block knows its source page, section, and location.
Python
#ai-agents#citations#document-parsing#docx#llm#markdown#pdf#provenance#python#rag
Add EPUB parser about 2 hours ago
good first issue
Document parsing that never loses provenance: Markdown + JSON output where every block knows its source page, section, and location.
Python
#ai-agents#citations#document-parsing#docx#llm#markdown#pdf#provenance#python#rag