Open Issues Need Help
View All on GitHubAI Summary: Enhance the EDGAR-CRAWLER tool to better extract financial statements from 10-K reports. Specifically, the tool needs to identify and extract financial statements located outside of the standard Item 8 or Item 16 sections, often appearing after an "INDEX TO * STATEMENTS" header near the end of the document. A new "financial_statement" item should be added to the JSON output to contain this extracted information.
The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files. Presented at WWW 2025 @ Sydney, Australia (https://dl.acm.org/doi/10.1145/3701716.3715289)