Extract structured menu information from images into JSON using a fine-tuned E2E model or LLM.

donut fine-tuning image-to-text transformer
1 Open Issue Need Help Last updated: Jun 27, 2025

Open Issues Need Help

View All on GitHub

AI Summary: The task requires collecting more menu image datasets to improve the performance of a fine-tuned E2E model or LLM used for extracting structured menu information from images. This involves gathering diverse menu images and potentially annotating them for training.

Complexity: 4/5
enhancement help wanted

Extract structured menu information from images into JSON using a fine-tuned E2E model or LLM.

Python
#donut#fine-tuning#image-to-text#transformer