Create large-scale synthetic training data for model distillation and evaluation

445 stars 32 forks 445 watchers Python Apache License 2.0
agents ai data-science dataset distillation evaluation fine-tuning huggingface huggingface-datasets kaggle machine-learning synthetic-data unsloth
1 Open Issue Need Help Last updated: Sep 15, 2025

Open Issues Need Help

View All on GitHub
Push to Kaggle Support about 2 months ago
good first issue

Create large-scale synthetic training data for model distillation and evaluation

Python
#agents#ai#data-science#dataset#distillation#evaluation#fine-tuning#huggingface#huggingface-datasets#kaggle#machine-learning#synthetic-data#unsloth