Open Issues Need Help
View All on GitHub e2e tests代码冗余 18 days ago
AI Summary: This GitHub issue identifies code redundancy within the `tests/e2e_tests` directory, specifically noting duplicated code in scripts generated for various task types. The proposed solution is to refactor these redundant code blocks into a common, shared method to enhance maintainability and reduce repetition.
Complexity:
2/5
good first issue
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
Python
#ai4science#data-generation#data-synthesis#knowledge-graph#llama-factory#llm#llm-training#pretrain#pretraining#qa#question-answering#qwen#sft#sft-data#xtuner
Quiz and Judge. Probs normalization. about 1 month ago
enhancement good first issue
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
Python
#ai4science#data-generation#data-synthesis#knowledge-graph#llama-factory#llm#llm-training#pretrain#pretraining#qa#question-answering#qwen#sft#sft-data#xtuner