Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

2 Open Issues Need Help Last updated: Oct 9, 2025

Open Issues Need Help

View All on GitHub

Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

Python

AI Summary: The `neural_audio_tokenizer.py` script requires a comprehensive review and refactoring to improve code consistency and logic. A key update involves enhancing the optional Encodec sections to achieve better token diversity, drawing inspiration from `mert`'s more effective strategies beyond simple jitter. Additionally, the issue calls for modernizing command-line arguments for improved user ergonomics, with the successful completion of these tasks paving the way for a new release.

Complexity: 4/5
bug enhancement good first issue help wanted

Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

Python