gptenv/neural-audio-tokenizer

Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

4 stars 1 forks 4 watchers Python MIT License

View on GitHub

4 Open Issues Need Help Last updated: Oct 9, 2025

Open Issues Need Help

View All on GitHub

The command-line verbose flag behaviour needs improvements and adjustments in behaviour, and some other items in the program too. 9 months ago

bug documentation enhancement good first issue help wanted

gptenv/neural-audio-tokenizer

Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

Python

A few more major and minor issues to fix and changes to make before we tag and release as v0.1.7 9 months ago

bug enhancement help wanted

gptenv/neural-audio-tokenizer

Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

Python

See inline comment in neural_audio_tokenizer.py and implement those requested considerations and potential changes and any changes you determine from that which are necessary or likely beneficial please. Thank you. :) 9 months ago

bug enhancement good first issue help wanted

gptenv/neural-audio-tokenizer

Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

Python

neural_audio_tokenizer.py needs some fixes and updates to achieve next release tag version 9 months ago

AI Summary: The `neural_audio_tokenizer.py` script requires a comprehensive review and refactoring to improve code consistency and logic. A key update involves enhancing the optional Encodec sections to achieve better token diversity, drawing inspiration from `mert`'s more effective strategies beyond simple jitter. Additionally, the issue calls for modernizing command-line arguments for improved user ergonomics, with the successful completion of these tasks paving the way for a new release.

Complexity: 4/5

bug enhancement good first issue help wanted

gptenv/neural-audio-tokenizer

Tim's Ears - Music and sound reasoning data for LLMs to hear audio via tokens and data files generated from processing any types of files

Python