Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.

1 Open Issue Need Help Last updated: Jun 20, 2025

Open Issues Need Help

View All on GitHub

AI Summary: The task is to determine how the project extracts audio tokens from audio files, specifically focusing on the method used to create the "data/spks/luna.pt" file which contains audio features and tokens. The question is whether a specific audio tokenizer is used, or if the voice is trained directly into the model. The goal is to understand how to replicate this process for voice cloning.

Complexity: 4/5
help wanted question

Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.

Python