huggingface

huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

153.6K stars 31.3K forks 153.6K watchers Python Apache License 2.0

audio deep-learning deepseek gemma glm hacktoberfest llm machine-learning model-hub natural-language-processing nlp pretrained-models python pytorch pytorch-transformers qwen speech-recognition transformer vlm

View on GitHub Website

21 Open Issues Need Help Last updated: Dec 8, 2025

Open Issues Need Help

View All on GitHub

IsADirectoryError when training with tqdm enabled for trainer about 2 hours ago

AI Summary: The user is encountering an `IsADirectoryError` during training with the `transformers` `Trainer` when `tqdm` is enabled. The error message is highly unusual, as it shows `tqdm`'s HTML progress bar output being misinterpreted as a file path, suggesting an unexpected interaction between the `Trainer`'s internal file operations (e.g., logging or checkpointing) and `tqdm`'s rich display output, likely in a notebook environment.

Complexity: 4/5

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

[Community Event] Doc Tests Sprint 5 days ago

Good First Issue

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Please use the new API settings to control TF32 behavior, ... 11 days ago

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Bad error message for AutoTokenizer loading Voxtral 15 days ago

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

RuntimeError when loading llmcompressor W8A8 quantized model: int8 dtype in weight initialization 18 days ago

Good First Issue

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Trainer.training_step incorrectly normalizes mean token loss when n_gpu > 1 20 days ago

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Improve EncoderDecoderModel docs about 1 month ago

AI Summary: This GitHub issue proposes improving the documentation for the `EncoderDecoderModel` in Hugging Face Transformers, as the current docs are outdated and cause user confusion. The task involves updating the existing `encoder-decoder.mdx` file to include a "How-to-guide" on creating, saving, and fine-tuning the model, along with a warning about correctly setting configuration values.

Complexity: 2/5

Good First Issue

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

[Contributions Welcome] Add Fast Image Processors about 1 month ago

Good First Issue Good Second Issue Vision contributions-welcome Processing

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Accelerate x Trainer issue tracker: about 2 months ago

Good First Issue trainer Good Second Issue DeepSpeed Good Difficult Issue PyTorch FSDP HACKTOBERFEST-ACCEPTED Accelerate

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

`ConditionalDetrImageProcessor` still accepts the deprecated parameter `max_size` 2 months ago

Good First Issue Vision

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Adding `logits_to_keep` to older models 2 months ago

Good First Issue

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Add argument to set number of eval steps in Trainer 2 months ago

Good First Issue trainer Feature request

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Whisper Finetuning Issue 2 months ago

Good First Issue

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

[Community contributions] Model cards 3 months ago

Good First Issue Good First Documentation Issue contributions-welcome

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

`train_tokens_per_second` is wrong after continuing from checkpoint 3 months ago

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Safetensors files for long-t5-tglobal models fail to load correctly 3 months ago

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Unknown Model (mobilenetv5_300m_enc) when loading Gemma 3n 3 months ago

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Please remove the redundant dependency jieba: rjieba does the same and has better performance 3 months ago

Good First Issue

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

Is there a plan to add DINOv3 into AutoBackbone? 4 months ago

AI Summary: The user requests the addition of DINOv3 to AutoBackbone, noting that DINOv2 is already included. They suggest DINOv3 could directly inherit from DINOv2 for ease of implementation and user convenience.

Complexity: 2/5

Good First Issue Feature request

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

MultiTask Classification and label_names on Trainer 4 months ago

Good First Issue bug

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm

text-generation extremely slow with large `bad_words_ids` list 5 months ago

AI Summary: The task is to investigate and resolve a performance issue in the Hugging Face Transformers library. The text generation pipeline is significantly slower when a large list of `bad_words_ids` is used. The solution requires profiling the code to identify the bottleneck (inefficient looping, tensor access, or slow regex) and optimizing the relevant section to improve performance.

Complexity: 4/5

Good First Issue bug mps

huggingface/transformers

153.6K

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

#audio#deep-learning#deepseek#gemma#glm#hacktoberfest#llm#machine-learning#model-hub#natural-language-processing#nlp#pretrained-models#python#pytorch#pytorch-transformers#qwen#speech-recognition#transformer#vlm