linkedin/Liger-Kernel

[transformers] support DeepSeek V3 1 day ago

enhancement good first issue help wanted huggingface feature

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

DeepSeek Native Sparse Attention (NSA) Kernel 4 days ago

help wanted feature fun

linkedin/Liger-Kernel

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

[feat] on-paper form of RoPE 9 days ago

enhancement good first issue feature

linkedin/Liger-Kernel

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

TiledMLP 4 months ago

help wanted

linkedin/Liger-Kernel

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

Transformers v5 compatibility 5 months ago

good first issue help wanted huggingface

linkedin/Liger-Kernel

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

Qwen2VLConfig and Qwen2_5_VLConfig have no attribute `hidden_size` 5 months ago

good first issue

linkedin/Liger-Kernel

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

AttributeError for 'language_model' in transformers v5 7 months ago

good first issue

linkedin/Liger-Kernel

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

Add support for the SmolLM3 model. 12 months ago

AI Summary: Add support for the SmolLM3 model to the Liger Kernel library by implementing necessary patching APIs and kernels to optimize its performance, similar to existing support for other LLMs like LLaMA and others. This involves adapting the existing framework to handle the specific architecture and operations of SmolLM3.

Complexity: 4/5

good first issue huggingface

linkedin/Liger-Kernel

6.5K

Efficient Triton Kernels for LLM Training

Python

#finetuning#gemma2#hacktoberfest#llama#llama3#llm-training#llms#mistral#phi3#triton#triton-kernels

Open Issues Need Help