☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

huggingface inference inference-platform kubernetes llamacpp llm modelscope ollama sglang text-generation-inference vllm
1 Open Issue Need Help Last updated: Jun 19, 2025

Open Issues Need Help

View All on GitHub
AI/ML Inference Platforms

AI Summary: Update the llmaz documentation and example to reflect the changes in Envoy AI Gateway v0.2.0. This involves replacing service references with backend references in the `docs/examples/envoy-ai-gateway` example and updating the relevant documentation to accurately reflect the new features and breaking changes of v0.2.0.

Complexity: 3/5
help wanted cleanup needs-priority needs-triage

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Go
#huggingface#inference#inference-platform#kubernetes#llamacpp#llm#modelscope#ollama#sglang#text-generation-inference#vllm