YALIS stands for Yet Another LLM Inference System. It is what it is.

2 Open Issues Need Help Last updated: Jul 2, 2025

Open Issues Need Help

View All on GitHub
enhancement help wanted

YALIS stands for Yet Another LLM Inference System. It is what it is.

Python

AI Summary: Optimize the YALIS LLM inference system's initialization time. This involves reducing both model loading time and the overhead of the first inference iteration, which is currently slow due to TorchDynamo/torch.compile warmup and CUDA graph recording. The goal is to improve the system's usability by significantly reducing startup time.

Complexity: 5/5
enhancement help wanted

YALIS stands for Yet Another LLM Inference System. It is what it is.

Python