A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

42 stars 8 forks 42 watchers Python Apache License 2.0
2 Open Issues Need Help Last updated: Sep 12, 2025

Open Issues Need Help

View All on GitHub
enhancement good first issue

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python