vLLM on AMD RDNA4 (gfx1201 / RX 9070 XT-9070): clone-and-docker-compose-up, with a W4A8-FP8-WMMA MoE kernel

9 stars 0 forks 9 watchers Python Apache License 2.0
2 Open Issues Need Help Last updated: Jun 30, 2026

Open Issues Need Help

View All on GitHub
enhancement help wanted

vLLM on AMD RDNA4 (gfx1201 / RX 9070 XT-9070): clone-and-docker-compose-up, with a W4A8-FP8-WMMA MoE kernel

Python
enhancement help wanted

vLLM on AMD RDNA4 (gfx1201 / RX 9070 XT-9070): clone-and-docker-compose-up, with a W4A8-FP8-WMMA MoE kernel

Python