Multimodal RL training framework for diffusion & omni models

465 stars 69 forks 465 watchers Python Apache License 2.0
diffusion-models flow-matching grpo multimodal qwen reinforcement-learning rlhf vllm
8 Open Issues Need Help Last updated: Jul 1, 2026

Open Issues Need Help

View All on GitHub

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm
good first issue

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm
good first issue

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm
help wanted

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm
good first issue help wanted

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm

Multimodal RL training framework for diffusion & omni models

Python
#diffusion-models#flow-matching#grpo#multimodal#qwen#reinforcement-learning#rlhf#vllm