Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

1.4K stars 55 forks 1.4K watchers Jupyter Notebook Apache License 2.0
cookbook large-language-model multimodal-large-language-models vision-language-model
1 Open Issue Need Help Last updated: Sep 13, 2025

Open Issues Need Help

View All on GitHub
enhancement help wanted

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook
#cookbook#large-language-model#multimodal-large-language-models#vision-language-model