Alibaba's open-weight vision-language model
Qwen Visual Language model series from Alibaba. Strong at multilingual visual understanding, document parsing, and chart reading. Available as open weights on HuggingFace. Runs via vLLM.
Vision-language models for image understanding, captioning, visual QA, and document parsing
AIchitect's Genome scanner detects Qwen-VL in your project via these signals:
transformersAdd to your GitHub README
[](https://aichitect.dev/tool/qwen-vl)Explore the full AI landscape
See how Qwen-VL fits into the bigger picture — browse all 207 tools and their relationships.