Mistral's multimodal vision model
Mistral's vision-language model available via Mistral API and as open weights. Supports multiple images per prompt, high-resolution understanding, and code extraction from screenshots.
Vision-language models for image understanding, captioning, visual QA, and document parsing
AIchitect's Genome scanner detects Pixtral in your project via these signals:
mistralaiMISTRAL_API_KEYAdd to your GitHub README
[](https://aichitect.dev/tool/pixtral)Explore the full AI landscape
See how Pixtral fits into the bigger picture — browse all 207 tools and their relationships.