MultimodalCommercial

Pixtral

Mistral's multimodal vision model

App Infrastructure

About

Mistral's vision-language model available via Mistral API and as open weights. Supports multiple images per prompt, high-resolution understanding, and code extraction from screenshots.

Choose Pixtral when…

  • You want a commercial vision model with competitive pricing
  • You need multi-image understanding in a single prompt
  • You're already using Mistral's API ecosystem

Builder Slot

How does your AI see and understand images?Optional for most stacks

Vision-language models for image understanding, captioning, visual QA, and document parsing

Dev Tools
Not applicable
App Infra
Optional
Hybrid
Optional

Other tools in this slot:

Stack Genome Detection

AIchitect's Genome scanner detects Pixtral in your project via these signals:

pip packages
mistralai
env vars
MISTRAL_API_KEY

Integrates with (1)

LiteLLMLLM Infrastructure
Compare →

Alternatives to consider (1)

Pricing

Pixtral 12B$0.15/1M tokens
Pixtral Large$2/1M tokens

Badge

Add to your GitHub README

Pixtral on AIchitect[![Pixtral](https://aichitect.dev/badge/tool/pixtral)](https://aichitect.dev/tool/pixtral)

Explore the full AI landscape

See how Pixtral fits into the bigger picture — browse all 207 tools and their relationships.

Explore graph →