TAAFT
Free mode
100% free
Freemium
Free Trial
Create tool

Pixtral 12B

New Text Gen
Released: September 17, 2024

Overview

Pixtral 12B is Mistral’s mid-sized vision–language model. It accepts images plus text prompts and generates grounded answers in text, with solid OCR, chart/diagram reasoning, and screenshot/UI understanding. It supports long-context input, function/tool calling, and JSON outputs, balancing quality and efficiency.

Description

Pixtral 12B combines a strong visual encoder with Mistral’s 12B-parameter language backbone, giving it the ability to “look and reason” while staying small enough for practical serving. It can handle documents, tables, dashboards, and natural images, reading fine-grained text and interpreting layouts or visual structures. Instruction tuning helps it follow prompts reliably, point to relevant details, and format responses in schema-consistent JSON, which makes it easy to slot into retrieval-augmented pipelines, RAG agents, or enterprise copilots.

Compared to Pixtral Large, the 12B variant trades raw accuracy for lower latency and cost, making it better suited to high-throughput or budget-conscious deployments. It handles multi-image prompts, streams tokens for responsive interaction, and supports fine-tuning with LoRA to capture domain-specific needs like invoices, contracts, or technical diagrams. In practice, teams use Pixtral 12B for document automation, multimodal assistants, analytics over charts and dashboards, and developer helpers that reason directly from screenshots—all with faster inference and lower infrastructure overhead than the flagship Large model.

About Mistral AI

Mistral AI is a company that specializes in artificial intelligence and machine learning solutions.

Industry: Technology, Information and Internet
Company Size: 11-50
Location: Paris, FR
Website: mistral.ai
View Company Profile

Related Models

Last updated: September 22, 2025