TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

MathVista

MathVista (testmini)

Mathematical reasoning over visual contexts: figures, charts, diagrams, geometric drawings.

Multimodal Multimodal accuracy Max 100.0% Released Oct 2023
8
Results
8
Models scored
86.8%
Top: o3
71.9%
Median

Best results

Top primary scores; one row per model.
1
86.8%
2
84.3%
4
72.0%
5
71.8%
7
63.8%

Frontier over time

Each dot is one model result; the line traces the running best score.
Best score over time0.0025.050.075.0100.0Oct 2024Jan 2025Apr 2025

All results

Showing one canonical row per model. Show all configurations
# Model Score Conditions Eval date Source Flags
1 o3 86.8% Apr 16, 2025 self reported primary
2 o4 mini 84.3% Apr 16, 2025 self reported primary
3 Llama 4 Maverick 73.7% Apr 5, 2025 self reported primary
4 GPT 4.1 72.0% Apr 14, 2025 self reported primary
5 o1 71.8% Apr 16, 2025 self reported primary
6 Llama 4 Scout 70.7% Apr 5, 2025 self reported primary
7 GPT-4o 63.8% Apr 16, 2025 self reported primary
8 Pixtral 12B 58.3% CoT Oct 10, 2024 self reported primary
0 AIs selected
Clear selection
#
Name
Task