TriviaQA
Open-domain question answering over 650k trivia question/answer pairs. Long-tail world knowledge.
Best results
Frontier over time
All results
| # | Model | Score | Conditions | Eval date | Source | Flags |
|---|---|---|---|---|---|---|
| 1 | Claude 2 | 87.5% | 5-shot · standard | 11 Jul 2023 | Self-reported | |
| 2 | LLaMA 2 | 85.0% | 1-shot | 19 Jul 2023 | Paper | Primary Verified |
| 3 | LLaMA 2 70B | 85.0% | 1-shot | 11 Jul 2023 | Paper | |
| 4 | Mixtral 8x7B | 71.5% | — | 08 Jan 2024 | Self-reported | Primary |
| 5 | Mistral 7B | 69.9% | 5-shot | 10 Oct 2023 | Paper | |
| 6 | Gemma 2 | 59.4% | 5-shot | 25 Feb 2025 | Self-reported | Primary |
MongoDB - Build AI That Scales
