World (Other)
mistral-large-3
Public AI IQ model profile with source-backed benchmark results and derived ranking context. Open Source
IQ
84
EQ
104
Effective Cost
$0.6636
IQ Rank
#79
IQ Dimensions
| Dimension | IQ | Coverage |
|---|---|---|
| mathematical reasoning | 82 | 1/5 |
| scientific reasoning | 107 | 4/4 |
| abstract reasoning | 71 | 0/3 |
| app building | 76 | 1/4 |
| production engineering | 86 | 2/7 |
| computer use | 86 | 2/7 |
| reliability | 80 | 3/3 |
Benchmark Results
| Benchmark | Score | Dimension |
|---|---|---|
| AA Omniscience | -39 | reliability |
| AIME | 42.917 | mathematical-reasoning |
| Arena.ai Overall | 1416 | |
| AttuneBench Composite | 51.8 | |
| BullshitBench v2 | 2 | reliability |
| CritPt | 0 | scientific-reasoning |
| EQ-Bench 3 Elo | 935.8 | |
| GPQA Diamond | 68 | scientific-reasoning |
| Humanity's Last Exam | 4.1 | scientific-reasoning |
| IFBench | 36.2 | reliability |
| LiveCodeBench | 55.34 | production-engineering |
| SciCode | 36.2 | scientific-reasoning |