OpenAI
gpt-5.1
Public AI IQ model profile with source-backed benchmark results and derived ranking context. Proprietary
IQ
106
EQ
114
Effective Cost
$9.0811
IQ Rank
#35
IQ Dimensions
| Dimension | IQ | Coverage |
|---|---|---|
| mathematical reasoning | 118 | 4/5 |
| scientific reasoning | 130 | 4/4 |
| abstract reasoning | 83 | 2/3 |
| app building | 96 | 2/4 |
| production engineering | 105 | 2/7 |
| computer use | 109 | 1/7 |
| reliability | 99 | 2/3 |
Benchmark Results
| Benchmark | Score | Dimension |
|---|---|---|
| AIME | 93.333 | mathematical-reasoning |
| ARC-AGI-1 | 72.8 | abstract-reasoning |
| ARC-AGI-2 | 17.6 | abstract-reasoning |
| Arena.ai Overall | 1439 | |
| BullshitBench v2 | 25 | reliability |
| CritPt | 5 | scientific-reasoning |
| EQ-Bench 3 Elo | 1553.8 | |
| FrontierMath Tier 1-3 | 31 | mathematical-reasoning |
| FrontierMath Tier 4 | 12.5 | mathematical-reasoning |
| GPQA Diamond | 85.6 | scientific-reasoning |
| Humanity's Last Exam | 27 | scientific-reasoning |
| IFBench | 75 | reliability |