OpenAI
gpt-oss-120b
Public AI IQ model profile with source-backed benchmark results and derived ranking context. Open Source
IQ
94
EQ
99
Effective Cost
$0.7857
IQ Rank
#56
IQ Dimensions
| Dimension | IQ | Coverage |
|---|---|---|
| mathematical reasoning | 113 | 2/5 |
| scientific reasoning | 121 | 4/4 |
| abstract reasoning | 74 | 0/3 |
| app building | 79 | 0/4 |
| production engineering | 90 | 3/7 |
| computer use | 99 | 2/7 |
| reliability | 86 | 3/3 |
Benchmark Results
| Benchmark | Score | Dimension |
|---|---|---|
| AA Omniscience | -50 | reliability |
| AIME | 92.598 | mathematical-reasoning |
| Arena.ai Overall | 1353 | |
| BullshitBench v2 | 5 | reliability |
| CritPt | 1.1 | scientific-reasoning |
| EQ-Bench 3 Elo | 1131.5 | |
| GPQA Diamond | 78.2 | scientific-reasoning |
| Humanity's Last Exam | 18.5 | scientific-reasoning |
| IFBench | 69 | reliability |
| LiveCodeBench | 83.23 | production-engineering |
| MathArena | 42.4 | mathematical-reasoning |
| SciCode | 38.9 | scientific-reasoning |