OpenAI

gpt-oss-120b

Public AI IQ model profile with source-backed benchmark results and derived ranking context. Open Source

IQ
94
EQ
99
Effective Cost
$0.7857
IQ Rank
#56

IQ Dimensions

DimensionIQCoverage
mathematical reasoning1132/5
scientific reasoning1214/4
abstract reasoning740/3
app building790/4
production engineering903/7
computer use992/7
reliability863/3

Benchmark Results

BenchmarkScoreDimension
AA Omniscience-50reliability
AIME92.598mathematical-reasoning
Arena.ai Overall1353
BullshitBench v25reliability
CritPt1.1scientific-reasoning
EQ-Bench 3 Elo1131.5
GPQA Diamond78.2scientific-reasoning
Humanity's Last Exam18.5scientific-reasoning
IFBench69reliability
LiveCodeBench83.23production-engineering
MathArena42.4mathematical-reasoning
SciCode38.9scientific-reasoning