US (Other)

mercury-2

Public AI IQ model profile with source-backed benchmark results and derived ranking context. Proprietary

IQ
92
EQ
n/a
Effective Cost
n/a
IQ Rank
#70

IQ Dimensions

DimensionIQCoverage
mathematical reasoning950/5
scientific reasoning1204/4
abstract reasoning710/3
app building721/4
production engineering940/7
computer use981/7
reliability912/3

Benchmark Results

BenchmarkScoreDimension
AA Omniscience-52reliability
Arena.ai Overall1346
CritPt1scientific-reasoning
GPQA Diamond77scientific-reasoning
Humanity's Last Exam16scientific-reasoning
IFBench70reliability
SciCode39scientific-reasoning
Terminal-Bench Hard27computer-use
Arena.ai WebDev1165app-building