US (Other)
mercury-2
Public AI IQ model profile with source-backed benchmark results and derived ranking context. Proprietary
IQ
92
EQ
n/a
Effective Cost
n/a
IQ Rank
#70
IQ Dimensions
| Dimension | IQ | Coverage |
|---|---|---|
| mathematical reasoning | 95 | 0/5 |
| scientific reasoning | 120 | 4/4 |
| abstract reasoning | 71 | 0/3 |
| app building | 72 | 1/4 |
| production engineering | 94 | 0/7 |
| computer use | 98 | 1/7 |
| reliability | 91 | 2/3 |
Benchmark Results
| Benchmark | Score | Dimension |
|---|---|---|
| AA Omniscience | -52 | reliability |
| Arena.ai Overall | 1346 | |
| CritPt | 1 | scientific-reasoning |
| GPQA Diamond | 77 | scientific-reasoning |
| Humanity's Last Exam | 16 | scientific-reasoning |
| IFBench | 70 | reliability |
| SciCode | 39 | scientific-reasoning |
| Terminal-Bench Hard | 27 | computer-use |
| Arena.ai WebDev | 1165 | app-building |