Google
gemma-4-31b
Public AI IQ model profile with source-backed benchmark results and derived ranking context. Open Source
IQ
96
EQ
112
Effective Cost
n/a
IQ Rank
#53
IQ Dimensions
| Dimension | IQ | Coverage |
|---|---|---|
| mathematical reasoning | 97 | 0/5 |
| scientific reasoning | 124 | 4/4 |
| abstract reasoning | 73 | 0/3 |
| app building | 87 | 1/4 |
| production engineering | 85 | 0/7 |
| computer use | 106 | 2/7 |
| reliability | 96 | 3/3 |
Benchmark Results
| Benchmark | Score | Dimension |
|---|---|---|
| AA Omniscience | -45 | reliability |
| Arena.ai Overall | 1451 | |
| BullshitBench v2 | 25 | reliability |
| CritPt | 1 | scientific-reasoning |
| EQ-Bench 3 Elo | 1450.6 | |
| GPQA Diamond | 86 | scientific-reasoning |
| Humanity's Last Exam | 23 | scientific-reasoning |
| IFBench | 76 | reliability |
| SciCode | 43 | scientific-reasoning |
| Terminal-Bench 2.1 | 43 | computer-use |
| Terminal-Bench Hard | 36 | computer-use |
| Arena.ai WebDev | 1373 | app-building |