DeepSeek
deepseek-v4-flash
Public AI IQ model profile with source-backed benchmark results and derived ranking context. Open Source
IQ
104
EQ
112
Effective Cost
$0.8674
IQ Rank
#36
IQ Dimensions
| Dimension | IQ | Coverage |
|---|---|---|
| mathematical reasoning | 108 | 1/5 |
| scientific reasoning | 132 | 4/4 |
| abstract reasoning | 79 | 0/3 |
| app building | 93 | 0/4 |
| production engineering | 103 | 1/7 |
| computer use | 117 | 3/7 |
| reliability | 96 | 3/3 |
Benchmark Results
| Benchmark | Score | Dimension |
|---|---|---|
| AA Omniscience | -23 | reliability |
| Arena.ai Overall | 1434 | |
| BrowseComp | 73.2 | computer-use |
| BullshitBench v2 | 14 | reliability |
| CritPt | 7 | scientific-reasoning |
| EQ-Bench 3 Elo | 1499.9 | |
| GPQA Diamond | 89 | scientific-reasoning |
| Humanity's Last Exam | 32 | scientific-reasoning |
| IFBench | 79 | reliability |
| MathArena | 60.6 | mathematical-reasoning |
| SciCode | 45 | scientific-reasoning |
| SWE-Bench Verified | 79 | production-engineering |