Terminal-Bench 2.1 Benchmark Scores
Each model's Terminal-Bench 2.1 pass@1 score, using Artificial Analysis as canonical and Vals.ai as fallback. Color = provider.
Terminal-Bench 2.1 Benchmark Scores
Each model's Terminal-Bench 2.1 pass@1 score, using Artificial Analysis as canonical and Vals.ai as fallback. Color = provider.
How To Read This Chart
This benchmark chart uses source-backed benchmark rows mapped to public AI IQ model profiles.
Top Models
| Rank | Model | Provider | Score |
|---|---|---|---|
| 1 | fable-5 | Anthropic | 85 |
| 2 | opus-4.8 | Anthropic | 85 |
| 3 | gpt-5.5 | OpenAI | 84 |
| 4 | opus-4.7 | Anthropic | 83 |
| 5 | gemini-3.5-flash | 79 | |
| 6 | glm-5.2 | Z.ai | 78 |
| 7 | qwen3.7-max | Alibaba | 75 |
| 8 | gemini-3.1-pro | 74 | |
| 9 | sonnet-4.6 | Anthropic | 71 |
| 10 | kimi-k2.7-code | Kimi | 67 |
| 11 | mimo-v2.5-pro | Xiaomi | 65 |
| 12 | minimax-m3 | MiniMax | 65 |