GSO-Bench Scores
Each model's GSO Opt@1 score for software optimization tasks. Color = provider.
GSO-Bench Scores
Each model's GSO Opt@1 score for software optimization tasks. Color = provider.
How To Read This Chart
This benchmark chart uses source-backed benchmark rows mapped to public AI IQ model profiles.
Top Models
| Rank | Model | Provider | Score |
|---|---|---|---|
| 1 | opus-4.7 | Anthropic | 44.12 |
| 2 | opus-4.6 | Anthropic | 41.18 |
| 3 | gpt-5.5 | OpenAI | 40.2 |
| 4 | gpt-5.4 | OpenAI | 31.37 |
| 5 | gpt-5.2 | OpenAI | 27.45 |
| 6 | opus-4.5 | Anthropic | 26.47 |
| 7 | gemini-3.1-pro | 22.55 | |
| 8 | gemini-3-pro | 18.63 | |
| 9 | gpt-5.1 | OpenAI | 13.73 |
| 10 | gemini-3-flash | 9.8 | |
| 11 | o3 | OpenAI | 8.82 |
| 12 | gpt-5 | OpenAI | 6.86 |