GSO-Bench Scores

Each model's GSO Opt@1 score for software optimization tasks. Color = provider.

GSO-Bench Scores

Each model's GSO Opt@1 score for software optimization tasks. Color = provider.

How To Read This Chart

This benchmark chart uses source-backed benchmark rows mapped to public AI IQ model profiles.

Top Models

RankModelProviderScore
1opus-4.7Anthropic44.12
2opus-4.6Anthropic41.18
3gpt-5.5OpenAI40.2
4gpt-5.4OpenAI31.37
5gpt-5.2OpenAI27.45
6opus-4.5Anthropic26.47
7gemini-3.1-proGoogle22.55
8gemini-3-proGoogle18.63
9gpt-5.1OpenAI13.73
10gemini-3-flashGoogle9.8
11o3OpenAI8.82
12gpt-5OpenAI6.86

Related Charts