Agents' Last Exam Benchmark Scores

Full / Overall pass rates from Agents' Last Exam. Color = provider.

Agents' Last Exam Benchmark Scores

Full / Overall pass rates from Agents' Last Exam. Color = provider.

How To Read This Chart

This benchmark chart uses source-backed benchmark rows mapped to public AI IQ model profiles.

Top Models

RankModelProviderScore
1gpt-5.5OpenAI24
2gpt-5.4OpenAI20.5
3opus-4.7Anthropic18.4
4gemini-3.1-proGoogle15.8
5opus-4.8Anthropic15.8
6opus-4.6Anthropic14.1
7deepseek-v4-proDeepSeek12.4
8qwen3.7-maxAlibaba11.8
9glm-5.1Z.ai11.5
10kimi-k2.6Kimi9.2
11mimo-v2.5Xiaomi8.6
12qwen3.6-plusAlibaba8.6

Related Charts