Model Rankings
— Model rankings —

LLM rankings, side by side

Benchmarks, cost vs intelligence, context windows, and head-to-head comparisons across every major large language model.

ModelOverall$/1M
GPT-5.5 (xhigh)
OpenAI
54.8$11.25
GPT-5.5 (medium)
OpenAI
50.4$11.25
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
Anthropic
47.2$6.00
Gemini 3.1 Pro Preview
Google DeepMind
46.5$4.50
DeepSeek V4 Pro (Reasoning, Max Effort)
44.3$2.17
Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
Anthropic
43.7$10.00
GPT-5.5 (low)
OpenAI
43.5$11.25
Muse Spark
43.1$0.00
Claude Opus 4.7 (Non-reasoning, High Effort)
Anthropic
42.7$10.00
MiMo-V2.5-Pro
42.2$1.35
Claude Opus 4.5 (Reasoning)
Anthropic
40.8$10.00
DeepSeek V4 Flash (Reasoning, Max Effort)
40.3$0.17
GLM-5.1 (Reasoning)
40.2$2.15
GPT-5.4 mini (xhigh)
OpenAI
40.0$1.69
Qwen3.6 Max Preview
40.0$2.92
Gemini 3 Pro Preview (high)
39.6$4.50
Qwen3.6 Plus
39.6$1.13
GPT-5.4 nano (xhigh)
OpenAI
38.2$0.46
MiniMax-M2.7
38.1$0.53
GLM-5-Turbo
38.1$0.00
Gemini 3 Flash Preview (Reasoning)
37.8$1.13
DeepSeek V4 Flash (Reasoning, High Effort)
37.4$0.17
Qwen3.6 27B (Reasoning)
37.1$1.35
Grok 4.20 0309 (Reasoning)
xAI
36.5$3.00
MiMo-V2-Omni-0327
36.4$0.80
GPT-5.5 (Non-reasoning)
OpenAI
35.4$11.25
KAT Coder Pro V2
35.4$0.53
GLM-5.1 (Non-reasoning)
35.4$2.15
Claude 4.5 Sonnet (Reasoning)
Anthropic
34.7$6.00
KAT-Coder-Pro V1
34.6$0.53
Kimi K2.6 (Non-reasoning)
34.6$1.71
GLM 5V Turbo (Reasoning)
34.5$0.00
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
Anthropic
34.3$6.00
Qwen3.5 397B A17B (Reasoning)
33.7$1.35
Hy3-preview (Reasoning)
33.6$0.20
MiMo-V2-Flash (Feb 2026)
33.2$0.15
Gemini 3 Pro Preview (low)
33.1$4.50
Kimi K2 Thinking
32.7$1.07
o3-pro
OpenAI
32.5$35.00
Qwen3.5 122B A10B (Reasoning)
32.3$1.10
Qwen3.5 397B A17B (Non-reasoning)
32.0$1.35
Qwen3 Max Thinking
31.7$2.40
Qwen3.6 35B A3B (Reasoning)
31.6$0.56
MiMo-V2-Flash (Reasoning)
31.2$0.15
DeepSeek V4 Pro (Non-reasoning)
31.2$0.54
Grok 4.1 Fast (Reasoning)
xAI
30.6$0.28
Qwen3.5 Omni Plus
30.6$1.50
GPT-5.1 Codex mini (high)
OpenAI
30.6$0.69
o3
OpenAI
30.4$3.50
GPT-5.4 nano (medium)
OpenAI
30.2$0.46
Mistral Medium 3.5
29.9$3.00
GPT-5.4 mini (medium)
OpenAI
29.8$1.69
Claude 4.5 Haiku (Reasoning)
Anthropic
29.6$2.00
Gemma 4 31B (Reasoning)
29.4$0.20
Qwen3.6 27B (Non-reasoning)
29.3$1.35
DeepSeek V4 Flash (Non-reasoning)
28.7$0.17
Qwen3.5 122B A10B (Non-reasoning)
28.1$1.10
MiMo-V2.5-Pro (Non-reasoning)
27.9$1.35
Gemini 3 Flash Preview (Non-reasoning)
27.4$1.13
DeepSeek V3.1 Terminus (Reasoning)
26.3$1.91
Hy3-preview (Non-reasoning)
26.1$0.20
Ling-2.6-1T
26.1$0.85
Doubao Seed Code
26.0$0.00
Gemma 4 26B A4B (Reasoning)
25.7$0.20
o4-mini (high)
OpenAI
25.6$1.93
Step 3.5 Flash
25.5$0.15
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
25.4$0.41
DeepSeek V3.2 Exp (Reasoning)
25.4$0.31
Mercury 2
25.3$0.38
GLM-4.6 (Reasoning)
25.1$0.96
Qwen3.5 9B (Reasoning)
25.0$0.11
Gemini 3.1 Flash-Lite
25.0$0.56
Qwen3 Max Thinking (Preview)
25.0$2.40
Gemma 4 31B (Non-reasoning)
24.8$0.20
DeepSeek V3.2 (Non-reasoning)
24.7$0.32
MiMo-V2-Flash (Non-reasoning)
24.7$0.15
K-EXAONE (Reasoning)
24.7$0.00
Trinity Large Thinking
24.5$0.40
Qwen3.6 35B A3B (Non-reasoning)
24.2$0.84
gpt-oss-120b (high)
OpenAI
23.8$0.26
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)
23.8$0.00
Claude 4.5 Haiku (Non-reasoning)
Anthropic
23.7$2.00
Kimi K2 0905
23.5$1.07
o1
OpenAI
23.4$26.25
EXAONE 4.5 33B
23.0$0.00
GLM-4.7-Flash (Reasoning)
22.9$0.15
Grok 3 mini Reasoning (high)
xAI
22.5$0.35
Nova 2.0 Pro Preview (medium)
21.8$3.44
Nova 2.0 Pro Preview (low)
19.6$3.44
Nova 2.0 Lite (high)
18.2$0.85
Claude Code
Anthropic
9.5
ElevenLabs Voice (v3)
ElevenLabs
9.4
Midjourney v7
Midjourney
9.3
FLUX.1 Pro
Black Forest Labs
9.1
Cursor Composer
Anysphere
9.0
Sora
OpenAI
8.8
Llama 3.3 Instruct 70B
Meta AI
8.6$0.61
GPT 5.5 Codex
OpenAI
GPT-5.5 Pro (xhigh)
OpenAI
$0.00
GPT-3.5 Turbo (0613)
OpenAI
$0.00
Gemini 3 Deep Think
$0.00
Cogito v2.1 (Reasoning)
$1.25
GPT-4o Realtime (Dec '24)
OpenAI
$0.00
EXAONE 4.5 33B (Non-reasoning)
$0.00
GPT-4o mini Realtime (Dec '24)
OpenAI
$0.00
Mi:dm K 2.5 Pro Preview
$0.00
Grok 4.3 (Beta)
xAI

Rankings data by Artificial Analysis. CSV imports cover supplementary benchmarks.