Overall Score
57.2
Google's flagship reasoning + research model
Released Feb 2026
Benchmark Scores
Reasoning
9.3
Coding
55.5
Math
—
Creative writing
8.5
Instruction following
9.0
Multimodal
9.5
Standard Benchmarks
Price
$7 / 1M input · $21 / 1M output
Paid
Context
2M tokens
Speed
135.33 t/s
29889ms TTFT
Overview
Google DeepMind's flagship reasoning + research model. The 2M-token context is unmatched for true long-form document and codebase work. Tight integration with Google Search and Workspace.
Strengths
Largest context window (2M) · strong on research + factual grounding · great Search integration · competitive pricing
Known limitations
Slightly weaker on creative writing tasks · Workspace lock-in for some features · less mature agentic tool use