SENTIENT WEEKLY
Signal in the AI noise.
The Sentient
Vox Machina
Model Rankings
News
Companies
Best Practices
Issue 003
Sign in
LLM
Gemini 3 Pro Preview (high)
Overall Score
48.4
☆ Save
Released Nov 2025
Benchmark Scores
Reasoning
—
Coding
46.5
Math
95.7
Creative writing
—
Instruction following
—
Multimodal
—
Standard Benchmarks
aa_mmlu_pro
aa_mmlu_pro
0.9
Measured 2026-05-13
·
source
aa_gpqa
aa_gpqa
0.9
Measured 2026-05-13
·
source
aa_hle
aa_hle
0.4
Measured 2026-05-13
·
source
aa_livecodebench
aa_livecodebench
0.9
Measured 2026-05-13
·
source
aa_scicode
aa_scicode
0.6
Measured 2026-05-13
·
source
aa_aime_25
aa_aime_25
1.0
Measured 2026-05-13
·
source
aa_ifbench
aa_ifbench
0.7
Measured 2026-05-13
·
source
aa_lcr
aa_lcr
0.7
Measured 2026-05-13
·
source
aa_terminalbench_hard
aa_terminalbench_hard
0.4
Measured 2026-05-13
·
source
aa_tau2
aa_tau2
0.9
Measured 2026-05-13
·
source
Price
—
Context
—
Speed
126.75 t/s
24440ms TTFT
Compare this model →
Discussion
0 comments
Sign in
to join the conversation.
Be the first to comment.