SENTIENT WEEKLY
Signal in the AI noise.
The Sentient
Vox Machina
Model Rankings
News
Companies
Best Practices
Issue 003
Sign in
LLM
DeepSeek V4 Pro (Non-reasoning)
Overall Score
39.3
☆ Save
Released Apr 2026
Benchmark Scores
Reasoning
—
Coding
38.4
Math
—
Creative writing
—
Instruction following
—
Multimodal
—
Standard Benchmarks
aa_gpqa
aa_gpqa
0.7
Measured 2026-05-13
·
source
aa_hle
aa_hle
0.1
Measured 2026-05-13
·
source
aa_scicode
aa_scicode
0.4
Measured 2026-05-13
·
source
aa_ifbench
aa_ifbench
0.5
Measured 2026-05-13
·
source
aa_lcr
aa_lcr
0.5
Measured 2026-05-13
·
source
aa_terminalbench_hard
aa_terminalbench_hard
0.4
Measured 2026-05-13
·
source
aa_tau2
aa_tau2
0.9
Measured 2026-05-13
·
source
Price
—
Context
—
Speed
30.44 t/s
1121ms TTFT
Compare this model →
Discussion
0 comments
Sign in
to join the conversation.
Be the first to comment.