SENTIENT WEEKLY
Signal in the AI noise.
The Sentient
Vox Machina
Model Rankings
News
Companies
Best Practices
Issue 003
Sign in
LLM
DeepSeek V4 Flash (Reasoning, High Effort)
Overall Score
44.9
☆ Save
Released Apr 2026
Benchmark Scores
Reasoning
—
Coding
39.8
Math
—
Creative writing
—
Instruction following
—
Multimodal
—
Standard Benchmarks
aa_gpqa
aa_gpqa
0.9
Measured 2026-05-13
·
source
aa_hle
aa_hle
0.3
Measured 2026-05-13
·
source
aa_scicode
aa_scicode
0.4
Measured 2026-05-13
·
source
aa_ifbench
aa_ifbench
0.7
Measured 2026-05-13
·
source
aa_lcr
aa_lcr
0.6
Measured 2026-05-13
·
source
aa_terminalbench_hard
aa_terminalbench_hard
0.4
Measured 2026-05-13
·
source
aa_tau2
aa_tau2
1.0
Measured 2026-05-13
·
source
Price
—
Context
—
Speed
0 t/s
0ms TTFT
Compare this model →
Discussion
0 comments
Sign in
to join the conversation.
Be the first to comment.