LLM

DeepSeek V4 Flash (Reasoning, Max Effort)

Overall Score
40.3
Released Apr 2026
Benchmark Scores
Reasoning
Coding
56.2
Math
Creative writing
Instruction following
Multimodal
Standard Benchmarks
aa_terminalbench_hardaa_terminalbench_hard
0.4
Measured 2026-06-28 · source
aa_terminalbench_v2_1aa_terminalbench_v2_1
0.6
Measured 2026-06-28 · source
aa_tau2aa_tau2
0.9
Measured 2026-06-28 · source
aa_tau_bankingaa_tau_banking
0.2
Measured 2026-06-28 · source
aa_gpqaaa_gpqa
0.9
Measured 2026-06-28 · source
aa_hleaa_hle
0.3
Measured 2026-06-28 · source
aa_scicodeaa_scicode
0.5
Measured 2026-06-28 · source
aa_ifbenchaa_ifbench
0.8
Measured 2026-06-28 · source
aa_lcraa_lcr
0.6
Measured 2026-06-28 · source
Price
Context
Speed
113.8 t/s
990ms TTFT
Compare this model →

Discussion

0 comments

Sign in to join the conversation.

Be the first to comment.