DeepSeekGA

DeepSeek V4

Name: DeepSeek V4
Price: 0.44 USD
Author: DeepSeek

Open-weight frontier-class coding at one-sixth the price — MIT-licensed, 1M context, self-hostable.

Read the full DeepSeek V4 analysis

Context

Max output

384K

Input /1M

$0.44

Output /1M

$0.87

Live pricing via OpenRouter

Best for

Budget coding agents at scale
Open-weight self-hosting and fine-tuning
Cost-anchor for routing decisions

Watch out

Trails the closed frontier (Opus 4.8, GPT-5.5) on agentic SWE, security, and hardest reasoning; CAISI puts it ~8 months behind. Many spec-sheet numbers are vendor-claimed, and the compressed 1M context can degrade exact long-context retrieval.

For creators. Run a capable coding agent on your own hardware (Flash, 284B) without per-token API costs, or use the cheap Pro API for reasoning work.

Benchmarks

swe bench verified	80.6
aa intelligence index	52
livecodebench	93.5
gpqa diamond	90.1
mmlu pro	87.5
aime 2025	87.5
terminal bench 2 0	67.9
humanitys last exam	37.7

Capabilities

Open-weight MIT-licensed frontier-class coding (80.6% SWE-bench Verified)
1M-token context via hybrid CSA+HCA attention (~10% KV cache vs V3.2)
Aggressive cost efficiency (~1/6 the price of Opus-class models)
Dual variants: Pro for reasoning, Flash for high-throughput
Self-hostable; NVFP4 quantization available

Compare DeepSeek V4

DeepSeek V4 vs Claude Opus 4.8

Opus 4.8 is the stronger model outright; DeepSeek V4 is the open-weight value play — frontier-class coding at roughly a third of the price, and you can own the weights.

Qwen3.7-Max vs DeepSeek V4

Qwen3.7-Max has the higher raw intelligence but is closed and API-only; DeepSeek V4 is open-weight MIT, cheaper, and self-hostable. Capability vs control.

Kimi K2.6 vs DeepSeek V4

Kimi K2.6 edges the open-weights intelligence lead; DeepSeek V4 counters with MIT licensing, strong coding, and a deeper ecosystem. Both are self-hostable giants.

Best for

Watch out

Benchmarks

Capabilities

Compare DeepSeek V4

Sources