DeepSeek: DeepSeek V3 0324

Name: DeepSeek: DeepSeek V3 0324
Brand: DeepSeek
SKU: deepseek/deepseek-chat-v3-0324

DeepSeek · Budget · Context 164K

deepseek/deepseek-chat-v3-0324

Data as of: 2026-07-09 04:00:09 UTC

LLM API list prices change frequently (new models and price cuts are common) and vary by tier, region, batch / cache usage and time. These are list prices captured at the time shown; always verify the current price with the provider before relying on it.

Price summary

Input $/1M $0.24

per 1M input tokens

Output $/1M $0.9

per 1M output tokens

Blended $/1M $0.405

0.75×input + 0.25×output (factual)

Cache read $/1M $0.135

per 1M cached-input tokens

Blended $/1M is a published convenience figure: 0.75 × input + 0.25 × output (a stated 3:1 input:output mix). It is descriptive arithmetic, not a value verdict.

Specifications

Model: DeepSeek: DeepSeek V3 0324
Provider: DeepSeek
Input $/1M: $0.24
Output $/1M: $0.9
In+Out $/1M: $1.14
Context: 164K tokens
Max output: 16K tokens
Cache read $/1M: $0.135
Modalities: text → text
Cross-checked: Differs

Capability

Capability score: —
MMLU-PRO: —
GPQA: —

Capability values are the published per-model score from Open LLM Leaderboard (Hugging Face), shown as-is with no edit and no “best” verdict. The leaderboard evaluates open-weight models only and lags the newest releases, so many models (including closed/proprietary APIs) have no value and show “—”. Different benchmarks rank models differently; treat this as one signal among many. As of 2026-05-25. Open LLM Leaderboard (Hugging Face) (Apache-2.0).

Official benchmark (maker-published)

MMLU (official): 88.5% (5-shot EM)
GPQA-Diamond (official): 59.1% (GPQA-Diamond Pass@1)

These are the model maker's own published benchmark scores, reproduced as-is with the publisher source and an as-of date — not a Quanteta score and not a recommendation. They are raw percentages on the named benchmark and are NOT on the same scale as the open-weight leaderboard scores above; do not compare the two directly. The exact evaluation setting (e.g. 5-shot vs 0-shot chain-of-thought) is shown per value because it changes the number; only same-setting values are plotted together. Source: DeepSeek-V3 Technical Report (arXiv:2412.19437): MMLU 88.5 (EM, 5-shot); GPQA-Diamond Pass@1 59.1 (as of 2024-12-27).

Try it / official references

External links open the provider's own pages; list prices and availability there are authoritative.

Estimated cost per use case

Use case	input tokens	output tokens	Cost (per 1,000 requests)
Chat / assistant	1,000	500	$0.69
RAG / Q&A	8,000	800	$2.64
Coding agent	6,000	2,000	$3.24
Summarization	12,000	600	$3.42

Each row is (input_tokens/1M)×input_price + (output_tokens/1M)×output_price, scaled to 1,000 requests. Assumptions are as shown in the table. Not a recommendation.

← Back to all models