DeepSeek has released updated pricing and model details for its API. Two models are available: DeepSeek-V4-Flash and DeepSeek-V4-Pro, both supporting 1M context length and up to 384K output tokens. V4-Flash is priced at $0.14/1M input tokens (cache miss) and $0.28/1M output tokens. V4-Pro is currently discounted 75% to $0.435/1M input (cache miss) and $0.87/1M output, with the discount running until May 31, 2026. Cache hit prices were reduced to 1/10 of launch price as of April 26, 2026. The legacy model names deepseek-chat and deepseek-reasoner will be deprecated, mapping to V4-Flash non-thinking and thinking modes respectively. Both OpenAI and Anthropic API formats are supported.

2m read timeFrom api-docs.deepseek.com
Post cover image
Table of contents
Model Details ​Deduction Rules ​

Sort: