The prices listed below are in units of per 1M tokens. A token, the smallest unit of text that the model recognizes, can be a word, a number, or even a punctuation mark. We will bill based on the total number of input and output tokens by the model.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

DeepSeek has released updated pricing and model details for its API. Two models are available: DeepSeek-V4-Flash and DeepSeek-V4-Pro, both supporting 1M context length and up to 384K output tokens. V4-Flash is priced at $0.14/1M input tokens (cache miss) and $0.28/1M output tokens. V4-Pro is currently discounted 75% to $0.435/1M input (cache miss) and $0.87/1M output, with the discount running until May 31, 2026. Cache hit prices were reduced to 1/10 of launch price as of April 26, 2026. The legacy model names deepseek-chat and deepseek-reasoner will be deprecated, mapping to V4-Flash non-thinking and thinking modes respectively. Both OpenAI and Anthropic API formats are supported.

Models & Pricing