DeepSeek

deepseek-v4-flash

DeepSeek V4 Flash delivers high speed performance and efficiency for modern AI tasks. It excels in reasoning, coding, and tool use with a 1M context window. The model supports both non thinking and thinking modes, providing flexibility for agentic workflows and complex implementations. It includes support for JSON output and tool calling.

🧠Reasoning🔧Tool calling⚡Caching

Pricing per 1M tokens

Input

$0.14

Output

$0.28

Cache write

$0.14

Cache read

$0.03

Specifications

Context window1M tokens

Max output384K tokens

API typechat

AddedApr 24, 2026

Model IDdeepseek/deepseek-v4-flash

Privacy & data

Data retentionYes

Used for trainingUnknown

Provider location🇨🇳 China

Privacy policyDeepSeek Privacy Policy →

Try with Requesty All DeepSeek models