Requesty
DeepSeek

deepseek-v4-flash

DeepSeek V4 Flash delivers high speed performance and efficiency for modern AI tasks. It excels in reasoning, coding, and tool use with a 1M context window. The model supports both non thinking and thinking modes, providing flexibility for agentic workflows and complex implementations. It includes support for JSON output and tool calling.

🧠Reasoning🔧Tool callingCaching

Pricing per 1M tokens

Input
$0.14
Output
$0.28
Cache write
$0.14
Cache read
$0.03

Specifications

Context window1M tokens
Max output384K tokens
API typechat
AddedApr 24, 2026
Model IDdeepseek/deepseek-v4-flash

Privacy & data

Data retentionYes
Used for trainingUnknown
Provider location🇨🇳 China