Google LLC (Gemini API)
gemini-2.5-flash
Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.
👁Vision🧠Reasoning🔧Tool calling⚡Caching
Pricing per 1M tokens
Input
$0.30
Output
$2.50
Cache write
$0.55
Cache read
$0.07
Specifications
Context window1.0M tokens
Max output66K tokens
API typechat
AddedMay 20, 2025
Model IDgoogle/gemini-2.5-flash
