Google LLC (Gemini API)

gemini-2.5-flash

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

👁Vision🧠Reasoning🔧Tool calling⚡Caching

Pricing per 1M tokens

Input

$0.30

Output

$2.50

Cache write

$0.55

Cache read

$0.07

Specifications

Context window1.0M tokens

Max output66K tokens

API typechat

AddedMay 20, 2025

Model IDgoogle/gemini-2.5-flash

Privacy & data

Data retentionYes

Used for trainingUnknown

Provider location🌍 Global

Privacy policyGemini API Terms →

Try with Requesty All Google LLC (Gemini API) models