Requesty
Google LLC (Vertex AI)@europe-north1

gemini-2.5-flash

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

πŸ‘Vision🧠ReasoningπŸ”§Tool calling⚑Caching

Pricing per 1M tokens

Input
$0.30
Output
$2.50
Cache write
$0.55
Cache read
$0.07

Specifications

Context window1.0M tokens
Max output66K tokens
API typechat
AddedMay 20, 2025
Model IDvertex/gemini-2.5-flash@europe-north1

Privacy & data

Data retentionNo
Used for trainingNo
Provider locationπŸ‡ΊπŸ‡Έ US / πŸ‡ͺπŸ‡Ί EU