Google LLC (Vertex AI)

gemini-3.1-flash-lite-preview

Gemini 3.1 Flash Lite Preview is the most cost-efficient model in the Gemini family, optimized for high-volume, low-latency tasks. It delivers fast responses with solid quality for everyday use cases including summarization, classification, and simple reasoning.

👁Vision🔧Tool calling⚡Caching

Pricing per 1M tokens

Input
$0.25
Output
$1.50
Cache write
$0.08
Cache read
$0.02

Specifications

Context window1.0M tokens
Max output66K tokens
API typechat
AddedMar 3, 2026
Model IDvertex/gemini-3.1-flash-lite-preview

Privacy & data

Data retentionNo
Used for trainingNo
Provider locationđŸ‡ș🇾 US / đŸ‡ȘđŸ‡ș EU
gemini-3.1-flash-lite-preview – Google LLC (Vertex AI) | Requesty