Google LLC (Vertex AI)
gemini-3.1-flash-lite-preview
Gemini 3.1 Flash Lite Preview is the most cost-efficient model in the Gemini family, optimized for high-volume, low-latency tasks. It delivers fast responses with solid quality for everyday use cases including summarization, classification, and simple reasoning.
👁Vision🔧Tool calling⚡Caching
Pricing per 1M tokens
Input
$0.25
Output
$1.50
Cache write
$0.08
Cache read
$0.02
Specifications
Context window1.0M tokens
Max output66K tokens
API typechat
AddedMar 3, 2026
Model IDvertex/gemini-3.1-flash-lite-preview
Privacy & data
Data retentionNo
Used for trainingNo
Provider location🇺🇸 US / 🇪🇺 EU
Privacy policyVertex AI Data Governance →