Google LLC (Gemini API)

gemini-3.1-flash-lite-preview

Gemini 3.1 Flash Lite Preview is the most cost-efficient model in the Gemini family, optimized for high-volume, low-latency tasks. It delivers fast responses with solid quality for everyday use cases including summarization, classification, and simple reasoning.

👁Vision🔧Tool calling⚡Caching

Pricing per 1M tokens

Input
$0.25
Output
$1.50
Cache write
$0.08
Cache read
$0.02

Specifications

Context window1.0M tokens
Max output66K tokens
API typechat
AddedMar 3, 2026
Model IDgoogle/gemini-3.1-flash-lite-preview

Privacy & data

Data retentionYes
Used for trainingUnknown
Provider location🌍 Global
gemini-3.1-flash-lite-preview – Google LLC (Gemini API) | Requesty