Google LLC (Vertex AI)

gemini-3.1-flash-lite-preview

Gemini 3.1 Flash Lite Preview is the most cost-efficient model in the Gemini family, optimized for high-volume, low-latency tasks. It delivers fast responses with solid quality for everyday use cases including summarization, classification, and simple reasoning.

👁Vision🔧Tool calling⚡Caching

Pricing per 1M tokens

Input

$0.25

Output

$1.50

Cache write

$0.08

Cache read

$0.02

Specifications

Context window1.0M tokens

Max output66K tokens

API typechat

AddedMar 3, 2026

Model IDvertex/gemini-3.1-flash-lite-preview

Privacy & data

Data retentionNo

Used for trainingNo

Provider location🇺🇸 US / 🇪🇺 EU

Privacy policyVertex AI Data Governance →

Try with Requesty All Google LLC (Vertex AI) models