Google LLC (Vertex AI)
kimi-k2
Kimi K2 Thinking is an open-source model that operates as a "thinking agent," reasoning step-by-step while using tools to achieve state-of-the-art performance on various benchmarks. It is capable of executing up to 200-300 sequential tool calls without human intervention, allowing it to solve complex problems across a wide range of tasks. The model uses Quantization-Aware Training (QAT) to support INT4 inference, which provides a roughly 2x improvement in generation speed.
πVisionπ§ Reasoningπ§Tool callingβ‘Caching
Pricing per 1M tokens
Input
$0.60
Output
$2.50
Cache write
$2.50
Cache read
$0.06
Specifications
Context window262K tokens
Max output262K tokens
API typechat
AddedApr 16, 2026
Model IDvertex/kimi-k2
Privacy & data
Data retentionNo
Used for trainingNo
Provider locationπΊπΈ US / πͺπΊ EU
Privacy policyVertex AI Data Governance β
