Requesty
Google LLC (Vertex AI)

kimi-k2

Kimi K2 Thinking is an open-source model that operates as a "thinking agent," reasoning step-by-step while using tools to achieve state-of-the-art performance on various benchmarks. It is capable of executing up to 200-300 sequential tool calls without human intervention, allowing it to solve complex problems across a wide range of tasks. The model uses Quantization-Aware Training (QAT) to support INT4 inference, which provides a roughly 2x improvement in generation speed.

πŸ‘Vision🧠ReasoningπŸ”§Tool calling⚑Caching

Pricing per 1M tokens

Input
$0.60
Output
$2.50
Cache write
$2.50
Cache read
$0.06

Specifications

Context window262K tokens
Max output262K tokens
API typechat
AddedApr 16, 2026
Model IDvertex/kimi-k2

Privacy & data

Data retentionNo
Used for trainingNo
Provider locationπŸ‡ΊπŸ‡Έ US / πŸ‡ͺπŸ‡Ί EU