Novita AI

zai-org/glm-4.6

GLM-4.6 is Z AI’s latest flagship model, designed to push agentic and coding performance further. It expands the context window from 128K to 200K tokens, improves reasoning and tool-use capabilities, and delivers stronger results in coding benchmarks and real-world development workflows. GLM-4.6 demonstrates refined writing quality, more capable agent behavior, and higher token efficiency (≈15% fewer tokens vs. GLM-4.5). Evaluations show clear gains over GLM-4.5 across reasoning, agents, and coding, reaching near parity with Claude Sonnet 4 in practical tasks while outperforming other open-source baselines. GLM-4.6 is available through the Z.ai API platform, OpenRouter, coding agents (Claude Code, Roo Code, Cline, Kilo Code), and soon as downloadable weights on HuggingFace and ModelScope.

🔧Tool calling

Pricing per 1M tokens

Input
$0.60
Output
$2.20

Specifications

Context window205K tokens
Max output131K tokens
API typechat
AddedJul 30, 2025
Model IDnovita/zai-org/glm-4.6

Privacy & data

Data retentionYes
Used for trainingUnknown
Provider locationđŸ‡ș🇾 US
zai-org/glm-4.6 – Novita AI | Requesty