Z AI

Z AI (formerly Zhipu AI) provides advanced large language models with strong agentic and coding capabilities.

πŸ“ πŸ‡ΈπŸ‡¬ Singaporeβ€’2 models availableβ€’Visit Website β†’
2
Available Models
$0.6
Avg Input Price/M
$0.6
Cheapest Model
zai/GLM-4.5
$0.6
Most Expensive
zai/GLM-4.5

Features Overview

0
Vision Support
2
Advanced Reasoning
0
Caching Support
0
Computer Use

Privacy & Data Policy

Data Retention

No data retention

Location

πŸ‡ΈπŸ‡¬ Singapore
Reasoning
Context Window
131K tokens
Max Output
98K tokens
Input
$0.6/M tokens
Output
$2.20/M tokens

GLM-4.5 and GLM-4.5-Air are Z AI's latest flagship models, purpose-built as foundational models for agent-oriented applications. Both leverage a Mixture-of-Experts (MoE) architecture. GLM-4.5 has a total parameter count of 355B with 32B active parameters per forward pass, while GLM-4.5-Air adopts a more streamlined design with 106B total parameters and 12B active parameters.

Reasoning
Context Window
200K tokens
Max Output
128K tokens
Input
$0.6/M tokens
Output
$2.20/M tokens

GLM-4.6 is Z AI’s latest flagship model, designed to push agentic and coding performance further. It expands the context window from 128K to 200K tokens, improves reasoning and tool-use capabilities, and delivers stronger results in coding benchmarks and real-world development workflows. GLM-4.6 demonstrates refined writing quality, more capable agent behavior, and higher token efficiency (β‰ˆ15% fewer tokens vs. GLM-4.5). Evaluations show clear gains over GLM-4.5 across reasoning, agents, and coding, reaching near parity with Claude Sonnet 4 in practical tasks while outperforming other open-source baselines. GLM-4.6 is available through the Z.ai API platform, OpenRouter, coding agents (Claude Code, Roo Code, Cline, Kilo Code), and soon as downloadable weights on HuggingFace and ModelScope.

Ready to use Z AI models?

Access all Z AI models through Requesty's unified API with intelligent routing, caching, and cost optimization.