Best AI models for coding
The Coding Index blends multiple coding evaluations — contamination-free code generation (LiveCodeBench), research-level scientific coding (SciCode), and agentic terminal tasks (Terminal-Bench). It is a broader, harder-to-game signal than any single coding benchmark.
- 🥇
gpt-5.5OpenAI Inc.·$5.00 / $30.00 per 1M59.159.1 - 🥈
gpt-5.4OpenAI Inc.·$2.50 / $15.00 per 1M57.257.2 - 🥉claude-opus-4-8Anthropic PBC·$5.00 / $25.00 per 1M56.756.7
- 4
gemini-3.1-pro-previewGoogle LLC (Gemini API)·$2.00 / $12.00 per 1M55.555.5 - 5
gpt-5.3-codexOpenAI Responses·$1.75 / $14.00 per 1M53.153.1 - 6claude-opus-4-7Anthropic PBC·$5.00 / $25.00 per 1M52.552.5
- 7
gpt-5.4-miniOpenAI Inc.·$0.75 / $4.50 per 1M51.551.5 - 8claude-sonnet-4-6Anthropic PBC·$3.00 / $15.00 per 1M50.950.9
- 9
qwen3.7-maxAlibaba Cloud·$2.50 / $7.50 per 1M50.150.1 - 10
gpt-5.2-chatOpenAI Inc.·$1.75 / $14.00 per 1M48.748.7 - 11claude-opus-4-6Anthropic PBC·$5.00 / $25.00 per 1M48.148.1
- 12claude-opus-4-5Anthropic PBC·$5.00 / $25.00 per 1M47.847.8
- 13
deepseek-v4-proDeepSeek·$0.43 / $0.87 per 1M47.547.5 - 14
kimi-k2.6Moonshot AI·$0.95 / $4.00 per 1M47.147.1 - 15
gemini-3-pro-previewGoogle LLC (Gemini API)·$2.00 / $12.00 per 1M46.546.5 - 16
XiaomiMiMo/MiMo-V2.5-ProDeepInfra Inc.·$1.00 / $3.00 per 1M45.545.5 - 17
gemini-3.5-flashGoogle LLC (Vertex AI)·$1.50 / $9.00 per 1M45.045.0 - 18
gpt-5.1-chatOpenAI Inc.·$1.25 / $10.00 per 1M44.744.7 - 19
GLM-5Z AI·$1.00 / $3.20 per 1M44.244.2 - 20
gpt-5.4-nanoOpenAI Inc.·$0.20 / $1.25 per 1M43.943.9 - 21
GLM-5.1Z AI·$1.40 / $4.40 per 1M43.443.4 - 22minimax-m3MiniMax·$0.30 / $1.20 per 1M43.443.4
- 23
gpt-5.2-codexOpenAI Responses·$1.75 / $14.00 per 1M43.043.0 - 24
qwen3.6-plusAlibaba Cloud·$0.50 / $3.00 per 1M42.942.9 - 25
gemini-3-flash-previewGoogle LLC (Gemini API)·$0.50 / $3.00 per 1M42.642.6 - 26MiniMax-M2.7MiniMax·$0.30 / $1.20 per 1M41.941.9
- 27
xiaomimimo/mimo-v2-proNovita AI·$2.00 / $6.00 per 1M41.441.4 - 28
qwen/qwen3.5-397b-a17bNovita AI·$0.60 / $3.60 per 1M41.341.3 - 29grok-4.3xAI Corp.·$1.25 / $2.50 per 1M41.041.0
- 30grok-4xAI Corp.·$3.00 / $15.00 per 1M40.540.5
Explore other rankings
How we rank
Scores for Coding Index come from Artificial Analysis, an independent AI benchmarking service. When a model is available through multiple providers (e.g. Anthropic direct, AWS Bedrock, Google Vertex), we show one canonical entry per model family so the ranking isn't polluted by duplicates. Benchmarks measure specific skills — always validate on your own workload before committing.
One API for every model on this list
Requesty is OpenAI-compatible and routes to 400+ models. Switch between any of the models above by changing one parameter in your code.
Get started free