DeepInfra Inc.

Serverless inference for machine learning models.

🇺🇸 US|No data retentionNo training1 vision15 tools1 caching

Model	Context	Max Output	Input/1M	Output/1M	Capabilities
Kimi K2.5	262K	131K	$0.45	$2.25	👁🔧⚡
deepseek-ai/DeepSeek-V3.1	164K	—	$0.30	$1.00	🔧
zai-org/GLM-4.5	131K	4K	$0.60	$2.20	🔧
zai-org/GLM-4.5-Air	131K	4K	$0.20	$1.10	🔧
Qwen/Qwen3-Coder-480B-A35B-Instruct	262K	—	$0.40	$1.60	🔧
phi-4	16K	—	$0.07	$0.14
Qwen/Qwen2.5-72B-Instruct	131K	—	$0.23	$0.40	🔧
Qwen/Qwen3-32B	40K	—	$0.10	$0.30	🔧
Qwen/Qwen2.5-Coder-32B-Instruct	16K	—	$0.07	$0.16	🔧
Qwen/Qwen3-235B-A22B	40K	4K	$0.20	$0.60	🔧
meta-llama/Meta-Llama-3.1-405B-Instruct	131K	—	$0.80	$0.80
meta-llama/Llama-3.3-70B-Instruct-Turbo	131K	—	$0.12	$0.30	🔧
meta-llama/Llama-3.2-90B-Vision-Instruct	131K	4K	$0.35	$0.40
meta-llama/Meta-Llama-3.1-70B-Instruct	131K	—	$0.23	$0.40	🔧
meta-llama/Llama-3.3-70B-Instruct	131K	—	$0.23	$0.40	🔧
deepseek-ai/DeepSeek-R1	64K	8K	$0.85	$2.50	🔧
deepseek-ai/DeepSeek-V3	128K	8K	$0.85	$0.90	🔧
deepseek-ai/DeepSeek-R1-Distill-Llama-70B	64K	8K	$0.23	$0.69
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo	131K	—	$0.02	$0.05	🔧