DeepInfra Inc.
18Serverless inference for machine learning models.
πΊπΈ US|No data retentionNo training14 tools
| Model | Context | Max Output | Input/1M | Output/1M | Capabilities |
|---|---|---|---|---|---|
deepseek-ai/DeepSeek-V3.1 | 164K | β | $0.30 | $1.00 | π§ |
zai-org/GLM-4.5 | 131K | 4K | $0.60 | $2.20 | π§ |
zai-org/GLM-4.5-Air | 131K | 4K | $0.20 | $1.10 | π§ |
Qwen/Qwen3-Coder-480B-A35B-Instruct | 262K | β | $0.40 | $1.60 | π§ |
phi-4 | 16K | β | $0.07 | $0.14 | |
Qwen/Qwen3-235B-A22B | 40K | 4K | $0.20 | $0.60 | π§ |
Qwen/Qwen2.5-72B-Instruct | 131K | β | $0.23 | $0.40 | π§ |
Qwen/Qwen3-32B | 40K | β | $0.10 | $0.30 | π§ |
Qwen/Qwen2.5-Coder-32B-Instruct | 16K | β | $0.07 | $0.16 | π§ |
meta-llama/Meta-Llama-3.1-405B-Instruct | 131K | β | $0.80 | $0.80 | |
meta-llama/Meta-Llama-3.1-70B-Instruct | 131K | β | $0.23 | $0.40 | π§ |
meta-llama/Llama-3.3-70B-Instruct-Turbo | 131K | β | $0.12 | $0.30 | π§ |
meta-llama/Llama-3.3-70B-Instruct | 131K | β | $0.23 | $0.40 | π§ |
meta-llama/Llama-3.2-90B-Vision-Instruct | 131K | 4K | $0.35 | $0.40 | |
deepseek-ai/DeepSeek-V3 | 128K | 8K | $0.85 | $0.90 | π§ |
deepseek-ai/DeepSeek-R1 | 64K | 8K | $0.85 | $2.50 | π§ |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 64K | 8K | $0.23 | $0.69 | |
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | 131K | β | $0.02 | $0.05 | π§ |
