Requesty

DeepInfra Inc.

18

Serverless inference for machine learning models.

πŸ‡ΊπŸ‡Έ US|No data retentionNo training14 tools
ModelContextMax OutputInput/1MOutput/1MCapabilities
deepseek-ai/DeepSeek-V3.1
164Kβ€”$0.30$1.00
πŸ”§
zai-org/GLM-4.5
131K4K$0.60$2.20
πŸ”§
zai-org/GLM-4.5-Air
131K4K$0.20$1.10
πŸ”§
Qwen/Qwen3-Coder-480B-A35B-Instruct
262Kβ€”$0.40$1.60
πŸ”§
phi-4
16Kβ€”$0.07$0.14
Qwen/Qwen3-235B-A22B
40K4K$0.20$0.60
πŸ”§
Qwen/Qwen2.5-72B-Instruct
131Kβ€”$0.23$0.40
πŸ”§
Qwen/Qwen3-32B
40Kβ€”$0.10$0.30
πŸ”§
Qwen/Qwen2.5-Coder-32B-Instruct
16Kβ€”$0.07$0.16
πŸ”§
meta-llama/Meta-Llama-3.1-405B-Instruct
131Kβ€”$0.80$0.80
meta-llama/Meta-Llama-3.1-70B-Instruct
131Kβ€”$0.23$0.40
πŸ”§
meta-llama/Llama-3.3-70B-Instruct-Turbo
131Kβ€”$0.12$0.30
πŸ”§
meta-llama/Llama-3.3-70B-Instruct
131Kβ€”$0.23$0.40
πŸ”§
meta-llama/Llama-3.2-90B-Vision-Instruct
131K4K$0.35$0.40
deepseek-ai/DeepSeek-V3
128K8K$0.85$0.90
πŸ”§
deepseek-ai/DeepSeek-R1
64K8K$0.85$2.50
πŸ”§
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
64K8K$0.23$0.69
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
131Kβ€”$0.02$0.05
πŸ”§