Requesty
Novita AI logo

Novita AI

AI-powered creative tools and model hosting. Requesty routes to 43 Novita AI models starting at $0.03 per 1M input tokens with context windows up to 1.0M tokens. One API key, OpenAI-compatible SDK, no markup.

Intelligence Index
51.4
Coding Index
43.4
GPQA Diamond
86.8%
Terminal-Bench Hard
43.2%

All Novita AI models

ModelContextMax OutputInput/1MOutput/1MCapabilitiesCoding
zai-org/glm-5.1
205Kβ€”$1.38$4.40
πŸ§ πŸ”§
43
xiaomimimo/mimo-v2-pro
1.0Mβ€”$2.00$6.00
πŸ§ πŸ”§
41
xiaomimimo/mimo-v2-flash
262Kβ€”$0.10$0.30
πŸ§ πŸ”§
32
minimax/minimax-m2.7-highspeed
205Kβ€”$0.60$2.40
πŸ§ πŸ”§
β€”
kwaipilot/kat-coder-pro
256Kβ€”$0.30$1.20
πŸ”§
β€”
inclusionai/ring-2.6-1t
262Kβ€”$0.30$2.50
πŸ§ πŸ”§
33
inclusionai/ling-2.6-flash
262Kβ€”$0.10$0.30
πŸ”§
23
inclusionai/ling-2.6-1t
262Kβ€”$0.30$2.50
πŸ”§
33
deepseek-v4-flash
1.0Mβ€”$0.14$0.28
πŸ§ πŸ”§
39
baidu/ernie-4.5-300b-a47b-paddle
123Kβ€”$0.28$1.10
β€”
baichuan/baichuan-m2-32b
131Kβ€”$0.07$0.07
β€”
gemma-4-26b-a4b-it
262K131K$0.13$0.40
πŸ‘πŸ§ πŸ”§
β€”
GLM-5
203K131K$1.00$3.20
πŸ§ πŸ”§βš‘
44
minimax/minimax-m2.7
200K128K$0.30$1.20
πŸ‘πŸ§ πŸ”§βš‘
42
deepseek-v3.2
164K66K$0.27$0.40
πŸ§ πŸ”§βš‘
37
qwen/qwen3.5-397b-a17b
262K66K$0.60$3.60
πŸ‘πŸ§ πŸ”§
41
zai-org/glm-4.5
131Kβ€”$0.60$2.20
πŸ”§
β€”
zai-org/glm-4.6
205K131K$0.60$2.20
πŸ”§
30
moonshotai/kimi-k2-instruct
131Kβ€”$0.57$2.30
πŸ”§
35
deepseek-v3-0324
128Kβ€”$0.40$1.30
πŸ”§
22
deepseek_v3
64Kβ€”$0.89$0.89
πŸ”§
β€”
deepseek-v3-turbo
128Kβ€”$0.40$1.30
πŸ”§
16
qwen/qwen-2.5-72b-instruct
32Kβ€”$0.38$0.40
πŸ”§
β€”
qwen/qwen2.5-vl-72b-instruct
96Kβ€”$0.80$0.80
β€”
qwen/qwen3-235b-a22b-fp8
128Kβ€”$0.20$0.80
β€”
meta-llama/llama-3-8b-instruct
8Kβ€”$0.04$0.04
β€”
meta-llama/llama-3-70b-instruct
8Kβ€”$0.51$0.74
β€”
wizardlm-2-8x22b
66Kβ€”$0.62$0.62
β€”
meta-llama/llama-3.3-70b-instruct
131Kβ€”$0.39$0.39
πŸ”§
β€”
meta-llama/llama-3.2-3b-instruct
33Kβ€”$0.03$0.05
πŸ”§
β€”
meta-llama/llama-3.1-8b-instruct
16Kβ€”$0.05$0.05
β€”
deepseek-r1
64Kβ€”$4.00$4.00
πŸ”§
24
meta-llama/llama-4-maverick-17b-128e-instruct-fp8
1.0M1.0M$0.20$0.85
β€”
deepseek-r1-distill-qwen-14b
128Kβ€”$0.15$0.15
πŸ”§
β€”
deepseek-prover-v2-671b
160Kβ€”$0.70$2.50
β€”
deepseek-r1-turbo
64Kβ€”$0.70$2.50
πŸ”§
24
deepseek-r1-distill-llama-70b
32Kβ€”$0.80$0.80
11
deepseek-r1-distill-qwen-32b
13Kβ€”$0.30$0.30
πŸ”§
β€”
sao10k/l3-8b-lunaris
8Kβ€”$0.05$0.05
β€”
sao10k/l31-70b-euryale-v2.2
16Kβ€”$1.48$1.48
β€”
Sao10K/L3-8B-Stheno-v3.2
8Kβ€”$0.05$0.05
β€”
mistralai/mistral-nemo
131Kβ€”$0.17$0.17
β€”
gryphe/mythomax-l2-13b
4Kβ€”$0.09$0.09
πŸ”§
β€”

About Novita AI on Requesty

How many Novita AI models are available through Requesty?
Requesty routes to 43 Novita AI models including regional variants, with pricing synced in real time to the upstream provider.
What is the cheapest Novita AI model?
The cheapest Novita AI model starts at $0.03 per million input tokens. See the pricing column in the table below for full per-model rates.
Does Requesty add markup on Novita AI pricing?
No. Requesty passes through exactly what Novita AI charges. You pay the same per-token rates as going direct β€” plus you get smart routing, caching, analytics, and one unified API for 400+ models.
Is my data used to train Novita AI models?
Novita AI's training policy varies by product and tier. See their privacy policy for specifics, and contact Requesty for enterprise-grade data controls.
Where are Novita AI models hosted?
Novita AI models are hosted in πŸ‡ΊπŸ‡Έ US. Some models are available in additional regions through AWS Bedrock, Azure, or Google Vertex AI β€” filter by region on the Novita AI rows in the models explorer.