deepseek-r1-distill-qwen-32b
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. It outperforms OpenAI's o1-mini across various benchmarks, achieving new state-of-the-art results for dense models. Other benchmark results include: AIME 2024 pass@1: 72.6 MATH-500 pass@1: 94.3 CodeForces Rating: 1691 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.
Specifications
Benchmarks
Released 2025-01-20Graduate-level physics, chemistry & biology questions designed to resist Googling.
Artificial Analysis Intelligence Index β a composite of multiple evaluations measuring overall model capability.
Scores are sourced from official model cards, Artificial Analysis, and public leaderboards. Benchmarks measure specific skills and do not capture every aspect of model quality β always test on your own workload.
Pricing
Requesty charges exactly what the upstream provider charges β no markup, no per-request fees. Prompt caching and smart routing can reduce effective cost by 30-80%.
Quickstart
Drop-in compatible with the OpenAI SDK. Change the base URL, swap in your Requesty API key, and set the model to novita/deepseek/deepseek-r1-distill-qwen-32b.
123456789101112131415from openai import OpenAI client = OpenAI( api_key="YOUR_REQUESTY_API_KEY", base_url="https://router.requesty.ai/v1", ) response = client.chat.completions.create( model="novita/deepseek/deepseek-r1-distill-qwen-32b", messages=[ {"role": "user", "content": "Explain quantum computing in one paragraph."}, ], ) print(response.choices[0].message.content)
Other Novita AI models
Frequently asked questions
How much does deepseek-r1-distill-qwen-32b cost?
What is the context window of deepseek-r1-distill-qwen-32b?
How does deepseek-r1-distill-qwen-32b perform on benchmarks?
What can deepseek-r1-distill-qwen-32b do?
How do I use deepseek-r1-distill-qwen-32b with the OpenAI SDK?
Access deepseek-r1-distill-qwen-32b through Requesty
One API key, 400+ models, OpenAI-compatible. No markup on provider prices, automatic failover, and smart caching built-in.

