Question 1

How many AI models can I access through Requesty?

Accepted Answer

Requesty routes to 457+ models across 24 providers, including OpenAI, Anthropic, Google Gemini, AWS Bedrock, Azure OpenAI, Google Vertex AI, DeepSeek, Meta Llama, xAI Grok, Mistral, Moonshot Kimi, Alibaba Qwen, Zhipu GLM and MiniMax. Use any of them through a single OpenAI-compatible API.

Question 2

Does Requesty charge markup on top of provider pricing?

Accepted Answer

No. Requesty passes through exactly what the upstream provider charges. You pay the same per-token rate as going direct to OpenAI, Anthropic or Google — and you get smart routing, automatic failover, prompt caching, analytics, and a single unified API included. Requesty makes money on a small platform fee for enterprise features, not on per-token markup.

Question 3

Which model is best for coding?

Accepted Answer

On SWE-Bench Verified — the most realistic coding benchmark, based on real GitHub issues — GPT-5.2 Codex, Claude Opus 4.7 and Claude Sonnet 4.6 currently lead. MiniMax M2.5 is the strongest open-weights option. See the "Best for coding" leaderboard above for live rankings, and each model detail page for full benchmark charts.

Question 4

Which model is best for reasoning and math?

Accepted Answer

For graduate-level reasoning (GPQA Diamond), GPT-5.4, Grok 4 and Claude Opus 4.7 lead the pack. For math (AIME, MATH benchmarks), GPT-5.4 and Grok 4 currently top the charts, with DeepSeek R1 offering strong performance at a fraction of the price.

Question 5

What is the longest context window available?

Accepted Answer

Several models now support 1M+ token context windows — great for whole-codebase analysis or long document reasoning. Gemini 2.5 Pro and some Claude variants lead on context length. Note that effective quality often degrades past 128K tokens; prompt caching (supported on many models) is usually a better approach for repeated long context.

Question 6

Are there free AI models I can use?

Accepted Answer

Yes — 0 models on Requesty have a zero-cost tier, including several Llama variants and DeepSeek models via third-party hosts. They're ideal for prototyping and development. You can filter by "Free" in the model explorer above.

Question 7

How do I switch between models in my code?

Accepted Answer

Requesty is OpenAI-SDK compatible. Point base_url to "https://router.requesty.ai/v1", set your API key, and change the "model" parameter to any supported model ID (e.g. "anthropic/claude-opus-4-7", "openai/gpt-5.2", "google/gemini-2.5-pro"). No library changes needed — the same code works across providers.

Question 8

Is my data private? Is it used for training?

Accepted Answer

Most major providers (Anthropic, Vertex AI, Azure OpenAI, AWS Bedrock) do not use API data for training by default. OpenAI offers zero-retention deployments via enterprise tiers. Each model detail page shows the specific data retention and training policy for that provider. Requesty itself never uses your data for training.

Question 9

Can I get regional deployments (EU, US, APAC)?

Accepted Answer

Yes. Models available through AWS Bedrock, Azure OpenAI, and Google Vertex AI can be pinned to specific regions (eu-west-1, us-east5, etc.) using the @region suffix. Useful for GDPR, HIPAA, and data residency requirements. Filter by Region in the explorer to see all options.

Question 10

How are benchmark scores calculated?

Accepted Answer

Benchmark scores shown on Requesty are sourced from official model cards, Artificial Analysis, and public leaderboards (LiveBench, SWE-Bench, Vellum). Scores measure specific skills and do not capture every aspect of model quality — always test on your own workload. Each model detail page links the canonical benchmark sources.

Compare 457+ AI models.

Top models by category

Best for coding

Best for reasoning

Best at math

Frequently asked questions