Requesty
Data/Reliability and ops

Operational metrics per provider, April 2026

Operational metrics per provider, April 2026

Switch metrics. Hover any row to see all three at once.

Share of requests that completed without an upstream error. Moonshot at 6% is a real reliability outlier.Source: Requesty production gateway, April 2026.

How reliable is each LLM provider in production? In April 2026 the top eight providers on the Requesty gateway (OpenAI, Anthropic, Vertex (Gemini), Bedrock, DeepSeek, Novita, xAI) sat at 95-99% success rate. Azure trailed at 78%, Vertex (Claude) at 84%, Mistral at 86%, and Moonshot at 6%, a real reliability outlier. Streaming adoption is bimodal too: Azure 68%, Anthropic 57%, everyone else under 30%.

Why it mattersProvider success rate translates directly into user-visible failures unless an application has a managed fallback chain. The 95-99% top tier is comfortably reliable; Vertex (Claude) and Azure visibly failing roughly 1 in 5 calls demands either a routing policy or active provider switching at the application layer to avoid sustained user pain.

Period
Apr 2026
Updated
May 9, 2026
ID
ops-metrics-april-2026
§ 01

Key findings

  • 01Success is bimodal: top tier at 95 to 99%, Vertex (Claude) 84%, Azure 78%, Mistral 86%, Moonshot 6%.
  • 02Streaming adoption is bimodal: Azure 68% and Anthropic 57%. Vertex (Claude) at 28%. Everyone else <10%.
  • 03Cache hit rate ranges from Anthropic-direct 77% to Vertex (Claude) 24% (same model family, 3x spread).
§ 02

Data

ProviderSuccess rate(percent)Streaming(percent)Cache hit(percent)
xAI99.30%1.30%35.70%
DeepSeek98.30%2.80%48.30%
OpenAI98.00%7.20%36.40%
Novita97.20%2.30%31.90%
Anthropic96.00%56.90%77.50%
Vertex (Gemini)95.90%3.70%9.60%
Bedrock95.60%9.70%56.90%
Mistral86.30%8.00%4.10%
Vertex (Claude)84.40%27.60%23.50%
Azure78.00%68.30%41.00%
Moonshot6.20%4.80%88.20%
§ 03

Cite as

APA
Click to copy
BibTeX
Click to copy
§ 04

Cited in

ID: ops-metrics-april-2026·Updated May 9, 2026·Period Apr 2026