---
id: ops-metrics-april-2026
slug: operational-metrics-by-provider-april-2026
title: "Operational metrics per provider, April 2026"
topic: reliability
period: Apr 2026
updated: 2026-05-09
license: CC BY 4.0
canonical: https://requesty.ai/data/operational-metrics-by-provider-april-2026
---

# Operational metrics per provider, April 2026

> How reliable is each LLM provider in production? In April 2026 the top eight providers on the Requesty gateway (OpenAI, Anthropic, Vertex (Gemini), Bedrock, DeepSeek, Novita, xAI) sat at 95-99% success rate. Azure trailed at 78%, Vertex (Claude) at 84%, Mistral at 86%, and Moonshot at 6%, a real reliability outlier. Streaming adoption is bimodal too: Azure 68%, Anthropic 57%, everyone else under 30%.

*Topic: Reliability and ops. Period: Apr 2026. Last updated 2026-05-09.*

## Why it matters

Provider success rate translates directly into user-visible failures unless an application has a managed fallback chain. The 95-99% top tier is comfortably reliable; Vertex (Claude) and Azure visibly failing roughly 1 in 5 calls demands either a routing policy or active provider switching at the application layer to avoid sustained user pain.

## Questions this answers

- Which LLM provider is most reliable in production?
- What is the success rate of OpenAI vs Anthropic vs Vertex?
- Why do some LLM providers fail more often than others?
- How widely is streaming adopted across LLM providers?

## Key findings

1. Success is bimodal: top tier at 95 to 99%, Vertex (Claude) 84%, Azure 78%, Mistral 86%, Moonshot 6%.
2. Streaming adoption is bimodal: Azure 68% and Anthropic 57%. Vertex (Claude) at 28%. Everyone else <10%.
3. Cache hit rate ranges from Anthropic-direct 77% to Vertex (Claude) 24% (same model family, 3x spread).

## Data

| Provider | Success rate (percent) | Streaming (percent) | Cache hit (percent) |
| --- | --- | --- | --- |
| xAI | 99.30% | 1.30% | 35.70% |
| DeepSeek | 98.30% | 2.80% | 48.30% |
| OpenAI | 98.00% | 7.20% | 36.40% |
| Novita | 97.20% | 2.30% | 31.90% |
| Anthropic | 96.00% | 56.90% | 77.50% |
| Vertex (Gemini) | 95.90% | 3.70% | 9.60% |
| Bedrock | 95.60% | 9.70% | 56.90% |
| Mistral | 86.30% | 8.00% | 4.10% |
| Vertex (Claude) | 84.40% | 27.60% | 23.50% |
| Azure | 78.00% | 68.30% | 41.00% |
| Moonshot | 6.20% | 4.80% | 88.20% |

## Caveats

- Apr 2025 success rates are anomalously low (OpenAI 54%, Anthropic 72%) and are likely under-reported because status_code wasn't being captured then. Mar to Apr 2026 success-rate comparisons are reliable; YoY success-rate deltas should be treated softly.

## Cite as

**APA.** Requesty (2026). Operational metrics per provider, April 2026. Requesty Data. https://requesty.ai/data/operational-metrics-by-provider-april-2026

```bibtex
@misc{requesty_operational_metrics_by_provider_april_2026,
  author       = {{Requesty}},
  title        = {Operational metrics per provider, April 2026},
  year         = {2026},
  howpublished = {\url{https://requesty.ai/data/operational-metrics-by-provider-april-2026}},
  note         = {Requesty Data}
}
```

## Cited in

- [What the gateway saw in April 2026](https://requesty.ai/blog/provider-trends-april-2026-agentic-share-latency)

---

Downloads: [JSON](https://requesty.ai/data/operational-metrics-by-provider-april-2026/data.json) · [CSV](https://requesty.ai/data/operational-metrics-by-provider-april-2026/data.csv) · [Markdown](https://requesty.ai/data/operational-metrics-by-provider-april-2026/data.md)