Route to Any Model, Anywhere

Stop wasting time with multiple LLM APIs and provider requirements. One API to rule them all.

POST /v1/chat/completions

{

"model": "the_best_model",

"messages": [...]

}

Claude 3.5 Sonnet

Gemini 2.0

GPT-4o-mini

Llama 3.1 70B

Cohere Command R

Gemini Flash 1.5

Get Started in Minutes

Automatically routes to the best model based on your task, balancing performance and cost.

Real-time token streaming for faster responses and better user experience.

Configurable data retention and privacy settings for each provider.

Intelligent caching and routing to minimize costs while maintaining performance.

Consistent JSON responses across all models with automatic validation.

Support for vision, tool use, and other model-specific capabilities.

Available Models

Access to all major AI models through a single API