Requesty

Groq Inc.

Ultra-fast AI inference with specialized hardware. Requesty routes to 2 Groq Inc. models starting at $0.10 per 1M input tokens with context windows up to 131K tokens. One API key, OpenAI-compatible SDK, no markup.

All Groq Inc. models

ModelContextMax OutputInput/1MOutput/1MCapabilitiesSWE-Bench
gpt-oss-20b
131K33K$0.10$0.50
πŸ”§
β€”
gpt-oss-120b
131K33K$0.15$0.75
πŸ”§
β€”

About Groq Inc. on Requesty

How many Groq Inc. models are available through Requesty?
Requesty routes to 2 Groq Inc. models including regional variants, with pricing synced in real time to the upstream provider.
What is the cheapest Groq Inc. model?
The cheapest Groq Inc. model starts at $0.10 per million input tokens. See the pricing column in the table below for full per-model rates.
Does Requesty add markup on Groq Inc. pricing?
No. Requesty passes through exactly what Groq Inc. charges. You pay the same per-token rates as going direct β€” plus you get smart routing, caching, analytics, and one unified API for 400+ models.
Is my data used to train Groq Inc. models?
Groq Inc.'s terms state that API data is not used for training. See their privacy policy for the authoritative statement.
Where are Groq Inc. models hosted?
Groq Inc. models are hosted in πŸ‡ΊπŸ‡Έ US. Some models are available in additional regions through AWS Bedrock, Azure, or Google Vertex AI β€” filter by region on the Groq Inc. rows in the models explorer.