Requesty
DeepInfra Inc.

meta-llama/Meta-Llama-3.1-405B-Instruct

A lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most.

Pricing per 1M tokens

Input
$0.80
Output
$0.80

Specifications

Context window131K tokens
Max outputUnlimited
API typechat
AddedFeb 6, 2025
Model IDdeepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct

Privacy & data

Data retentionNo
Used for trainingNo
Provider locationπŸ‡ΊπŸ‡Έ US