Requesty
DeepInfra Inc.

meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo

A lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most.

πŸ”§Tool calling

Pricing per 1M tokens

Input
$0.02
Output
$0.05

Specifications

Context window131K tokens
Max outputUnlimited
API typechat
AddedMay 14, 2024
Model IDdeepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo

Privacy & data

Data retentionNo
Used for trainingNo
Provider locationπŸ‡ΊπŸ‡Έ US