Requesty
Together AI Inc.

meta-llama/Llama-3.2-3B-Instruct-Turbo

A lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most.

πŸ”§Tool calling

Pricing per 1M tokens

Input
$0.06
Output
$0.06

Specifications

Context window131K tokens
Max outputUnlimited
API typechat
AddedJan 30, 2025
Model IDtogether/meta-llama/Llama-3.2-3B-Instruct-Turbo

Privacy & data

Data retentionNo
Used for trainingNo
Provider locationπŸ‡ΊπŸ‡Έ US