Requesty
Novita AI

meta-llama/llama-4-maverick-17b-128e-instruct-fp8

A lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most.

Pricing per 1M tokens

Input
$0.20
Output
$0.85

Specifications

Context window1.0M tokens
Max output1.0M tokens
API typechat
AddedJan 30, 2025
Model IDnovita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8

Privacy & data

Data retentionYes
Used for trainingUnknown
Provider locationπŸ‡ΊπŸ‡Έ US