Models DeepInfra
meta-llama/Llama-4-Scout-17B-16E-Instruct
8K max output
chat
Pricing
Per 1M tokens
Input
$0.08
Cached input
—
Output
$0.30
Cache write
—
Modalities
Input
text
Output
text
Features
Streaming
Function calling
Vision
Reasoning
JSON mode
Share This Model
Share on X or copy the link
DeepInfra
Llama 4 Scout 17B 16e Instruct
Input
$0.08/M
Output
$0.30/M
Text Generation