Models Google Vertex AI
llama4@llama-4-scout-17b-16e-instruct
33K max output
chat
Pricing
Per 1M tokens
Input
—
Cached input
—
Output
—
Cache write
—
Modalities
Input
text
Output
text
Features
Streaming
Function calling
Vision
Reasoning
JSON mode
Share This Model
Share on X or copy the link
Google Vertex AI
Llama4@llama 4 Scout 17B 16e Instruct
Input
—/M
Output
—/M
Text Generation