Models

llama4@llama-4-scout-17b-16e-instruct

Google Vertex AI
33K max output
chat

Pricing

Per 1M tokens

Input
Cached input
Output
Cache write

Modalities

Input
text
Output
text

Features

Streaming
Function calling
Vision
Reasoning
JSON mode

Share This Model

Share on X or copy the link

Google Vertex AI

Llama4@llama 4 Scout 17B 16e Instruct

Input
/M
Output
/M
Text Generation
Portkey