Models Google Vertex AI
llama3-70b-8192
8K max output
chat
Pricing
Per 1M tokens
Input
—
Cached input
—
Output
—
Cache write
—
Modalities
Input
text
Output
text
Features
Streaming
Function calling
Vision
Reasoning
JSON mode
Share This Model
Share on X or copy the link
Google Vertex AI
Llama3 70B
Input
—/M
Output
—/M
Text Generation