Providers & Endpoints Use the API Trending GitHub

meta-llama/Llama-4-Scout-17B-16E-Instruct

8K max output

chat

Pricing

Per 1M tokens

Input

$0.08

Cached input

—

Output

$0.30

Cache write

—

Modalities

Input

text

Output

text

Features

Streaming

Function calling

Vision

Reasoning

JSON mode

Share This Model

Share on X or copy the link

DeepInfra

Llama 4 Scout 17B 16e Instruct

Input

$0.08/M

Output

$0.30/M

Text Generation