Models DeepInfra
nvidia/NVIDIA-Nemotron-Nano-9B-v2
8K max output
chat
Pricing
Per 1M tokens
Input
$0.04
Cached input
—
Output
$0.16
Cache write
—
Modalities
Input
text
Output
text
Features
Streaming
Function calling
Vision
Reasoning
JSON mode
Share This Model
Share on X or copy the link
DeepInfra
Nvidia Nemotron Nano 9B V2
Input
$0.04/M
Output
$0.16/M
Text Generation