Models

Qwen/Qwen3-Embedding-8B

DeepInfra
8K max output
embedding

Pricing

Per 1M tokens

Input
Cached input
Output
Cache write

Modalities

Input
text
Output
embedding

Features

Streaming
Function calling
Vision
Reasoning
JSON mode

Share This Model

Share on X or copy the link

DeepInfra

Qwen3 Embedding 8B

Input
/M
Output
/M
Text Generation
Portkey