Providers & Endpoints Use the API Trending GitHub

Nebius

106 models available5 endpoints

106 models

Model ID	Input $/M	Output $/M	Cache Read $/M	Cache Write $/M	Features	Updated
deepseek-ai/DeepSeek-V3.2 Deepseek V3.2	$0.30	$0.45	—	—		1mo ago
meta-llama/Llama-Guard-3-8B Llama Guard 3 8B	$0.20	$0.60	—	—		3mo ago
BAAI/bge-multilingual-gemma2 Bge Multilingual Gemma2	$0.01	Free	—	—		3mo ago
google/gemma-3-27b-it Gemma 3 27B It	$0.10	$0.30	—	—		3mo ago
google/gemma-3-27b-it-fast Gemma 3 27B It Fast	$0.20	$0.60	—	—		3mo ago
intfloat/e5-mistral-7b-instruct E5 Mistral 7B Instruct	$0.01	Free	—	—		3mo ago
zai-org/GLM-4.5-Air Glm 4.5 Air	$0.20	$1.20	—	—		3mo ago
deepseek-ai/DeepSeek-R1-0528 Deepseek R1	$0.80	$2.40	—	—		3mo ago
deepseek-ai/DeepSeek-R1-0528-fast Deepseek R1 0528 Fast	$2.00	$6.00	—	—		3mo ago
deepseek-ai/DeepSeek-V3-0324 Deepseek V3	$0.50	$1.50	—	—		3mo ago
deepseek-ai/DeepSeek-V3-0324-fast Deepseek V3 0324 Fast	$0.75	$2.25	—	—		3mo ago
google/gemma-2-2b-it Gemma 2 2B It	$0.02	$0.06	—	—		3mo ago
google/gemma-2-9b-it-fast Gemma 2 9B It Fast	$0.03	$0.09	—	—		3mo ago
meta-llama/Llama-3.3-70B-Instruct Llama 3.3 70B Instruct	$0.13	$0.40	—	—		3mo ago
meta-llama/Llama-3.3-70B-Instruct-fast Llama 3.3 70B Instruct Fast	$0.25	$0.75	—	—		3mo ago
meta-llama/Meta-Llama-3.1-8B-Instruct Meta Llama 3.1 8B Instruct	$0.02	$0.06	—	—		3mo ago
meta-llama/Meta-Llama-3.1-8B-Instruct-fast Meta Llama 3.1 8B Instruct Fast	$0.03	$0.09	—	—		3mo ago
moonshotai/Kimi-K2-Instruct Kimi K2 Instruct	$0.50	$2.40	—	—		3mo ago
NousResearch/Hermes-4-405B Hermes 4 405B	$1.00	$3.00	—	—		3mo ago
NousResearch/Hermes-4-70B Hermes 4 70B	$0.13	$0.40	—	—		3mo ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Llama 3 1 Nemotron Ultra 253B V1	$0.60	$1.80	—	—		3mo ago
openai/gpt-oss-120b GPT Oss 120B	$0.15	$0.60	—	—		3mo ago
openai/gpt-oss-20b GPT Oss 20B	$0.05	$0.20	—	—		3mo ago
Qwen/Qwen2.5-Coder-7B-fast Qwen2.5 Coder 7B Fast	$0.03	$0.09	—	—		3mo ago
Qwen/Qwen3-235B-A22B-Instruct-2507 Qwen3 235B A22b Instruct	$0.20	$0.60	—	—		3mo ago
Qwen/Qwen3-235B-A22B-Thinking-2507 Qwen3 235B A22b Thinking	$0.20	$0.80	—	—		3mo ago
Qwen/Qwen3-30B-A3B-Instruct-2507 Qwen3 30B A3b Instruct	$0.10	$0.30	—	—		3mo ago
Qwen/Qwen3-30B-A3B-Thinking-2507 Qwen3 30B A3b Thinking	$0.10	$0.30	—	—		3mo ago
Qwen/Qwen3-32B Qwen3 32B	$0.10	$0.30	—	—		3mo ago
Qwen/Qwen3-32B-fast Qwen3 32B Fast	$0.20	$0.60	—	—		3mo ago
Qwen/Qwen3-Coder-30B-A3B-Instruct Qwen3 Coder 30B A3b Instruct	$0.10	$0.30	—	—		3mo ago
Qwen/Qwen3-Coder-480B-A35B-Instruct Qwen3 Coder 480B A35b Instruct	$0.40	$1.80	—	—		3mo ago
zai-org/GLM-4.5 Glm 4.5	$0.60	$2.20	—	—		3mo ago
black-forest-labs/flux-dev Flux Dev	—	—	—	—		4mo ago
black-forest-labs/flux-schnell Flux Schnell	—	—	—	—		4mo ago
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Deepseek Coder V2 Lite Instruct	—	—	—	—		4mo ago
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct-fast Deepseek Coder V2 Lite Instruct Fast	—	—	—	—		4mo ago
llava-hf/llava-1.5-13b-hf Llava 1.5 13B Hf	—	—	—	—		4mo ago
llava-hf/llava-1.5-7b-hf Llava 1.5 7B Hf	—	—	—	—		4mo ago
mistralai/Mixtral-8x7B-Instruct-v0.1 Mixtral 8x7b Instruct V0.1	—	—	—	—	Tools	4mo ago
mistralai/Mixtral-8x7B-Instruct-v0.1-fast Mixtral 8x7b Instruct V0.1 Fast	—	—	—	—	Tools	4mo ago
Qwen/Qwen2-VL-72B-Instruct Qwen2 VL 72B Instruct	—	—	—	—		4mo ago
Qwen/Qwen2-VL-7B-Instruct Qwen2 VL 7B Instruct	—	—	—	—		4mo ago
Qwen/Qwen2.5-Coder-32B-Instruct Qwen2.5 Coder 32B Instruct	—	—	—	—		4mo ago
Qwen/Qwen2.5-Coder-32B-Instruct-fast Qwen2.5 Coder 32B Instruct Fast	—	—	—	—		4mo ago
Qwen/Qwen2.5-Coder-7B Qwen2.5 Coder 7B	—	—	—	—		4mo ago
Qwen/Qwen2.5-Coder-7B-Instruct Qwen2.5 Coder 7B Instruct	—	—	—	—		4mo ago
Qwen/Qwen2.5-Coder-7B-Instruct-fast Qwen2.5 Coder 7B Instruct Fast	—	—	—	—		4mo ago
Qwen/Qwen2.5-VL-72B-Instruct Qwen2.5 VL 72B Instruct	—	—	—	—		4mo ago
aaditya/Llama3-OpenBioLLM-70B Llama3 Openbiollm 70B	—	—	—	—		5mo ago
aaditya/Llama3-OpenBioLLM-8B Llama3 Openbiollm 8B	—	—	—	—		5mo ago
allenai/OLMo-7B-Instruct-hf Olmo 7B Instruct Hf	—	—	—	—		5mo ago
cognitivecomputations/dolphin-2.9.2-mixtral-8x22b Dolphin 2.9.2 Mixtral 8x22b	—	—	—	—		5mo ago
deepseek-ai/DeepSeek-V3 Deepseek V3	—	—	—	—		5mo ago
deepseek-ai/DeepSeek-V3.1 Deepseek V3.1	—	—	—	—		5mo ago
google/gemma-2-27b-it Gemma 2 27B It	—	—	—	—		5mo ago
google/gemma-2-27b-it-fast Gemma 2 27B It Fast	—	—	—	—		5mo ago
google/gemma-2-2b-it-fast Gemma 2 2B It Fast	—	—	—	—		5mo ago
google/gemma-2-9b-it Gemma 2 9B It	—	—	—	—		5mo ago
meta-llama/Llama-3.2-1B Llama 3.2 1B	—	—	—	—		5mo ago
meta-llama/Llama-3.2-1B-Instruct Llama 3.2 1B Instruct	—	—	—	—		5mo ago
meta-llama/Llama-3.2-3B Llama 3.2 3B	—	—	—	—		5mo ago
meta-llama/Llama-3.2-3B-Instruct Llama 3.2 3B Instruct	—	—	—	—		5mo ago
meta-llama/Meta-Llama-3.1-405B-Instruct Meta Llama 3.1 405B Instruct	—	—	—	—		5mo ago
meta-llama/Meta-Llama-3.1-70B-Instruct Meta Llama 3.1 70B Instruct	—	—	—	—		5mo ago
meta-llama/Meta-Llama-3.1-70B-Instruct-fast Meta Llama 3.1 70B Instruct Fast	—	—	—	—		5mo ago
microsoft/Phi-3-medium-128k-instruct Phi 3 Medium 128k Instruct	—	—	—	—		5mo ago
microsoft/Phi-3-medium-128k-instruct-fast Phi 3 Medium 128k Instruct Fast	—	—	—	—		5mo ago
microsoft/Phi-3-mini-4k-instruct Phi 3 Mini 4k Instruct	—	—	—	—		5mo ago
microsoft/Phi-3-mini-4k-instruct-fast Phi 3 Mini 4k Instruct Fast	—	—	—	—		5mo ago
microsoft/Phi-3.5-mini-instruct Phi 3.5 Mini Instruct	—	—	—	—		5mo ago
microsoft/Phi-3.5-MoE-instruct Phi 3.5 MOE Instruct	—	—	—	—		5mo ago
MiniMaxAI/MiniMax-M2.1 Minimax M2.1	—	—	—	—		5mo ago
mistralai/Devstral-Small-2505 Devstral Small	—	—	—	—		5mo ago
mistralai/Mistral-Nemo-Instruct-2407 Mistral Nemo Instruct	—	—	—	—		5mo ago
mistralai/Mistral-Nemo-Instruct-2407-fast Mistral Nemo Instruct 2407 Fast	—	—	—	—		5mo ago
mistralai/Mixtral-8x22B-Instruct-v0.1 Mixtral 8x22b Instruct V0.1	—	—	—	—		5mo ago
mistralai/Mixtral-8x22B-Instruct-v0.1-fast Mixtral 8x22b Instruct V0.1 Fast	—	—	—	—		5mo ago
moonshotai/Kimi-K2-Thinking Kimi K2 Thinking	—	—	—	—		5mo ago
NousResearch/Hermes-3-Llama-405B Hermes 3 Llama 405B	—	—	—	—		5mo ago
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Llama 3.1 Nemotron 70B Instruct Hf	—	—	—	—		5mo ago
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF-fast Llama 3.1 Nemotron 70B Instruct Hf Fast	—	—	—	—		5mo ago
nvidia/Nemotron-Nano-V2-12b Nemotron Nano V2 12B	—	—	—	—		5mo ago
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B Nvidia Nemotron 3 Nano 30B A3b	—	—	—	—		5mo ago
PrimeIntellect/INTELLECT-3.1-13B Intellect 3.1 13B	—	—	—	—		5mo ago
Qwen/Qwen2.5-1.5B-Instruct Qwen2.5 1.5b Instruct	—	—	—	—		5mo ago
Qwen/Qwen2.5-32B-Instruct Qwen2.5 32B Instruct	—	—	—	—		5mo ago
Qwen/Qwen2.5-32B-Instruct-fast Qwen2.5 32B Instruct Fast	—	—	—	—		5mo ago
Qwen/Qwen2.5-72B-Instruct Qwen2.5 72B Instruct	—	—	—	—		5mo ago
Qwen/Qwen2.5-72B-Instruct-fast Qwen2.5 72B Instruct Fast	—	—	—	—		5mo ago
Qwen/Qwen3-0.6B Qwen3 0.6b	—	—	—	—		5mo ago
Qwen/Qwen3-0.6B-Base Qwen3 0.6b Base	—	—	—	—		5mo ago
Qwen/Qwen3-1.7B Qwen3 1.7b	—	—	—	—		5mo ago
Qwen/Qwen3-1.7B-Base Qwen3 1.7b Base	—	—	—	—		5mo ago
Qwen/Qwen3-14B Qwen3 14B	—	—	—	—		5mo ago
Qwen/Qwen3-14B-Base Qwen3 14B Base	—	—	—	—		5mo ago
Qwen/Qwen3-30B-A3B Qwen3 30B A3b	—	—	—	—		5mo ago
Qwen/Qwen3-4B Qwen3 4B	—	—	—	—		5mo ago
Qwen/Qwen3-4B-Base Qwen3 4B Base	—	—	—	—		5mo ago
Qwen/Qwen3-8B Qwen3 8B	—	—	—	—		5mo ago
Qwen/Qwen3-8B-Base Qwen3 8B Base	—	—	—	—		5mo ago
Qwen/Qwen3-Embedding-8B Qwen3 Embedding 8B	—	—	—	—		5mo ago
Qwen/Qwen3-Next-80B-A3B-Thinking Qwen3 Next 80B A3b Thinking	—	—	—	—		5mo ago
Qwen/QwQ-32B Qwq 32B	—	—	—	—		5mo ago
Qwen/QwQ-32B-fast Qwq 32B Fast	—	—	—	—		5mo ago
zai-org/GLM-4.7-FP8 Glm 4.7 FP8	—	—	—	—		5mo ago