ModelsNebius
106 models available5 endpoints
106 models
Model IDInput $/MOutput $/MCache Read $/MCache Write $/MFeaturesUpdated
meta-llama/Llama-Guard-3-8B
Llama Guard 3 8B
$0.20$0.60
3w ago
BAAI/bge-multilingual-gemma2
Bge Multilingual Gemma2
$0.01Free
3w ago
google/gemma-3-27b-it
Gemma 3 27B It
$0.10$0.30
3w ago
google/gemma-3-27b-it-fast
Gemma 3 27B It Fast
$0.20$0.60
3w ago
intfloat/e5-mistral-7b-instruct
E5 Mistral 7B Instruct
$0.01Free
3w ago
zai-org/GLM-4.5-Air
Glm 4.5 Air
$0.20$1.20
3w ago
deepseek-ai/DeepSeek-R1-0528
Deepseek R1
$0.80$2.40
3w ago
deepseek-ai/DeepSeek-R1-0528-fast
Deepseek R1 0528 Fast
$2.00$6.00
3w ago
deepseek-ai/DeepSeek-V3-0324
Deepseek V3
$0.50$1.50
3w ago
deepseek-ai/DeepSeek-V3-0324-fast
Deepseek V3 0324 Fast
$0.75$2.25
3w ago
google/gemma-2-2b-it
Gemma 2 2B It
$0.02$0.06
3w ago
google/gemma-2-9b-it-fast
Gemma 2 9B It Fast
$0.03$0.09
3w ago
meta-llama/Llama-3.3-70B-Instruct
Llama 3.3 70B Instruct
$0.13$0.40
3w ago
meta-llama/Llama-3.3-70B-Instruct-fast
Llama 3.3 70B Instruct Fast
$0.25$0.75
3w ago
meta-llama/Meta-Llama-3.1-8B-Instruct
Meta Llama 3.1 8B Instruct
$0.02$0.06
3w ago
meta-llama/Meta-Llama-3.1-8B-Instruct-fast
Meta Llama 3.1 8B Instruct Fast
$0.03$0.09
3w ago
moonshotai/Kimi-K2-Instruct
Kimi K2 Instruct
$0.50$2.40
3w ago
NousResearch/Hermes-4-405B
Hermes 4 405B
$1.00$3.00
3w ago
NousResearch/Hermes-4-70B
Hermes 4 70B
$0.13$0.40
3w ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
Llama 3 1 Nemotron Ultra 253B V1
$0.60$1.80
3w ago
openai/gpt-oss-120b
GPT Oss 120B
$0.15$0.60
3w ago
openai/gpt-oss-20b
GPT Oss 20B
$0.05$0.20
3w ago
Qwen/Qwen2.5-Coder-7B-fast
Qwen2.5 Coder 7B Fast
$0.03$0.09
3w ago
Qwen/Qwen3-235B-A22B-Instruct-2507
Qwen3 235B A22b Instruct
$0.20$0.60
3w ago
Qwen/Qwen3-235B-A22B-Thinking-2507
Qwen3 235B A22b Thinking
$0.20$0.80
3w ago
Qwen/Qwen3-30B-A3B-Instruct-2507
Qwen3 30B A3b Instruct
$0.10$0.30
3w ago
Qwen/Qwen3-30B-A3B-Thinking-2507
Qwen3 30B A3b Thinking
$0.10$0.30
3w ago
Qwen/Qwen3-32B
Qwen3 32B
$0.10$0.30
3w ago
Qwen/Qwen3-32B-fast
Qwen3 32B Fast
$0.20$0.60
3w ago
Qwen/Qwen3-Coder-30B-A3B-Instruct
Qwen3 Coder 30B A3b Instruct
$0.10$0.30
3w ago
Qwen/Qwen3-Coder-480B-A35B-Instruct
Qwen3 Coder 480B A35b Instruct
$0.40$1.80
3w ago
zai-org/GLM-4.5
Glm 4.5
$0.60$2.20
3w ago
black-forest-labs/flux-dev
Flux Dev
2mo ago
black-forest-labs/flux-schnell
Flux Schnell
2mo ago
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Deepseek Coder V2 Lite Instruct
2mo ago
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct-fast
Deepseek Coder V2 Lite Instruct Fast
2mo ago
llava-hf/llava-1.5-13b-hf
Llava 1.5 13B Hf
2mo ago
llava-hf/llava-1.5-7b-hf
Llava 1.5 7B Hf
2mo ago
mistralai/Mixtral-8x7B-Instruct-v0.1
Mixtral 8x7b Instruct V0.1
Tools
2mo ago
mistralai/Mixtral-8x7B-Instruct-v0.1-fast
Mixtral 8x7b Instruct V0.1 Fast
Tools
2mo ago
Qwen/Qwen2-VL-72B-Instruct
Qwen2 VL 72B Instruct
2mo ago
Qwen/Qwen2-VL-7B-Instruct
Qwen2 VL 7B Instruct
2mo ago
Qwen/Qwen2.5-Coder-32B-Instruct
Qwen2.5 Coder 32B Instruct
2mo ago
Qwen/Qwen2.5-Coder-32B-Instruct-fast
Qwen2.5 Coder 32B Instruct Fast
2mo ago
Qwen/Qwen2.5-Coder-7B
Qwen2.5 Coder 7B
2mo ago
Qwen/Qwen2.5-Coder-7B-Instruct
Qwen2.5 Coder 7B Instruct
2mo ago
Qwen/Qwen2.5-Coder-7B-Instruct-fast
Qwen2.5 Coder 7B Instruct Fast
2mo ago
Qwen/Qwen2.5-VL-72B-Instruct
Qwen2.5 VL 72B Instruct
2mo ago
aaditya/Llama3-OpenBioLLM-70B
Llama3 Openbiollm 70B
2mo ago
aaditya/Llama3-OpenBioLLM-8B
Llama3 Openbiollm 8B
2mo ago
allenai/OLMo-7B-Instruct-hf
Olmo 7B Instruct Hf
2mo ago
cognitivecomputations/dolphin-2.9.2-mixtral-8x22b
Dolphin 2.9.2 Mixtral 8x22b
2mo ago
deepseek-ai/DeepSeek-V3
Deepseek V3
2mo ago
deepseek-ai/DeepSeek-V3.1
Deepseek V3.1
2mo ago
deepseek-ai/DeepSeek-V3.2
Deepseek V3.2
2mo ago
google/gemma-2-27b-it
Gemma 2 27B It
2mo ago
google/gemma-2-27b-it-fast
Gemma 2 27B It Fast
2mo ago
google/gemma-2-2b-it-fast
Gemma 2 2B It Fast
2mo ago
google/gemma-2-9b-it
Gemma 2 9B It
2mo ago
meta-llama/Llama-3.2-1B
Llama 3.2 1B
2mo ago
meta-llama/Llama-3.2-1B-Instruct
Llama 3.2 1B Instruct
2mo ago
meta-llama/Llama-3.2-3B
Llama 3.2 3B
2mo ago
meta-llama/Llama-3.2-3B-Instruct
Llama 3.2 3B Instruct
2mo ago
meta-llama/Meta-Llama-3.1-405B-Instruct
Meta Llama 3.1 405B Instruct
2mo ago
meta-llama/Meta-Llama-3.1-70B-Instruct
Meta Llama 3.1 70B Instruct
2mo ago
meta-llama/Meta-Llama-3.1-70B-Instruct-fast
Meta Llama 3.1 70B Instruct Fast
2mo ago
microsoft/Phi-3-medium-128k-instruct
Phi 3 Medium 128k Instruct
2mo ago
microsoft/Phi-3-medium-128k-instruct-fast
Phi 3 Medium 128k Instruct Fast
2mo ago
microsoft/Phi-3-mini-4k-instruct
Phi 3 Mini 4k Instruct
2mo ago
microsoft/Phi-3-mini-4k-instruct-fast
Phi 3 Mini 4k Instruct Fast
2mo ago
microsoft/Phi-3.5-mini-instruct
Phi 3.5 Mini Instruct
2mo ago
microsoft/Phi-3.5-MoE-instruct
Phi 3.5 MOE Instruct
2mo ago
MiniMaxAI/MiniMax-M2.1
Minimax M2.1
2mo ago
mistralai/Devstral-Small-2505
Devstral Small
2mo ago
mistralai/Mistral-Nemo-Instruct-2407
Mistral Nemo Instruct
2mo ago
mistralai/Mistral-Nemo-Instruct-2407-fast
Mistral Nemo Instruct 2407 Fast
2mo ago
mistralai/Mixtral-8x22B-Instruct-v0.1
Mixtral 8x22b Instruct V0.1
2mo ago
mistralai/Mixtral-8x22B-Instruct-v0.1-fast
Mixtral 8x22b Instruct V0.1 Fast
2mo ago
moonshotai/Kimi-K2-Thinking
Kimi K2 Thinking
2mo ago
NousResearch/Hermes-3-Llama-405B
Hermes 3 Llama 405B
2mo ago
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Llama 3.1 Nemotron 70B Instruct Hf
2mo ago
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF-fast
Llama 3.1 Nemotron 70B Instruct Hf Fast
2mo ago
nvidia/Nemotron-Nano-V2-12b
Nemotron Nano V2 12B
2mo ago
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B
Nvidia Nemotron 3 Nano 30B A3b
2mo ago
PrimeIntellect/INTELLECT-3.1-13B
Intellect 3.1 13B
2mo ago
Qwen/Qwen2.5-1.5B-Instruct
Qwen2.5 1.5b Instruct
2mo ago
Qwen/Qwen2.5-32B-Instruct
Qwen2.5 32B Instruct
2mo ago
Qwen/Qwen2.5-32B-Instruct-fast
Qwen2.5 32B Instruct Fast
2mo ago
Qwen/Qwen2.5-72B-Instruct
Qwen2.5 72B Instruct
2mo ago
Qwen/Qwen2.5-72B-Instruct-fast
Qwen2.5 72B Instruct Fast
2mo ago
Qwen/Qwen3-0.6B
Qwen3 0.6b
2mo ago
Qwen/Qwen3-0.6B-Base
Qwen3 0.6b Base
2mo ago
Qwen/Qwen3-1.7B
Qwen3 1.7b
2mo ago
Qwen/Qwen3-1.7B-Base
Qwen3 1.7b Base
2mo ago
Qwen/Qwen3-14B
Qwen3 14B
2mo ago
Qwen/Qwen3-14B-Base
Qwen3 14B Base
2mo ago
Qwen/Qwen3-30B-A3B
Qwen3 30B A3b
2mo ago
Qwen/Qwen3-4B
Qwen3 4B
2mo ago
Qwen/Qwen3-4B-Base
Qwen3 4B Base
2mo ago
Qwen/Qwen3-8B
Qwen3 8B
2mo ago
Qwen/Qwen3-8B-Base
Qwen3 8B Base
2mo ago
Qwen/Qwen3-Embedding-8B
Qwen3 Embedding 8B
2mo ago
Qwen/Qwen3-Next-80B-A3B-Thinking
Qwen3 Next 80B A3b Thinking
2mo ago
Qwen/QwQ-32B
Qwq 32B
2mo ago
Qwen/QwQ-32B-fast
Qwq 32B Fast
2mo ago
zai-org/GLM-4.7-FP8
Glm 4.7 FP8
2mo ago