ModelsNebius
106 models available5 endpoints
106 models
| Model ID | Input $/M | Output $/M | Cache Read $/M | Cache Write $/M | Features | Updated |
|---|---|---|---|---|---|---|
| black-forest-labs/flux-dev Flux Dev | — | — | — | — | 1mo ago | |
| black-forest-labs/flux-schnell Flux Schnell | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Deepseek Coder V2 Lite Instruct | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct-fast Deepseek Coder V2 Lite Instruct Fast | — | — | — | — | 1mo ago | |
| google/gemma-3-27b-it Gemma 3 27B It | — | — | — | — | 1mo ago | |
| google/gemma-3-27b-it-fast Gemma 3 27B It Fast | — | — | — | — | 1mo ago | |
| llava-hf/llava-1.5-13b-hf Llava 1.5 13B Hf | — | — | — | — | 1mo ago | |
| llava-hf/llava-1.5-7b-hf Llava 1.5 7B Hf | — | — | — | — | 1mo ago | |
| mistralai/Mixtral-8x7B-Instruct-v0.1 Mixtral 8x7b Instruct V0.1 | — | — | — | — | Tools | 1mo ago |
| mistralai/Mixtral-8x7B-Instruct-v0.1-fast Mixtral 8x7b Instruct V0.1 Fast | — | — | — | — | Tools | 1mo ago |
| Qwen/Qwen2-VL-72B-Instruct Qwen2 VL 72B Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen2-VL-7B-Instruct Qwen2 VL 7B Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-Coder-32B-Instruct Qwen2.5 Coder 32B Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-Coder-32B-Instruct-fast Qwen2.5 Coder 32B Instruct Fast | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-Coder-7B Qwen2.5 Coder 7B | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-Coder-7B-fast Qwen2.5 Coder 7B Fast | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-Coder-7B-Instruct Qwen2.5 Coder 7B Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-Coder-7B-Instruct-fast Qwen2.5 Coder 7B Instruct Fast | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-VL-72B-Instruct Qwen2.5 VL 72B Instruct | — | — | — | — | 1mo ago | |
| aaditya/Llama3-OpenBioLLM-70B Llama3 Openbiollm 70B | — | — | — | — | 1mo ago | |
| aaditya/Llama3-OpenBioLLM-8B Llama3 Openbiollm 8B | — | — | — | — | 1mo ago | |
| allenai/OLMo-7B-Instruct-hf Olmo 7B Instruct Hf | — | — | — | — | 1mo ago | |
| BAAI/bge-multilingual-gemma2 Bge Multilingual Gemma2 | — | — | — | — | 1mo ago | |
| cognitivecomputations/dolphin-2.9.2-mixtral-8x22b Dolphin 2.9.2 Mixtral 8x22b | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-R1-0528 Deepseek R1 | $0.80 | $2.40 | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-R1-0528-fast Deepseek R1 0528 Fast | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-V3 Deepseek V3 | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-V3-0324 Deepseek V3 | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-V3-0324-fast Deepseek V3 0324 Fast | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-V3.1 Deepseek V3.1 | — | — | — | — | 1mo ago | |
| deepseek-ai/DeepSeek-V3.2 Deepseek V3.2 | — | — | — | — | 1mo ago | |
| google/gemma-2-27b-it Gemma 2 27B It | — | — | — | — | 1mo ago | |
| google/gemma-2-27b-it-fast Gemma 2 27B It Fast | — | — | — | — | 1mo ago | |
| google/gemma-2-2b-it Gemma 2 2B It | — | — | — | — | 1mo ago | |
| google/gemma-2-2b-it-fast Gemma 2 2B It Fast | — | — | — | — | 1mo ago | |
| google/gemma-2-9b-it Gemma 2 9B It | — | — | — | — | 1mo ago | |
| google/gemma-2-9b-it-fast Gemma 2 9B It Fast | — | — | — | — | 1mo ago | |
| intfloat/e5-mistral-7b-instruct E5 Mistral 7B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/Llama-3.2-1B Llama 3.2 1B | — | — | — | — | 1mo ago | |
| meta-llama/Llama-3.2-1B-Instruct Llama 3.2 1B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/Llama-3.2-3B Llama 3.2 3B | — | — | — | — | 1mo ago | |
| meta-llama/Llama-3.2-3B-Instruct Llama 3.2 3B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/Llama-3.3-70B-Instruct Llama 3.3 70B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/Llama-3.3-70B-Instruct-fast Llama 3.3 70B Instruct Fast | — | — | — | — | 1mo ago | |
| meta-llama/Llama-Guard-3-8B Llama Guard 3 8B | — | — | — | — | 1mo ago | |
| meta-llama/Meta-Llama-3.1-405B-Instruct Meta Llama 3.1 405B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/Meta-Llama-3.1-70B-Instruct Meta Llama 3.1 70B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/Meta-Llama-3.1-70B-Instruct-fast Meta Llama 3.1 70B Instruct Fast | — | — | — | — | 1mo ago | |
| meta-llama/Meta-Llama-3.1-8B-Instruct Meta Llama 3.1 8B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/Meta-Llama-3.1-8B-Instruct-fast Meta Llama 3.1 8B Instruct Fast | — | — | — | — | 1mo ago | |
| microsoft/Phi-3-medium-128k-instruct Phi 3 Medium 128k Instruct | — | — | — | — | 1mo ago | |
| microsoft/Phi-3-medium-128k-instruct-fast Phi 3 Medium 128k Instruct Fast | — | — | — | — | 1mo ago | |
| microsoft/Phi-3-mini-4k-instruct Phi 3 Mini 4k Instruct | — | — | — | — | 1mo ago | |
| microsoft/Phi-3-mini-4k-instruct-fast Phi 3 Mini 4k Instruct Fast | — | — | — | — | 1mo ago | |
| microsoft/Phi-3.5-mini-instruct Phi 3.5 Mini Instruct | — | — | — | — | 1mo ago | |
| microsoft/Phi-3.5-MoE-instruct Phi 3.5 MOE Instruct | — | — | — | — | 1mo ago | |
| MiniMaxAI/MiniMax-M2.1 Minimax M2.1 | — | — | — | — | 1mo ago | |
| mistralai/Devstral-Small-2505 Devstral Small | — | — | — | — | 1mo ago | |
| mistralai/Mistral-Nemo-Instruct-2407 Mistral Nemo Instruct | — | — | — | — | 1mo ago | |
| mistralai/Mistral-Nemo-Instruct-2407-fast Mistral Nemo Instruct 2407 Fast | — | — | — | — | 1mo ago | |
| mistralai/Mixtral-8x22B-Instruct-v0.1 Mixtral 8x22b Instruct V0.1 | — | — | — | — | 1mo ago | |
| mistralai/Mixtral-8x22B-Instruct-v0.1-fast Mixtral 8x22b Instruct V0.1 Fast | — | — | — | — | 1mo ago | |
| moonshotai/Kimi-K2-Instruct Kimi K2 Instruct | — | — | — | — | 1mo ago | |
| moonshotai/Kimi-K2-Thinking Kimi K2 Thinking | — | — | — | — | 1mo ago | |
| NousResearch/Hermes-3-Llama-405B Hermes 3 Llama 405B | — | — | — | — | 1mo ago | |
| NousResearch/Hermes-4-405B Hermes 4 405B | — | — | — | — | 1mo ago | |
| NousResearch/Hermes-4-70B Hermes 4 70B | — | — | — | — | 1mo ago | |
| nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Llama 3 1 Nemotron Ultra 253B V1 | — | — | — | — | 1mo ago | |
| nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Llama 3.1 Nemotron 70B Instruct Hf | — | — | — | — | 1mo ago | |
| nvidia/Llama-3.1-Nemotron-70B-Instruct-HF-fast Llama 3.1 Nemotron 70B Instruct Hf Fast | — | — | — | — | 1mo ago | |
| nvidia/Nemotron-Nano-V2-12b Nemotron Nano V2 12B | — | — | — | — | 1mo ago | |
| nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B Nvidia Nemotron 3 Nano 30B A3b | — | — | — | — | 1mo ago | |
| openai/gpt-oss-120b GPT Oss 120B | — | — | — | — | 1mo ago | |
| openai/gpt-oss-20b GPT Oss 20B | — | — | — | — | 1mo ago | |
| PrimeIntellect/INTELLECT-3.1-13B Intellect 3.1 13B | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-1.5B-Instruct Qwen2.5 1.5b Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-32B-Instruct Qwen2.5 32B Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-32B-Instruct-fast Qwen2.5 32B Instruct Fast | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-72B-Instruct Qwen2.5 72B Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen2.5-72B-Instruct-fast Qwen2.5 72B Instruct Fast | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-0.6B Qwen3 0.6b | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-0.6B-Base Qwen3 0.6b Base | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-1.7B Qwen3 1.7b | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-1.7B-Base Qwen3 1.7b Base | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-14B Qwen3 14B | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-14B-Base Qwen3 14B Base | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-235B-A22B-Instruct-2507 Qwen3 235B A22b Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-235B-A22B-Thinking-2507 Qwen3 235B A22b Thinking | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-30B-A3B Qwen3 30B A3b | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-30B-A3B-Instruct-2507 Qwen3 30B A3b Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-30B-A3B-Thinking-2507 Qwen3 30B A3b Thinking | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-32B Qwen3 32B | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-32B-fast Qwen3 32B Fast | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-4B Qwen3 4B | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-4B-Base Qwen3 4B Base | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-8B Qwen3 8B | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-8B-Base Qwen3 8B Base | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-Coder-30B-A3B-Instruct Qwen3 Coder 30B A3b Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-Coder-480B-A35B-Instruct Qwen3 Coder 480B A35b Instruct | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-Embedding-8B Qwen3 Embedding 8B | — | — | — | — | 1mo ago | |
| Qwen/Qwen3-Next-80B-A3B-Thinking Qwen3 Next 80B A3b Thinking | — | — | — | — | 1mo ago | |
| Qwen/QwQ-32B Qwq 32B | — | — | — | — | 1mo ago | |
| Qwen/QwQ-32B-fast Qwq 32B Fast | — | — | — | — | 1mo ago | |
| zai-org/GLM-4.5 Glm 4.5 | — | — | — | — | 1mo ago | |
| zai-org/GLM-4.5-Air Glm 4.5 Air | — | — | — | — | 1mo ago | |
| zai-org/GLM-4.7-FP8 Glm 4.7 FP8 | — | — | — | — | 1mo ago |