ModelsNovita AI
64 models available4 endpoints
64 models
| Model ID | Input $/M | Output $/M | Cache Read $/M | Cache Write $/M | Features | Updated |
|---|---|---|---|---|---|---|
| xiaomi/mimo-v2-flash Mimo V2 Flash | — | — | — | — | 1mo ago | |
| baichuan/baichuan-m2-32b Baichuan M2 32B | — | — | — | — | 1mo ago | |
| baidu/ernie-4.5-21B-a3b-thinking Ernie 4.5 21B A3b Thinking | — | — | — | — | 1mo ago | |
| baidu/ernie-4.5-300b-a47b-paddle Ernie 4.5 300B A47b Paddle | — | — | — | — | 1mo ago | |
| baidu/ernie-4.5-vl-28b-a3b-thinking Ernie 4.5 VL 28B A3b Thinking | — | — | — | — | 1mo ago | |
| deepseek/deepseek-ocr-2 Deepseek OCR 2 | — | — | — | — | 1mo ago | |
| deepseek/deepseek-prover-v2-671b Deepseek Prover V2 671B | — | — | — | — | 1mo ago | |
| deepseek/deepseek-r1-distill-qwen-14b Deepseek R1 Distill Qwen 14B | — | — | — | — | 1mo ago | |
| deepseek/deepseek-r1-distill-qwen-32b Deepseek R1 Distill Qwen 32B | — | — | — | — | 1mo ago | |
| deepseek/deepseek-r1-turbo Deepseek R1 Turbo | — | — | — | — | 1mo ago | |
| deepseek/deepseek-v3-0324 Deepseek V3 | — | — | — | — | 1mo ago | |
| deepseek/deepseek-v3-turbo Deepseek V3 Turbo | — | — | — | — | 1mo ago | |
| deepseek/deepseek-v3.1 Deepseek V3.1 | — | — | — | — | 1mo ago | |
| deepseek/deepseek-v3.1-terminus Deepseek V3.1 Terminus | — | — | — | — | 1mo ago | |
| deepseek/deepseek-v3.2 Deepseek V3.2 | — | — | — | — | 1mo ago | |
| deepseek/deepseek-v3.2-exp Deepseek V3.2 Exp | — | — | — | — | 1mo ago | |
| google/gemma-3-12b-instruct Gemma 3 12B Instruct | — | — | — | — | 1mo ago | |
| google/gemma-3-27b-it Gemma 3 27B It | — | — | — | — | 1mo ago | |
| gryphe/mythomax-l2-13b Mythomax L2 13B | $0.19 | $0.19 | — | — | 1mo ago | |
| kwaipilot/kat-coder-pro Kat Coder Pro | — | — | — | — | 1mo ago | |
| meta-llama/llama-3-70b-instruct Llama 3 70B Instruct | $0.80 | $0.80 | — | — | 1mo ago | |
| meta-llama/llama-3.2-3b-instruct Llama 3.2 3B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/llama-3.3-70b-instruct Llama 3.3 70B Instruct | — | — | — | — | 1mo ago | |
| meta-llama/llama-4-maverick-17b-128e-instruct-fp8 Llama 4 Maverick 17B 128e Instruct FP8 | — | — | — | — | 1mo ago | |
| meta-llama/llama-4-scout-17b-16e-instruct Llama 4 Scout 17B 16e Instruct | — | — | — | — | 1mo ago | |
| minimax/minimax-m2.1 Minimax M2.1 | — | — | — | — | 1mo ago | |
| minimaxai/minimax-m1-80k Minimax M1 80k | — | — | — | — | 1mo ago | |
| mistralai/mistral-nemo-7b-instruct Mistral Nemo 7B Instruct | — | — | — | — | 1mo ago | |
| moonshotai/kimi-k2-0905 Kimi K2 | — | — | — | — | 1mo ago | |
| moonshotai/kimi-k2-instruct Kimi K2 Instruct | — | — | — | — | 1mo ago | |
| moonshotai/kimi-k2-thinking Kimi K2 Thinking | — | — | — | — | 1mo ago | |
| moonshotai/kimi-k2.5 Kimi K2.5 | — | — | — | — | 1mo ago | |
| nousresearch/hermes-2-pro-llama-3-8b Hermes 2 Pro Llama 3 8B | — | — | — | — | 1mo ago | |
| openai/gpt-oss-120b GPT Oss 120B | — | — | — | — | 1mo ago | |
| qwen/qwen-2.5-72b-instruct Qwen 2.5 72B Instruct | — | — | — | — | 1mo ago | |
| qwen/qwen2.5-7b-instruct Qwen2.5 7B Instruct | — | — | — | — | 1mo ago | |
| qwen/qwen2.5-vl-72b-instruct Qwen2.5 VL 72B Instruct | — | — | — | — | 1mo ago | |
| qwen/qwen3-235b-a22b-fp8 Qwen3 235B A22b FP8 | — | — | — | — | 1mo ago | |
| qwen/qwen3-235b-a22b-instruct-2507 Qwen3 235B A22b Instruct | — | — | — | — | 1mo ago | |
| qwen/qwen3-235b-a22b-thinking-2507 Qwen3 235B A22b Thinking | — | — | — | — | 1mo ago | |
| qwen/qwen3-30b-a3b-fp8 Qwen3 30B A3b FP8 | — | — | — | — | 1mo ago | |
| qwen/qwen3-32b-fp8 Qwen3 32B FP8 | — | — | — | — | 1mo ago | |
| qwen/qwen3-4b-fp8 Qwen3 4B FP8 | — | — | — | — | 1mo ago | |
| qwen/qwen3-8b-fp8 Qwen3 8B FP8 | — | — | — | — | 1mo ago | |
| qwen/qwen3-coder-30b-a3b-instruct Qwen3 Coder 30B A3b Instruct | — | — | — | — | 1mo ago | |
| qwen/qwen3-coder-480b-a35b-instruct Qwen3 Coder 480B A35b Instruct | — | — | — | — | 1mo ago | |
| qwen/qwen3-coder-next Qwen3 Coder Next | — | — | — | — | 1mo ago | |
| qwen/qwen3-max Qwen3 Max | — | — | — | — | 1mo ago | |
| qwen/qwen3-vl-235b-a22b-instruct Qwen3 VL 235B A22b Instruct | — | — | — | — | 1mo ago | |
| qwen/qwen3-vl-235b-a22b-thinking Qwen3 VL 235B A22b Thinking | — | — | — | — | 1mo ago | |
| sao10k/l3-1-70b-euryale-v2.2 L3 1 70B Euryale V2.2 | — | — | — | — | 1mo ago | |
| sao10k/l3-70b-euryale-v2.1 L3 70B Euryale V2.1 | — | — | — | — | 1mo ago | |
| sao10k/l3-8b-lunaris L3 8B Lunaris | — | — | — | — | 1mo ago | |
| sao10k/l3-8b-stheno-v3.2 L3 8B Stheno V3.2 | — | — | — | — | 1mo ago | |
| teknium/openhermes-2.5-mistral-7b Openhermes 2.5 Mistral 7B | $0.17 | $0.17 | — | — | 1mo ago | |
| zai-org/autoglm-phone-9b-multilingual Autoglm Phone 9B Multilingual | — | — | — | — | 1mo ago | |
| zai-org/glm-4.6-vision Glm 4.6 Vision | — | — | — | — | 1mo ago | |
| zai-org/glm-4.6-vision-air Glm 4.6 Vision Air | — | — | — | — | 1mo ago | |
| zai-org/glm-4.7 Glm 4.7 | — | — | — | — | 1mo ago | |
| microsoft/wizardlm-2-8x22b Wizardlm 2 8x22b | $0.90 | $0.90 | — | — | 1mo ago | |
| lzlv_70b Lzlv 70B | $0.70 | $0.80 | — | — | 3mo ago | |
| meta-llama/llama-3-8b-instruct Llama 3 8B Instruct | $0.10 | $0.10 | — | — | 3mo ago | |
| Nous-Hermes-2-Mixtral-8x7B-DPO Nous Hermes 2 Mixtral 8x7b Dpo | $0.27 | $0.27 | — | — | 3mo ago | |
| nousresearch/nous-hermes-llama2-13b Nous Hermes Llama2 13B | $0.26 | $0.26 | — | — | 3mo ago |