ModelsNovita AI
67 models available4 endpoints
67 models
| Model ID | Input $/M | Output $/M | Cache Read $/M | Cache Write $/M | Features | Updated |
|---|---|---|---|---|---|---|
| qwen/qwen3-vl-30b-a3b-instruct Qwen3 VL 30B A3b Instruct | $0.20 | $0.70 | — | — | 3w ago | |
| qwen/qwen3-vl-8b-instruct Qwen3 VL 8B Instruct | $0.08 | $0.50 | — | — | 3w ago | |
| xiaomimimo/mimo-v2-flash Mimo V2 Flash | $0.10 | $0.30 | $0.02 | — | 3w ago | |
| baidu/ernie-4.5-21B-a3b-thinking Ernie 4.5 21B A3b Thinking | $0.07 | $0.28 | — | — | 3w ago | |
| baidu/ernie-4.5-300b-a47b-paddle Ernie 4.5 300B A47b Paddle | $0.28 | $1.10 | — | — | 3w ago | |
| baidu/ernie-4.5-vl-28b-a3b-thinking Ernie 4.5 VL 28B A3b Thinking | $0.39 | $0.39 | — | — | 3w ago | |
| deepseek/deepseek-ocr-2 Deepseek OCR 2 | $0.03 | $0.03 | — | — | 3w ago | |
| deepseek/deepseek-prover-v2-671b Deepseek Prover V2 671B | $0.70 | $2.50 | — | — | 3w ago | |
| deepseek/deepseek-r1-turbo Deepseek R1 Turbo | $0.70 | $2.50 | — | — | 3w ago | |
| deepseek/deepseek-v3-0324 Deepseek V3 | $0.27 | $1.12 | $0.14 | — | 3w ago | |
| deepseek/deepseek-v3-turbo Deepseek V3 Turbo | $0.40 | $1.30 | — | — | 3w ago | |
| deepseek/deepseek-v3.1 Deepseek V3.1 | $0.27 | $1.00 | $0.14 | — | 3w ago | |
| deepseek/deepseek-v3.1-terminus Deepseek V3.1 Terminus | $0.27 | $1.00 | $0.14 | — | 3w ago | |
| deepseek/deepseek-v3.2 Deepseek V3.2 | $0.27 | $0.40 | $0.13 | — | 3w ago | |
| deepseek/deepseek-v3.2-exp Deepseek V3.2 Exp | $0.27 | $0.41 | — | — | 3w ago | |
| google/gemma-3-27b-it Gemma 3 27B It | $0.12 | $0.20 | — | — | 3w ago | |
| kwaipilot/kat-coder-pro Kat Coder Pro | $0.30 | $1.20 | $0.06 | — | 3w ago | |
| meta-llama/llama-3-70b-instruct Llama 3 70B Instruct | $0.51 | $0.74 | — | — | 3w ago | |
| meta-llama/llama-3-8b-instruct Llama 3 8B Instruct | $0.04 | $0.04 | — | — | 3w ago | |
| meta-llama/llama-3.3-70b-instruct Llama 3.3 70B Instruct | $0.14 | $0.40 | — | — | 3w ago | |
| meta-llama/llama-4-maverick-17b-128e-instruct-fp8 Llama 4 Maverick 17B 128e Instruct FP8 | $0.27 | $0.85 | — | — | 3w ago | |
| meta-llama/llama-4-scout-17b-16e-instruct Llama 4 Scout 17B 16e Instruct | $0.18 | $0.59 | — | — | 3w ago | |
| microsoft/wizardlm-2-8x22b Wizardlm 2 8x22b | $0.62 | $0.62 | — | — | 3w ago | |
| minimax/minimax-m2.1 Minimax M2.1 | $0.30 | $1.20 | $0.03 | — | 3w ago | |
| minimaxai/minimax-m1-80k Minimax M1 80k | $0.55 | $2.20 | — | — | 3w ago | |
| moonshotai/kimi-k2-0905 Kimi K2 | $0.60 | $2.50 | — | — | 3w ago | |
| moonshotai/kimi-k2-instruct Kimi K2 Instruct | $0.57 | $2.30 | — | — | 3w ago | |
| moonshotai/kimi-k2-thinking Kimi K2 Thinking | $0.60 | $2.50 | $0.15 | — | 3w ago | |
| moonshotai/kimi-k2.5 Kimi K2.5 | $0.60 | $3.00 | $0.10 | — | 3w ago | |
| Nous-Hermes-2-Mixtral-8x7B-DPO Nous Hermes 2 Mixtral 8x7b Dpo | $0.27 | $0.27 | — | — | 3w ago | |
| nousresearch/hermes-2-pro-llama-3-8b Hermes 2 Pro Llama 3 8B | $0.14 | $0.14 | — | — | 3w ago | |
| nousresearch/nous-hermes-llama2-13b Nous Hermes Llama2 13B | $0.26 | $0.26 | — | — | 3w ago | |
| openai/gpt-oss-120b GPT Oss 120B | $0.05 | $0.25 | — | — | 3w ago | |
| qwen/qwen-2.5-72b-instruct Qwen 2.5 72B Instruct | $0.38 | $0.40 | — | — | 3w ago | |
| qwen/qwen2.5-7b-instruct Qwen2.5 7B Instruct | $0.07 | $0.07 | — | — | 3w ago | |
| qwen/qwen2.5-vl-72b-instruct Qwen2.5 VL 72B Instruct | $0.80 | $0.80 | — | — | 3w ago | |
| qwen/qwen3-235b-a22b-fp8 Qwen3 235B A22b FP8 | $0.20 | $0.80 | — | — | 3w ago | |
| qwen/qwen3-235b-a22b-instruct-2507 Qwen3 235B A22b Instruct | $0.09 | $0.58 | — | — | 3w ago | |
| qwen/qwen3-235b-a22b-thinking-2507 Qwen3 235B A22b Thinking | $0.30 | $3.00 | — | — | 3w ago | |
| qwen/qwen3-30b-a3b-fp8 Qwen3 30B A3b FP8 | $0.09 | $0.45 | — | — | 3w ago | |
| qwen/qwen3-32b-fp8 Qwen3 32B FP8 | $0.10 | $0.45 | — | — | 3w ago | |
| qwen/qwen3-4b-fp8 Qwen3 4B FP8 | $0.03 | $0.03 | — | — | 3w ago | |
| qwen/qwen3-coder-30b-a3b-instruct Qwen3 Coder 30B A3b Instruct | $0.07 | $0.27 | — | — | 3w ago | |
| qwen/qwen3-coder-480b-a35b-instruct Qwen3 Coder 480B A35b Instruct | $0.30 | $1.30 | — | — | 3w ago | |
| qwen/qwen3-coder-next Qwen3 Coder Next | $0.20 | $1.50 | — | — | 3w ago | |
| qwen/qwen3-max Qwen3 Max | $2.11 | $8.45 | — | — | 3w ago | |
| qwen/qwen3-vl-235b-a22b-instruct Qwen3 VL 235B A22b Instruct | $0.30 | $1.50 | — | — | 3w ago | |
| qwen/qwen3-vl-235b-a22b-thinking Qwen3 VL 235B A22b Thinking | $0.98 | $3.95 | — | — | 3w ago | |
| sao10k/l3-70b-euryale-v2.1 L3 70B Euryale V2.1 | $1.48 | $1.48 | — | — | 3w ago | |
| sao10k/l3-8b-lunaris L3 8B Lunaris | $0.05 | $0.05 | — | — | 3w ago | |
| zai-org/autoglm-phone-9b-multilingual Autoglm Phone 9B Multilingual | $0.04 | $0.14 | — | — | 3w ago | |
| zai-org/glm-4.7 Glm 4.7 | $0.60 | $2.20 | $0.11 | — | 3w ago | |
| xiaomi/mimo-v2-flash Mimo V2 Flash | — | — | — | — | 2mo ago | |
| baichuan/baichuan-m2-32b Baichuan M2 32B | — | — | — | — | 2mo ago | |
| deepseek/deepseek-r1-distill-qwen-14b Deepseek R1 Distill Qwen 14B | — | — | — | — | 2mo ago | |
| deepseek/deepseek-r1-distill-qwen-32b Deepseek R1 Distill Qwen 32B | — | — | — | — | 2mo ago | |
| google/gemma-3-12b-instruct Gemma 3 12B Instruct | — | — | — | — | 2mo ago | |
| gryphe/mythomax-l2-13b Mythomax L2 13B | $0.19 | $0.19 | — | — | 2mo ago | |
| meta-llama/llama-3.2-3b-instruct Llama 3.2 3B Instruct | — | — | — | — | 2mo ago | |
| mistralai/mistral-nemo-7b-instruct Mistral Nemo 7B Instruct | — | — | — | — | 2mo ago | |
| qwen/qwen3-8b-fp8 Qwen3 8B FP8 | — | — | — | — | 2mo ago | |
| sao10k/l3-1-70b-euryale-v2.2 L3 1 70B Euryale V2.2 | — | — | — | — | 2mo ago | |
| sao10k/l3-8b-stheno-v3.2 L3 8B Stheno V3.2 | — | — | — | — | 2mo ago | |
| teknium/openhermes-2.5-mistral-7b Openhermes 2.5 Mistral 7B | $0.17 | $0.17 | — | — | 2mo ago | |
| zai-org/glm-4.6-vision Glm 4.6 Vision | — | — | — | — | 2mo ago | |
| zai-org/glm-4.6-vision-air Glm 4.6 Vision Air | — | — | — | — | 2mo ago | |
| lzlv_70b Lzlv 70B | $0.70 | $0.80 | — | — | 4mo ago |