Live Pricing
3169 models across all major providers
| Model | Provider | Input $/1M tokens | Output $/1M tokens | Context | Max Output |
|---|---|---|---|---|---|
| deepseek-ai/deepseek-v3.1 | Nvidia | — | — | — | — |
| deepseek-r1 | Nvidia | — | — | — | — |
| deepseek-v3.1 | Nvidia | — | — | — | — |
| flux_1-dev | Nvidia | — | — | — | — |
| gemma-3-27b-it | Nvidia | — | — | — | — |
| google/gemma-2-27b-it | Nvidia | $0.65 | $0.65 | — | — |
| google/gemma-3-27b-it | Nvidia | — | — | — | — |
| llama-3.1-nemotron-ultra-253b-v1 | Nvidia | — | — | — | — |
| llama-3.3-nemotron-super-49b-v1.5 | Nvidia | — | — | — | — |
| microsoft/phi-3-medium-128k-instruct | Nvidia | $1.00 | $1.00 | — | — |
| microsoft/phi-4-mini-instruct | Nvidia | — | — | — | — |
| mistral-small-3.1-24b-instruct-2503 | Nvidia | — | — | — | — |
| moonshotai/kimi-k2-0905-preview | Nvidia | — | — | — | — |
| moonshotai/kimi-k2.5 | Nvidia | $0.45 | $2.20 | — | — |
| moonshotai/kimi-k2-instruct | Nvidia | — | — | — | — |
| moonshotai/kimi-k2-instruct-0905 | Nvidia | — | — | — | — |
| moonshotai/kimi-k2-thinking | Nvidia | $0.47 | $2.00 | — | — |
| nemoretriever-ocr-v1 | Nvidia | — | — | — | — |
| nvidia/cosmos-nemotron-34b | Nvidia | — | — | — | — |
| nvidia/llama-3.1-nemotron-70b-instruct | Nvidia | $1.20 | $1.20 | — | — |
| nvidia/llama-3.1-nemotron-ultra-253b-v1 | Nvidia | $0.60 | $1.80 | — | — |
| nvidia/llama-3.3-nemotron-super-49b-v1 | Nvidia | $0.10 | $0.40 | — | — |
| nvidia/llama-3.3-nemotron-super-49b-v1.5 | Nvidia | $0.10 | $0.40 | — | — |
| nvidia/nemoretriever-ocr-v1 | Nvidia | — | — | — | — |
| nvidia/parakeet-tdt-0.6b-v2 | Nvidia | — | — | — | — |
| openai/gpt-oss-120b | Nvidia | $0.04 | $0.40 | — | — |
| openai/whisper-large-v3 | Nvidia | — | — | — | — |
| parakeet-tdt-0.6b-v2 | Nvidia | — | — | — | — |
| phi-4-multimodal-instruct | Nvidia | — | — | — | — |
| qwen3-235b-a22b | Nvidia | — | — | — | — |
| qwen3-coder-480b-a35b-instruct | Nvidia | — | — | — | — |
| qwen/qwen2.5-coder-7b-instruct | Nvidia | $0.03 | $0.09 | — | — |
| qwen/qwen3-235b-a22b | Nvidia | $0.11 | $0.60 | — | — |
| qwen/qwen3-coder-480b-a35b-instruct | Nvidia | — | — | — | — |
| qwen/qwen3-next-80b-a3b-instruct | Nvidia | $0.09 | $1.10 | — | — |
| qwen/qwen3-next-80b-a3b-thinking | Nvidia | $0.15 | $1.20 | — | — |
| qwen/qwq-32b | Nvidia | $0.15 | $0.40 | — | — |
| whisper-large-v3 | Nvidia | — | — | — | — |
| deepseek-v3.2 | Ollama Cloud | $0.40 | $1.20 | — | — |
| gemini-3-flash-preview | Ollama Cloud | $0.50 | $3.00 | — | — |
| gemini-3-pro-preview | Ollama Cloud | $2.00 | $12.00 | — | — |
| glm-4.6 | Ollama Cloud | $0.30 | $0.90 | — | — |
| glm-4.7 | Ollama Cloud | $0.06 | $0.40 | — | — |
| glm-5 | Ollama Cloud | $0.30 | $2.55 | — | — |
| kimi-k2.5 | Ollama Cloud | — | $3.00 | — | — |
| kimi-k2-thinking | Ollama Cloud | $0.47 | $2.00 | — | — |
| minimax-m2 | Ollama Cloud | $0.29 | $1.20 | — | — |
| minimax-m2.1 | Ollama Cloud | $0.27 | $0.95 | — | — |
| minimax-m2.5 | Ollama Cloud | $0.29 | $1.20 | — | — |
| qwen3-coder-next | Ollama Cloud | $0.12 | $0.75 | — | — |