# TokenMonopoly — AI coding deals leaderboard

> Live pricing and SWE-bench scores for AI APIs used in coding harnesses. Prices refreshed daily.

- Last updated: 2026-06-09T07:07:57.115Z
- Providers tracked: 100
- Models listed: 39
- Total host offers: 926
- Canonical HTML view: https://tokenmonopoly.com/

## Methodology

- **Input / Output $/MTok** — pay-per-token is direct; subscription offers are normalized to an effective rate assuming 5M input + 1M output tokens/day.
- **SWE-bench** — accuracy scores scraped from vals.ai's public SWE-bench leaderboard and fuzzy-matched to each model's canonical name.
- **Price/perf score** — `SWE-bench ÷ effective blended $/MTok`. Higher is better. Models with no SWE-bench score are ranked last.
- **Hosts** — most models are served by multiple hosts (e.g. Anthropic, Bedrock, Vertex; or DeepInfra, Together, Groq for open models). The main table shows the cheapest host per model and the host count; per-host breakdowns follow.

## Ranked models

| # | Model | Cheapest Host | Hosts | SWE-bench | Input $/MTok | Output $/MTok | Save | Type |
|---|-------|---------------|------:|----------:|-------------:|--------------:|-----:|------|
| 1 | MiniMax M2.1 | AtlasCloud | 6 | 74.8% | $0.29 | $0.95 | −26% | closed |
| 2 | GPT-5.4 Nano | Azure | 2 | 69.8% | $0.20 | $1.25 | — | closed |
| 3 | MiniMax M2.7 | DekaLLM | 10 | 73.8% | $0.26 | $1.20 | −54% | closed |
| 4 | MiniMax M3 | SiliconFlow | 2 | 75.0% | $0.30 | $1.20 | — | closed |
| 5 | DeepSeek V4 Pro | DeepSeek | 15 | 77.4% | $0.43 | $0.87 | −89% | open |
| 6 | Gemini 3.1 Flash Lite Preview | Google AI Studio | 2 | 62.8% | $0.25 | $1.50 | — | closed |
| 7 | Qwen3.6 Plus | Alibaba | 1 | 73.4% | $0.33 | $1.95 | — | open |
| 8 | Qwen3.6 27B | Chutes | 9 | 70.0% | $0.29 | $2.00 | −47% | open |
| 9 | GLM 4.7 | DekaLLM | 15 | 69.4% | $0.38 | $1.74 | −74% | closed |
| 10 | GPT-5 Mini | OpenAI | 2 | 60.8% | $0.25 | $2.00 | — | closed |
| 11 | Kimi K2.5 | ModelRun | 18 | 70.0% | $0.40 | $1.90 | −38% | open |
| 12 | Devstral 2 2512 | Mistral | 1 | 62.8% | $0.40 | $2.00 | — | open |
| 13 | GLM 5 | GMICloud | 19 | 71.4% | $0.60 | $1.92 | −48% | closed |
| 14 | Nemotron 3 Ultra | DeepInfra | 1 | 69.0% | $0.50 | $2.50 | — | open |
| 15 | Gemini 3 Flash Preview | Google AI Studio | 2 | 75.0% | $0.50 | $3.00 | — | closed |
| 16 | Kimi K2 0711 | Novita | 2 | 60.2% | $0.57 | $2.30 | −6% | open |
| 17 | Kimi K2.6 | Io Net | 22 | 76.2% | $0.68 | $3.40 | −35% | open |
| 18 | GLM 5.1 | GMICloud | 21 | 76.4% | $0.98 | $3.08 | −44% | closed |
| 19 | GPT-5.4 Mini | OpenAI | 2 | 73.0% | $0.75 | $4.50 | — | closed |
| 20 | Grok 4.3 | Xai | 1 | 71.4% | $1.25 | $2.50 | — | closed |
| 21 | Qwen3.7 Max | Alibaba | 1 | 68.8% | $1.25 | $3.75 | — | open |
| 22 | Claude Haiku 4.5 | Amazon Bedrock | 3 | 66.6% | $1.00 | $5.00 | — | closed |
| 23 | Qwen3.6 Max Preview | Alibaba | 1 | 72.8% | $1.04 | $6.24 | — | open |
| 24 | Gemini 3.5 Flash | Google AI Studio | 2 | 78.8% | $1.50 | $9.00 | — | closed |
| 25 | GPT-5.1 | Azure | 2 | 69.8% | $1.25 | $10.00 | — | closed |
| 26 | GPT-5 | Azure | 2 | 69.0% | $1.25 | $10.00 | — | closed |
| 27 | Gemini 3.1 Pro Preview | Google AI Studio | 2 | 78.8% | $2.00 | $12.00 | — | closed |
| 28 | GPT-5.3-Codex | OpenAI | 2 | 78.0% | $1.75 | $14.00 | — | closed |
| 29 | Gemini 2.5 Pro | Google AI Studio | 2 | 54.4% | $1.25 | $10.00 | — | closed |
| 30 | GPT-5.2 | OpenAI | 2 | 75.8% | $1.75 | $14.00 | — | closed |
| 31 | GPT-5.2-Codex | OpenAI | 2 | 72.4% | $1.75 | $14.00 | — | closed |
| 32 | GPT-5.4 | OpenAI | 2 | 78.2% | $2.50 | $15.00 | — | closed |
| 33 | Claude Sonnet 4.6 | Amazon Bedrock | 4 | 77.4% | $3.00 | $15.00 | — | closed |
| 34 | Claude Sonnet 4.5 | Amazon Bedrock | 3 | 70.0% | $3.00 | $15.00 | — | closed |
| 35 | Claude Opus 4.8 | Google AI Studio | 3 | 88.6% | $5.00 | $25.00 | — | closed |
| 36 | Claude Opus 4.7 | Google AI Studio | 3 | 82.0% | $5.00 | $25.00 | — | closed |
| 37 | Claude Opus 4.6 | Google AI Studio | 4 | 78.2% | $5.00 | $25.00 | — | closed |
| 38 | Claude Opus 4.5 | Amazon Bedrock | 3 | 76.4% | $5.00 | $25.00 | — | closed |
| 39 | GPT-5.5 | OpenAI | 2 | 82.6% | $5.00 | $30.00 | — | closed |

## Host breakdown

### MiniMax M2.1

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| AtlasCloud | $0.29 | $0.95 | fp8 | 98.2% |
| Nebius | $0.30 | $1.20 | fp8 | 100.0% |
| Fireworks AI | $0.30 | $1.20 | — | — |
| Novita | $0.30 | $1.20 | fp8 | 97.8% |
| Minimax | $0.30 | $1.20 | fp8 | 99.4% |
| Venice | $0.35 | $1.50 | — | — |

### GPT-5.4 Nano

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $0.20 | $1.25 | — | 100.0% |
| OpenAI | $0.20 | $1.25 | — | 93.5% |

### MiniMax M2.7

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| DekaLLM | $0.26 | $1.20 | fp4 | 100.0% |
| Morph | $0.28 | $1.20 | — | 95.1% |
| AtlasCloud | $0.30 | $1.20 | fp8 | 99.3% |
| Fireworks AI | $0.30 | $1.20 | — | 100.0% |
| Together | $0.30 | $1.20 | fp4 | 99.3% |
| Minimax | $0.30 | $1.20 | fp8 | 97.8% |
| Novita | $0.30 | $1.20 | fp8 | 100.0% |
| DeepInfra | $0.30 | $1.20 | fp8 | 100.0% |
| Mara | $0.30 | $1.20 | — | 98.3% |
| SambaNova | $0.60 | $2.40 | — | 100.0% |

### MiniMax M3

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| SiliconFlow | $0.30 | $1.20 | fp8 | 99.4% |
| Minimax | $0.30 | $1.20 | fp8 | 100.0% |

### DeepSeek V4 Pro

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| DeepSeek | $0.43 | $0.87 | — | 100.0% |
| Baidu | $0.76 | $1.52 | fp8 | 99.2% |
| StreamLake | $0.87 | $1.74 | — | 99.5% |
| DeepInfra | $1.30 | $2.60 | fp4 | 99.2% |
| GMICloud | $1.39 | $2.78 | fp8 | 99.4% |
| DigitalOcean | $1.48 | $2.96 | — | 97.8% |
| SiliconFlow | $1.60 | $3.13 | fp8 | 98.7% |
| Novita | $1.60 | $3.20 | fp8 | 99.9% |
| Alibaba | $1.61 | $3.22 | — | 99.4% |
| AtlasCloud | $1.68 | $3.38 | fp8 | 99.4% |
| Parasail | $1.74 | $3.48 | fp8 | 90.6% |
| Fireworks AI | $1.74 | $3.48 | — | 94.9% |
| Venice | $1.73 | $3.80 | — | 94.7% |
| Together | $2.10 | $4.40 | — | 97.5% |
| Io Net | $4.45 | $5.50 | — | — |

### Gemini 3.1 Flash Lite Preview

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $0.25 | $1.50 | — | 98.4% |
| Google AI Studio | $0.25 | $1.50 | — | 98.9% |

### Qwen3.6 27B

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Chutes | $0.30 | $2.00 | fp8 | — |
| Morph | $0.29 | $2.40 | — | 97.5% |
| Alibaba | $0.45 | $2.70 | — | 100.0% |
| Io Net | $0.29 | $3.20 | fp8 | 91.3% |
| SiliconFlow | $0.30 | $3.20 | fp8 | — |
| DeepInfra | $0.32 | $3.20 | fp8 | 96.9% |
| Ambient | $0.32 | $3.20 | — | 98.2% |
| Venice | $0.33 | $3.25 | fp8 | — |
| WandB | $0.60 | $3.60 | fp8 | 100.0% |

### GLM 4.7

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| DekaLLM | $0.38 | $1.74 | fp4 | — |
| Chutes | $0.39 | $1.75 | bf16 | 100.0% |
| DeepInfra | $0.40 | $1.75 | fp4 | 100.0% |
| StreamLake | $0.48 | $1.76 | — | 95.8% |
| AtlasCloud | $0.52 | $1.85 | fp8 | 100.0% |
| Nebius | $0.40 | $2.00 | fp8 | 93.9% |
| Novita | $0.54 | $1.98 | fp8 | 97.6% |
| Parasail | $0.45 | $2.10 | fp8 | 99.4% |
| SiliconFlow | $0.45 | $2.20 | fp8 | 100.0% |
| Fireworks AI | $0.60 | $2.20 | — | — |
| Z Ai | $0.60 | $2.20 | — | 85.7% |
| Google AI Studio | $0.60 | $2.20 | — | 99.9% |
| Venice | $0.55 | $2.65 | fp4 | 99.4% |
| Phala | $0.85 | $3.30 | — | — |
| Cerebras | $2.25 | $2.75 | fp16 | 100.0% |

### GPT-5 Mini

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $0.25 | $2.00 | — | 100.0% |
| Azure | $0.25 | $2.00 | — | — |

### Kimi K2.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| ModelRun | $0.40 | $1.90 | fp4 | 100.0% |
| Ambient | $0.40 | $1.98 | — | 99.1% |
| Chutes | $0.44 | $2.00 | int4 | 99.7% |
| Io Net | $0.45 | $2.00 | int4 | 99.5% |
| Inceptron | $0.44 | $2.20 | int4 | 99.8% |
| DeepInfra | $0.45 | $2.25 | fp4 | 100.0% |
| SiliconFlow | $0.45 | $2.25 | int4 | 100.0% |
| AtlasCloud | $0.49 | $2.50 | int4 | 99.9% |
| StreamLake | $0.54 | $2.70 | — | 100.0% |
| Together | $0.50 | $2.80 | — | 98.7% |
| Parasail | $0.60 | $2.80 | int4 | 99.8% |
| Novita | $0.57 | $2.85 | — | 99.9% |
| BaseTen | $0.60 | $3.00 | fp4 | — |
| Cloudflare | $0.60 | $3.00 | — | 99.8% |
| Phala | $0.60 | $3.00 | — | 99.5% |
| Moonshot AI | $0.60 | $3.00 | int4 | 100.0% |
| Fireworks AI | $0.60 | $3.00 | — | — |
| Venice | $0.56 | $3.50 | — | 98.6% |

### GLM 5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| GMICloud | $0.60 | $1.92 | fp8 | 99.9% |
| DeepInfra | $0.60 | $2.08 | fp4 | 100.0% |
| StreamLake | $0.65 | $2.08 | — | 100.0% |
| Baidu | $0.70 | $2.24 | fp8 | 100.0% |
| Ambient | $0.72 | $2.30 | fp8 | 99.3% |
| SiliconFlow | $0.95 | $2.55 | fp8 | 100.0% |
| Chutes | $0.95 | $2.55 | fp8 | — |
| BaseTen | $0.95 | $3.15 | fp4 | — |
| AtlasCloud | $0.95 | $3.15 | fp8 | 100.0% |
| Nebius | $1.00 | $3.20 | fp4 | 99.1% |
| Fireworks AI | $1.00 | $3.20 | — | — |
| Amazon Bedrock | $1.00 | $3.20 | — | — |
| Friendli | $1.00 | $3.20 | — | 100.0% |
| Novita | $1.00 | $3.20 | fp8 | 100.0% |
| Z Ai | $1.00 | $3.20 | — | 99.9% |
| Parasail | $1.00 | $3.20 | fp8 | 100.0% |
| Together | $1.00 | $3.20 | — | — |
| Venice | $1.00 | $3.20 | fp8 | — |
| Phala | $1.20 | $3.50 | — | — |

### Gemini 3 Flash Preview

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $0.50 | $3.00 | — | 99.4% |
| Google AI Studio | $0.50 | $3.00 | — | 96.6% |

### Kimi K2 0711

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Novita | $0.57 | $2.30 | fp8 | 100.0% |
| Moonshot AI | $0.60 | $2.50 | fp8 | 68.6% |

### Kimi K2.6

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Io Net | $0.68 | $3.41 | int4 | 100.0% |
| Baidu | $0.68 | $3.42 | fp4 | 99.8% |
| Novita | $0.80 | $3.40 | — | 100.0% |
| DigitalOcean | $0.81 | $3.40 | — | 99.4% |
| Inceptron | $0.73 | $3.50 | int4 | 98.5% |
| Chutes | $0.74 | $3.50 | int4 | 99.6% |
| Cloudflare | $0.74 | $3.50 | — | 99.4% |
| Parasail | $0.75 | $3.50 | int4 | 99.7% |
| DeepInfra | $0.75 | $3.50 | fp4 | 100.0% |
| StreamLake | $0.85 | $3.60 | — | 99.7% |
| SiliconFlow | $0.77 | $4.00 | fp8 | 100.0% |
| BaseTen | $0.95 | $4.00 | fp4 | — |
| Ambient | $0.95 | $4.00 | — | 98.4% |
| Nebius | $0.95 | $4.00 | int4 | 99.8% |
| Moonshot AI | $0.95 | $4.00 | int4 | 100.0% |
| WandB | $0.95 | $4.00 | fp4 | 100.0% |
| AtlasCloud | $0.95 | $4.00 | int4 | 99.9% |
| AkashML | $0.95 | $4.00 | int4 | — |
| Fireworks AI | $0.95 | $4.00 | — | — |
| Venice | $0.85 | $4.66 | int4 | 100.0% |
| Phala | $1.09 | $4.60 | — | — |
| Together | $1.20 | $4.50 | — | 95.1% |

### GLM 5.1

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| GMICloud | $0.98 | $3.08 | fp8 | 98.6% |
| Baidu | $0.98 | $3.08 | fp8 | 99.8% |
| DeepInfra | $1.05 | $3.50 | fp4 | 100.0% |
| Ionstream | $1.11 | $3.52 | fp8 | — |
| StreamLake | $1.19 | $3.74 | — | 99.7% |
| Chutes | $1.20 | $4.00 | fp8 | 90.7% |
| AtlasCloud | $1.26 | $3.96 | fp8 | 99.7% |
| Phala | $1.21 | $4.20 | — | 96.0% |
| Io Net | $1.30 | $4.20 | fp8 | 99.6% |
| Morph | $1.40 | $4.20 | — | — |
| BaseTen | $1.30 | $4.30 | fp4 | 74.7% |
| Novita | $1.38 | $4.40 | fp8 | 99.6% |
| Inceptron | $1.40 | $4.40 | fp8 | 99.4% |
| Together | $1.40 | $4.40 | — | 91.9% |
| Parasail | $1.40 | $4.40 | fp8 | 99.5% |
| Fireworks AI | $1.40 | $4.40 | — | 99.4% |
| Z Ai | $1.40 | $4.40 | — | 98.8% |
| SiliconFlow | $1.40 | $4.40 | fp8 | 100.0% |
| Ambient | $1.40 | $4.40 | fp8 | 98.2% |
| Friendli | $1.40 | $4.40 | — | 99.9% |
| Venice | $1.75 | $5.50 | fp8 | — |

### GPT-5.4 Mini

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $0.75 | $4.50 | — | 45.7% |
| Azure | $0.75 | $4.50 | — | 99.8% |

### Claude Haiku 4.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Amazon Bedrock | $1.00 | $5.00 | — | — |
| Google AI Studio | $1.00 | $5.00 | — | — |
| Anthropic | $1.00 | $5.00 | — | 100.0% |

### Gemini 3.5 Flash

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $1.50 | $9.00 | — | 99.8% |
| Google AI Studio | $1.50 | $9.00 | — | 99.5% |

### GPT-5.1

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $1.25 | $10.00 | — | — |
| OpenAI | $1.25 | $10.00 | — | 99.9% |

### GPT-5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $1.25 | $10.00 | — | 100.0% |
| OpenAI | $1.25 | $10.00 | — | 97.3% |

### Gemini 3.1 Pro Preview

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $2.00 | $12.00 | — | 99.5% |
| Google AI Studio | $2.00 | $12.00 | — | 98.9% |

### GPT-5.3-Codex

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $1.75 | $14.00 | — | 98.7% |
| Azure | $1.75 | $14.00 | — | 100.0% |

### Gemini 2.5 Pro

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $1.25 | $10.00 | — | 98.7% |
| Google AI Studio | $1.25 | $10.00 | — | 97.9% |

### GPT-5.2

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $1.75 | $14.00 | — | 99.6% |
| Azure | $1.75 | $14.00 | — | 100.0% |

### GPT-5.2-Codex

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $1.75 | $14.00 | — | 99.4% |
| Azure | $1.75 | $14.00 | — | — |

### GPT-5.4

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $2.50 | $15.00 | — | 99.2% |
| Azure | $2.50 | $15.00 | — | 100.0% |

### Claude Sonnet 4.6

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Amazon Bedrock | $3.00 | $15.00 | — | — |
| Anthropic | $3.00 | $15.00 | — | 99.3% |
| Google AI Studio | $3.00 | $15.00 | — | — |
| Azure | $3.00 | $15.00 | — | — |

### Claude Sonnet 4.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Amazon Bedrock | $3.00 | $15.00 | — | — |
| Google AI Studio | $3.00 | $15.00 | — | — |
| Anthropic | $3.00 | $15.00 | — | 100.0% |

### Claude Opus 4.8

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $5.00 | $25.00 | — | 99.9% |
| Anthropic | $5.00 | $25.00 | — | 100.0% |
| Amazon Bedrock | $5.00 | $25.00 | — | — |

### Claude Opus 4.7

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $5.00 | $25.00 | — | — |
| Amazon Bedrock | $5.00 | $25.00 | — | — |
| Anthropic | $5.00 | $25.00 | — | 99.2% |

### Claude Opus 4.6

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $5.00 | $25.00 | — | — |
| Amazon Bedrock | $5.00 | $25.00 | — | 99.9% |
| Azure | $5.00 | $25.00 | — | — |
| Anthropic | $5.00 | $25.00 | — | 99.4% |

### Claude Opus 4.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Amazon Bedrock | $5.00 | $25.00 | — | — |
| Anthropic | $5.00 | $25.00 | — | 99.4% |
| Google AI Studio | $5.00 | $25.00 | — | 100.0% |

### GPT-5.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $5.00 | $30.00 | — | 99.4% |
| Azure | $5.00 | $30.00 | — | 100.0% |


## Subscription deals

_None currently tracked._
