Not just the cheapestโthe best value. We rank 324 AI models by quality-per-dollar using live pricing data from 91 providers.
Historical snapshot
This page is a dated monthly snapshot. For the live version that is better aligned to current rankings and search intent, use Best AI Models (Live) or jump to Best Open Source LLM.
Value Score = Quality Index รท Price per Million Tokens
Value Score
2870.0
DeepSeek via CoreWeave
Value Score
340.0
Multiverse Computing via CompactifAI
Value Score
335.0
Alibaba via DeepInfra (FP8)
| Rank | Model | Value | Quality | Price/M | Speed | Provider |
|---|---|---|---|---|---|---|
| 1 | DeepSeek V4 Flash (Non-reasoning) DeepSeek | 2870.0 | 28.7 | $0.010 | 73 tok/s | CoreWeave |
| 2 | HyperNova 60B 2605 Multiverse Computing | 340.0 | 22.1 | $0.065 | 351 tok/s | CompactifAI |
| 3 | Qwen3.5 4B (Reasoning) Alibaba | 335.0 | 20.1 | $0.060 | 26 tok/s | DeepInfra (FP8) |
| 4 | gpt-oss-120b (high) OpenAI | 310.1 | 23.8 | $0.077 | 40 tok/s | DeepInfra |
| 5 | Hy3-preview (Reasoning) Tencent | 293.4 | 33.6 | $0.115 | 132 tok/s | SiliconFlow |
| 6 | DeepSeek V4 Flash (Reasoning, Max Effort) DeepSeek | 293.1 | 40.3 | $0.138 | 113 tok/s | GMI |
| 7 | DeepSeek V4 Flash (Reasoning, High Effort) DeepSeek | 272.0 | 37.4 | $0.138 | 112 tok/s | GMI |
| 8 | Llama 3.1 Instruct 8B Meta | 271.1 | 6.1 | $0.022 | 68 tok/s | DeepInfra (Turbo, FP8) |
| 9 | Qwen3.5 4B (Non-reasoning) Alibaba | 266.7 | 16 | $0.060 | 25 tok/s | DeepInfra FP8 |
| 10 | gpt-oss-20B (high) OpenAI | 259.1 | 14.9 | $0.058 | 29 tok/s | DeepInfra |
Top models for high-volume, cost-sensitive workloads.
Self-hostable models with best API value when you can't self-host.
DeepSeek V4 Flash (Non-reasoning)
DeepSeek via CoreWeave
$0.010
Value: 2870.0
HyperNova 60B 2605
Multiverse Computing via CompactifAI
$0.065
Value: 340.0
Qwen3.5 4B (Reasoning)
Alibaba via DeepInfra (FP8)
$0.060
Value: 335.0
gpt-oss-120b (high)
OpenAI via DeepInfra
$0.077
Value: 310.1
Hy3-preview (Reasoning)
Tencent via SiliconFlow
$0.115
Value: 293.4
Use our interactive explorer to compare pricing across all 91 providers. Filter by quality, speed, and price to find your perfect model.
As of January 2026, DeepSeek V4 Flash (Non-reasoning) offers the best value under $1/M at $0.010/M with a quality index of 28.7. For absolute lowest cost, open source models like DeepSeek and Qwen can be self-hosted for near-zero marginal cost after infrastructure.
DeepSeek V4 Flash (Non-reasoning) currently offers the best quality-per-dollar with a value score of 2870.0(Quality Index 28.7 at $0.010/M). This means you get more "intelligence" per dollar spent than any other model.
For most production workloads, no. Models like DeepSeek V3.2 and Gemini Flash deliver 85-95% of GPT-5's quality at 1/10th to 1/20th the cost. Use GPT-5 for: (1) complex multi-step reasoning, (2) tasks where error cost is high, (3) when you need specific OpenAI features. Use budget models for: high-volume chat, content generation, and routine tasks.
Use API if: you have <1M tokens/day, need instant scaling, or lack GPU infrastructure.Self-host if: you have >10M tokens/day (break-even point), need data privacy, or want to fine-tune. At current GPU prices, self-hosting DeepSeek V3 becomes cheaper than API around 5-10M tokens/day depending on your setup.
Self-hostable models
๐ปLiveCodeBench leaders
๐คTool use & agents
๐Expert picks
Data sources: Pricing from Artificial Analysis (live API data). Quality Index from AA Intelligence Index. Updated daily via automated pipeline.See methodology โ