Live model intelligence map

Explore LLMs by the tradeoffs that decide production choices

Q: How do I compare LLMs?

Sort by Quality Index to find the strongest models overall, then filter by price, speed, or context window to match your constraints. Use the Compare page to put 2–4 finalists head to head.

Compare frontier and open-weight models by intelligence, agentic capability, cost per task, token price, latency, throughput, context, and provider availability.

Live337 models93 providersArtificial Analysis Intelligence IndexUpdated June 2026

Best AI Models Best Agentic Models Agentic Fit Finder Best Coding Models Best Open Source Models

Decision cockpit

0 model groups0 endpoints

Choose by task economics, not leaderboard position alone.

Explore now brings Artificial Analysis v4.1 task metrics into the main model map: agentic strength, per-task cost, response time, context, provider coverage, and token price in one view.

1 selected

Model groups

Showing 0 rows after filters.

									Action
No model groups match the current filters.

How to use this LLM ranking page

Start with capability, then force the economics

Frontier models often cluster tightly on raw intelligence. Cost per task, response time, provider coverage, and context length usually create the real shortlist.

Move from discovery to a defensible final pick

Use Explore to find the shape of the market, then move into Compare or use the Agentic Fit Finder when task-level reliability and cost matter more than chat quality alone.

Frequently Asked Questions

What is the best LLM in 2026?

Claude Opus 4.8 leads the overall quality ranking right now. The best model for you depends on your use case — coding, cost, speed, and context length all shift the answer.

How do I compare LLMs?

Sort by Quality Index for overall strength, then filter by price, speed, or context window to match your constraints. Move to the Compare page to put 2–4 finalists head to head.

Which AI model has the best quality-to-price ratio?

Open-weight models like DeepSeek, Qwen, and Llama often lead on quality-to-price. For agentic workflows, cost per task is usually a better filter than token price alone.

How often is this leaderboard updated?

Data is pulled from Artificial Analysis and refreshed automatically. New models appear as soon as they have benchmark scores and provider endpoints.

Model rankings

Current live rankings

Browse the latest ranking pages for overall models, coding, open source, Ollama, long context, and agentic workflows.

Coding

Best LLM for Coding

Current coding leaderboard using LiveCodeBench, Terminal-Bench, and SciCode.

Open page →

Open Source

Best Open Source LLM

Top open-weight models for self-hosting, Ollama, and low-cost API use.

Open page →

Self-Hosted

Best Local LLM

Best local AI models by hardware tier for self-hosting on Macs, RTX GPUs, and workstations.

Open page →

Ollama

Best Ollama Models

Ollama-first picks for coding, chat, reasoning, and low-friction local inference.

Open page →

Long Context

Largest Context Window LLM

Best long-context models for large documents, codebases, and retrieval-heavy workflows.

Open page →

Agents

Best Agentic Models

Rankings for tool use, multi-step execution, and autonomous agent workflows.

Open page →