Best Open Source LLMs February 2026
GLM-4.7 holds #1, Kimi K2.5 debuts at #2 with 96% AIME. Full rankings, Ollama picks by VRAM tier, and self-hosting guide.
Read articleBenchmark analysis, model rankings, and practical guides for builders.
GLM-4.7 holds #1, Kimi K2.5 debuts at #2 with 96% AIME. Full rankings, Ollama picks by VRAM tier, and self-hosting guide.
Read articleA data-driven DeepSeek hub that connects benchmarks to provider reality: pricing, speed, and time to first token across hosts.
A MiniMax hub for builders. Understand M1 vs M2 vs M2.1 and compare providers by price, speed, and time to first token.
Claude Opus 4.5 for reasoning, GLM-4.7 for open source dominance, and Gemini 3 Pro for speed.
The narrative around AI has long revolved around compute power. As we enter 2026, a quieter shift is underway. Memory, not chips, is becoming the constraint.
We compared 94 model endpoints. The gap between open source and proprietary has shrunk to 5-7 quality index points.
Twelve months ago, we were debating whether AI could reason. Now we're debating who owns the reasoning.
DeepSeek V3.2 hit 96% on AIME 2025. Xiaomi dropped a frontier model. GLM-4.7 claimed the coding crown.
OpenAI declared "Code Red" in December 2025. China's 15 open-weight models, the efficiency revolution, and custom silicon converged.
Analysis of 114 models reveals a market in transition: benchmarks saturating, open-weight matching proprietary at 10x lower cost.
The AI arms race explodes in November 2025 with three frontier releases in 12 days. Benchmarks, pricing, and where each dominates.
Moonshot AI's open-weight MoE takes on OpenAI's proprietary GPT-5.1. Architecture, benchmarks, pricing.
Moonshot AI's K2 Thinking lands a 67 on Artificial Analysis, sets agentic records, proves open weights compete on reasoning.
A 94-model deep dive covering quality, price, and speed deltas across the 2025 LLM landscape.
DeepSeek V3.1, Qwen3-235B, GLM-4.6 - performance benchmarks, pricing, and deployment insights.
Lessons from Andrej Karpathy: model collapse, low-entropy outputs, and how to push past the slop.
Z.ai's GLM-4.5 hybrid reasoning vs Moonshot AI's Kimi-K2 1T parameter architecture.
Z.ai's GLM-4.5 vs Alibaba's Qwen3-235B with massive parameter count and FP8 optimization.
Compare 100+ LLMs by price, speed, and benchmarks.