Blog

Benchmark analysis, model rankings, and practical guides for builders.

Rankings-Feb 9, 2026-12 min read

Best Open Source LLMs February 2026

GLM-4.7 holds #1, Kimi K2.5 debuts at #2 with 96% AIME. Full rankings, Ollama picks by VRAM tier, and self-hosting guide.

Read article

Model Rankings

View all

All Articles

DeepSeek models: what to use, what to skip, and where to run them

A data-driven DeepSeek hub that connects benchmarks to provider reality: pricing, speed, and time to first token across hosts.

GuideJan 15, 2026

MiniMax models: what to use, what to skip, and where to run them

A MiniMax hub for builders. Understand M1 vs M2 vs M2.1 and compare providers by price, speed, and time to first token.

GuideJan 15, 2026

Top 3 AI Models January 2026: Our Expert Picks

Claude Opus 4.5 for reasoning, GLM-4.7 for open source dominance, and Gemini 3 Pro for speed.

OpinionJan 9, 2026

The unspoken bottleneck reshaping artificial intelligence

The narrative around AI has long revolved around compute power. As we enter 2026, a quieter shift is underway. Memory, not chips, is becoming the constraint.

Deep DiveJan 8, 2026

January 2026: Open source vs proprietary LLMs compared

We compared 94 model endpoints. The gap between open source and proprietary has shrunk to 5-7 quality index points.

AnalysisJan 2, 2026

2025 AI Year in Review: The Year Intelligence Became Infrastructure

Twelve months ago, we were debating whether AI could reason. Now we're debating who owns the reasoning.

Year in ReviewDec 26, 2025

The Open Source Revolution: How December 2025 Changed Everything

DeepSeek V3.2 hit 96% on AIME 2025. Xiaomi dropped a frontier model. GLM-4.7 claimed the coding crown.

Deep DiveDec 26, 2025

Three Forces That Broke OpenAI's Moat

OpenAI declared "Code Red" in December 2025. China's 15 open-weight models, the efficiency revolution, and custom silicon converged.

IndustryDec 3, 2025

The State of LLMs: December 2025

Analysis of 114 models reveals a market in transition: benchmarks saturating, open-weight matching proprietary at 10x lower cost.

AnalysisDec 3, 2025

Gemini 3 Pro vs GPT 5.1 vs Claude Opus 4.5

The AI arms race explodes in November 2025 with three frontier releases in 12 days. Benchmarks, pricing, and where each dominates.

ComparisonNov 25, 2025

Kimi K2 Thinking vs ChatGPT 5.1: Reasoning Showdown

Moonshot AI's open-weight MoE takes on OpenAI's proprietary GPT-5.1. Architecture, benchmarks, pricing.

ComparisonNov 18, 2025

Kimi K2 Thinking: How Open Weights Are Catching GPT-5

Moonshot AI's K2 Thinking lands a 67 on Artificial Analysis, sets agentic records, proves open weights compete on reasoning.

AnalysisNov 2025

Open Source vs Proprietary LLMs: 2025 Benchmark Analysis

A 94-model deep dive covering quality, price, and speed deltas across the 2025 LLM landscape.

AnalysisOct 2025

Top Open Source LLMs October 2025: Complete Guide

DeepSeek V3.1, Qwen3-235B, GLM-4.6 - performance benchmarks, pricing, and deployment insights.

GuideOct 2025

Why AI Outputs Are Turning Into Repetitive Slop

Lessons from Andrej Karpathy: model collapse, low-entropy outputs, and how to push past the slop.

OpinionOct 2025

GLM-4.5 vs Kimi-K2: Battle of the Agentic AI Giants

Z.ai's GLM-4.5 hybrid reasoning vs Moonshot AI's Kimi-K2 1T parameter architecture.

ComparisonAug 12, 2025

GLM-4.5 vs Qwen3-235B: The Ultimate Comparison

Z.ai's GLM-4.5 vs Alibaba's Qwen3-235B with massive parameter count and FP8 optimization.

ComparisonAug 12, 2025

Find the right model for your use case

Compare 100+ LLMs by price, speed, and benchmarks.