#scaling law

24 related articles

O3 vs Gemini 2.5 Pro vs Claude 3.7: Re…

2026年5月30日·3 min

O3 vs Gemini 2.5 Pro vs Claude 3.7: Real-World AI Coding Ability Comparison

Real-world comparison of O3, Gemini 2.5 Pro, and Claude 3.7 coding abilities through snake battles, RL training, solar system simulation, and soccer game tasks.

Meta Muse Spark Technical Deep Dive: How Three-Dimensional Scaling Achieves 10x Compute Reduction

Research

2026年5月28日·2 min

Meta Muse Spark Technical Deep Dive: How Three-Dimensional Scaling Achieves 10x Compute Reduction

Meta reveals Muse Spark technical details: three-dimensional scaling across pre-training, RL, and test-time inference achieves over 10x compute reduction versus Llama 4 Maverick.

GLM5 Architecture Leaked: 745B Parameters, DeepSeek V4 May Launch Quantized Smaller Model First

Tech Frontiers

2026年5月27日·2 min

GLM5 Architecture Leaked: 745B Parameters, DeepSeek V4 May Launch Quantized Smaller Model First

GLM5 code leak reveals 745B-parameter MoE architecture replicating DeepSeek V3. DeepSeek V4 may launch a 200B quantized model first, with flagship exceeding 1T parameters.

Claude Code Sub-Agents and Cursor BugBot Launch: AI Programming Tools Get Major Upgrades

Tech Frontiers

2026年5月27日·3 min

Claude Code Sub-Agents and Cursor BugBot Launch: AI Programming Tools Get Major Upgrades

Anthropic adds custom sub-agents to Claude Code, Cursor launches code review Agent BugBot, Qwen releases 92-language translation model, and Google unveils three experimental AI products.