The Five-Layer Evolution of Scaling Law: How Physical AI Opens the Next Growth Curve
The Five-Layer Evolution of Scaling La…
A five-layer Scaling Law framework reveals how Physical AI opens AI's next growth frontier.
This article dissects the five-layer evolution of Scaling Law—from Pre-Training and Post-Training to Test-Time, Agent, and Multi-Agent Scaling—and explores how Physical AI is driving a paradigm shift from virtual to physical worlds. It examines World Models, edge-side inference, model compression, and emotional interaction as key pillars of the next growth curve.
Introduction
Over the past year and more, discussions around Scaling Law in the AI industry have never ceased. From the three technical roadmaps Jensen Huang initially proposed at GTC, to the five-layer Scaling framework that has since evolved, to the rapid rise of Physical AI, the entire industry is undergoing a paradigm shift from the virtual world to the physical world.
This article is based on a talk given in Hangzhou's Yunqi Town by an AI practitioner who left Huawei to join DJI. It outlines the complete evolution of Scaling Law and explores the technological trajectory of the Physical AI era.
The Five-Layer Evolution of Scaling Law
Layer 1: Pre-Training Scaling
Pre-Training Scaling is the most classic and widely recognized approach. The core logic is straightforward: by increasing model parameter scale, training data volume, and compute, the capabilities of foundation models continuously improve.
From GPT-1 to GPT-4, from LLaMA 1 to LLaMA 4, from DeepSeek V1 to V4, all major foundation model iterations validate the same principle—more data, larger models, and more compute yield better results. While this approach is simple and brute-force, it remains the cornerstone of AI capability improvement to this day.
Layer 2: Post-Training Scaling
Post-Training Scaling builds upon pre-training by using instruction fine-tuning, RLHF, DPO, GRPO, and other reinforcement learning techniques to enable model self-learning and improve performance on specific tasks. Representative work includes GPT-4's alignment training, Claude's safety training, and DeepSeek V3's reinforcement learning phase.
The core value of post-training lies in transforming models from "highly capable" to "highly capable and controllable"—which is critical for productization.
Layer 3: Test-Time Scaling
Test-Time Scaling is the most groundbreaking direction of the past two years. Conventional wisdom held that model inference simply produces answers, but practice has shown that investing more compute and time during inference—allowing models to think more deeply and verify their reasoning—yields significantly better results.
OpenAI's O-series models introduced Chain of Thought, and DeepSeek's reasoning models along with multi-path reasoning techniques continue to emerge, proving that "letting models think longer" is itself an effective form of scaling.
Layer 4: Agent Scaling
With large models at the core, Agent Scaling expands peripheral capabilities such as tool calling, Memory, and Planning to raise the ceiling of what a single agent can accomplish on complex tasks. Representative technologies include:
- ReAct framework
- Planning/Soft methods
- Long-term memory mechanisms
- MCP protocol
- Function Call
Agent Scaling marks AI's transition from passive response to proactive execution, opening an entirely new application paradigm.
Layer 5: Multi-Agent Scaling
Once the capability boundaries of individual agents are broken through, multi-agent collaboration becomes the new scaling frontier. Through increasing agent numbers, role specialization, and communication protocols, systems can accomplish complex work far beyond any single agent's capabilities.
Interestingly, the explosion of Multi-Agent has far exceeded expectations. Representative products like Manus, OpenCore, and the emergence of Agent-to-Agent protocols mean that today's programmers are no longer writing code—they're monitoring dozens of agents working in coordination.
The Paradigm Shift from Agentic AI to Physical AI
The Ceiling of the Virtual World
The five-layer Scaling framework has built a fairly complete technology stack within the virtual world. But a core question emerges: will Scaling stop evolving after these five layers?
The answer is no. AI is rapidly moving from Agentic AI in the virtual world to Physical AI in the physical world. Although Physical AI doesn't yet have a clearly defined killer application, the trend is becoming increasingly clear.
Reconstructing Scaling Law for the Physical AI Era
Pre-Training Level: Fundamental Transformation of Model Architecture
Model architectures are evolving from LLM (Large Language Model) to VLM (Vision-Language Model) to VLA (Vision-Language-Action Model), and moving toward WM (World Model). World Models can understand and simulate the operating principles of the physical world, providing the decision-making foundation for physical agents such as drones, autonomous vehicles, and robots.
Post-Training Level: Model Compression and Specialization
Physical AI introduces entirely new requirements for post-training. When deploying in vertical industries, large models need to be compressed, distilled, and reinforced into smaller, more precise Small LLM/VLM/WM to accommodate the compute constraints of embedded devices. Streamlining and specializing models around Physical AI Scaling Law is becoming a core focus area for leading companies.
Test-Time Scaling Level: From Cloud to Edge
In the physical world, the form of test-time scaling undergoes a fundamental change. Wearable devices, AI glasses, and other hardware need to understand their surroundings in real time, meaning inference no longer happens in the cloud but is completed instantly on small chips carried by the user. AI Glasses are the most promising product category in this direction.
Emotion and Memory: Core Differentiators of Physical AI
Once AI hardware possesses Agent capabilities, long-term Memory and Emotion understanding will become key differentiators. AI hardware must not only execute tasks but also understand and accompany users—signifying a qualitative leap from "tool" to "companion."
Numerous startups are already positioning themselves in this direction, attempting to build physical agents with emotional interaction capabilities.
Where Is the Next Growth Curve?
Based on current technological evolution patterns, each layer of Scaling breaks through the "boundary" of the previous layer:
| Layer | Boundary Broken |
|---|---|
| Pre-Training | Boundary of model capability |
| Post-Training | Boundary of model controllability |
| Test-Time Scaling | Boundary of reasoning quality |
| Agent Scaling | Boundary of single interaction |
| Multi-Agent Scaling | Boundary of individual capability |
Physical AI is breaking through the boundary of the digital world, allowing AI to truly enter physical space. And when AI in the physical world becomes sufficiently dense, Multi-Agent collaboration in the physical world—the self-organizing coordination of massive numbers of intelligent hardware devices—may well be the next inflection point.
Conclusion
Scaling Law is far from reaching its end. It's not a straight line but a technology tree that continuously branches and grows. From virtual to physical, from individual to collective, from tool to companion, AI's path of expansion remains full of imaginative possibilities.
For practitioners, understanding the internal logic of these five (and potentially more) layers of Scaling is essential to accurately predicting where the next technological wave will surge. Whether you choose to go deep on Multi-Agent collaboration or commit to World Model R&D for Physical AI, the key is to see clearly the direction of Scaling Law's evolution—every breakthrough is a leap beyond existing boundaries.
Related articles

AI Aggregator Platforms Tested: A Complete Guide to Using GPT 5.5 and Other Top Models for Free
A hands-on guide to using GPT 5.5, Gemini 3.1 Pro, and Grok 4.2 for free via AI aggregator platforms, covering cross-model context memory, account pool mechanisms, and key security risks.

Vibe Coding in Practice: A Junior Student Uses Cursor to Build a Multi-Agent System with 51 AI Officials Based on the Three Departments and Six Ministries Framework
A junior student uses Cursor and Vibe Coding to build a multi-agent system with 51 AI officials modeled on China's Three Departments and Six Ministries, featuring task distribution, approval workflows, and Token cost visualization.

How to Connect Codex to DeepSeek Models: Free Switching via CC Switch
Learn how to connect OpenAI Codex to DeepSeek models via CC Switch, enabling free switching between DeepSeek and GPT with complete setup and routing guide.