285 articles
When AI Gets a Virtual Body: A Deep Di…
Deep dive into how Bilibili's Lumen project gives AI a virtual body, enabling environmental perception, collaborative puzzle-solving, and emotional interaction — exploring the leap from conversational to embodied AI.
Agent Memory: Giving AI Coding Agents …
Agent Memory is an open-source local memory layer providing persistent, cross-session, cross-tool long-term memory for AI coding agents like Claude Code, Cursor, and Codex.
Cursor Composer 2.5 Hands-On: An AI Co…
Hands-on review of Cursor Composer 2.5's Agent view, Plan mode, and right panel features. Coding ability matches Claude and GPT top models at up to 10x lower cost with significantly faster speed.
Product ReviewsAnthropic cuts Claude Opus 4.8 Fast mode pricing. Learn how the dual-mode workflow—Fast for interactive coding, Normal for async tasks—reshapes AI programming.
Deep Dive into Qwen3.7 Max: One-Tenth …
Alibaba's Qwen3.7 Max targets AI agents with coding tasks at just $1.30 (one-tenth of GPT-5), supporting 35 hours of continuous execution. Deep analysis of its cost advantages, front-end capabilities, and three key limitations.
Codex Computer Use Hands-On Review: Se…
Hands-on review of OpenAI Codex Computer Use: step-by-step setup guide, WeChat demo, screenshot-based core principles, and analysis of three major security risks including privacy leakage and current limitations.
Claude Code's Frontend UI Too Ugly? Th…
AI coding tools produce frontend pages lacking design sense? UI-UX-Pro-Max-Skill is an open-source design enhancement Skill with 67 UI styles, 161 color systems, supporting Claude Code and Cursor.
Vibe Coding in Action: Building a Mac …
A Bilibili creator used Vibe Coding to build a full-featured Mac screen recorder entirely through AI conversation, without writing any code. A deep dive into its features and Vibe Coding's potential.
Claude Opus 4.8 Hands-On Review: A Com…
In-depth hands-on review of Claude Opus 4.8 across 2D tower defense, 3D game dev, UI reproduction, and tool generation, with scoring and comparison to Opus 4.7.
Claude Opus 4.8 Real-World Testing: Wh…
In-depth testing of Claude Opus 4.8 across game dev, UI reproduction, 3D scenes, and tool building—$50 in Tokens reveals its true capabilities and limits.
Claude Code vs Codex Deep Dive: A Prac…
A comprehensive comparison of Claude Code and OpenAI Codex covering architecture, use cases, and benchmarks to help you choose the right AI coding tool.
Zion No-Code Platform Hands-On: Buildi…
Hands-on review of Zion no-code platform: building a production-ready e-commerce mini program from scratch with visual design, built-in database, workflow orchestration, AI Agent integration, and one-click multi-platform publishing.
Cursor 3.0 Deep Dive: The AI Agent Com…
Cursor 3.0 abandons VS Code entirely, rewritten from scratch in Rust as an AI agent management platform. Deep dive into its three evolutions, Composer 2 controversy, parallel agent orchestration, and the paradigm shift from assisted to autonomous coding.
O3 vs Gemini 2.5 Pro vs Claude 3.7: Re…
Real-world comparison of O3, Gemini 2.5 Pro, and Claude 3.7 coding abilities through snake battles, RL training, solar system simulation, and soccer game tasks.
Deep Comparison of o1, o1 pro, and o3-…
Deep Research comparison of OpenAI o1, o1 pro, and o3-mini-high coding capabilities, covering code quality, optimization, error rates, and debugging with benchmarks and real-world cases.
Llama 3.3 70B In-Depth Review: Testing…
Meta releases Llama 3.3 70B open-source model with just 70B parameters rivaling 405B performance. Tested on 13 logic, math, and coding questions, it passed 12 — reshaping the open-source model landscape.
Real-World Coding Test of 13 Top AI Mo…
Benchmark of 13 top AI models including GPT-4.1, Claude 3.7 Sonnet, and Gemini 2.5 Pro on coding ability, scored across 8 dimensions using the same high-difficulty algorithm problem.
API Aggregation Proxy Platforms Tested…
Hands-on testing of an API aggregation proxy platform's model calling capabilities, including GPT-Image2 image generation, cost analysis, and coverage of 100+ models like Claude and Gemini.
Augment Remote Agent Hands-On: Running…
Hands-on review of Augment Remote Agent: 10 cloud AI Agents coding in parallel, covering bug fixes, PR generation, documentation, and more with detailed workflows and real-world examples.
AI Tool Rankings for Solo Businesses: …
A complete AI tool matrix for solo businesses across 7 categories—Text, Image, Video, Audio, Digital Avatars, Coding & Agents—with top picks, alternatives, and open-source options.