683 related articles
Tech FrontiersSGLang team hosts an Agent Loops Office Hour exploring inference optimization for agentic loops, covering KV Cache reuse, low-latency multi-turn dialogue, and tool calling techniques.
O3 vs Gemini 2.5 Pro vs Claude 3.7: Re…
Real-world comparison of O3, Gemini 2.5 Pro, and Claude 3.7 coding abilities through snake battles, RL training, solar system simulation, and soccer game tasks.
Real-World Coding Test of 13 Top AI Mo…
Benchmark of 13 top AI models including GPT-4.1, Claude 3.7 Sonnet, and Gemini 2.5 Pro on coding ability, scored across 8 dimensions using the same high-difficulty algorithm problem.
API Aggregation Proxy Platforms Tested…
Hands-on testing of an API aggregation proxy platform's model calling capabilities, including GPT-Image2 image generation, cost analysis, and coverage of 100+ models like Claude and Gemini.
Orchestrating AI Agents as State Machi…
Explore the next evolution of AI coding: applying CI/CD engineering practices to orchestrate Agents as state machines with YAML templates, Gates, and Dashboards for autonomous multi-Agent progression.
Six Foundational Upgrades to Claude Co…
Anthropic's largest-ever foundational upgrade to Claude Code fixes six critical issues at once—terminal flickering, thinking freezes, cryptic errors, context deadlocks, unstable connections, and session crashes—shifting AI coding competition to the infrastructure layer.
BMad-Method: Building an AI Agile Deve…
Deep dive into BMad-Method, an open-source multi-agent framework simulating a full agile team—from business analysis to QA—supporting Claude Code, Cursor, and more.
Claude Code Source Code Study Guide: E…
Learn AI Agent development from Claude Code's 510K lines of source code, covering Agent Loop, context compression, multi-Agent orchestration, and two efficient study methods.
Major Claude Code Update: A Complete G…
Deep dive into Claude Code's new Agent View and Goal system, covering multi-agent parallel management, background sessions, and result-oriented autonomous execution.
Spring AI Agent Utils: A Java Agent To…
Deep dive into Spring AI Agent Utils toolkit covering Skill modules, Ask a User Question, To Do Write, Auto Memory, and multi-Agent orchestration — empowering Java developers to build powerful AI Agents.
Unified Management Tool for Claude Cod…
A deep dive into the unified management client for Claude Code and Codex, solving pain points like tedious configuration, high switching costs, and fragmented management with one-click setup and usage monitoring.
Harness Engineering Deep Dive: Multi-L…
Deep dive into Harness Engineering: deconstructing Claude Code's multi-level memory, defense-in-depth, Hermes Agent autonomous evolution, and multi-Agent collaboration for industrial-grade AI development.
Claude Opus 4.8 Hands-On: What Can You…
Hands-on testing of Claude Opus 4.8's coding and creative abilities, including Mario game and Slay the Spire-style card game development, quota consumption, and real-world bug frequency.
Claude Code Desktop Installation & Con…
Step-by-step guide to install Claude Code Desktop, use it without an account via Developer Mode, integrate DeepSeek models through CSwitch, add Chinese localization, and configure custom Skills.
Getting Started with Claude Code: 5 Co…
Deep dive into the core differences between Claude Code and regular AI chat tools across 5 dimensions: interaction, context understanding, execution, memory, and tool invocation.
AI + Jupyter Notebook: A Practical Met…
The hardest part of STEM is the gap between theory and practice. Learn how to use Jupyter Notebook with AI Coding Agents to auto-generate interactive tutorials for math, physics, statistics, and more.
Cursor + MCP in Practice: A Complete G…
A detailed guide on integrating Playwright MCP Server with Cursor, covering Node.js setup with NVM, NPM mirror configuration, and building a browser automation agent step by step.
Zen MCP: An Open-Source Tool That Lets…
Deep dive into Zen MCP, an open-source project that lets Claude Code orchestrate Gemini, O3, and other AI models via MCP protocol, with cost-reduction proxy setup guide.
Product ReviewsCursor announces Claude Opus 4.8 is live. CursorBench shows significant gains in coding efficiency and task persistence. Analysis of key improvements and market impact.
Bolt.DIY + Claude 3.7 Sonnet: Building…
Learn how to use open-source Bolt.DIY with Claude 3.7 Sonnet to build full-stack web apps with zero code. Includes local deployment tutorial, hands-on demo, and cost analysis—an AI course platform built in 13 minutes for $3.