154 related articles
35 Lines of Prompts Let Codex Auto-Opt…
An OpenAI employee used just 35 lines of prompts to have Codex analyze 30 days of work history, identify repetitive tasks, and generate reusable automated Skills. Combined with screen reading and long-term memory, Codex is becoming a proactive workflow optimization agent.
Industry InsightsA simple tweet sparks wide discussion: What do you most want AI to solve? From healthcare to education equity and scientific research, exploring the shift from technology-driven to demand-driven AI.
Tech FrontiersSGLang team hosts an Agent Loops Office Hour exploring inference optimization for agentic loops, covering KV Cache reuse, low-latency multi-turn dialogue, and tool calling techniques.
Llama 3.3 70B In-Depth Review: Testing…
Meta releases Llama 3.3 70B open-source model with just 70B parameters rivaling 405B performance. Tested on 13 logic, math, and coding questions, it passed 12 — reshaping the open-source model landscape.
BMad-Method: Building an AI Agile Deve…
Deep dive into BMad-Method, an open-source multi-agent framework simulating a full agile team—from business analysis to QA—supporting Claude Code, Cursor, and more.
Spring AI Agent Utils: A Java Agent To…
Deep dive into Spring AI Agent Utils toolkit covering Skill modules, Ask a User Question, To Do Write, Auto Memory, and multi-Agent orchestration — empowering Java developers to build powerful AI Agents.
Context Engineering Replaces Prompt En…
Learn how Context Engineering replaces Prompt Engineering to boost Claude Code efficiency. Build complex multi-Agent projects with zero coding using structured context files.
Claude Code with MiniMax M2: Testing a…
Real-world testing of MiniMax M2 as Claude Code's backend model across three projects: framework migration, iOS development, and full-stack MVP — at just 8% of Claude's price.
Gemini 2.5 Pro 0605 Hands-On Compariso…
Hands-on testing of Gemini 2.5 Pro 0605 across coding, reasoning, creative writing, and app development, compared head-to-head with OpenAI o3 and Claude Opus 4.
memU Memory Framework Explained: Unify…
Deep dive into the memU open-source memory framework: how it organizes Agent memory as a file system with three-layer semantic abstraction, dual-loop collaboration, and two retrieval modes.
WenzAgent Open-Source Framework: A Pra…
A detailed guide on deploying WenzAgent, an open-source multi-Agent management framework under Apache License, supporting LAN-based multi-device AI agent collaboration with Server-Client architecture.
Building an Agent Framework from Scrat…
Learn how to split AI Agent capabilities into four modules—Tool Registry, Message Store, Agent Runtime, and Built-in Tools—and build a reusable, extensible Agent framework using Python decorators.
Hermes Self-Evolution Framework: An Op…
Deep dive into NousResearch's open-source Hermes Agent self-evolution framework, using DSPy and GEPA for automated prompt optimization with five-layer safety mechanisms.
Generic Agent: A Self-Evolving AI Agen…
Generic Agent builds a self-evolving AI agent with just 3,000 lines of code, 9 atomic tools, and a five-layer memory architecture — using only one-sixth the tokens of competitors.
AI Agent Learning Roadmap: From Beginn…
A detailed three-month AI Agent learning roadmap covering LLM basics, ReAct paradigm, LangChain, memory mechanisms, tool calling, and multi-agent collaboration with practical project suggestions.
Claude Code Installation & Agent Hands…
Step-by-step Claude Code installation guide with Volcengine GLM5.1 Chinese LLM. Hands-on Agent demos for Bilibili data scraping and ComfyUI setup. No coding required.
Industry InsightsWarp deeply integrates GPT-5.5 to build cross-environment AI coding agents spanning local terminals, cloud deployment, and open-source collaboration. Explore its architecture, open-source strategy, and differentiation from GitHub Copilot.
Deep DivesDeep analysis of MCP Apps: how Anthropic and OpenAI's official MCP extension enables AI tools to return interactive UIs, solving the context gap with Human-in-the-Loop collaboration.
Codex Team Reveals a New AI Programmin…
OpenAI's Codex team shows AI programming now prioritizes organizational skills over coding. Learn the four paradigm shift signals, efficient workflows, and how developer roles are being reshaped.
GPT 5.5 vs Claude Code vs DeepSeek V4:…
Hands-on comparison of GPT 5.5, Opus 4.7 (Claude Code), and DeepSeek V4 Pro through a 3D flight simulator and WebGPU shader test — covering coding ability, pricing, and real-world performance.