75 related articles
Product ReviewsDeep dive into OpenHands, an open-source AI coding agent platform covering architecture design, sandboxed code execution, and multi-tool orchestration, compared with Copilot, Devin, and more.
TutorialsLearn how to redirect Claude Agent SDK API requests to local LLMs via LiteLLM Proxy, achieving zero-cost inference while retaining full agent framework capabilities.
Deep DivesA deep dive into AI Agent development methodology, from the ReAct theoretical framework to a four-layer enterprise tech stack covering model services, Agent types, LangChain, and production deployment.
TutorialsDeep dive into npcpy's four-layer architecture, multi-agent collaboration, knowledge graph lifecycle management, and deployment strategies for building stable, controllable AI Agent systems.
Running Qwen3.6-27B Locally on Mac: 4 …
Benchmarking 4 solutions for running Qwen3.6-27B locally on Mac: GGUF, MLX Diflash, and MTP-LX. MTP-LX 4bit leads at 43.6 tok/s with solid coding, writing, and reasoning quality.
Local Deployment of Qwen 3.6 27B on 4×…
Real-world test of Qwen 3.6 27B FP8 deployed on 4×3080Ti 16GB modded GPUs with OpenCode for system tool development. Covers hardware setup, inference speed, context management, and productivity gains.
Decoding LLM Naming Conventions: Param…
Decode LLM naming conventions, understand 32B parameters & AWQ/GGUF quantization formats, with 4-bit VRAM estimation formulas, MOE model pitfalls, and model selection by GPU tier.
Frontend to AI Full-Stack: Complete Sk…
A complete skill tree for frontend developers transitioning to AI full-stack engineers, covering TypeScript, NestJS, LangChain JS, RAG, vector databases, and Tauri 2 with a clear learning roadmap.
The Truth Behind Windsurf's "Unlimited…
Deep analysis of how Windsurf's "unlimited credits" actually works—third-party plugins rotate shared account pools, not an official bug. Covers mechanisms, security risks, and safer alternatives.
Complete Guide to Local LLM Deployment…
Complete guide to deploying open-source LLMs locally with Ollama. Covers installation, model selection, VRAM requirements, and performance comparison of Llama 3 and Qwen models. Free, offline-capable AI.
TutorialsA deep dive into AI-driven research methodology: LLM selection, Python automation, Zotero reference management, Overleaf writing, local LLM deployment, and N8N workflow automation.
Deep DivesDeep dive into AI's four-stage evolution from Chat to Agentic AI, covering multi-Agent architectures, ReAct framework, and MCP protocol for developers.
TutorialsComplete guide to building AI Agents on Dify with zero code, covering tool integration, ESA search configuration, time awareness solutions, and Agent design best practices.
Product ReviewsDeep dive into ChuanhuChatGPT, a 15K-star open-source project with multi-model access, Agent support, RAG file Q&A, GPT fine-tuning, and web search.
Product ReviewsAn in-depth look at LangBot, an open-source production-grade AI bot platform supporting WeChat, DingTalk, Discord & more, with ChatGPT, DeepSeek, Agent, RAG & plugin capabilities.