151 related articles
Local Deployment of Qwen 3.6 27B on 4×…
Real-world test of Qwen 3.6 27B FP8 deployed on 4×3080Ti 16GB modded GPUs with OpenCode for system tool development. Covers hardware setup, inference speed, context management, and productivity gains.
Decoding LLM Naming Conventions: Param…
Decode LLM naming conventions, understand 32B parameters & AWQ/GGUF quantization formats, with 4-bit VRAM estimation formulas, MOE model pitfalls, and model selection by GPU tier.
AI Coding Appliance vs Cloud LLMs: Can…
A deep cost comparison between AI coding appliances and cloud LLM APIs. A 20-person team spending ¥480K/year on tokens can deploy 4 local OnePanel units at ¥99K each, breaking even in 2.5 months.
Frontend to AI Full-Stack: Complete Sk…
A complete skill tree for frontend developers transitioning to AI full-stack engineers, covering TypeScript, NestJS, LangChain JS, RAG, vector databases, and Tauri 2 with a clear learning roadmap.
Qwen + VS Code in Practice: The Full P…
A blogger uses VS Code and Alibaba Cloud Qwen 3.6 to completely revamp a six-year-old webmaster networking feature. This article details the tech stack, prompt engineering, and practical insights from AI-powered feature renovation.
TutorialsDeep dive into OpenAI Codex plugin system architecture (Skills, Apps, MCP Server), four installation methods, and a macOS app development case study showing how plugins boost AI coding efficiency.
Complete Guide to Local LLM Deployment…
Complete guide to deploying open-source LLMs locally with Ollama. Covers installation, model selection, VRAM requirements, and performance comparison of Llama 3 and Qwen models. Free, offline-capable AI.
Hermes Agent Practical Guide: From Ins…
Guide to Hermes Agent's architecture, installation, Skill extensions, Feishu integration, news bots, Python SDK integration, and automated workflows.
Deep DivesDeep dive into Agentic RAG vs traditional RAG, covering tool calling, multi-step iteration, query rewriting, with LangChain and LangGraph code examples for building intelligent retrieval systems.
TutorialsCompare traditional RAG vs Agentic RAG architectures, explore planning, tool use, and multi-step iteration capabilities, with full LangChain/LangGraph ReAct Agent code and ChatBoss project examples.
Product ReviewsAn in-depth look at LangBot, an open-source production-grade AI bot platform supporting WeChat, DingTalk, Discord & more, with ChatGPT, DeepSeek, Agent, RAG & plugin capabilities.