85 related articles
Product ReviewsHands-on testing of Google Gemma 4 open-source models running offline on three phones, with Dense vs MOE architecture explained and a complete Ollama + Claude Code deployment tutorial.
TutorialsA systematic guide to LLM engineer core skills covering RAG, Agent app development and SFT, RLHF fine-tuning, with clear learning paths for different backgrounds.
Tech FrontiersWindsurf rebrands as Devin Desktop with Agent Command Center for multi-agent management, open-source ACP protocol, and a Rust-rewritten local Agent. Full breakdown of the upgrade and platform strategy.
TutorialsStep-by-step tutorial on connecting Claude Code to DeepSeek using ccswitch. No overseas account or credit card needed — just 10 RMB to start using an AI coding assistant.
TutorialsGuide to enabling MTP multi-Token prediction acceleration in llama.cpp, covering CUDA setup, desktop configuration, model selection, and benchmarks showing ~60 Token/s with Qwen3 27B.
TutorialsComplete guide to connecting Codex with DeepSeek V4 via CC Switch relay, including API Key setup, channel configuration, and plugin unlock steps for cost-effective AI programming.
TutorialsComplete guide to integrating Doubao and DeepSeek AI models with Zotero 9 for paper summarization, full-text translation, and intelligent Q&A.
TutorialsStep-by-step tutorial: Build a low-cost AI programming assistant using DeepSeek-V3 API with VSCode's Continue plugin. Covers setup, API Key configuration, code completion demo, and Ollama local deployment.
TutorialsComplete Hermes Agent deployment tutorial for Windows: environment setup, model configuration, WeChat channel connection, and troubleshooting. Uses fewer tokens than CrawlAI with direct WeChat chat support.
TutorialsTesting Hermes agent coordinating DeepSeek V4 and MiniMax 2.7 for collaborative coding: PDF export in 9 minutes, RSS service built from scratch in Nim language.
Product ReviewsIn-depth analysis of AI aggregation platforms claiming free unlimited DeepSeek R1 full version access, revealing data security risks and sustainability concerns, with reliable alternatives.
Windsurf Wave 3 Deep Dive: MCP Support…
Deep dive into Windsurf Wave 2 & Wave 3 updates: MCP protocol support, Turbo auto mode, DeepSeek integration, Tab to Jump, pricing comparison with Cursor.
Product ReviewsDeep dive into OpenHuman open-source AI Agent: context-first architecture, Rust+React hybrid, Memory Tree system, Token Juice compression, and multi-model routing.
TutorialsUsing oMLX with MTP and Qwen3.6 35B on Apple Silicon Mac to achieve 86.7 tokens/s local coding speed, building a full-stack app in under 5 minutes.
TutorialsGuide to OpenRouter's 28 free AI models with API setup, covering GPT-OSS 120B, DeepSeek V4 Flash, and leaderboard insights into the AI model market landscape.
AI Weekly: Claude Code Review, Gemma 4…
Weekly AI roundup: Anthropic launches Claude Code review, Google Gemma 4 leaks with MoE architecture, DeepSeek V4 delayed again, Microsoft Copilot Cowork reshapes collaboration, and OpenAI acquires PromptFool.
Tech FrontiersSGLang v0.5.12.post1 stability patch details: 12 critical fixes covering DeepSeek V4 garbled text and crashes, NIXL PD disaggregated inference logic, Blackwell B300 adaptation, and cold start optimization.
Tech FrontiersDeep dive into StepFun AI's Step 3.7 Flash, a 198B sparse MoE vision-language model with 256K context and 3-level reasoning, excelling in multimodal understanding, AI coding, and Agent tool orchestration.
Tech FrontiersLiquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.
Industry InsightsAMD Instinct MI355X achieves 5% lower TCO than NVIDIA B200 on DeepSeek-R1 disaggregated inference via SGLang+MoRI full-stack optimization with 1.25x per-GPU throughput.