90 related articles
Tech FrontiersGPT-5.6 internal testing launches UltraFast mode, Codex goal-driven mode revolutionizes AI programming, MiniMax cuts costs 360x, Anthropic vs OpenAI valuation war, Cerebras IPO raises $5.55B, Figure robot validates 8-hour autonomous ops, Google Vio 3.1 leads AI video.
Product ReviewsA Bilibili video claims DeepSeek V4Pro is free and unlimited, but no such version officially exists. This article analyzes third-party AI aggregation platforms, their risks, and safer alternatives.
TutorialsIn-depth analysis of Google's Gemma 4 open-source models: 31B, 26B MOE, and 14B/12B benchmarks, deployment guides for all platforms, and MS-Swift fine-tuning tutorial for building local Agent workflows.
Product ReviewsHands-on testing of Google Gemma 4 open-source models running offline on three phones, with Dense vs MOE architecture explained and a complete Ollama + Claude Code deployment tutorial.
TutorialsA hands-on guide to using Qwen3 for free via OpenRouter API and Ollama local deployment, paired with Cline coding agent for full-stack development tasks.
TutorialsA systematic guide to LLM engineer core skills covering RAG, Agent app development and SFT, RLHF fine-tuning, with clear learning paths for different backgrounds.
TutorialsStep-by-step tutorial on connecting Claude Code to DeepSeek using ccswitch. No overseas account or credit card needed — just 10 RMB to start using an AI coding assistant.
TutorialsGuide to enabling MTP multi-Token prediction acceleration in llama.cpp, covering CUDA setup, desktop configuration, model selection, and benchmarks showing ~60 Token/s with Qwen3 27B.
TutorialsComplete guide to connecting Codex with DeepSeek V4 via CC Switch relay, including API Key setup, channel configuration, and plugin unlock steps for cost-effective AI programming.
TutorialsComplete guide to integrating Doubao and DeepSeek AI models with Zotero 9 for paper summarization, full-text translation, and intelligent Q&A.
TutorialsStep-by-step tutorial: Build a low-cost AI programming assistant using DeepSeek-V3 API with VSCode's Continue plugin. Covers setup, API Key configuration, code completion demo, and Ollama local deployment.
TutorialsComplete Hermes Agent deployment tutorial for Windows: environment setup, model configuration, WeChat channel connection, and troubleshooting. Uses fewer tokens than CrawlAI with direct WeChat chat support.
TutorialsTesting Hermes agent coordinating DeepSeek V4 and MiniMax 2.7 for collaborative coding: PDF export in 9 minutes, RSS service built from scratch in Nim language.
Product ReviewsIn-depth analysis of AI aggregation platforms claiming free unlimited DeepSeek R1 full version access, revealing data security risks and sustainability concerns, with reliable alternatives.
Windsurf Wave 3 Deep Dive: MCP Support…
Deep dive into Windsurf Wave 2 & Wave 3 updates: MCP protocol support, Turbo auto mode, DeepSeek integration, Tab to Jump, pricing comparison with Cursor.
Product ReviewsDeep dive into OpenHuman open-source AI Agent: context-first architecture, Rust+React hybrid, Memory Tree system, Token Juice compression, and multi-model routing.
TutorialsUsing oMLX with MTP and Qwen3.6 35B on Apple Silicon Mac to achieve 86.7 tokens/s local coding speed, building a full-stack app in under 5 minutes.
AI Weekly: Claude Code Review, Gemma 4…
Weekly AI roundup: Anthropic launches Claude Code review, Google Gemma 4 leaks with MoE architecture, DeepSeek V4 delayed again, Microsoft Copilot Cowork reshapes collaboration, and OpenAI acquires PromptFool.
Tech FrontiersSGLang v0.5.12.post1 stability patch details: 12 critical fixes covering DeepSeek V4 garbled text and crashes, NIXL PD disaggregated inference logic, Blackwell B300 adaptation, and cold start optimization.
Tech FrontiersDeep dive into StepFun AI's Step 3.7 Flash, a 198B sparse MoE vision-language model with 256K context and 3-level reasoning, excelling in multimodal understanding, AI coding, and Agent tool orchestration.