90 related articles
Tech FrontiersLiquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.
Industry InsightsAMD Instinct MI355X achieves 5% lower TCO than NVIDIA B200 on DeepSeek-R1 disaggregated inference via SGLang+MoRI full-stack optimization with 1.25x per-GPU throughput.
Tech FrontiersCloudflare contributes decode KV cache offload and Mooncake recovery fixes to SGLang, resolving garbled output under high concurrency for Kimi K2.6 and enabling automatic fault recovery in distributed inference.
AI Fully Automated Orchestration in Pr…
Deep analysis of AI fully automated software orchestration: from Claude Code workflows to parallel orchestration strategies, exploring how models like MiniMax M1 drive software production costs toward zero.
Deep Dive into Three Major LLM Career …
Deep analysis of three core LLM roles—Application Engineer, Development Engineer, and Algorithm Engineer—covering technical requirements, salary thresholds, and career prospects including RAG, fine-tuning, and inference deployment.
DeepSeek V3 + bolt.html: A Practical G…
Learn how DeepSeek V3-0324 and open-source tool bolt.html combine to generate beautiful HTML pages with zero code using prompt engineering techniques.
Why Qwen3 Is the Best Open-Source Mode…
Analysis of Qwen3's advantages for MCP agent development, comparing DeepSeek R1's lack of Function Calling, covering MoE architecture and thinking mode switching.
June AI Showdown: Mythos, Sonnet 4.8, …
June 2025 becomes AI's densest release month: Anthropic Mythos nears launch, Claude Sonnet/Opus 4.8 skip-level upgrades, GPT-5.6 rapid iteration, DeepSeek V4 Pro permanent 75% price cut.
Complete Guide to Connecting Codex wit…
Step-by-step guide to deploying OpenAI Codex CLI in China using WSL + MoBridge relay + DeepSeek API. No VPN needed, stable and cost-effective setup in minutes.
Tech FrontiersDeepSeek announces permanent discount pricing for its V4-Pro model. Learn how this impacts developers, V4-Pro's competitive edge, and the latest LLM price war trends.
Product ReviewsHands-on review of Xiaomi MIMO 2.5's free 200M Token offer. Covers the application process, coding performance vs Copilot and DeepSeek V4, usage limitations, and who should try this free AI coding tool.
Product ReviewsCompare top AI coding plans including Cursor Max, ChatGPT Pro, GLM Coding Plan, and DeepSeek API — with pricing, performance, and use-case recommendations to help you choose.
Product ReviewsCursor restricted in China? This article reviews three AI coding alternatives: Augment Code for smart prompt optimization, Trae for best value, and Amazon Kilo for process-driven development.
Deep DivesA deep dive into AI Agent development methodology, from the ReAct theoretical framework to a four-layer enterprise tech stack covering model services, Agent types, LangChain, and production deployment.
Tech FrontiersWeekly AI roundup: Kimi K2.6 tops open-source rankings, Anthropic launches Opus 4.7 and Claude Design, Alibaba rolls out Qwen 3.6 series, Google releases emotion-controllable TTS model.
Tech FrontiersGLM5 code leak reveals 745B-parameter MoE architecture replicating DeepSeek V3. DeepSeek V4 may launch a 200B quantized model first, with flagship exceeding 1T parameters.
Tech FrontiersAlibaba's Qwen APP launches 400+ features integrating Alipay and Taobao, Baidu releases ERNIE 5.0, Meituan unveils deep reasoning model, StepFun tops global speech AI rankings, and Anthropic's share nears Google's.
Tech FrontiersOpenAI releases GPT-5.2 with a 390x efficiency gain on ARC-AGI, beating Claude Opus 4.5. Deep analysis of the efficiency leap, user experience paradox, Disney's $1B deal, and the AI content quality crisis.
Qwen 3.6 vs Gemma 4: In-Depth Comparis…
Real-world comparison of Qwen 3.6 and Gemma 4 local AI models building a Markdown editor with Tauri, testing planning ability, code generation, and development efficiency.
Product ReviewsDeep analysis of Moonshot AI's open-source Kimi K2.6 Agent orchestration: 300 sub-Agents executing 4000-step tasks, outperforming GPT-5.4 in coding benchmarks, LoRA fine-tuning on 2x RTX 4090s.