188 related articles
Tech FrontiersSGLang v0.5.12.post1 stability patch details: 12 critical fixes covering DeepSeek V4 garbled text and crashes, NIXL PD disaggregated inference logic, Blackwell B300 adaptation, and cold start optimization.
Tech FrontiersDeep dive into StepFun AI's Step 3.7 Flash, a 198B sparse MoE vision-language model with 256K context and 3-level reasoning, excelling in multimodal understanding, AI coding, and Agent tool orchestration.
Tech FrontiersLiquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.
Industry InsightsAMD Instinct MI355X achieves 5% lower TCO than NVIDIA B200 on DeepSeek-R1 disaggregated inference via SGLang+MoRI full-stack optimization with 1.25x per-GPU throughput.
Tech FrontiersCloudflare contributes decode KV cache offload and Mooncake recovery fixes to SGLang, resolving garbled output under high concurrency for Kimi K2.6 and enabling automatic fault recovery in distributed inference.
Llama 3.3 70B In-Depth Review: Testing…
Meta releases Llama 3.3 70B open-source model with just 70B parameters rivaling 405B performance. Tested on 13 logic, math, and coding questions, it passed 12 — reshaping the open-source model landscape.
Deep Dive into Three Major LLM Career …
Deep analysis of three core LLM roles—Application Engineer, Development Engineer, and Algorithm Engineer—covering technical requirements, salary thresholds, and career prospects including RAG, fine-tuning, and inference deployment.
How to Choose an AI Coding IDE: A Comp…
A detailed comparison of mainstream AI coding IDEs including Cursor, Trae, and Windsurf, covering Auto mode, Codex integration, and more to help developers at all levels find the best AI coding tool.
TutorialsA beginner's guide to learning AI large language models — covering learning paths, hardware requirements, Python essentials, and cloud services for learners at every level.
AI Agent Learning Roadmap: From Beginn…
A detailed three-month AI Agent learning roadmap covering LLM basics, ReAct paradigm, LangChain, memory mechanisms, tool calling, and multi-agent collaboration with practical project suggestions.
Tech FrontiersGoogle launches Gemini Omni video editing in India, letting users upload and edit videos with AI. Explore the feature details, India market strategy, and the multimodal AI shift from understanding to creation.
Harness Engineering: A Practical Guide…
Explore the three stages of AI programming evolution: from Prompt Engineering to Context Engineering to Harness Engineering. Master enterprise-grade AI coding with Cloud Code + VS Code.
Claude Code: 19 Essential Commands Exp…
A deep dive into 19 essential Claude Code commands across conversation management, settings, utilities, permissions, and plugins, plus a golden workflow to double your dev productivity.
TutorialsComplete Cursor beginner's guide covering installation, model selection, three conversation modes, billing rules, Rules setup, and practical AI programming tips to help you master Cursor quickly.
Product ReviewsIn-depth analysis of Devin 2.0: dropped from $500 to $20/month, 12x efficiency in code migration, but only 15% completion on complex tasks. Real test data on use cases and limitations.
Product ReviewsDeep comparison of Qoder, Cursor, Windsurf, and Devin across autonomy, reliability, and context capabilities to help developers choose the right AI coding assistant.
Product ReviewsGoogle's AI coding assistant Jules exits Beta with environment snapshots, Critic Agent reinforcement learning code review, interactive planning, web preview, web search, and more.
Tech FrontiersOct 3, 2025 AI Daily: IBM releases Granite 4.0 hybrid architecture open-source models, Google launches Jules CLI and Gemini 2.5 Flash Image GA, Ant Group open-sources Ming UniVision, OpenAI hits $500B valuation.
TutorialsDeep dive into Claude Code Sub-Agent mechanism with a practical blog writing + Git commit case study, showing how multi-agent collaboration solves instruction loss and context bloat issues.
TutorialsDeep dive into Andrew Ng and Harrison Chase's LangChain course, covering the five core components—Models, Prompts, Indexes, Chains, and Agents—to help developers master LLM app development.