644 related articles
Tech FrontiersDeep dive into StepFun AI's Step 3.7 Flash, a 198B sparse MoE vision-language model with 256K context and 3-level reasoning, excelling in multimodal understanding, AI coding, and Agent tool orchestration.
Tech FrontiersLiquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.
Industry InsightsSGLang co-hosts a finance AI inference event with Crusoe AI and Cloudflare, exploring LLM inference deployment in trading, risk management, and compliance — signaling Wall Street's shift to production-grade AI infrastructure.
Industry InsightsAMD Instinct MI355X achieves 5% lower TCO than NVIDIA B200 on DeepSeek-R1 disaggregated inference via SGLang+MoRI full-stack optimization with 1.25x per-GPU throughput.
Tech FrontiersCloudflare contributes decode KV cache offload and Mooncake recovery fixes to SGLang, resolving garbled output under high concurrency for Kimi K2.6 and enabling automatic fault recovery in distributed inference.
Tech FrontiersSGLang team hosts an Agent Loops Office Hour exploring inference optimization for agentic loops, covering KV Cache reuse, low-latency multi-turn dialogue, and tool calling techniques.
Deep Comparison of o1, o1 pro, and o3-…
Deep Research comparison of OpenAI o1, o1 pro, and o3-mini-high coding capabilities, covering code quality, optimization, error rates, and debugging with benchmarks and real-world cases.
Llama 3.3 70B In-Depth Review: Testing…
Meta releases Llama 3.3 70B open-source model with just 70B parameters rivaling 405B performance. Tested on 13 logic, math, and coding questions, it passed 12 — reshaping the open-source model landscape.
Claude Code Source Code Study Guide: E…
Learn AI Agent development from Claude Code's 510K lines of source code, covering Agent Loop, context compression, multi-Agent orchestration, and two efficient study methods.
Spring AI Agent Utils: A Java Agent To…
Deep dive into Spring AI Agent Utils toolkit covering Skill modules, Ask a User Question, To Do Write, Auto Memory, and multi-Agent orchestration — empowering Java developers to build powerful AI Agents.
Indie Game AI in Practice: Building a …
A detailed breakdown of building a complete slime combat AI for indie games, covering FSM architecture, multi-attack modules, group AI pursuit mechanics, and animation synchronization.
Claude Code /loop Command Explained: U…
Deep dive into Claude Code's /loop command: how it works, usage methods, key limitations, and a side-by-side comparison with Scheduled Tasks and GitHub Actions.
How OpenAI Helps a Top Racing Team Win…
OpenAI partners with IndyCar powerhouse Chip Ganassi Racing, using AI data analysis, pit stop optimization, and real-time strategy to find crucial fractions of a second on the track.
Deep Dive into Cursor's Pay-Per-Use Re…
Deep analysis of Cursor's pay-per-use refill plugin: account rotation mechanism, tiered discounts, full model support, and objective assessment of compliance risks and data security concerns.
AI Programming Spec Sheets: 30 Lines o…
Replace vague prompts with spec sheets—30 lines of config gets AI coding right the first time. Covers the six-element framework, three-tier boundaries, and three iron rules to eliminate rework.
OpenAI Codex Complete Guide: Four Tool…
Deep dive into OpenAI Codex's four core tools: IDE extension, CLI, Cloud service, and code review bot. Learn how they work together to build an efficient AI programming workflow from local coding to cloud automation.
Claude Code Desktop Installation & Con…
Step-by-step guide to install Claude Code Desktop, use it without an account via Developer Mode, integrate DeepSeek models through CSwitch, add Chinese localization, and configure custom Skills.
AI + Jupyter Notebook: A Practical Met…
The hardest part of STEM is the gap between theory and practice. Learn how to use Jupyter Notebook with AI Coding Agents to auto-generate interactive tutorials for math, physics, statistics, and more.
Dify 1.8.0 Hands-On Tutorial: Complete…
A detailed guide to Dify 1.8.0 Docker deployment, environment setup, and AI app building. Covers five app types, comparisons with Coze, workflow creation, and more for this open-source AI platform.
Zen MCP: An Open-Source Tool That Lets…
Deep dive into Zen MCP, an open-source project that lets Claude Code orchestrate Gemini, O3, and other AI models via MCP protocol, with cost-reduction proxy setup guide.