#Unity

644 related articles

2026年5月30日·2 min

Step 3.7 Flash: Deep Dive into the 198B Sparse MoE Multimodal Model

Deep dive into StepFun AI's Step 3.7 Flash, a 198B sparse MoE vision-language model with 256K context and 3-level reasoning, excelling in multimodal understanding, AI coding, and Agent tool orchestration.

LFM2.5-8B-A1B: A MoE Model with 1.5B Active Parameters Delivering 4x Its Weight Class Performance

Tech Frontiers

2026年5月30日·2 min

LFM2.5-8B-A1B: A MoE Model with 1.5B Active Parameters Delivering 4x Its Weight Class Performance

Liquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.

SGLang Enters Finance: How AI Inference Infrastructure Is Reshaping Wall Street

Industry Insights

2026年5月30日·2 min

SGLang Enters Finance: How AI Inference Infrastructure Is Reshaping Wall Street

SGLang co-hosts a finance AI inference event with Crusoe AI and Cloudflare, exploring LLM inference deployment in trading, risk management, and compliance — signaling Wall Street's shift to production-grade AI infrastructure.

AMD MI355X Beats B200: Full-Stack Optimization Breakdown for 5% Lower TCO on DeepSeek-R1 Inference

Industry Insights

2026年5月30日·2 min

AMD MI355X Beats B200: Full-Stack Optimization Breakdown for 5% Lower TCO on DeepSeek-R1 Inference

AMD Instinct MI355X achieves 5% lower TCO than NVIDIA B200 on DeepSeek-R1 disaggregated inference via SGLang+MoRI full-stack optimization with 1.25x per-GPU throughput.

Cloudflare Contributes Critical KV Cache and Mooncake Fixes to SGLang

Tech Frontiers

2026年5月30日·1 min

Cloudflare Contributes Critical KV Cache and Mooncake Fixes to SGLang

Cloudflare contributes decode KV cache offload and Mooncake recovery fixes to SGLang, resolving garbled output under high concurrency for Kimi K2.6 and enabling automatic fault recovery in distributed inference.

SGLang Hosts Agent Loops Office Hour, Focusing on Agentic Loop Architecture Optimization

Tech Frontiers

2026年5月30日·1 min

SGLang Hosts Agent Loops Office Hour, Focusing on Agentic Loop Architecture Optimization

SGLang team hosts an Agent Loops Office Hour exploring inference optimization for agentic loops, covering KV Cache reuse, low-latency multi-turn dialogue, and tool calling techniques.

Product Reviews

Deep Comparison of o1, o1 pro, and o3-…

2026年5月30日·3 min

Deep Comparison of o1, o1 pro, and o3-mini-high Coding Capabilities: A Deep Research Analysis

Deep Research comparison of OpenAI o1, o1 pro, and o3-mini-high coding capabilities, covering code quality, optimization, error rates, and debugging with benchmarks and real-world cases.

Product Reviews

Llama 3.3 70B In-Depth Review: Testing…

2026年5月30日·3 min

Llama 3.3 70B In-Depth Review: Testing the Strongest Open-Source LLM with 13 Questions

Meta releases Llama 3.3 70B open-source model with just 70B parameters rivaling 405B performance. Tested on 13 logic, math, and coding questions, it passed 12 — reshaping the open-source model landscape.

Tutorials

Claude Code Source Code Study Guide: E…

2026年5月30日·3 min

Claude Code Source Code Study Guide: Efficiently Mastering Core AI Agent Development Architecture

Learn AI Agent development from Claude Code's 510K lines of source code, covering Agent Loop, context compression, multi-Agent orchestration, and two efficient study methods.

Tutorials

Spring AI Agent Utils: A Java Agent To…

2026年5月30日·3 min

Spring AI Agent Utils: A Java Agent Toolkit Reverse-Engineered from Claude Code's Core Features

Deep dive into Spring AI Agent Utils toolkit covering Skill modules, Ask a User Question, To Do Write, Auto Memory, and multi-Agent orchestration — empowering Java developers to build powerful AI Agents.

Tutorials

Indie Game AI in Practice: Building a …

2026年5月30日·3 min

Indie Game AI in Practice: Building a Slime Combat System with Soul

A detailed breakdown of building a complete slime combat AI for indie games, covering FSM architecture, multi-attack modules, group AI pursuit mechanics, and animation synchronization.

Tutorials

Claude Code /loop Command Explained: U…

2026年5月30日·2 min

Claude Code /loop Command Explained: Usage, Limitations & Comparison of Three Scheduling Solutions

Deep dive into Claude Code's /loop command: how it works, usage methods, key limitations, and a side-by-side comparison with Scheduled Tasks and GitHub Actions.

Industry Insights

How OpenAI Helps a Top Racing Team Win…

2026年5月29日·1 min

How OpenAI Helps a Top Racing Team Win Races

OpenAI partners with IndyCar powerhouse Chip Ganassi Racing, using AI data analysis, pit stop optimization, and real-time strategy to find crucial fractions of a second on the track.

Product Reviews

Deep Dive into Cursor's Pay-Per-Use Re…

2026年5月29日·3 min

Deep Dive into Cursor's Pay-Per-Use Refill Plan: Is Using Official Pro Accounts at 65% Off Reliable?

Deep analysis of Cursor's pay-per-use refill plugin: account rotation mechanism, tiered discounts, full model support, and objective assessment of compliance risks and data security concerns.

Tutorials

AI Programming Spec Sheets: 30 Lines o…

2026年5月29日·3 min

AI Programming Spec Sheets: 30 Lines of Configuration Saves Five Rounds of Rework

Replace vague prompts with spec sheets—30 lines of config gets AI coding right the first time. Covers the six-element framework, three-tier boundaries, and three iron rules to eliminate rework.

Tutorials

OpenAI Codex Complete Guide: Four Tool…

2026年5月29日·2 min

OpenAI Codex Complete Guide: Four Tools for Building an AI Programming Workflow

Deep dive into OpenAI Codex's four core tools: IDE extension, CLI, Cloud service, and code review bot. Learn how they work together to build an efficient AI programming workflow from local coding to cloud automation.

Tutorials

Claude Code Desktop Installation & Con…

2026年5月29日·3 min

Claude Code Desktop Installation & Configuration Guide: No Account Required + DeepSeek Integration + Chinese Localization

Step-by-step guide to install Claude Code Desktop, use it without an account via Developer Mode, integrate DeepSeek models through CSwitch, add Chinese localization, and configure custom Skills.

Tutorials

AI + Jupyter Notebook: A Practical Met…

2026年5月29日·3 min

AI + Jupyter Notebook: A Practical Method for Quickly Getting Started in Any STEM Subject

The hardest part of STEM is the gap between theory and practice. Learn how to use Jupyter Notebook with AI Coding Agents to auto-generate interactive tutorials for math, physics, statistics, and more.

Tutorials

Dify 1.8.0 Hands-On Tutorial: Complete…

2026年5月29日·2 min

Dify 1.8.0 Hands-On Tutorial: Complete Guide from Deployment to Building AI Applications

A detailed guide to Dify 1.8.0 Docker deployment, environment setup, and AI app building. Covers five app types, comparisons with Coze, workflow creation, and more for this open-source AI platform.

Tutorials

Zen MCP: An Open-Source Tool That Lets…

2026年5月29日·2 min

Zen MCP: An Open-Source Tool That Lets Claude Orchestrate Multiple AI Models in Collaboration

Deep dive into Zen MCP, an open-source project that lets Claude Code orchestrate Gemini, O3, and other AI models via MCP protocol, with cost-reduction proxy setup guide.