#large language model

985 related articles

2026年5月30日·1 min

Cloudflare Contributes Critical KV Cache and Mooncake Fixes to SGLang

Cloudflare contributes decode KV cache offload and Mooncake recovery fixes to SGLang, resolving garbled output under high concurrency for Kimi K2.6 and enabling automatic fault recovery in distributed inference.

SGLang Hosts Agent Loops Office Hour, Focusing on Agentic Loop Architecture Optimization

Tech Frontiers

2026年5月30日·1 min

SGLang Hosts Agent Loops Office Hour, Focusing on Agentic Loop Architecture Optimization

SGLang team hosts an Agent Loops Office Hour exploring inference optimization for agentic loops, covering KV Cache reuse, low-latency multi-turn dialogue, and tool calling techniques.

Product Reviews

Llama 3.3 70B In-Depth Review: Testing…

2026年5月30日·3 min

Llama 3.3 70B In-Depth Review: Testing the Strongest Open-Source LLM with 13 Questions

Meta releases Llama 3.3 70B open-source model with just 70B parameters rivaling 405B performance. Tested on 13 logic, math, and coding questions, it passed 12 — reshaping the open-source model landscape.

Product Reviews

API Aggregation Proxy Platforms Tested…

2026年5月30日·2 min

API Aggregation Proxy Platforms Tested: One Interface to Call 100+ AI Models

Hands-on testing of an API aggregation proxy platform's model calling capabilities, including GPT-Image2 image generation, cost analysis, and coverage of 100+ models like Claude and Gemini.

Industry Insights

Six Foundational Upgrades to Claude Co…

2026年5月30日·3 min

Six Foundational Upgrades to Claude Code: AI Programming Moves from Lab to Industrial Scale

Anthropic's largest-ever foundational upgrade to Claude Code fixes six critical issues at once—terminal flickering, thinking freezes, cryptic errors, context deadlocks, unstable connections, and session crashes—shifting AI coding competition to the infrastructure layer.

Tutorials

BMad-Method: Building an AI Agile Deve…

2026年5月30日·3 min

BMad-Method: Building an AI Agile Development Team with a Multi-Agent Framework

Deep dive into BMad-Method, an open-source multi-agent framework simulating a full agile team—from business analysis to QA—supporting Claude Code, Cursor, and more.

Tutorials

Claude Code Source Code Study Guide: E…

2026年5月30日·3 min

Claude Code Source Code Study Guide: Efficiently Mastering Core AI Agent Development Architecture

Learn AI Agent development from Claude Code's 510K lines of source code, covering Agent Loop, context compression, multi-Agent orchestration, and two efficient study methods.

Tutorials

Claude Code Monitor Tool Explained: Ev…

2026年5月30日·2 min

Claude Code Monitor Tool Explained: Event-Driven Replaces Polling, Saving Tokens More Efficiently

Deep dive into Claude Code's new built-in Monitor tool. Learn how event-driven monitoring replaces polling via Stream Filter and Poll and Diff modes, dramatically reducing token consumption.

Product Reviews

Major Claude Code Update: A Complete G…

2026年5月30日·2 min

Major Claude Code Update: A Complete Guide to Agent View and the Goal System

Deep dive into Claude Code's new Agent View and Goal system, covering multi-agent parallel management, background sessions, and result-oriented autonomous execution.

Product Reviews

ABCoder in Practice: A Demonstration o…

2026年5月30日·3 min

ABCoder in Practice: A Demonstration of Solving AI Code Hallucination

A practical comparison using Hertz framework SSE services shows how ABCoder uses MCP protocol to let AI models consult real source code, solving LLM code hallucination problems.

OpenAI Launches Rosalind Biodefense Program: How AI Is Reshaping Public Health Security

Tech Frontiers

2026年5月29日·2 min

OpenAI Launches Rosalind Biodefense Program: How AI Is Reshaping Public Health Security

OpenAI launches Rosalind Biodefense, offering GPT-Rosalind to government agencies to accelerate pathogen surveillance, vaccine R&D, and pandemic preparedness using AI.

MixupMP: How Data Augmentation Fixes the Uncertainty Quantification Flaws of Deep Ensembles

Research

2026年5月29日·3 min

MixupMP: How Data Augmentation Fixes the Uncertainty Quantification Flaws of Deep Ensembles

Deep dive into AISTATS 2024 paper MixupMP: revealing Deep Ensembles' fundamental UQ flaws and fixing them via Mixup augmentation and Martingale Posterior framework for better calibration and OOD detection.

Product Reviews

Deep Dive into Cursor's Pay-Per-Use Re…

2026年5月29日·3 min

Deep Dive into Cursor's Pay-Per-Use Refill Plan: Is Using Official Pro Accounts at 65% Off Reliable?

Deep analysis of Cursor's pay-per-use refill plugin: account rotation mechanism, tiered discounts, full model support, and objective assessment of compliance risks and data security concerns.

Tutorials

AI Programming Spec Sheets: 30 Lines o…

2026年5月29日·3 min

AI Programming Spec Sheets: 30 Lines of Configuration Saves Five Rounds of Rework

Replace vague prompts with spec sheets—30 lines of config gets AI coding right the first time. Covers the six-element framework, three-tier boundaries, and three iron rules to eliminate rework.

Product Reviews

Claude Opus 4.8 Hands-On: What Can You…

2026年5月29日·2 min

Claude Opus 4.8 Hands-On: What Can You Build in One Hour?

Hands-on testing of Claude Opus 4.8's coding and creative abilities, including Mario game and Slay the Spire-style card game development, quota consumption, and real-world bug frequency.

Tutorials

Claude Code Desktop Installation & Con…

2026年5月29日·3 min

Claude Code Desktop Installation & Configuration Guide: No Account Required + DeepSeek Integration + Chinese Localization

Step-by-step guide to install Claude Code Desktop, use it without an account via Developer Mode, integrate DeepSeek models through CSwitch, add Chinese localization, and configure custom Skills.

Tutorials

Getting Started with Claude Code: 5 Co…

2026年5月29日·2 min

Getting Started with Claude Code: 5 Core Advantages Over Regular AI Coding Tools

Deep dive into the core differences between Claude Code and regular AI chat tools across 5 dimensions: interaction, context understanding, execution, memory, and tool invocation.

Tutorials

AI + Jupyter Notebook: A Practical Met…

2026年5月29日·3 min

AI + Jupyter Notebook: A Practical Method for Quickly Getting Started in Any STEM Subject

The hardest part of STEM is the gap between theory and practice. Learn how to use Jupyter Notebook with AI Coding Agents to auto-generate interactive tutorials for math, physics, statistics, and more.

Industry Insights

Deep Dive into Three Major LLM Career …

2026年5月29日·3 min

Deep Dive into Three Major LLM Career Paths: Requirements, Tech Stacks, and Career Prospects

Deep analysis of three core LLM roles—Application Engineer, Development Engineer, and Algorithm Engineer—covering technical requirements, salary thresholds, and career prospects including RAG, fine-tuning, and inference deployment.

Tutorials

Cursor + MCP in Practice: A Complete G…

2026年5月29日·3 min

Cursor + MCP in Practice: A Complete Guide to Building a Browser Automation Agent

A detailed guide on integrating Playwright MCP Server with Cursor, covering Node.js setup with NVM, NPM mirror configuration, and building a browser automation agent step by step.