#LLM推理

2 related articles

2026年5月30日·2 min

Tutorial: Deploying a PD-Disaggregated SGLang Multi-Node Inference Cluster on AMD GPUs

Learn how to deploy a PD-disaggregated SGLang inference cluster on AMD GPUs using a single config file, boosting LLM throughput and latency performance.

NVIDIA Blackwell Sets New STAC-AI Records for Financial LLM Inference

Industry Insights

2026年5月27日·2 min

NVIDIA Blackwell Sets New STAC-AI Records for Financial LLM Inference

NVIDIA Blackwell GPU sets new LLM inference records in STAC-AI financial benchmark. Explore Blackwell architecture advantages, TensorRT-LLM co-optimization, and LLM applications in trading and risk management.