3 related articles
Tech FrontiersDeep dive into StepFun AI's Step 3.7 Flash, a 198B sparse MoE vision-language model with 256K context and 3-level reasoning, excelling in multimodal understanding, AI coding, and Agent tool orchestration.
Tech FrontiersLiquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.
Tech FrontiersDeep dive into Moonshot AI's fully open-sourced Kimi K2.5: 1T parameter MoE architecture, Vision-to-Code capabilities, and 100-Agent parallel cluster system topping open-source benchmarks.