29 related articles
AI Fully Automated Orchestration in Pr…
Deep analysis of AI fully automated software orchestration: from Claude Code workflows to parallel orchestration strategies, exploring how models like MiniMax M1 drive software production costs toward zero.
Deep Dive into Three Major LLM Career …
Deep analysis of three core LLM roles—Application Engineer, Development Engineer, and Algorithm Engineer—covering technical requirements, salary thresholds, and career prospects including RAG, fine-tuning, and inference deployment.
DeepSeek V3 + bolt.html: A Practical G…
Learn how DeepSeek V3-0324 and open-source tool bolt.html combine to generate beautiful HTML pages with zero code using prompt engineering techniques.
Why Qwen3 Is the Best Open-Source Mode…
Analysis of Qwen3's advantages for MCP agent development, comparing DeepSeek R1's lack of Function Calling, covering MoE architecture and thinking mode switching.
Tech FrontiersGLM5 code leak reveals 745B-parameter MoE architecture replicating DeepSeek V3. DeepSeek V4 may launch a 200B quantized model first, with flagship exceeding 1T parameters.
Qwen 3.6 vs Gemma 4: In-Depth Comparis…
Real-world comparison of Qwen 3.6 and Gemma 4 local AI models building a Markdown editor with Tauri, testing planning ability, code generation, and development efficiency.
Product ReviewsDeep analysis of Moonshot AI's open-source Kimi K2.6 Agent orchestration: 300 sub-Agents executing 4000-step tasks, outperforming GPT-5.4 in coding benchmarks, LoRA fine-tuning on 2x RTX 4090s.
Running Qwen3.6-27B Locally on Mac: 4 …
Benchmarking 4 solutions for running Qwen3.6-27B locally on Mac: GGUF, MLX Diflash, and MTP-LX. MTP-LX 4bit leads at 43.6 tok/s with solid coding, writing, and reasoning quality.
Decoding LLM Naming Conventions: Param…
Decode LLM naming conventions, understand 32B parameters & AWQ/GGUF quantization formats, with 4-bit VRAM estimation formulas, MOE model pitfalls, and model selection by GPU tier.