#MLX framework

4 related articles

2026年6月3日·2 min

Gemma 4 Complete Guide: The Apache 2.0 Open-Source Agent Powerhouse

In-depth analysis of Google's Gemma 4 open-source models: 31B, 26B MOE, and 14B/12B benchmarks, deployment guides for all platforms, and MS-Swift fine-tuning tutorial for building local Agent workflows.

oMLX + MTP + Qwen3.6: Local AI Coding Speed Breaks New Records

Tutorials

2026年6月1日·3 min

oMLX + MTP + Qwen3.6: Local AI Coding Speed Breaks New Records

Using oMLX with MTP and Qwen3.6 35B on Apple Silicon Mac to achieve 86.7 tokens/s local coding speed, building a full-stack app in under 5 minutes.

Tutorials

DeepSeek V4 Flash MTP Speculative Deco…

2026年5月29日·3 min

DeepSeek V4 Flash MTP Speculative Decoding Real-World Test: A Guide to 20% Faster Local Inference

Real-world testing of DeepSeek V4 Flash with MTP speculative decoding: ~20% speedup for code generation, minimal gains for text. Covers memory overhead, accuracy differences, Q4 vs Q3 quantization, and full deployment tutorial.

Deep Dive into OpenAI Codex Plugin System: Architecture, Installation, and Hands-On Development

Tutorials

2026年5月27日·2 min

Deep Dive into OpenAI Codex Plugin System: Architecture, Installation, and Hands-On Development

Deep dive into OpenAI Codex plugin system architecture (Skills, Apps, MCP Server), four installation methods, and a macOS app development case study showing how plugins boost AI coding efficiency.