# dense model

GPT-5.5 vs DeepSeek-V4: Who Wins in a Four-Round Head-to-Head Test?

GPT-5.5 vs DeepSeek-V4 in four comprehensive rounds covering world knowledge, context memory, logical reasoning, and coding — a detailed comparison of real performance differences.

Manus Hands-On Review: How Does This AI Agent Perform on the DeepSeek Tech Stack?

Product Reviews

2026年6月3日·3 min

Manus Hands-On Review: How Does This AI Agent Perform on the DeepSeek Tech Stack?

Hands-on review of Manus AI Agent on the DeepSeek tech stack, analyzing task execution, Chinese reasoning capabilities, strengths, limitations, and the potential of domestic LLMs in Agent applications.

DeepSeek-V3.2 Released: Coding and Math Capabilities Join the Global Top Tier

DeepSeek-V3.2 Released: Coding and Math Capabilities Join the Global Top Tier

DeepSeek-V3.2 released with coding, math, and Agent capabilities matching Gemini 3.0 Pro, setting new open-source SOTA. Detailed analysis of performance gains, use cases, and deployment tips.

Ollama + Gemma 4 Local Codex Setup: Complete Guide to Zero-Cost AI Programming

2026年6月3日·3 min

Ollama + Gemma 4 Local Codex Setup: Complete Guide to Zero-Cost AI Programming

Learn how to run Codex locally with Ollama and Gemma 4 for zero-cost AI programming. Covers installation, model selection, and real demos as an alternative to $20-200/month paid plans.

Complete Guide to Connecting DeepSeek V4 with Claude Code: CC Switch Configuration Tutorial

Complete Guide to Connecting DeepSeek V4 with Claude Code: CC Switch Configuration Tutorial

Learn how to connect DeepSeek V4 Pro and V4 Flash to Claude Code using CC Switch, with complete steps for download, model mapping, and API Key configuration in 5 minutes.

Google Gemma 4 Hands-On Review: Offline on Smartphones + Ollama Deployment Tutorial

Product Reviews

2026年6月3日·3 min

Google Gemma 4 Hands-On Review: Offline on Smartphones + Ollama Deployment Tutorial

Hands-on testing of Google Gemma 4 open-source models running offline on three phones, with Dense vs MOE architecture explained and a complete Ollama + Claude Code deployment tutorial.

Qwen3 Free Coding in Practice: Building Full-Stack Apps with Cline

Qwen3 Free Coding in Practice: Building Full-Stack Apps with Cline

A hands-on guide to using Qwen3 for free via OpenRouter API and Ollama local deployment, paired with Cline coding agent for full-stack development tasks.

Connect Claude Code to DeepSeek: Zero-Barrier Four-Step Configuration Tutorial

2026年6月2日·2 min

Connect Claude Code to DeepSeek: Zero-Barrier Four-Step Configuration Tutorial

Step-by-step tutorial on connecting Claude Code to DeepSeek using ccswitch. No overseas account or credit card needed — just 10 RMB to start using an AI coding assistant.

llama.cpp MTP Acceleration Deployment Guide: Configuration Steps & Real-World Benchmarks

2026年6月2日·3 min

llama.cpp MTP Acceleration Deployment Guide: Configuration Steps & Real-World Benchmarks

Guide to enabling MTP multi-Token prediction acceleration in llama.cpp, covering CUDA setup, desktop configuration, model selection, and benchmarks showing ~60 Token/s with Qwen3 27B.

Tutorial: Building a Low-Cost AI Code Editor with DeepSeek-V3 + VSCode

2026年6月2日·2 min

Tutorial: Building a Low-Cost AI Code Editor with DeepSeek-V3 + VSCode

Step-by-step tutorial: Build a low-cost AI programming assistant using DeepSeek-V3 API with VSCode's Continue plugin. Covers setup, API Key configuration, code completion demo, and Ollama local deployment.

Hermes Agent Deployment Tutorial: An AI Assistant That Uses Fewer Tokens Than CrawlAI

2026年6月2日·3 min

Hermes Agent Deployment Tutorial: An AI Assistant That Uses Fewer Tokens Than CrawlAI

Complete Hermes Agent deployment tutorial for Windows: environment setup, model configuration, WeChat channel connection, and troubleshooting. Uses fewer tokens than CrawlAI with direct WeChat chat support.

oMLX + MTP + Qwen3.6: Local AI Coding Speed Breaks New Records

2026年6月1日·3 min

oMLX + MTP + Qwen3.6: Local AI Coding Speed Breaks New Records

Using oMLX with MTP and Qwen3.6 35B on Apple Silicon Mac to achieve 86.7 tokens/s local coding speed, building a full-stack app in under 5 minutes.

AI Weekly: Claude Code Review, Gemma 4…

2026年6月1日·3 min

AI Weekly: Claude Code Review, Gemma 4 Leak & DeepSeek V4 Delayed

Weekly AI roundup: Anthropic launches Claude Code review, Google Gemma 4 leaks with MoE architecture, DeepSeek V4 delayed again, Microsoft Copilot Cowork reshapes collaboration, and OpenAI acquires PromptFool.

Step 3.7 Flash: Deep Dive into the 198B Sparse MoE Multimodal Model

2026年5月30日·2 min

Step 3.7 Flash: Deep Dive into the 198B Sparse MoE Multimodal Model

Deep dive into StepFun AI's Step 3.7 Flash, a 198B sparse MoE vision-language model with 256K context and 3-level reasoning, excelling in multimodal understanding, AI coding, and Agent tool orchestration.

LFM2.5-8B-A1B: A MoE Model with 1.5B Active Parameters Delivering 4x Its Weight Class Performance

2026年5月30日·2 min

LFM2.5-8B-A1B: A MoE Model with 1.5B Active Parameters Delivering 4x Its Weight Class Performance

Liquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.

Cloudflare Contributes Critical KV Cache and Mooncake Fixes to SGLang