#本地部署

30 related articles

2026年6月2日·4 min

Complete Guide to Building a Local AI Knowledge Base with Qwen3.5 + RAGFlow + Ollama

Complete guide to building a local AI knowledge base with Qwen3.5, RAGFlow, and Ollama, covering Docker deployment, Embedding model configuration, knowledge base creation, and RAG system setup.

Stable Diffusion Local Deployment Guide: Run AI Image Generation Free with 8GB RAM

Tutorials

2026年6月2日·3 min

Stable Diffusion Local Deployment Guide: Run AI Image Generation Free with 8GB RAM

Complete guide to deploying Stable Diffusion locally. Covers hardware requirements, one-click installation, and model setup. Run AI image generation free with 8GB RAM.

Hertzman: A Free, No-Install Local LLM Deployment Tool Review

Product Reviews

2026年6月2日·3 min

Hertzman: A Free, No-Install Local LLM Deployment Tool Review

Detailed review of Hertzman local inference engine covering one-click deployment, smart hardware recommendations, OpenAI-compatible API, and performance comparison with LM Studio.

Stable Diffusion Local Deployment Guide: Free and Unlimited AI Image Generation

Tutorials

2026年6月2日·3 min

Stable Diffusion Local Deployment Guide: Free and Unlimited AI Image Generation

Complete guide to deploying Stable Diffusion locally, covering hardware requirements, one-click installation, and model management. Free, unlimited, fully offline AI image generation for creators and privacy-conscious users.

Complete Guide to Configuring Local DeepSeek Model in PyCharm for AI-Assisted Programming

Tutorials

2026年6月2日·2 min

Complete Guide to Configuring Local DeepSeek Model in PyCharm for AI-Assisted Programming

Learn how to configure a local DeepSeek model in PyCharm via Ollama for free, privacy-safe AI-assisted programming. Includes installation steps, plugin setup, usage tips, and hardware recommendations.

LFM2.5-8B-A1B: A MoE Model with 1.5B Active Parameters Delivering 4x Its Weight Class Performance

Tech Frontiers

2026年5月30日·2 min

LFM2.5-8B-A1B: A MoE Model with 1.5B Active Parameters Delivering 4x Its Weight Class Performance

Liquid AI releases LFM2.5-8B-A1B, a MoE model with 8B total params but only 1.5B active, matching 6B-class models in tool calling. Supports 128K context, local deployment, multilingual, with SGLang Day-0 support.

Tutorials

DeepSeek V4 Flash MTP Speculative Deco…

2026年5月29日·3 min

DeepSeek V4 Flash MTP Speculative Decoding Real-World Test: A Guide to 20% Faster Local Inference

Real-world testing of DeepSeek V4 Flash with MTP speculative decoding: ~20% speedup for code generation, minimal gains for text. Covers memory overhead, accuracy differences, Q4 vs Q3 quantization, and full deployment tutorial.

Claude Agent SDK + LiteLLM + Local LLMs: Building a Zero-Cost AI Agent Platform

Tutorials

2026年5月28日·3 min

Claude Agent SDK + LiteLLM + Local LLMs: Building a Zero-Cost AI Agent Platform

Learn how to redirect Claude Agent SDK API requests to local LLMs via LiteLLM Proxy, achieving zero-cost inference while retaining full agent framework capabilities.

Product Reviews

Local Deployment of Qwen 3.6 27B on 4×…

2026年5月27日·3 min

Local Deployment of Qwen 3.6 27B on 4×3080Ti: Real-World Coding Test with OpenCode

Real-world test of Qwen 3.6 27B FP8 deployed on 4×3080Ti 16GB modded GPUs with OpenCode for system tool development. Covers hardware setup, inference speed, context management, and productivity gains.

Tutorials

Complete Guide to Local LLM Deployment…

2026年5月27日·2 min

Complete Guide to Local LLM Deployment with Ollama: AI That Works Offline

Complete guide to deploying open-source LLMs locally with Ollama. Covers installation, model selection, VRAM requirements, and performance comparison of Llama 3 and Qwen models. Free, offline-capable AI.