#Ollama

75 related articles

2026年5月28日·3 min

OpenHands Deep Dive: How an Open-Source AI Coding Agent is Redefining Software Development

Deep dive into OpenHands, an open-source AI coding agent platform covering architecture design, sandboxed code execution, and multi-tool orchestration, compared with Copilot, Devin, and more.

Claude Agent SDK + LiteLLM + Local LLMs: Building a Zero-Cost AI Agent Platform

Tutorials

2026年5月28日·3 min

Claude Agent SDK + LiteLLM + Local LLMs: Building a Zero-Cost AI Agent Platform

Learn how to redirect Claude Agent SDK API requests to local LLMs via LiteLLM Proxy, achieving zero-cost inference while retaining full agent framework capabilities.

AI Agent Development Methodology: A Complete Guide from ReAct to Enterprise-Grade Tech Stack

Deep Dives

2026年5月28日·2 min

AI Agent Development Methodology: A Complete Guide from ReAct to Enterprise-Grade Tech Stack

A deep dive into AI Agent development methodology, from the ReAct theoretical framework to a four-layer enterprise tech stack covering model services, Agent types, LangChain, and production deployment.

npcpy: An Open-Source Framework That Rethinks AI Agent Development with Software Engineering Principles

Tutorials

2026年5月27日·2 min

npcpy: An Open-Source Framework That Rethinks AI Agent Development with Software Engineering Principles

Deep dive into npcpy's four-layer architecture, multi-agent collaboration, knowledge graph lifecycle management, and deployment strategies for building stable, controllable AI Agent systems.

Product Reviews

Running Qwen3.6-27B Locally on Mac: 4 …

2026年5月27日·3 min

Running Qwen3.6-27B Locally on Mac: 4 Solutions Benchmarked

Benchmarking 4 solutions for running Qwen3.6-27B locally on Mac: GGUF, MLX Diflash, and MTP-LX. MTP-LX 4bit leads at 43.6 tok/s with solid coding, writing, and reasoning quality.

Product Reviews

Local Deployment of Qwen 3.6 27B on 4×…

2026年5月27日·3 min

Local Deployment of Qwen 3.6 27B on 4×3080Ti: Real-World Coding Test with OpenCode

Real-world test of Qwen 3.6 27B FP8 deployed on 4×3080Ti 16GB modded GPUs with OpenCode for system tool development. Covers hardware setup, inference speed, context management, and productivity gains.

Tutorials

Decoding LLM Naming Conventions: Param…

2026年5月27日·3 min

Decoding LLM Naming Conventions: Parameter Counts, Quantization Formats & VRAM Requirements Quick Reference

Decode LLM naming conventions, understand 32B parameters & AWQ/GGUF quantization formats, with 4-bit VRAM estimation formulas, MOE model pitfalls, and model selection by GPU tier.

Tutorials

Frontend to AI Full-Stack: Complete Sk…

2026年5月27日·2 min

Frontend to AI Full-Stack: Complete Skill Tree & Learning Roadmap

A complete skill tree for frontend developers transitioning to AI full-stack engineers, covering TypeScript, NestJS, LangChain JS, RAG, vector databases, and Tauri 2 with a clear learning roadmap.

Product Reviews

The Truth Behind Windsurf's "Unlimited…

2026年5月27日·2 min

The Truth Behind Windsurf's "Unlimited Credits" Hack: Shared Account Pool Rotation Mechanism and Risk Analysis

Deep analysis of how Windsurf's "unlimited credits" actually works—third-party plugins rotate shared account pools, not an official bug. Covers mechanisms, security risks, and safer alternatives.

Tutorials

Complete Guide to Local LLM Deployment…

2026年5月27日·2 min

Complete Guide to Local LLM Deployment with Ollama: AI That Works Offline

Complete guide to deploying open-source LLMs locally with Ollama. Covers installation, model selection, VRAM requirements, and performance comparison of Llama 3 and Qwen models. Free, offline-capable AI.

AI-Powered Research in Practice: From LLM Selection to Building Automated Workflows with N8N

Tutorials

2026年5月14日·4 min

AI-Powered Research in Practice: From LLM Selection to Building Automated Workflows with N8N

A deep dive into AI-driven research methodology: LLM selection, Python automation, Zotero reference management, Overleaf writing, local LLM deployment, and N8N workflow automation.

From Copilot to Agentic AI: A Complete Guide to Multi-Agent Collaboration Architectures

Deep Dives

2026年5月14日·4 min

From Copilot to Agentic AI: A Complete Guide to Multi-Agent Collaboration Architectures

Deep dive into AI's four-stage evolution from Chat to Agentic AI, covering multi-Agent architectures, ReAct framework, and MCP protocol for developers.

Dify AI Agent Tutorial: Tool Integration & ESA Search Configuration in Practice

Tutorials

2026年5月13日·4 min

Dify AI Agent Tutorial: Tool Integration & ESA Search Configuration in Practice

Complete guide to building AI Agents on Dify with zero code, covering tool integration, ESA search configuration, time awareness solutions, and Agent design best practices.

ChuanhuChatGPT: A Comprehensive Analysis of the 15K-Star Open-Source Multi-Model Chat Interface

Product Reviews

2026年5月13日·3 min

ChuanhuChatGPT: A Comprehensive Analysis of the 15K-Star Open-Source Multi-Model Chat Interface

Deep dive into ChuanhuChatGPT, a 15K-star open-source project with multi-model access, Agent support, RAG file Q&A, GPT fine-tuning, and web search.

LangBot: A Deep Dive into the 16K-Star Multi-Platform AI Bot Development Framework

Product Reviews

2026年5月13日·2 min

LangBot: A Deep Dive into the 16K-Star Multi-Platform AI Bot Development Framework

An in-depth look at LangBot, an open-source production-grade AI bot platform supporting WeChat, DingTalk, Discord & more, with ChatGPT, DeepSeek, Agent, RAG & plugin capabilities.