46 related articles
TutorialsStep-by-step guide to building a local RAG knowledge base using RAGFlow, Ollama, and LM Studio with Docker, covering Embedding model deployment and network troubleshooting for private AI Q&A.
Deep DivesIn-depth analysis of three core reasons Python dominates AI development: simple syntax for quick onboarding, powerful ecosystem, and industry-wide network effects.
TutorialsPractical guide to batch AI image generation on Mac using Draw Things, covering prompt iteration strategies, negative prompt pitfalls, performance tips, and the decision to switch to Replicate cloud platform.
TutorialsA systematic AI Agent learning roadmap covering Python setup, Prompt Engineering, RAG, LangChain, multi-Agent collaboration, with enterprise medical consultation system case study and phased learning plan.
Product ReviewsDeep dive into Tencent Marvis system-level AI assistant, analyzing its local knowledge base, semantic search, privacy mode, and how Agents evolve from tools to OS integration.
Deep DivesDeep dive into Transformer architecture covering self-attention QKV mechanics, Encoder-Decoder structure, Flash Attention memory optimization, RoPE positional encoding, and GQA inference acceleration.
TutorialsComplete guide to deploying Stable Diffusion locally. Covers hardware requirements, one-click installation, and model setup. Run AI image generation free with 8GB RAM.
TutorialsComplete guide to deploying Stable Diffusion locally, covering hardware requirements, one-click installation, and model management. Free, unlimited, fully offline AI image generation for creators and privacy-conscious users.
TutorialsLearn how to configure a local DeepSeek model in PyCharm via Ollama for free, privacy-safe AI-assisted programming. Includes installation steps, plugin setup, usage tips, and hardware recommendations.
ResearchDeep dive into how the Humanize framework transforms LLM tokens into engineering productivity via Agent Loops. Covers KDA winning CUDA kernel contests, virtual hardware optimization, and 50% research cost reduction.
TutorialsLearn how to deploy a PD-disaggregated SGLang inference cluster on AMD GPUs using a single config file, boosting LLM throughput and latency performance.
Tech FrontiersSGLang v0.5.12.post1 stability patch details: 12 critical fixes covering DeepSeek V4 garbled text and crashes, NIXL PD disaggregated inference logic, Blackwell B300 adaptation, and cold start optimization.
Industry InsightsAMD Instinct MI355X achieves 5% lower TCO than NVIDIA B200 on DeepSeek-R1 disaggregated inference via SGLang+MoRI full-stack optimization with 1.25x per-GPU throughput.
Deep Dive into Three Major LLM Career …
Deep analysis of three core LLM roles—Application Engineer, Development Engineer, and Algorithm Engineer—covering technical requirements, salary thresholds, and career prospects including RAG, fine-tuning, and inference deployment.
Optimize Anything: One API to Unify Op…
UC Berkeley and Stanford propose Optimize Anything, a universal text optimization framework that unifies optimization of CUDA kernels, agent architectures, and prompts through one declarative API.
Claude Code Installation & Agent Hands…
Step-by-step Claude Code installation guide with Volcengine GLM5.1 Chinese LLM. Hands-on Agent demos for Bilibili data scraping and ComfyUI setup. No coding required.
Product ReviewsDeep dive into AIStarter and PanelAI architecture upgrades covering project market, model management, AI assistant features, and pricing strategy for this all-in-one AI toolbox.
Product ReviewsIndie developer releases AI IDE WaLiCode v0.2.0 with multi-project chat, task decomposition mode, and Ollama local model support, addressing pain points in mainstream AI IDEs.
Industry InsightsDeep dive into how NVIDIA Dynamo Snapshot reduces LLM inference cold start time from minutes to seconds via GPU state snapshot and recovery, covering Kubernetes integration and elastic inference.
Running AI Models on a P106 Mining GPU…
Build a local AI workstation with a P106 mining GPU for under $10. Run Live Portrait and other AI models locally with full privacy, zero marginal cost, and incredible value.