187 related articles
Product ReviewsMemPalace is an open-source local memory tool that builds long-term memory for AI Agents via verbatim storage, semantic retrieval, and MCP protocol, solving the pain of starting from scratch every session.
Tech FrontiersMultiple core leaders depart Alibaba's Qwen team amid metric disputes. Same day: MiniMax Music 2.5+, OpenAI GPT 5.3 Instant, Google Gemini 3.1 Flashlight, and Seedance 2.0 pricing announced.
Tech FrontiersGLM5 code leak reveals 745B-parameter MoE architecture replicating DeepSeek V3. DeepSeek V4 may launch a 200B quantized model first, with flagship exceeding 1T parameters.
Tech FrontiersDeepSeek releases OCR2 replacing CLIP with an LLM as visual encoder; Moonshot AI launches Kimi K2.5 with 100+ sub-agent cluster mode; Microsoft deploys 3nm Maia 200 chip; Alibaba releases Qwen3 Max Thinking.
Tech FrontiersDeep dive into Google Gemini Omni's video style transfer: transform videos into watercolor, cyberpunk, or Ghibli styles using natural language. Explore its technology, workflow, and competitive landscape.
TutorialsA systematic breakdown of the Complete Guide to Claude Code course, covering context engineering, MCP protocol, claude.md configuration, multi-Agent architecture, and three progressive projects.
Product ReviewsDeep analysis of Moonshot AI's open-source Kimi K2.6 Agent orchestration: 300 sub-Agents executing 4000-step tasks, outperforming GPT-5.4 in coding benchmarks, LoRA fine-tuning on 2x RTX 4090s.
TutorialsHow can frontend engineers advance into AI Agent development? This guide covers LangGraph.js core architecture (state, nodes, edges), LangChain comparison, and workflow agent design with practical examples.
Tech FrontiersAnthropic adds custom sub-agents to Claude Code, Cursor launches code review Agent BugBot, Qwen releases 92-language translation model, and Google unveils three experimental AI products.
TutorialsA systematic guide to the relationships between AI, machine learning, deep learning, and large language models, helping developers build a clear knowledge framework and find an efficient learning path.
Product ReviewsIn-depth review of Kimi K2.6's coding, Agent collaboration, and visual development capabilities. #1 open-source on SWE-Bench Pro, 300 parallel sub-agents, API priced at 1/3 of competitors.
TutorialsA complete beginner's guide to LLM application development: learn the three key directions (API calling, RAG, Agent), master frameworks like LangChain, and follow a step-by-step learning path to become an AI application developer.
TutorialsHow to start LLM application development from scratch? A complete roadmap covering Python basics, RAG knowledge bases, and Agent development with LangChain.
Local Deployment of Qwen 3.6 27B on 4×…
Real-world test of Qwen 3.6 27B FP8 deployed on 4×3080Ti 16GB modded GPUs with OpenCode for system tool development. Covers hardware setup, inference speed, context management, and productivity gains.
Decoding LLM Naming Conventions: Param…
Decode LLM naming conventions, understand 32B parameters & AWQ/GGUF quantization formats, with 4-bit VRAM estimation formulas, MOE model pitfalls, and model selection by GPU tier.
Industry InsightsNVIDIA Blackwell GPU sets new LLM inference records in STAC-AI financial benchmark. Explore Blackwell architecture advantages, TensorRT-LLM co-optimization, and LLM applications in trading and risk management.
Getting Started with RAG: A Complete G…
A deep dive into RAG (Retrieval-Augmented Generation) technology, covering LLM hallucinations, data staleness, and limited expertise, plus RAG workflows, core components, and LangChain learning paths.
Efficient PyTorch Learning: A Source C…
A proven PyTorch learning method: spend 2-3 days on basics, then advance rapidly by reading U-Net and ViT source code line by line. Master PyTorch through source code-driven learning.
LLM Learning Roadmap: A Complete Guide…
A systematic breakdown of seven core LLM learning modules covering environment setup, Prompt Engineering, RAG, Agents, dev frameworks, fine-tuning, and hands-on projects for developers.
Tech FrontiersDetailed guide to Google Gemini Omni's multimodal video generation: mix text, images, and video inputs to synthesize coherent 10-second videos with one click.