#Transformer

187 related articles

2026年5月27日·2 min

MemPalace: An Open-Source Tool That Gives AI Agents Local Long-Term Memory

MemPalace is an open-source local memory tool that builds long-term memory for AI Agents via verbatim storage, semantic retrieval, and MCP protocol, solving the pain of starting from scratch every session.

Qwen Core Team Turmoil, OpenAI and Google Release New Models in Rapid Succession | AI Daily

Tech Frontiers

2026年5月27日·2 min

Qwen Core Team Turmoil, OpenAI and Google Release New Models in Rapid Succession | AI Daily

Multiple core leaders depart Alibaba's Qwen team amid metric disputes. Same day: MiniMax Music 2.5+, OpenAI GPT 5.3 Instant, Google Gemini 3.1 Flashlight, and Seedance 2.0 pricing announced.

GLM5 Architecture Leaked: 745B Parameters, DeepSeek V4 May Launch Quantized Smaller Model First

Tech Frontiers

2026年5月27日·2 min

GLM5 Architecture Leaked: 745B Parameters, DeepSeek V4 May Launch Quantized Smaller Model First

GLM5 code leak reveals 745B-parameter MoE architecture replicating DeepSeek V3. DeepSeek V4 may launch a 200B quantized model first, with flagship exceeding 1T parameters.

DeepSeek OCR2, Kimi K2.5, and Microsoft Maia 200 All Launched on the Same Day

Tech Frontiers

2026年5月27日·2 min

DeepSeek OCR2, Kimi K2.5, and Microsoft Maia 200 All Launched on the Same Day

DeepSeek releases OCR2 replacing CLIP with an LLM as visual encoder; Moonshot AI launches Kimi K2.5 with 100+ sub-agent cluster mode; Microsoft deploys 3nm Maia 200 chip; Alibaba releases Qwen3 Max Thinking.

Gemini Omni Video Style Transfer: Change Video Visual Styles with Natural Language

Tech Frontiers

2026年5月27日·2 min

Gemini Omni Video Style Transfer: Change Video Visual Styles with Natural Language

Deep dive into Google Gemini Omni's video style transfer: transform videos into watercolor, cyberpunk, or Ghibli styles using natural language. Explore its technology, workflow, and competitive landscape.

The Complete Guide to Claude Code: From Personal Assistant to AI Agent Development

Tutorials

2026年5月27日·3 min

The Complete Guide to Claude Code: From Personal Assistant to AI Agent Development

A systematic breakdown of the Complete Guide to Claude Code course, covering context engineering, MCP protocol, claude.md configuration, multi-Agent architecture, and three progressive projects.

Kimi K2.6 Open-Source Hands-On: How Strong Is Its Orchestration of 300 Concurrent Agents?

Product Reviews

2026年5月27日·2 min

Kimi K2.6 Open-Source Hands-On: How Strong Is Its Orchestration of 300 Concurrent Agents?

Deep analysis of Moonshot AI's open-source Kimi K2.6 Agent orchestration: 300 sub-Agents executing 4000-step tasks, outperforming GPT-5.4 in coding benchmarks, LoRA fine-tuning on 2x RTX 4090s.

Frontend Engineers Leveling Up to AI Agents: LangGraph.js Architecture Design & Practical Guide

Tutorials

2026年5月27日·3 min

Frontend Engineers Leveling Up to AI Agents: LangGraph.js Architecture Design & Practical Guide

How can frontend engineers advance into AI Agent development? This guide covers LangGraph.js core architecture (state, nodes, edges), LangChain comparison, and workflow agent design with practical examples.

Claude Code Sub-Agents and Cursor BugBot Launch: AI Programming Tools Get Major Upgrades

Tech Frontiers

2026年5月27日·3 min

Claude Code Sub-Agents and Cursor BugBot Launch: AI Programming Tools Get Major Upgrades

Anthropic adds custom sub-agents to Claude Code, Cursor launches code review Agent BugBot, Qwen releases 92-language translation model, and Google unveils three experimental AI products.

Getting Started with AI Full-Stack Development: A Knowledge Framework from Machine Learning to Large Language Models

Tutorials

2026年5月27日·3 min

Getting Started with AI Full-Stack Development: A Knowledge Framework from Machine Learning to Large Language Models

A systematic guide to the relationships between AI, machine learning, deep learning, and large language models, helping developers build a clear knowledge framework and find an efficient learning path.

Kimi K2.6 In-Depth Review: A Complete Breakdown of Its Coding and Agent Capabilities

Product Reviews

2026年5月27日·3 min

Kimi K2.6 In-Depth Review: A Complete Breakdown of Its Coding and Agent Capabilities

In-depth review of Kimi K2.6's coding, Agent collaboration, and visual development capabilities. #1 open-source on SWE-Bench Pro, 300 parallel sub-agents, API priced at 1/3 of competitors.

Getting Started with LLM Application Development from Scratch: A Complete Guide to Learning Paths and Career Directions

Tutorials

2026年5月27日·3 min

Getting Started with LLM Application Development from Scratch: A Complete Guide to Learning Paths and Career Directions

A complete beginner's guide to LLM application development: learn the three key directions (API calling, RAG, Agent), master frameworks like LangChain, and follow a step-by-step learning path to become an AI application developer.

Learning LLM Application Development from Scratch: A Complete Roadmap from RAG to Agent

Tutorials

2026年5月27日·3 min

Learning LLM Application Development from Scratch: A Complete Roadmap from RAG to Agent

How to start LLM application development from scratch? A complete roadmap covering Python basics, RAG knowledge bases, and Agent development with LangChain.

Product Reviews

Local Deployment of Qwen 3.6 27B on 4×…

2026年5月27日·3 min

Local Deployment of Qwen 3.6 27B on 4×3080Ti: Real-World Coding Test with OpenCode

Real-world test of Qwen 3.6 27B FP8 deployed on 4×3080Ti 16GB modded GPUs with OpenCode for system tool development. Covers hardware setup, inference speed, context management, and productivity gains.

Tutorials

Decoding LLM Naming Conventions: Param…

2026年5月27日·3 min

Decoding LLM Naming Conventions: Parameter Counts, Quantization Formats & VRAM Requirements Quick Reference

Decode LLM naming conventions, understand 32B parameters & AWQ/GGUF quantization formats, with 4-bit VRAM estimation formulas, MOE model pitfalls, and model selection by GPU tier.

NVIDIA Blackwell Sets New STAC-AI Records for Financial LLM Inference

Industry Insights

2026年5月27日·2 min

NVIDIA Blackwell Sets New STAC-AI Records for Financial LLM Inference

NVIDIA Blackwell GPU sets new LLM inference records in STAC-AI financial benchmark. Explore Blackwell architecture advantages, TensorRT-LLM co-optimization, and LLM applications in trading and risk management.

Deep Dives

Getting Started with RAG: A Complete G…

2026年5月27日·3 min

Getting Started with RAG: A Complete Guide from LLM Hallucinations to Retrieval-Augmented Generation

A deep dive into RAG (Retrieval-Augmented Generation) technology, covering LLM hallucinations, data staleness, and limited expertise, plus RAG workflows, core components, and LangChain learning paths.

Tutorials

Efficient PyTorch Learning: A Source C…

2026年5月27日·3 min

Efficient PyTorch Learning: A Source Code-Driven Methodology

A proven PyTorch learning method: spend 2-3 days on basics, then advance rapidly by reading U-Net and ViT source code line by line. Master PyTorch through source code-driven learning.

Tutorials

LLM Learning Roadmap: A Complete Guide…

2026年5月27日·3 min

LLM Learning Roadmap: A Complete Guide from Beginner to Project Implementation Across Seven Core Modules

A systematic breakdown of seven core LLM learning modules covering environment setup, Prompt Engineering, RAG, Agents, dev frameworks, fine-tuning, and hands-on projects for developers.

Gemini Omni Video Generation: One-Click Synthesis from Mixed Text, Image, and Video Inputs

Tech Frontiers

2026年5月27日·2 min

Gemini Omni Video Generation: One-Click Synthesis from Mixed Text, Image, and Video Inputs

Detailed guide to Google Gemini Omni's multimodal video generation: mix text, images, and video inputs to synthesize coherent 10-second videos with one click.