71 related articles

Step-by-step guide to configuring CreateNow agents with DeepSeek, Kimi, and Xiaomi LLMs. Covers one-click setup, custom model integration, and API Key acquisition for building AI digital employees.
Stable Diffusion Poxian Edition Bundle…
Stable Diffusion Poxian Edition bundle: install in 3 steps with 337 built-in workflows, Chinese-annotated models, GTX 1060+ support, and completely free local AI image/video generation.
The Five-Tier Pyramid of IT Careers in…
AI is reshaping IT careers into a five-tier pyramid from tool usage to self-developed models. Learn where you fit and how to maximize your career potential.
Guo Yu on the AI Agent Era: The End of…
Former ByteDance engineer Guo Yu analyzes the AI Agent revolution: how Claude Code's Skill feature signals the end of traditional software, the SaaS collapse, and the future of knowledge workers.
Google Agent Skills Library Hands-On: …
Hands-on review of Google's open-source Agent Skills library: how 12 Skills improve AI-generated UI quality, configuration pitfalls, and the strategic significance of standardizing Agent Skills.
Six Major AI Events in One Day: OpenAI…
Six major AI events decoded: OpenAI bug falsely bans Pro users, Anthropic calls for frontier model pause, DeepSeek quality drops, Grok tops image arena, ChatGPT hits 1B MAU, WeChat tests AI payments.
StepFun STEP3.7 Flash Tops AA Benchmar…
StepFun STEP3.7 Flash tops Artificial Analysis benchmark in speed, cost-efficiency, and multimodal. AI safety leaders call for legislation, embodied AI gets 300K-home training ground, Huawei Cloud unveils Agentic Infra.

Runway upgrades real-time video Characters with tool calling, enabling AI video agents to execute queries, tasks, and operations—marking a shift from content generation to intelligent agent platform.

Deep dive into Runway Agent's AI video generation capabilities: how one product photo and a creative brief can automatically produce a complete ad video in a single session.

Runway announces its 4th AI Film Festival top 10 finalists with NYC and LA screenings in June. A look at standout works, AI video tech evolution, and creator ecosystem impact.

Runway launches AI creative agent Runway Agent, supporting conversational interaction for end-to-end video production from ideation to sound design and editing, marking AI video tools entering the Agent era.

Deep dive into Google Gemini Omni's core capabilities: multimodal input support for images, video, and audio, enabling interactive video generation and editing—a full-modal AI transforming content creation.

From "otter using WiFi on a plane" to multi-character complex narratives, AI video generation achieved exponential leaps in two years. Analyzing how diffusion models and Transformers drive breakthroughs.

Google's Gemini Omni generated a movie trailer for the Roman epic The Aeneid from a single prompt, and showed off video editing—fixing errors directly without regenerating.

At Google I/O, AI video tool Flow integrates deeply with Gemini Omni, bringing batch editing, character consistency improvements, and cinematic output upgrades.

Aleph 2.0 introduces single-frame edit propagation: modify one frame and automatically apply changes across the entire video. Deep dive into Edit Studio, temporal consistency breakthroughs, and industry impact.

OpenAI officially returns to robotics, hiring full-stack hardware and ML engineers at scale. Led by DALL·E creator Aditya Ramesh, the team evolved from world simulation research to build general-purpose robots.

Deep dive into Replit Canvas: multimodal AI generation for images, video, and audio, with sketch-to-image, WYSIWYG editing, and real-time collaboration.
Tech FrontiersGitHub Universe unveils Agent HQ platform for unified coding agent management, Copilot upgrades with multi-model support. OpenAI completes restructuring, Anthropic tests new model, NVIDIA open-sources AI models.
Product ReviewsIn-depth review of 11 AI Agent tools including ChatGPT Agent, Manus, and Claude Code, covering office work, academic writing, coding, and video creation scenarios.