Skip to main content

Meituan Unveils LongCat-Video Model for Advanced AI-Generated Content

Meituan Introduces Revolutionary Long Video Generation AI

Meituan's research division has taken a significant leap in artificial intelligence with the release of LongCat-Video, a cutting-edge video generation model that promises to transform content creation workflows. This development marks a major milestone in the company's exploration of "world models" - AI systems designed to understand and simulate real-world dynamics.

Image

Technical Architecture and Capabilities

The model is built on an advanced Diffusion Transformer (DiT) framework, integrating three core functionalities:

  • Text-to-video generation at 720p resolution and 30fps
  • Precise image-to-video conversion preserving original attributes
  • Seamless video continuation extending clips coherently

What sets LongCat-Video apart is its innovative use of "conditional frame count" parameters that enable the system to intelligently distinguish between different input tasks while maintaining consistent output quality.

Breakthrough in Long-Form Content Creation

The most remarkable achievement is the model's ability to generate stable, coherent videos lasting up to 5 minutes - a significant advancement over previous systems limited to short clips. This capability addresses persistent challenges in AI video generation:

  • Eliminates color drift across frames
  • Prevents quality degradation over time
  • Maintains consistent character actions and environments

The technological breakthrough holds particular promise for applications requiring extended simulations, such as autonomous driving systems and embodied AI platforms.

Performance Optimization

The development team implemented several innovations to enhance efficiency:

  1. Two-stage coarse-to-fine generation pipeline
  2. Block-sparse attention (BSA) mechanisms
  3. Advanced model distillation techniques These optimizations resulted in a 10.1x improvement in inference speed without compromising output quality.

Benchmark Results and Availability

Rigorous testing demonstrates that LongCat-Video achieves state-of-the-art (SOTA) performance across multiple metrics:

  • Text-to-video alignment accuracy
  • Visual fidelity scores
  • Motion naturalness evaluations

The model has been made publicly available through GitHub and Hugging Face repositories, lowering barriers for both individual creators and enterprise users.

Key Points:

  • First commercial-grade AI capable of generating stable 5-minute videos
  • Combines three generation modes under unified architecture
  • Sets new benchmarks for open-source video generation quality
  • Potential applications span entertainment, education, and industrial simulation

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Meituan Unveils LongCat-Video Model for 5-Minute AI-Generated Content
News

Meituan Unveils LongCat-Video Model for 5-Minute AI-Generated Content

Meituan has launched LongCat-Video, a groundbreaking AI model capable of generating high-quality, continuous 5-minute videos. Built on Diffusion Transformer architecture, it supports text-to-video, image-to-video, and video continuation tasks without additional adaptation. The model maintains temporal consistency and avoids quality degradation in long-form content.

October 27, 2025
AI-video-generationDiffusionTransformerMeituan-tech
Lightricks Unveils Open-Source AI That Creates Videos With Sound in Seconds
News

Lightricks Unveils Open-Source AI That Creates Videos With Sound in Seconds

Israeli tech firm Lightricks has released LTX-2, an innovative AI system that generates 20-second HD videos with perfectly synced audio from text prompts. Unlike traditional methods, it processes visuals and sound simultaneously using a unique dual-stream architecture. The open-source model outperforms competitors with blazing speed - creating 720p content in just over a second per step.

January 12, 2026
AI-video-generationopen-source-AILightricks
Moonlight AI's Kiwi-do Model Stuns With Visual Physics Prowess
News

Moonlight AI's Kiwi-do Model Stuns With Visual Physics Prowess

Moonshot AI's mysterious new 'Kiwi-do' model has emerged as a potential game-changer in multimodal AI. Showing remarkable capabilities in visual physics comprehension, this freshly spotted model appears ahead of Moonshot's planned K2 series release. Early tests suggest Kiwi-do could revolutionize how AI interprets complex visual data.

January 5, 2026
multimodal-AIcomputer-visionMoonshot-AI
Alibaba's Z-Image Turbocharges AI Art with Surprising Efficiency
News

Alibaba's Z-Image Turbocharges AI Art with Surprising Efficiency

Alibaba's Tongyi Lab has unveiled Z-Image-Turbo, a breakthrough AI image generator that punches above its weight. With just 6 billion parameters - far fewer than competitors - it delivers stunning results in seconds on consumer-grade GPUs. The model handles complex Chinese prompts naturally and produces print-quality images with minimal processing steps. Already climbing human preference rankings, this open-source challenger could reshape the AI art landscape.

November 27, 2025
AI-artgenerative-modelscomputer-vision
News

LTX-2 AI Model Revolutionizes Video Generation with 4K Output

Lightricks unveils LTX-2, a groundbreaking AI video generation model capable of producing 20-second 4K narrative videos with synchronized audio-visual output. The open-source solution runs locally on consumer GPUs and offers unprecedented creative control.

October 31, 2025
AI-video-generationLTX-24K-content
ByteDance, HK Universities Open-Source DreamOmni2 AI Image Editor
News

ByteDance, HK Universities Open-Source DreamOmni2 AI Image Editor

ByteDance and Hong Kong universities have open-sourced DreamOmni2, a breakthrough AI image editing system that understands abstract concepts through multimodal instructions. The technology outperforms existing open-source models and approaches commercial solutions.

October 27, 2025
AI-image-editingmultimodal-AIopen-source-AI