hailuo-02

Hailuo 02

Hailuo 02 is a cutting-edge video generation model developed by MiniMax, utilizing a proprietary Noise-aware Compute Redistribution (NCR) architecture.

Hailuo 02

Major Upgrades

Powered by the new NCR architecture, Hailuo 02 achieves a breakthrough in simulating complex physical interactions (fluids, cloth, collisions) with significantly reduced "hallucinations" compared to its predecessor.

Internal benchmarks report a 30% improvement in Fréchet Video Distance (FVD) scores, translating to sharper, more coherent visuals that closely mimic real-world footage.

The introduction of the S2V-01 model allows for robust subject referencing, ensuring characters maintain their identity and appearance across different shots and angles.

Model Details

PublisherMiniMax
Open StatusClosed Source
Model ParameterNot Disclosed
MultimodalT2V, I2V, S2V
Including ModelsHailuo 02 Standard, Hailuo 02 Pro
Output Aspect Ratio16:9, 9:16, 1:1
Output Resolution720p, 1080p
Output Duration6s, 10s
Output Frame Rate24fps, 30fps

Summary

Hailuo 02 has rapidly ascended the ranks to become a formidable competitor to Sora 2, particularly noted for its "best-in-class" motion smoothness and subject consistency. Its unique NCR architecture delivers exceptional physical realism, making it a favorite for scenes requiring complex dynamics. While its clip duration is currently shorter than some rivals, the quality of those seconds is often regarded as superior, offering a "cinematic" feel that few others can match.

Key Features

Advanced prompt understanding allows users to execute specific camera movements (pan, tilt, zoom) with precision, mimicking professional cinematography.

An integrated tool that automatically refines user prompts to better align with the model's capabilities, ensuring higher quality outputs for less experienced users.

Optimized to deliver high-definition results at a lower computational cost than many competitors, making it an attractive option for high-volume production.

The ability to upload a reference image for a character and have the model faithfully animate that specific subject in new scenarios.

Video Showcases

animal
unusual activity

Dogs are the players at The World Series Of Poker and they are drinking big bowls of water very sloppily and splashing water on the cards and on the felt of the poker table, one dog poker player is tilting their head sideways in confusion.

camera motion
human - activity

A low-angle shot of a dancer leaping gracefully into the air, making their movement appear even more dynamic and powerful.

unusual subject
high motion level

A giant humanoid, made of fluffy blue cotton candy, stomping on the ground, and roaring to the sky, clear blue sky behind them.

scene
camera motion

A drone camera circles around a beautiful historic church built on a rocky outcropping along the Amalfi Coast, the view showcases historic and magnificent architectural details and tiered pathways and patios, waves are seen crashing against the rocks below as the view overlooks the horizon of the coastal waters and hilly landscapes of the Amalfi Coast Italy, several distant people are seen walking and enjoying vistas on patios of the dramatic ocean views, the warm glow of the afternoon sun creates a magical and romantic feeling to the scene, the view is stunning captured with beautiful photography.

Performance Metrics

Hailuo 02 Model Capability Assessment (Dec 20, 2025)

Hailuo 02
Radar chart showing model performance metrics20406080100SubjectConsistencyTemporalConsistencyAesthetic &Image QualityDynamics &MotionFidelityVisualQualitySemanticAlignment

Hailuo 02 Metrics Bar Charts by Dimension

Visual Quality

Visual Quality Metrics

PSNR
57.3
SSIM
63.0
LPIPS
41.4
FVD
49.5
Inception Score (IS)
49.5
020406080100

Score (normalized)

Temporal Consistency

Temporal Consistency Metrics

Temporal Warping Error
48.0
Optical Flow Consistency
63.0
Temporal Flicker Score
54.0
Long-term Consistency Tracking
51.1
Motion Smoothness
63.0
020406080100

Score (normalized)

Semantic Alignment

Semantic Alignment Metrics

CLIP Score
42.0
Tag2Text / UMT / GRiT
49.5
Semantic Accuracy
51.8
020406080100

Score (normalized)

Subject Consistency

Subject Consistency Metrics

DINO Feature Similarity
47.3
Object Identity Tracking
48.0
Multiple Object Consistency
60.8
020406080100

Score (normalized)

Aesthetic & Image Quality

Aesthetic & Image Quality Metrics

LAION Aesthetic Predictor
48.0
MUSIQ Score
57.0
Color/Texture Consistency
69.0
Human-Opinion MOS
63.0
020406080100

Score (normalized)

Dynamics & Motion

Dynamics & Motion Metrics

Action Recognition Accuracy
48.0
Dynamics Controllability
48.0
Motion Diversity Score
72.6
Physical Realism Score
63.0
020406080100

Score (normalized)

Service Providers

H

HailuoAI.com

The official web interface for MiniMax's video models, offering a streamlined experience for creators.

API Providers

M

MiniMax API

Direct API access from the developer, offering the most up-to-date model versions and features.

People Also Ask

Hailuo AI (by MiniMax) operates on a freemium model. New users typically receive free daily credits or an initial trial allowance (e.g., 200 credits or a few daily generations) to test the service without payment. However, once these free credits are exhausted, you must purchase a subscription or credit pack to continue generating videos. The free tier is often slower (lower priority queue) and includes watermarks on the output.

Hailuo AI is generally considered safe for standard consumer use. The platform enforces content moderation policies that filter out illegal, explicit (NSFW), or violent content to comply with safety regulations. Regarding data privacy, Hailuo states that it encrypts user data and complies with major privacy laws (like GDPR/CCPA), though as with any cloud AI service, users should avoid uploading highly sensitive or confidential personal information.

Hailuo AI uses an advanced Video-to-Video and Text-to-Video diffusion model (specifically the MiniMax Video-01/02 architecture). It processes your text prompt or input image, "imagines" the motion based on its training on millions of video clips, and generates new frames that maintain temporal consistency (smooth motion) and subject identity. It excels at following complex camera instructions (like "zoom in" or "pan right") and simulating realistic physics.

Hailuo AI offers several pricing tiers: 1.Free Plan: $0/month (limited daily credits, watermarked, slower). 2.Standard Plan: Approximately $9.99/month. This typically includes around 1,000 credits, faster generation speeds, and watermark-free downloads. 3.Unlimited Plans: Higher tiers range from $35 to $100+ per month, offering more credits (e.g., 4,500+), concurrent generation (running multiple tasks at once), and highest priority processing.

Currently, Hailuo AI generates short clips (typically 4-6 seconds) by default. To make them longer: 1.Image-to-Video Extension: Take the last frame of your generated video (using a screenshot or video editor) and use it as the "Start Image" input for a new generation prompt. 2.Video Editor Stitching: Repeat this process multiple times and then stitch the clips together in a video editor (like CapCut or Premiere) to create a seamless longer sequence. 3.Platform Tools: Some third-party platforms integrating Hailuo (or future updates to the official site) may offer a native "Extend" button that automates this "end-frame-to-start-frame" process.

References