Tencent-Hunyuan/HunyuanVideo
Python
Captured source
source ↗Tencent-Hunyuan/HunyuanVideo
Description: HunyuanVideo: A Systematic Framework For Large Video Generation Model
Language: Python
License: NOASSERTION
Stars: 12190
Forks: 1248
Open issues: 175
Created: 2024-11-28T08:38:31Z
Pushed: 2025-11-21T06:36:57Z
Default branch: main
Fork: no
Archived: no
README:
[中文阅读](./README_zh.md)
HunyuanVideo: A Systematic Framework For Large Video Generation Model
👋 Join our WeChat and Discord
-----
This repo contains PyTorch model definitions, pre-trained weights and inference/sampling code for our paper exploring HunyuanVideo. You can find more visualizations on our project page.
> **HunyuanVideo: A Systematic Framework For Large Video Generation Model**
🔥🔥🔥 News!!
- Nov 21, 2025: 🎉 We release the HunyuanVideo-1.5, a highly efficient and powerful new foundation model.
- May 28, 2025: 💃 We release the HunyuanVideo-Avatar, an audio-driven human animation model based on HunyuanVideo.
- May 09, 2025: 🙆 We release the HunyuanCustom, a multimodal-driven architecture for customized video generation based on HunyuanVideo.
- Mar 06, 2025: 🌅 We release the HunyuanVideo-I2V, an image-to-video model based on HunyuanVideo.
- Jan 13, 2025: 📈 We release the Penguin Video Benchmark.
- Dec 18, 2024: 🏃♂️ We release the FP8 model weights of HunyuanVideo to save more GPU memory.
- Dec 17, 2024: 🤗 HunyuanVideo has been integrated into Diffusers.
- Dec 7, 2024: 🚀 We release the parallel inference code for HunyuanVideo powered by xDiT.
- Dec 3, 2024: 👋 We release the inference code and model weights of HunyuanVideo. Download.
🎥 Demo
🧩 Community Contributions
If you develop/use HunyuanVideo in your projects, welcome to let us know.
- ComfyUI-Kijai (FP8 Inference, V2V and IP2V Generation): ComfyUI-HunyuanVideoWrapper by Kijai
- ComfyUI-Native (Native Support): ComfyUI-HunyuanVideo by ComfyUI Official
- FastVideo (Consistency Distilled Model and Sliding Tile Attention): FastVideo and Sliding Tile Attention by Hao AI Lab
- HunyuanVideo-gguf (GGUF Version and Quantization): HunyuanVideo-gguf by city96
- Enhance-A-Video (Better Generated Video for Free): Enhance-A-Video by NUS-HPC-AI-Lab
- TeaCache (Cache-based Accelerate): TeaCache by Feng Liu
- HunyuanVideoGP (GPU Poor version): HunyuanVideoGP by DeepBeepMeep
- RIFLEx (Video Length Extrapolation): RIFLEx by Tsinghua University
- HunyuanVideo Keyframe Control Lora: hunyuan-video-keyframe-control-lora by dashtoon
- Sparse-VideoGen (Accelerate Video Generation with High Pixel-level Fidelity): Sparse-VideoGen by University of California, Berkeley
- FramePack (Packing Input Frame Context in Next-Frame Prediction Models for Video Generation): FramePack by Lvmin Zhang
- Jenga (Training-Free Efficient Video Generation via Dynamic Token Carving): Jenga by DV Lab
- DCM (Dual-Expert Consistency Model for Efficient and High-Quality Video Generation): DCM by Vchitect
📑 Open-source Plan
- HunyuanVideo (Text-to-Video Model)
- [x] Inference
- [x] Checkpoints
- [x] Multi-gpus Sequence Parallel inference (Faster inference speed on more gpus)
- [x] Web Demo (Gradio)
- [x] Diffusers
- [x] FP8 Quantified weight
- [x] Penguin Video Benchmark
- [x] ComfyUI
- HunyuanVideo (Image-to-Video Model)
- [X] Inference
- [X] Checkpoints
Contents
- [HunyuanVideo: A Systematic Framework For Large Video Generation Model](#hunyuanvideo-a-systematic-framework-for-large-video-generation-model)
- [🎥 Demo](#-demo)
- [🔥🔥🔥 News!!](#-news)
- [🧩 Community Contributions](#-community-contributions)
- [📑 Open-source Plan](#-open-source-plan)
- [Contents](#contents)
- [Abstract](#abstract)
- [HunyuanVideo Overall Architecture](#hunyuanvideo-overall-architecture)
- [🎉 HunyuanVideo Key Features](#-hunyuanvideo-key-features)
- [Unified Image and Video Generative Architecture](#unified-image-and-video-generative-architecture)
- [MLLM Text Encoder](#mllm-text-encoder)
- [3D VAE](#3d-vae)
- [Prompt Rewrite](#prompt-rewrite)
- [📈 Comparisons](#-comparisons)
- [📜 Requirements](#-requirements)
- [🛠️ Dependencies and Installation](#️-dependencies-and-installation)
- [Installation Guide for Linux](#installation-guide-for-linux)
- [🧱 Download Pretrained Models](#-download-pretrained-models)
- [🔑 Single-gpu Inference](#-single-gpu-inference)
- [Using Command Line](#using-command-line)
- [Run a Gradio Server](#run-a-gradio-server)
- [More Configurations](#more-configurations)
- [🚀 Parallel Inference on Multiple GPUs by xDiT](#-parallel-inference-on-multiple-gpus-by-xdit)
- [Using Command Line](#using-command-line-1)
- [🚀 FP8…
Excerpt shown — open the source for the full document.
Notability
notability 7.0/10New video model with high community traction.