RepoTencent HunyuanTencent Hunyuanpublished Nov 28, 2024seen 5d

Tencent-Hunyuan/HunyuanVideo

Python

Open original ↗

Captured source

source ↗
published Nov 28, 2024seen 5dcaptured 10hhttp 200method plain

Tencent-Hunyuan/HunyuanVideo

Description: HunyuanVideo: A Systematic Framework For Large Video Generation Model

Language: Python

License: NOASSERTION

Stars: 12190

Forks: 1248

Open issues: 175

Created: 2024-11-28T08:38:31Z

Pushed: 2025-11-21T06:36:57Z

Default branch: main

Fork: no

Archived: no

README:

[中文阅读](./README_zh.md)

HunyuanVideo: A Systematic Framework For Large Video Generation Model

👋 Join our WeChat and Discord

-----

This repo contains PyTorch model definitions, pre-trained weights and inference/sampling code for our paper exploring HunyuanVideo. You can find more visualizations on our project page.

> **HunyuanVideo: A Systematic Framework For Large Video Generation Model**

🔥🔥🔥 News!!

  • Nov 21, 2025: 🎉 We release the HunyuanVideo-1.5, a highly efficient and powerful new foundation model.
  • May 28, 2025: 💃 We release the HunyuanVideo-Avatar, an audio-driven human animation model based on HunyuanVideo.
  • May 09, 2025: 🙆 We release the HunyuanCustom, a multimodal-driven architecture for customized video generation based on HunyuanVideo.
  • Mar 06, 2025: 🌅 We release the HunyuanVideo-I2V, an image-to-video model based on HunyuanVideo.
  • Jan 13, 2025: 📈 We release the Penguin Video Benchmark.
  • Dec 18, 2024: 🏃‍♂️ We release the FP8 model weights of HunyuanVideo to save more GPU memory.
  • Dec 17, 2024: 🤗 HunyuanVideo has been integrated into Diffusers.
  • Dec 7, 2024: 🚀 We release the parallel inference code for HunyuanVideo powered by xDiT.
  • Dec 3, 2024: 👋 We release the inference code and model weights of HunyuanVideo. Download.

🎥 Demo

🧩 Community Contributions

If you develop/use HunyuanVideo in your projects, welcome to let us know.

📑 Open-source Plan

  • HunyuanVideo (Text-to-Video Model)
  • [x] Inference
  • [x] Checkpoints
  • [x] Multi-gpus Sequence Parallel inference (Faster inference speed on more gpus)
  • [x] Web Demo (Gradio)
  • [x] Diffusers
  • [x] FP8 Quantified weight
  • [x] Penguin Video Benchmark
  • [x] ComfyUI
  • HunyuanVideo (Image-to-Video Model)
  • [X] Inference
  • [X] Checkpoints

Contents

  • [HunyuanVideo: A Systematic Framework For Large Video Generation Model](#hunyuanvideo-a-systematic-framework-for-large-video-generation-model)
  • [🎥 Demo](#-demo)
  • [🔥🔥🔥 News!!](#-news)
  • [🧩 Community Contributions](#-community-contributions)
  • [📑 Open-source Plan](#-open-source-plan)
  • [Contents](#contents)
  • [Abstract](#abstract)
  • [HunyuanVideo Overall Architecture](#hunyuanvideo-overall-architecture)
  • [🎉 HunyuanVideo Key Features](#-hunyuanvideo-key-features)
  • [Unified Image and Video Generative Architecture](#unified-image-and-video-generative-architecture)
  • [MLLM Text Encoder](#mllm-text-encoder)
  • [3D VAE](#3d-vae)
  • [Prompt Rewrite](#prompt-rewrite)
  • [📈 Comparisons](#-comparisons)
  • [📜 Requirements](#-requirements)
  • [🛠️ Dependencies and Installation](#️-dependencies-and-installation)
  • [Installation Guide for Linux](#installation-guide-for-linux)
  • [🧱 Download Pretrained Models](#-download-pretrained-models)
  • [🔑 Single-gpu Inference](#-single-gpu-inference)
  • [Using Command Line](#using-command-line)
  • [Run a Gradio Server](#run-a-gradio-server)
  • [More Configurations](#more-configurations)
  • [🚀 Parallel Inference on Multiple GPUs by xDiT](#-parallel-inference-on-multiple-gpus-by-xdit)
  • [Using Command Line](#using-command-line-1)
  • [🚀 FP8…

Excerpt shown — open the source for the full document.

Notability

notability 7.0/10

New video model with high community traction.