GitHub - gulucaptain/videoassembler: Official project of paper "MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing"

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

Haoyu Zhao · Tianyi Lu · Jiaxi Gu · Xing Zhang · Qingping Zheng · Zuxuan Wu · Hang Xu · Yu-Gang Jiang

Fudan University | Huawei Noah's Ark Lab

📢 News

[2024.12.22] Release inference code. We are working to improve MagDiff, stay tuned!
[2024.07.04] Our paper has been accepted by the 18th European Conference on Computer Vision (ECCV) 2024.
[2023.11.29] Release first paper version on Arxiv.

🏃‍♂️ Getting Started

Download the pretrained base models for StableDiffusion V2.1.

Download our MagDiff checkpoints.

Please follow the huggingface download instructions to download the above models and checkpoints.

Below is an example structure of these model files.

assets/
├── MagDiff.pth
└── stable-diffusion-2-1-base/
    ├── scheduler/...
    ├── text_encoder/...
    ├── tokenizer/...
    ├── unet/...
    ├── vae/...
    ├── ...
    └── README.md

⚒️ Installation

prerequisites: python>=3.10, CUDA>=11.8.

Install with pip:

pip3 install -r requirements.txt

💃 Inference

Run inference on single GPU:

bash inference.sh

🎓 Citation

If you find this codebase useful for your research, please use the following entry.

@inproceedings{zhao2024magdiff,
    author    = {Zhao, Haoyu and Lu, Tianyi and Gu, Jiaxi and Zhang, Xing and Zheng, Qingping and Wu, Zuxuan and Xu, Hang and Jiang Yu-Gang},
    title     = {MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing},
    booktitle = {European Conference on Computer Vision},
    year      = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets/teaser		assets/teaser
callbacks		callbacks
configs		configs
data		data
model		model
scripts		scripts
utils		utils
README.md		README.md
inference.sh		inference.sh
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

📢 News

🏃‍♂️ Getting Started

⚒️ Installation

💃 Inference

🎓 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

gulucaptain/videoassembler

Folders and files

Latest commit

History

Repository files navigation

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

📢 News

🏃‍♂️ Getting Started

⚒️ Installation

💃 Inference

🎓 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages