A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
This is the repository that contains the source code for the GPT4Video project page.
If you use GPT4Video in your project, please kindly cite:
@articles{wang2023gpt4video,
title={GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation},
author={Zhanyu Wang, Longyue Wang, Minghao Wu, Zhen Zhao, Chenyang Lyu, Huayang Li, Deng Cai, Luping Zhou, Shuming Shi, Zhaopeng Tu},
journal = {CoRR},
year={2023}
}
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
