Deep Animation Video Interpolation with Global Motion Aggregation and Style Loss

This is the code repository for my bachelor's graduation thesis. You can view my thesis via this.

Environment

This project was once trained on The University of Sydney's HPC Artemis which had a very ancient version of Pytorch and other frameworks. If you are under similar circumstances, please follow these procedures:

Click to Reveal

Run under NVCC 10.0.130.

$ conda create -n AFI pip python=3.7
$ conda activate AFI
$ conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch
$ conda install scipy opencv
$ pip install einops easydict

$ pip install torch-scatter -f https://data.pyg.org/whl/torch-1.4.0+cu100.html

skimage.measure.compare_psnr() and skimage.measure.compare_ssim() was replaced by skimage.metrics.peak_signal_noise_ratio() and skimage.metrics.structural_similarity().

Ref. scikit-image Official Doc 0.16.1

Install lower version to avoid modifications.

$ conda install scikit-image==0.14.3 -c conda-forge

If you got:

ImportError: cannot import name 'PILLOW_VERSION' from 'PIL'

Try:

$ conda install pillow=6.1

Otherwise, just follow these:

$ conda create -n AFI pip python=3.10
$ conda install pytorch==1.12.1 cudatoolkit=11.3 torchvision==0.13.1 -c pytorch
$ conda install scipy scikit-image
$ conda install pytorch-scatter -c pyg
$ pip install einops easydict

A very recent bug is that if you use conda to install OpenCV, it will automatically degrade your PyTorch to the CPU version. So please use pip instead for installation.

$ pip install opencv-python

skimage.measure.compare_psnr() and skimage.measure.compare_ssim() was replaced by skimage.metrics.peak_signal_noise_ratio() and skimage.metrics.structural_similarity().

Ref. scikit-image Official Doc 0.16.1

Install lower version to avoid modifications.

$ conda install scikit-image==0.14.3 -c conda-forge

If you got:

ImportError: cannot import name 'PILLOW_VERSION' from 'PIL'

Try:

$ conda install pillow=6.1

Data

Click the links below to download the corresponding datasets:

QVI960
ATD-12K
ATD-12K Training Set SGM (The link is temporarily unavailable.)

You can also generate the SGM flows yourself. The content of the training SGM flows are extacted to datasets/atd-12k.

So the final hierarchy will be like this:

datasets
  +--- QVI-960
  +--- atd-12k
          +--- train_10k_pre_calc_sgm_flows
          +--- test_2k_540p
          +--- ...

Weights

The qualitative and quantitative results in the paper were based on a not fully trained model. I have trained the model fully now on a single RTX3090. All the weights are provided in the checkpoints folder.

animeinterp+gma.pth is the intialised weights of the model combined with pretained weights from AnimeInterp and GMA. This is the starting point of the training.

QVI960/199.pth: The synthesis module of the model is then fine-tuned in QVI-960 datasets for 200 epochs.

ATD12K/49.pth: Starting with QVI960/199.pth, the whole model is then trained on L1 loss for 50 epochs.

Style/49.pth: Starting with QVI960/199.pth, the whole model is then trained directly on the Style loss for 50 epochs.

ATD12K-Style: Starting with ATD12K/49.pth, the whole model is then trained on the Style loss for another 50 epochs.

Train

Run the command below to start training on QVI-960 dataset:

$ python QVI960_train.py

Run the command below to start training on ATD-12K dataset using L1 Loss:

$ python ATD12K_train.py

Run the command below to start training on ATD-12K dataset using Style Loss:

$ python ATD12K_train_style.py

Evaluation

Modify configs/config_test_w_sgm.py to specify the configuration fo the evaluation process, then run:

$ python test_anime_sequence_one_by_one.py configs/config_test_w_sgm.py

References

Deep Animation Video Interpolation in the Wild

Learning to Estimate Hidden Motions with Global Motion Aggregation

FILM: Frame Interpolation for Large Motion

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
artemis		artemis
checkpoints		checkpoints
configs		configs
datas		datas
datasets		datasets
models		models
outputs		outputs
scripts		scripts
utils		utils
.gitignore		.gitignore
ATD12K_train.py		ATD12K_train.py
ATD12K_train_style.py		ATD12K_train_style.py
LICENSE		LICENSE
QVI960_train.py		QVI960_train.py
README.md		README.md
test_anime_sequence_one_by_one.py		test_anime_sequence_one_by_one.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Animation Video Interpolation with Global Motion Aggregation and Style Loss

Environment

Data

Weights

Train

Evaluation

References

About

Languages

License

Soooda/AFI

Folders and files

Latest commit

History

Repository files navigation

Deep Animation Video Interpolation with Global Motion Aggregation and Style Loss

Environment

Data

Weights

Train

Evaluation

References

About

Resources

License

Stars

Watchers

Forks

Languages