11785 Project: Action recognition to improve sound event classification
There are two tasks that need to be performed in our task.
The first one is to train the two tower model. The corresponding code is in the 'two_tower_training' branch.
The secons task is to perform the downstream task, which is ESC-50 sound event classification.
We have provided colab notebook for you to run the two tasks in both branches.