Build software better, together

OpenGVLab / InternVideo

Star

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Updated Dec 11, 2024
Python

HuaizhengZhang / Awsome-Deep-Learning-for-Video-Analysis

Star

Papers, code and datasets about deep learning and multi-modal learning for video analysis

machine-learning deep-learning paper video-classification video-analysis multimodal-learning video-dataset

Updated Oct 10, 2021

OpenDriveLab / DriveAGI

Sponsor

Star

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

autonomous-driving large-dataset general-artificial-intelligence video-generation world-models video-dataset embodied-ai policy-learning foundation-model

Updated Nov 9, 2024
Python

RaivoKoot / Video-Dataset-Loading-Pytorch

Star

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

machine-learning deep-learning pytorch videos dataloader action-recognition video-dataset

Updated Jan 18, 2023
Python

ttengwang / Awesome_Long_Form_Video_Understanding

Star

Awesome papers & datasets specifically focused on long-term videos.

video-representation-learning video-dataset dense-video-captioning video-grounding temporal-action-detection temporal-action-localization temporal-sentence-grounding audio-visual-event-localization long-term-video video-large-language-models video-llms

Updated Nov 15, 2024

yuanxiaosc / Multimodal-short-video-dataset-and-baseline-classification-model

Star

500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型（TensorFlow2.0）。

tensorflow-models classification-model multimodal-datasets video-dataset

Updated Jul 23, 2019
Jupyter Notebook

jssprz / video_captioning_datasets

Star

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

review video-captioning state-of-the-art vision-and-language charades video-to-text msvd video-dataset video-description activitynet-captions trecvid tgif-dataset msr-vtt vatex

Updated Oct 27, 2023
Jupyter Notebook

ascuet / SoccerAct10

Star

SoccerAct10 is a dataset which contains 10 different soccer actions. This dataset was developed using the videos from YouTube.

action-recognition video-classification video-action-recognition fine-grained-classification video-dataset sports-analysis football-dataset action-recognition-dataset sports-classification sports-recognition-dataset soccer-video-classification sports-video-classification soccer-activity-classification keras-action-recognition pytorch-action-recognition football-action-classification football-action-recognition

Updated Apr 21, 2023

AlexanderMelde / SPHAR-Dataset

Star

Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera like position.

machine-learning cctv surveillance-systems artificial-intelligence dataset human-activity-recognition action-recognition video-classification human-action-recognition video-prediction video-dataset cctv-detection

Updated Sep 28, 2020
Python

YuxinZhaozyx / pytorch-VideoDataset

Star

Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.

video pytorch dataset preprocessing transforms video-dataset

Updated Jul 5, 2022
Python

epic-kitchens / epic-kitchens-55-starter-kit-action-recognition

Star

🌱 Starter kit for working with the EPIC-KITCHENS-55 dataset for action recognition or anticipation

deep-learning jupyter notebook pandas action-recogntion epic-kitchens gulpio video-dataset

Updated Jun 22, 2020
Jupyter Notebook

17Skye17 / VideoLT

Star

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

video-classification video-dataset long-tailed-recognition

Updated Apr 9, 2022
Python

LIUTIGHE / BSCV-Dataset

Star

Official repository for the paper titled "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method", accepted by NeurIPS 2023 Dataset and Benchmark Track

computer-vision image-processing video-processing video-inpainting low-level-vision video-dataset video-restoration

Updated Jul 19, 2024
Python

innat / VideoSwin

Star

Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling

tensorflow keras torch video-classification video-dataset

Updated Apr 3, 2024
Jupyter Notebook

eric-ai-lab / MMWorld

Star

Official repo of the paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

evaluation video-understanding video-dataset multi-disciplinary multimodal-large-language-models world-model

Updated Sep 21, 2024
Python

pritamqu / AVCAffe

Star

[AAAI 2023] AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work

dataset audiovisual emotion-recognition video-dataset audio-dataset cognitive-load audiovisual-dataset

Updated Aug 23, 2023
Python

innat / VideoMAE

Star

[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

tensorflow keras torch video-classification jax video-dataset videomae

Updated Jan 19, 2024
Jupyter Notebook

danielchyeh / this-is-my

Star

Official This-Is-My Dataset published in CVPR 2023

personalization clip vision-and-language video-dataset video-retrieval

Updated Jul 18, 2024
Python

richardtml / DIViTA

Star

Improving Transfer Learning with a Dual Image and Video Transformer for Multi-label Movie Trailer Genre Classification

transformer convolutional-neural-networks transfer-learning video-understanding movie-trailers video-dataset

Updated Mar 29, 2023
Python

AlexanderMelde / S-SPHAR-Dataset

Star

Synthetically Generated Surveillance Perspective Human Action Recognition Dataset: 6901 Videos from 10 action classes, made by a 3D Simulation, all cropped spatio-temporally and filmed from a surveillance-camera like position.

machine-learning cctv surveillance-systems artificial-intelligence dataset human-activity-recognition action-recognition video-classification human-action-recognition video-prediction video-dataset cctv-detection surveillance-perspective

Updated Sep 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-dataset

Here are 36 public repositories matching this topic...

OpenGVLab / InternVideo

HuaizhengZhang / Awsome-Deep-Learning-for-Video-Analysis

OpenDriveLab / DriveAGI

RaivoKoot / Video-Dataset-Loading-Pytorch

ttengwang / Awesome_Long_Form_Video_Understanding

yuanxiaosc / Multimodal-short-video-dataset-and-baseline-classification-model

jssprz / video_captioning_datasets

ascuet / SoccerAct10

AlexanderMelde / SPHAR-Dataset

YuxinZhaozyx / pytorch-VideoDataset

epic-kitchens / epic-kitchens-55-starter-kit-action-recognition

17Skye17 / VideoLT

LIUTIGHE / BSCV-Dataset

innat / VideoSwin

eric-ai-lab / MMWorld

pritamqu / AVCAffe

innat / VideoMAE

danielchyeh / this-is-my

richardtml / DIViTA

AlexanderMelde / S-SPHAR-Dataset

Improve this page

Add this topic to your repo