vidAIo

vidAIo is video search engine made by combining natural language processing/computer vision to analyze the audio/frames.

I wanted to combine as many AI technologies as I could think of. Videos offer a powerful opportunity as they offer up both audio, which can be transcribed to text, and analyzed and indexed by natural language processing (NLP) algorithms, and frames, which often contain a myriad of objects, and can be analyzed and indexed by computer vision (CV) algorithms. On the NLP side, there are algorithms for keyword extraction (graphical model), named entity extraction (nltk chunking), topic modeling (LDA with gensim), and summarization (lexrank). On the CV side, there is a neural network (built on Torch7) trained on the Cifar10 dataset, achieving 57% accuracy (not bad for about an hour of training on a laptop), and a facial recognition system (PCA+SVM, built on scikit-learn). The NLP systems tend to work much better than the CV systems. The data is stored in MongoDB and there's a simple web interface built with Flask.

Note: this is hackathon level code with a lot of dependencies. Enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
.gitignore		.gitignore
README.md		README.md
config.py		config.py
config.pyc		config.pyc
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vidAIo

About

Releases

Packages

Languages

benglard/vidAIo

Folders and files

Latest commit

History

Repository files navigation

vidAIo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages