Skip to content

Latest commit

 

History

History
35 lines (25 loc) · 1.4 KB

README.md

File metadata and controls

35 lines (25 loc) · 1.4 KB

tweetcoder

  • Alexander Biggs
  • Francisco Canas
  • Sasa Milics

What be this?

A very work-in-progress twitter trends visualization device using autoencoders.

What does this need?

What are these files?

src/jtotext.py -- Script used to produce plain text, line-return separated list of tweet text from a json file dumped by the db we are using.

src/preprocessory.py -- Script used to extract feature vectors from a file containing the body text from tweets. Will use two separate and mutually exclusive methods: "Bag of Words" and "per Character" feature extraction.

src/autoencoder/autoencoder.py -- The main code file for the autoencoder. This file will contain the code that trains RBMs, links them together, and encodes input vectors into tinier input vectors.

src/autoencoder/deeplearning/rbm.py -- The Restricted Boltzmann Machine class that is a building block for the autoencoder.