Skip to content

Latest commit

 

History

History
59 lines (38 loc) · 1.67 KB

README.md

File metadata and controls

59 lines (38 loc) · 1.67 KB

  • Install
  • Insert data
  • Run Website
  • Annotation
  • Patchnotes
  • TODO
  • After manuel classificiation

Install

Jupyter Notebook required for transform & load .tsv into database

pip install sqlite3
pip install requests
pip install bottle
  1. run python create_database.py

Insert data using reddit.com

  1. [optional] using .tsv-files

    1. put your .tsv-files in /tsv
    2. open jupyter notebook transform.ipynb, change your .tsv-file
    3. maybe you have to do some changes for the user.replaces()
    4. run Jupyter Notebook transform.ipynb
  2. run fetch_json_reddit_from_database.py (Saves all jsons from existing accounts in your database)

  3. run update_fetched.py (Updates fetch status in your database, if json-files are in /json, if you had run fetch_json_reddit_from_database.py, you do not need to do it.)

Run website

  1. run python webserver.py
  2. open localhost in your browser

Annotation

  1. click on annotate in menu
  2. click fetch now (button yellow) after fetch it becomes grey (refetch is possible)
  3. type your name in Annotator (name will pre-fill for functions next entry / add new entry after save, but will not saved automatically for the next annotations)
    1. if json is empty: account is does not exists
    2. if json is there, but only username is there and the other key/values are null: accout is bannend
    3. rest has data
  4. is duplicate is active, the user_id (account name) was more than once in the .tsv-file
  • click save
  • after save you can goto next entry or add new entry

After manuel classification

your json file are in /json your sqlite3 database file is in /database