Skip to content

raphiniert/AnarchismAndGender

Repository files navigation

Anarchism and gender

This project is a tool to analyse short text corpora, namely historical Austrian newspaper articles. Newspapers are, amongst other file formats, available as text files via the ANNO tool (see http://anno.onb.ac.at) provided by the Austrian National Library. The project is part of an undergraduate seminar (see https://ufind.univie.ac.at/de/course.html?lv=070137&semester=2020W) at the University of Vienna. The seminar deals with anarchism and gender.

prerequisites

Open a terminal and check your current Python 3 version by running:

python3 -V
Python 3.7.3  # 3.8.X is also fine

project setup

download project

Open a terminal, clone the project, enter the project folder and create the log directory:

git clone https://github.com/raphiniert/AnarchismAndGender.git
cd AnarchismAndGender

Copy the previously downloaded file chromedrive into the bin folder.

setup virtual environment and install dependencies

Enter the project folder an execute following steps:

python3 -m venv venv
. venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
python -m spacy download de_core_news_lg

Verify if everything worked by running:

python crawl.py -h

If you see the following output, you're good to go.

usage: crawl.py [-h] [--verbose] [--no-headless] [--skip-issue-crawling] [--update] [--skip-db-issues]

optional arguments:
  -h, --help            show this help message and exit
  --verbose             increase output verbosity
  --no-headless         don't run chrome headless
  --skip-issue-crawling
                        don't crawl issues and use link list instead
  --update              update db entry if exists
  --skip-db-issues      skip issue crawling if issue is already in db

TODO

troubleshooting

Below you find common errors and possible solutions to fix them.

SyntaxError: invalid syntax or ModuleNotFoundError: No module named 'selenium'

Make sure your virtual environment is enabled. You can enable it by running:

. venv/bin/activate

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages