This repository was developed for the code and data behind the story: Una constelaciΓ³n de escritoras latinoamericanas (nacidas en el siglo XX).
The analysis uses web scrapping of Wikipedia entries for Latin American women writers and network graph visualization in order to create a web application.
βββ app.py # Streamlit app file
βββ assets # Resources for the project
βΒ Β βββ datacritica
βΒ Β βββ imgs
βΒ Β βββ imgs_processed
βΒ Β βββ mosaics
βΒ Β βββ targets
βΒ Β βββ targets_processed
βββ data # Categorized data
βΒ Β βββ processed # Cleaned data
βΒ Β βΒ Β βββ escritoras_wiki.csv
βΒ Β βΒ Β βββ escritores_destacados.csv
βΒ Β βββ raw # Original data
βΒ Β βββ escritoras.csv
βββ Dockerfile # Commands to build a docker image
βββ docs # Explanatory materials
βΒ Β βββ data-dictionary.md # Information about the data
βΒ Β βββ references # Papers, manuals, articles, etc.
βββ escritoras_latinas # Python package
βΒ Β βββ data # Functions to manipulate data
βΒ Β βΒ Β βββ analyze.py # Module to analyze data
βΒ Β βΒ Β βββ export.py # Module to save exports
βΒ Β βΒ Β βββ load.py # Module to load data and paths
βΒ Β βΒ Β βββ process.py # Module to process data
βΒ Β βββ utils # Functions to make common patterns
βΒ Β βββ paths.py # Module to generate relative paths
βββ LICENSE # Project license
βββ notebooks # Jupyter notebooks
βΒ Β βββ 0.0-scrapping-text.ipynb
βΒ Β βββ 0.1-scrapping-text.ipynb
βΒ Β βββ 0.2-scrapping-images.ipynb
βΒ Β βββ 1.0-annotate-data.ipynb
βΒ Β βββ 1.1-process-images.ipynb
βΒ Β βββ 2.0-visualize-network.ipynb
βΒ Β βββ 2.1-visualize-network.ipynb
βΒ Β βββ 2.2-visualize-donut-chart.ipynb
βββ outputs # Exports generated by notebooks
βΒ Β βββ figures # Generated graphics, maps, etc.
βΒ Β βΒ Β βββ index.html
| βββ networks # Generated graph network
βΒ Β βΒ Β βββ index.html
βΒ Β βββ tables # Generated pivot tables
βΒ Β βββ LICENSE
βΒ Β βββ photomosaics
βΒ Β βΒ Β βββ photomosaics.py
βΒ Β βΒ Β βββ run.py
βΒ Β βΒ Β βββ scrape.py
βΒ Β βββ README.md
βΒ Β βββ requirements.txt
βββ Pipfile # Project dependencies
βββ Pipfile.lock # Specific versions of packages on Pipfile
βββ README.md # Top-level README for this project
βββ README-ES.md # README in Spanish
βββ requirements.txt # Project dependencies
βββ setup.py # Import project as a python module
βββ style.css # Styles for streamlit app
This project is released under MIT License.
This repository was generated with cookiecutter using a data-journalism template for python.