Automagically export and upconvert data from Transkribus collections into TEI/XML using page2tei from @dariok and acdh-transkribus-pyutils.
- create a virtual environment
python -m venv venv
- update pip to latest version and install needed python packages
pip install -U pip && pip install -r requirements.txt
- copy/rename
dummy.env
tosecret.env
and add your Transkribus credentials
- add the Transkribus collection IDs to
./col_ids.txt
(each ID on a new line) run./export_env_variables.sh
to set your Transkribus credentials as environment variables. - run
python dump_data.py
- Create GitHub secrets called
TR_USER
andTR_PW
and add your Transkribus credentials - Go to GitHub Actions and start the workflow -> the exported METS and TEI/XML files will be checked into your repo
created with transkribus-export-cookiecutter