Open Corpus Workbench with TEITOK Docker compose file
-
Updated
May 30, 2019 - Dockerfile
Open Corpus Workbench with TEITOK Docker compose file
evenki-corpus
Corpus linguistics final project for the course COMM 313: Computational Text Analysis at the University of Pennsylvania. Aims to determine how the anti-vaccination movement has evolved on social media before and during the COVID-19 pandemic.
Easy Text Annotator
The data and code located in this repository introduce an international preparatory class learner corpus and its complexity analyses.
(Ongoing module in development) Getting Wikipedia articles parsed content. Created for getting text corpuses data fast and easy. But can be freely used for other purpuses too
Tools and resources for the computational processing of the Nheengatu language
Kurdish Textbooks Corpus
The recordings of marwari speech by Bharti, the speaker of it. It Includes setences of all kinds using translation method and narrations of health care and lifecycle.
Corpus for linguistic study of natural gas pipeline debates.
Treebanks modified from PROIEL and Perseus.
A module to quickly create Corpus objects containing TTR, tokenized sentences, lexical density, class frequencies and more.
A tool for determinating distances between multimodal annotations.
2019 project - french wikipedia corpus data analysis
Paper that Lena Baunaz and I are working on as part of my SNSF-funded 'Focus in diachrony' research project at the University of Cambridge, UK.
All scripts needed to exploit French corpus and create the associated database for the CODIM Project.
Heuristics and cognitive biases in public discourse on climate changes - lingustic data analysis
Add a description, image, and links to the corpus-linguistics topic page so that developers can more easily learn about it.
To associate your repository with the corpus-linguistics topic, visit your repo's landing page and select "manage topics."