Skip to content
Change the repository type filter

All

    Repositories list

    • Plugin that runs the analyses described in Vine, Boyd, & Pennebaker (2020). See also: Vocabulate (https://github.com/ryanboyd/Vocabulate)
      C#
      MIT License
      0000Updated Apr 12, 2022Apr 12, 2022
    • Plugin to evaluate the statistical properties of a text analysis dictionary. Gives Mean and Standard Deviations for each word, plus internal consistency metrics for each category.
      C#
      MIT License
      0010Updated Mar 2, 2022Mar 2, 2022
    • Frequency-based content coding, like that found in LIWC.
      C#
      MIT License
      0100Updated Jul 14, 2021Jul 14, 2021
    • StopList

      Public
      Plugin that contains stop lists and will remove tokens from a text before carrying forward
      C#
      MIT License
      0000Updated Jul 9, 2021Jul 9, 2021
    • Plugin to lemmatize based on pre-defined lists. At the time of writing this description, the lists used are primarily from https://github.com/michmech/lemmatization-lists
      C#
      MIT License
      0000Updated Jul 6, 2021Jul 6, 2021
    • Plugin to calculate Narrative Arc scores. See https://www.arcofnarrative.com/ and the research paper at https://www.doi.org/10.1126/sciadv.aba2196
      C#
      MIT License
      0400Updated Jun 30, 2021Jun 30, 2021
    • Plugin for analyzing texts via a weighted dictionary.
      C#
      MIT License
      0000Updated Jun 18, 2021Jun 18, 2021
    • VADER

      Public
      Plugin that wraps around the VADER sentiment analysis method (specifically, the VADER Sharp implementation).
      C#
      MIT License
      0000Updated Jun 1, 2021Jun 1, 2021
    • Plugin to do sentiment analysis on a sentence-by-sentence basis. Built around CoreNLP.NET (https://sergey-tihon.github.io/Stanford.NLP.NET/StanfordCoreNLP.html)
      C#
      MIT License
      0000Updated Jun 1, 2021Jun 1, 2021
    • Plugin that does Part Of Speech Tagging, built around CoreNLP.NET (https://sergey-tihon.github.io/Stanford.NLP.NET/StanfordCoreNLP.html)
      C#
      MIT License
      0100Updated Apr 10, 2021Apr 10, 2021
    • Plugin to write strings out into separate .txt files.
      C#
      MIT License
      0000Updated Mar 24, 2021Mar 24, 2021
    • Plugin to read in texts from .docx files
      C#
      MIT License
      0000Updated Mar 16, 2021Mar 16, 2021
    • PlugIndex

      Public
      Indexer for the plugins. Not the smartest support tool, but gets the job done.
      C#
      MIT License
      0000Updated Mar 7, 2021Mar 7, 2021
    • Software to update BUTTER plugins
      C#
      MIT License
      0000Updated Mar 7, 2021Mar 7, 2021
    • Statistically compare the word frequencies from 2 or more BUTTER frequency lists.
      C#
      MIT License
      0000Updated Mar 5, 2021Mar 5, 2021
    • Syrup

      Public
      Hashing/indexing/etc. Just a part of the toolchain for package distribution.
      C#
      MIT License
      0000Updated Mar 2, 2021Mar 2, 2021
    • This plugin will use a BUTTER frequency list to replace individual words with phrases. This is useful for taking n-grams and joining them into single tokens using collocation metrics. A useful preprocessing step for something like word2vec.
      C#
      MIT License
      0000Updated Feb 27, 2021Feb 27, 2021
    • Word2Vec

      Public
      Plugin to train a word2vec model.
      C#
      MIT License
      0100Updated Dec 9, 2020Dec 9, 2020
    • Plugin for tokenizing texts via whitespace.
      C#
      MIT License
      0100Updated Dec 9, 2020Dec 9, 2020
    • Pluging that is essentially a C# port of the NLTK Twitter-Aware Tokenizer (i.e., the "casual" tokenizer: https://github.com/nltk/nltk/blob/develop/nltk/tokenize/casual.py)
      C#
      Apache License 2.0
      0100Updated Dec 9, 2020Dec 9, 2020
    • Plugin that takes tokens and concatenates them back into a single string. Useful for when you want to tokenize text, do a bunch of preprocessing, then output the preprocessed string.
      C#
      MIT License
      0000Updated Dec 9, 2020Dec 9, 2020
    • Plugin to segments texts in various ways.
      C#
      MIT License
      0200Updated Dec 4, 2020Dec 4, 2020
    • Plugin to perform user-defined regex replacements in texts.
      C#
      MIT License
      0100Updated Dec 4, 2020Dec 4, 2020
    • Plugin to interface with the Receptiviti API (see https://www.receptiviti.com/ and https://receptiviti.github.io/api-docs/). Requires that you have your Receptiviti API keys available.
      C#
      MIT License
      0000Updated Dec 3, 2020Dec 3, 2020
    • Plugin to calculate several different indicies of "readability" (e.g., SMOG, Flesch-Kincaid, etc.)
      C#
      MIT License
      0100Updated Dec 3, 2020Dec 3, 2020
    • Plugin to write output to a CSV file. Used by most plugin chains to write your output.
      C#
      MIT License
      0000Updated Dec 3, 2020Dec 3, 2020
    • Plugin to write a corpus of texts into a single .txt file (with each text separated by a newline).
      C#
      MIT License
      0000Updated Dec 3, 2020Dec 3, 2020
    • Plugin to drop observations that fall below a user-specified number of tokens.
      C#
      MIT License
      0000Updated Dec 3, 2020Dec 3, 2020
    • Plugin to calculate lexical diversity/richness scores. Includes things like type-token ratio, etc.
      C#
      MIT License
      0000Updated Dec 3, 2020Dec 3, 2020
    • Plugin that wraps around the LemmaGen lemmatizer (http://lemmatise.ijs.si/Software/Version3)
      C#
      MIT License
      0000Updated Dec 3, 2020Dec 3, 2020