Skip to content

Files

Latest commit

f3b6406 · Jul 25, 2023

History

History

lang-detection

Language detection model

This model uses an LSTM for character-level language detection. Given a sentence of text, each character is fed into the model and the final output determines which of five languages the sentence was written in.

First run scrape.jl to download a Wikipedia data set. model.jl contains the actual model and training code.

Training

cd text/lang-detection
julia scrape.jl
julia --project model.jl

References