A simple implementation of Email Spam Classification using Naive Bayes Classifier.
DATA SET
Raw data comes from the SpamAssassin public corpus, available for free download at http://spamassassin.apache.org/publiccorpus/.
Data categorized into three categories: “spam,” “easy ham,” and “hard ham.”
Results
References