Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 906 Bytes

README.md

File metadata and controls

13 lines (9 loc) · 906 Bytes

Data Mining Course - Articles Classification Project

This project uses classifiers from the scikit-learn library to classify articles based on their content. It also uses the wordcloud library to create an image for each different category of the articles also based on the content.

To install all the dependencies on your virtual environment, run:

  • pip install -r requirements.txt

To run all the classifiers and the WorldCloud script with python 3, run:

  • python main.py

If you are running on Windows, with the standard python library, you will need the Microsoft Visual C++ 14.0, for more information click here.
Since the above dependency requires many GBs to be installed, you can skip it and let the project run without the WordCloud (it automatically detects the missing dependency and skips running the WordCloud).