Skip to content

Commit 88637b8

Browse files
Add README
1 parent 7e0126e commit 88637b8

File tree

1 file changed

+13
-0
lines changed

1 file changed

+13
-0
lines changed

README.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
# Text Categorization
2+
3+
> This repository contains the source code and other helper files for my undergraduate thesis titled "Graph Convolutional Neural Networks for Text Categorization" under the supervision of [Prof. Xavier Bresson](http://www.ntu.edu.sg/home/xbresson/) at Nanyang Technological University, Singapore.
4+
5+
There are a total of three benchmark models and three deep learning models implemented in this repository for text classification:
6+
7+
1. `baseline.py`: Linear SVC & Multinomial Naive Bayes
8+
2. `mlp.py`: Multilayer Perceptron
9+
3. `cnn_fchollet.py`: F. Chollet CNN (based on this [2016 blog post](https://blog.keras.io/using-pre-trained-word-embeddings-in-a-keras-model.html))
10+
4. `cnn_ykim.py`: Y. Kim CNN (based on [Y. Kim, 2014](https://arxiv.org/abs/1408.5882))
11+
5. `graph_cnn.py`: Graph CNN (based on [M. Defferrard et al., 2017](https://arxiv.org/abs/1606.09375))
12+
13+
The above models were tested on three datasets — [Rotten Tomatoes Sentence Polarity Dataset](http://www.cs.cornell.edu/people/pabo/movie-review-data/), [20 Newsgroups](http://qwone.com/~jason/20Newsgroups/) & [RCV1](http://www.jmlr.org/papers/volume5/lewis04a/lewis04a.pdf). The code used to preprocess the datasets can be found [here](data.py) and the performance of the models on these datasets can be found [here](results.csv).

0 commit comments

Comments
 (0)