Relation Extraction (COMP61332 Group Coursework)

This Project is temporarily used for the group coursework, which contains two functions of Relation extraction.

BiLSTM, 2. Bert (pre-trained model: bert-base-cased)

Group Member:

Junfan Cheng, Tong Shen, Qiujie Xu, Yuxuan Zhang

Dataset

The dataset applied in this project is only the "NYT" part of UniRel Model, which data can also be downloaded from the shared drive link

The dataset is originally obtained from TPLinker, refer to TPLinker official repository.

Please ensure you download the related dataset file and put them under the correct path:

Download and unzip the dataset from this shared link.
Copy all the files from the "nyt" folders within the unzipped folders to the dataset file path: "./nyt_dataset".

Trained Model file

The trained model of this assignment has been uploaded in this link. Please ensure all models are downloaded and put in the "trained_model" folder before running the code.

How to use this model to predict relations with any input sentences.

For every model code provided, there exists a section named "Predictor" where users are allowed to modify the input sentence.

BiLSTM: Execute all section cells except for "Model Training and Validation", "Training and Validation Analysis", "Model Testing", and "Drawing the Heatmap of the Confusion Matrix" Subsequently. Users can change the input sentence by changing the 'new_sentence' argument of the 'predict_new_sentence' method in the last cell of "Predictor".
Bert: Firstly, Execute all cells of "Necessary (requirements & data preprocessing)". Then, the cells under the "Predictor" section (1, 2, 3, 4.1.1, 4.1.2) should be run sequentially. The predicted result will be shown as the output of the last cell in section 4.1.2. (user can change input sentences in the cell of section 4.1.1)

Tips:

Before testing or predicting, ensure all model files are downloaded and placed in the directory (./trained_model).
For the predictor of Bert, the 4.2 section does not need to run if you want to test the random input only.

Adaptations/Improvements

To enhance the original BiLSTM version, we integrated an additional embedding and attention layer to enhance its capability to focus on relevant labels and avoid concentrating on meaningless ones. The results displayed below demonstrate that this method is a promising approach to improving the performance of BiLSTM.
To improve BERT's performance and deal with the overfitting phenomenon, two additional models are used: one for Named Entity Recognition (NER) from the transformers library and another using word2vec. The approach involves expanding the dataset with entities identified by the NER model and selecting the most semantically similar words using cosine distance from the word2vec model. These results are integrated, applying weights based on the effectiveness of the NER and word2vec models, to compute the final score by summing these weighted contributions. Finally, pick the result label with the highest final score.

Result and Evaluation

The following picture showcases how different relation extraction models perform, evaluated with the F1 score. This score is great because it checks if a model is not just good at finding correct relationships between words (precision) but also doesn't miss out on many (recall). It's like finding the sweet spot between being accurate and thorough, making it a perfect way to see which model does the best job at understanding texts.

BiLSTM: The result of BiLSTM with attention (left) and without attention (right) on the test dataset.

BERT: The result of Bert with NER & word2vec (left) and without NER & word2vec (right) on the test dataset.

Reference

Valette, Marion (2019). Simple Relation Extraction with a Bi-LSTM Model. Available online: https://medium.com/southpigalle/simple-relation-extraction-with-a-bi-lstm-model-part-1-682 b670d5e11
Tapas Nayak and Hwee Tou Ng. 2019. Effective attention modeling for neural relation extraction. arXiv preprint arXiv:1912.03832
Tang, W., Xu, B., Zhao, Y., Mao, Z., Liu, Y., Liao, Y., & Xie, H. (2022). UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction. arXiv preprint arXiv:2211.09039.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
model_code		model_code
nyt_dataset		nyt_dataset
result		result
trained_model		trained_model
.gitignore		.gitignore
README.md		README.md
requirements_BERT.txt		requirements_BERT.txt
requirements_BILSTM.txt		requirements_BILSTM.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Relation Extraction (COMP61332 Group Coursework)

Group Member:

Dataset

Trained Model file

How to use this model to predict relations with any input sentences.

Adaptations/Improvements

Result and Evaluation

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

ChenBin-007/Relation-Extraction

Folders and files

Latest commit

History

Repository files navigation

Relation Extraction (COMP61332 Group Coursework)

Group Member:

Dataset

Trained Model file

How to use this model to predict relations with any input sentences.

Adaptations/Improvements

Result and Evaluation

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages