-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathfyp.tex
169 lines (133 loc) · 6.45 KB
/
fyp.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
\documentclass[12pt, a4paper]{report}
\usepackage{fyp}
%%these packages are not really necessary if you dont need the code and proofs environments
%%so if you like you can delete from here till the next comment
%%note that there are some examples below which obviously won't work once you remove this part
% \usepackage[square,numbers]{natbib}
\usepackage{natbib}
\usepackage[hyphens,spaces]{url}
\usepackage{verbatim}
\usepackage{amsfonts}
\usepackage{amsmath}
\usepackage{amssymb}
\usepackage{amsthm}
\usepackage{acronym}
\usepackage{csquotes}
\usepackage{multirow}
\usepackage{rotating}
\usepackage[ruled,vlined,linesnumbered,noresetcount]{algorithm2e}
\usepackage[table,xcdraw]{xcolor}
\usepackage[normalem]{ulem}
\usepackage[colorlinks,citecolor=blue,urlcolor=blue,bookmarks=false,hypertexnames=true,linktoc=both]{hyperref}
\useunder{\uline}{\ul}{}
\newcommand{\unk}{\textit{\textless{UNK}\textgreater} }
%%this environment is useful if you have code snippets
\newenvironment{code}
{\footnotesize\verbatim}{\endverbatim\normalfont}
%%the following environments are useful to present proofs in your thesis
\theoremstyle{definition}
\newtheorem{definition}{Definition}[section]
\theoremstyle{definition}%plain}
\newtheorem{example}{Example}[section]
\theoremstyle{definition}%remark}
\newtheorem{proposition}{Proposition}[section]
\theoremstyle{definition}%remark}
\newtheorem{lemma}{Lemma}[section]
\theoremstyle{definition}%remark}
\newtheorem{corollary}{Corollary}[section]
\theoremstyle{definition}%remark}
\newtheorem{theorem}{Theorem}[section]
%%you can delete till here if you dont need the code and proofs environments
\graphicspath{ {figures/}{../figures/} }
\setlength{\headheight}{15pt}
%\overfullrule=15pt
%% TEMPORARY FIX FOR MORE READABLE CITATIONS
% \usepackage{natbib}
% \bibpunct{(}{)}{,}{a}{}{;}
% \bibliographystyle{abbrvnat}
% \renewcommand{\cite}[1]{[\citealp{#1}]}
%% END OF TEMPORARY FIX FOR MORE READABLE CITATIONS
% \bibliographystyle{abbrv} %% UNCOMMENT
\bibliographystyle{ACM-Reference-Format}
\setcitestyle{authoryear,open={(},close={)}}
\makeatletter
\newcommand{\algorithmfootnote}[2][\footnotesize]{%
\let\old@algocf@finish\@algocf@finish% Store algorithm finish macro
\def\@algocf@finish{\old@algocf@finish% Update finish macro to insert "footnote"
\leavevmode\rlap{\begin{minipage}{\linewidth}
#1#2
\end{minipage}}%
}%
}
% \let\Hy@linktoc\Hy@linktoc@none
\makeatother
\usepackage{subfiles}
\begin{document}
%%make sure to enter this information
\title{TSAPLAY: A framework to aid reproducibility in and exploration of the field of Targeted Sentiment Analysis}
\author{Sean Bugeja}
\date{August 2020}
\supervisor{Mr. Mike Rosner}
\department{Faculty of ICT}
\universitycrestpath{crest}
\submitdate{August 2020}
\frontmatter
\begin{acknowledgements}
I would like to thank the University, both for the masters programme itself, as well as their understanding throughout my reading for it.
I am deeply grateful to my supervisor, Mike Rosner, whose level of patience and understanding are unparalleled. His guidance and support were crucial for me too see this project to its completion and cannot be understated. To Professor Matthew Montebello for providing his guidance when I needed it most. To my family and friends, for getting me through the toughest periods of this journey.
\end{acknowledgements}
\begin{abstract}
The proliferation of platforms online which enable users from around to world to voice their opinions on any subject over time has led to the emergence of the largest publically accessible textual representation of public opinion which has not gone unnoticed by the NLP community, using this data in conjunction with sophisticated models for various tasks. Targeted Sentiment Analysis (TSA), whereby the sentiment polarity towards a particular target is identified, presents itself as one of the most popular of such tasks garnering a wide range of different approaches over the years.
In this work, we attempt to recreate some of the most seminal approaches to this task in an effort to evaluate the current state of reproducibility of this field, and whether it has been neglected by the fields' rapid growth. Towards this end, we develop a framework which facilitates TSA model research by decomposing the various parts of the Machine Learning (ML) pipeline into separate modules, abstracting lower-level complexities from the end-user while exposing entry points for feature-extendibility and, providing an ideal environment where models can be evaluated and compared more efficiently.
Using this framework to investigate reproducibility, we identify three issues, namely, lack of specificity, the importance of multiple experimentation runs and, the use of misleading metrics which do not account for class distribution in datasets. Finally, we evaluate different out-of-vocabulary clustering approaches and find downstream effects which merit further investigation.
\end{abstract}
{\hypersetup{linkcolor=black}
\tableofcontents
\listoffigures
\listoftables
}
\newpage
\section*{Acronyms}
\begin{acronym}
\acro{NLP}{Natural Language Processing}
\acro{LSTM}{Long Short Term Memory}
\acro{OOV}{Out of Vocabulary}
\acro{IV}{In Vocabulary}
\acro{SVM}{Support Vector Machine}
\acro{ASR}{Automatic Speech Recognition}
\acro{RC}{Reading Comprehension}
\acro{LDA}{Latent Dirichlet Allocation}
\acro{LSA}{Latent Semantic Analysis}
\acro{CLI}{Command Line Interface}
\end{acronym}
\mainmatter
%%you can organize your chapters into parts but this is not always necessary
%%\part{Part1} - Available but generally not used
\chapter{Introduction} \label{chap:introduction}
\subfile{chapters/1-Introduction/main}
\chapter{Background and Literature Review} \label{chap:background}
\subfile{chapters/2-LiteratureReview/main}
\chapter{Design} \label{chap:design}
\subfile{chapters/3-Design/main}
\chapter{Implementation} \label{chap:implementation}
\subfile{chapters/4-Implementation/main}
% \chapter{Evaluation Methods} \label{chap:evaluation_methods}
% \subfile{chapters/5-EvaluationMethods/main}
\chapter{Results and Observations} \label{chap:results_observations}
\subfile{chapters/6-Results/main}
\chapter{Conclusions and Future Work} \label{chap:conclusion}
\subfile{chapters/7-Conclusion/main}
\appendix
% \begin{proof}
% this is a proof
% \end{proof}
% \chapter{Appendix A}
% \section{These are some details}
% %%example of the code environment
% \begin{code}
% this is some code;
% Make sure to use this template.
% \end{code}
\bibliomatter
\bibliography{references}
\end{document}