-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathmrp.html
186 lines (148 loc) · 8.18 KB
/
mrp.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta content="width=device-width, initial-scale=1.0" name="viewport">
<title>Henry Wu - Project Details</title>
<meta content="" name="description">
<meta content="" name="keywords">
<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=UA-109307947-1"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'UA-109307947-1');
</script>
<!-- Favicons -->
<link href="assets/img/computer.svg" rel="icon">
<!-- Google Fonts -->
<link href="https://fonts.googleapis.com/css?family=Open+Sans:300,300i,400,400i,600,600i,700,700i|Raleway:300,300i,400,400i,500,500i,600,600i,700,700i|Poppins:300,300i,400,400i,500,500i,600,600i,700,700i" rel="stylesheet">
<!-- Vendor CSS Files -->
<link href="assets/vendor/bootstrap/css/bootstrap.min.css" rel="stylesheet">
<link href="assets/vendor/icofont/icofont.min.css" rel="stylesheet">
<link href="assets/vendor/boxicons/css/boxicons.min.css" rel="stylesheet">
<link href="assets/vendor/venobox/venobox.css" rel="stylesheet">
<link href="assets/vendor/owl.carousel/assets/owl.carousel.min.css" rel="stylesheet">
<link href="assets/vendor/aos/aos.css" rel="stylesheet">
<!-- Template Main CSS File -->
<link href="assets/css/style.css" rel="stylesheet">
<!-- =======================================================
* Template Name: iPortfolio - v1.4.1
* Template URL: https://bootstrapmade.com/iportfolio-bootstrap-portfolio-websites-template/
* Author: BootstrapMade.com
* License: https://bootstrapmade.com/license/
======================================================== -->
</head>
<body>
<!-- ======= Mobile nav toggle button ======= -->
<button type="button" class="mobile-nav-toggle d-xl-none"><i class="icofont-navigation-menu"></i></button>
<!-- ======= Header ======= -->
<header id="header">
<div class="d-flex flex-column">
<div class="profile">
<img src="assets/img/selfie.jpg" alt="" class="img-fluid rounded-circle">
<h1 class="text-light"><a href="index.html">Henry Wu</a></h1>
<div class="social-links mt-3 text-center">
<a href="https://www.linkedin.com/in/henry-wu-225544145/" target="_blank" class="linkedin"><i class="bx bxl-linkedin"></i></a>
<a href="https://github.com/Tusk98" target="_blank" class="github"><i class="bx bxl-github"></i></a>
</div>
</div>
<nav class="nav-menu">
<ul>
<li class="active"><a href="index.html#about"><i class="bx bx-home"></i> <span>Home</span></a></li>
<li><a href="index.html#resume"><i class="bx bx-file-blank"></i> <span>Resume</span></a></li>
<li><a href="index.html#projects"><i class="bx bx-server"></i> Projects</a></li>
<li><a href="index.html#contact"><i class="bx bx-envelope"></i> Contact</a></li>
</ul>
</nav><!-- .nav-menu -->
<button type="button" class="mobile-nav-toggle d-xl-none"><i class="icofont-navigation-menu"></i></button>
</div>
</header><!-- End Header -->
<main id="main">
<!-- ======= Breadcrumbs ======= -->
<section id="breadcrumbs" class="breadcrumbs">
<div class="container">
<div class="d-flex justify-content-between align-items-center">
<h2>Project Details</h2>
<ol>
<li><a href="index.html">Home</a></li>
<li>Project Details</li>
</ol>
</div>
</div>
</section><!-- End Breadcrumbs -->
<!-- ======= Portfolio Details Section ======= -->
<section id="portfolio-details" class="portfolio-details">
<div class="container">
<div class="portfolio-details-container">
<div class="owl-carousel portfolio-details-carousel">
<img src="assets/img/mrp-screenshot1.PNG" class="img-fluid" alt="">
<img src="assets/img/mrp-screenshot2.PNG" class="img-fluid" alt="">
<img src="assets/img/mrp-screenshot3.PNG" class="img-fluid" alt="">
</div>
<div class="portfolio-info">
<ul>
<li><strong>Project date</strong>: August, 2023</li>
<li><strong>Github link</strong>: <a target="_blank" href="https://github.com/Tusk98/Handwriting">MRP</a></li>
</ul>
</div>
</div>
<div class="portfolio-description">
<h2>Major Research Project (MRP): Sentiment Analysis of Online Reviews from Yelp Open Dataset</h2>
<strong style="font-size:120%">Background:</strong>
<p>An applied research project (MRP) is required to be conducted and presented in partial fulfillment
of the requirements for the Master of Science: Data Science and Analytics from Toronto Metropolitan university.
<br> I wanted to do a project on NLP and chose to do sentiment analysis on the Yelp Open Dataset.
</p>
<strong style="font-size:120%">MRP Abstract:</strong>
<p>In the digital age, online reviews have become a central component in driving consumer choices.
This study focuses on sentiment analysis of Yelp reviews, juxtaposing traditional machine learning (ML)
algorithms (Naïve Bayes, Logistic Regression, Random Forest, Support Vector Machines) against the
contemporary BERT model. Drawing from a vast dataset of over 6 million reviews, a balanced training set
was derived by undersampling prevalent 5-star reviews. Our key objectives encompass both categorizing
reviews into positive or negative sentiments, but also predicting precise star ratings. Remarkably, while
conventional ML models demonstrated a range of accuracy levels, BERT stood out with its proficiency,
particularly in positive/negative sentiment classification, reaching a flawless accuracy rate. These
findings underscore BERT’s potential in complex sentiment tasks, even as traditional models showcase
notable abilities. The performance of each model is evaluated based on classification reports and a
confusion matrix.
</p>
<strong style="font-size:120%">Challenges:</strong>
<p> There was only one significant challenge that I encountered while writing this paper:
I lacked the necessary computing power to run a deep learning BERT model on 500'000 validation samples. <br>
I utilized a pre-trained BERT model fine tuned on an undersampled training dataset from the Yelp Open Dataset. Unfortunately, my personal computer lacked the capability to run
the fine-tuning or the validation. I ended up having to purchase a subscription to Google Colab for a strong runtime environment.
Even then, I had to reduce the number of validation samples from 500'000 to 250'000 as the Google Colab environment was exhausting its ram, leading to
crashes before my code could complete its execution.
</p>
</div>
</div>
</section><!-- End Portfolio Details Section -->
</main><!-- End #main -->
<!-- ======= Footer ======= -->
<footer id="footer">
<div class="container">
<div class="credits">
Email: <a href="mailto:[email protected]" title="Henry's email">[email protected]<br></a>
Phone: +1 (647) 631-3201
</div>
</div>
</footer><!-- End Footer -->
<a href="#" class="back-to-top"><i class="icofont-simple-up"></i></a>
<!-- Vendor JS Files -->
<script src="assets/vendor/jquery/jquery.min.js"></script>
<script src="assets/vendor/bootstrap/js/bootstrap.bundle.min.js"></script>
<script src="assets/vendor/jquery.easing/jquery.easing.min.js"></script>
<script src="assets/vendor/php-email-form/validate.js"></script>
<script src="assets/vendor/waypoints/jquery.waypoints.min.js"></script>
<script src="assets/vendor/counterup/counterup.min.js"></script>
<script src="assets/vendor/isotope-layout/isotope.pkgd.min.js"></script>
<script src="assets/vendor/venobox/venobox.min.js"></script>
<script src="assets/vendor/owl.carousel/owl.carousel.min.js"></script>
<script src="assets/vendor/typed.js/typed.min.js"></script>
<script src="assets/vendor/aos/aos.js"></script>
<!-- Template Main JS File -->
<script src="assets/js/main.js"></script>
</body>
</html>