Multi-class Sentiment Analysis of COVID-19 Tweets by Machine Learning and Deep Learning Approaches

Authors

  • Moustafa Maaskri University of Tiaret
  • Sid Ahmed Mokhtar Mostefaoui University of Tiaret
  • Madani Hadj Meghazi University of Tiaret
  • Mohamed Goismi Dr. Tahar Moulay University

DOI:

https://doi.org/10.13053/cys-28-2-4568

Keywords:

Ensemble machine learning, Deep learning, Voting, Bagging, Stacking, BERT

Abstract

COVID-19 is a virus that has spread rapidly over the globe. The condition has repercussions beyond the realm of public health. Twitter is one platform where people post reactions to events during the outbreak. User-generated information, like tweets, presents unique challenges for sentiment analysis on Twitter data. With that in mind, this work employs four methods for analyzing Twitter data in terms of sentiment: the vector space model (TF-IDF) with three different ensemble machine learning models (voting, bagging, and stacking) and BERT (Bidirectional Encoder Representations from Transformers). Experiments showed that BERT outperformed the other three techniques, with an F1-score of 74%, a precision of 74%, and a recall of 74% for categorizing five sentiment classes on data from a Kaggle competition (Coronavirus tweets NLP-Text Classification).

Author Biographies

Moustafa Maaskri, University of Tiaret

LRIAS Laboratory,Computer Science department

Sid Ahmed Mokhtar Mostefaoui, University of Tiaret

LRIAS Laboratory,Computer Science department

Madani Hadj Meghazi, University of Tiaret

LRIAS Laboratory,Computer Science department

Mohamed Goismi, Dr. Tahar Moulay University

GeCoDe Laboratory, Computer Science department

Downloads

Published

2024-06-12

Issue

Section

Articles