Introduction

AUTHOR=Elmitwalli Sherif , Mehegan John 

TITLE=Sentiment analysis of COP9-related tweets: a comparative study of pre-trained models and traditional techniques

JOURNAL=Frontiers in Big Data

VOLUME=Volume 7 - 2024

YEAR=2024

URL=https://www.frontiersin.org/journals/big-data/articles/10.3389/fdata.2024.1357926

DOI=10.3389/fdata.2024.1357926

ISSN=2624-909X

ABSTRACT=<sec><title>Introduction</title><p>Sentiment analysis has become a crucial area of research in natural language processing in recent years. The study aims to compare the performance of various sentiment analysis techniques, including lexicon-based, machine learning, Bi-LSTM, BERT, and GPT-3 approaches, using two commonly used datasets, IMDB reviews and Sentiment140. The objective is to identify the best-performing technique for an exemplar dataset, tweets associated with the WHO Framework Convention on Tobacco Control Ninth Conference of the Parties in 2021 (COP9).</p></sec><sec><title>Methods</title><p>A two-stage evaluation was conducted. In the first stage, various techniques were compared on standard sentiment analysis datasets using standard evaluation metrics such as accuracy, F1-score, and precision. In the second stage, the best-performing techniques from the first stage were applied to partially annotated COP9 conference-related tweets.</p></sec><sec><title>Results</title><p>In the first stage, BERT achieved the highest F1-scores (0.9380 for IMDB and 0.8114 for Sentiment 140), followed by GPT-3 (0.9119 and 0.7913) and Bi-LSTM (0.8971 and 0.7778). In the second stage, GPT-3 performed the best for sentiment analysis on partially annotated COP9 conference-related tweets, with an F1-score of 0.8812.</p></sec><sec><title>Discussion</title><p>The study demonstrates the effectiveness of pre-trained models like BERT and GPT-3 for sentiment analysis tasks, outperforming traditional techniques on standard datasets. Moreover, the better performance of GPT-3 on the partially annotated COP9 tweets highlights its ability to generalize well to domain-specific data with limited annotations. This provides researchers and practitioners with a viable option of using pre-trained models for sentiment analysis in scenarios with limited or no annotated data across different domains.</p></sec>