Sentiment Analysis in Twitter Data Using Machine Learning-Based Approach

Kazi Abdullah Al Arafat, Mahmudur Rahman Roni, Sumaya Siddique, Mohammad Abu Yousuf, Mohammad Ali Moni

Research output: Book chapter/Published conference paperConference paperpeer-review

Abstract

There are now many more people sharing their views, ideas, and opinions because of the proliferation of user-generated material on the Internet and the development of information technology. People frequently use social media platforms like Facebook, Instagram, WhatsApp, and Twitter to express their thoughts and feelings. Opinion analysis, which is the act of identifying emotional undertones from this enormous volume of Internet data, has become more important as a way of understanding public opinion, trends, and brand impressions. However, because of the distinctive features of the network, such as informal language, character constraints, and the quick influx of data, conducting sentiment analysis on Twitter data poses unique obstacles. By considering the above difficulties, we have proposed a machine learning-based approach to classify the emotions as either positive, negative, or neutral. This paper conducts a thorough analysis of sentiment on Twitter data using the Support Vector Machine (SVM), Maximum Entropy (Max Ent), and Naive Bayes Multinomial (NBM) machine learning algorithms. To do this experiment, we deal with the large volume of data collected from Kaggle. To train the machine learning model, this wide range of tweet datasets from different subjects and domains is preprocessed using the Natural Language Tool Kit (NLTK). The results from these models were tested using various testing matrices like precision, recall, and F1-score. We achieved a maximum accuracy of 97% for SVM compared to all three classifiers.

Original languageEnglish
Title of host publicationProceedings of Trends in Electronics and Health Informatics - TEHI 2023
EditorsMufti Mahmud, M. Shamim Kaiser, Anirban Bandyopadhyay, Kanad Ray, Shamim Al Mamun
PublisherSpringer Science and Business Media Deutschland GmbH
Pages169-184
Number of pages16
ISBN (Print)9789819739363
DOIs
Publication statusPublished - 2025
Event3rd International Conference on Trends in Electronics and Health Informatics, TEHI 2023 - Dhaka, Bangladesh
Duration: 20 Dec 202321 Dec 2023

Publication series

NameLecture Notes in Networks and Systems
Volume1034 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference3rd International Conference on Trends in Electronics and Health Informatics, TEHI 2023
Country/TerritoryBangladesh
CityDhaka
Period20/12/2321/12/23

Fingerprint

Dive into the research topics of 'Sentiment Analysis in Twitter Data Using Machine Learning-Based Approach'. Together they form a unique fingerprint.

Cite this