TY - CHAP
T1 - SATLabel
T2 - A framework for sentiment and aspect terms based automatic topic labelling
AU - Shahriar, Khandaker Tayef
AU - Moni, Mohammad Ali
AU - Hoque, Mohammed Moshiul
AU - Islam, Muhammad Nazrul
AU - Sarker, Iqbal H.
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2022
Y1 - 2022
N2 - In this paper, we present a framework that automatically labels latent Dirichlet allocation (LDA) generated topics using sentiment and aspect terms from COVID-19 tweets to help the end-users by minimizing the cognitive overhead of identifying key topics labels. Social media platforms, especially Twitter, are considered as one of the most influential sources of information for providing public opinion related to a critical situation like the COVID-19 pandemic. LDA is a popular topic modelling algorithm that extracts hidden themes of documents without assigning a specific label. Thus, automatic labelling of LDA-generated topics from COVID-19 tweets is a great challenge instead of following the manual labelling approach to get an overview of wider public opinion. To overcome this problem, in this paper, we propose a framework named SATLabel that effectively identifies significant topic labels using top unigrams features of sentiment terms and aspect terms clusters from LDA-generated topics of COVID-19-related tweets to uncover various issues related to the COVID-19 pandemic. The experimental results show that our methodology is more effective, simpler, and traces better topic labels compare to the manual topic labelling approach.
AB - In this paper, we present a framework that automatically labels latent Dirichlet allocation (LDA) generated topics using sentiment and aspect terms from COVID-19 tweets to help the end-users by minimizing the cognitive overhead of identifying key topics labels. Social media platforms, especially Twitter, are considered as one of the most influential sources of information for providing public opinion related to a critical situation like the COVID-19 pandemic. LDA is a popular topic modelling algorithm that extracts hidden themes of documents without assigning a specific label. Thus, automatic labelling of LDA-generated topics from COVID-19 tweets is a great challenge instead of following the manual labelling approach to get an overview of wider public opinion. To overcome this problem, in this paper, we propose a framework named SATLabel that effectively identifies significant topic labels using top unigrams features of sentiment terms and aspect terms clusters from LDA-generated topics of COVID-19-related tweets to uncover various issues related to the COVID-19 pandemic. The experimental results show that our methodology is more effective, simpler, and traces better topic labels compare to the manual topic labelling approach.
KW - Aspect terms
KW - Automatic labelling
KW - Data-driven framework
KW - LDA
KW - Sentiment terms
KW - Soft cosine similarity
KW - Topic
KW - Unigrams
UR - http://www.scopus.com/inward/record.url?scp=85135517750&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85135517750&partnerID=8YFLogxK
U2 - 10.1007/978-981-19-2347-0_6
DO - 10.1007/978-981-19-2347-0_6
M3 - Chapter (peer-reviewed)
AN - SCOPUS:85135517750
SN - 9789811923463
T3 - Lecture Notes on Data Engineering and Communications Technologies
SP - 63
EP - 75
BT - Machine intelligence and data science applications
A2 - , Vaclav Skala
A2 - , T. P. Singh
A2 - , Tanupriya Choudhury
A2 - , Ravi Tomar
A2 - , Md. Abul Bashar
PB - Springer
CY - Singapore
ER -