Feature selection approach for Twitter sentiment analysis and text classification based on Chi-Square and Naïve Bayes

S. Paudel, P. W.C. Prasad, Abeer Alsadoon, Md Rafiqul Islam, Amr Elchouemi

Research output: Book chapter/Published conference paperConference paper

Abstract

With the rapid growth of web and mobile technology, Social networking services like Twitter are widely used, resulting in large amounts of data being generated daily in social networking sites. Efficient Sentiment analysis of such data is very important for a range of applications and improvement of accuracy in detecting sentiment is the main aim of this research. This report examines the combination of a Chi-Squared feature selection algorithm, k-mean clustering and TF-IDF for attribute weighting based on Naïve Bayes, for classification of text and sentiment in communications generated on Twitter. This approach is compared with other approaches based on Naïve Bayes to give an account of their relative strengths and weaknesses. When running experiments on multi-domain twitter datasets, results indicate that the proposed method shows superior performance across a range of. The main aim of this research is to enhance the performance of the Naïve Bayes classifier using a feature selection technique.
Original languageEnglish
Title of host publicationInternational Conference on Applications and Techniques in Cyber Security and Intelligence ATCI 2018 - Applications and Techniques in Cyber Security and Intelligence
EditorsMohammed Atiquzzaman, Zheng Xu, Jemal Abawajy, Kim-Kwang Raymond Choo, Rafiqul Islam
PublisherSpringer-Verlag London Ltd.
Pages281-298
Number of pages18
ISBN (Print)9783319987750
DOIs
Publication statusPublished - 2019
EventInternational Conference of Applications and Techniques in Cyber Intelligence, ATCI 2018 - Shanghai Univeristy, Shanghai, China
Duration: 11 Jul 201813 Jul 2018
https://web.archive.org/web/20180909204616/http://www.atci2018.com/index.html
https://researchoutput.csu.edu.au/admin/files/37226362/37225002_Published_Paper.pdf (proceedings preface and index)

Publication series

NameAdvances in Intelligent Systems and Computing
Volume842
ISSN (Print)2194-5357

Conference

ConferenceInternational Conference of Applications and Techniques in Cyber Intelligence, ATCI 2018
CountryChina
CityShanghai
Period11/07/1813/07/18
Internet address

Fingerprint Dive into the research topics of 'Feature selection approach for Twitter sentiment analysis and text classification based on Chi-Square and Naïve Bayes'. Together they form a unique fingerprint.

  • Cite this

    Paudel, S., Prasad, P. W. C., Alsadoon, A., Islam, M. R., & Elchouemi, A. (2019). Feature selection approach for Twitter sentiment analysis and text classification based on Chi-Square and Naïve Bayes. In M. Atiquzzaman, Z. Xu, J. Abawajy, K-K. R. Choo, & R. Islam (Eds.), International Conference on Applications and Techniques in Cyber Security and Intelligence ATCI 2018 - Applications and Techniques in Cyber Security and Intelligence (pp. 281-298). (Advances in Intelligent Systems and Computing; Vol. 842). Springer-Verlag London Ltd.. https://doi.org/10.1007/978-3-319-98776-7_30