Performance evaluation of keyword extraction methods and visualization for student online comments

Feng Liu, Xiaodi Huang, Weidong Huang, Sophia Duan

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)
56 Downloads (Pure)

Abstract

Topic keyword extraction (as a typical task in information retrieval) refers to extracting the core keywords from document topics. In an online environment, students often post comments in subject forums. The automatic and accurate extraction of keywords from these comments are beneficial to lecturers (particular when it comes to repeatedly delivered subjects). In this paper, we compare the performance of traditional machine learning algorithms and two deep learning methods in extracting topic keywords from student comments posted in subject forums. For this purpose, we collected student comment data from a period of two years, manually tagging part of the raw data for our experiments. Based on this dataset, we comprehensively compared the five typical algorithms of naïve Bayes, logistic regression, support vector machine, convolutional neural networks, and Long Short-Term Memory with Attention (Att-LSTM). The performances were measured by the four evaluation metrics. We further examined the keywords by visualization. From the results of our experiment and visualization, we conclude that the Att-LSTM method is the best approach for topic keyword extraction from student comments. Further, the results from the algorithms and visualization are symmetry, to some degree. In particular, the extracted topics from the comments posted at the same stages of different teaching sessions are, almost, reflection symmetry.
Original languageEnglish
Article number1923
Pages (from-to)1-20
Number of pages20
JournalSymmetry
Volume12
Issue number11
DOIs
Publication statusPublished - 22 Nov 2020

Fingerprint

Dive into the research topics of 'Performance evaluation of keyword extraction methods and visualization for student online comments'. Together they form a unique fingerprint.

Cite this