Today information in the world wide web is overwhelmed by unprecedented quantity of data on versatile topics with varied quality. However, the quality of information disseminated in the field of medicine has been questioned as the negative health consequences of health misinformation can be life-threatening.There is currently no generic automated tool for evaluating the quality of online health information spanned over broad range. To address this gap, in this paper, we applied data mining approach to automatically assess the quality of online health articles based on 10 quality criteria. We have prepared a labelled dataset with 53012 features and applied different feature selection methods to identify the best feature subset with which our trained classifier achieved an accuracy of 84%-90% varied over 10 criteria. Our semantic analysis of features shows the underpinning associations between the selected features and assessment criteria and further rationalize our assessment approach. Our findings will help in identifying high quality health articles and thus aiding users in shaping their opinion to make right choice while picking health related help from online.
Original languageEnglish
Pages (from-to)591-601
Number of pages10
JournalIEEE Journal of Biomedical and Health Informatics
Issue number2
Early online date20 Oct 2020
Publication statusPublished - Feb 2021


Dive into the research topics of 'Automatically assessing quality of online health articles'. Together they form a unique fingerprint.
  • University of Maryland

    Kabir, A. (Visiting researcher)

    11 Nov 201915 Nov 2019

    Activity: Visiting an external institutionVisiting an external organisationAcademic

Cite this