Differences in health news from reliable and unreliable media

Sameer Dhoju, Md Main Uddin Rony, Ashad Kabir, Naeemul Hassan

Research output: Book chapter/Published conference paperConference paperpeer-review

4 Citations (Scopus)


The spread of ‘fake’ health news is a big problem with even bigger consequences. In this study, we examine a collection of health-related news articles published by reliable and unreliable media outlets. Our analysis shows that there are structural, topical, and semantic patterns which are different in contents from reliable and unreliable media outlets. Using machine learning, we leverage these patterns and build classification models to identify the source (reliable or unreliable) of a health-related news article. Our model can predict the source of an article with an F-measure of 96%. We argue that the findings from this study will be useful for combating the health disinformation problem.
Original languageEnglish
Title of host publicationThe Web Conference 2019
Subtitle of host publicationCompanion of The World Wide Web Conference WWW 2019
EditorsSihem Amer-Yahia, Mohammad Madian, Ashish Goel, Geert-Jan Houben, Kristina Lerman, Julian McAuley, Ricardo Baeza-Yates, Leila Zia
Place of PublicationUSA
PublisherAssociation for Computing Machinery (ACM)
Number of pages7
ISBN (Print)9781450366755
Publication statusPublished - May 2019
EventThe Web Conference 2019 - The Hyatt Regency, San Francisco, United States
Duration: 13 May 201917 May 2019


ConferenceThe Web Conference 2019
CountryUnited States
CitySan Francisco
Internet address

Fingerprint Dive into the research topics of 'Differences in health news from reliable and unreliable media'. Together they form a unique fingerprint.

Cite this