scispace - formally typeset
Journal ArticleDOI

Detecting health misinformation in online health communities: Incorporating behavioral features into machine learning based approaches

Yuehua Zhao, +2 more
- 01 Jan 2021 - 
- Vol. 58, Iss: 1, pp 102390
Reads0
Chats0
TLDR
A novel health misinformation detection model was proposed which incorporated the central- level features and the peripheral-level features (including linguistic features, sentiment features, and user behavioral features) and correctly detected about 85% of the health misinformation.
Abstract
Curbing the diffusion of health misinformation on social media has long been a public concern since the spread of such misinformation can have adverse effects on public health. Previous studies mainly relied on linguistic features and textual features to detect online health-related misinformation. Based on the Elaboration Likelihood Model (ELM), this study proposed that the features of online health misinformation can be classified into two levels: central-level and peripheral-level. In this study, a novel health misinformation detection model was proposed which incorporated the central-level features (including topic features) and the peripheral-level features (including linguistic features, sentiment features, and user behavioral features). In addition, the following behavioral features were introduced to reflect the interaction characteristics of users: Discussion initiation, Interaction engagement, Influential scope, Relational mediation, and Informational independence. Due to the lack of a labeled dataset, we collected the dataset from a real online health community in order to provide a real scenario for data analysis. Four types of misinformation were identified through the coding analysis. The proposed model and its individual features were validated on the real-world dataset. The model correctly detected about 85% of the health misinformation. The results also suggested that behavioral features were more informative than linguistic features in detecting misinformation. The findings not only demonstrated the efficacy of behavioral features in health misinformation detection but also offered both methodological and theoretical contributions to misinformation detection from the perspective of integrating the features of messages as well as the features of message creators.

read more

Citations
More filters
Journal ArticleDOI

Characterizing the dissemination of misinformation on social media in health emergencies: An empirical study based on COVID-19

TL;DR: The empirical results show that health caution and advice, help seeking misinformation, and emotional support significantly increase the dissemination of misinformation, indicating both dark and bright misinformation ambiguity and richness.
Journal ArticleDOI

Temporally evolving graph neural network for fake news detection

TL;DR: Wang et al. as discussed by the authors introduced a novel temporal propagation-based fake news detection framework, which could fuse structure, content semantics, and temporal information, and model temporal evolution patterns of real-world news as the graph evolving under the setting of dynamic diffusion networks.
Journal ArticleDOI

Linguistic characteristics and the dissemination of misinformation in social media: The moderating effect of information richness

TL;DR: A misinformation dissemination model that includes the direct effects of four novel linguistic characteristics on dissemination and the moderating effect of information richness is proposed, indicating that the four linguistic characteristics proposed by this study are also suitable for the dissemination of misinformation in English.
Journal ArticleDOI

Health Misinformation Detection in the Social Web: An Overview and a Data Science Approach

TL;DR: This paper considers several publicly available datasets that have only recently been generated for the assessment of health misinformation under different perspectives and identifies distinct groups of features and machine learning techniques that can be effective to assess misinformation in online health-related content.
Journal ArticleDOI

A rumor reversal model of online health information during the Covid-19 epidemic.

Abstract: The development of the Internet and social media has expanded the speed and scope of information dissemination, but not all widely disseminated information is true. Especially during the public health emergencies, the endogenous health information demand generated by the lack of scientific knowledge of health information among online users stimulates the dissemination of health information by mass media while providing opportunities for rumor mongers to publish and spread online rumors. Invalid scientific knowledge and rumors will have a serious negative impact and disrupt social order during epidemic outbreaks such as COVID-19. Therefore, it is extremely important to construct an effective online rumor reversal model. The purpose of this study is to build an online rumor reversal model to control the spread of online rumors and reduce their negative impact. From the perspective of internal and external factors, based on the SIR model, this study constructed a G-SCNDR online rumor reversal model by adopting scientific knowledge level theory and an external online rumor control strategy. In this study, the G-SCNDR model is simulated, and a sensitivity analysis of the important parameters of the model is performed. The reversal efficiency of the G-SCNDR model can be improved by properly adopting the isolation-conversion strategy as the external control approach to online rumors with improving the popularization rate of the level of users' scientific knowledge and accelerating the transformation efficiency of official nodes. This study can help provide a better understanding of the process of online rumor spreading and reversing, as well as offering ceritain guidance and countermeasures for online rumor control during public health emergencies.
References
More filters
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Journal Article

Scikit-learn: Machine Learning in Python

TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Journal ArticleDOI

Support-Vector Networks

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Book

Content analysis: an introduction to its methodology

TL;DR: History Conceptual Foundations Uses and Kinds of Inference The Logic of Content Analysis Designs Unitizing Sampling Recording Data Languages Constructs for Inference Analytical Techniques The Use of Computers Reliability Validity A Practical Guide
Related Papers (5)