Fake News Detection on Social Media: A Data Mining Perspective (2017) | Kai Shu

Citations

PDF

Open Access

More filters

Proceedings Article•DOI•

Fake News Detection: An Ensemble Learning Approach

[...]

Arush Agarwal¹, Akhil Dixit¹•Institutions (1)

Netaji Subhas Institute of Technology¹

13 May 2020

TL;DR: A model for detecting fake news by examining the accuracy of a report and predicting its authenticity and builds an ensemble network to learn the portrayals of news reports, authors, and titles simultaneously is proposed.

...read moreread less

Abstract: Due to easy access, rapid growth, and proliferation of the information available through regular news mediums or social media, it is becoming easy for people to look for news and consume it. But on the other hand, it is becoming a daunting task to differentiate between false information and true information thus leading to widespread fake news. We can define fake news as a type of deceiving journalism and statements that are used to artifice and mislead people. Also, the credibility of social media platforms is at stake where this news is mostly shared. These types of forged information news can have serious negative societal impacts and thus their detection has become the emerging area that is attracting research attention. In this paper, we propose a model for detecting fake news by examining the accuracy of a report and predicting its authenticity. By feature extraction and forming credibility scores from the textual information, this model builds an ensemble network to learn the portrayals of news reports, authors, and titles simultaneously. Different machine learning algorithms like SVM, CNN, LSTM, KNN, and Naive Bayes are used for higher accuracy and it was observed that LSTM showed better accuracy with 97%. The performance and effectiveness of classifiers were evaluated based on their precision, recall, and F1-Score. The usage of different algorithms shows the effectiveness of performance on the dataset.

...read moreread less

32 citations

Proceedings Article•DOI•

Fake News Detection: A long way to go

[...]

Sunidhi Sharma¹, Dilip Kumar Sharma¹•Institutions (1)

GLA University¹

01 Nov 2019

TL;DR: This article has checked and analysed many research articles along with many survey articles and summed up this paper so as to provide the readers with a short idea of what fake news is, it's different flavours in the news spectrum, its characteristics and identification basic.

...read moreread less

Abstract: One can easily say in today's world, information aka news to few is more precious than money itself. This news needs to be in authentic form which is usually found in adulterated version. Leading us to have a dire need for an identification of real news from any possible fake news. News, being a form of information can be subjective to the proofs and source for its authenticity. As a human, one can easily identify real news from fake news with the help of one's innate capability to deduce logic and outlandish source of the information piece. Just that one needs few trusted sources to check for the facts and myths. But on a real time basis, there is a dire need for some software which can nip such ‘false news’ in its bud. Leading it to be one of the most researched area nowadays. Primarily being a part of Information Retrieval, this area is taking up a lot of attention from researchers worldwide to come up with a real-time solution for such an issue. In this article we have checked and analysed many research articles along with many survey articles and summed up this paper so as to provide the readers with a short idea of what fake news is, it's different flavours in the news spectrum, its characteristics and identification basic. We also included the different methods used by prior researchers in the same field. Using few researches as examples we learned about the basics of those methods used in fake news identification. The future aspects are also included in this article along with the challenges one faces while doing research in this very field.

...read moreread less

31 citations

Journal Article•DOI•

“Bend the truth”: Benchmark dataset for fake news detection in Urdu language and its evaluation

[...]

Maaz Amjad¹, Grigori Sidorov¹, Alisa Zhila¹, Helena Gómez-Adorno², Ilia Mikhailovich Voronkov³, Alexander Gelbukh¹ - Show less +2 more•Institutions (3)

Instituto Politécnico Nacional¹, National Autonomous University of Mexico², Moscow Institute of Physics and Technology³

01 Jan 2020-Journal of Intelligent and Fuzzy Systems

TL;DR: A manually assembled and verified dataset containing 900 news articles, 500 annotated as real and 400, as fake, allowing the investigation of automated fake news detection approaches in Urdu, along with the baseline classification and its evaluation.

...read moreread less

Abstract: The paper presents a new corpus for fake news detection in the Urdu language along with the baseline classification and its evaluation. With the escalating use of the Internet worldwide and substantially increasing impact produced by the availability of ambiguous information, the challenge to quickly identify fake news in digital media in various languages becomes more acute. We provide a manually assembled and verified dataset containing 900 news articles, 500 annotated as real and 400, as fake, allowing the investigation of automated fake news detection approaches in Urdu. The news articles in the truthful subset come from legitimate news sources, and their validity has been manually verified. In the fake subset, the known difficulty of finding fake news was solved by hiring professional journalists native in Urdu who were instructed to intentionally write deceptive news articles. The dataset contains 5 different topics: (i) Business, (ii) Health, (iii) Showbiz, (iv) Sports, and (v) Technology. To establish our Urdu dataset as a benchmark, we performed baseline classification. We crafted a variety of text representation feature sets including word n-grams, character n-grams, functional word n-grams, and their combinations. After applying a variety of feature weighting schemes, we ran a series of classifiers on the train-test split. The results show sizable performance gains by AdaBoost classifier with 0.87 F1Fake and 0.90 F1Real. We provide the results evaluated against different metrics for a convenient comparison of future research. The dataset is publicly available for research purposes.

...read moreread less

31 citations

Proceedings Article•DOI•

Perverse Downstream Consequences of Debunking: Being Corrected by Another User for Posting False Political News Increases Subsequent Sharing of Low Quality, Partisan, and Toxic Content in a Twitter Field Experiment

[...]

Mohsen Mosleh¹, Cameron Martel², Dean Eckles², David G. Rand²•Institutions (2)

University of Exeter¹, Massachusetts Institute of Technology²

06 May 2021

TL;DR: This paper investigated downstream consequences of social corrections on users' subsequent sharing of other content and found causal evidence that being publicly corrected by another user shifts one's attention away from accuracy, presenting an important challenge for social correction approaches.

...read moreread less

Abstract: A prominent approach to combating online misinformation is to debunk false content. Here we investigate downstream consequences of social corrections on users’ subsequent sharing of other content. Being corrected might make users more attentive to accuracy, thus improving their subsequent sharing. Alternatively, corrections might not improve subsequent sharing - or even backfire - by making users feel defensive, or by shifting their attention away from accuracy (e.g., towards various social factors). We identified N=2,000 users who shared false political news on Twitter, and replied to their false tweets with links to fact-checking websites. We find causal evidence that being corrected decreases the quality, and increases the partisan slant and language toxicity, of the users’ subsequent retweets (but has no significant effect on primary tweets). This suggests that being publicly corrected by another user shifts one's attention away from accuracy - presenting an important challenge for social correction approaches.

...read moreread less

31 citations

Journal Article•DOI•

Understanding the spread of COVID-19 misinformation on social media: The effects of topics and a political leader's nudge

[...]

Xiangyu Wang¹, Min Zhang¹, Weiguo Fan¹, Kang Zhao¹•Institutions (1)

University of Iowa¹

27 Sep 2021-Journal of the Association for Information Science and Technology

TL;DR: In this paper, the authors used the ongoing COVID-19 pandemic as a case study to systematically investigate factors associated with the spread of multi-topic misinformation related to one event on social media based on the heuristic-systematic model.

...read moreread less

Abstract: The spread of misinformation on social media has become a major societal issue during recent years. In this work, we used the ongoing COVID-19 pandemic as a case study to systematically investigate factors associated with the spread of multi-topic misinformation related to one event on social media based on the heuristic-systematic model. Among factors related to systematic processing of information, we discovered that the topics of a misinformation story matter, with conspiracy theories being the most likely to be retweeted. As for factors related to heuristic processing of information, such as when citizens look up to their leaders during such a crisis, our results demonstrated that behaviors of a political leader, former US President Donald J. Trump, may have nudged people's sharing of COVID-19 misinformation. Outcomes of this study help social media platform and users better understand and prevent the spread of misinformation on social media.

...read moreread less

31 citations

Collapse

Fake News Detection on Social Media: A Data Mining Perspective

Citations

References

Related Papers (5)

Trending Questions (1)