Separating Facts from Fiction: Linguistic Models to Classify Suspicious and Trusted News Posts on Twitter

doi:10.18653/V1/P17-2102

Open AccessProceedings ArticleDOI

Separating Facts from Fiction: Linguistic Models to Classify Suspicious and Trusted News Posts on Twitter

- Vol. 2, pp 647-653

TLDR

This work builds predictive models to classify 130 thousand news posts as suspicious or verified, and predict four sub-types of suspicious news – satire, hoaxes, clickbait and propaganda, and shows that neural network models trained on tweet content and social network interactions outperform lexical models.

Abstract:

Pew research polls report 62 percent of U.S. adults get news on social media (Gottfried and Shearer, 2016). In a December poll, 64 percent of U.S. adults said that “made-up news” has caused a “great deal of confusion” about the facts of current events (Barthel et al., 2016). Fabricated stories in social media, ranging from deliberate propaganda to hoaxes and satire, contributes to this confusion in addition to having serious effects on global stability. In this work we build predictive models to classify 130 thousand news posts as suspicious or verified, and predict four sub-types of suspicious news – satire, hoaxes, clickbait and propaganda. We show that neural network models trained on tweet content and social network interactions outperform lexical models. Unlike previous work on deception detection, we find that adding syntax and grammar features to our models does not improve performance. Incorporating linguistic features improves classification results, however, social interaction features are most informative for finer-grained separation between four types of suspicious news posts.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Big Data and quality data for fake news and misinformation detection

Fatemeh Torabi Asr, +1 more

- 23 May 2019 -

Big Data & Society

TL;DR: The full text of the news articles is made available, together with veracity labels previously assigned based on manual assessment of the articles’ truth content, for building a system to automatically detect misinformation in news.

...read moreread less

Journal ArticleDOI

False News On Social Media: A Data-Driven Survey

Francesco Pierri, +1 more

TL;DR: The aim of this survey is to offer a comprehensive study on the recent advances in terms of detection, characterization and mitigation of false news that propagate on social media, as well as the challenges and the open questions that await future research on the field.

...read moreread less

Journal ArticleDOI

The Web of False Information: Rumors, Fake News, Hoaxes, Clickbait, and Various Other Shenanigans

Savvas Zannettou, +3 more

- 10 Apr 2018 -

arXiv: Social and Information Networks

TL;DR: A typology of the Web’s false-information ecosystem, composed of various types of false- information, actors, and their motives is provided, which pays particular attention to political false information as it can have dire consequences to the community and previous work shows that this type of false information propagates faster and further when compared to other types offalse information.

...read moreread less

Posted Content

Learning Hierarchical Discourse-level Structure for Fake News Detection

Hamid Karimi, +1 more

- 27 Feb 2019 -

arXiv: Computation and Language

TL;DR: HDSF learns and constructs a discourse-level structure for fake/real news articles in an automated and data-driven manner and identifies insightful structure-related properties, which can explain the discovered structures and boost the understating of fake news.

...read moreread less

Proceedings ArticleDOI

Explainable Machine Learning for Fake News Detection

Julio Cesar dos Reis, +4 more

TL;DR: A highly exploratory investigation that produced hundreds of thousands of models from a large and diverse set of features found a strong link between features and model predictions, showing that some features are clearly tailored for detecting certain types of fake news, thus evidencing that different combinations of features cover a specific region of the fake news space.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Large-scale Video Classiﬁcation with Convolutional Neural Networks

Andrej Karpathy, +5 more

Proceedings ArticleDOI

Large-Scale Video Classification with Convolutional Neural Networks

Andrej Karpathy, +5 more

TL;DR: This work studies multiple approaches for extending the connectivity of a CNN in time domain to take advantage of local spatio-temporal information and suggests a multiresolution, foveated architecture as a promising way of speeding up the training.

...read moreread less

Journal ArticleDOI

Liberals and Conservatives Rely on Different Sets of Moral Foundations

Jesse Graham, +2 more

- 01 May 2009 -

Journal of Personality and Social Psycho...

TL;DR: Across 4 studies using multiple methods, liberals consistently showed greater endorsement and use of the Harm/care and Fairness/reciprocity foundations compared to the other 3 foundations, whereas conservatives endorsed and used the 5 foundations more equally.

...read moreread less