Detection of Multiple Identity Manipulation in Collaborative Projects

doi:10.1145/2872518.2890586

Proceedings ArticleDOI

Detection of Multiple Identity Manipulation in Collaborative Projects

- pp 955-960

TLDR

This article proposes a set of features that grows on previous literature to use in automatic data analysis in order to detect the Sockpuppets accounts created on EnWiki and compares several machine learning algorithms to show that the new features and training data enable to detect 99\% of fake accounts, improving previous results from the literature.

Abstract:

Various techniques are used to manipulate users in OSN environments such as social spam, identity theft, spear phishing and Sybil attacks... In this article, we are interested in analyzing the behavior of multiple fake accounts that try to bypass the OSN regulation. In the context of social media manipulation detection, we focus on the special case of multiple Identity accounts (Sockpuppet) created on English Wikipedia (EnWiki). We set up a complete methodology spanning from the data extraction from EnWiki to the training and testing of our selected data using several machine learning algorithms. In our methodology we propose a set of features that grows on previous literature to use in automatic data analysis in order to detect the Sockpuppets accounts created on EnWiki. We apply them on a database of 10.000 user accounts. The results compare several machine learning algorithms to show that our new features and training data enable to detect 99\% of fake accounts, improving previous results from the literature.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

An Army of Me: Sockpuppets in Online Discussion Communities

Srijan Kumar, +3 more

TL;DR: In this article, a taxonomy of deceptive behavior in online discussion communities is presented, and a data-driven view of deception is presented for the automatic detection of sockpuppets, i.e., whether they pretend to be different users or their supportiveness.

...read moreread less

Journal ArticleDOI

Machine Learning: A Review on Binary Classification

Roshan Kumari, +1 more

- 15 Feb 2017 -

International Journal of Computer Applic...

TL;DR: This research synthesizes binary classification in which various approaches for binary classification are discussed and sockpuppet detection is based on binary.

...read moreread less

Proceedings ArticleDOI

Privacy, Anonymity, and Perceived Risk in Open Collaboration: A Study of Tor Users and Wikipedians

Andrea Forte, +2 more

TL;DR: This qualitative study examines privacy practices and concerns among contributors to open collaboration projects and collected interview data from people who use the anonymity network Tor who also contribute to online projects and Wikipedia editors who are concerned about their privacy.

...read moreread less

Proceedings ArticleDOI

An Army of Me: Sockpuppets in Online Discussion Communities

Srijan Kumar, +3 more

- 21 Mar 2017 -

arXiv: Social and Information Networks

TL;DR: Sockpuppets differ from ordinary users in terms of their posting behavior, linguistic traits, as well as social network structure, and this analysis suggests a taxonomy of deceptive behavior in discussion communities.

...read moreread less

Proceedings ArticleDOI

Antisocial Behavior on the Web: Characterization and Detection

Srijan Kumar, +2 more

TL;DR: This tutorial presents the state-of-the-art research spanning two aspects of antisocial behavior: characterization of their behavioral properties, and development of algorithms for identifying and predicting them.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Random Forests

Leo Breiman

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

Journal ArticleDOI

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995 -

Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

Journal ArticleDOI

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Yoav Freund, +1 more

TL;DR: The model studied can be interpreted as a broad, abstract extension of the well-studied on-line prediction model to a general decision-theoretic setting, and it is shown that the multiplicative weight-update Littlestone?Warmuth rule can be adapted to this model, yielding bounds that are slightly weaker in some cases, but applicable to a considerably more general class of learning problems.

...read moreread less

Book ChapterDOI

The Sybil Attack

John R. Douceur

TL;DR: It is shown that, without a logically centralized authority, Sybil attacks are always possible except under extreme and unrealistic assumptions of resource parity and coordination among entities.

...read moreread less

Journal ArticleDOI

An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression

Naomi Altman

- 01 Aug 1992 -

The American Statistician

TL;DR: Kernel and nearest-neighbor regression estimators are local versions of univariate location estimators, and so they can readily be introduced to beginning students and consulting clients who are familiar with such summaries as the sample mean and median.

...read moreread less