scispace - formally typeset
Open Access

Rule-Based On-the-fly Web Spambot Detection Using Action Strings

Reads0
Chats0
TLDR
In this article, a rule-based web usage behavior action string that can be analyzed using Trie data structures to detect web spambots is proposed to eliminate spam in web 2.0 applications.
Abstract
spambots are a new type of internet robot that spread spam content through Web 2.0 applications like online discussion boards, blogs, wikis, social networking platforms etc. These robots are intelligently designed to act like humans in order to fool safeguards and other users. Such spam content not only wastes valuable resources and time but also may mislead users with unsolicited content. Spam content typically intends to misinform users (scams), generate traffic, make sales (marketing/advertising), and occasionally compromise parties, people or systems by spreading spyware or malwares. Current countermeasures do not effectively identify and prevent web spambots. Proactive measures to deter spambots from entering a site are limited to question / response scenarios. The remaining efforts then focus on spam content identification as a passive activity. Spammers have evolved their techniques to bypass existing anti-spam filters. In this paper, we describe a rule-based web usage behaviour action string that can be analysed using Trie data structures to detect web spambots. Our experimental results show the proposed system is successful for on-the-fly classification of web spambots hence eliminating spam in web 2.0 applications.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

An integrated method for real time and offline web robot detection

TL;DR: This paper presents a novel detection approach that relies on the differences in the resource request patterns of web robots and humans, and rationalizes why differences in resourcerequest patterns are expected to remain intrinsic to robots and human despite the continuous evolution of their traffic.

Overview of Web Spammer Detection

Mo Qian, +1 more
TL;DR: An overview of Web spammer detection is presented, along with a comparison over the difference between traditional and burgeoning spammer Detection approaches, and the prospects for future development and suggestions for possible extensions are emphasized.
Journal ArticleDOI

A design of a proxy inspired from human immune system to detect SQL Injection and Cross-Site Scripting

Erwin Adi
- 01 Jan 2012 - 
TL;DR: Wines is proposed, named after Web Immune Systems, a design for a proxy that learns variations of the attack strings from behaviors of malicious users for the purpose of detecting those mutated attack strings.
Dissertation

Personal Email Spam Filtering with Minimal User Interaction

Mona Mojdeh
TL;DR: This work describes new approaches to solve the problem of building a personal spam filter that requires minimal user feedback, and shows that learning filters with no user input can substantially improve the results of open-source and industry-leading commercial filters that employ no user-specific training.
References
More filters
Proceedings ArticleDOI

Identifying web spam with user behavior analysis

TL;DR: A novel spam detection framework is proposed that can detect unknown spam types and newly-appeared spam with the help of user behavior analysis and preliminary experiments show the effectiveness of the proposed features and detection framework.
Proceedings Article

Is Britney Spears Spam

TL;DR: The aim is to redefine spam and the role of the spam filter in the context of Social Networking Services (SNS) and develop a research prototype that categorizes senders into broader categories than spam/not spam using features unique to SNS.
Proceedings ArticleDOI

Toward spam 2.0: An evaluation of Web 2.0 anti-spam methods

TL;DR: Analysis of current anti-spam methods in Web 2.0 for spam detection and prevention against the proposed evaluation framework shows that the need for more robust methods which are prevention based, unsupervised and do not increase user and system interaction complexity is highly demanded.
Book ChapterDOI

HoneySpam 2.0: Profiling Web Spambot Behaviour

TL;DR: By profiling web spambots, this paper provides the foundation for identifying such bots and preventing and filtering web spam content and describes the design of HoneySpam 2.0.
Book ChapterDOI

Web Spam Identification with User Browsing Graph

TL;DR: A user browsing graph is constructed based on users' Web access log and link analysis algorithms on this graph are adopted to identify Web spam pages and shows that algorithms performed on the constructed graph outperforms those on the original graph.
Related Papers (5)