Bot or Not? A Case Study on Bot Recognition from Web Session Logs

doi:10.1007/978-3-319-95095-2_19

Book ChapterDOI

Bot or Not? A Case Study on Bot Recognition from Web Session Logs

- pp 197-206

TLDR

A study of web usage logs is reported on to verify whether it is possible to achieve good recognition rates in the task of distinguishing between human users and automated bots using computational intelligence techniques.

Abstract:

This work reports on a study of web usage logs to verify whether it is possible to achieve good recognition rates in the task of distinguishing between human users and automated bots using computational intelligence techniques. Two problem statements are given, offline (for completed sessions) and on-line (for sequences of individual HTTP requests). The former is solved with several standard computational intelligence tools. For the second, a learning version of Wald’s sequential probability ratio test is used.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Sequential Tests of Statistical Hypotheses.

Rasul A. Khan, +1 more

- 01 Mar 1977 -

Journal of the American Statistical Asso...

Journal ArticleDOI

Bot recognition in a Web store: An approach based on unsupervised learning

Stefano Rovetta, +2 more

- 01 May 2020 -

Journal of Network and Computer Applicat...

TL;DR: Results demonstrate that the classification based on unsupervised learning is very efficient, achieving a similar performance level as the fully supervised classification, and is an experimental indication that the bot recognition problem can be successfully dealt with using methods that are less sensitive to mislabelled data or missing labels.

...read moreread less

Journal ArticleDOI

Identifying legitimate Web users and bots with different traffic profiles — an Information Bottleneck approach

Grażyna Suchacka, +1 more

- 07 Jun 2020 -

Knowledge Based Systems

TL;DR: This paper proposes a novel approach to identify various profiles of bots and humans which combines feature selection and unsupervised learning of HTTP-level traffic patterns to develop a user session classification model.

...read moreread less

Proceedings ArticleDOI

Online Web Bot Detection Using a Sequential Classification Approach

Alberto Cabri, +3 more

TL;DR: The present approach uses deep neural networks combined with Wald's Sequential Probability Ratio Test to express the relationship between subsequent HTTP requests in an ongoing session and to assess the likelihood of each session being generated by a bot or human before it ends.

...read moreread less

Proceedings ArticleDOI

Towards a framework for detecting advanced Web bots

Christos Iliou, +5 more

TL;DR: The proposed framework has significant ability to detect Web bots that do not try to hide their bot identity using HTTP Web logs, and balanced accuracy in a false-positive intolerant server > 95%).

...read moreread less

References

PDF

Open Access

More filters

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Journal ArticleDOI

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995 -

Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

Some methods for classification and analysis of multivariate observations

James B. MacQueen

TL;DR: The k-means algorithm as mentioned in this paper partitions an N-dimensional population into k sets on the basis of a sample, which is a generalization of the ordinary sample mean, and it is shown to give partitions which are reasonably efficient in the sense of within-class variance.

...read moreread less

Journal ArticleDOI

A possibilistic approach to clustering

Raghu Krishnapuram, +1 more

- 01 May 1993 -

IEEE Transactions on Fuzzy Systems

TL;DR: An appropriate objective function whose minimum will characterize a good possibilistic partition of the data is constructed, and the membership and prototype update equations are derived from necessary conditions for minimization of the criterion function.

...read moreread less

Book ChapterDOI

Sequential Tests of Statistical Hypotheses

Abraham Wald

- 01 Jun 1945 -

Annals of Mathematical Statistics

TL;DR: A sequential test of a statistical hypothesis is defined as any statistical test procedure which gives a specific rule, at any stage of the experiment (at the n-th trial for each integral value of n), for making one of the following three decisions: (1) to accept the hypothesis being tested (null hypothesis), (2) to reject the null hypothesis, (3) to continue the experiment by making an additional observation.

...read moreread less