scispace - formally typeset
Open AccessProceedings ArticleDOI

Toward Generating a New Intrusion Detection Dataset and Intrusion Traffic Characterization

Reads0
Chats0
TLDR
A reliable dataset is produced that contains benign and seven common attack network flows, which meets real world criteria and is publicly avaliable and evaluates the performance of a comprehensive set of network traffic features and machine learning algorithms to indicate the best set of features for detecting the certain attack categories.
Abstract
With exponential growth in the size of computer networks and developed applications, the significant increasing of the potential damage that can be caused by launching attacks is becoming obvious. Meanwhile, Intrusion Detection Systems (IDSs) and Intrusion Prevention Systems (IPSs) are one of the most important defense tools against the sophisticated and ever-growing network attacks. Due to the lack of adequate dataset, anomaly-based approaches in intrusion detection systems are suffering from accurate deployment, analysis and evaluation. There exist a number of such datasets such as DARPA98, KDD99, ISC2012, and ADFA13 that have been used by the researchers to evaluate the performance of their proposed intrusion detection and intrusion prevention approaches. Based on our study over eleven available datasets since 1998, many such datasets are out of date and unreliable to use. Some of these datasets suffer from lack of traffic diversity and volumes, some of them do not cover the variety of attacks, while others anonymized packet information and payload which cannot reflect the current trends, or they lack feature set and metadata. This paper produces a reliable dataset that contains benign and seven common attack network flows, which meets real world criteria and is publicly avaliable. Consequently, the paper evaluates the performance of a comprehensive set of network traffic features and machine learning algorithms to indicate the best set of features for detecting the certain attack categories.

read more

Citations
More filters
Journal ArticleDOI

Unsupervised feature selection and cluster center initialization based arbitrary shaped clusters for intrusion detection

TL;DR: A clustering method based on unsupervised feature selection and cluster center initialization for intrusion detection that performs better than basic clustering, which takes fewer iterations to form final clusters and provides better accuracy.
Journal ArticleDOI

An Empirical Evaluation of Deep Learning for Network Anomaly Detection

TL;DR: This study designs and examines deep learning models constructed based on Fully Connected Networks, Variational AutoEncoder (VAE), and Sequence-to-Sequence (Seq2Seq) structures, and confirms the feasibility of deep learning-based network anomaly detection.
Posted ContentDOI

New Directions in Automated Traffic Analysis

TL;DR: In this article, the authors introduce nPrint, a tool that generates a unified packet representation that is amenable for representation learning and model training, and integrate nPrint with Automated Machine Learning (AutoML) to automate many aspects of traffic analysis.
Journal ArticleDOI

Empirical study on multiclass classification‐based network intrusion detection

TL;DR: A comprehensive empirical study on network intrusion detection as a multiclass classification task, not just to detect a suspicious connection but also to assign the correct type as well, showing a significant improvement in the detection of network attacks with the recommended approach.
Journal ArticleDOI

A Survey on Device Behavior Fingerprinting: Data Sources, Techniques, Application Scenarios, and Datasets

TL;DR: In this paper, a comprehensive review of the device types, behavioral data, and processing and evaluation techniques used by the most recent and representative research works dealing with two major scenarios: device identification and device misbehavior detection.
References
More filters
Proceedings ArticleDOI

A detailed analysis of the KDD CUP 99 data set

TL;DR: A new data set is proposed, NSL-KDD, which consists of selected records of the complete KDD data set and does not suffer from any of mentioned shortcomings.
Journal ArticleDOI

Testing Intrusion detection systems: a critique of the 1998 and 1999 DARPA intrusion detection system evaluations as performed by Lincoln Laboratory

TL;DR: The purpose of this article is to attempt to identify the shortcomings of the Lincoln Lab effort in the hope that future efforts of this kind will be placed on a sounder footing.
Journal ArticleDOI

Toward developing a systematic approach to generate benchmark datasets for intrusion detection

TL;DR: The intent for this dataset is to assist various researchers in acquiring datasets of this kind for testing, evaluation, and comparison purposes, through sharing the generated datasets and profiles.
Proceedings ArticleDOI

Characterization of Tor Traffic using Time based Features.

TL;DR: A time analysis on Tor traffic flows is presented, captured between the client and the entry node, to detect the application type: Browsing, Chat, Streaming, Mail, Voip, P2P or File Transfer.
Proceedings ArticleDOI

Generation of a new IDS test dataset: Time to retire the KDD collection

TL;DR: A new publicly available dataset is introduced which is representative of modern attack structure and methodology and is contrasted with the legacy datasets, and the performance difference of commonly used intrusion detection algorithms is highlighted.
Related Papers (5)