Features Dimensionality Reduction Approaches for Machine Learning Based Network Intrusion Detection

doi:10.3390/ELECTRONICS8030322

Open AccessJournal ArticleDOI

Features Dimensionality Reduction Approaches for Machine Learning Based Network Intrusion Detection

Razan Abdulhammed, +4 more

- 14 Mar 2019 -

Electronics

- Vol. 8, Iss: 3, pp 322

TLDR

A Multi-Class Combined performance metric is proposed to compare various multi-class and binary classification systems through incorporating FAR, DR, Accuracy, and class distribution parameters and a uniform distribution based balancing approach is developed to handle the imbalanced distribution of the minority class instances in the CICIDS2017 network intrusion dataset.

Abstract:

The security of networked systems has become a critical universal issue that influences individuals, enterprises and governments. The rate of attacks against networked systems has increased dramatically, and the tactics used by the attackers are continuing to evolve. Intrusion detection is one of the solutions against these attacks. A common and effective approach for designing Intrusion Detection Systems (IDS) is Machine Learning. The performance of an IDS is significantly improved when the features are more discriminative and representative. This study uses two feature dimensionality reduction approaches: (i) Auto-Encoder (AE): an instance of deep learning, for dimensionality reduction, and (ii) Principle Component Analysis (PCA). The resulting low-dimensional features from both techniques are then used to build various classifiers such as Random Forest (RF), Bayesian Network, Linear Discriminant Analysis (LDA) and Quadratic Discriminant Analysis (QDA) for designing an IDS. The experimental findings with low-dimensional features in binary and multi-class classification show better performance in terms of Detection Rate (DR), F-Measure, False Alarm Rate (FAR), and Accuracy. This research effort is able to reduce the CICIDS2017 dataset’s feature dimensions from 81 to 10, while maintaining a high accuracy of 99.6% in multi-class and binary classification. Furthermore, in this paper, we propose a Multi-Class Combined performance metric C o m b i n e d M c with respect to class distribution to compare various multi-class and binary classification systems through incorporating FAR, DR, Accuracy, and class distribution parameters. In addition, we developed a uniform distribution based balancing approach to handle the imbalanced distribution of the minority class instances in the CICIDS2017 network intrusion dataset.

Features Dimensionality Reduction Approaches for Machine Learning Based Network Intrusion Detection

Citations

Network Intrusion Detection System: A systematic study of Machine Learning and Deep Learning approaches

An effective convolutional neural network based on SMOTE and Gaussian mixture model for intrusion detection in imbalanced dataset

Effective Attack Detection in Internet of Medical Things Smart Environment Using a Deep Belief Neural Network

An effective intrusion detection approach using SVM with naïve Bayes feature embedding

Hybrid Deep Learning for Botnet Attack Detection in the Internet-of-Things Networks

References

Auto-Encoding Variational Bayes

Auto-Encoding Variational Bayes

Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Random forest in remote sensing: A review of applications and future directions

Dimensionality Reduction: A Comparative Review

Related Papers (5)

Toward Generating a New Intrusion Detection Dataset and Intrusion Traffic Characterization

A detailed analysis of the KDD CUP 99 data set

UNSW-NB15: a comprehensive data set for network intrusion detection systems (UNSW-NB15 network data set)

A Deep Learning Approach for Intrusion Detection Using Recurrent Neural Networks

A Deep Learning Approach to Network Intrusion Detection