Home
/
Authors
/
I. Sumaiya Thaseen

Author

I. Sumaiya Thaseen

Bio: I. Sumaiya Thaseen is an academic researcher from VIT University. The author has contributed to research in topics: Feature selection & Intrusion detection system. The author has an hindex of 8, co-authored 23 publications receiving 190 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Intrusion detection model using fusion of PCA and optimized SVM

[...]

I. Sumaiya Thaseen¹, Ch. Aswani Kumar¹•Institutions (1)

VIT University¹

01 Nov 2014

TL;DR: A novel method of integrating principal component analysis (PCA) and support vector machine (SVM) by optimizing the kernel parameters using automatic parameter selection technique is proposed, which reduces the training and testing time to identify intrusions thereby improving the accuracy.

...read moreread less

Abstract: Intrusion detection systems (IDS) play a major role in detecting the attacks that occur in the computer or networks. Anomaly intrusion detection models detect new attacks by observing the deviation from profile. However there are many problems in the traditional IDS such as high false alarm rate, low detection capability against new network attacks and insufficient analysis capacity. The use of machine learning for intrusion models automatically increases the performance with an improved experience. This paper proposes a novel method of integrating principal component analysis (PCA) and support vector machine (SVM) by optimizing the kernel parameters using automatic parameter selection technique. This technique reduces the training and testing time to identify intrusions thereby improving the accuracy. The proposed method was tested on KDD data set. The datasets were carefully divided into training and testing considering the minority attacks such as U2R and R2L to be present in the testing set to identify the occurrence of unknown attack. The results indicate that the proposed method is successful in identifying intrusions. The experimental results show that the classification accuracy of the proposed method outperforms other classification techniques using SVM as the classifier and other dimensionality reduction or feature selection techniques. Minimum resources are consumed as the classifier input requires reduced feature set and thereby minimizing training and testing overhead time.

...read moreread less

57 citations

Journal Article•DOI•

Integrated Intrusion Detection Model Using Chi-Square Feature Selection and Ensemble of Classifiers

[...]

I. Sumaiya Thaseen¹, Ch. Aswani Kumar¹, Amir Ahmad²•Institutions (2)

VIT University¹, College of Information Technology²

01 Apr 2019-Arabian Journal for Science and Engineering

TL;DR: The aim of this paper is to identify the critical features required in the construction of intrusion detection model, thereby achieving the maximum accuracy and to utilize an ensemble approach of classifiers with minimum complexity to overcome the issues in the existing ensemble-based intrusion detection models.

...read moreread less

Abstract: Intrusion detection system is a device or software application that monitors a network of systems to identify any malicious activity or policy violations. In order to identify intrusions or normal activity, IDS would consider different network-related features such as source address, protocol and flag. The major challenge for any intrusion detection model is to achieve maximum accuracy with minimal false alarms. The aim of this paper is to identify the critical features required in the construction of intrusion detection model, thereby achieving the maximum accuracy. The model utilizes an ensemble approach of classifiers with minimum complexity to overcome the issues in the existing ensemble-based intrusion detection models. In this paper, Chi-square feature selection and the ensemble of classifiers such as support vector machine (SVM), modified Naive Bayes (MNB) and LPBoost are utilized to develop an intrusion detection model. The motivation for selecting Chi-square feature selection is that they rank the features based on the statistical significance test and consider only those features that are dependent on the class label. Supervised classifiers are highly consistent and produce precise results as the use of training data improves the ability to distinguish between classes with similar features. Experimental results indicate high accuracy in comparison with base classifiers by the ensemble of LPBoost. As there is a huge class imbalance present in the network traffic, the prediction of the class label by a majority voting of SVM, MNB and LPBoost is an optimal solution in preference to reliance on a single classifier.

...read moreread less

56 citations

Journal Article•DOI•

An integrated intrusion detection system using correlation‐based attribute selection and artificial neural network

[...]

I. Sumaiya Thaseen¹, J. Saira Banu¹, K. Lavanya¹, Muhammad Rukunuddin Ghalib¹, Kumar Abhishek², Kumar Abhishek¹ - Show less +2 more•Institutions (2)

VIT University¹, National Institute of Technology, Patna²

01 Feb 2021

TL;DR: A correlation‐based feature selection integrated with neural network for identifying anomalies and the results show that the proposed model is superior in terms of accuracy, sensitivity, and specificity in comparison with some of the state‐of‐the‐art techniques.

...read moreread less

Abstract: Serious concerns regarding vulnerability and security have been raised as a result of the constant growth of computer networks. Intrusion detection systems (IDS) have been adopted by netwo...

...read moreread less

38 citations

Proceedings Article•DOI•

A hybrid anomaly detection model using G-LDA

[...]

Bhavesh Kasliwal¹, Shraey Bhatia¹, Shubham Saini¹, I. Sumaiya Thaseen¹, C. Aswani Kumar¹ - Show less +1 more•Institutions (1)

VIT University¹

27 Mar 2014

TL;DR: A hybrid technique integrating Latent Dirichlet Allocation and genetic algorithm namely the G-LDA process, which has a better accuracy for detecting known and unknown attacks and a low false positive rate is proposed.

...read moreread less

Abstract: Anomaly detection is one of the important challenges of network security associated today. We present a novel hybrid technique called G-LDA to identify the anomalies in network traffic. We propose a hybrid technique integrating Latent Dirichlet Allocation and genetic algorithm namely the G-LDA process. Furthermore, feature selection plays an important role in identifying the subset of attributes for determining the anomaly packets. The proposed method is evaluated by carrying out experiments on KDDCUP'99 dataset. The experimental results reveal that the hybrid technique has a better accuracy for detecting known and unknown attacks and a low false positive rate.

...read moreread less

27 citations

Journal Article•DOI•

Terrain Mapping of LandSat8 Images using MNF and Classifying Soil Properties using Ensemble Modelling

[...]

K. Lavanya¹, Ahmed J. Obaid², I. Sumaiya Thaseen¹, Kumar Abhishek³, Khushboo Saboo¹, Rucha Paturkar¹ - Show less +2 more•Institutions (3)

VIT University¹, University of Kufa², National Institute of Technology, Patna³

01 Apr 2020-International Journal of Nonlinear Analysis and Applications

TL;DR: A major aim of this paper is to design a robust technique for extracting, transforming Landsat images to numerical data and pre-processing the data for classifying the soil property.

...read moreread less

Abstract: Traditional technique for determining the soil texture and other soil properties is performed in laboratory which is a time consuming task. In this paper, machine learning algorithms are deployed to classify the soil texture and its properties without any intervention of laboratory equipment using the satellite images recorded by Landsat 8. These images are used to extract the terrain properties of the region which is integrated with weather data for the specific region and the vegetation index which are the major factors affecting the soil condition. A major aim of this paper is to design a robust technique for extracting, transforming Landsat images to numerical data and pre-processing the data for classifying the soil property. Minimum Noise Fraction (MNF) is utilized to segregate and remove noise from the Landsat images for subsequent processing. A significant amount of noise is present in the raw data which affects the accuracy of the analysis. Terrain features are extracted after noise removal from the MNF transformed images and merged with the weather data, and vegetation index for a period of time and then classified using voting classifier of the ensemble modeling or analysis of the soil texture of the region. The voting is performed by integrating the results of logistic regression, support vector machine and decision tree. With this study, the consolidated dependence of the soil texture on the environmental factors is analyzed and a cross validation accuracy of 94.44% is obtained.

...read moreread less

23 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fraud detection system

[...]

Aisha Abdallah¹, Mohd Aizaini Maarof¹, Anazida Zainal¹•Institutions (1)

Universiti Teknologi Malaysia¹

01 Jun 2016-Journal of Network and Computer Applications

TL;DR: There are issues and challenges that hinder the performance of FDSs, such as concept drift, supports real time detection, skewed distribution, large amount of data etc, which are provided in this survey paper.

...read moreread less

403 citations

Journal Article•DOI•

Intrusion detection model using fusion of chi-square feature selection and multi class SVM

[...]

Ikram Sumaiya Thaseen¹, Cherukuri Aswani Kumar¹•Institutions (1)

VIT University¹

01 Oct 2017-Journal of King Saud University - Computer and Information Sciences

TL;DR: The main idea behind this model is to construct a multi class SVM which has not been adopted for IDS so far to decrease the training and testing time and increase the individual classification accuracy of the network attacks.

...read moreread less

321 citations

Journal Article•DOI•

Unsupervised Machine Learning for Networking: Techniques, Applications and Research Challenges

[...]

Muhammad Usama¹, Junaid Qadir¹, Aunn Raza², Hunain Arif², Kok-Lim Alvin Yau³, Yehia Elkhatib⁴, Amir Hussain⁵, Ala Al-Fuqaha⁶ - Show less +4 more•Institutions (6)

Information Technology University¹, National University of Science and Technology², Sunway University³, Lancaster University⁴, Edinburgh Napier University⁵, Khalifa University⁶

14 May 2019-IEEE Access

TL;DR: In this article, the authors provide an overview of unsupervised learning in the domain of networking, and provide a comprehensive review of the current state of the art in this area, by synthesizing insights from previous survey papers.

...read moreread less

Abstract: While machine learning and artificial intelligence have long been applied in networking research, the bulk of such works has focused on supervised learning. Recently, there has been a rising trend of employing unsupervised machine learning using unstructured raw network data to improve network performance and provide services, such as traffic engineering, anomaly detection, Internet traffic classification, and quality of service optimization. The growing interest in applying unsupervised learning techniques in networking stems from their great success in other fields, such as computer vision, natural language processing, speech recognition, and optimal control (e.g., for developing autonomous self-driving cars). In addition, unsupervised learning can unconstrain us from the need for labeled data and manual handcrafted feature engineering, thereby facilitating flexible, general, and automated methods of machine learning. The focus of this survey paper is to provide an overview of applications of unsupervised learning in the domain of networking. We provide a comprehensive survey highlighting recent advancements in unsupervised learning techniques, and describe their applications in various learning tasks, in the context of networking. We also provide a discussion on future directions and open research issues, while identifying potential pitfalls. While a few survey papers focusing on applications of machine learning in networking have previously been published, a survey of similar scope and breadth is missing in the literature. Through this timely review, we aim to advance the current state of knowledge, by carefully synthesizing insights from previous survey papers, while providing contemporary coverage of the recent advances and innovations.

...read moreread less

182 citations

Journal Article•DOI•

Network Intrusion Detection Combined Hybrid Sampling With Deep Hierarchical Network

[...]

Kaiyuan Jiang¹, Wang Wenya¹, Aili Wang¹, Haibin Wu¹•Institutions (1)

Harbin University of Science and Technology¹

13 Feb 2020-IEEE Access

TL;DR: A network intrusion detection algorithm combined hybrid sampling with deep hierarchical network is proposed, which uses convolution neural network to extract spatial features and Bi-directional long short-term memory to extract temporal features, which forms aDeep hierarchical network model.

...read moreread less

Abstract: Intrusion detection system (IDS) plays an important role in network security by discovering and preventing malicious activities. Due to the complex and time-varying network environment, the network intrusion samples are submerged into a large number of normal samples, which leads to insufficient samples for model training and detection results with a high false detection rate. According to the problem of data imbalance, we propose a network intrusion detection algorithm combined hybrid sampling with deep hierarchical network. Firstly, we use the one-side selection (OSS) to reduce the noise samples in majority category, and then increase the minority samples by Synthetic Minority Over-sampling Technique (SMOTE). In this way, a balanced dataset can be established to make the model fully learn the features of minority samples and greatly reduce the model training time. Secondly, we use convolution neural network (CNN) to extract spatial features and Bi-directional long short-term memory (BiLSTM) to extract temporal features, which forms a deep hierarchical network model. The proposed network intrusion detection algorithm was verified by experiments on the NSL-KDD and UNSW-NB15 dataset, and the classification accuracy can achieve 83.58% and 77.16%, respectively.

...read moreread less

173 citations

Journal Article•DOI•

Performance Analysis of Intrusion Detection Systems Using a Feature Selection Method on the UNSW-NB15 Dataset

[...]

Sydney Mambwe Kasongo¹, Yanxia Sun¹•Institutions (1)

University of Johannesburg¹

01 Dec 2020-Journal of Big Data

TL;DR: An analysis of the UNSW-NB15 intrusion detection dataset is presented and a filter-based feature reduction technique using the XGBoost algorithm is applied that allows for methods such as the DT to increase its test accuracy from 88.13 to 90.85% for the binary classification scheme.

...read moreread less

Abstract: Computer networks intrusion detection systems (IDSs) and intrusion prevention systems (IPSs) are critical aspects that contribute to the success of an organization. Over the past years, IDSs and IPSs using different approaches have been developed and implemented to ensure that computer networks within enterprises are secure, reliable and available. In this paper, we focus on IDSs that are built using machine learning (ML) techniques. IDSs based on ML methods are effective and accurate in detecting networks attacks. However, the performance of these systems decreases for high dimensional data spaces. Therefore, it is crucial to implement an appropriate feature extraction method that can prune some of the features that do not possess a great impact in the classification process. Moreover, many of the ML based IDSs suffer from an increase in false positive rate and a low detection accuracy when the models are trained on highly imbalanced datasets. In this paper, we present an analysis the UNSW-NB15 intrusion detection dataset that will be used for training and testing our models. Moreover, we apply a filter-based feature reduction technique using the XGBoost algorithm. We then implement the following ML approaches using the reduced feature space: Support Vector Machine (SVM), k-Nearest-Neighbour (kNN), Logistic Regression (LR), Artificial Neural Network (ANN) and Decision Tree (DT). In our experiments, we considered both the binary and multiclass classification configurations. The results demonstrated that the XGBoost-based feature selection method allows for methods such as the DT to increase its test accuracy from 88.13 to 90.85% for the binary classification scheme.

...read moreread less

159 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61

Collapse