scispace - formally typeset
Open AccessJournal ArticleDOI

Effective and Efficient Hybrid Android Malware Classification Using Pseudo-Label Stacked Auto-Encoder

TLDR
In this article, a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), was used to detect Android malware, which involves training using a set of labeled and unlabeled instances.
Abstract
Android has become the target of attackers because of its popularity. The detection of Android mobile malware has become increasingly important due to its significant threat. Supervised machine learning, which has been used to detect Android malware is far from perfect because it requires a significant amount of labeled data. Since labeled data is expensive and difficult to get while unlabeled data is abundant and cheap in this context, we resort to a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), which involves training using a set of labeled and unlabeled instances. We use a hybrid approach of dynamic analysis and static analysis to craft feature vectors. We evaluate our proposed model on CICMalDroid2020, which includes 17,341 most recent samples of five different Android apps categories. After that, we compare the results with state-of-the-art techniques in terms of accuracy and efficiency. Experimental results show that our proposed framework outperforms other semi-supervised approaches and common machine learning algorithms.

read more

Citations
More filters
Journal ArticleDOI

Cyber-Threat Detection System Using a Hybrid Approach of Transfer Learning and Multi-Model Image Representation

TL;DR: A malware detection system based on word2vec-based transfer learning and multi-model image representation that combines the textual and texture features of network traffic to leverage the advantages of both types.
Journal ArticleDOI

Explainable Malware Detection System Using Transformers-Based Transfer Learning and Multi-Model Visual Representation

TL;DR: An explainable malware detection system was proposed using transfer learning and malware visual features for effective malware detection and an interpretable artificial intelligence (AI) experiment was conducted.
Journal ArticleDOI

An Enhanced Deep Learning Neural Network for the Detection and Identification of Android Malware

TL;DR: AMDI-Droid as mentioned in this paper proposes a new architecture based on a deep neural network, where the predictive outputs obtained from all hidden layers are blended to produce a final prediction, and a loss function is formulated by combining the predictive loss of each base classifier connected to the corresponding hidden layer.
Journal ArticleDOI

Explainable artificial intelligence for cybersecurity: a literature survey

TL;DR: In this paper , the authors conduct an extensive literature review on the intersection between explainable AI and cybersecurity, and investigate the existing literature from two perspectives: the applications of XAI to cybersecurity (e.g., intrusion detection, malware classification), and the security of X-Atea (i.e., attacks on XAI pipelines, potential countermeasures).
Journal ArticleDOI

Android malware category detection using a novel feature vector-based machine learning model

TL;DR: In this paper , a novel Huffman encoding-based feature vector generation technique is proposed to improve the efficiency of the detection model and the proposed model was evaluated using machine learning and deep learning methods.
References
More filters
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Journal Article

Scikit-learn: Machine Learning in Python

TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Journal ArticleDOI

Deep learning

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.
Proceedings ArticleDOI

Combining labeled and unlabeled data with co-training

TL;DR: A PAC-style analysis is provided for a problem setting motivated by the task of learning to classify web pages, in which the description of each example can be partitioned into two distinct views, to allow inexpensive unlabeled data to augment, a much smaller set of labeled examples.
Proceedings ArticleDOI

A unified architecture for natural language processing: deep neural networks with multitask learning

TL;DR: This work describes a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense using a language model.
Related Papers (5)