Effective and Efficient Hybrid Android Malware Classification Using Pseudo-Label Stacked Auto-Encoder
TLDR
In this article, a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), was used to detect Android malware, which involves training using a set of labeled and unlabeled instances.Abstract:
Android has become the target of attackers because of its popularity. The detection of Android mobile malware has become increasingly important due to its significant threat. Supervised machine learning, which has been used to detect Android malware is far from perfect because it requires a significant amount of labeled data. Since labeled data is expensive and difficult to get while unlabeled data is abundant and cheap in this context, we resort to a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), which involves training using a set of labeled and unlabeled instances. We use a hybrid approach of dynamic analysis and static analysis to craft feature vectors. We evaluate our proposed model on CICMalDroid2020, which includes 17,341 most recent samples of five different Android apps categories. After that, we compare the results with state-of-the-art techniques in terms of accuracy and efficiency. Experimental results show that our proposed framework outperforms other semi-supervised approaches and common machine learning algorithms.read more
Citations
More filters
Journal ArticleDOI
Cyber-Threat Detection System Using a Hybrid Approach of Transfer Learning and Multi-Model Image Representation
TL;DR: A malware detection system based on word2vec-based transfer learning and multi-model image representation that combines the textual and texture features of network traffic to leverage the advantages of both types.
Journal ArticleDOI
Explainable Malware Detection System Using Transformers-Based Transfer Learning and Multi-Model Visual Representation
Farhan Ullah,Amjad Alsirhani,Mohammed Alshahrani,Abdullah Alomari,Hamad Naeem,Syed Aziz Shah +5 more
TL;DR: An explainable malware detection system was proposed using transfer learning and malware visual features for effective malware detection and an interpretable artificial intelligence (AI) experiment was conducted.
Journal ArticleDOI
An Enhanced Deep Learning Neural Network for the Detection and Identification of Android Malware
TL;DR: AMDI-Droid as mentioned in this paper proposes a new architecture based on a deep neural network, where the predictive outputs obtained from all hidden layers are blended to produce a final prediction, and a loss function is formulated by combining the predictive loss of each base classifier connected to the corresponding hidden layer.
Journal ArticleDOI
Explainable artificial intelligence for cybersecurity: a literature survey
Fabien Charmet,Harry Chandra Tanuwidjaja,Solayman Ayoubi,Pierre-François Gimenez,Yufei Han,Houda Jmila,Gregory Blanc,Takeshi Takahashi,Zhonghua Zhang +8 more
TL;DR: In this paper , the authors conduct an extensive literature review on the intersection between explainable AI and cybersecurity, and investigate the existing literature from two perspectives: the applications of XAI to cybersecurity (e.g., intrusion detection, malware classification), and the security of X-Atea (i.e., attacks on XAI pipelines, potential countermeasures).
Journal ArticleDOI
Android malware category detection using a novel feature vector-based machine learning model
TL;DR: In this paper , a novel Huffman encoding-based feature vector generation technique is proposed to improve the efficiency of the detection model and the proposed model was evaluated using machine learning and deep learning methods.
References
More filters
Proceedings Article
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan,Andrew Zisserman +1 more
TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Journal Article
Scikit-learn: Machine Learning in Python
Fabian Pedregosa,Gaël Varoquaux,Alexandre Gramfort,Vincent Michel,Bertrand Thirion,Olivier Grisel,Mathieu Blondel,Peter Prettenhofer,Ron Weiss,Vincent Dubourg,Jake Vanderplas,Alexandre Passos,David Cournapeau,Matthieu Brucher,Matthieu Perrot,Edouard Duchesnay +15 more
TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Journal ArticleDOI
Deep learning
TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.
Proceedings ArticleDOI
Combining labeled and unlabeled data with co-training
Avrim Blum,Tom M. Mitchell +1 more
TL;DR: A PAC-style analysis is provided for a problem setting motivated by the task of learning to classify web pages, in which the description of each example can be partitioned into two distinct views, to allow inexpensive unlabeled data to augment, a much smaller set of labeled examples.
Proceedings ArticleDOI
A unified architecture for natural language processing: deep neural networks with multitask learning
Ronan Collobert,Jason Weston +1 more
TL;DR: This work describes a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense using a language model.