Applying Wide & Deep Learning Model for Android Malware Classification |
| |
Authors: | Le Duc Thuan Pham Van Huong Hoang Van Hiep Nguyen Kim Khanh |
| |
Affiliation: | 1 Institute of Computer Sciences and Information Technology (ICS/IT), The University of Agriculture, Peshawar, 25130, Khyber Pakhtunkhwa, Pakistan2 Department of Convergence Engineering for Intelligent Drone, Sejong University, Seoul, 05006, Korea3 Collaborative Robotics and Intelligent Systems (CoRIS) Institute, Oregon State University, OR, USA4 Department of Software, Sejong University, Seoul, 05006, Korea |
| |
Abstract: | Android malware has exploded in popularity in recent years, due to the platform’s dominance of the mobile market. With the advancement of deep learning technology, numerous deep learning-based works have been proposed for the classification of Android malware. Deep learning technology is designed to handle a large amount of raw and continuous data, such as image content data. However, it is incompatible with discrete features, i.e., features gathered from multiple sources. Furthermore, if the feature set is already well-extracted and sparsely distributed, this technology is less effective than traditional machine learning. On the other hand, a wide learning model can expand the feature set to enhance the classification accuracy. To maximize the benefits of both methods, this study proposes combining the components of deep learning based on multi-branch CNNs (Convolutional Network Neural) with wide learning method. The feature set is evaluated and dynamically partitioned according to its meaning and generalizability to subsets when used as input to the model’s wide or deep component. The proposed model, partition, and feature set quality are all evaluated using the K-fold cross validation method on a composite dataset with three types of features: API, permission, and raw image. The accuracy with Wide and Deep CNN (WDCNN) model is 98.64%, improved by 1.38% compared to RNN (Recurrent Neural Network) model. |
| |
Keywords: | Wide and deep (W&D) learning convolutional neural network image feature raw features generalized features |
|
| 点击此处可从《计算机系统科学与工程》浏览原始摘要信息 |
|
点击此处可从《计算机系统科学与工程》下载全文 |
|