Longcgdroid: android malware detection through longitudinal study for machine learning and deep learning

No Thumbnail Available

Date

2023

Journal Title

Journal ISSN

Volume Title

Publisher

Scientific Research Support Fund of Jordan

Abstract

This study aims to compare the longitudinal performance between machine-learning and deep-learning classifiers for Android malware detection, employing different levels of feature abstraction. Using a dataset of 200k Android apps labeled by date within a 10-year range (2013-2022), we propose the LongCGDroid, an image-based effective approach for Android malware detection. We use the semantic Call Graph API representation that is derived from the Control Flow Graph and Data Flow Graph to extract abstracted API calls. Thus, we evaluate the longitudinal performance of LongCGDroid against API changes. Different models are used; machine-learning models (LR, RF, KNN, SVM) and deep-learning models (CNN, RNN). Empirical experiments demonstrate a progressive decline in performance for all classifiers when evaluated on samples from later periods. However, the deep-learning CNN model under the class abstraction maintains a certain stability over time. In comparison with eight state-of-the-art approaches, LongCGDroid achieves higher accuracy.

Description

Keywords

Adjacency matrix, Android security, Longitudinal evaluation, Machine learning, Malware detection

Citation

Endorsement

Review

Supplemented By

Referenced By