Enhancing the Sustainability of Machine Learning-Based Malware Detection Techniques for Android Applications | Zendy

Seyeon Park | Zendy; Hojun Lee | Zendy; Daeun Kim | Zendy; Hyeun Jun Moon | Zendy; Seongje Cho | Zendy; Youngsup Hwang | Zendy; Hyoil Han | Zendy; Kyoungwon Suh | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Enhancing the Sustainability of Machine Learning-Based Malware Detection Techniques for Android Applications

Author(s) -

Seyeon Park,

Hojun Lee,

Daeun Kim,

Hyeun Jun Moon,

Seongje Cho,

Youngsup Hwang,

Hyoil Han,

Kyoungwon Suh

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3576733

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

The rapid increase in smartphone usage has led to a corresponding rise in malicious Android applications, making it important to develop efficient and sustainable malware detection methods with high accuracy. This paper presents a two-stage machine learning approach aimed at improving both detection accuracy and sustainability in Android malware classification. The first stage estimates the release year of an app using its SDK version information, while the second stage classifies apps as benign or malicious through a weighted voting mechanism applied to year-specific malware detection models. This method balances the high accuracy of retraining with reduced computational overhead, delivering robust and scalable malware detection. Using a dataset spanning 2014 to 2023, we evaluate the performance of the proposed method in comparison to retraining and incremental learning approaches. Experimental results demonstrate that while the retraining method achieves the highest accuracy and F1 score, it incurs a significant increase in training time. Conversely, the incremental learning method offers lower accuracy but reduced training time. Our two-stage classification method effectively balances these trade-offs, providing accuracy comparable to retraining while maintaining stable training times and moderate model sizes, making it a viable option for sustainable malware detection in real-world environments. Future research will explore non-machine-learning-based release year prediction methods to further optimize training efficiency and improve adaptability to the rapidly evolving malware detection landscape.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research