Imbalanced malware classification: an approach based on dynamic classifier selection

  • Jose Vinicius S. Souza
  • , Camila Barbosa Vieira
  • , George D.C. Cavalcanti
  • , Rafael M.O. Cruz

Research output: Contribution to Book/Report typesContribution to conference proceedingspeer-review

Abstract

In recent years, the rise of cyber threats has emphasized the need for robust malware detection systems, especially on mobile devices. Malware, which targets vulnerabilities in devices and user data, represents a substantial security risk. A significant challenge in malware detection is the imbalance in datasets, where most applications are benign, with only a small fraction posing a threat. This study addresses the often-overlooked issue of class imbalance in malware detection by evaluating various machine learning strategies for detecting malware in Android applications. We assess monolithic classifiers and ensemble methods, focusing on dynamic selection algorithms, which have shown superior performance compared to traditional approaches. In contrast to balancing strategies performed on the whole dataset, we propose a balancing procedure that works individually for each classifier in the pool. Our empirical analysis demonstrates that the KNOP algorithm obtained the best results using a pool of Random Forest. Additionally, an instance hardness assessment revealed that balancing reduces the difficulty of the minority class and enhances the detection of the minority class (malware). The code used for the experiments is available at https://github.com/jvss2/Machine-Learning-Empirical-Evaluation.

Original languageEnglish
Title of host publication2025 IEEE Symposium on Computational Intelligence in Security, Defence and Biometrics Companion, CISDB Companion 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331508470
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event2025 IEEE Symposium on Computational Intelligence in Security, Defence and Biometrics Companion, CISDB Companion 2025 - Trondheim, Norway
Duration: 17 Mar 202520 Mar 2025

Publication series

Name2025 IEEE Symposium on Computational Intelligence in Security, Defence and Biometrics Companion, CISDB Companion 2025

Conference

Conference2025 IEEE Symposium on Computational Intelligence in Security, Defence and Biometrics Companion, CISDB Companion 2025
Country/TerritoryNorway
CityTrondheim
Period17/03/2520/03/25

!!!Keywords

  • Android security
  • Data Balance
  • Embedding
  • Machine Learning
  • Multiple Classifier Systems

Fingerprint

Dive into the research topics of 'Imbalanced malware classification: an approach based on dynamic classifier selection'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.

Cite this