RMIT University
Browse

Mobile Malware Detection with Imbalanced Data using a Novel Synthetic Oversampling Strategy and Deep Learning

conference contribution
posted on 2024-11-03, 14:42 authored by Mahbub Khoda, Joarder Kamruzzaman, Iqbal GondalIqbal Gondal, Tasadduq Imam, Ashfaqur Rahman
Mobile malware detection is inherently an imbalanced data problem since the number of benign applications in the market is far greater than the number of malicious applications. Existing methods to handle imbalanced data, such as synthetic minority over-sampling, do not translate well into this domain since mobile malware detection generally deals with binary features and these methods are designed for continuous features. Also, methods adapted for categorical features cannot be applied here since random modifications of features can result in invalid sample generation. In this work, we propose a novel technique for generating synthetic samples for mobile malware detection with imbalanced data. Our proposed method adds new data points in the sample space by generating synthetic malware samples which also preserves the original functionality of the malicious apps. Experiments show that the proposed approach outperforms existing techniques in terms of precision, recall, F1score, and AUC. This study will be useful in building deep neural network-based systems to handle imbalanced data for mobile malware detection.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1109/WiMob50308.2020.9253433
  2. 2.
    ISBN - Is published in 9781728197227 (urn:isbn:9781728197227)

Start page

1

End page

6

Total pages

6

Outlet

Proceedings 16th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob 2020)

Name of conference

WiMob 2020

Publisher

IEEE

Place published

United States

Start date

2020-10-12

End date

2020-10-14

Language

English

Copyright

© 2020 by IEEE

Former Identifier

2006109838

Esploro creation date

2021-09-30

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC