Authors :
Anthony Obogo Otiko; Gabriel Akibi Inyang; Etim Esu Oyo-Ita; Utoda Reuben Agim
Volume/Issue :
Volume 10 - 2025, Issue 5 - May
Google Scholar :
https://tinyurl.com/yrjtefbf
DOI :
https://doi.org/10.38124/ijisrt/25may1232
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The proliferation of Android devices has resulted in a rise in complex malware specifically designed for these
platforms, requiring higher detection techniques beyond conventional static and dynamic analyses. In this study, the
Artificial Bee Colony (ABC) algorithm for feature selection is integrated with the eXtreme Gradient Boosting (XGBoost)
and Random Forest (RF) classifiers to provide a novel method for Android malware detection. The ABC algorithm, which
draws inspiration from honeybee foraging behavior, improves the performance of classifiers by balancing exploration and
exploitation within feature subsets. Evaluation of the suggested approach on the Debrin Android malware dataset showed
significant enhancements in detection accuracy and decreased false positives. The experimental findings demonstrated that
both RF and XGBoost classifiers showed excellent performance, with RF slightly surpassing XGBoost in accuracy, precision,
recall, and ROC-AUC metrics. The results highlight the efficacy of integrating metaheuristic feature selection with strong
classifiers to enhance Android malware detection and tackle the difficulties presented by progressing threats.
Keywords :
Android Malware, Feature Selection, Machine Learning, Extreme Gradient Boosting, Random Forest.
References :
- Sharma, Y., & Arora, A. (2024). A comprehensive review on permissions-based Android malware detection. International Journal of Information Security, 23(3), 1877–1912. https://doi.org/10.1007/s10207-024-00822-2:contentReference[oaicite:3]{index=3}
- Redhu, A., Choudhary, P., Srinivasan, K., & Das, T. K. (2024). Deep learning-powered malware detection in cyberspace: A contemporary review. Frontiers in Physics, 12, 1349463. https://doi.org/10.3389/fphy.2024.1349463:contentReference[oaicite:7]{index=7}
- Long, D. A. (2024). Countering Russian cybergang ransomware attacks against critical infrastructure in the United States. Doctoral dissertation, National American University.
- Malik, M. S. (2024). IoT malware: A comprehensive survey of threats, vulnerabilities, and mitigation strategies. International Journal for Electronic Crime Investigation, 8(1), 57–66. https://doi.org/10.54692/ijeci.2024.0801187:contentReference[oaicite:15]{index=15}
- Jhanjhi, N. Z., & Shah, I. A. (2024). Navigating cyber threats and cybersecurity in the logistics industry. IGI Global.
- Jimmy, F. N. U. (2024). Cybersecurity vulnerabilities and remediation through cloud security tools. Journal of Artificial Intelligence General Science (JAIGS), 2(1), 196–233. https://doi.org/10.60087/jaigs.vol03.issue01.p233:contentReference[oaicite:23]{index=23}
- Geng, J., Wang, J., Fang, Z., Zhou, Y., Wu, D., & Ge, W. (2024). A survey of strategy-driven evasion methods for PE malware: Transformation, concealment, and attack. Computers & Security, 137, 103595. https://doi.org/10.1016/j.cose.2023.103595:contentReference[oaicite:27]{index=27}
- Yunmar, R. A., Kusumawardani, S. S., & Mohsen, F. (2024). Hybrid Android malware detection: A review of heuristic-based approach. IEEE Access, 12, 41255–41286.
- Hamid, R. A. I., & Riad, K. (2024). Advancing malware artifact detection and analysis through memory forensics: A comprehensive literature review. Journal of Theoretical and Applied Information Technology, 102(4).
- Chenet, C. P., Savino, A., & Di Carlo, S. (2024). A survey on hardware-based malware detection approaches. IEEE Access.
- Ispahany, J., Islam, M. R., Islam, M. Z., & Khan, M. A. (2024). Ransomware detection using machine learning: A review, research limitations, and future directions. IEEE Access.
- Ullah, S., Li, J., Ullah, F., Chen, J., Ali, I., Khan, S., & Leung, V. C. (2024). The revolution and vision of explainable AI for Android malware detection and protection. Internet of Things, 101320.
- Kiliçarslan, S., & Dönmez, E. (2024). Improved multi-layer hybrid adaptive particle swarm optimization based artificial bee colony for optimizing feature selection and classification of microarray data. Multimedia Tools and Applications, 83(26), 67259–67281.
- Idris, N. F., Ismail, M. A., Jaya, M. I. M., Ibrahim, A. O., Abulfaraj, A. W., & Binzagr, F. (2024). Stacking with recursive feature elimination-isolation forest for classification of diabetes mellitus. PLOS One, 19(5), e0302595.
- Arık, O. A., Köse, E., & Canbulut, G. (2024). Metaheuristic algorithms to forecast future carbon dioxide emissions of Turkey. Turkish Journal of Forecasting, 8(1), 23–39. https://doi.org/10.34110/forecasting.1388906:contentReference[oaicite:59]{index=59}
- Widians, J. A., Wardoyo, R., & Hartati, S. (2024). Feature selection based on chi-square and ant colony optimization for multi-label classification. International Journal of Electrical & Computer Engineering, 14(3).
- Li, X., Zhang, S., & Shao, P. (2024). Discrete artificial bee colony algorithm with fixed neighborhood search for traveling salesman problem. Engineering Applications of Artificial Intelligence, 131, 107816.
- AlSobeh, A. M., Gaber, K., Hammad, M. M., Nuser, M., & Shatnawi, A. (2024). Android malware detection using time-aware machine learning approach. Cluster Computing, 1–22.
- Kirubavathi, G., & Anne, W. R. (2024). Behavioral based detection of android ransomware using machine learning techniques. International Journal of System Assurance Engineering and Management, 1–22.
- Kumar, K. R., & Gurrala, J. (2024). Enhancing android malware detection through filter-based feature selection and machine learning classification. Journal of Electrical Systems, 20(3), 2801–2809.
- Ma, R., Yin, S., Feng, X., Zhu, H., & Sheng, V. S. (2024). A lightweight deep learning-based android malware detection framework. Expert Systems with Applications, 255, 124633.
- Yerima, S. (2024). Android malware dataset for machine learning 2. figshare. https://doi.org/10.6084/m9.figshare.5854653.v1:contentReference[oaicite:87]{index=87}
- Andrew, I. W., Isah, C. S., Umar, M. M., & Bolaji, L. A. (2024). Methodological approaches used for the Google machine reassignment problem. Aligarh Journal of Statistics, 44, 25–50.
- Efendi, R., Yusa, M., & Hallatu, S. T. (2024). Classification of land suitability for soybean crops using the CART method and feature selection using an algorithm ABC. IT Journal Research and Development, 9(1), 1–13.
- Sun, Z., Wang, G., Li, P., Wang, H., Zhang, M., & Liang, X. (2024). An improved random forest based on the classification accuracy and correlation measurement of decision trees. Expert Systems with Applications, 237, 121549.
- Lee, G. T., Lim, H. G., Wang, T., Zhang, G., & Jeong, M. K. (2024). Double bagging trees with weighted sampling for predictive maintenance and management of etching equipment. Journal of Process Control, 135, 103175.
- Sekhar, J. C., Roy, T. L., Sridharan, K., Saravanan, K. A., & Taloba, A. I. (2024). Explainable artificial intelligence method for identifying cardiovascular disease with a combination CNN-XG-Boost framework. International Journal of Advanced Computer Science & Applications, 15(5).
The proliferation of Android devices has resulted in a rise in complex malware specifically designed for these
platforms, requiring higher detection techniques beyond conventional static and dynamic analyses. In this study, the
Artificial Bee Colony (ABC) algorithm for feature selection is integrated with the eXtreme Gradient Boosting (XGBoost)
and Random Forest (RF) classifiers to provide a novel method for Android malware detection. The ABC algorithm, which
draws inspiration from honeybee foraging behavior, improves the performance of classifiers by balancing exploration and
exploitation within feature subsets. Evaluation of the suggested approach on the Debrin Android malware dataset showed
significant enhancements in detection accuracy and decreased false positives. The experimental findings demonstrated that
both RF and XGBoost classifiers showed excellent performance, with RF slightly surpassing XGBoost in accuracy, precision,
recall, and ROC-AUC metrics. The results highlight the efficacy of integrating metaheuristic feature selection with strong
classifiers to enhance Android malware detection and tackle the difficulties presented by progressing threats.
Keywords :
Android Malware, Feature Selection, Machine Learning, Extreme Gradient Boosting, Random Forest.