Authors :
Aishabanu Multani; Santosh Saha
Volume/Issue :
Volume 11 - 2026, Issue 5 - May
Google Scholar :
https://tinyurl.com/44p8cfru
Scribd :
https://tinyurl.com/j8tz43yp
DOI :
https://doi.org/10.38124/ijisrt/26May787
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
The dynamic change in the nature of various cyber threats, especially the threat of phishing, has identified the
limitations of traditional security solutions such as rule-based systems, signature-based systems, etc. Cyber Threat
Intelligence has emerged as an effective security practice that provides contextual information on threat actors, techniques,
etc., whereas Machine Learning has also emerged as an effective security practice that offers solutions to automated threat
detection using data analysis patterns. Although both of these security practices have immense potential, the integration of
both has not yet been explored, as seen in the existing literature on the integration of both security practices to offer effective
security solutions, especially against the threat of phishing. This paper has been designed to offer an extensive review of the
existing literature on the integration of Cyber Threat Intelligence, Machine Learning, and security solutions, especially
against the threat of phishing, as seen in the literature from 2016 to 2025.On the other hand, the comparative analysis of the
challenges identifies the need to address the issue of unstructured sources of intelligence, the problem of limited
interoperability, the issue of scalability, the problem of lack of explainability, and the problem of insufficient validation of
the solutions in the real world. Moreover, the current models of phishing detection, despite their high benchmark accuracy,
have limitations related to their adaptability, multilinguality, and adversarial robustness. With the identified research gaps,
this review highlights the importance of developing semantically enriched CTI solutions, knowledge graph-based solutions,
Large Language Model-based solutions, and adaptive learning-based solutions to facilitate the development of explainable
and real-time solutions to the problem of cybersecurity.
Keywords :
Cyber Security, Cyber Threat Intelligence(CTI), Machine Learning, NLP/LLMs, Phishing detection, threat detection
References :
- A. Trivedi, R. Gupta, and K. Jangal, "Cyber Threat Intelligence Research Paper," Arabian Agricultural Services Company (ARASCO), Tech. Rep., Aug. 2024.
- A. Aljuhami, “Cyber Threat Intelligence in Risk Management: A Comprehensive Survey,” Computers & Security, vol. 105, pp. 1–12, 2021
- R. Samtani, R. Chinn, and H. Chen, “AZSecure Hacker Assets Portal: A Cyber Threat Intelligence and Malware Analysis Platform,” IEEE Intelligence and Security Informatics (ISI), pp. 1–6, 2016.
- R. Rastogi and R. Dutta, “MALOnt: An Ontology for Malware Threat Intelligence Representation,” Journal of Information Warfare, vol. 19, no. 4, pp. 45–58, 2020.
- M. Peng Gao, “Enabling Efficient Threat Hunting with CTI (THREATRAPTOR) Using NLP and TBQL Queries,” IEEE Access, vol. 9, pp. 108732–108745, 2021.
- D. Ejaz, S. Siddiqui, and F. Irshad, “Visualizing Interesting Patterns in Cyber Threat Intelligence Reports Using Machine Learning,” Procedia Computer Science, vol. 207, pp. 121–130, 2022.
- H. Li, T. Zhang, and X. Liu, “Automatic Construction of Technique Knowledge Graphs from Cyber Threat Intelligence Reports,” Expert Systems with Applications, vol. 204, 2022.
- N. Irshad and S. Siddiqui, “Attack2Vec: Threat Actor Profiling through NLP-Based CTI Analysis,” Future Generation Computer Systems, vol. 142, pp. 213–224, 2023.
- I. Connolly, “Dark Web Malware Marketplaces: A CTI-Based Web Scraping Study,” Journal of Digital Forensics, Security and Law, vol. 18, no. 4, pp. 45–60, 2023.
- M. Mavroeidis and S. Bromander, “Cyber Threat Intelligence Ontologies and Interoperability Standards: A Critical Evaluation,” Computers \& Security, vol. 132, 2023.
- M. Lawall and A. Beenken, “Threat-Led Approach to Mitigating Ransomware Using OSINT and SOCMINT,” Journal of Cybersecurity and Digital Trust, vol. 11, no. 1, pp. 23–36, 2024.
- H. Rosa, “An Integrated Framework for Monitoring Phishing Campaigns Using CTI and Machine Learning,” Computers \& Security, vol. 143, 2025.
- M. Naik and P. Deshpande, “Machine Learning-Based SQL Injection Detection Using XGBoost,” Journal of Information Security and Applications, vol. 71, 2022.
- A. Apruzzese, M. Andreolini, L. Ferretti, and M. Marchetti, “Machine Learning for Cybersecurity: A Review and Meta-Analysis,” Computers \& Security, vol. 121, 2022.
- E. Dunsina, J. Obafemi, and P. Ezimadu, “Reinforcement Learning for Post-Incident Malware Investigation,” Forensic Science International: Digital Investigation, vol. 48, pp. 1–10, 2025.
- S. Alshaikh, L. Alqahtani, and A. Watson, “Understanding the Communication and Adoption of Machine Learning in Cybersecurity,” Computers & Security, vol. 139, 2025.
- I. Qiqieh, “An intelligent cyber threat detection: A swarm-optimized machine learning approach,” Alexandria Engineering Journal, 2025.
- S.-A. Sadegh-Zadeh, “An unsupervised machine learning approach for cyber threat detection using geographic profiling and Domain Name System data,” Decision Analytics Journal, 2025.
- S. B. Shah, “Machine Learning for Cyber Threat Detection and Prevention in Critical Infrastructure,” Journal of Global Research in Electronics and Communication, 2025.
- K. I. Nnaka, “AI-powered threat detection: Opportunities and limitations in modern cyber defense,” World Journal of Advanced Research and Reviews, 2025.
- A. Dincy R. Arikkat and N. Joseph, “SECTIS: Secure Cyber Threat Intelligence Sharing Using Swarm Learning and Blockchain,” IEEE Transactions on Network and Service Management, vol. 19, no. 5, pp. 5232–5245, 2025.
- F. Fieblinger, M. Reichenbach, and A. R. Sadiq, “Integrating Knowledge Graphs and Large Language Models for Cyber Threat Intelligence Automation,” Applied Intelligence, vol. 54, no. 2, pp. 1120–1135, 2024.
- R. Rahman, T. Nasir, and H. Chowdhury, “Temporal Attack Pattern Mining Using LLMs and ML Classifiers,” IEEE Access, vol. 13, pp. 20125–20138, 2025.
- M. Arazzi, F. Moretti, and C. Piras, “NLP-Based Cyber Threat Intelligence and Relation Extraction Using BERT and GCN,” IEEE Access, vol. 13, pp. 47210–47225, 2025.
- M. Alsaedi, F. A. Ghaleb, F. Saeed, J. Ahmad, and M. Alasli, "Cyber Threat Intelligence-Based Malicious URL Detection Model Using Ensemble Learning," Sensors, vol. 22, no. 9, p. 3373, Apr. 2022, doi: 10.3390/s22093373.
- I. Naseer, "Machine Learning Applications in Cyber Threat Intelligence: A Comprehensive Review," Asian Bulletin of Big Data Management, vol. 3, no. 2, pp. 190–200, Jan. 2024, doi: 10.62019/abbdm.v3i2.85.
- A. Aslam, M. Khan, and N. Hussain, “AntiPhishStack: Phishing URL Detection Using LSTM and XGBoost,” International Journal of Information Security Science, vol. 12, no. 3, pp. 145–158, 2023.
- V. van Geest, T. de Koning, and M. van Vliet, “Hybrid Framework for Automated Phishing Detection Using Stacking Models,” Expert Systems with Applications, vol. 237, 2024.
- N. Malarvizhi, C. S. Krishna, J. K. Kumar, and P. V. S. Kumar, "Cyber Threat Detection in URLs using Machine Learning," Grenze International Journal of Engineering and Technology, June Issue.
- G. S. Nayak, B. Muniyal, and M. C. Belavagi, "Enhancing Phishing Detection: A Machine Learning Approach With Feature Selection and Deep Learning Models," IEEE Access, vol. 13, 2025, doi: 10.1109/ACCESS.2025.3543738.
- M. M. Rahman, K. Dhakal, N. Gony, M. K. Shuvra, and M. Rahman, "AI integration in cybersecurity software: Threat detection and response," International Journal of Innovative Research and Scientific Studies, vol. 8, no. 3, pp. 3907–3921, 2025.
- A. An, S. Alghamdi, and R. Aljohani, “Multilingual Email Phishing Detection Using Ensemble Machine.
- A. An, S. Alghamdi, and R. Aljohani, “Multilingual Email Phishing Detection Using Ensemble Machine Learning,” IEEE Access, vol. 13, pp. 67230–67242, 2025.
- K. Lim, A. Rajendran, and L. Tan, “EXPLICATE: Explainable AI for Phishing Detection Using TF-IDF, LIME, and LLM,” Information Sciences, vol. 657, pp. 212–224, 2025.
- P. Li, X. Zhang, and Y. Chen, “Reference-Based Phishing Detection for Enterprise Web Systems,” IEEE Transactions on Dependable and Secure Computing, vol. 22, 2025.
- D. Saxena, S. Degadwala, and M. Joshi, "Phishing URL Detection Using Machine Learning," International Journal of Scientific Research in Science and Technology, vol. 13, no. 1, pp. 19–25, Jan. 2026, doi: 10.32628/IJSRST2613101.
The dynamic change in the nature of various cyber threats, especially the threat of phishing, has identified the
limitations of traditional security solutions such as rule-based systems, signature-based systems, etc. Cyber Threat
Intelligence has emerged as an effective security practice that provides contextual information on threat actors, techniques,
etc., whereas Machine Learning has also emerged as an effective security practice that offers solutions to automated threat
detection using data analysis patterns. Although both of these security practices have immense potential, the integration of
both has not yet been explored, as seen in the existing literature on the integration of both security practices to offer effective
security solutions, especially against the threat of phishing. This paper has been designed to offer an extensive review of the
existing literature on the integration of Cyber Threat Intelligence, Machine Learning, and security solutions, especially
against the threat of phishing, as seen in the literature from 2016 to 2025.On the other hand, the comparative analysis of the
challenges identifies the need to address the issue of unstructured sources of intelligence, the problem of limited
interoperability, the issue of scalability, the problem of lack of explainability, and the problem of insufficient validation of
the solutions in the real world. Moreover, the current models of phishing detection, despite their high benchmark accuracy,
have limitations related to their adaptability, multilinguality, and adversarial robustness. With the identified research gaps,
this review highlights the importance of developing semantically enriched CTI solutions, knowledge graph-based solutions,
Large Language Model-based solutions, and adaptive learning-based solutions to facilitate the development of explainable
and real-time solutions to the problem of cybersecurity.
Keywords :
Cyber Security, Cyber Threat Intelligence(CTI), Machine Learning, NLP/LLMs, Phishing detection, threat detection