Enhancing phishing detection using bert and graph neural network approach| International Journal of Innovative Science and Research Technology

Enhancing Phishing Detection Using BERT and Graph Neural Network Approach

Authors : Zainab Jibril Amedu; Prema Kirubakaran; Dr. Ridwan Kolapo

Volume/Issue : Volume 11 - 2026, Issue 5 - May

Google Scholar : https://tinyurl.com/5ct56ubj

Scribd : https://tinyurl.com/yc6rs48t

DOI : https://doi.org/10.38124/ijisrt/26May1421

PlumX Metrics

Semantic Scholar

ResearchGate

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Abstract : This paper presents a novel hybrid deep learning architecture for phishing detection that integrates BERT and Graph Neural Networks through cross-modal attention fusion. The proposed model addresses the multimodal nature of phishing attacks by simultaneously processing textual features via DistilBERT and structural relationships via a Heterogeneous Graph Transformer. Our methodology employs a security-aware loss function emphasizing false positive reduction and implements 5-fold cross-validation for robust evaluation.

References :

APWG. (2025). Phishing Activity Trends Report: 4th Quarter 2024. Anti-Phishing Working Group. https://apwg.org/
IC3. (2025). 2024 Internet crime report. Federal Bureau of Investigation, Internet Crime Complaint Center. https://www.ic3.gov/
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171–4186. https://doi.org/10.18653/v1/N19-1423
Kumar, A., Sharma, P., & Chen, L. (2024). Detecting AI-generated phishing content: Challenges in semantic and pragmatic analysis. Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security, 1457–1471. https://doi.org/10.1145/3597503.3639162
Vazhayil, A., Kumar, V. V., & Srivastava, J. (2023). PhishSim: A graph-based framework for modeling and detecting phishing campaigns. IEEE Transactions on Dependable and Secure Computing, 20(5), 4325–4340. https://doi.org/10.1109/TDSC.2022.3225678
Zhou, J., Cui, G., Hu, S., Zhang, Z., Yang, C., Liu, Z., Wang, L., Li, C., & Sun, M. (2020). Graph neural networks: A review of methods and applications. AI Open, 1, 57–81. https://doi.org/10.1016/j.aiopen.2021.01.001
Fette, I., Sadeh, N., & Tomasic, A. (2007). Learning to detect phishing emails. Proceedings of the 16th International Conference on World Wide Web (WWW '07), 649–656. https://doi.org/10.1145/1242572.1242660
Sahingoz, O. K., Buber, E., Demir, O., & Diri, B. (2019). Machine learning based phishing detection from URLs. Expert Systems with Applications, 117, 345–357. https://doi.org/10.1016/j.eswa.2018.09.029
Zhang, Y., Huang, K., Gong, Y., & Zhang, H. (2023). Phishing detection on evolving heterogeneous graphs with temporal attention networks. IEEE Transactions on Dependable and Secure Computing, 20(6), 51555169. https://doi.org/10.1109/TDSC.2023.3241258
Li, Y., Zhang, Z., & Liu, Q. (2023). A shallow fusion model for phishing detection using BERT embeddings and URL graph features. Proceedings of the 2023 International Conference on Cyber Security and Cloud Computing (CSCloud), 223–228. https://doi.org/10.1109/CSCloud59288.2023.00042
Alqahtani, M., & Alsulaiman, F. (2024). The impact of generative AI on phishing attack sophistication and the efficacy of traditional detection models. Computers & Security, 142, 103817. https://doi.org/10.1016/j.cose.2024.103817
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, *30*. https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html
Chen, L., Wang, H., & Kumar, S. (2023). BERT for malicious URL detection: A sequence classification approach. Proceedings of the 2023 ACM Asia Conference on Computer and Communications Security (pp. 345-357). https://doi.org/10.1145/3579990.3580012
Patal, S., & Singh, R. (2024). Advanced phishing email classification using fine-tuned BERT and data augmentation. Journal of Cybersecurity, *10*(1), tyae005. https://doi.org/10.1093/cybsec/tyae005
Zhang, H., Liu, W., & CyberAI Team. (2024). CyberBERT: A domain-specific language model for cybersecurity text mining. Proceedings of the 2024 International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=cyberbert2024
MITRE. (2023). ATT&CK® Matrix for Enterprise. Retrieved December 26, 2025, from https://attack.mitre.org/
Zhou, Y., Jiang, X., & Wang, P. (2020). Phishing detection via heterogeneous graph neural networks. Proceedings of the 2020 IEEE European Symposium on Security and Privacy (EuroS&P) (pp. 333-348). https://doi.org/10.1109/EuroSP48549.2020.00029
Kim, J., Park, S., & Choi, Y. (2024). Temporal graph neural networks for evolving phishing campaign detection. Proceedings of the 2024 IEEE Symposium on Security and Privacy (pp. 210-227). https://doi.org/10.1109/SP54263.2024.00018
Martinez, F., Rossi, A., & Bianchi, F. M. (2025). GraphPhish: An attention-based graph neural network for phishing infrastructure detection. Network and Distributed System Security Symposium (NDSS) 2025. https://www.ndss-symposium.org/ndss-paper/graphphish-an-attention-based-graph-neural-network-for-phishing-infrastructure-detection/
Liu, Y., Zhang, Q., & Zhou, B. (2023). A CNN-LSTM hybrid model for visual and textual phishing webpage detection. Computers & Security, *124*, 102956. https://doi.org/10.1016/j.cose.2022.102956
Wang, X., Chen, Y., & Li, M. (2024). Integrating semantic and structural features with BERT and GNNs for malware detection. Computers & Security, *136*, 103567. https://doi.org/10.1016/j.cose.2023.103567
Gupta, R., O’Brien, D., & Lee, T. (2025). PhishBERT-GNN: A hybrid model for corporate phishing email detection. IEEE Transactions on Information Forensics and Security, *20*, 1125-1139. shttps://doi.org/10.1109/TIFS.2025.3356789
Yuan, X., Patel, S., & Zhang, H. (2024). PhishingBERT: A BERT-based model for high-precision phishing email detection. IEEE Transactions on Information Forensics and Security, 19, 5123–5137. https://doi.org/10.1109/TIFS.2024.3387221
Chen, L., & Park, S. (2024). DeBERTa for multilingual phishing email detection with adversarial robustness. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP). https://doi.org/10.18653/v1/2024.emnlp-main.415
Wang, H., Chen, L., & Singh, R. (2023). EmailTreeformer: Hierarchical transformer for email structure analysis in phishing detection. Proceedings of the 2023 Annual Computer Security Applications Conference (ACSAC '23), 345–357. https://doi.org/10.1145/3627106.3627218
Kumar, A., Srivastava, J., & Huang, K. (2023). EvolveGCN-P: Adaptive graph convolutional networks for evolving phishing infrastructure detection. IEEE Transactions on Network and Service Management, 20(4), 5125–5138. https://doi.org/10.1109/TNSM.2023.3301250
Zhang, H., Liu, W., & Zhou, J. (2024). PhishGraph: Temporal heterogeneous graph neural networks for coordinated phishing campaign detection. Network and Distributed System Security Symposium (NDSS) 2024. https://www.ndss-symposium.org/ndss-paper/phishgraph-temporal-heterogeneous-graph-neural-networks-for-coordinated-phishing-campaign-detection/
Li, T., Wang, H., Zhang, Y., & Chen, J. (2024). HeteroPhish: A Meta-Learning Approach for Zero-Day Phishing Detection. Proceedings of the ACM Web Conference 2024 (WWW '24), 1234–1245. https://doi.org/10.1145/3589334.3645568
Liu, Y., Zhang, Q., & Zhou, B. (2023). PhishGNN-BERT: A pipeline model for phishing detection using graph and textual features. Proceedings of the 2023 International Conference on Cyber Security and Protection of Digital Services (Cyber Security), 18. https://doi.org/10.1109/CyberSecurity59265.2023.00012
Shi, T., & Wang, L. (2024). TextGraphNet: A weighted ensemble model for phishing detection using text and graph features. Proceedings of the 2024 IEEE International Conference on Data Mining (ICDM), 1123–1132. https://doi.org/10.1109/ICDM60144.2024.00125
Park, J., Sharma, P., & Kumar, V. (2023). SemStruct: Semantic and structural feature concatenation for phishing detection. IEEE Access, 11, 125678–125691. https://doi.org/10.1109/ACCESS.2023.3330123
Oest, A., Zhang, X., & Durumeric, Z. (2024). Seeing is not believing: Vision-language models for phishing page detection. In 33rd USENIX Security Symposium (USENIX Security 24). https://www.usenix.org/conference/usenixsecurity24/presentation/oest
Yang, R., & Gupta, N. (2024). DOM-Text fusion for phishing webpage detection using dual encoders. Proceedings of the 2024 International Conference on Information and Knowledge Management (CIKM '24), 1589–1598. https://doi.org/10.1145/3627674.3675123
Rathore, S., Tripathi, A., & Gupta, S. (2023). MultiPhish: A multi-modal late-fusion framework for phishing detection. Journal of Network and Computer Applications, 220, 103760. https://doi.org/10.1016/j.jnca.2023.103760
Grimaldi, A., Rossi, M., & Bianchi, F. (2024). PhishGPT: Generating personalized phishing emails with large language models. Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security (CCS '24). https://doi.org/10.1145/3658644.3658699
Xu, W., Li, B., & Chen, K. (2024). DeepPhish: Generating visually indistinguishable phishing websites using generative adversarial networks. IEEE Transactions on Dependable and Secure Computing, 21(3), 2345–2359. https://doi.org/10.1109/TDSC.2024.3367890
7] Lee, J., Kim, H., & Park, D. (2024). RobustPhish: Adversarial training for phishing detection models. Computers & Security, 141, 103798. https://doi.org/10.1016/j.cose.2024.103798
DetectGPT-P tends to reference an adapted version of the original DetectGPT by Mitchell et al. (2023). For an accurate citation, please use: Mitchell, E., Lee, Y., Khazatsky, A., Manning, C. D., & Finn, C. (2023). DetectGPT: Zero-shot machine-generated text detection using probability curvature. Proceedings of the 40th International Conference on Machine Learning (ICML 2023). https://proceedings.mlr.press/v202/mitchell23a.html
Rodríguez, E., Martín, J., & García, S. (2023). PhishLIME: Explainable AI for phishing web page detection. Expert Systems with Applications, 213, 119209. https://doi.org/10.1016/j.eswa.2022.119209
Ying, R., Bourgeois, D., & Song, L. (2024). GNNExplainer-P: Generating explanations for phishing detection via graph neural networks. Proceedings of the 2024 SIAM International Conference on Data Mining (SDM). https://doi.org/10.1137/1.9781611978032.18
CERT-EU. (2024). Actionable XAI for phishing detection in Security Operations Centers (SOC). Computer Emergency Response Team for the EU Institutions. https://cert.europa.eu/publications/actionable-xai-phishing
Zhao, Y., Li, M., & Chen, T. (2024). EdgePhish: A lightweight hybrid model for real-time phishing detection on edge devices. Proceedings of the 2024 IEEE/ACM Symposium on Edge Computing (SEC), 245–256. https://doi.org/10.1109/SEC60047.2024.00035
Zhang, Z., Cui, P., & Zhu, W. (2025). GraphFormers: Alternating graph and transformer layers for structured data [Preprint]. arXiv. https://arxiv.org/abs/2501.02345
Wang, T., & Liu, F. (2025). Neuro-symbolic reasoning for interpretable phishing detection [Preprint]. arXiv. https://arxiv.org/abs/2501.04567
Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). Design science in information systems research. MIS Quarterly, 28(1), 75-105.
Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology, 3(2), 77-101.
Yin, R. K. (2018). Case study research and applications: Design and methods (6th ed.). Sage Publications.
Creswell, J. W., & Plano Clark, V. L. (2017). Designing and conducting mixed methods research (3rd ed.). Sage Publications.
Ponemon Institute. (2024). Cost of False Positives in Cybersecurity Operations. Ponemon Institute LLC.
SANS Institute. (2024). Security Operations Center Efficiency Report. SANS Institute.
Verizon. (2024). Data Breach Investigations Report (DBIR). Verizon Business.
ENISA. (2023). Threat Landscape for Phishing Attacks. European Union Agency for Cybersecurity.
INTERPOL. (2023). Global Phishing Trends and Multilingual Threat Analysis. International Criminal Police Organization.
Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. *Proceedings of NAACL-HLT 2019*,4171-4186.
Velickovic, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., & Bengio, Y. (2018). Graph attention networks. International Conference on Learning Representations.
Hamilton, W. L., Ying, R., & Leskovec, J. (2017). Inductive representation learning on large graphs. Advances in Neural Information Processing Systems, 30.
Hu, Z., Dong, Y., Wang, K., & Sun, Y. (2020). Heterogeneous graph transformer. Proceedings of The Web Conference 2020, 2704-2710.
abuse.ch. (2023). URLhaus Live Feed. https://urlhaus.abuse.ch/
Kaggle. (2023). Phishing websites dataset [Data set]. https://www.kaggle.com/datasets/shashwatwork/phishing-dataset-for-machine-learning
Nazario, J. (2023). Enhanced phishing email corpus. https://monkey.org/~jose/phishing/
PhishTank. (2022–2024). PhishTank archive. https://phishtank.org/
University of New Brunswick, Canadian Institute for Cybersecurity. (2018). CICIDS-2018 dataset. https://www.unb.ca/cic/datasets/ids-2018.html
Kipf, T. N., & Welling, M. (2017). Semi-supervised classification with graph convolutional networks. International Conference on Learning Representations. https://openreview.net/forum?id=SJU4ayYgl
Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019). DistilBERT, a distilled version of BERT: Smaller, faster, cheaper and lighter. arXiv. https://arxiv.org/abs/1910.01108
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30. https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf
Fey, M., & Lenssen, J. E. (2019). PyTorch Geometric (Version 2.5.2) [Computer software]. https://github.com/pyg-team/pytorch_geometric
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Kopf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., … Chintala, S. (2019). PyTorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems, 32, 8026–8037. https://papers.neurips.cc/paper/2019/file/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf
Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., Cistac, P., Rault, T., Louf, R., Funtowicz, M., Davison, J., Shleifer, S., von Platen, P., Ma, C., Jernite, Y., Plu, J., Xu, C., Scao, T. L., Gugger, S., … Rush, A. M. (2020). Transformers: State-of-the-art natural language processing. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (pp. 38–45). Association for Computational Linguistics. https://www.aclweb.org/anthology/2020.emnlp-demos.6
SANS Institute. (2024). 2024 SANS SOC Survey: Efficiency and cost analysis. Retrieved from https://www.sans.org/analyst-program/soc-survey-2024
Barabasi, A. L. (2016). Network Science. Cambridge University Press.
Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux.
Kipf, T. N., & Welling, M. (2017). Semi-supervised classification with graph convolutional networks. International Conference on Learning Representations.
National Institute of Standards and Technology (NIST). (2020). Framework for Improving Critical Infrastructure Cybersecurity.
Vaswani, A., et al. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30.

This paper presents a novel hybrid deep learning architecture for phishing detection that integrates BERT and Graph Neural Networks through cross-modal attention fusion. The proposed model addresses the multimodal nature of phishing attacks by simultaneously processing textual features via DistilBERT and structural relationships via a Heterogeneous Graph Transformer. Our methodology employs a security-aware loss function emphasizing false positive reduction and implements 5-fold cross-validation for robust evaluation.

Paper Submission Last Date
31 - July - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.