Development of a logistic regression model for fake news detection in nigerian social media| International Journal of Innovative Science and Research Technology

Development of a Logistic Regression Model for Fake News Detection in Nigerian Social Media

Authors : Zainab Muhammad Nadada; Prema Kirubakaran; Muhammad Suleiman

Volume/Issue : Volume 11 - 2026, Issue 3 - March

Google Scholar : https://tinyurl.com/sbf3rtw8

Scribd : https://tinyurl.com/36hdpa7x

DOI : https://doi.org/10.38124/ijisrt/26mar1713

PlumX Metrics

Semantic Scholar

ResearchGate

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Abstract : The rapid growth of the use of social media is posing a significant rate of social, political and economic threats in Nigeria. Communication done online has the nature of being informal, especially in platforms like Instagram, thus complicating the verification of information. It is against this background that this study developed and evaluated a machine learning–based framework for fake news detection on the Instagram social media platform within the Nigerian context. The objectives of the study were to: (i) design a machine learning–based framework for fake news detection on Instagram; (ii) implement the designed framework using Term Frequency–Inverse Document Frequency (TF-IDF) feature extraction and Logistic Regression classification techniques; and (iii) evaluate the performance of the developed model using accuracy, precision, recall, and F1-score metrics. This work adopted an experimental research design. FakeNewsNet repository was utilized to get publicly available benchmark datasets, which contains political and entertainment news - PolitiFact and GossipCop, where data was labeled as fake or real. Nigerian Pidgin English dataset was incorporated into the training process so as to improve contextual relevance and show transfer learning. Under data preprocessing, applied in this research were techniques such as text cleaning, label normalization, and stratified data splitting. TF-IDF, and a Logistic Regression model with class weight balancing was executed under the feature extraction process, where the model was trained and evaluated using an 80:20 train-test split. To simulate Instagram message input and provide instant fake or real news predictions that had confidence scores, a chat model for news verification was implemented. The results showed that the model achieved an overall accuracy of approximately 82%, with satisfactory precision, recall, and F1-score values, indicating effective classification performance. Pidgin English inputs were successfully classified, a key indicator that the model is adaptable to local linguistics patterns. This study concluded that machine learning techniques, when combined with appropriate feature extraction and contextual data, can effectively support Nigerian fake news detection on social media platforms. Recommended in this study is the involvement of larger Nigerian-language datasets, the exploration of advanced deep learning models, and full integration with social media APIs to enhance real-time deployment and enhance the rate of accuracy of detection.

Keywords : Term Frequency–Inverse Document Frequency (TF-IDF).

References :

Abikoye, O. C., & Abdulsalam, S. O. (2024). A Bi-LSTM-2-ML transfer-learning framework for fake news detection using hybrid embeddings. International Journal of Computer Applications, 186(4), 12–22.
Adewole, T., Balogun, A., & Salami, M. (2023). Machine learning-based fake news detection on Nigerian Twitter data using TF-IDF and SVM. Nigerian Journal of Computing and Applied Informatics, 5(2), 45–55.
Ahmed, H., Traore, I., & Saad, S. (2021). Detecting opinion spam and fake news using text classification: A comparative analysis. Expert Systems with Applications, 168, 114 371.
Esan, A., Adebimpe, O., & Ojo, K. (2023). Long-short-term memory model for fake news detection in Nigeria. Ianna Journal of Interdisciplinary Studies, 5(1), 71–82.
Hossain, M., Rahman, M., & Islam, M. (2022). Fake news detection for Bangla using Bi-LSTM with word embeddings. Journal of Information Technology and Digital Services, 4(3), 23–33.
Kaliyar, R. K., Goswami, A., & Narang, P. (2021). FakeBERT: Fake news detection in social media with a BERT-based deep learning approach. Multimedia Tools and Applications, 80, 134 67–134 83.
Kumar, S., & Singh, R. (2020). Deep learning models for fake news detection: A comparative study. International Journal of Information Systems and Management, 10(3), 15–27.
Muhammad, S., Adelani, D., Ruder, S., Ahmad, I., & Bello, I. (2022). NaijaSenti: A Nigerian Twitter sentiment corpus for four major languages. In Proceedings of the 13th Language Resources and Evaluation Conference (pp. 5805–5815). European Language Resources Association.
Olowononi, F., Ayodele, T., & Eke, C. (2022). Hybrid NLP approach for misinformation detection in Nigerian WhatsApp messages. African Journal of Computing & ICT, 15(2), 59–68.
Oyewusi, T., Adebayo, J., & Akintoye, B. (2020). Semantic enrichment and resource creation for Nigerian Pidgin language processing. Nigerian Journal of Language Technologies, 3(1), 44–55.
Pandey, A., Singh, V., & Srivastava, A. (2022). Performance evaluation of machine learning classifiers for fake news detection. International Journal of Information Engineering, 9(4), 33–41.
Patel, K., & Patel, D. (2021). Deep learning architectures and applications in NLP: A review. Journal of Artificial Intelligence Research, 12(2), 1–15.
Sangamnerkar, M., Patil, A., & Deshmukh, R. (2020). An ensemble machine learning approach for fabricated news detection. Procedia Computer Science, 167, 2344–2353.
Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2019). Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations, 19(1), 22–36.
Singhal, S., Shah, R., Chakraborty, T., & Kumaraguru, P. (2019). SpotFake: A multimodal framework for fake news detection. In Proceedings of the IEEE International Conference on Big Data (pp. 2141–2149). IEEE.
Song, C., Lee, S., & Han, J. (2021). Multimodal fake news detection via cross-modal attention networks. Information Processing & Management, 58(4), 102 551.
Varshini, M., Krishnan, G., & Raj, S. (2024). RDGT-GAN: Robust distribution-generalized transformer for fake news detection. Expert Systems with Applications, 241, 122 564.
Villagracia-Octaviano, M. (2021). A comparative analysis of machine learning algorithms for fake news detection. Journal of Data Science and Analytics, 3(2), 49–60.
Wang, Y., Chen, L., & Zhang, P. (2023). Detecting AI-generated and fake news using fine-tuned transformer models. Artificial Intelligence Review, 56, 7023–7039.

The rapid growth of the use of social media is posing a significant rate of social, political and economic threats in Nigeria. Communication done online has the nature of being informal, especially in platforms like Instagram, thus complicating the verification of information. It is against this background that this study developed and evaluated a machine learning–based framework for fake news detection on the Instagram social media platform within the Nigerian context. The objectives of the study were to: (i) design a machine learning–based framework for fake news detection on Instagram; (ii) implement the designed framework using Term Frequency–Inverse Document Frequency (TF-IDF) feature extraction and Logistic Regression classification techniques; and (iii) evaluate the performance of the developed model using accuracy, precision, recall, and F1-score metrics. This work adopted an experimental research design. FakeNewsNet repository was utilized to get publicly available benchmark datasets, which contains political and entertainment news - PolitiFact and GossipCop, where data was labeled as fake or real. Nigerian Pidgin English dataset was incorporated into the training process so as to improve contextual relevance and show transfer learning. Under data preprocessing, applied in this research were techniques such as text cleaning, label normalization, and stratified data splitting. TF-IDF, and a Logistic Regression model with class weight balancing was executed under the feature extraction process, where the model was trained and evaluated using an 80:20 train-test split. To simulate Instagram message input and provide instant fake or real news predictions that had confidence scores, a chat model for news verification was implemented. The results showed that the model achieved an overall accuracy of approximately 82%, with satisfactory precision, recall, and F1-score values, indicating effective classification performance. Pidgin English inputs were successfully classified, a key indicator that the model is adaptable to local linguistics patterns. This study concluded that machine learning techniques, when combined with appropriate feature extraction and contextual data, can effectively support Nigerian fake news detection on social media platforms. Recommended in this study is the involvement of larger Nigerian-language datasets, the exploration of advanced deep learning models, and full integration with social media APIs to enhance real-time deployment and enhance the rate of accuracy of detection.

Keywords : Term Frequency–Inverse Document Frequency (TF-IDF).

Paper Submission Last Date
30 - June - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.