Leveraging Machine Learning for Lung Cancer Risk Assessment Based on Survey Insights


Authors : Sumit Mhaske; Mandar Dakhorkar; Vanshika Khiyani; Rudrani Patil; Ganesh Shelke

Volume/Issue : Volume 10 - 2025, Issue 4 - April


Google Scholar : https://tinyurl.com/4jukp4xx

Scribd : https://tinyurl.com/yf9z3uxa

DOI : https://doi.org/10.38124/ijisrt/25apr1335

Google Scholar

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Note : Google Scholar may take 15 to 20 days to display the article.


Abstract : Lung cancer is still among the top cancers to cause cancer death in humans around the world. It has a lot to do with lifestyle and smoking- individual factors that contribute to lung cancer development. This research study seeks to analyze the viability of the machines through algorithms for the likely risk prediction of lung cancer through survey data- that is, symptoms, behavioral traits, and demographic data. The dataset consists of information such as smoking habits along with anxiety levels, fatigue, and other symptoms employed. Various machine learning models were trained and evaluated on Logistic Regression, Decision Tree, Random Forest, Naive Bayes, and Support Vector Machines (SVM) algorithms. Among those, Random Forest proved to be the best predictor giving about 96.7% accuracy and strong precision and recall values, indicating its effectiveness in identifying high-risk subjects. This research indicates that machine learning can be applied to healthcare for early diagnosis and screening.

Keywords : Lung Cancer; Machine Learning; Prediction Model; Survey Data; Random Forest; Health Informatics; Risk Assessment.

References :

  1. Narvekar, S., Shirodkar, M., Raut, T., Vaingankar, P., Kumar, K. M. C., & Aswale, S. (2022). A survey on detection of lung cancer using different image processing techniques. 2022 3rd International Conference on Intelligent Engineering and Management (ICIEM), 13–18. https://doi.org/10.1109/iciem54221.2022.9853190
  2. J. Clerk Maxwell, A Treatise on Electricity and Magnetism, 3rd ed., vol. 2. Oxford: Clarendon, 1892, pp.68-73. Nadkarni, N. S., & Borkar, S. (2019). Detection of Lung Cancer in CT Images using Image Processing. In Goa College of Engineering, Proceedings of the Third International Conference on Trends in Electronics and Informatics (ICOEI 2019) (p. 863).
  3. Shafiq, S., Asghar, M. A., Amjad, M. E., & Ibrahim, J. (2022). An Effective Early Stage Detection of Lung Cancer Using Fuzzy Local Information cMean and GoogLeNet. New, 1–6.
  4. Vikruthi, S., Thippagudisa, K. B., Basha, P. H., Vamsi, P., Chandra, G. M., & Kiran, V. S. (2023).  Performance Analysis of Gene Expression Profiles of Lung Cancer Prediction using JR Algorithm. New, 1731–1736.
  5. B, K. S. (2020). Prediction of lung cancer using Convolutional Neural Network (CNN). International Journal of Advanced Trends in Computer Science and Engineering, 9(3), 3361–3365.
  6. Bhargav, A. L., & Ashokkumar, C. (2024). AI-Driven Insights: A Survey on Innovative Approach for Lung Cancer Prediction Utilizing Machine Learning and Deep Learning Methods : Lung Cancer Prediction Utilizing Machine Learning and Deep Learning Methods. New, 1152–1158.
  7. Baker, Q. B., Gharaibeh, M., & Al-Harahsheh, Y. (2021). Predicting Lung Cancer Survival Time Using Deep Learning Techniques. Ew, 177–181.
  8. K, A., T, R., B, S., & M, K. (2024). Real-Time lung cancer detection using GAN-Enhanced LDCT imaging and CNN-Based classification for clinical accuracy. 2021 International Conference on System, Computation, Automation and Networking (ICSCAN), 1–6.
  9. S. H. Begum, M. I. Baig, M. A. Hussain and M. A. Muqeet, "A Lightweight Deep Learning Model for Automatic Diagnosis of Lung Cancer," 2022 IEEE 2nd International Conference on Mobile Networks and Wireless Communications (ICMNWC), Tumkur, Karnataka, India, 2022
  10. A. Jawaid, J. Hafeez, S. Khan and A. Ur Rehman, "Lung Cancer Detection using Artificial Neural Network on Android," 2023 Global Conference on Wireless and Optical Technologies (GCWOT), Malaga, Spain, 2023, pp. 1-8, doi: 10.1109/GCWOT57803.2023.10064658.
  11. A. A. Sagaya Priscilla and R. Balamanigandan, "Hybrid Artificial Neural Networks for Lung Cancer Detection," 2024 International Conference on Sustainable Communication Networks and Application (ICSCNA), Theni, India, 2024, pp. 757-761, doi: 10.1109/ICSCNA63714.2024.10864149.

Lung cancer is still among the top cancers to cause cancer death in humans around the world. It has a lot to do with lifestyle and smoking- individual factors that contribute to lung cancer development. This research study seeks to analyze the viability of the machines through algorithms for the likely risk prediction of lung cancer through survey data- that is, symptoms, behavioral traits, and demographic data. The dataset consists of information such as smoking habits along with anxiety levels, fatigue, and other symptoms employed. Various machine learning models were trained and evaluated on Logistic Regression, Decision Tree, Random Forest, Naive Bayes, and Support Vector Machines (SVM) algorithms. Among those, Random Forest proved to be the best predictor giving about 96.7% accuracy and strong precision and recall values, indicating its effectiveness in identifying high-risk subjects. This research indicates that machine learning can be applied to healthcare for early diagnosis and screening.

Keywords : Lung Cancer; Machine Learning; Prediction Model; Survey Data; Random Forest; Health Informatics; Risk Assessment.

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe