Authors :
N. Bhavana; Billu Harathi
Volume/Issue :
Volume 10 - 2025, Issue 5 - May
Google Scholar :
https://tinyurl.com/37x74zzk
DOI :
https://doi.org/10.38124/ijisrt/25may375
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
Fraudulent insurance claims represent a significant threat to the financial stability of insurance companies and
contribute to increased premiums for policyholders. With the growing volume of data in the insurance industry, the need
for efficient and accurate fraud detection mechanisms has become more pressing. Machine learning (ML) offers powerful
tools to detect anomalous patterns and identify potentially fraudulent claims. This study investigates the application of
various machine learning algorithms to detect and analyze fraudulent insurance claims. Techniques such as Decision
Trees, Random Forest, Support Vector Machines, and Gradient Boosting are explored to evaluate their effectiveness in
identifying irregularities. The proposed system utilizes a structured dataset comprising historical claim information,
including both legitimate and fraudulent cases. Feature selection methods are applied to enhance model accuracy and
interpretability. Performance metrics such as accuracy, precision, recall, and F1-score are used to evaluate the models.
The results demonstrate that machine learning can significantly enhance the detection of fraudulent claims, leading to cost
savings and increased operational efficiency. Furthermore, the analysis provides insights into the most influential features
contributing to fraudulent behavior, offering valuable support to insurers in decision-making processes. The integration of
ML in fraud detection not only automates the identification process but also continuously improves with the influx of new
data. This research supports the implementation of intelligent systems in the insurance sector to combat fraud effectively.
Keywords :
Fraudulent insurance, Machine learning (ML), Gradient Boosting.
References :
- Baesens, B., Van Vlasselaer, V., & Verbeke, W. (2015). Fraud analytics using descriptive, predictive, and social network techniques: A guide to data science for fraud detection. John Wiley & Sons.
- Buda, A., & Jarynowski, A. (2019). Classification of insurance fraud using machine learning techniques. In Proceedings of the Federated Conference on Computer Science and Information Systems (pp. 345-349).
- Ngai, E. W., Hu, Y., Wong, Y. H., Chen, Y., & Sun, X. (2011). The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature. Decision Support Systems, 50(3), 559-569.
- Phua, C., Lee, V., Smith, K., & Gayler, R. (2005). A comprehensive survey of data mining-based fraud detection research. arXiv preprint arXiv:1009.6119.
- Viaene, S., Dedene, G., & Derrig, R. A. (2002). Auto claim fraud detection using Bayesian learning neural networks. Expert Systems with Applications, 29(3), 653-666.
- Bolton, R. J., & Hand, D. J. (2002). Statistical fraud detection: A review. Statistical Science, 17(3), 235-255.
- Kou, Y., Lu, C. T., Sirwongwattana, S., & Huang, Y. P. (2004). Survey of fraud detection techniques. In IEEE International Conference on Networking, Sensing and Control (Vol. 2, pp. 749-754).
- Whitrow, C., Hand, D. J., Juszczak, P., Weston, D., & Adams, N. M. (2009). Transaction aggregation as a strategy for credit card fraud detection. Data Mining and Knowledge Discovery, 18(1), 30-55.
- Sahin, Y., & Duman, E. (2011). Detecting credit card fraud by ANN and logistic regression. Expert Systems with Applications, 38(10), 13305-13310.
- Bhattacharyya, S., Jha, S., Tharakunnel, K., & Westland, J. C. (2011). Data mining for credit card fraud: A comparative study. Decision Support Systems, 50(3), 602-613.
Fraudulent insurance claims represent a significant threat to the financial stability of insurance companies and
contribute to increased premiums for policyholders. With the growing volume of data in the insurance industry, the need
for efficient and accurate fraud detection mechanisms has become more pressing. Machine learning (ML) offers powerful
tools to detect anomalous patterns and identify potentially fraudulent claims. This study investigates the application of
various machine learning algorithms to detect and analyze fraudulent insurance claims. Techniques such as Decision
Trees, Random Forest, Support Vector Machines, and Gradient Boosting are explored to evaluate their effectiveness in
identifying irregularities. The proposed system utilizes a structured dataset comprising historical claim information,
including both legitimate and fraudulent cases. Feature selection methods are applied to enhance model accuracy and
interpretability. Performance metrics such as accuracy, precision, recall, and F1-score are used to evaluate the models.
The results demonstrate that machine learning can significantly enhance the detection of fraudulent claims, leading to cost
savings and increased operational efficiency. Furthermore, the analysis provides insights into the most influential features
contributing to fraudulent behavior, offering valuable support to insurers in decision-making processes. The integration of
ML in fraud detection not only automates the identification process but also continuously improves with the influx of new
data. This research supports the implementation of intelligent systems in the insurance sector to combat fraud effectively.
Keywords :
Fraudulent insurance, Machine learning (ML), Gradient Boosting.