Intelligent Customer Segmentation in Digital Commerce Using K-Means Clustering


Authors : Damla Demir; Gökçe Karahan Adalı

Volume/Issue : Volume 10 - 2025, Issue 12 - December


Google Scholar : https://tinyurl.com/576nam3b

Scribd : https://tinyurl.com/y6ktuc8a

DOI : https://doi.org/10.38124/ijisrt/25dec513

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Note : Google Scholar may take 30 to 40 days to display the article.


Abstract : The rapid rise in e-commerce has forced companies to have good knowledge of customer behavior and tailor the marketing strategies accordingly. This study discusses the appropriateness of the K-Means algorithm for customer segmentation from behavioral and demographic data obtained by a systematic Likert-scale survey. Clusters with high interpretability were obtained and validated through silhouette analysis with values up to 0.75, indicating high internal consistency. Key findings show that female interviewees prefer shopping by mobile to a far greater extent than male interviewees, while male interviewees are more responsive to promotional emails and SMS. Younger and middle-aged users are similarly more susceptible to social media advertising, with older segments having more neutral or selective orientations. These results illustrate the complexity of customers' behavior and that demographic and behavioral data should be combined in segmentation studies. By its demonstration of the value of clustering techniques in providing insightful customer profiles, this study contributes to practical and methodological applications to data-driven decision- making in e-commerce. Future research is encouraged to expand the dataset size and incorporate more advanced methods such as predictive modeling and sentiment analysis to further improve segmentation precision.

Keywords : K-Means Clustering, Customer Segmentation, E-commerce Analytics.

References :

  1.   Aggarwal, D. (2023). Exploring the Role of AI for Online Grocery Shopping through Enhancing Personalized Recommendations and Customer Segmentation. Technoarete Transactions On Advances In Computer Applications (TTACA), 2(3).
  2. Alzahrani, R., Habib, M., & Khan, M. A. (2022). Student Satisfaction Clustering using K-Means with Orange Data Mining Tool. International Journal of Advanced Computer Science and Applications, 13(2), 34-40.
  3. Analytics Vidhya (2025). What is k-means clustering? https://www.analyticsvidhya.com/blog/2019/08/comprehensive-guide-k-means-clustering/ Accessed on  02.03.2025)
  4. Cooil, B., Aksoy, L., & Keiningham, T. L. (2008). Approaches to Customer Segmentation. Journal of Relationship Marketing, 6(3–4), 9–39. https://doi.org/10.1300/J366v06n03_02
  5. Daruvuri, S., Raj, A., & Tripathi, M. (2025). Improving customer segmentation through PCA-enhanced K-Means clustering in online retail datasets. Journal of Big Data, 12(1), 48. https://journalofbigdata.springeropen.com/articles/10.1186/s40537-025-01111-y
  6. García-Murillo, M., & Annabi, H. (2002). Customer Knowledge Management. Journal of the Operational Research Society, 53(8), 875–884.
  7. Google Developers. (2024). What is clustering? Accessed on 19.03.2025, https://developers.google.com/machine-learning/clustering/overview
  8. IBM, (2025, March 02) K-means clustering. https://www.ibm.com/think/topics/k-means-clustering#:~:text=K-means%20is%20an%20iterative,the%20characteristics%20of%20the%20data
  9. John, I., Shobayo, O., & Ogunleye, O. (2024). Clustering customer segments in UK retail using RFM   and machine learning algorithms. ArXiv preprint. https://arxiv.org/abs/2402.04103
  10. Kashwan, K. R. & C M, Velu. (2013). Customer Segmentation Using Clustering and Data Mining Techniques. International Journal of Computer Theory and Engineering. 5. 856-861. 10.7763/IJCTE.2013.V5.811.
  11. Kirubakaran, Vignesh Saravanan & G, Sakthi. (2025). Customer Segmentation using Clustering   Techniques: Data-Driven Approach to Enhance Marketing Strategy. 10.1109/ICSCNA63714.2024.10864053.
  12. Kuhn, Max & Johnson, Kjell. (2013). Applied Predictive Modeling. 10.1007/978-1-4614-6849-3.
  13. Lau, M. M., Cheung, R., Lam, A. Y., & Chu, Y. T. (2019). Segmenting online consumers by motivations for using e-commerce platforms. Journal of Retailing and Consumer Services, 47, 70-78.
  14. Nimbalkar, A. A., & Berad, A. T. (2021). The increasing importance of AI applications in e-commerce. Vidyabharati International Interdisciplinary Research Journal, 13(1), 388-391.
  15. Nugroho, M. A., Darmawan, M. A., & Yudhistira, A. (2024). Customer segmentation using optimized K-Means clustering in retail data. Journal of Intelligent Data Science and Systems, 4(2), 112–120. https://www.idss.iocspublisher.org/index.php/jidss/article/view/236
  16. Rahm, E., & Do, H. H. (2000). Data cleaning: Problems and current approaches. IEEE Data Eng. Bull., 23(4), 3-13.
  17. Rimakka, H., Öztürk, A., & Yilmaz, S. (2023). User segmentation based on purchasing habits and preferences on the Amazon platform using K-Means clustering. International Journal of Data Science Research, 6(3), 201–210. https://www.researchgate.net/publication/377972483
  18. Semoglou, V., Papakostas, G. A., & Likas, A. (2025). K-Sil: A silhouette-based improvement of K-Means clustering for better customer segmentation. Expert Systems with Applications, 233, 120900. https://doi.org/10.1016/j.eswa.2025.120900
  19. Syakur, M. A., Khotimah, B. K., Rochman, E. M. S., & Satoto, B. D. (2018). Integration of K-Means clustering method and elbow method for identification of the best customer profile cluster. IOP Conference Series: Materials Science and Engineering, 336(1), 012017. https://doi.org/10.1088/1757-899X/336/1/012017
  20. Tuma, M. N., Decker, R., & Scholz, S. W. (2011). A survey-based customer segmentation using clustering of latent class probabilities. European Journal of Operational Research, 213(3), 564-572.
  21. Wang, T. (2025). Reinforcement learning-based hybrid clustering for customer behavior analysis. Applied Soft Computing, 145, 110401. https://doi.org/10.1016/j.asoc.2025.110401
  22. Wirth, R., & Hipp, J. (2000, April). CRISP-DM: Towards a standard process model for data mining. In Proceedings of the 4th international conference on the practical applications of knowledge discovery and data mining (Vol. 1, pp. 29-39).
  23. Wu, R. S., & Chou, P. H. (2011). Customer segmentation of multiple category data in e-commerce using a soft-clustering approach. Electronic Commerce Research and Applications, 10(3), 331-341. https://doi.org/10.1016/j.elerap.2010.11.002
  24. Zhou, Y., Cheng, L., & Xu, H. (2020). Personalized Recommendation System Based on Customer Segmentation Using K-means Clustering and PCA. Procedia Computer Science, 174, 838-843  

The rapid rise in e-commerce has forced companies to have good knowledge of customer behavior and tailor the marketing strategies accordingly. This study discusses the appropriateness of the K-Means algorithm for customer segmentation from behavioral and demographic data obtained by a systematic Likert-scale survey. Clusters with high interpretability were obtained and validated through silhouette analysis with values up to 0.75, indicating high internal consistency. Key findings show that female interviewees prefer shopping by mobile to a far greater extent than male interviewees, while male interviewees are more responsive to promotional emails and SMS. Younger and middle-aged users are similarly more susceptible to social media advertising, with older segments having more neutral or selective orientations. These results illustrate the complexity of customers' behavior and that demographic and behavioral data should be combined in segmentation studies. By its demonstration of the value of clustering techniques in providing insightful customer profiles, this study contributes to practical and methodological applications to data-driven decision- making in e-commerce. Future research is encouraged to expand the dataset size and incorporate more advanced methods such as predictive modeling and sentiment analysis to further improve segmentation precision.

Keywords : K-Means Clustering, Customer Segmentation, E-commerce Analytics.

CALL FOR PAPERS


Paper Submission Last Date
31 - December - 2025

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe