Enhanced fpgrowth framework and apriori algorithm utilizing tda for big data analysis| International Journal of Innovative Science and Research Technology

Enhanced FP-Growth Framework and Apriori Algorithm Utilizing TDA for Big Data Analysis

Authors : Abdulkader Mohammed Abdulla Al-Badani

Volume/Issue : Volume 10 - 2025, Issue 12 - December

Google Scholar : https://tinyurl.com/4kxxekfa

Scribd : https://tinyurl.com/2urztbnp

DOI : https://doi.org/10.38124/ijisrt/25dec579

PlumX Metrics

Semantic Scholar

ResearchGate

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Abstract : To efficiently analyze huge datasets, mining big data requires advanced computational techniques and algorithms. Apriori and FP-Growth are two of the most well-known algorithms in data mining. They help businesses make decisions based on customer trends and behaviors by finding patterns and correlations. Machine learning has made these algorithms even better by making them more accurate and efficient. The association rule approach does have some problems, though. For example, it needs a lot of memory, it has to search through all the data sets to find the frequency of an item set, and it sometimes makes rules that aren't the best. This study conducts a comparative analysis of the FP-Growth, Apriori, and TDA algorithms, demonstrating notable performance differences. The FP-Growth algorithm was much better at working with large datasets than the Apriori method, which had problems with scalability and took longer to process larger datasets, even though it was easier to build. This study suggests changes to the FP-Growth algorithm to fix these problems. It uses the TDA matrix to make a very compact FP-tree. This method tries to cut down on the time it takes to mine and the number of items that are created, which will make memory use more efficient and speed up processing for large datasets. In short, the proposed method is a promising way to make data mining processes more efficient and scalable, especially when it comes to big data analytics.

Keywords : FP-Growth Algorithm, Aprioiri Algorithm, FP-tree, Support Count, TDA.

References :

Riadi, I., Herman, H., Fitriah, F., Suprihatin, S., Muis, A., & Yunus, M. 2023. Implementation of association rule using apriori algorithm and frequent pattern growth for inventory control. Jurnal Infotel, 15(4),pp. 369-378.‏
Dunham M, Naughton J, Chen W D, et al. 2010..Proceedings of the 2000 ACM SIGMOD international conference on Management of data[J]. Water International, 26(4),pp. 607-609.
Singh R, Bhala A, Salunkhe J, et al.2015. Optimized Apriori Algorithm Using Matrix Data Structure[J]. International Journal of Research in Engineering and AppSciences,9(5),pp. 2249-3905.
Yu, C., Liang, Y., & Zhang, X. 2023. Research on Apriori algorithm based on compression processing and hash table. In Third International Conference on Machine Learning and Computer Application (ICMLCA 2022) (Vol. 12636, pp. 606-611). SPIE.‏
A. S. Hoong Lee, L. S. Yap, H. N. Chua, Y. C. Low, and M. A. Ismail. 2021. “A data mining approach to analyse crash injury severity level,” J. Eng. Sci. Technol., vol. 16, pp. 1–14.
S. Wang, J. Cao, and P. S. Yu. 2019 . “Deep learning for spatiotemporal data mining: A survey,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 8, pp. 1–21.
Elisa, E. 2018. Market Basket Analysis Pada Mini Market Ayu Dengan Algoritma Apriori. Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), 2(2),pp. 472-478.‏
Salam, A., Zeniarja, J., Wicaksono, W., & Kharisma, L. 2018. Pencarian Pola Asosiasi Untuk Penataan Barang Dengan Menggunakan Perbandingan Algoritma Apriori Dan Fp-Growth (Study Kasus Distro Epo Store Pemalang). Dinamik, 23(2),pp. 57-65.‏
M. D. Febrianto and A. Supriyanto. , 2022. “Implementasi algoritma apriori untuk menentukan pola pembelian produk,” Jurikom, vol. 9, no. 6, pp. 2010–2020.
M. M. Hasan and S. Z. Mishu..2018. “An adaptive method for mining frequent itemsets based on apriori and fp growth algorithm,” in 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2). IEEE, pp. 1–4.
A. Almira, S. Suendri, and A. Ikhwan, 2021.“Implementasi data mining menggunakan algoritma fp-growth pada analisis pola pencurian daya listrik,” Jurnal Informatika Universitas Pamulang, pp. 442–448.
J. Han, J. Pei, and Y. Yin. .2000. “Mining frequent patterns without candidate generation,” ACM sigmod record, no. 2, pp. 1– 12.
F. Wei and L. Xiang. 2015. “Improved frequent pattern mining algorithm based on fp-tree,” in Proceedings of The Fourth International Conference on Information Science and Cloud Computing (ISCC2015), pp. 18–19.
R. Krupali, D. Garg, and K. Kotecha. 2017. “An improved approach of fp-growth tree for frequent itemset mining using partition projection and parallel projection techniques,” International Recent and Innovation Trends in Computing and Communication, pp. 929–934.
AGRAWAL, Rakesh, et al. 1994. Fast algorithms for mining association rules. In: Proc. 20th int. conf. very large data bases, VLDB. pp. 487-499.‏
HAN, Jiawei; PEI, Jian; YIN, Yiwen. 2000. Mining frequent patterns without candidate generation. ACM sigmod record, 29.2: 1-12.
SHRIDHAR, M.; PARMAR. 2017. Mahesh. Survey on association rule mining and its approaches. Int J Comput Sci Eng, 5.3: 129-135.‏
KHANALI, Hoda; VAZIRI, Babak. 2017. A survey on improved algorithms for mining association rules. Int. J. Comput. A,pp. 165: 8887.
Sohrabi, M. K., & HASANNEJAD, M. H. 2016. Association rule mining using new FP-linked list algorithm.‏
Huaman Llanos, A. A., Huatangari, L. Q., Yalta Meza, J. R., Monteza, A. H., Adrianzen Guerrero, O. D., & Rodriguez Estacio, J. S. 2024. Toward Enhanced Customer Transaction Insights: An Apriori Algorithm-based Analysis of Sales Patterns at University Industrial Corporation. International Journal of Advanced Computer Science & Applications, 15(2).‏
BALA, Alhassan, et al. 2016. Performance analysis of apriori and fp-growth algorithms (association rule mining). Int. J. Computer Technology &Applications, 7.2,pp. 279-293.‏
Al-Maolegi, M., & Arkok, B. 2014. An improved Apriori algorithm for association rules. arXiv preprint arXiv:1403.3948.‏‏
Yuan, X. 2017. An improved Apriori algorithm for mining association rules. In AIP conference proceedings (Vol. 1820, No. 1). AIP Publishing.‏
Han J, Pei J, Yin Y. 2000. Mining frequent patterns without candidate generation (Acm Sigmod Record, 29(2)), pp.1-12.
Gruca, A. 2014. Improvement of FP-Growth algorithm for mining description-oriented rules. In Man-Machine Interactions, Part of Advances in Intelligent Systems and Computing, (AISC), Springer, vol. 242, pp. 183-192.
Sohrabi, M. K., and Marzooni, H. H. 2016. Association rule mining using new FP-Linked list algorithm. Journal of Advances in Computer Research (JACR), 7(1), pp. 23-34.‏
B. Zhang. 2021 .“Optimization of fp-growth algorithm based on cloud computing and computer big data,” International Journal of System Assurance Engineering and Management, pp. 853–863.
Blake, C. L., and Merz., M. J, UCI Repository of Machine Learning Databases [http://www. ics. uci. edu/~ mlearn/ MLRepository. html]. Irvine, CA: University of California‖, Department of Information and Computer Science.

To efficiently analyze huge datasets, mining big data requires advanced computational techniques and algorithms. Apriori and FP-Growth are two of the most well-known algorithms in data mining. They help businesses make decisions based on customer trends and behaviors by finding patterns and correlations. Machine learning has made these algorithms even better by making them more accurate and efficient. The association rule approach does have some problems, though. For example, it needs a lot of memory, it has to search through all the data sets to find the frequency of an item set, and it sometimes makes rules that aren't the best. This study conducts a comparative analysis of the FP-Growth, Apriori, and TDA algorithms, demonstrating notable performance differences. The FP-Growth algorithm was much better at working with large datasets than the Apriori method, which had problems with scalability and took longer to process larger datasets, even though it was easier to build. This study suggests changes to the FP-Growth algorithm to fix these problems. It uses the TDA matrix to make a very compact FP-tree. This method tries to cut down on the time it takes to mine and the number of items that are created, which will make memory use more efficient and speed up processing for large datasets. In short, the proposed method is a promising way to make data mining processes more efficient and scalable, especially when it comes to big data analytics.

Keywords : FP-Growth Algorithm, Aprioiri Algorithm, FP-tree, Support Count, TDA.

Paper Submission Last Date
31 - July - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.