An enhancement of deep feature synthesis algorithm using mean median and mode imputation| International Journal of Innovative Science and Research Technology

An Enhancement of Deep Feature Synthesis Algorithm Using Mean, Median, and Mode Imputation

Authors : Josefa Ysabelle J. Maliwat; Princess A. Ylade; Richard C. Regala; Dan Michael A. Cortez; Antolin J. Alipio; Khatalyn E. Mata; Mark Christopher R. Blanco

Volume/Issue : Volume 7 - 2022, Issue 4 - April

Google Scholar : https://bit.ly/3IIfn9N

Scribd : https://bit.ly/39xIsIT

DOI : https://doi.org/10.5281/zenodo.6558729

Abstract : The Deep Feature Synthesis (DFS) algorithm automates feature engineering and is capable of extracting and applying complicated featuresto a variety of processes. Due to the novelty of DFS as a method for feature engineering, critical ways for dealing with missing values and unwanted data in a dataset have yet to be established. This paper discusses the usage of mean, median, and mode imputation to preprocess data before analyzing it.However, it is only limited to displaying the differences between nonimputed and imputed datasets. This strategy enables users to obtain more precise results by eliminating biased estimations. This study demonstrates that there is a distinct difference between the two datasets. This paper is concluded by proving that imputing datasets will cause distinctness in the results compared to the results of the datasets with missing and unwanted values.

Keywords : Deep Feature Synthesis, Auto Feature Engineering, Imputation

The Deep Feature Synthesis (DFS) algorithm automates feature engineering and is capable of extracting and applying complicated featuresto a variety of processes. Due to the novelty of DFS as a method for feature engineering, critical ways for dealing with missing values and unwanted data in a dataset have yet to be established. This paper discusses the usage of mean, median, and mode imputation to preprocess data before analyzing it.However, it is only limited to displaying the differences between nonimputed and imputed datasets. This strategy enables users to obtain more precise results by eliminating biased estimations. This study demonstrates that there is a distinct difference between the two datasets. This paper is concluded by proving that imputing datasets will cause distinctness in the results compared to the results of the datasets with missing and unwanted values.

Keywords : Deep Feature Synthesis, Auto Feature Engineering, Imputation