An Enhancement of Deep Feature Synthesis Algorithm Using Mean, Median, and Mode Imputation


Authors : Josefa Ysabelle J. Maliwat; Princess A. Ylade; Richard C. Regala; Dan Michael A. Cortez; Antolin J. Alipio; Khatalyn E. Mata; Mark Christopher R. Blanco

Volume/Issue : Volume 7 - 2022, Issue 4 - April

Google Scholar : https://bit.ly/3IIfn9N

Scribd : https://bit.ly/39xIsIT

DOI : https://doi.org/10.5281/zenodo.6558729

Abstract : The Deep Feature Synthesis (DFS) algorithm automates feature engineering and is capable of extracting and applying complicated featuresto a variety of processes. Due to the novelty of DFS as a method for feature engineering, critical ways for dealing with missing values and unwanted data in a dataset have yet to be established. This paper discusses the usage of mean, median, and mode imputation to preprocess data before analyzing it.However, it is only limited to displaying the differences between nonimputed and imputed datasets. This strategy enables users to obtain more precise results by eliminating biased estimations. This study demonstrates that there is a distinct difference between the two datasets. This paper is concluded by proving that imputing datasets will cause distinctness in the results compared to the results of the datasets with missing and unwanted values.

Keywords : Deep Feature Synthesis, Auto Feature Engineering, Imputation

The Deep Feature Synthesis (DFS) algorithm automates feature engineering and is capable of extracting and applying complicated featuresto a variety of processes. Due to the novelty of DFS as a method for feature engineering, critical ways for dealing with missing values and unwanted data in a dataset have yet to be established. This paper discusses the usage of mean, median, and mode imputation to preprocess data before analyzing it.However, it is only limited to displaying the differences between nonimputed and imputed datasets. This strategy enables users to obtain more precise results by eliminating biased estimations. This study demonstrates that there is a distinct difference between the two datasets. This paper is concluded by proving that imputing datasets will cause distinctness in the results compared to the results of the datasets with missing and unwanted values.

Keywords : Deep Feature Synthesis, Auto Feature Engineering, Imputation

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe