Evaluation of Three Simple Imputation Methods for Enhancing Preprocessing of Data with Missing Values | Zendy

R. Somasundaram | Zendy; R. Nedunchezhian | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Evaluation of Three Simple Imputation Methods for Enhancing Preprocessing of Data with Missing Values

Author(s) -

R. Somasundaram,

R. Nedunchezhian

Publication year - 2011

Publication title -

international journal of computer applications

Language(s) - English

Resource type - Journals

ISSN - 0975-8887

DOI - 10.5120/2619-3544

Subject(s) - computer science , imputation (statistics) , missing data , preprocessor , data pre processing , simple (philosophy) , data mining , information retrieval , artificial intelligence , machine learning , philosophy , epistemology

of the important stages of data mining is preprocessing, where the data is prepared for different mining tasks. Often, the real-world data tends to be incomplete, noisy, and inconsistent. It is very common that the data are not obtainable for every observation of every variable. So the presence of missing variables is obvious in the data set. A most important task when preprocessing the data is, to fill in missing values, smooth out noise and correct inconsistencies. This paper presents the missing value problem in data mining and evaluates some of the methods generally used for missing value imputation. In this work, three simple missing value imputation methods are implemented namely (1) Constant substitution, (2) Mean attribute value substitution and (3) Random attribute value substitution. The performance of the three missing value imputation algorithms were measured with respect to different rate or different percentage of missing values in the data set by using some known clustering methods. To evaluate the performance, the standard WDBC data set has been used.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research