Improving accuracy of missing data imputation in data mining | Zendy

Nzar A. Ali | Zendy; Zhyan M. Omer | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Improving accuracy of missing data imputation in data mining

Author(s) -

Nzar A. Ali,

Zhyan M. Omer

Publication year - 2017

Publication title -

kurdistan journal of applied research

Language(s) - English

Resource type - Journals

eISSN - 2411-7706

pISSN - 2411-7684

DOI - 10.24017/science.2017.3.30

Subject(s) - missing data , imputation (statistics) , data mining , raw data , computer science , machine learning , programming language

In fact, raw data in the real world is dirty. Each large data repository contains various types of anomalous values that influence the result of the analysis, since in data mining, good models usually need good data, databases in the world are not always clean and includes noise, incomplete data, duplicate records, inconsistent data and missing values. Missing data is a common drawback in many real-world data sets. In this paper, we proposed an algorithm depending on improving (MIGEC) algorithm in the way of imputation for dealing missing values. We implement grey relational analysis (GRA) on attribute values instead of instance values, and the missing data were initially imputed by mean imputation and then estimated by our proposed algorithm (PA) used as a complete value for imputing next missing value.We compare our proposed algorithm with several other algorithms such as MMS, HDI, KNNMI, FCMOCS, CRI, CMI, NIIA and MIGEC under different missing mechanisms. Experimental results demonstrate that the proposed algorithm has less RMSE values than other algorithms under all missingness mechanisms.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore