z-logo
open-access-imgOpen Access
Advanced Data Imputation Techniques for Predicting Type 2 Diabetes using Machine Learning
Author(s) -
Sofia Goel,
Sudhansh Sharma
Publication year - 2019
Publication title -
international journal of innovative technology and exploring engineering
Language(s) - English
Resource type - Journals
ISSN - 2278-3075
DOI - 10.35940/ijitee.b7466.129219
Subject(s) - imputation (statistics) , computer science , missing data , machine learning , outlier , artificial intelligence , scalability , data mining , database
Type 2 Diabetes mellitus is a serious metabolic disorder that is prevailing worldwide at an alarming rate. Medical dataset often suffers from the problem of missing data and outliers. However, handling of missing data with traditional mean based imputing may lead towards a bias model and return unpredictable outcome. Making complex models by combining multiple classifiers as well as some other methods could increase the accuracy which again is a time-consuming approach and requires heavy computation capability which significantly increases the deployment cost. The proposed research is to design a model to classify the data using class wise imputation technique and outlier handling. Performance of the proposed model is evaluated on nine machine learning classifiers and compared with traditional approaches like simple mean, median, and linear regression. Experimental results show the superiority of the proposed model in terms of classification accuracy and model complexity. The accuracy achieved by the proposed approach is 88.01%, which is highest as compared to the previous studies. The proposed research work is presented to improve accuracy, scalability and overall performance of the classification in the medical dataset, which ultimately proves to be a lifesaver if the diagnosis is achieved efficiently at an early stage.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here