Automatic missing value imputation for cleaning phase of diabetic’s readmission prediction model | Zendy

Jesmeen Mohd Zebaral Hoque | Zendy; J. Hossen | Zendy; Shohel Sayeed | Zendy; Chy. Mohammed Tawsif K. | Zendy; Jaya Ganesan | Zendy; J. Emerson Raja | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Automatic missing value imputation for cleaning phase of diabetic’s readmission prediction model

Author(s) -

Jesmeen Mohd Zebaral Hoque,

J. Hossen,

Shohel Sayeed,

Chy. Mohammed Tawsif K.,

Jaya Ganesan,

J. Emerson Raja

Publication year - 2022

Publication title -

international journal of power electronics and drive systems/international journal of electrical and computer engineering

Language(s) - English

Resource type - Journals

eISSN - 2722-2578

pISSN - 2722-256X

DOI - 10.11591/ijece.v12i2.pp2001-2013

Subject(s) - imputation (statistics) , computer science , missing data , data mining , support vector machine , adaboost , data set , feature selection , random forest , cross validation , artificial intelligence , logistic regression , machine learning

Recently, the industry of healthcare started generating a large volume of datasets. If hospitals can employ the data, they could easily predict the outcomes and provide better treatments at early stages with low cost. Here, data analytics (DA) was used to make correct decisions through proper analysis and prediction. However, inappropriate data may lead to flawed analysis and thus yield unacceptable conclusions. Hence, transforming the improper data from the entire data set into useful data is essential. Machine learning (ML) technique was used to overcome the issues due to incomplete data. A new architecture, automatic missing value imputation (AMVI) was developed to predict missing values in the dataset, including data sampling and feature selection. Four prediction models (i.e., logistic regression, support vector machine (SVM), AdaBoost, and random forest algorithms) were selected from the well-known classification. The complete AMVI architecture performance was evaluated using a structured data set obtained from the UCI repository. Accuracy of around 90% was achieved. It was also confirmed from cross-validation that the trained ML model is suitable and not over-fitted. This trained model is developed based on the dataset, which is not dependent on a specific environment. It will train and obtain the outperformed model depending on the data available.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore