z-logo
open-access-imgOpen Access
Evaluation of a Multiple Regression Model for Noisy and Missing Data
Author(s) -
Chanintorn Jittawiriyanukoon
Publication year - 2018
Publication title -
international journal of electrical and computer engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.277
H-Index - 22
ISSN - 2088-8708
DOI - 10.11591/ijece.v8i4.pp2220-2229
Subject(s) - missing data , estimator , computer science , mean squared error , consistency (knowledge bases) , data mining , big data , regression , statistics , data collection , artificial intelligence , machine learning , mathematics
The standard data collection problems may involve noiseless data while on the other hand large organizations commonly experience noisy and missing data, probably concerning data collected from individuals. As noisy and missing data will be significantly worrisome for occasions of the vast data collection then the investigation of different filtering techniques for big data environment would be remarkable. A multiple regression model where big data is employed for experimenting will be presented. Approximation for datasets with noisy and missing data is also proposed. The statistical root mean squared error (RMSE) associated with correlation coefficient (COEF) will be analyzed to prove the accuracy of estimators. Finally, results predicted by massive online analysis (MOA) will be compared to those real data collected from the following different time. These theoretical predictions with noisy and missing data estimation by simulation, revealing consistency with the real data are illustrated. Deletion mechanism (DEL) outperforms with the lowest average percentage of error.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here