Misclassification in Big Data Soft Set Environment | Zendy

Jyoti Arora | Zendy; Kamaljit Kaur | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Misclassification in Big Data Soft Set Environment

Author(s) -

Jyoti Arora,

Kamaljit Kaur

Publication year - 2017

Publication title -

international journal of computer applications

Language(s) - English

Resource type - Journals

ISSN - 0975-8887

DOI - 10.5120/ijca2017914298

Subject(s) - computer science , set (abstract data type) , big data , data set , data science , information retrieval , data mining , artificial intelligence , programming language

In order to deal with classification for large data, data filtering and data cleansing are used as preprocessing methods. Generally it remove noisy data, misclassified data, errors and inconsistent data and results unreliable classification. Because sometimes cleaned data can also affect the prediction accuracy or other testing. In this paper, we performed analysis of misclassified data and identify how much data has been wrong classified. For future aspect, This misclassified data is need to be rectified to get valuable information. To demonstrate this concept, we have used Air Traffic dataset from Statistical Computing Statistical Graphics (SCSG) to examine misclassified content in data set. Five supervised classifiers are used: Support vector Machine, decision procedure, k-nearest neighbor, random forest and logistic regression. The results shows that out of these classifiers, SVM classify 86% of the data correctly and only 14% of data has misclassification.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research