z-logo
open-access-imgOpen Access
Data Cleaning in Cloud Platform
Author(s) -
V. Ramya,
S. Jayasimha
Publication year - 2020
Publication title -
international journal of recent technology and engineering
Language(s) - English
Resource type - Journals
ISSN - 2277-3878
DOI - 10.35940/ijrte.a3088.059120
Subject(s) - computer science , raw data , data quality , task (project management) , data mining , process (computing) , object (grammar) , data warehouse , quality (philosophy) , polishing , data element , cloud computing , association rule learning , data mapping , database , metadata , artificial intelligence , engineering , systems engineering , mechanical engineering , metric (unit) , philosophy , operations management , epistemology , programming language , operating system
Data is very valuable and it is generated in large volumes. The Use of high-quality data for making quality decisions has become a huge task which helps people to make better decisions, analysis, predictions. We are surrounded by data with errors, Data cleaning is a delayed, complicated task and considered costly. Data polishing is important since it is necessary to remove errors from the data before transferring to the data warehouse since poor quality data is eliminated to get the desired results. The Error-free data will produce precise and accurate results when queried. Hence consistent and proper data is required for the decision making. The characteristics of data polishing is data repairing and data association. Identifying the homogeneous object and linking it to the most associated object is defined as Association. The process of making the database reliable by repairing and finding the faults is defined as repairing. In the case of big data applications, we do not use all the existing data, we use only subsets of appropriate data. Association is the process of converting extensive amounts of raw data to subsets of appropriate data that are useful. Once we get the appropriate data, the available data is analyzed and it leads to knowledge [14]. Multiple approaches are used to associate the given data and to achieve meaningful and useful knowledge to fix or repair [12]. Maintaining polished quality of data is referred to as data polishing. Usually the objectives of data polishing are not properly defined. This paper will discuss the goals of data cleaning and different approaches for data cleaning platforms.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here