z-logo
open-access-imgOpen Access
Data Cleaning Model for XML Datasets using Conditional Dependencies
Author(s) -
Mohammed Ragheb Hakawati,
Yasmin Mohd Yacob,
Rafikha Aliana A. Raof,
Mustafa M.Khalifa Jabiry,
Eiad Syaf Alhudiani
Publication year - 2020
Publication title -
european journal of electrical engineering and computer science
Language(s) - English
Resource type - Journals
ISSN - 2736-5751
DOI - 10.24018/ejece.2020.4.1.163
Subject(s) - computer science , data mining , xml , set (abstract data type) , relational database , xml database , data set , database , information retrieval , artificial intelligence , programming language , operating system
Data Cleaning as an essential phase to enhance the overall quality used for decades with different data models, the majority handled a relational dataset as the most dominant data model. However, the XML data model, besides the relational data model considered the most data model commonly used for storing, retrieving, and querying valuable data. In this paper, we introduce a model for detecting and repairing XML data inconsistencies using a set of conditional dependencies. Detecting inconsistencies will be done by joining the existed data source with a set of patterns tableaus as conditional dependencies and then update these values to match the proper patterns using a set of SQL statements. This research considered the final phase for a cleaning model introduced for XML datasets by firstly mapping the XML document to a set of related tables then discovering a set of conditional dependencies (Functional and Inclusions) and finally then applying the following algorithms as a closing step of quality enhancement.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here