Semantic Integrity Constraint Rule Discovery and Outlier Detection in Relational Data as a Data Quality Mining Technique | Zendy

R. VasanthKumarMehta | Zendy; S Rajalakshmi | Zendy

Open Access

Semantic Integrity Constraint Rule Discovery and Outlier Detection in Relational Data as a Data Quality Mining Technique

Author(s) -

R. VasanthKumarMehta,

S Rajalakshmi

Publication year - 2014

Publication title -

international journal of computer applications

Language(s) - English

Resource type - Journals

ISSN - 0975-8887

DOI - 10.5120/15357-3819

Subject(s) - computer science , anomaly detection , outlier , data mining , constraint (computer aided design) , data quality , quality (philosophy) , data integrity , information retrieval , data science , artificial intelligence , database , mechanical engineering , metric (unit) , philosophy , operations management , epistemology , engineering , economics

Data Quality is critical to the quality of patterns and analysis obtained from data. One of the important factors plaguing data is violation of Semantic Integrity, leading to inconsistency, in turn resulting in generation of bad patterns or reports when data mining or warehousing techniques are applied on such data. In this paper, a data quality mining technique is proposed to automatically generate Semantic Integrity Constraint Rules from the data. Further, this process leads to identification of Outliers, which are then to be classified as either violations or genuine cases of exception. The results of applying the proposed technique on a real-life data set are discussed. Some other data quality-related observations made in the process are listed. General Terms Data Quality Mining, Semantic Integrity Constraints, Outlier Detection

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research