Learning from inconsistent and noisy data: The AQ18 approach
Author(s) -
Kenneth A. Kaufman,
Ryszard S. Michalski
Publication year - 1999
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
ISBN - 3-540-65965-X
DOI - 10.1007/bfb0095128
Subject(s) - completeness (order theory) , computer science , consistency (knowledge bases) , flexibility (engineering) , data quality , data consistency , data mining , consistency model , weak consistency , artificial intelligence , machine learning , measure (data warehouse) , noisy data , quality (philosophy) , strong consistency , mathematics , statistics , database , philosophy , epistemology , estimator , mathematical analysis , metric (unit) , operations management , economics
In concept learning or data mining tasks, the learner is typically faced with a choice of many possible hypotheses characterizing the data. If one can assume that the training data are nois e-free, then the generated hypothesis should be complete and consistent with regard to the data. In real -world problems, however, data are often noisy, and an insistence on full completeness and consistency is no longer valid. The proble m then is to determine a hypothesis that represents the "best" trade -off between completeness and consistency. This paper presents an approach to this problem in which a learner seeks rules optimizing a description quality criterion that combines completeness and consistency gain , a measure based on consistency that reflects the rule 's benefit . The method has been implemented in the AQ18 learning and data mining system and compared to several other methods. Experiments have indicated the flexibility and power of the proposed method.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom