z-logo
open-access-imgOpen Access
Making Use of Functional Dependencies Based on Data to Find Better Classification Trees
Author(s) -
Hyontai Sug
Publication year - 2021
Publication title -
international journal of circuits, systems and signal processing
Language(s) - English
Resource type - Journals
ISSN - 1998-4464
DOI - 10.46300/9106.2021.15.160
Subject(s) - categorical variable , dependency (uml) , conditional independence , machine learning , computer science , decision tree , precondition , artificial intelligence , data mining , programming language
For the classification task of machine learning algorithms independency between conditional attributes is a precondition for success of data mining. On the other hand, decision trees are one of the mostly used machine learning algorithms because of their good understandability. So, because dependency between conditional attributes can cause more complex trees, supplying conditional attributes independent each other is very important, the requirement of conditional attributes for decision trees as well as other machine learning algorithms is that they are independent each other and dependent on decisional attributes only. Statistical method to check independence between attributes is Chi-square test, but the test can be effective for categorical attributes only. So, the applicability of Chi-square test is limited, because most datasets for data mining have mixed attributes of categorical and numerical. In order to overcome the problem, and as a way to test dependency between conditional attributes, a novel method based on functional dependency based on data that can be applied to any datasets irrespective of data type of attributes is suggested. After removing highly dependent attributes between conditional attributes, we can generate better decision trees. Experiments were performed to show that the method is effective, and the experiments showed very good results.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here