z-logo
Premium
Knowledge discovery in data streams with regression tree methods
Author(s) -
Alberg Dima,
Last Mark,
Kandel Abraham
Publication year - 2011
Publication title -
wiley interdisciplinary reviews: data mining and knowledge discovery
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.506
H-Index - 47
eISSN - 1942-4795
pISSN - 1942-4787
DOI - 10.1002/widm.51
Subject(s) - interpretability , computer science , data mining , data stream mining , tree (set theory) , machine learning , regression analysis , decision tree , regression , artificial intelligence , cluster analysis , mathematics , statistics , mathematical analysis
This paper presents an advanced review of regression tree methods for mining data streams. Batch regression tree methods are known for their simplicity, interpretability, accuracy, and efficiency. They use fast divide‐and‐conquer greedy algorithms that recursively partition the given training data into smaller subsets. The result is a tree‐shaped model with splitting rules in the internal nodes and predictions in the leaves. Most batch regression tree methods take a complete dataset and build a model using that data. Generally, this tree model cannot be modified if new data is acquired later. Their successors, the incremental model and interval trees algorithms, are able to build and retrain a model on a step‐by‐step basis by incorporating new numerical training instances into the model as they become available. Moreover, these algorithms produce even more compact and accurate models than batch regression tree algorithms because they use intervals or functional models with a change detection mechanism, which makes them a more suitable choice for regression analysis of data streams. Finally, this review summarizes the performance results of the reviewed methods and crystallizes 10 requirements for successful implementation of a regression tree algorithm in data stream mining area. © 2011 Wiley Periodicals, Inc. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Technologies > Machine Learning Technologies > Prediction

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here