Tree aggregation for random forest class probability estimation | Zendy

Sage Andrew J. | Zendy; Genschel Ulrike | Zendy; Nettleton Dan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Tree aggregation for random forest class probability estimation

Author(s) -

Sage Andrew J.,

Genschel Ulrike,

Nettleton Dan

Publication year - 2020

Publication title -

statistical analysis and data mining: the asa data science journal

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.381

H-Index - 33

eISSN - 1932-1872

pISSN - 1932-1864

DOI - 10.1002/sam.11446

Subject(s) - random forest , computer science , decision tree , tree (set theory) , regression , class (philosophy) , calibration , machine learning , artificial intelligence , data mining , statistics , mathematics , mathematical analysis

In random forest methodology, an overall prediction or estimate is made by aggregating predictions made by individual decision trees. Popular implementations of random forests rely on different methods for aggregating predictions. In this study, we provide an empirical analysis of the performance of aggregation approaches available for classification and regression problems. We show that while the choice of aggregation scheme usually has little impact in regression, it can have a profound effect on probability estimation in classification problems. Our study illustrates the causes of calibration issues that arise from two popular aggregation approaches and highlights the important role that terminal nodesize plays in the aggregation of tree predictions. We show that optimal choices for random forest tuning parameters depend heavily on the manner in which tree predictions are aggregated.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research