Prequential analysis of complex data with adaptive model reselection | Zendy

Clarke Jennifer | Zendy; Clarke Bertrand | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Prequential analysis of complex data with adaptive model reselection

Author(s) -

Clarke Jennifer,

Clarke Bertrand

Publication year - 2009

Publication title -

statistical analysis and data mining: the asa data science journal

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.381

H-Index - 33

eISSN - 1932-1872

pISSN - 1932-1864

DOI - 10.1002/sam.10052

Subject(s) - computer science , inference , data mining , matching (statistics) , data set , set (abstract data type) , feature (linguistics) , algorithm , artificial intelligence , mathematics , statistics , linguistics , philosophy , programming language

In Prequential analysis, an inference method is viewed as a forecasting system, and the quality of the inference method is based on the quality of its predictions. This is an alternative approach to more traditional statistical methods that focus on the inference of parameters of the data generating distribution. In this paper, we introduce adaptive combined average predictors (ACAPs) for the Prequential analysis of complex data. That is, we use convex combinations of two different model averages to form a predictor at each time step in a sequence. A novel feature of our strategy is that the models in each average are re‐chosen adaptively at each time step. To assess the complexity of a given data set, we introduce measures of data complexity for continuous response data. We validate our measures in several simulated contexts prior to using them in real data examples. The performance of ACAPs is compared with the performances of predictors based on stacking or likelihood weighted averaging in several model classes and in both simulated and real data sets. Our results suggest that ACAPs achieve a better trade off between model list bias and model list variability in cases where the data is very complex. This implies that the choices of model class and averaging method should be guided by a concept of complexity matching, i.e. the analysis of a complex data set may require a more complex model class and averaging strategy than the analysis of a simpler data set. We propose that complexity matching is akin to a bias‐variance tradeoff in statistical modeling. Copyright © 2009 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 2: 000‐000, 2009

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research