z-logo
Premium
Efficient interaction selection for clustered data via stagewise generalized estimating equations
Author(s) -
Vaughan Gregory,
Aseltine Robert,
Chen Kun,
Yan Jun
Publication year - 2020
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.8574
Subject(s) - hierarchy , lasso (programming language) , selection (genetic algorithm) , computer science , set (abstract data type) , estimation , regression , variable (mathematics) , feature selection , mathematical optimization , multilevel model , regression analysis , machine learning , mathematics , statistics , mathematical analysis , management , world wide web , economics , market economy , programming language
Model selection in the presence of interaction terms is challenging as the final model must maintain a hierarchy between main effects and interaction terms. This work presents two stagewise estimation approaches to appropriately select models with interaction terms that can utilize generalized estimating equations to model clustered data. The first proposed technique is a hierarchical lasso stagewise estimating equations approach, which is shown to directly correspond to the hierarchical lasso penalized regression. The second is a stagewise active set approach, which enforces the variable hierarchy by conforming the selection to a properly growing active set in each stagewise estimation step. The effectiveness in interaction selection and the superior computational efficiency of the proposed techniques are assessed in simulation studies. The new methods are applied to a study of hospitalization rates attributed to suicide attempts among 15 to 19 year old at the school district level in Connecticut.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here