Premium
Variable Selection for Semiparametric Mixed Models in Longitudinal Studies
Author(s) -
Ni Xiao,
Zhang Daowen,
Zhang Hao Helen
Publication year - 2010
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/j.1541-0420.2009.01240.x
Subject(s) - semiparametric regression , econometrics , semiparametric model , selection (genetic algorithm) , mixed model , statistics , longitudinal data , computer science , feature selection , variable (mathematics) , mathematics , nonparametric statistics , artificial intelligence , data mining , mathematical analysis
Summary We propose a double‐penalized likelihood approach for simultaneous model selection and estimation in semiparametric mixed models for longitudinal data. Two types of penalties are jointly imposed on the ordinary log‐likelihood: the roughness penalty on the nonparametric baseline function and a nonconcave shrinkage penalty on linear coefficients to achieve model sparsity. Compared to existing estimation equation based approaches, our procedure provides valid inference for data with missing at random, and will be more efficient if the specified model is correct. Another advantage of the new procedure is its easy computation for both regression components and variance parameters. We show that the double‐penalized problem can be conveniently reformulated into a linear mixed model framework, so that existing software can be directly used to implement our method. For the purpose of model inference, we derive both frequentist and Bayesian variance estimation for estimated parametric and nonparametric components. Simulation is used to evaluate and compare the performance of our method to the existing ones. We then apply the new method to a real data set from a lactation study.