z-logo
open-access-imgOpen Access
Addressing the Identification Problem in Age-period-cohort Analysis
Author(s) -
YuKang Tu,
Nicole Krämer,
Wen–Chung Lee
Publication year - 2012
Publication title -
epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.901
H-Index - 173
eISSN - 1531-5487
pISSN - 1044-3983
DOI - 10.1097/ede.0b013e31824d57a9
Subject(s) - partial least squares regression , regression analysis , statistics , collinearity , regression , total least squares , regression diagnostic , mathematics , segmented regression , estimator , linear regression , cohort , principal component regression , generalized least squares , econometrics , polynomial regression
In the analysis of trends in health outcomes, an ongoing issue is how to separate and estimate the effects of age, period, and cohort. As these 3 variables are perfectly collinear by definition, regression coefficients in a general linear model are not unique. In this tutorial, we review why identification is a problem, and how this problem may be tackled using partial least squares and principal components regression analyses. Both methods produce regression coefficients that fulfill the same collinearity constraint as the variables age, period, and cohort. We show that, because the constraint imposed by partial least squares and principal components regression is inherent in the mathematical relation among the 3 variables, this leads to more interpretable results. We use one dataset from a Taiwanese health-screening program to illustrate how to use partial least squares regression to analyze the trends in body heights with 3 continuous variables for age, period, and cohort. We then use another dataset of hepatocellular carcinoma mortality rates for Taiwanese men to illustrate how to use partial least squares regression to analyze tables with aggregated data. We use the second dataset to show the relation between the intrinsic estimator, a recently proposed method for the age-period-cohort analysis, and partial least squares regression. We also show that the inclusion of all indicator variables provides a more consistent approach. R code for our analyses is provided in the eAppendix.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here