Methods for Stratified Cluster Sampling with Informative Stratification
Author(s) -
Alastair Scott,
C. Wild
Publication year - 2007
Publication title -
journal of applied mathematics and decision sciences
Language(s) - English
Resource type - Journals
eISSN - 1532-7612
pISSN - 1173-9126
DOI - 10.1155/2007/56372
Subject(s) - covariate , statistics , mathematics , econometrics , cluster sampling , cluster (spacecraft) , stratified sampling , stratification (seeds) , population , sampling (signal processing) , computer science , demography , seed dormancy , botany , germination , filter (signal processing) , dormancy , sociology , computer vision , biology , programming language
We look at fitting regression models using data from stratified cluster samples when the strata may depend in some way on the observed responses within clusters. One important subclass of examples is that of family studies in genetic epidemiology, where the probability of selecting a family into the study depends on the incidence of disease within the family. We develop the survey-weighted estimating equation approach for this problem, with particular emphasis on the estimation of superpopulation parameters. Full maximum likelihood for this class of problems involves modelling the population distribution of the covariates which is simply not feasible when there are a large number of potential covariates. We discuss efficient semiparametric maximum likelihood methods in which the covariate distribution is left completely unspecified. We further discuss the relative efficiencies of these two approaches.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom