z-logo
Premium
Logistic regression with a continuous exposure measured in pools and subject to errors
Author(s) -
Van Domelen Dane R.,
Mitchell Emily M.,
Perkins Neil J.,
Schisterman Enrique F.,
Manatunga Amita K.,
Huang Yijian,
Lyles Robert H.
Publication year - 2018
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.7891
Subject(s) - statistics , logistic regression , pooling , covariate , nominal level , regression , inference , mathematics , econometrics , computer science , confidence interval , artificial intelligence
In a multivariable logistic regression setting where measuring a continuous exposure requires an expensive assay, a design in which the biomarker is measured in pooled samples from multiple subjects can be very cost effective. A logistic regression model for poolwise data is available, but validity requires that the assay yields the precise mean exposure for members of each pool. To account for errors, we assume the assay returns the true mean exposure plus a measurement error (ME) and/or a processing error (PE). We pursue likelihood‐based inference for a binary health‐related outcome modeled by logistic regression coupled with a normal linear model relating individual‐level exposure to covariates and assuming that the ME and PE components are independent and normally distributed regardless of pool size. We compare this approach with a discriminant function‐based alternative, and we demonstrate the potential value of incorporating replicates into the study design. Applied to a reproductive health dataset with pools of size 2 along with individual samples and replicates, the model fit with both ME and PE had a lower AIC than a model accounting for ME only. Relative to ignoring errors, this model suggested a somewhat higher (though still nonsignificant) adjusted log‐odds ratio associating the cytokine MCP‐1 with risk of spontaneous abortion. Simulations modeled after these data confirm validity of the methods, demonstrate how ME and particularly PE can reduce the efficiency advantage of a pooling design, and highlight the value of replicates in improving stability when both errors are present.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here