Regression for skewed biomarker outcomes subject to pooling | Zendy

Mitchell Emily M. | Zendy; Lyles Robert H. | Zendy; Manatunga Amita K. | Zendy; Danaher Michelle | Zendy; Perkins Neil J. | Zendy; Schisterman Enrique F. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Regression for skewed biomarker outcomes subject to pooling

Author(s) -

Mitchell Emily M.,

Lyles Robert H.,

Manatunga Amita K.,

Danaher Michelle,

Perkins Neil J.,

Schisterman Enrique F.

Publication year - 2014

Publication title -

biometrics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 2.298

H-Index - 130

eISSN - 1541-0420

pISSN - 0006-341X

DOI - 10.1111/biom.12134

Subject(s) - pooling , covariate , statistics , logistic regression , computer science , regression , linear regression , econometrics , regression analysis , standard error , outcome (game theory) , data mining , mathematics , artificial intelligence , mathematical economics

Summary Epidemiological studies involving biomarkers are often hindered by prohibitively expensive laboratory tests. Strategically pooling specimens prior to performing these lab assays has been shown to effectively reduce cost with minimal information loss in a logistic regression setting. When the goal is to perform regression with a continuous biomarker as the outcome, regression analysis of pooled specimens may not be straightforward, particularly if the outcome is right‐skewed. In such cases, we demonstrate that a slight modification of a standard multiple linear regression model for poolwise data can provide valid and precise coefficient estimates when pools are formed by combining biospecimens from subjects with identical covariate values. When these x ‐homogeneous pools cannot be formed, we propose a Monte Carlo expectation maximization (MCEM) algorithm to compute maximum likelihood estimates (MLEs). Simulation studies demonstrate that these analytical methods provide essentially unbiased estimates of coefficient parameters as well as their standard errors when appropriate assumptions are met. Furthermore, we show how one can utilize the fully observed covariate data to inform the pooling strategy, yielding a high level of statistical efficiency at a fraction of the total lab cost.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research