z-logo
Premium
Modeling continuous covariates with a “spike” at zero: Bivariate approaches
Author(s) -
Jenkner Carolin,
Lorenz Eva,
Becher Heiko,
Sauerbrei Willi
Publication year - 2016
Publication title -
biometrical journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.108
H-Index - 63
eISSN - 1521-4036
pISSN - 0323-3847
DOI - 10.1002/bimj.201400112
Subject(s) - bivariate analysis , covariate , univariate , mathematics , statistics , bivariate data , variable (mathematics) , binary number , zero (linguistics) , spike (software development) , econometrics , multivariate statistics , computer science , mathematical analysis , arithmetic , linguistics , philosophy , software engineering
In epidemiology and clinical research, predictors often take value zero for a large amount of observations while the distribution of the remaining observations is continuous. These predictors are called variables with a spike at zero. Examples include smoking or alcohol consumption. Recently, an extension of the fractional polynomial (FP) procedure, a technique for modeling nonlinear relationships, was proposed to deal with such situations. To indicate whether or not a value is zero, a binary variable is added to the model. In a two stage procedure, called FP‐spike, the necessity of the binary variable and/or the continuous FP function for the positive part are assessed for a suitable fit. In univariate analyses, the FP‐spike procedure usually leads to functional relationships that are easy to interpret. This paper introduces four approaches for dealing with two variables with a spike at zero (SAZ). The methods depend on the bivariate distribution of zero and nonzero values. Bi‐Sep is the simplest of the four bivariate approaches. It uses the univariate FP‐spike procedure separately for the two SAZ variables. In Bi‐D3, Bi‐D1, and Bi‐Sub, proportions of zeros in both variables are considered simultaneously in the binary indicators. Therefore, these strategies can account for correlated variables. The methods can be used for arbitrary distributions of the covariates. For illustration and comparison of results, data from a case‐control study on laryngeal cancer, with smoking and alcohol intake as two SAZ variables, is considered. In addition, a possible extension to three or more SAZ variables is outlined. A combination of log‐linear models for the analysis of the correlation in combination with the bivariate approaches is proposed.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here