Premium
Identifying and interpreting subgroups in health care utilization data with count mixture regression models
Author(s) -
Kurz Christoph F.,
Hatfield Laura A.
Publication year - 2019
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.8307
Subject(s) - overdispersion , covariate , negative binomial distribution , statistics , count data , econometrics , poisson regression , bayesian probability , skewness , mixture model , regression analysis , medicine , poisson distribution , mathematics , environmental health , population
Inpatient care is a large share of total health care spending, making analysis of inpatient utilization patterns an important part of understanding what drives health care spending growth. Common features of inpatient utilization measures such as length of stay and spending include zero inflation, overdispersion, and skewness, all of which complicate statistical modeling. Moreover, latent subgroups of patients may have distinct patterns of utilization and relationships between that utilization and observed covariates. In this work, we apply and compare likelihood‐based and parametric Bayesian mixtures of negative binomial and zero‐inflated negative binomial regression models. In a simulation, we find that the Bayesian approach finds the true number of mixture components more accurately than using information criteria to select among likelihood‐based finite mixture models. When we apply the models to data on hospital lengths of stay for patients with lung cancer, we find distinct subgroups of patients with different means and variances of hospital days, health and treatment covariates, and relationships between covariates and length of stay.