A product-multinomial framework for categorical data analysis with missing responses
Author(s) -
Frederico Z. Poleto,
Júlio M. Singer,
Carlos Daniel Paulino
Publication year - 2014
Publication title -
brazilian journal of probability and statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.441
H-Index - 18
eISSN - 2317-6199
pISSN - 0103-0752
DOI - 10.1214/12-bjps198
Subject(s) - categorical variable , multinomial distribution , missing data , mathematics , estimator , notation , generalized linear model , linear model , set (abstract data type) , product (mathematics) , design matrix , data set , statistics , econometrics , computer science , arithmetic , programming language , geometry
We extend the multinomial modeling scenario for the analysis of categorical data with missing responses described by Paulino (1991, Brazilian Journal of Probability and Statistics, 5, 1-42) to the product-multinomial setup so that the inclusion of explanatory variables is allowed. Assuming an ignorable missing data mechanism, linear and log-linear models may be fitted via maximum likelihood. Weighted least squares methodology may as well be used to fit more general functional linear models, if a missing completely at random mechanism is assumed. We also consider a hybrid approach, where any missingness process is fitted by maximum likelihood in a first step, and the estimated marginal probabilities of categorization and their covariance matrix are used in a second stage to fit the model via weighted least squares, in the spirit of functional asymptotic regression methodology. Goodness-of-fit tests are present, and the methodology is illustrated via two data sets. All the methods were computationally implemented via subroutines written in R.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom