On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses

Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.

Abstract

Zero-inflated Poisson (ZIP) and negative binomial (ZINB) models are widely used to model zero-inflated count responses. These models extend the Poisson and negative binomial (NB) to address excessive zeros in the count response. By adding a degenerate distribution centered at 0 and interpreting it as describing a non-risk group in the population, the ZIP (ZINB) models a two-component population mixture. As in applications of Poisson and NB, the key difference between ZIP and ZINB is the allowance for overdispersion by the ZINB in its NB component in modeling the count response for the at-risk group. Overdispersion arising in practice too often does not follow the NB, and applications of ZINB to such data yield invalid inference. If sources of overdispersion are known, other parametric models may be used to directly model the overdispersion. Such models too are subject to assumed distributions. Further, this approach may not be applicable if information about the sources of overdispersion is unavailable. In this paper, we propose a distribution-free alternative and compare its performance with these popular parametric models as well as a moment-based approach proposed by Yu et al. [Statistics in Medicine 2013; 32: 2390-2405]. Like the generalized estimating equations, the proposed approach requires no elaborate distribution assumptions. Compared with the approach of Yu et al., it is more robust to overdispersed zero-inflated responses. We illustrate our approach with both simulated and real study data.

Keywords: functional response models; generalized estimating equations; population mixtures; zero-inflated Poisson; zero-inflated Poisson with random effects; zero-inflated negative binomial.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Binomial Distribution*
  • Biometry / methods
  • Computer Simulation
  • HIV Infections / psychology
  • Humans
  • Likelihood Functions
  • Male
  • Models, Statistical*
  • Poisson Distribution*
  • Randomized Controlled Trials as Topic