Nonstandard conditionally specified models for nonignorable missing data

Proc Natl Acad Sci U S A. 2020 Aug 11;117(32):19045-19053. doi: 10.1073/pnas.1815563117. Epub 2020 Jul 28.

Abstract

Data analyses typically rely upon assumptions about the missingness mechanisms that lead to observed versus missing data, assumptions that are typically unassessable. We explore an approach where the joint distribution of observed data and missing data are specified in a nonstandard way. In this formulation, which traces back to a representation of the joint distribution of the data and missingness mechanism, apparently first proposed by J. W. Tukey, the modeling assumptions about the distributions are either assessable or are designed to allow relatively easy incorporation of substantive knowledge about the problem at hand, thereby offering a possibly realistic portrayal of the data, both observed and missing. We develop Tukey's representation for exponential-family models, propose a computationally tractable approach to inference in this class of models, and offer some general theoretical comments. We then illustrate the utility of this approach with an example in systems biology.

Keywords: Bayesian analysis; Tukey’s representation; exponential tilting; missing not at random; nonignorable missingness mechanism.

Associated data

  • Dryad/10.5061/dryad.d644f