Introduction to Double Robust Methods for Incomplete Data

Shaun R Seaman; Stijn Vansteelandt

doi:10.1214/18-STS647

Introduction to Double Robust Methods for Incomplete Data

Stat Sci. 2018;33(2):184-197. doi: 10.1214/18-STS647.

Authors

Shaun R Seaman¹, Stijn Vansteelandt^{2

3}

Affiliations

¹ Medical Research Council Biostatistics Unit, University of Cambridge, Cambridge, UK.
² Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Gent, Belgium.
³ Department of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.

Abstract

Most methods for handling incomplete data can be broadly classified as inverse probability weighting (IPW) strategies or imputation strategies. The former model the occurrence of incomplete data; the latter, the distribution of the missing variables given observed variables in each missingness pattern. Imputation strategies are typically more efficient, but they can involve extrapolation, which is difficult to diagnose and can lead to large bias. Double robust (DR) methods combine the two approaches. They are typically more efficient than IPW and more robust to model misspecification than imputation. We give a formal introduction to DR estimation of the mean of a partially observed variable, before moving to more general incomplete-data scenarios. We review strategies to improve the performance of DR estimators under model misspecification, reveal connections between DR estimators for incomplete data and 'design-consistent' estimators used in sample surveys, and explain the value of double robustness when using flexible data-adaptive methods for IPW or imputation.

Keywords: augmented inverse probability weighting; calibration estimators; data-adaptive methods; doubly robust; empirical likelihood; imputation; inverse probability weighting; missing data; primary – 62 Statistics; secondary – 62A01 Foundations and philosophical topics; semiparametric methods.

Abstract

Grants and funding