Quantitative genetic modeling and inference in the presence of nonignorable missing data

Evolution. 2014 Jun;68(6):1735-47. doi: 10.1111/evo.12380. Epub 2014 Mar 20.

Abstract

Natural selection is typically exerted at some specific life stages. If natural selection takes place before a trait can be measured, using conventional models can cause wrong inference about population parameters. When the missing data process relates to the trait of interest, a valid inference requires explicit modeling of the missing process. We propose a joint modeling approach, a shared parameter model, to account for nonrandom missing data. It consists of an animal model for the phenotypic data and a logistic model for the missing process, linked by the additive genetic effects. A Bayesian approach is taken and inference is made using integrated nested Laplace approximations. From a simulation study we find that wrongly assuming that missing data are missing at random can result in severely biased estimates of additive genetic variance. Using real data from a wild population of Swiss barn owls Tyto alba, our model indicates that the missing individuals would display large black spots; and we conclude that genes affecting this trait are already under selection before it is expressed. Our model is a tool to correctly estimate the magnitude of both natural selection and additive genetic variance.

Keywords: Animal model; Tyto alba; missing not at random; sex-linked inheritance; shared parameter model.

MeSH terms

  • Animals
  • Genetic Variation
  • Genetics, Population / methods*
  • Models, Genetic*
  • Models, Statistical*
  • Phenotype
  • Population / genetics
  • Quantitative Trait, Heritable*
  • Sample Size
  • Selection, Genetic
  • Strigiformes* / genetics