Logistic analysis of epidemiologic studies with augmentation sampling involving re-stratification and population expansion

Biostatistics. 2015 Jan;16(1):169-78. doi: 10.1093/biostatistics/kxu024. Epub 2014 Jun 6.

Abstract

Epidemiologic cross-sectional, case-cohort, or case-control studies often select augmentation samples to supplement an existing (baseline) sample, primarily for the two reasons: (1) to increase the sample sizes from certain subdomains of interest that were not originally considered in the design of the baseline study and (2) to obtain samples from an extension of the target population. To address these two objectives, two-stage stratified sample designs are considered, where the stratification based on the expanded population at the second stage is not nested in the first stage strata. The sample weighting and Taylor linearization variance estimation for the two-stage stratified sample designs, involving re-stratification and population expansion, are provided for estimating population totals and logistic regression coefficients. Results from limited simulation studies and a logistic regression analysis of a study of human papillomavirus serology are provided.

Keywords: Pseudo-likelihood function; Sample weighting; Taylor linearization variance estimation; Two-stage stratified sampling.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Adult
  • Epidemiologic Research Design*
  • Female
  • Humans
  • Logistic Models*
  • Patient Selection*
  • Sample Size
  • Seroepidemiologic Studies