Statistical methods for the time-to-event analysis of individual participant data from multiple epidemiological studies

Int J Epidemiol. 2010 Oct;39(5):1345-59. doi: 10.1093/ije/dyq063. Epub 2010 May 3.

Abstract

Background: Meta-analysis of individual participant time-to-event data from multiple prospective epidemiological studies enables detailed investigation of exposure-risk relationships, but involves a number of analytical challenges.

Methods: This article describes statistical approaches adopted in the Emerging Risk Factors Collaboration, in which primary data from more than 1 million participants in more than 100 prospective studies have been collated to enable detailed analyses of various risk markers in relation to incident cardiovascular disease outcomes.

Results: Analyses have been principally based on Cox proportional hazards regression models stratified by sex, undertaken in each study separately. Estimates of exposure-risk relationships, initially unadjusted and then adjusted for several confounders, have been combined over studies using meta-analysis. Methods for assessing the shape of exposure-risk associations and the proportional hazards assumption have been developed. Estimates of interactions have also been combined using meta-analysis, keeping separate within- and between-study information. Regression dilution bias caused by measurement error and within-person variation in exposures and confounders has been addressed through the analysis of repeat measurements to estimate corrected regression coefficients. These methods are exemplified by analysis of plasma fibrinogen and risk of coronary heart disease, and Stata code is made available.

Conclusion: Increasing numbers of meta-analyses of individual participant data from observational data are being conducted to enhance the statistical power and detail of epidemiological studies. The statistical methods developed here can be used to address the needs of such analyses.

Publication types

  • Meta-Analysis
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Age Factors
  • Coronary Disease / blood*
  • Coronary Disease / epidemiology*
  • Data Interpretation, Statistical*
  • Epidemiologic Methods*
  • Fibrinogen / analysis*
  • Humans
  • Proportional Hazards Models
  • Risk Factors
  • Sex Factors
  • Smoking / epidemiology
  • Time Factors

Substances

  • Fibrinogen