Empirical evaluation of prediction intervals for cancer incidence

BMC Med Res Methodol. 2005 Jun 10:5:21. doi: 10.1186/1471-2288-5-21.

Abstract

Background: Prediction intervals can be calculated for predicting cancer incidence on the basis of a statistical model. These intervals include the uncertainty of the parameter estimates and variations in future rates but do not include the uncertainty of assumptions, such as continuation of current trends. In this study we evaluated whether prediction intervals are useful in practice.

Methods: Rates for the period 1993-97 were predicted from cancer incidence rates in the five Nordic countries for the period 1958-87. In a Poisson regression model, 95% prediction intervals were constructed for 200 combinations of 20 cancer types for males and females in the five countries. The coverage level was calculated as the proportion of the prediction intervals that covered the observed number of cases in 1993-97.

Results: Overall, 52% (104/200) of the prediction intervals covered the observed numbers. When the prediction intervals were divided into quartiles according to the number of cases in the last observed period, the coverage level was inversely proportional to the frequency (84%, 52%, 46% and 26%). The coverage level varied widely among the five countries, but the difference declined after adjustment for the number of cases in each country.

Conclusion: The coverage level of prediction intervals strongly depended on the number of cases on which the predictions were based. As the sample size increased, uncertainty about the adequacy of the model dominated, and the coverage level fell far below 95%. Prediction intervals for cancer incidence must therefore be interpreted with caution.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Age Distribution
  • Aged
  • Aged, 80 and over
  • Child
  • Child, Preschool
  • Cohort Studies
  • Empirical Research*
  • Female
  • Forecasting / methods*
  • Humans
  • Incidence
  • Infant
  • Infant, Newborn
  • Male
  • Middle Aged
  • Models, Statistical
  • Neoplasms / classification
  • Neoplasms / epidemiology*
  • Poisson Distribution*
  • Registries
  • Scandinavian and Nordic Countries / epidemiology
  • Uncertainty