Variance of age-specific log incidence decomposition (VALID): a unifying model of measured and unmeasured genetic and non-genetic risks

Int J Epidemiol. 2023 Oct 5;52(5):1557-1568. doi: 10.1093/ije/dyad086.

Abstract

Background: The extent to which known and unknown factors explain how much people of the same age differ in disease risk is fundamental to epidemiology. Risk factors can be correlated in relatives, so familial aspects of risk (genetic and non-genetic) must be considered.

Development: We present a unifying model (VALID) for variance in risk, with risk defined as log(incidence) or logit(cumulative incidence). Consider a normally distributed risk score with incidence increasing exponentially as the risk increases. VALID's building block is variance in risk, Δ2, where Δ = log(OPERA) is the difference in mean between cases and controls and OPERA is the odds ratio per standard deviation. A risk score correlated r between a pair of relatives generates a familial odds ratio of exp(rΔ2). Familial risk ratios, therefore, can be converted into variance components of risk, extending Fisher's classic decomposition of familial variation to binary traits. Under VALID, there is a natural upper limit to variance in risk caused by genetic factors, determined by the familial odds ratio for genetically identical twin pairs, but not to variation caused by non-genetic factors.

Application: For female breast cancer, VALID quantified how much variance in risk is explained-at different ages-by known and unknown major genes and polygenes, non-genomic risk factors correlated in relatives, and known individual-specific factors.

Conclusion: VALID has shown that, while substantial genetic risk factors have been discovered, much is unknown about genetic and familial aspects of breast cancer risk especially for young women, and little is known about individual-specific variance in risk.

Keywords: Breast cancer; familial cause; familial odds ratio; familial risk ratio; genetic cause; genomic cause; major gene; non-familial cause; polygenic risk score; variance components.

Publication types

  • Research Support, Non-U.S. Gov't
  • Twin Study

MeSH terms

  • Age Factors
  • Breast Neoplasms* / epidemiology
  • Breast Neoplasms* / genetics
  • Female
  • Genetic Predisposition to Disease*
  • Humans
  • Incidence
  • Risk Factors