Development and quality appraisal of a new English breast screening linked data set as part of the age, test threshold, and frequency of mammography screening (ATHENA-M) study

Br J Radiol. 2024 Jan 23;97(1153):98-112. doi: 10.1093/bjr/tqad023.

Abstract

Objectives: To build a data set capturing the whole breast cancer screening journey from individual breast cancer screening records to outcomes and assess data quality.

Methods: Routine screening records (invitation, attendance, test results) from all 79 English NHS breast screening centres between January 1, 1988 and March 31, 2018 were linked to cancer registry (cancer characteristics and treatment) and national mortality data. Data quality was assessed using comparability, validity, timeliness, and completeness.

Results: Screening records were extracted from 76/79 English breast screening centres, 3/79 were not possible due to software issues. Data linkage was successful from 1997 after introduction of a universal identifier for women (NHS number). Prior to 1997 outcome data are incomplete due to linkage issues, reducing validity. Between January 1, 1997 and March 31, 2018, a total of 11 262 730 women were offered screening of whom 9 371 973 attended at least one appointment, with 139 million person-years of follow-up (a median of 12.4 person years for each woman included) with 73 810 breast cancer deaths and 1 111 139 any-cause deaths. Comparability to reference data sets and internal validity were demonstrated. Data completeness was high for core screening variables (>99%) and main cancer outcomes (>95%).

Conclusions: The ATHENA-M project has created a large high-quality and representative data set of individual women's screening trajectories and outcomes in England from 1997 to 2018, data before 1997 are lower quality.

Advances in knowledge: This is the most complete data set of English breast screening records and outcomes constructed to date, which can be used to evaluate and optimize screening.

Keywords: breast neoplasm; data accuracy; mammography; routinely collected health data.

MeSH terms

  • Breast
  • Breast Neoplasms*
  • Female
  • Humans
  • Mammography
  • Semantic Web*
  • State Medicine