Pancreatic cancer symptom trajectories from Danish registry data and free text in electronic health records

Elife. 2023 Nov 21:12:e84919. doi: 10.7554/eLife.84919.

Abstract

Pancreatic cancer is one of the deadliest cancer types with poor treatment options. Better detection of early symptoms and relevant disease correlations could improve pancreatic cancer prognosis. In this retrospective study, we used symptom and disease codes (ICD-10) from the Danish National Patient Registry (NPR) encompassing 6.9 million patients from 1994 to 2018,, of whom 23,592 were diagnosed with pancreatic cancer. The Danish cancer registry included 18,523 of these patients. To complement and compare the registry diagnosis codes with deeper clinical data, we used a text mining approach to extract symptoms from free text clinical notes in electronic health records (3078 pancreatic cancer patients and 30,780 controls). We used both data sources to generate and compare symptom disease trajectories to uncover temporal patterns of symptoms prior to pancreatic cancer diagnosis for the same patients. We show that the text mining of the clinical notes was able to complement the registry-based symptoms by capturing more symptoms prior to pancreatic cancer diagnosis. For example, 'Blood pressure reading without diagnosis', 'Abnormalities of heartbeat', and 'Intestinal obstruction' were not found for the registry-based analysis. Chaining symptoms together in trajectories identified two groups of patients with lower median survival (<90 days) following the trajectories 'Cough→Jaundice→Intestinal obstruction' and 'Pain→Jaundice→Abnormal results of function studies'. These results provide a comprehensive comparison of the two types of pancreatic cancer symptom trajectories, which in combination can leverage the full potential of the health data and ultimately provide a fuller picture for detection of early risk factors for pancreatic cancer.

Keywords: cancer biology; computational biology; disease progression; human; longitudinal analysis; pancreas cancer; patient stratification; symptomology; systems biology.

Plain language summary

Pancreatic cancer is one of the deadliest cancer types. Scientists predict it will become the second largest cause of cancer-related deaths in 2030. It has few or no symptoms at early stages and often goes undetected for an extended period. As a result, patients are often diagnosed at an advanced stage when they have few treatment options and lower survival rates. Only 11 percent of patients with pancreatic cancer survive five years past their diagnosis. Earlier detection and surgery to remove the tumor increase patient survival to 42% at five years. Those who undergo surgery at the earliest stage have an 84% survival rate at five years. Developing ways to screen for and detect pancreatic cancer early could improve patient survival. Identifying early symptoms is critical. So far, studies show links between weight loss, abdominal pain, lower back pain, and new-onset diabetes and pancreatic cancer. But clinicians often overlook these symptoms or do not associate them with cancer. National health registries may be data sources that scientists can use to zoom in on early pancreatic symptoms and create alerts for clinicians. Hjaltelin, Novitski et al. identified potential pancreatic cancer symptoms using patient registry data and electronic health records. Hjaltelin, Novitski et al. extracted potential pancreatic cancer-related disease or symptom trajectories from 7 million patients listed in the Danish National Patient Registry. They also scoured clinical notes in 34,000 patients’ electronic health records for symptoms. The electronic health records yielded more promising symptoms than the registry. But both data sources produced complementary information. The analysis showed that some symptoms, like jaundice, were associated with higher survival rates because they may lead to earlier diagnosis. The data so far suggest that symptoms leading up to a pancreatic cancer diagnosis may be nonspecific and not occur in a particular order. As the cancer progresses, symptoms may become more specific and severe. Further assessment of the study’s results is necessary. Tools like artificial intelligence or advanced text mining may allow scientists identify more definitive early symptom trajectories and help clinicians identify patients earlier.

MeSH terms

  • Denmark / epidemiology
  • Electronic Health Records
  • Humans
  • Jaundice*
  • Pancreatic Neoplasms* / diagnosis
  • Pancreatic Neoplasms* / epidemiology
  • Retrospective Studies
  • Routinely Collected Health Data