Machine learning to predict major bleeding during anticoagulation for venous thromboembolism: possibilities and limitations

Br J Haematol. 2023 Jun;201(5):971-981. doi: 10.1111/bjh.18737. Epub 2023 Mar 21.

Abstract

Predictive tools for major bleeding (MB) using machine learning (ML) might be advantageous over traditional methods. We used data from the Registro Informatizado de Enfermedad TromboEmbólica (RIETE) to develop ML algorithms to identify patients with venous thromboembolism (VTE) at increased risk of MB during the first 3 months of anticoagulation. A total of 55 baseline variables were used as predictors. New data prospectively collected from the RIETE were used for further validation. The RIETE and VTE-BLEED scores were used for comparisons. External validation was performed with the COMMAND-VTE database. Learning was carried out with data from 49 587 patients, of whom 873 (1.8%) had MB. The best performing ML method was XGBoost. In the prospective validation cohort the sensitivity, specificity, positive predictive value and F1 score were: 33.2%, 93%, 10%, and 15.4% respectively. F1 value for the RIETE and VTE-BLEED scores were 8.6% and 6.4% respectively. In the external validation cohort the metrics were 10.3%, 87.6%, 3.5% and 5.2% respectively. In that cohort, the F1 value for the RIETE score was 17.3% and for the VTE-BLEED score 9.75%. The performance of the XGBoost algorithm was better than that from the RIETE and VTE-BLEED scores only in the prospective validation cohort, but not in the external validation cohort.

Keywords: haemorrhage; machine learning; outcomes; pulmonary embolism; venous thrombosis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Anticoagulants / adverse effects
  • Hemorrhage / chemically induced
  • Hemorrhage / complications
  • Humans
  • Predictive Value of Tests
  • Pulmonary Embolism* / complications
  • Registries
  • Venous Thromboembolism* / etiology

Substances

  • Anticoagulants