Supervised Machine Learning for Predicting Length of Stay After Lumbar Arthrodesis: A Comprehensive Artificial Intelligence Approach

Christine M Etzel; Ashwin Veeramani; Andrew S Zhang; Christopher L McDonald; Kevin J DiSilvestro; Eric M Cohen; Alan H Daniels

doi:10.5435/JAAOS-D-21-00241

Supervised Machine Learning for Predicting Length of Stay After Lumbar Arthrodesis: A Comprehensive Artificial Intelligence Approach

J Am Acad Orthop Surg. 2022 Feb 1;30(3):125-132. doi: 10.5435/JAAOS-D-21-00241.

Authors

Christine M Etzel¹, Ashwin Veeramani, Andrew S Zhang, Christopher L McDonald, Kevin J DiSilvestro, Eric M Cohen, Alan H Daniels

Affiliation

¹ From the Department of Orthopedic Surgery (Etzel, Zhang, McDonald, DiSilvestro, Cohen, and Daniels), Warren Alpert Medical School of Brown University, Rhode Island Hospital, and the Division of Applied Mathematics (Veeramani), Brown University, Providence, RI.

PMID: 34928886
DOI: 10.5435/JAAOS-D-21-00241

Abstract

Introduction: Few studies have evaluated the utility of machine learning techniques to predict and classify outcomes, such as length of stay (LOS), for lumbar fusion patients. Six supervised machine learning algorithms may be able to predict and classify whether a patient will experience a short or long hospital LOS after lumbar fusion surgery with a high degree of accuracy.

Methods: Data were obtained from the National Surgical Quality Improvement Program between 2009 and 2018. Demographic and comorbidity information was collected for patients who underwent anterior, anterolateral, or lateral transverse process technique arthrodesis procedure; anterior lumbar interbody fusion (ALIF); posterior, posterolateral, or lateral transverse process technique arthrodesis procedure; posterior lumbar interbody fusion/transforaminal lumbar interbody fusion (PLIF/TLIF); and posterior fusion procedure posterior spine fusion (PSF). Machine learning algorithmic analyses were done with the scikit-learn package in Python on a high-performance computing cluster. In the total sample, 85% of patients were used for training the models, whereas the remaining patients were used for testing the models. C-statistic area under the curve and prediction accuracy (PA) were calculated for each of the models to determine their accuracy in correctly classifying the test cases.

Results: In total, 12,915 ALIF patients, 27,212 PLIF/TLIF patients, and 23,406 PSF patients were included in the algorithmic analyses. The patient factors most strongly associated with LOS were sex, ethnicity, dialysis, and disseminated cancer. The machine learning algorithms yielded area under the curve values of between 0.673 and 0.752 (PA: 69.6% to 80.1%) for ALIF, 0.673 and 0.729 (PA: 66.0% to 81.3%) for PLIF/TLIF, and 0.698 and 0.749 (PA: 69.9% to 80.4%) for PSF.

Conclusion: Machine learning classification algorithms were able to accurately predict long LOS for ALIF, PLIF/TLIF, and PSF patients. Supervised machine learning algorithms may be useful in clinical and administrative settings. These data may additionally help inform predictive analytic models and assist in setting patient expectations.

Level iii: Diagnostic study, retrospective cohort study.

MeSH terms

Artificial Intelligence
Humans
Length of Stay
Lumbar Vertebrae / surgery
Retrospective Studies
Spinal Fusion* / methods
Supervised Machine Learning