Prediction of general medical admission length of stay with natural language processing and deep learning: a pilot study

Intern Emerg Med. 2020 Sep;15(6):989-995. doi: 10.1007/s11739-019-02265-3. Epub 2020 Jan 2.

Abstract

Length of stay (LOS) and discharge destination predictions are key parts of the discharge planning process for general medical hospital inpatients. It is possible that machine learning, using natural language processing, may be able to assist with accurate LOS and discharge destination prediction for this patient group. Emergency department triage and doctor notes were retrospectively collected on consecutive general medical and acute medical unit admissions to a single tertiary hospital from a 2-month period in 2019. These data were used to assess the feasibility of predicting LOS and discharge destination using natural language processing and a variety of machine learning models. 313 patients were included in the study. The artificial neural network achieved the highest accuracy on the primary outcome of predicting whether a patient would remain in hospital for > 2 days (accuracy 0.82, area under the received operator curve 0.75, sensitivity 0.47 and specificity 0.97). When predicting LOS as an exact number of days, the artificial neural network achieved a mean absolute error of 2.9 and a mean squared error of 16.8 on the test set. For the prediction of home as a discharge destination (vs any non-home alternative), all models performed similarly with an accuracy of approximately 0.74. This study supports the feasibility of using natural language processing to predict general medical inpatient LOS and discharge destination. Further research is indicated with larger, more detailed, datasets from multiple centres to optimise and examine the accuracy that may be achieved with such predictions.

Keywords: Artificial intelligence; Deep learning; Machine learning; Natural language processing; Neural network; Prognostication.

MeSH terms

  • Aged
  • Aged, 80 and over
  • Deep Learning
  • Female
  • Forecasting / methods*
  • Hospitalization / statistics & numerical data*
  • Humans
  • Length of Stay / statistics & numerical data*
  • Length of Stay / trends
  • Male
  • Middle Aged
  • Natural Language Processing*
  • Patients' Rooms / organization & administration
  • Patients' Rooms / statistics & numerical data
  • Pilot Projects
  • Retrospective Studies