An ASER AI/ML expert panel formative user research study for an interpretable interactive splenic AAST grading graphical user interface prototype

Emerg Radiol. 2024 Apr;31(2):167-178. doi: 10.1007/s10140-024-02202-8. Epub 2024 Feb 2.

Abstract

Purpose: The AAST Organ Injury Scale is widely adopted for splenic injury severity but suffers from only moderate inter-rater agreement. This work assesses SpleenPro, a prototype interactive explainable artificial intelligence/machine learning (AI/ML) diagnostic aid to support AAST grading, for effects on radiologist dwell time, agreement, clinical utility, and user acceptance.

Methods: Two trauma radiology ad hoc expert panelists independently performed timed AAST grading on 76 admission CT studies with blunt splenic injury, first without AI/ML assistance, and after a 2-month washout period and randomization, with AI/ML assistance. To evaluate user acceptance, three versions of the SpleenPro user interface with increasing explainability were presented to four independent expert panelists with four example cases each. A structured interview consisting of Likert scales and free responses was conducted, with specific questions regarding dimensions of diagnostic utility (DU); mental support (MS); effort, workload, and frustration (EWF); trust and reliability (TR); and likelihood of future use (LFU).

Results: SpleenPro significantly decreased interpretation times for both raters. Weighted Cohen's kappa increased from 0.53 to 0.70 with AI/ML assistance. During user acceptance interviews, increasing explainability was associated with improvement in Likert scores for MS, EWF, TR, and LFU. Expert panelists indicated the need for a combined early notification and grading functionality, PACS integration, and report autopopulation to improve DU.

Conclusions: SpleenPro was useful for improving objectivity of AAST grading and increasing mental support. Formative user research identified generalizable concepts including the need for a combined detection and grading pipeline and integration with the clinical workflow.

Keywords: AAST grade; Artificial intelligence; Hemorrhage; Spleen; Trauma; User acceptance.

MeSH terms

  • Artificial Intelligence
  • Humans
  • Machine Learning
  • Reproducibility of Results
  • Tomography, X-Ray Computed* / methods
  • Wounds, Nonpenetrating*