Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters

Michael S Ryan; Asra R Khan; Yoon Soo Park; Cody Chastain; Carrie Phillipi; Sally A Santen; Beth A Barron; Vivian Obeso; Sandra L Yingling; Core Entrustable Professional Activities for Entering Residency Pilot Program

doi:10.1097/ACM.0000000000004222

Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters

Acad Med. 2022 Apr 1;97(4):544-551. doi: 10.1097/ACM.0000000000004222.

Authors

Michael S Ryan¹, Asra R Khan², Yoon Soo Park³, Cody Chastain⁴, Carrie Phillipi⁵, Sally A Santen⁶, Beth A Barron⁷, Vivian Obeso⁸, Sandra L Yingling⁹; Core Entrustable Professional Activities for Entering Residency Pilot Program

Affiliations

¹ M.S. Ryan is associate professor and assistant dean, Clinical Medical Education, Department of Pediatrics, Virginia Commonwealth University, Richmond, Virginia; ORCID: https://orcid.org/0000-0003-3266-9289 .
² A.R. Khan is associate professor, director, Doctoring and Clinical Skills course, and clerkship director, Department of Internal Medicine, University of Illinois College of Medicine, Chicago, Illinois; ORCID: https://orcid.org/0000-0002-2306-4643 .
³ Y.S. Park is director, Health Professions Education Research, and member of the faculty, Harvard Medical School and Massachusetts General Hospital, Boston, Massachusetts; ORCID: https://orcid.org/0000-0001-8583-4335 .
⁴ C. Chastain is assistant professor, Department of Internal Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee.
⁵ C. Phillipi is professor and vice chair of education, Department of Pediatrics, Oregon Health & Science University, Portland, Oregon.
⁶ S.A. Santen is professor and senior associate dean, Assessment, Evaluation, and Scholarship, Department of Emergency Medicine, Virginia Commonwealth University, Richmond, Virginia.
⁷ B.A. Barron is associate professor and associate director, Simulation, Department of Internal Medicine, Columbia University School of Medicine, New York, New York.
⁸ V. Obeso is associate professor and assistant dean, Curriculum and Medical Education, Department of Internal Medicine, Florida International University, Miami, Florida.
⁹ S.L. Yingling is assistant professor and associate dean, Educational Planning and Quality Improvement, Department of Medical Education, University of Illinois College of Medicine, Chicago, Illinois; ORCID: https://orcid.org/0000-0002-9072-7590 .

Abstract

Purpose: In undergraduate medical education (UME), competency-based medical education has been operationalized through the 13 Core Entrustable Professional Activities for Entering Residency (Core EPAs). Direct observation in the workplace using rigorous, valid, reliable measures is required to inform summative decisions about graduates' readiness for residency. The purpose of this study is to investigate the validity evidence of 2 proposed workplace-based entrustment scales.

Method: The authors of this multisite, randomized, experimental study used structured vignettes and experienced raters to examine validity evidence of the Ottawa scale and the UME supervisory tool (Chen scale) in 2019. The authors used a series of 8 cases (6 developed de novo) depicting learners at preentrustable (less-developed) and entrustable (more-developed) skill levels across 5 Core EPAs. Participants from Core EPA pilot institutions rated learner performance using either the Ottawa or Chen scale. The authors used descriptive statistics and analysis of variance to examine data trends and compare ratings, conducted interrater reliability and generalizability studies to evaluate consistency among participants, and performed a content analysis of narrative comments.

Results: Fifty clinician-educators from 10 institutions participated, yielding 579 discrete EPA assessments. Both Ottawa and Chen scales differentiated between less- and more-developed skill levels (P < .001). The interclass correlation was good to excellent for all EPAs using Ottawa (range, 0.68-0.91) and fair to excellent using Chen (range, 0.54-0.83). Generalizability analysis revealed substantial variance in ratings attributable to the learner-EPA interaction (59.6% for Ottawa; 48.9% for Chen) suggesting variability for ratings was appropriately associated with performance on individual EPAs.

Conclusions: In a structured setting, both the Ottawa and Chen scales distinguished between preentrustable and entrustable learners; however, the Ottawa scale demonstrated more desirable characteristics. These findings represent a critical step forward in developing valid, reliable instruments to measure learner progression toward entrustment for the Core EPAs.

Publication types

Randomized Controlled Trial

MeSH terms

Clinical Competence
Competency-Based Education
Education, Medical, Undergraduate*
Humans
Internship and Residency*
Reproducibility of Results
Workplace

Grants and funding

UL1 TR002649/TR/NCATS NIH HHS/United States