Real-time test-retest and interrater reliability of select physical performance measures in physically active college-aged students

Int J Sports Phys Ther. 2014 Dec;9(7):874-87.

Abstract

Background: A new 16-item physical performance measure screening battery (16-PPM) was developed in order to expand on established movement based qualitatively scored functional screening batteries to encompass a broader spectrum of quantitatively scored functional constructs such as strength, endurance, and power.

Purpose/hypothesis: The purpose of this study was quantify the real-time tests-retest and expert versus novice interrater reliability of the 16-PPM screen on a group of physically active college-aged individuals. The authors' hypothesized that the test-retest and interrater reliability of quantitatively-scored performance measures would be highly correlated (ICC ≥ 0.75) and that qualitatively-scored movement screening tests would be moderately correlated (K w = 0.41-0.60).

Study design: Cohort reliability study.

Methods: Nineteen (8 males, 11 females) healthy physically active college-aged students completed the 16-PPM on two days, one week apart.

Results: The majority of the quantitatively scored components of the 16-PPMs demonstrated good expert-novice interrater reliability (ICC > 0.75), while qualitatively scored tests had moderate (K w = 0.41-0.60) to substantial (K w = 0.61-0.80) agreement. Test-retest reliability was consistent between raters, with most quantitatively scored PPMs exhibiting superior reliability to the qualitatively scored PPMs.

Conclusions: The 16-PPM test items showed good test-retest and interrater reliability. However, results indicate that expert raters may be more reliable than novice raters for qualitatively scored tests. The validity of this 16-PPM needs to be determined in future studies.

Clinical relevance: Physical performance screening batteries may be used to help identify individuals at risk for future athletic injury; however, current PPMs that rely on qualitatively scored movement screens have exhibited inconsistent and questionable injury prediction validity. The addition of reliable quantitatively scored PPMs may complement qualitatively scored PPMs to improve the battery's predictive ability.

Level of evidence: Level III.

Keywords: Functional screen; physical performance measures; reliability.