Scoring based on item response theory did not alter the measurement ability of EORTC QLQ-C30 scales

Morten Aa Petersen; Mogens Groenvold; Neil Aaronson; Elisabeth Brenne; Peter Fayers; Julie Damgaard Nielsen; Mirjam Sprangers; Jakob B Bjorner; European Organisation for Research and Treatment of Cancer Quality of Life Group

doi:10.1016/j.jclinepi.2005.02.008

Scoring based on item response theory did not alter the measurement ability of EORTC QLQ-C30 scales

J Clin Epidemiol. 2005 Sep;58(9):902-8. doi: 10.1016/j.jclinepi.2005.02.008.

Authors

Morten Aa Petersen¹, Mogens Groenvold, Neil Aaronson, Elisabeth Brenne, Peter Fayers, Julie Damgaard Nielsen, Mirjam Sprangers, Jakob B Bjorner; European Organisation for Research and Treatment of Cancer Quality of Life Group

Affiliation

¹ The research unit, Department of Palliative Medicine, Bispebjerg Hospital, Bispebjerg bakke 23, 2400 Copenhagen, Denmark. map01@bbh.hosp.dk

PMID: 16085193
DOI: 10.1016/j.jclinepi.2005.02.008

Abstract

Background and objectives: Most health-related quality-of-life questionnaires include multi-item scales. Scale scores are usually estimated as simple sums of the item scores. However, scoring procedures utilizing more information from the items might improve measurement abilities, and thereby reduce the needed sample sizes. We investigated whether item response theory (IRT)-based scoring improved the measurement abilities of the EORTC QLQ-C30 physical functioning, emotional functioning, and fatigue scales.

Methods: Using a database of 13,010 subjects we estimated the relative validities of IRT scoring compared to sum scoring of the scales.

Results: The mean relative validities were 1.04 (physical), 1.03 (emotional), and 0.97 (fatigue). None of these were significantly larger than 1. Thus, no gain in measurement abilities using IRT scoring was found for these scales. Possible explanations include that the items in the scales are not constructed for IRT scoring and that the scales are relatively short.

Conclusion: IRT scoring of the three longest EORTC QLQ-C30 scales did not improve measurement abilities compared to the traditional sum scoring of the scales.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Adult
Databases, Factual
Emotions
Fatigue
Female
Health Status*
Humans
Male
Middle Aged
Models, Statistical
Neoplasm Staging
Neoplasms / physiopathology
Neoplasms / psychology
Quality of Life*
Reproducibility of Results
Sample Size
Surveys and Questionnaires*