How to measure the diagnostic accuracy of noninvasive liver fibrosis indices: the area under the ROC curve revisited

Jerome Lambert; Philippe Halfon; Guillaume Penaranda; Pierre Bedossa; Patrice Cacoub; Fabrice Carrat

doi:10.1373/clinchem.2007.097923

How to measure the diagnostic accuracy of noninvasive liver fibrosis indices: the area under the ROC curve revisited

Clin Chem. 2008 Aug;54(8):1372-8. doi: 10.1373/clinchem.2007.097923. Epub 2008 Jun 6.

Authors

Jerome Lambert¹, Philippe Halfon, Guillaume Penaranda, Pierre Bedossa, Patrice Cacoub, Fabrice Carrat

Affiliation

¹ Université Pierre et Marie Curie-Paris; INSERM, UMR-S 707; unité de santé publique, Assistance Publique Hôpitaux de Paris, Hôpital Saint-Antoine, Paris, France.

PMID: 18539647
DOI: 10.1373/clinchem.2007.097923

Abstract

Background: The area under the ROC curve (AUC) is widely used to measure the diagnostic accuracy of noninvasive fibrosis indices. However, use of the AUC assumes a binary gold standard, whereas fibrosis staging is based on an ordinal scale and also depends on the distribution of fibrosis stages in the study sample. We explored other fibrosis staging accuracy measures designed for ordinal gold standards, the C-statistic and the Obuchowski measure.

Methods: We performed a simulation study to assess the bias in estimating the accuracy measures when the distribution of fibrosis stages in the study sample do not fit the reference distribution in the population to which the indices are applied. We also estimated the type I error of the tests comparing these measures in 2 samples with different distributions of fibrosis stages. We illustrated the practical use of these measures by reanalyzing real data.

Results: Compared with the AUC or the C-statistic, the Obuchowski measure showed limited bias when the distribution of fibrosis stages in the study sample differed from the reference distribution. The type I error was strongly inflated with the AUC or the C-statistic but was preserved in the Obuchowski measure. When we compared noninvasive indices on real data, AUC analysis led to discordant results depending on how the fibrosis stages were grouped together. One single conclusion was drawn from the analysis based on the Obuchowski measure.

Conclusions: We recommend using the Obuchowski measure for assessing the diagnostic accuracy of noninvasive indices of fibrosis.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bias
Biomarkers / blood
Computer Simulation
Data Interpretation, Statistical*
Diagnostic Tests, Routine* / methods
Diagnostic Tests, Routine* / statistics & numerical data
Hepatitis C, Chronic / complications
Humans
Liver / pathology
Liver Cirrhosis / blood
Liver Cirrhosis / diagnosis*
Liver Cirrhosis / etiology
Liver Cirrhosis / pathology
ROC Curve*
Reference Standards
Sensitivity and Specificity
Severity of Illness Index

Substances

Biomarkers