Elsevier

Academic Radiology

Volume 19, Issue 12, December 2012, Pages 1491-1498
Academic Radiology

Receiver operating characteristic analysis
An Analytic Expression for the Binormal Partial Area under the ROC Curve

https://doi.org/10.1016/j.acra.2012.09.009Get rights and content

Rationale and Objectives

The partial area under the receiver operating characteristic (ROC) curve (pAUC) is a useful summary measure for diagnostic studies. Unlike most summary measures that are functions of the ROC curve, researchers have not been aware of an analytic expression available for computing the pAUC for an ROC curve based on a latent binormal model. Instead, the pAUC has been computed using numerical integration or a rational polynomial approximation. Our purpose is to provide analytic expressions for the two forms of pAUC.

Materials and Methods

We discuss the two fundamentally different types of pAUC. We present analytic expressions for both types, provide corresponding proofs, and illustrate their application with an example comparing the performances of spin echo and cine magnetic resonance imaging for detecting thoracic aortic dissection.

Results

We provide an example of using the pAUC as the outcome in a multireader multicase analysis. We find that using the pAUC results in a more significant finding.

Conclusions

We have provided analytic expressions for both types of pAUC, making it easier to compute the pAUCs corresponding to binormal ROC curves.

Introduction

In diagnostic radiology, receiver operating characteristic (ROC) curves are commonly used to quantify the accuracy with which a reader (typically a radiologist) can discriminate between images from nondiseased (or normal) and diseased (or abnormal) cases. Although the ROC curve concisely describes the tradeoffs between sensitivity and specificity, typically accuracy is summarized by a summary index that is a function of the ROC curve. Commonly used summary indices include the area under the ROC curve (AUC), the partial area under the ROC curve (pAUC), sensitivity for a given specificity, and specificity for a given sensitivity. See Zou et al (1) for a concise introduction to ROC analysis.

A common method for estimating the ROC curve is likelihood estimation under the assumption of a latent binormal model 2, 3, 4, 5; alternatively, a generalized linear model approach can also be used 6, 7 based on the binormal model assumption. Under the latent binormal model assumption the ROC curve can be described by two parameters. Except for the pAUC, analytic expressions have been routinely employed for expressing the indices previously mentioned as a function of the binormal ROC curve parameters. It is generally believed that the pAUC, assuming a latent binormal model, cannot be expressed as an analytic expression. For example, Pepe (8) states: “Unfortunately, a simple analytic expression does not exist for the pAUC summary measure. It must be calculated using numerical integration or a rational polynomial approximation.” Similarly, Zhou et al (9) state: “This partial area as it is known, is evaluated by numerical integration (McClish, 1989).” Although these methods can be programmed, having a simple expression for the pAUC would be much more convenient.

It is generally not known that Pan and Metz (10) provided analytic expressions for the two forms of pAUC. However, the expressions they provided were incorrect and they did not provide proofs for their results. More importantly, it is generally not known that Thompson and Zucchini (11) provided a correct analytic expression for one form of pAUC as well as the proof. In fact, we only became aware of this latter result during the final stage of submitting this article. The purpose of this article is to bring to the attention of the reader the result provided by Thompson and Zucchini, extend their result to the second form of pAUC, and illustrate use of both pAUC expressions with a real data set that compares the relative performance of single spin-echo magnetic resonance imaging (SE MRI) to cinematic presentation of MRI (CINE MRI) for the detection of thoracic aortic dissection. In addition, we provide proofs for both results that are more accessible to radiology researchers and clinicians than the proof given by Thomas and Zucchini.

Section snippets

Two Different pAUCs

Let FPF and TPF denote false- and true-positive fractions for a given classification threshold such that an image with a test result equal or greater than the threshold is classified as diseased, and otherwise nondiseased. That is, FPF is the probability that a test result for a nondiseased subject exceeds the threshold and TPF is the probability that a test result for a diseased subject exceeds the threshold. The ROC curve is a plot of TPF versus FPF for all possible thresholds. FPF and TPF

Results

The ROC curves computed for the example data set are presented in Figure 2. Table 1 presents the corresponding binormal parameter estimates for a and b and estimates and standard errors for AUC, pAUCFPF for FPF intervals (0.0, 0.2) and (0.0, 0.1), and pAUCTPF for TPF intervals (0.8, 1.0) and (0.9, 1.0). The pAUCs have been normalized by dividing by the length of the defining interval; thus, the pAUC values represent average sensitivity or specificity over the corresponding defining FPF or TPF

Discussion

For the two types of pAUCs, we derived analytic expressions under the assumption of a latent binormal model. Previously it was believed that analytic expressions did not exist, even though Thompson and Zucchini (11) had stated and proved Equation 2, and thus numerical methods have been used to solve for pAUC values. The formulas presented in this article greatly simplify computation of pAUCs.

We illustrated use of these expressions with a real data set in which, using a multireader analysis, we

Acknowledgments

I thank Carolyn Van Dyke, MD, for sharing her data set for the example. I thank the reviewers for helpful suggestions that clarified the presentation.

References (28)

  • M. Pepe

    The statistical evaluation of medical tests for classification and prediction

    (2003)
  • X.-H. Zhou et al.

    Statistical methods in diagnostic medicine

    (2011)
  • M.L. Thompson et al.

    On the statistical analysis of ROC curves

    Stat Med

    (1989)
  • D.K. McClish

    Analyzing a portion of the ROC curve

    Med Decision Making

    (1989)
  • Cited by (14)

    • Diagnostic accuracy of technologies for glaucoma case-finding in a community setting

      2015, Ophthalmology
      Citation Excerpt :

      Initial diagnostic accuracy estimates of each index test to detect glaucoma suspect/definite POAG combined and definite POAG were evaluated using the predefined cutoffs for abnormality to generate sensitivity, specificity, and likelihood ratios with 95% confidence intervals (CIs). To compare index test performance within a clinically relevant range for detection of a low-prevalence disease, we determined the sensitivity at 90% specificity and normalized the partial area under the receiver operating characteristic curves (AUROC) to determine the average sensitivity27 between 90% and 100% specificity. To test for any statistically significant differences between sensitivity at a set specificity and partial AUROC curve estimates, the Wald test was used.28

    • Mixtures of receiver operating characteristic curves

      2013, Academic Radiology
      Citation Excerpt :

      Therefore, estimation of the AUC for the p-mixture ROC curve requires only the estimation of the AUCs of its component ROC curves. There is no closed form expression for the pAUC of a binormal curve (26); however, it can be written as a function of the cumulative density function of the standard bivariate normal (27). It follows that there is no closed form expression for the pAUC of the mixture ROC curve either, but this is not a burden: an estimate of pAUC(p) can be obtained by numerically integrating the component ROC curves or using the bivariate formulation (27).

    • An additive selection of markers to improve diagnostic accuracy based on a discriminatory measure

      2013, Academic Radiology
      Citation Excerpt :

      Following the proposed selection procedure, a subset of algorithms can be selected and measurements from these algorithms can be combined to generate a composite marker which tends to have better accuracy than an individual algorithm. As one referee points out, our proposed algorithm can be easily extended to marker selection in terms of the partial AUC (pAUC) (30) and/or multireader multicase (MRMC) ROC. In the first case, we can provide clinically relevant values of test sensitivity or specificity and replace full AUC in discriminatory score of the marker by the pAUC.

    View all citing articles on Scopus

    Stephen Hillis was supported by the National Institute of Biomedical Imaging and Bioengineering grants R01EB000863 and R01EB013667.

    Deceased.

    View full text