Review of ROC analysis
Some of the material was modified from the review of the application of ROC analysis, "Measuring the Observer Performance of Digtial Systems" by Philip F Judy, Cornella M Schaefer, Jorg-Wilhem Oestmann and Reginald E Greene. In:Computed Digital Radiolgraphy in Clical Practice. Editors RE Greene and JW Ostmann, Thieme Medical Publishers, New York, 1992:59-69. (Reprint available from first author.)
Maximum likelihood fitting procedure
Experimental images are divided into two
classes of stimuli such as A (control images) and B (images with
pulmonary nodules). In an ROC experiment, observers make an
ordinal rating to indicate their confidence that the stimulus
belongs to alternative A rather than alternative B. The rating
is assumed to be related to a continuous decision variable that
is monotonically related to the observer's estimate of the likelihood
ratio that the stimuli is alternative A rather than alternative
B. The decision variable has a separate probability density
distribution for each alternative. Each probability distribution
can be used to determine the fraction of stimuli A or B that the
observer will assign to each ordinal rating. It is the area under
the probability density distribution between two values of the
decision variable, called criteria.
Primarily for the convenience of fitting the
rating data, gaussian distributions of the decision variable
are usually assumed. The parameters of distribution and quantities
derived from them are useful as summary statistics of the observers'
performance. A computer program originally developed by Dorfman
and Alf1 and widely distributed by Charles Metz of the Department
of Radiology at the University of Chicago is used to analyze rating
data. We have extended the procedure to treat more the two alternatives.2
Area under the ROC curve
Receiver operating characteristic (ROC)
analysis is the standard approach to evaluate the sensitivity
and specificity of diagnostic procedures.3 ROC analysis estimates
a curve, which describes the inherent tradeoff between sensitivity
and specificity of a diagnostic test. Each point on the ROC curve
is associated with a specific diagnostic criterion. This point
will vary among observers because their diagnostic criteria will
vary even when their ROC curves are the same. The area under
the ROC curve (A-z) has become a particularly important metric
for evaluating diagnostic procedures because it is the average
sensitivity over all possible specificities.4,5,6
Notes
1 Dorfmann DD, Alf E. Maximum likelihood estimation of parameters of signal detection theory and determination of confidence intervals - Rating-method data. J Math Psychol 1969;6:487-496.
2 Kijewski MF, Swensson RG, Judy PF. Analyisis of rating data from multiple-alternative tasks. J MAth Psychol 1989;33(4):428-451.
3 Swets JA, Pickett RM. Evaluation of diagnostic systems: Methods from signal detection theory. Academic Press, New York, 1992.
4 Swets JA. ROC anaylis applied to the evaluation of medical imaging techniques. Invest Radiol 1979;14:109-121.
5 Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic curve. Radiology 1982;143:29-36.
6 Metz CE. ROC methodology in radiologic imaging. Invest Radiolo 1986;21:720-733.
[Format Density] | [Introduction]
Influence of CT Image Size and
Format on Accuracy of Lung Nodule Detection
SE Seltzer, PF Judy, U Feldman,
L Scarff, FL Jacobson. Radiology 1998;206:617-622.
Is this a mirrored page?
The official URL is http://brighamrad.harvard.edu/research/topics/vispercep/ROC.html
Posted February 23, 1998