Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gur D, Bandos AI, Rockette HE, Zuley ML, Hakim CM, Chough DM, Ganott MA, Sumkin JH. Is an ROC-type response truly always better than a binary response in observer performance studies? Acad Radiol 2010;17:639-45. [PMID: 20236840 DOI: 10.1016/j.acra.2009.12.012] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2009] [Revised: 12/17/2009] [Accepted: 12/27/2009] [Indexed: 01/20/2023]

For:	Gur D, Bandos AI, Rockette HE, Zuley ML, Hakim CM, Chough DM, Ganott MA, Sumkin JH. Is an ROC-type response truly always better than a binary response in observer performance studies? Acad Radiol 2010;17:639-45. [PMID: 20236840 DOI: 10.1016/j.acra.2009.12.012] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2009] [Revised: 12/17/2009] [Accepted: 12/27/2009] [Indexed: 01/20/2023]

Number

Cited by Other Article(s)

Tabata K, Uraoka N, Benhamida J, Hanna MG, Sirintrapun SJ, Gallas BD, Gong Q, Aly RG, Emoto K, Matsuda KM, Hameed MR, Klimstra DS, Yagi Y. Validation of mitotic cell quantification via microscopy and multiple whole-slide scanners. Diagn Pathol 2019;14:65. [PMID: 31238983 PMCID: PMC6593538 DOI: 10.1186/s13000-019-0839-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 06/11/2019] [Indexed: 01/03/2023] Open

Abstract

BACKGROUND

The establishment of whole-slide imaging (WSI) as a medical diagnostic device allows that pathologists may evaluate mitotic activity with this new technology. Furthermore, the image digitalization provides an opportunity to develop algorithms for automatic quantifications, ideally leading to improved reproducibility as compared to the naked eye examination by pathologists. In order to implement them effectively, accuracy of mitotic figure detection using WSI should be investigated. In this study, we aimed to measure pathologist performance in detecting mitotic figures (MFs) using multiple platforms (multiple scanners) and compare the results with those obtained using a brightfield microscope.

METHODS

Four slides of canine oral melanoma were prepared and digitized using 4 WSI scanners. In these slides, 40 regions of interest (ROIs) were demarcated, and five observers identified the MFs using different viewing modes: microscopy and WSI. We evaluated the inter- and intra-observer agreements between modes with Cohen's Kappa and determined "true" MFs with a consensus panel. We then assessed the accuracy (agreement with truth) using the average of sensitivity and specificity.

RESULTS

In the 40 ROIs, 155 candidate MFs were detected by five pathologists; 74 of them were determined to be true MFs. Inter- and intra-observer agreement was mostly "substantial" or greater (Kappa = 0.594-0.939). Accuracy was between 0.632 and 0.843 across all readers and modes. After averaging over readers for each modality, we found that mitosis detection accuracy for 3 of the 4 WSI scanners was significantly less than that of the microscope (p = 0.002, 0.012, and 0.001).

CONCLUSIONS

This study is the first to compare WSIs and microscopy in detecting MFs at the level of individual cells. Our results suggest that WSI can be used for mitotic cell detection and offers similar reproducibility to the microscope, with slightly less accuracy.

Collapse

Affiliation(s)

Kazuhiro Tabata Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA Department of Pathology, Nagasaki University Hospital, 1-7-1 Sakamoto, Nagasaki, Nagasaki 8528501 Japan
Naohiro Uraoka Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
Jamal Benhamida Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
Matthew G. Hanna Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
Sahussapont Joseph Sirintrapun Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
Brandon D. Gallas Center For Devices and Radiological Health, Office of Science and Engineering Laboratories, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993 USA
Qi Gong Center For Devices and Radiological Health, Office of Science and Engineering Laboratories, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993 USA
Rania G. Aly Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA Department of Pathology, Faculty of Medicine, Alexandria university, 22 El-Guish Road, El-Shatby, Alexandria, 21526 Egypt
Katsura Emoto Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA Thoracic Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, 10065 NY USA
Kant M. Matsuda Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
Meera R. Hameed Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
David S. Klimstra Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
Yukako Yagi Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA

Collapse

The Reproducibility of Changes in Diagnostic Figures of Merit Across Laboratory and Clinical Imaging Reader Studies. Acad Radiol 2017;24:1436-1446. [PMID: 28666723 DOI: 10.1016/j.acra.2017.05.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Revised: 04/28/2017] [Accepted: 05/01/2017] [Indexed: 11/23/2022]

Harvey S, Gallagher AM, Nolan M, Hughes CM. Listening to Women: Expectations and Experiences in Breast Imaging. J Womens Health (Larchmt) 2016;24:777-83. [PMID: 26390380 DOI: 10.1089/jwh.2015.29001.swh] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Value of gadolinium-enhanced MRI in detection of acute appendicitis in children and adolescents. AJR Am J Roentgenol 2015;203:W543-8. [PMID: 25341169 DOI: 10.2214/ajr.13.12093] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Estimating the receiver operating characteristic curve in studies that match controls to cases on covariates. Acad Radiol 2013;20:863-73. [PMID: 23601953 DOI: 10.1016/j.acra.2013.03.004] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Revised: 03/07/2013] [Accepted: 03/08/2013] [Indexed: 11/23/2022]

Samuelson FW. Inference based on diagnostic measures from studies of new imaging devices. Acad Radiol 2013;20:816-24. [PMID: 23643364 DOI: 10.1016/j.acra.2013.03.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Revised: 03/01/2013] [Accepted: 03/07/2013] [Indexed: 10/26/2022]

Nishikawa RM, Pesce LL. Estimating sensitivity and specificity for technology assessment based on observer studies. Acad Radiol 2013;20:825-30. [PMID: 23660073 DOI: 10.1016/j.acra.2013.03.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Revised: 03/23/2013] [Accepted: 03/26/2013] [Indexed: 11/17/2022]

Abbey CK, Eckstein MP, Boone JM. Estimating the relative utility of screening mammography. Med Decis Making 2013;33:510-20. [PMID: 23295543 DOI: 10.1177/0272989x12470756] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Eng J. Teaching receiver operating characteristic analysis: an interactive laboratory exercise. Acad Radiol 2012;19:1452-6. [PMID: 23040502 DOI: 10.1016/j.acra.2012.09.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2012] [Revised: 09/09/2012] [Accepted: 09/10/2012] [Indexed: 11/16/2022]

Rafferty EA, Park JM, Philpotts LE, Poplack SP, Sumkin JH, Halpern EF, Niklason LT. Assessing radiologist performance using combined digital mammography and breast tomosynthesis compared with digital mammography alone: results of a multicenter, multireader trial. Radiology 2012;266:104-13. [PMID: 23169790 DOI: 10.1148/radiol.12120674] [Citation(s) in RCA: 284] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

PURPOSE

To compare radiologists' diagnostic accuracy and recall rates for breast tomosynthesis combined with digital mammography versus digital mammography alone.

MATERIALS AND METHODS

Institutional review board approval was obtained at each accruing institution. Participating women gave written informed consent. Mediolateral oblique and craniocaudal digital mammographic and tomosynthesis images of both breasts were obtained from 1192 subjects. Two enriched reader studies were performed to compare digital mammography with tomosynthesis against digital mammography alone. Study 1 comprised 312 cases (48 cancer cases) with images read by 12 radiologists; study 2, 312 cases (51 cancer cases) with 15 radiologists. Study 1 readers recorded only that an abnormality requiring recall was present; study 2 readers had additional training and recorded both lesion type and location. Diagnostic accuracy was compared with receiver operating characteristic analysis. Recall rates of noncancer cases, sensitivity, specificity, and positive and negative predictive values determined by analyzing Breast Imaging Reporting and Data System scores were compared for the two methods.

RESULTS

Diagnostic accuracy for combined tomosynthesis and digital mammography was superior to that of digital mammography alone. Average difference in area under the curve in study 1 was 7.2% (95% confidence interval [CI]: 3.7%, 10.8%; P < .001) and in study 2 was 6.8% (95% CI: 4.1%, 9.5%; P < .001). All 27 radiologists increased diagnostic accuracy with addition of tomosynthesis. Recall rates for noncancer cases for all readers significantly decreased with addition of tomosynthesis (range, 6%-67%; P < .001 for 25 readers, P < .03 for all readers). Increased sensitivity was largest for invasive cancers: 15% and 22% in studies 1 and 2 versus 3% for in situ cancers in both studies.

CONCLUSION

Addition of tomosynthesis to digital mammography offers the dual benefit of significantly increased diagnostic accuracy and significantly reduced recall rates for noncancer cases.

SUPPLEMENTAL MATERIAL

http://radiology.rsna.org/lookup/suppl/doi:10.1148/radiol.12120674/-/DC1.

Collapse

Wunderlich A, Noo F. A nonparametric procedure for comparing the areas under correlated LROC curves. IEEE TRANSACTIONS ON MEDICAL IMAGING 2012;31:2050-61. [PMID: 22736638 PMCID: PMC3619029 DOI: 10.1109/tmi.2012.2205015] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]

Chakraborty DP. Measuring agreement between rating interpretations and binary clinical interpretations of images: a simulation study of methods for quantifying the clinical relevance of an observer performance paradigm. Phys Med Biol 2012;57:2873-904. [PMID: 22516804 DOI: 10.1088/0031-9155/57/10/2873] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

Laboratory receiver operating characteristic (ROC) studies, that are often used to evaluate medical imaging systems, differ from 'live' clinical interpretations in several respects which could compromise their clinical relevance. The aim was to develop methodology for quantifying the clinical relevance of a laboratory ROC study. A simulator was developed to generate ROC ratings data and binary clinical interpretations classified as correct or incorrect for a common set of images interpreted under clinical and laboratory conditions. The area under the trapezoidal ROC curve (AUC) was used as the laboratory figure-of-merit and the fraction of correct clinical decisions as the clinical figure-of-merit. Conventional agreement measures (Pearson, Spearman, Kendall and kappa) between the bootstrap-induced fluctuations of the two figures of merit were estimated. A jackknife pseudovalue transformation applied to the figures of merit was also investigated as a way to capture agreement existing at the individual image level that could be lost at the figure-of-merit level. It is shown that the pseudovalues define a relevance-ROC curve. The area under this curve (rAUC) measures the ability of the laboratory figure-of-merit-based pseudovalues to correctly classify incorrect versus correct clinical interpretations. Therefore, rAUC is a measure of the clinical relevance of an ROC study. The conventional measures and rAUC were compared under varying simulator conditions. It was found that design details of the ROC study, namely the number of bins, the difficulty level of the images, the ratio of disease-present to disease-absent images and the unavoidable difference between laboratory and clinical performance levels, can lead to serious underestimation of the agreement as indicated by conventional agreement measures, even for perfectly correlated data, while rAUC showed high agreement and was relatively immune to these details. At the same time rAUC was sensitive to factors such as intrinsic correlation between the laboratory and clinical decision variables and differences in reporting thresholds that are expected to influence agreement both at the individual image level and at the figure-of-merit level. Suggestions are made for how to conduct relevance-ROC studies aimed at assessing agreement between laboratory and clinical interpretations. The method could be used to evaluate the clinical relevance of alternative scalar figures of merit, such as the sensitivity at a predifined specificity.

Collapse

Samuelson F, Gallas BD, Myers KJ, Petrick N, Pinsky P, Sahiner B, Campbell G, Pennello GA. The importance of ROC data. Acad Radiol 2011;18:257-8; author reply 259-61. [PMID: 21232688 DOI: 10.1016/j.acra.2010.10.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2010] [Revised: 10/18/2010] [Accepted: 10/20/2010] [Indexed: 11/19/2022]

Gur D, Bandos AI, Rockette HE. Reply. Acad Radiol 2011. [DOI: 10.1016/j.acra.2010.11.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]