Skip to main content
Erschienen in: Journal of Digital Imaging 3/2012

01.06.2012

Consensus Versus Disagreement in Imaging Research: a Case Study Using the LIDC Database

verfasst von: Dmitriy Zinovev, Yujie Duo, Daniela S. Raicu, Jacob Furst, Samuel G. Armato

Erschienen in: Journal of Imaging Informatics in Medicine | Ausgabe 3/2012

Einloggen, um Zugang zu erhalten

Abstract

Traditionally, image studies evaluating the effectiveness of computer-aided diagnosis (CAD) use a single label from a medical expert compared with a single label produced by CAD. The purpose of this research is to present a CAD system based on Belief Decision Tree classification algorithm, capable of learning from probabilistic input (based on intra-reader variability) and providing probabilistic output. We compared our approach against a traditional decision tree approach with respect to a traditional performance metric (accuracy) and a probabilistic one (area under the distance–threshold curve—AuCdt). The probabilistic classification technique showed notable performance improvement in comparison with the traditional one with respect to both evaluation metrics. Specifically, when applying cross-validation technique on the training subset of instances, boosts of 28.26% and 30.28% were noted for the probabilistic approach with respect to accuracy and AuCdt, respectively. Furthermore, on the validation subset of instances, boosts of 20.64% and 23.21% were noted again for the probabilistic approach with respect to the same two metrics. In addition, we compared our CAD system results with diagnostic data available for a small subset of the Lung Image Database Consortium database. We discovered that when our CAD system errs, it generally does so with low confidence. Predictions produced by the system also agree with diagnoses of truly benign nodules more often than radiologists, offering the possibility of reducing the false positives.
Literatur
1.
Zurück zum Zitat Bankier AA, Levin D, Halpern EF, Kressel HY: Consensus interpretation in imaging research: is there a better way? Radiology 257:14–17, 2010PubMedCrossRef Bankier AA, Levin D, Halpern EF, Kressel HY: Consensus interpretation in imaging research: is there a better way? Radiology 257:14–17, 2010PubMedCrossRef
2.
Zurück zum Zitat Mower WR: Evaluating bias and variability in diagnostic test reports. Ann Emerg Med 33(1):85–91, 1999PubMedCrossRef Mower WR: Evaluating bias and variability in diagnostic test reports. Ann Emerg Med 33(1):85–91, 1999PubMedCrossRef
3.
Zurück zum Zitat Turner DA: Observer variability: what to do until perfect diagnostic tests are invented. J Nucl Med 19(4):435–437, 1978PubMed Turner DA: Observer variability: what to do until perfect diagnostic tests are invented. J Nucl Med 19(4):435–437, 1978PubMed
4.
Zurück zum Zitat Jarvik JG, Deyo RA: Moderate versus mediocre: the reliability of spine MR data interpretations. Radiology 250(1):15–17, 2009PubMedCrossRef Jarvik JG, Deyo RA: Moderate versus mediocre: the reliability of spine MR data interpretations. Radiology 250(1):15–17, 2009PubMedCrossRef
5.
Zurück zum Zitat Carrino JA, Lurie JD, Tosteson AN, et al: Lumbar spine: reliability of MR imaging findings. Radiology 250(1):161–170, 2009PubMedCrossRef Carrino JA, Lurie JD, Tosteson AN, et al: Lumbar spine: reliability of MR imaging findings. Radiology 250(1):161–170, 2009PubMedCrossRef
6.
Zurück zum Zitat MacMahon H, Engelmann R, Behlen F, Hoffmann K, Ishida T, Roe C, Metz C, Doi K: Computer-aided diagnosis of pulmonary nodules: Results of a large-scale observer test. Radiology 13:723–726, 1999 MacMahon H, Engelmann R, Behlen F, Hoffmann K, Ishida T, Roe C, Metz C, Doi K: Computer-aided diagnosis of pulmonary nodules: Results of a large-scale observer test. Radiology 13:723–726, 1999
7.
Zurück zum Zitat Matsuki Y, Nakamura K, Watanabe H, Aoki T, Nakata H, Katsuragawa S, Doi K: Usefulness of an artificial neural network for differentiating benign from malignant pulmonary nodules on high-resolution CT: evaluation with receiver operating characteristic analysis. Am J Roentgenol 178(3):657–663, 2002 Matsuki Y, Nakamura K, Watanabe H, Aoki T, Nakata H, Katsuragawa S, Doi K: Usefulness of an artificial neural network for differentiating benign from malignant pulmonary nodules on high-resolution CT: evaluation with receiver operating characteristic analysis. Am J Roentgenol 178(3):657–663, 2002
8.
Zurück zum Zitat Li F, Aoyama M, Shiraishi J, et al: Radiologists’ performance for differentiating benign from malignant lung nodules on high-resolution CT using computer estimated likelihood of malignancy. Am J Roentgenol 183:1209–1215, 2004 Li F, Aoyama M, Shiraishi J, et al: Radiologists’ performance for differentiating benign from malignant lung nodules on high-resolution CT using computer estimated likelihood of malignancy. Am J Roentgenol 183:1209–1215, 2004
9.
Zurück zum Zitat Marten K, Grillhösl A, Seyfarth T, Obenauer S, Rummeny EJ, Engelke C: Computer-assisted detection of pulmonary nodules: evaluation of diagnostic performance using an expert knowledge-based detection system with variable reconstruction slice thickness settings. Eur Radiol 15:203–212, 2005PubMedCrossRef Marten K, Grillhösl A, Seyfarth T, Obenauer S, Rummeny EJ, Engelke C: Computer-assisted detection of pulmonary nodules: evaluation of diagnostic performance using an expert knowledge-based detection system with variable reconstruction slice thickness settings. Eur Radiol 15:203–212, 2005PubMedCrossRef
10.
Zurück zum Zitat Peldschus K, Herzog P, Wood SA, Cheema JI, Costello P, Schoepf UJ: Computer-aided diagnosis as a second reader—spectrum of findings in CT studies of the chest interpreted as normal. Chest Journal 128:1517–1523, 2005CrossRef Peldschus K, Herzog P, Wood SA, Cheema JI, Costello P, Schoepf UJ: Computer-aided diagnosis as a second reader—spectrum of findings in CT studies of the chest interpreted as normal. Chest Journal 128:1517–1523, 2005CrossRef
11.
Zurück zum Zitat Baker JA, Rosen EL, Lo JY, Gimenez EI, Walsh R, Soo MS: Computer-aided detection (CAD) in screening mammography: sensitivity of commercial CAD systems for detecting architectural distortion. Am J Roentgenol 181:1083–1088, 2003 Baker JA, Rosen EL, Lo JY, Gimenez EI, Walsh R, Soo MS: Computer-aided detection (CAD) in screening mammography: sensitivity of commercial CAD systems for detecting architectural distortion. Am J Roentgenol 181:1083–1088, 2003
12.
Zurück zum Zitat Ciatto S, Turco MR, Risso G, et al: Comparison of standard reading and computer-aided detection (CAD) on a national proficiency test of screening mammography. Eur J Radiol 45:135–138, 2003PubMedCrossRef Ciatto S, Turco MR, Risso G, et al: Comparison of standard reading and computer-aided detection (CAD) on a national proficiency test of screening mammography. Eur J Radiol 45:135–138, 2003PubMedCrossRef
13.
Zurück zum Zitat Karssemeijer N, Risso G, Catarzi S, et al: Computer-aided detection versus independent double reading of masses on mammograms. Radiology 227:192–200, 2003PubMedCrossRef Karssemeijer N, Risso G, Catarzi S, et al: Computer-aided detection versus independent double reading of masses on mammograms. Radiology 227:192–200, 2003PubMedCrossRef
14.
Zurück zum Zitat Muramatsu C, Li Q, Suzuki K, et al: Investigation of psychophysical measure for evaluation of similar images for mammographic masses: Preliminary results. Medical Physics 32:2295–2304, 2005PubMedCrossRef Muramatsu C, Li Q, Suzuki K, et al: Investigation of psychophysical measure for evaluation of similar images for mammographic masses: Preliminary results. Medical Physics 32:2295–2304, 2005PubMedCrossRef
15.
Zurück zum Zitat Fletcher JW, Kymes SM, Gould M, Alazraki N, Coleman RE, Lowe VJ, et al: A comparison of the diagnostic accuracy of 18FFDG PET and CT in the characterization of solitary pulmonary nodules. J Nucl Med 49:179–185, 2008PubMedCrossRef Fletcher JW, Kymes SM, Gould M, Alazraki N, Coleman RE, Lowe VJ, et al: A comparison of the diagnostic accuracy of 18FFDG PET and CT in the characterization of solitary pulmonary nodules. J Nucl Med 49:179–185, 2008PubMedCrossRef
16.
Zurück zum Zitat Tao Y, Lo S-C B, Freedman M T, Xuan J: Joint segmentation and spiculation detection for ill-defined and spiculated mammographic masses. Proc. SPIE, doi:10.1117/12.844045, February 16, 2010 Tao Y, Lo S-C B, Freedman M T, Xuan J: Joint segmentation and spiculation detection for ill-defined and spiculated mammographic masses. Proc. SPIE, doi:10.1117/12.844045, February 16, 2010
17.
Zurück zum Zitat Sahiner B, Hadjiiski L M, Chan H P, Paramagul C, Nees A, Helvie M, Shi J: Concordance of Computer-Extracted Image Features with BI-RADS Descriptors for Mammographic Mass Margin. Proc. SPIE, doi: 10.1117/12.770752, March 17, 2008 Sahiner B, Hadjiiski L M, Chan H P, Paramagul C, Nees A, Helvie M, Shi J: Concordance of Computer-Extracted Image Features with BI-RADS Descriptors for Mammographic Mass Margin. Proc. SPIE, doi: 10.1117/12.770752, March 17, 2008
18.
Zurück zum Zitat Ochs R, Kimb HJ, Angel E, Panknin C, McNitt-Gray M, Brown M: Forming a reference standard from LIDC data: impact of reader agreement on reported CAD performance. Proc. SPIE, DOI: 10.1117/12.707916, March 30, 2007 Ochs R, Kimb HJ, Angel E, Panknin C, McNitt-Gray M, Brown M: Forming a reference standard from LIDC data: impact of reader agreement on reported CAD performance. Proc. SPIE, DOI: 10.1117/12.707916, March 30, 2007
19.
Zurück zum Zitat Opfer R, Wiemker RD: Performance Analysis For Computer-Aided Lung Nodule Detection On LIDC Data. Proc. SPIE, DOI: 10.1117/12.708210, February 21, 2007 Opfer R, Wiemker RD: Performance Analysis For Computer-Aided Lung Nodule Detection On LIDC Data. Proc. SPIE, DOI: 10.1117/12.708210, February 21, 2007
20.
Zurück zum Zitat Armato III, SG, Roberts RY, Kocherginsky M, Aberle DR, Kazerooni EA, MacMahon H, van Beek EJR, Yankelevitz DF, McLennan G, McNitt-Gray MF, Meyer CR, Reeves AP, Caligiuri P, Quint LE, Sundaram B, Croft BY, Clarke LP: Assessment of radiologist performance in the detection of lung nodules: dependence on the definition of “truth”. Acad Radiol 16:28–38, 2009PubMedCrossRef Armato III, SG, Roberts RY, Kocherginsky M, Aberle DR, Kazerooni EA, MacMahon H, van Beek EJR, Yankelevitz DF, McLennan G, McNitt-Gray MF, Meyer CR, Reeves AP, Caligiuri P, Quint LE, Sundaram B, Croft BY, Clarke LP: Assessment of radiologist performance in the detection of lung nodules: dependence on the definition of “truth”. Acad Radiol 16:28–38, 2009PubMedCrossRef
21.
Zurück zum Zitat Armato III, SG, et al: Lung Image Database Consortium: developing a resource for the medical imaging research community. Radiology 232:739–748, 2004PubMedCrossRef Armato III, SG, et al: Lung Image Database Consortium: developing a resource for the medical imaging research community. Radiology 232:739–748, 2004PubMedCrossRef
22.
Zurück zum Zitat Hillman BJ: ACRIN—lessons learned in conducting multi-center trials of imaging and cancer. Cancer Imaging 5(Spec No A):S97–S101, 2005PubMedCrossRef Hillman BJ: ACRIN—lessons learned in conducting multi-center trials of imaging and cancer. Cancer Imaging 5(Spec No A):S97–S101, 2005PubMedCrossRef
23.
Zurück zum Zitat Elouedi Z, Mellouli K, Smets P: Belief decision trees: theoretical foundations. International Journal of Approximate Reasoning 28:91–124, 2001CrossRef Elouedi Z, Mellouli K, Smets P: Belief decision trees: theoretical foundations. International Journal of Approximate Reasoning 28:91–124, 2001CrossRef
24.
Zurück zum Zitat Wu TF, Lin CJ, Weng RC: Probability estimates for multi-class classification by pairwise coupling. J Mach Learn Res 5(August):975–1005, 2004 Wu TF, Lin CJ, Weng RC: Probability estimates for multi-class classification by pairwise coupling. J Mach Learn Res 5(August):975–1005, 2004
25.
Zurück zum Zitat Robinson PJA: Radiology’s Achilles’ heel: error and variation in the interpretation of the Rontghen image. Br J Radiol 70:1085–1098, 1997PubMed Robinson PJA: Radiology’s Achilles’ heel: error and variation in the interpretation of the Rontghen image. Br J Radiol 70:1085–1098, 1997PubMed
26.
Zurück zum Zitat Reiner B: Uncovering and improving upon the inherent deficiencies of radiology reporting through data mining. J Digit Imaging 23:109–118, 2010PubMedCrossRef Reiner B: Uncovering and improving upon the inherent deficiencies of radiology reporting through data mining. J Digit Imaging 23:109–118, 2010PubMedCrossRef
27.
Zurück zum Zitat Zinovev D, Raicu D, Furst J, Armato III, SG: Predicting radiological panel opinions using a panel of machine learning classifiers. Algorithms Journal 2:1473–1502, 2009CrossRef Zinovev D, Raicu D, Furst J, Armato III, SG: Predicting radiological panel opinions using a panel of machine learning classifiers. Algorithms Journal 2:1473–1502, 2009CrossRef
28.
Zurück zum Zitat Quinlan JR: Improved use of continuous attributes in C4.5. Journal of Artificial Intelligence Research 4:77–90, 1996 Quinlan JR: Improved use of continuous attributes in C4.5. Journal of Artificial Intelligence Research 4:77–90, 1996
29.
Zurück zum Zitat Spackman KA: Signal detection theory: Valuable tools for evaluating inductive learning. Proc. 6th Int. Workshop on Machine Learning 160–163, 1989 Spackman KA: Signal detection theory: Valuable tools for evaluating inductive learning. Proc. 6th Int. Workshop on Machine Learning 160–163, 1989
30.
Zurück zum Zitat Liu H, Song D, Rüger S, Hu R, Uren V: Comparing dissimilarity measures for content-based image retrieval. Proc. 4th Asia Inf. Ret. Conf. on Information Retrieval Technology 44–50, 2008 Liu H, Song D, Rüger S, Hu R, Uren V: Comparing dissimilarity measures for content-based image retrieval. Proc. 4th Asia Inf. Ret. Conf. on Information Retrieval Technology 44–50, 2008
Metadaten
Titel
Consensus Versus Disagreement in Imaging Research: a Case Study Using the LIDC Database
verfasst von
Dmitriy Zinovev
Yujie Duo
Daniela S. Raicu
Jacob Furst
Samuel G. Armato
Publikationsdatum
01.06.2012
Verlag
Springer-Verlag
Erschienen in
Journal of Imaging Informatics in Medicine / Ausgabe 3/2012
Print ISSN: 2948-2925
Elektronische ISSN: 2948-2933
DOI
https://doi.org/10.1007/s10278-011-9445-3

Weitere Artikel der Ausgabe 3/2012

Journal of Digital Imaging 3/2012 Zur Ausgabe

Screening-Mammografie offenbart erhöhtes Herz-Kreislauf-Risiko

26.04.2024 Mammografie Nachrichten

Routinemäßige Mammografien helfen, Brustkrebs frühzeitig zu erkennen. Anhand der Röntgenuntersuchung lassen sich aber auch kardiovaskuläre Risikopatientinnen identifizieren. Als zuverlässiger Anhaltspunkt gilt die Verkalkung der Brustarterien.

S3-Leitlinie zu Pankreaskrebs aktualisiert

23.04.2024 Pankreaskarzinom Nachrichten

Die Empfehlungen zur Therapie des Pankreaskarzinoms wurden um zwei Off-Label-Anwendungen erweitert. Und auch im Bereich der Früherkennung gibt es Aktualisierungen.

Fünf Dinge, die im Kindernotfall besser zu unterlassen sind

18.04.2024 Pädiatrische Notfallmedizin Nachrichten

Im Choosing-Wisely-Programm, das für die deutsche Initiative „Klug entscheiden“ Pate gestanden hat, sind erstmals Empfehlungen zum Umgang mit Notfällen von Kindern erschienen. Fünf Dinge gilt es demnach zu vermeiden.

„Nur wer sich gut aufgehoben fühlt, kann auch für Patientensicherheit sorgen“

13.04.2024 Klinik aktuell Kongressbericht

Die Teilnehmer eines Forums beim DGIM-Kongress waren sich einig: Fehler in der Medizin sind häufig in ungeeigneten Prozessen und mangelnder Kommunikation begründet. Gespräche mit Patienten und im Team können helfen.

Update Radiologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.