Skip to main content
Erschienen in: European Radiology 4/2022

Open Access 18.11.2021 | Editorial

How to show that a new imaging method can replace a standard method, when no reference standard is available?

verfasst von: Patrick Omoumi, Nancy A. Obuchowski

Erschienen in: European Radiology | Ausgabe 4/2022

download
DOWNLOAD
print
DRUCKEN
insite
SUCHEN
Hinweise

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Assessing the performance of a new diagnostic method is a common problem in radiology. With technical advances impacting image acquisition and post-processing, including applications of artificial intelligence, newer ways to perform imaging have emerged and need to be compared to the methods of reference.
Practical examples include low-dose CT examinations providing similar image quality as standard dose acquisitions thanks to improved image reconstruction [13], or accelerated MRI protocols by using novel image acquisition or image reconstruction methods [4, 5].
While the assessment of image quality and phantom studies might be the first steps of the evaluation, the diagnostic performance of the new imaging method should eventually be compared to that of the old method, typically using a reference standard such as surgery. The statistical methods for this type of analysis are well established [6, 7].
However, more often than not, there is no suitable or convenient reference standard against which the performance of the new diagnostic method can be tested. An example of this is imaging of the degenerative spine for which a surgical correlation is often missing. When performed, surgery only provides a limited amount of information compared to the extensive assessment that can be done by imaging (for example, no assessment of bone marrow can usually be done at spine surgery).
In such situations, investigators have tried a variety of approaches to quantify performance of the new method relative to the current one, such as reporting accuracy using the current test as the reference standard, assessing correlation of the new test’s findings with the current test, estimating intra- or inter-reader agreement of the new and existing tests, and testing for lack of significant differences between the findings of the tests.
These approaches, however, can provide misleading results. Alternatively, interchangeability is a statistical method to assess whether a new diagnostic method can replace a conventional method when there is no reference standard available. To illustrate, let us consider four studies on degenerative spine MRI taken from the recent literature [811].
These four studies aimed to show that MRI of the degenerative spine in the sagittal plane may be limited to a fast spin echo/turbo spin echo (FSE/TSE) Dixon fluid-sensitive sequence, with no need to perform additional T1-weighted (T1w) sequences. While the conclusions were the same, the approaches used were different, some being potentially misleading.
Sollmann et al tested for lack of significant differences between the number of abnormalities detected on the new protocol including fat-only (FO), in-phase (IP), and water-only (WO) images derived from a FSE/TSE Dixon T2-weighted sequence and the standard protocol (T1w, IP, WO images) [9]. However, this approach focuses on pooled results rather than individual subject results and suffers from low power to detect small but important differences.
Saiffudin et al studied the agreement between the new and standard protocols with different readers, and compared the inter-reader agreement obtained with the two protocols. This approach may also be misleading in various ways. First, poor inter-technique agreement may be due to poor intra-reader agreement for variables which are subjectively graded, thereby underestimating the potential for the new protocol to replace the standard one. In that study, kappa statistics for inter-protocol agreement were as low as 0.39 for some variables, but intra-reader agreement was not reported. Second, measures of intra- and inter-reader agreement, e.g., kappa statistic, tell us nothing about diagnostic performance.
In the study by Yang et al, diagnostic performance of the new protocol was quantified using the standard protocol as the reference standard. Because the new test often makes similar errors as the standard test, this approach usually leads to gross exaggeration of accuracy of the new protocol. Furthermore, when the new protocol disagrees with the standard one, it is misleading to assume that the standard protocol’s findings are always correct since it is not a true reference standard.
The first paper of the series, on the other hand, had used interchangeability to demonstrate that the new simplified protocol could replace the standard protocol [8]. Interchangeability is a statistical method that describes the effect of replacing the current test with the new test, without the need for a reference standard. The idea is to first quantify the ability of the current test to agree with itself (i.e., measure the inter-reader agreement where all readers use the current test), then compare this with the ability of the new test to agree with the current test. If the frequency of agreement and types of disagreements between new and current tests are similar to when the current test is compared with itself, then we conclude that the new test is interchangeable with the current test (Fig. 1). Interchangeability may be tested for both qualitative and quantitative data [12].
It is important that the definition of agreement and the maximum allowable difference between new and current vs. current with itself are defined a priori, and that the study is powered to detect these differences.
Apart from the initial study by Zanchi et al, there are several other examples of the use of interchangeability of imaging tests in the literature [8, 13, 14], as well as papers describing the statistical methods [12, 15, 16].
Finally, beyond these statistical considerations, it should be mentioned that interchangeability needs to be proven for all types of information that the standard method is supposed to provide in order for the new method to be able to replace it. For instance, in degenerative spine MRI, T1w images are also used for the assessment of bone marrow pathology. If the new protocol does not include T1w sequences, the information must be included in the other images [14]. Furthermore, the detailed acquisition parameters should also be specified in the study and taken into account when replacing a standard protocol by a new one. Indeed, what might be true for a certain sequence on a certain scanner might not be generalizable to all sequences on all scanners.

Declarations

Guarantor

The scientific guarantor of this publication is Dr Patrick Omoumi.

Conflict of interest

No conflict of interest to declare.

Statistics and biometry

One of the authors is a statistician.
Not applicable.

Ethical approval

Not applicable

Methodology

• Editorial
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://​creativecommons.​org/​licenses/​by/​4.​0/​.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Unsere Produktempfehlungen

e.Med Interdisziplinär

Kombi-Abonnement

Für Ihren Erfolg in Klinik und Praxis - Die beste Hilfe in Ihrem Arbeitsalltag

Mit e.Med Interdisziplinär erhalten Sie Zugang zu allen CME-Fortbildungen und Fachzeitschriften auf SpringerMedizin.de.

e.Med Radiologie

Kombi-Abonnement

Mit e.Med Radiologie erhalten Sie Zugang zu CME-Fortbildungen des Fachgebietes Radiologie, den Premium-Inhalten der radiologischen Fachzeitschriften, inklusive einer gedruckten Radiologie-Zeitschrift Ihrer Wahl.

Literatur
1.
Zurück zum Zitat Messerli M, Kluckert T, Knitel M et al (2017) Ultralow dose CT for pulmonary nodule detection with chest X-ray equivalent dose - a prospective intra-individual comparative study. Eur Radiol 27:3290–3299CrossRef Messerli M, Kluckert T, Knitel M et al (2017) Ultralow dose CT for pulmonary nodule detection with chest X-ray equivalent dose - a prospective intra-individual comparative study. Eur Radiol 27:3290–3299CrossRef
2.
Zurück zum Zitat Omoumi P, Becce F, Ott JG, Racine D, Verdun FR (2015) Optimization of radiation dose and image quality in musculoskeletal CT: emphasis on iterative reconstruction techniques (part 1). Semin Musculoskelet Radiol 19:415–421CrossRef Omoumi P, Becce F, Ott JG, Racine D, Verdun FR (2015) Optimization of radiation dose and image quality in musculoskeletal CT: emphasis on iterative reconstruction techniques (part 1). Semin Musculoskelet Radiol 19:415–421CrossRef
3.
Zurück zum Zitat Omoumi P, Verdun FR, Becce F (2015) Optimization of radiation dose and image quality in musculoskeletal CT: emphasis on iterative reconstruction techniques (part 2). Semin Musculoskelet Radiol 19:422–430CrossRef Omoumi P, Verdun FR, Becce F (2015) Optimization of radiation dose and image quality in musculoskeletal CT: emphasis on iterative reconstruction techniques (part 2). Semin Musculoskelet Radiol 19:422–430CrossRef
4.
Zurück zum Zitat Roux M, Hilbert T, Hussami M, Becce F, Kober T, Omoumi P (2019) MRI T2 mapping of the knee providing synthetic morphologic images: comparison to conventional turbo spin-echo MRI. Radiology 293:620–630CrossRef Roux M, Hilbert T, Hussami M, Becce F, Kober T, Omoumi P (2019) MRI T2 mapping of the knee providing synthetic morphologic images: comparison to conventional turbo spin-echo MRI. Radiology 293:620–630CrossRef
5.
Zurück zum Zitat Del Grande F, Rashidi A, Luna R et al (2021) Five-minute five-sequence knee MRI using combined simultaneous multislice and parallel imaging acceleration: comparison with 10-minute parallel imaging knee MRI. Radiology 299:635–646CrossRef Del Grande F, Rashidi A, Luna R et al (2021) Five-minute five-sequence knee MRI using combined simultaneous multislice and parallel imaging acceleration: comparison with 10-minute parallel imaging knee MRI. Radiology 299:635–646CrossRef
6.
Zurück zum Zitat Dwyer AJ (1991) Matchmaking and McNemar in the comparison of diagnostic modalities. Radiology 178:328–330CrossRef Dwyer AJ (1991) Matchmaking and McNemar in the comparison of diagnostic modalities. Radiology 178:328–330CrossRef
7.
Zurück zum Zitat Obuchowski NA (2003) Receiver operating characteristic curves and their use in radiology. Radiology 229:3–8CrossRef Obuchowski NA (2003) Receiver operating characteristic curves and their use in radiology. Radiology 229:3–8CrossRef
8.
Zurück zum Zitat Zanchi F, Richard R, Hussami M, Monier A, Knebel JF, Omoumi P (2020) MRI of non-specific low back pain and/or lumbar radiculopathy: do we need T1 when using a sagittal T2-weighted Dixon sequence. Eur Radiol 30:2583–2593CrossRef Zanchi F, Richard R, Hussami M, Monier A, Knebel JF, Omoumi P (2020) MRI of non-specific low back pain and/or lumbar radiculopathy: do we need T1 when using a sagittal T2-weighted Dixon sequence. Eur Radiol 30:2583–2593CrossRef
9.
Zurück zum Zitat Sollmann N, Mönch S, Riederer I, Zimmer C, Baum T, Kirschke JS (2020) Imaging of the degenerative spine using a sagittal T2-weighted DIXON turbo spin-echo sequence. Eur J Radiol 131:109204CrossRef Sollmann N, Mönch S, Riederer I, Zimmer C, Baum T, Kirschke JS (2020) Imaging of the degenerative spine using a sagittal T2-weighted DIXON turbo spin-echo sequence. Eur J Radiol 131:109204CrossRef
10.
Zurück zum Zitat Saifuddin A, Rajakulasingam R, Santiago R, Siddiqui M, Khoo M, Pressney I (2021) Comparison of lumbar degenerative disc disease using conventional fast spin echo T2W MRI and T2 fast spin echo dixon sequences. Br J Radiol 94:20201438CrossRef Saifuddin A, Rajakulasingam R, Santiago R, Siddiqui M, Khoo M, Pressney I (2021) Comparison of lumbar degenerative disc disease using conventional fast spin echo T2W MRI and T2 fast spin echo dixon sequences. Br J Radiol 94:20201438CrossRef
11.
Zurück zum Zitat Yang S, Lassalle L, Mekki A et al (2021) Can T2-weighted Dixon fat-only images replace T1-weighted images in degenerative disc disease with Modic changes on lumbar spine MRI. Eur Radiol 31(12):9380–9389CrossRef Yang S, Lassalle L, Mekki A et al (2021) Can T2-weighted Dixon fat-only images replace T1-weighted images in degenerative disc disease with Modic changes on lumbar spine MRI. Eur Radiol 31(12):9380–9389CrossRef
12.
Zurück zum Zitat Obuchowski NA, Subhas N, Schoenhagen P (2014) Testing for interchangeability of imaging tests. Acad Radiol 21:1483–1489CrossRef Obuchowski NA, Subhas N, Schoenhagen P (2014) Testing for interchangeability of imaging tests. Acad Radiol 21:1483–1489CrossRef
13.
Zurück zum Zitat Subhas N, Benedick A, Obuchowski NA et al (2017) Comparison of a fast 5-minute shoulder MRI protocol with a standard shoulder MRI protocol: a multiinstitutional multireader study. AJR Am J Roentgenol 208:W146–W154CrossRef Subhas N, Benedick A, Obuchowski NA et al (2017) Comparison of a fast 5-minute shoulder MRI protocol with a standard shoulder MRI protocol: a multiinstitutional multireader study. AJR Am J Roentgenol 208:W146–W154CrossRef
14.
Zurück zum Zitat Maeder Y, Dunet V, Richard R, Becce F, Omoumi P (2018) Bone marrow metastases: T2-weighted Dixon Spin-echo fat images can replace T1-weighted spin-echo images. Radiology 286:948–959CrossRef Maeder Y, Dunet V, Richard R, Becce F, Omoumi P (2018) Bone marrow metastases: T2-weighted Dixon Spin-echo fat images can replace T1-weighted spin-echo images. Radiology 286:948–959CrossRef
15.
Zurück zum Zitat Barnhart HX, Kosinski AS, Haber MJ (2007) Assessing individual agreement. J Biopharm Stat 17:697–719CrossRef Barnhart HX, Kosinski AS, Haber MJ (2007) Assessing individual agreement. J Biopharm Stat 17:697–719CrossRef
16.
Zurück zum Zitat Obuchowski NA, Subhas N, Polster J (2017) Statistics for radiology research. Semin Musculoskelet Radiol 21:23–31CrossRef Obuchowski NA, Subhas N, Polster J (2017) Statistics for radiology research. Semin Musculoskelet Radiol 21:23–31CrossRef
Metadaten
Titel
How to show that a new imaging method can replace a standard method, when no reference standard is available?
verfasst von
Patrick Omoumi
Nancy A. Obuchowski
Publikationsdatum
18.11.2021
Verlag
Springer Berlin Heidelberg
Erschienen in
European Radiology / Ausgabe 4/2022
Print ISSN: 0938-7994
Elektronische ISSN: 1432-1084
DOI
https://doi.org/10.1007/s00330-021-08325-7

Weitere Artikel der Ausgabe 4/2022

European Radiology 4/2022 Zur Ausgabe

Akuter Schwindel: Wann lohnt sich eine MRT?

28.04.2024 Schwindel Nachrichten

Akuter Schwindel stellt oft eine diagnostische Herausforderung dar. Wie nützlich dabei eine MRT ist, hat eine Studie aus Finnland untersucht. Immerhin einer von sechs Patienten wurde mit akutem ischämischem Schlaganfall diagnostiziert.

Screening-Mammografie offenbart erhöhtes Herz-Kreislauf-Risiko

26.04.2024 Mammografie Nachrichten

Routinemäßige Mammografien helfen, Brustkrebs frühzeitig zu erkennen. Anhand der Röntgenuntersuchung lassen sich aber auch kardiovaskuläre Risikopatientinnen identifizieren. Als zuverlässiger Anhaltspunkt gilt die Verkalkung der Brustarterien.

S3-Leitlinie zu Pankreaskrebs aktualisiert

23.04.2024 Pankreaskarzinom Nachrichten

Die Empfehlungen zur Therapie des Pankreaskarzinoms wurden um zwei Off-Label-Anwendungen erweitert. Und auch im Bereich der Früherkennung gibt es Aktualisierungen.

Fünf Dinge, die im Kindernotfall besser zu unterlassen sind

18.04.2024 Pädiatrische Notfallmedizin Nachrichten

Im Choosing-Wisely-Programm, das für die deutsche Initiative „Klug entscheiden“ Pate gestanden hat, sind erstmals Empfehlungen zum Umgang mit Notfällen von Kindern erschienen. Fünf Dinge gilt es demnach zu vermeiden.

Update Radiologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.