International Journal of Radiation Oncology*Biology*Physics
Physics ContributionEvaluation of Automatic Atlas-Based Lymph Node Segmentation for Head-and-Neck Cancer
Introduction
Intensity-modulated radiation therapy (IMRT) has allowed for multiple advances in the treatment of head-and-neck cancer (HNC), including improved parotid gland sparing 1, 2 and higher radiation doses for tumors located near critical structures. To fully exploit the advantages of IMRT, all target volumes and critical structures must be contoured before treatment planning. This time-consuming process may be repeated multiple times during a treatment course because of tumor response or changes in patient weight or anatomy. Automatic segmentation can reduce physician contouring time, with time reductions up to 30–40% seen in studies of HNC and breast contouring 3, 4.
Another potential advantage of automatic segmentation is reduction in intra- and interobserver variability in anatomical volume delineation. Variability of contouring among physicians has been noted in a number of studies 5, 6, 7, 8. The impact of such inconsistencies may be especially evident in HNC radiotherapy, where the range of interobserver variability is somewhat larger and may exceed the errors due to position uncertainty and organ motion (9). Interobserver variability may not affect an individual radiation oncologist's contours, but it does impact the field as a whole in regards to interpretation of clinical trials results and consistency across the specialty.
Automatic segmentation has been shown to reduce variability of contours among physicians and improve efficiency for multiple disease sites 3, 4. The gains in efficiency and consistency are valuable only if accuracy is not compromised. Assessment of accuracy is a complex issue, because there is no objective volume for comparison. A standard approach in the evaluation of automatic segmentation for radiotherapy planning has been to use individual expert physician segmentations for comparison. The shortcoming of this approach is that it does not address interobserver variability.
The Simultaneous Truth and Performance Level Estimation (STAPLE) algorithm is a widely accepted tool that adjusts for intra- and interobserver variability in image segmentation (10). It takes a collection of segmentations and calculates a probabilistic estimate of the true segmentation. Using this algorithm, we took a collection of physician manual contours and generated an estimate of the true segmentation, to use as the “reference standard” for contour comparisons. We compared manual, automatic, and automatic-modified (AM) contours to this standard.
This study focused on the target volume of lymph node regions for HNC patients, as manually contouring these volumes is a time-intensive task. The goal of this study was to use multiple assessment tools, including the STAPLE algorithm, to evaluate if automatic segmentation could decrease inter-physician variability while maintaining accuracy. Using these same methods, we analyzed how physicians modify automatic anatomical segmentations in terms of size, shape, and position.
Section snippets
Study overview
We selected 5 adult patients with non-bulky neck nodes who were treated with IMRT for HNC of either the oropharynx or nasopharynx. For each patient, a three-step process was performed: physicians manually contoured designated regions of interest (ROIs) on the planning CT scans; HNC atlas was automatically registered to the planning CT and delineated atlas-based ROI; and physicians reviewed and modified the atlas-based ROI.
Creation of automatic contours
A commercially available HNC atlas (Velocity Medical Systems, Atlanta,
Accuracy
The physicians' qualitative assessments of the automatic contours deemed 32% of the contours to be acceptable for treatment planning without modification. Four of the five physicians had consistent answers for all the patients, answering either yes or no for the automatic contours for all five of the patients. All of the physicians who answered no to the acceptability question indicated that the CTVs were too large.
Figure 2(a, b) and Table 1 show the comparisons between the STAPLE-manual and
Discussion
By creating “true” contours from multiple experienced physicians' manual contours, we have demonstrated that the use of atlas-based automatic lymph node segmentation can improve efficiency and decrease interobserver variability while maintaining accuracy. Variability is one of the most challenging issues in the IMRT era and recognition of this fact has motivated recent efforts to quantify variability and develop systematic approaches to improve consistency (15). The ability to accurately and
Conclusion
By creating a ground truth from multiple segmentations, the STAPLE algorithm provides a unique tool to assess variability in contouring. With the application of STAPLE, we have shown that atlas-based automatic LNS in HNC is accurate, efficient, and reduces interobserver variability. Further analysis of the variability in IMRT contouring may help improve consistency across the field and augment the education process for physicians learning IMRT.
References (18)
- et al.
Quality of life and survival outcome for patients with nasopharyngeal carcinoma receiving three-dimensional conformal radiotherapy vs. intensity-modulated radiotherapy—A longitudinal study
Int J Radiat Oncol Biol Phys
(2008) - et al.
Intensity-modulated radiation therapy reduces late salivary toxicity without compromising tumor control in patients with oropharyngeal carcinoma: A comparison with conventional techniques
Radiother Oncol
(2001) - et al.
Automatic segmentation of whole-breast using atlas approach and deformable image registration
Int J Radiat Oncol Bio Phys
(2009) - et al.
Reduce in variation and improve efficiency of target volume delineation by a computer-assisted system using a deformable image registration approach
Int J Radiat Oncol Biol Phys
(2007) - et al.
An evaluation of the tumor-shape definition by experienced observers from CT images of supraglottic carcinomas (ACRIN Protocol 6658)
Int J Radiat Oncol Bio Phys
(2007) - et al.
Variations in target delineation for head and neck IMRT: An international multi-institutional study
Int J Radiat Oncol Biol Phys
(2004) - et al.
Laryngeal tumor volume measurements determined with CT: A study on intra- and interobserver variation
Int J Radiat Oncol Biol Phys
(1998) - et al.
The potential impact of CT-MRI matching on tumor volume delineation in advanced head and neck cancer
Int J Radiat Oncol Biol Phys
(1997) - et al.
CT-based delineation of lymph node levels and related CTVs in the node-negative neck: DAHANCA, EORTC, GORTEC, NCIC, RTOG consensus guidelines
Radiother Oncol
(2003)
Cited by (115)
Reference standard for the evaluation of automatic segmentation algorithms: Quantification of inter observer variability of manual delineation of prostate contour on MRI
2024, Diagnostic and Interventional ImagingClinical acceptability of automatically generated lymph node levels and structures of deglutition and mastication for head and neck radiation therapy
2024, Physics and Imaging in Radiation OncologyEvaluation of different algorithms for automatic segmentation of head-and-neck lymph nodes on CT images
2023, Radiotherapy and OncologyA simple single-cycle interactive strategy to improve deep learning-based segmentation of organs-at-risk in head-and-neck cancer
2023, Physics and Imaging in Radiation Oncology
Dr. Fox is entitled to royalties derived from Velocity Medical Solution's sale of products. The terms of this agreement have been reviewed and approved by Emory University in accordance with its conflict of interest policies.