QuickNAT: A fully convolutional network for quick and accurate segmentation of neuroanatomy

doi:10.1016/j.neuroimage.2018.11.042

NeuroImage

Volume 186, 1 February 2019, Pages 713-727

https://doi.org/10.1016/j.neuroimage.2018.11.042 Get rights and content

Highlights

•
Introduces a deep learning based whole brain segmentation tool called QuickNAT, processing each 3D MRI T1 brain scans in 20 secs.
•
The high segmentation accuracy of QuickNAT was evaluated on 5 different benchmark datasets, containing a wide age range, subjects with different pathologies (AD, MCI and CN), and different scanners (1.5T and 3.0T).
•
QuickNAT can be effectively used for longitudinal studies as it performs well in test-retest and multi-center experiments.

Abstract

Whole brain segmentation from structural magnetic resonance imaging (MRI) is a prerequisite for most morphological analyses, but is computationally intense and can therefore delay the availability of image markers after scan acquisition. We introduce QuickNAT, a fully convolutional, densely connected neural network that segments a MRI brain scan in 20 s. To enable training of the complex network with millions of learnable parameters using limited annotated data, we propose to first pre-train on auxiliary labels created from existing segmentation software. Subsequently, the pre-trained model is fine-tuned on manual labels to rectify errors in auxiliary labels. With this learning strategy, we are able to use large neuroimaging repositories without manual annotations for training. In an extensive set of evaluations on eight datasets that cover a wide age range, pathology, and different scanners, we demonstrate that QuickNAT achieves superior segmentation accuracy and reliability in comparison to state-of-the-art methods, while being orders of magnitude faster. The speed up facilitates processing of large data repositories and supports translation of imaging biomarkers by making them available within seconds for fast clinical decision making.

Graphical abstract

Introduction

Magnetic Resonance Imaging (MRI) provides detailed in-vivo insights about the morphology of the human brain, which is essential for studying development, aging, and disease (Giedd et al., 1999; Draganski et al., 2004; Shaw et al., 2006; Raznahan et al., 2012; Alexander-Bloch and Giedd, 2013; Wachinger et al., 2016; Lerch et al., 2017). In order to access measurements like volume, thickness, or shape of a structure, the neuroanatomy needs to be segmented, which is a time-consuming process when performed manually (Fischl et al., 2002). Computational tools have been developed that can fully automatically segment brain MRI scans by warping a manually segmented atlas to the target scan (Fischl et al., 2002; Ashburner and Friston, 2005; Rohlfing et al., 2005; Svarer et al., 2005). Such approaches have two potential shortcomings: (i) the estimation of the 3D deformation field for warping is computationally intense, and (ii) lack of homologies may result in erroneous segmentations of the cortex (Lerch et al., 2017). Due to these drawbacks, existing atlas-based methods require hours of processing time for each scan and may result in sub-optimal solutions.

We propose a method for the Quick segmentation of NeuroAnaTomy (QuickNAT) in MRI T1 scans based on a deep fully convolutional neural network (F-CNN) that runs in seconds on GPUs, compared to hours for existing atlas-based methods. We believe that this speed up by several orders of magnitude can have a wide impact on neuroimaging: processing of large datasets can be performed on a single GPU workstation, instead of a computing cluster; quantitative morphological measurements can be derived from a scan within seconds, boosting its translation. Furthermore, the fast processing speed allows for sampling multiple segmentations in a reasonable amount of time to estimate segmentation uncertainty for automated quality control (Roy et al., 2018). Beside its speed, QuickNAT produces state-of-the-art segmentation accuracy as demonstrated on multiple datasets covering a wide age range, different field strengths, and pathologies. Moreover, it yields effect sizes that are closer to those of manual segmentations and therefore offers advantages for group analyses. Finally, QuickNAT exhibits high test-retest accuracy making it useful for longitudinal studies.

Deep learning models have had ample success over the last years, but require vast amounts of annotated data for effective training (LeCun et al., 2015). The task of semantic image segmentation is dominated by F-CNN models in computer vision (Long et al., 2015). The limited availability of training data with manual annotations presents the main challenge in extending F-CNN models to brain segmentation. To address this challenge, we introduce a new training strategy (Fig. 2) that exploits large brain repositories without manual labels and small repositories with manual labels. First, we apply existing software tools (e.g., FreeSurfer (Fischl et al., 2002)) to segment scans without annotations. We refer to these automatically generated segmentations as auxiliary labels, which we use to pre-train the network. Auxiliary labels may not be as accurate as expert annotations; however, they allow us to efficiently leverage the vast amount of initially unlabeled data for supervised training of the network. It also makes the network familiar with a wide range of morphological variations of different brain structures that may exist in a wide population. In the second step, we fine-tune (i.e., continue training) the previous network with smaller manually annotated data. Pre-training provides a good prior initialization of the network, such that scarce manual annotations are optimally utilized to achieve high segmentation accuracy. As a side note, we observed that a network trained only on FreeSurfer segmentations can produce more accurate results than FreeSurfer itself.

QuickNAT consists of three 2D F-CNNs operating on coronal, axial and sagittal views followed by a view aggregation step to infer the final segmentation (Fig. 3). Each F-CNN has the same architecture and is inspired by the traditional encoder/decoder based U-Net architecture with skip connections (Ronneberger et al., 2015), enhanced with unpooling layers (Noh et al., 2015) (Fig. 1). We also introduce dense connections (Huang et al., 2016) within each encoder/decoder block to aid gradient flow and to promote feature re-usability, which is essential given the limited amount of training data. The network is optimized using a joint loss function of multi-class Dice loss and weighted logistic loss, where weights compensate for high class imbalance in the data and encourage proper estimation of anatomical boundaries.

The two main methodological innovations of QuickNAT are the training strategy with auxiliary labels and the F-CNN architecture. To the best of our knowledge, this is the first work to conduct such a large number of experiments on highly heterogeneous datasets to evaluate the robustness of an F-CNN for brain segmentation. The code and trained model are available as extensions of MatConvNet (Vedaldi and Lenc, 2015) at https://github.com/abhi4ssj/QuickNATv2. This is an extension of our early work (Roy et al., 2017), where we introduced the concept of pre-training with auxiliary labels. In this work, we improved upon the architecture, segment more brain structures and show exhaustive experiments for a wide range of possibilities to substantiate the effectiveness of the framework.

Section snippets

Methods

Given an input MRI brain scan I, we want to infer its segmentation map S, which indicates 27 cortical and subcortical structures. Given a set of scans $I = {I_{1}, \dots I_{n}}$ and its corresponding segmentations $S = {S_{1}, \dots, S_{n}}$ , we want to learn a function $f_{s e g} : I \to S$ . We express this function as an F-CNN model, termed QuickNAT, which is detailed below.

Experimental datasets

We use nine brain MRI datasets in our experiments. We use five datasets with manual annotations to evaluate segmentation accuracy. Three datasets were used for testing reliability of the segmentation framework. Table 1 summarizes the number of subjects per dataset, the age range, the diagnosis, and the annotated structures. Present diagnoses are Alzheimer's disease (AD), mild cognitive impairment (MCI), and psychiatric disorders. Details about acquisition protocol used in each of the datasets

Experiments and results

We evaluate QuickNAT in a comprehensive series of eight experiments to assess accuracy, reproducibility, and sensitivity on a large variety of neuroimaging datasets, summarized in Table 2. In all experiments, we pre-train QuickNAT on 581 MRI volumes from the IXI dataset to get auxiliary segmentations from FreeSurfer (Fischl et al., 2002). We conducted 5 experiments to evaluate the segmentation accuracy (experiments 1 to 5; Sec. 4.1 and Sec. 4.2), and another 3 experiments (experiments 6 to 8;

Comparison with deep learning approaches

Recently, convolutional neural networks have been proposed for brain segmentation (Chen et al., 2018; Dolz et al., 2018; Fedorov et al., 2017; Wachinger et al., 2018; Moeskops et al., 2016). DeepNAT (Wachinger et al., 2018) reported competitive results on the MALC data, but as shown in Table 3, QuickNAT yields significantly higher accuracy, while requiring only seconds (Fig. 6). Dolz et al. (2018) proposed a network for segmenting 8 structures based on skull-stripped and intensity normalized

Conclusion

We have introduced QuickNAT, a deep learning based method for brain segmentation that runs in seconds, achieving superior performance with respect to existing methods and being orders of magnitudes faster in comparison to patch-based CNNs and atlas-based approaches. We have demonstrated that QuickNAT generalizes well to other, unseen datasets (training data different to testing) and yields high segmentation accuracy across diagnostic groups, scanner field strengths, and age, while producing

Acknowledgment

Support for this research was provided in part by the Bavarian State Ministry of Education, Science and the Arts in the framework of the Center Digitisation.Bavaria (ZD.B). We thank Neuromorphometrics Inc. for providing manual annotations, neuroimaging initiatives for sharing data, and NVIDIA corporation for GPU donation. We would also like to thank Dr. Sebastian Pölsterl for proofreading the manuscript and providing feedback. Data collection and sharing was funded by the Alzheimer's Disease

References (51)

J. Ashburner et al.
Unified segmentation
Neuroimage
(2005)
B.B. Avants et al.
A reproducible evaluation of ants similarity metric performance in brain image registration
Neuroimage
(2011)
M. Boccardi et al.
Training labels for hippocampal segmentation based on the EADC-ADNI harmonized hippocampal protocol
Alzheimer's Dementia
(2015)
H. Chen et al.
VoxResNet: deep voxelwise residual networks for brain segmentation from 3d MR images
Neuroimage
(2018)
A.M. Dale et al.
Cortical surface-based analysis: I. Segmentation and surface reconstruction
Neuroimage
(1999)
J. Dolz et al.
3d fully convolutional networks for subcortical segmentation in MRI: a large-scale study
Neuroimage
(2018)
B. Fischl et al.
Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain
Neuron
(2002)
B. Fischl et al.
Cortical surface-based analysis: II: inflation, flattening, and a surface-based coordinate system
Neuroimage
(1999)
M. Havaei et al.
Brain tumor segmentation with deep neural networks
Med. Image Anal.
(2017)
C.R. Jack et al.
Tracking pathophysiological processes in Alzheimer's disease: an updated hypothetical model of dynamic biomarkers
Lancet Neurol.
(2013)

M. Jenkinson et al.

Fsl. Neuroimage

(2012)

K. Kamnitsas et al.

Efficient multi-scale 3d cnn with fully connected crf for accurate brain lesion segmentation

Med. Image Anal.

(2017)

M.J. Kempton et al.

A comprehensive testing protocol for MRI neuroanatomical segmentation techniques: evaluation of a novel lateral ventricle segmentation method

Neuroimage

(2011)

B. Patenaude et al.

A Bayesian model of shape and appearance for subcortical brain segmentation

Neuroimage

(2011)

C. Svarer et al.

MR-based automatic delineation of volumes of interest in human brain PET images using probability maps

Neuroimage

(2005)

B. Thyreau et al.

Segmentation of the hippocampus by transferring algorithmic knowledge for large cohort processing

Med. Image Anal.

(2018)

S. Valverde et al.

Improving automated multiple sclerosis lesion segmentation with a cascaded 3d convolutional neural network approach

Neuroimage

(2017)

C. Wachinger et al.

DeepNAT: deep convolutional neural network for segmenting neuroanatomy

Neuroimage

(2018)

A. Alexander-Bloch et al.

Imaging structural co-variance between human brain regions

Nat. Rev. Neurosci.

(2013)

A.J. Asman et al.

Formulating spatially varying performance in the statistical fusion framework

IEEE Trans. Med. Imag.

(2012)

V. Badrinarayanan et al.

Segnet: a Deep Convolutional Encoder-decoder Architecture for Image Segmentation

(2015)

T. Bartsch et al.

The hippocampus in Aging and Disease: from Plasticity to Vulnerability

(2015)

T. Brosch et al.

Deep 3d convolutional encoder networks with shortcuts for multiscale feature integration applied to multiple sclerosis lesion segmentation

IEEE Trans. Med. Imag.

(2016)

B. Draganski et al.

Neuroplasticity: changes in grey matter induced by training

Nature

(2004)

A. Fedorov et al.

Almost Instant Brain Atlas Segmentation for Large-scale Studies

(2017)

Cited by (223)

Fighting the scanner effect in brain MRI segmentation with a progressive level-of-detail network trained on multi-site data
2024, Medical Image Analysis
Many clinical and research studies of the human brain require accurate structural MRI segmentation. While traditional atlas-based methods can be applied to volumes from any acquisition site, recent deep learning algorithms ensure high accuracy only when tested on data from the same sites exploited in training (i.e., internal data). Performance degradation experienced on external data (i.e., unseen volumes from unseen sites) is due to the inter-site variability in intensity distributions, and to unique artefacts caused by different MR scanner models and acquisition parameters. To mitigate this site-dependency, often referred to as the scanner effect, we propose LOD-Brain, a 3D convolutional neural network with progressive levels-of-detail (LOD), able to segment brain data from any site. Coarser network levels are responsible for learning a robust anatomical prior helpful in identifying brain structures and their locations, while finer levels refine the model to handle site-specific intensity distributions and anatomical variations. We ensure robustness across sites by training the model on an unprecedentedly rich dataset aggregating data from open repositories: almost 27,000 T1w volumes from around 160 acquisition sites, at 1.5 - 3T, from a population spanning from 8 to 90 years old. Extensive tests demonstrate that LOD-Brain produces state-of-the-art results, with no significant difference in performance between internal and external sites, and robust to challenging anatomical variations. Its portability paves the way for large-scale applications across different healthcare institutions, patient populations, and imaging technology manufacturers. Code, model, and demo are available on the project website.
Deep learning methods for early detection of Alzheimer's disease using structural MR images: a survey
2024, Neurocomputing
In this paper, we present an extensive review of the most recent works on Alzheimer’s disease (AD) prediction, focusing on Moderate Cognitive Impairment (MCI) conversion prediction. We aimed to identify the most useful brain-magnetic resonance imaging (MRI) biomarkers and deep learning frameworks used for prediction. To achieve this, we analyzed more than 130 studies and reviewed 7 articles. A closer examination revealed that the hippocampus is an important region of interest (ROI) affected early by AD, and many related features help detect the disease in its early stages. However, when considered alone, this ROI is not sufficient to ensure high prediction performance. Therefore, several other brain regions can also provide additional information to improve prediction accuracy. Concerning state-of-the-art deep neural networks, the U-Net represents the most efficient architecture for hippocampus segmentation. The RESU-Net architecture achieved the highest Dice Similarity Coefficient (DSC) value, equal to 94%.For MCI conversion prediction, the best results were obtained by two models identifying significant landmarks from the entire brain for classification. The multi-stream convolutional neural network achieved the best Area Under the Curve (AUC) and specificity of 94.39% and 99.70%, respectively. Finally, a region ensemble model delivered the highest accuracy of 85.90%, highlighting the need for further research to address this challenging problem.
Neural deformation fields for template-based reconstruction of cortical surfaces from MRI
2024, Medical Image Analysis
The reconstruction of cortical surfaces is a prerequisite for quantitative analyses of the cerebral cortex in magnetic resonance imaging (MRI). Existing segmentation-based methods separate the surface registration from the surface extraction, which is computationally inefficient and prone to distortions. We introduce Vox2Cortex-Flow (V2C-Flow), a deep mesh-deformation technique that learns a deformation field from a brain template to the cortical surfaces of an MRI scan. To this end, we present a geometric neural network that models the deformation-describing ordinary differential equation in a continuous manner. The network architecture comprises convolutional and graph-convolutional layers, which allows it to work with images and meshes at the same time. V2C-Flow is not only very fast, requiring less than two seconds to infer all four cortical surfaces, but also establishes vertex-wise correspondences to the template during reconstruction. In addition, V2C-Flow is the first approach for cortex reconstruction that models white matter and pial surfaces jointly, therefore avoiding intersections between them. Our comprehensive experiments on internal and external test data demonstrate that V2C-Flow results in cortical surfaces that are state-of-the-art in terms of accuracy. Moreover, we show that the established correspondences are more consistent than in FreeSurfer and that they can directly be utilized for cortex parcellation and group analyses of cortical thickness.
Development of the next-generation functional neuro-cognitive imaging protocol - Part 1: A 3D sliding-window convolutional neural net for automated brain parcellation
2024, NeuroImage
Functional MRI has emerged as a powerful tool to assess the severity of Post-concussion syndrome (PCS) and to provide guidance for neuro-cognitive therapists during treatment. The next-generation functional neuro-cognitive imaging protocol (fNCI2) has been developed to provide this assessment. This paper covers the first step in the analysis process, the development of a rapidly re-trainable, machine-learning, brain parcellation tool. The use of a sufficiently deep U-Net architecture encompassing a small (39 × 39 × 39 voxel input, 27 × 27 × 27 voxel output) sliding window to sample the entirety of the 3D image allows for the prediction of the entire image using only a single trained network. A large number of training, validating, and testing windows are thus generated from the 101 manually-labeled Mindboggle images, and full-image prediction is provided via a voxel-vote method using overlapping windows. Our method produces parcellated images that are highly consistent with standard atlas-based methods in under 3 min on a modern GPU, and the single network architecture allows for rapid retraining (<36 hr) as needed.
Automated Segmentation of Sacral Chordoma and Surrounding Muscles Using Deep Learning Ensemble
2023, International Journal of Radiation Oncology Biology Physics
The manual segmentation of organ structures in radiation oncology treatment planning is a time-consuming and highly skilled task, particularly when treating rare tumors like sacral chordomas. This study evaluates the performance of automated deep learning (DL) models in accurately segmenting the gross tumor volume (GTV) and surrounding muscle structures of sacral chordomas.
An expert radiation oncologist contoured 5 muscle structures (gluteus maximus, gluteus medius, gluteus minimus, paraspinal, piriformis) and sacral chordoma GTV on computed tomography images from 48 patients. We trained 6 DL auto-segmentation models based on 3-dimensional U-Net and residual 3-dimensional U-Net architectures. We then implemented an average and an optimally weighted average ensemble to improve prediction performance. We evaluated algorithms with the average and standard deviation of the volumetric Dice similarity coefficient, surface Dice similarity coefficient with 2- and 3-mm thresholds, and average symmetric surface distance. One independent expert radiation oncologist assessed the clinical viability of the DL contours and determined the necessary amount of editing before they could be used in clinical practice.
Quantitatively, the ensembles performed the best across all structures. The optimal ensemble (volumetric Dice similarity coefficient, average symmetric surface distance) was (85.5 ± 6.4, 2.6 ± 0.8; GTV), (94.4 ± 1.5, 1.0 ± 0.4; gluteus maximus), (92.6 ± 0.9, 0.9 ± 0.1; gluteus medius), (85.0 ± 2.7, 1.1 ± 0.3; gluteus minimus), (92.1 ± 1.5, 0.8 ± 0.2; paraspinal), and (78.3 ± 5.7, 1.5 ± 0.6; piriformis). The qualitative evaluation suggested that the best model could reduce the total muscle and tumor delineation time to a 19-minute average.
Our methodology produces expert-level muscle and sacral chordoma tumor segmentation using DL and ensemble modeling. It can substantially augment the streamlining and accuracy of treatment planning and represents a critical step toward automated delineation of the clinical target volume in sarcoma and other disease sites.
Self-supervised-RCNN for medical image segmentation with limited data annotation
2023, Computerized Medical Imaging and Graphics
Many successful methods developed for medical image analysis based on machine learning use supervised learning approaches, which often require large datasets annotated by experts to achieve high accuracy. However, medical data annotation is time-consuming and expensive, especially for segmentation tasks. To overcome the problem of learning with limited labeled medical image data, an alternative deep learning training strategy based on self-supervised pretraining on unlabeled imaging data is proposed in this work. For the pretraining, different distortions are arbitrarily applied to random areas of unlabeled images. Next, a Mask-RCNN architecture is trained to localize the distortion location and recover the original image pixels. This pretrained model is assumed to gain knowledge of the relevant texture in the images from the self-supervised pretraining on unlabeled imaging data. This provides a good basis for fine-tuning the model to segment the structure of interest using a limited amount of labeled training data. The effectiveness of the proposed method in different pretraining and fine-tuning scenarios was evaluated based on the Osteoarthritis Initiative dataset with the aim of segmenting effusions in MRI datasets of the knee. Applying the proposed self-supervised pretraining method improved the Dice score by up to 18% compared to training the models using only the limited annotated data. The proposed self-supervised learning approach can be applied to many other medical image analysis tasks including anomaly detection, segmentation, and classification.

View all citing articles on Scopus

¹: Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.

View full text

QuickNAT: A fully convolutional network for quick and accurate segmentation of neuroanatomy

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Methods

Experimental datasets

Experiments and results

Comparison with deep learning approaches

Conclusion

Acknowledgment

Neuroimage

Neuroimage

Alzheimer's Dementia

Neuroimage

Neuroimage

Neuroimage

Neuron

Neuroimage

Med. Image Anal.

Lancet Neurol.

Fsl. Neuroimage

Med. Image Anal.

Neuroimage

Neuroimage

Neuroimage

Med. Image Anal.

Neuroimage

Neuroimage

Imaging structural co-variance between human brain regions

Nat. Rev. Neurosci.

Formulating spatially varying performance in the statistical fusion framework

IEEE Trans. Med. Imag.

Segnet: a Deep Convolutional Encoder-decoder Architecture for Image Segmentation

The hippocampus in Aging and Disease: from Plasticity to Vulnerability

Deep 3d convolutional encoder networks with shortcuts for multiscale feature integration applied to multiple sclerosis lesion segmentation

IEEE Trans. Med. Imag.

Neuroplasticity: changes in grey matter induced by training

Nature

Almost Instant Brain Atlas Segmentation for Large-scale Studies