MSFCN-multiple supervised fully convolutional networks for the osteosarcoma segmentation of CT images

doi:10.1016/j.cmpb.2017.02.013

Computer Methods and Programs in Biomedicine

Volume 143, May 2017, Pages 67-74

https://doi.org/10.1016/j.cmpb.2017.02.013 Get rights and content

Highlights

•
It is a deep end-to-end network for medical image segmentation.
•
Multiple supervision side output layers were introduced to the network for guiding the multi-scale feature learning.
•
A large number of feature channels were used in the up-sampling portion in order to capture more context information.
•
The segmentation method achieved an average DSC of 87.80%, an average sensitivity of 86.88%, an average HM of 19.81%, and an F1-measure of 0.9080, these results are better than some existing studies.

Abstract

Background and objective

Automatic osteosarcoma tumor segmentation on computed tomography (CT) images is a challenging problem, as tumors have large spatial and structural variabilities. In this study, an automatic tumor segmentation method, which was based on a fully convolutional networks with multiple supervised side output layers (MSFCN), was presented.

Methods

Image normalization is applied as a pre-processing step for decreasing the differences among images. In the frame of the fully convolutional networks, supervised side output layers were added to three layers in order to guide the multi-scale feature learning as a contracting structure, which was then able to capture both the local and global image features. Multiple feature channels were used in the up-sampling portion to capture more context information, for the assurance of accurate segmentation of the tumor, with low contrast around the soft tissue. The results of all the side outputs were fused to determine the final boundaries of the tumors.

Results

A quantitative comparison of the 405 osteosarcoma manual segmentation results from the CT images showed that the average Dice similarity coefficient (DSC), average sensitivity, average Hammoude distance (HM) and F1-measure were 87.80%, 86.88%, 19.81% and 0.908, respectively. It was determined that, when compared with the other learning-based algorithms (for example, the fully convolution networks (FCN), U-Net method, and holistically-nested edge detection (HED) method), the MSFCN had the best performances in terms of DSC, sensitivity, HM and F1-measure.

Conclusion

The results indicated that the proposed algorithm contributed to the fast and accurate delineation of tumor boundaries, which could potentially assist doctors in making more precise treatment plans.

Introduction

Osteosarcoma is one of the most prevalent types of bone tumor, and occurs most often in children and adolescents [1]. The present standard practices for the management of osteosarcoma are a combination of neoadjuvant chemotherapy, and surgical intervention of the primary tumor [2].

The accuracy of the tumor segmentations from osteosarcoma CT images is crucial not only to the treatment planning before neoadjuvant chemotherapy, but also to the following therapeutic efficacy evaluations. The manual delineation of tumor tissue from each slice by an experienced radiologist is time-consuming and laborious. Also, the results are subjective and non-reproducible. For these reasons, an accurate automatic or semi-automatic tumor segmentation method is required. Tumor tissue segmentation from osteosarcoma CT images presents many challenges. The main difficulties can be divided into the following three aspects: (1) The specificity of the osteosarcoma. Osteosarcoma arises from bones, as well as the soft tissues of the extremities [3], which makes it difficult to identify the boundaries of tumors. Furthermore, the tumors of different patients may vary greatly in size and position, and also may have a variety of shapes and appearance properties; (2) The heterogeneity of the tumors. The grey scale and texture features are not uniform inside a tumor, and the distributions of tumor tissue necrosis are diverse. In addition, the gray differences between the tumor tissue and other normal surrounding tissues on the osteosarcoma CT images are usually very small; and (3) The diversity of the CT imaging equipment protocols. The osteosarcoma CT images are acquired from different imaging equipment, which may have variable imaging protocols. These diverse arguments may present non-ignorable differences among the images. All of these reasons cause osteosarcoma CT image segmentations to be challengeable tasks.

During the past decade, a number of methods have been proposed to effectively segment tumor regions from osteosarcoma images. Generally speaking, these can be divided into two categories: (1) Cluster-based methods [4], [5], [6]. These methods interactively chose object seed point and background seed point, and depicted the properties of each class. These methods had high computational efficiency. However, they were found to be sensitive to initialization and noise. In addition, due to the lack of object prior, they were only able to process the images with simple structures and orderly textures. (2) Traditional learning-based methods [7], [8], [9]. These methods considered the segmentation tasks as per-pixel classification tasks. They learn a pixel classification model based on the handcraft features. However, the learning-based methods had some limitations. In order to improve the accuracy of the classifier, a large number of features were required to be calculated. This caused the computations to be slow, and also costly in terms of memory. In order to make the algorithm more efficient, many techniques, such as dimensionality reduction [10] or feature selection methods [11], were employed to reduce the number of features. However, the reduction in the number of features was found to be often at the cost of reduced accuracy [12]. Therefore, limited by the handcraft features, these methods did not work well when applied to large amounts of osteosarcoma CT images, as they failed to extract the object osteosarcoma tumor regions which were characterized by complex structures and disorderly textures.

Recently, new learning-based segmentation methods, which were based on convolutional neural networks (CNN), have been introduced [13], [14], [15], [16], [17], [18]. These CNN-based methods were able to learn a hierarchy of increasingly complex features directly from patches (a local region around pixel), and predict the class label of each pixel according to the learned features [19]. Due to the fact that a CNN operates over patches using kernels, there is no need for extracting the handcraft features. Thereby, the segmentation accuracy can be significantly improved. However, these patch-based CNN are too time and memory consuming, and a large amount of redundancy exists due to overlapping patches [20]. Furthermore, the receptive field sizes are limited by the patch sizes, and only local features can be extracted. More elegant networks, referred to as fully convolutional networks (FCN) [21], have been proposed to overcome these limitations. An FCN uses a pre-trained CNN model on ImageNet to make accurate image segmentations. Some segmentation tasks based on FCN has achieved good results [20], [22]. However, these methods were found to be unable to identify some smaller object regions.

Therefore, the development of a fast and fully automatic segmentation method, which has better accuracy and uses osteosarcoma CT images, was the prime motivation behind this study. In this study, a multiple supervised fully convolutional networks (MSFCN) for the segmentations of tumor areas on osteosarcoma CT images, was presented. In the MSFCN, supervision side output layer was added to the middle hidden layers, which enabled the network to learn the rich hierarchical features directly from the images, and accurately identify the tumor regions from the osteosarcoma CT images.

Section snippets

Data acquisitions

The datasets used in this research study consisted of 2305 osteosarcoma CT images from 23 patients aged between 8 to 30 years. This dataset was split into 1900 training images and 405 testing images. The testing images were divided into two groups, based on their lesion locations. There were 109 bone lesion images, in which the tumor was located on bone, and 296 mixed lesion images, in which the tumors were located in both bone and soft tissues. All of the osteosarcoma CT images were obtained

Model initialization parameters

In this study, the first nine groups parameters of the pre-trained VGG-16 model were adopted to initialize the filters of the convolutional part of the MSFCN. The remaining hyper parameters of the MSFCN are shown in Table 1:

Data augmentation

When there is not abundant medical data to train a deep network, artificial data augmentation is a common way to generate sufficient training data. It can also teach the network the desired invariances and robustness properties when the data set is relatively small. In this

Discussion and conclusion

In this study, a novel multiple supervised fully convolutional networks method (MSFCN) for the segmentation of osteosarcoma in CT images was presented. The MSFCN displayed two advantages in identifying the boundaries of the osteosarcoma: (1) A large number of feature channels (128) were used in the up-sampling, which reserved more context information; and (2) Multiple supervision layers were introduced to guide the multi-scale feature learning, which was found to be helpful in capturing the

Acknowledgment

This work is supported by National Natural Science Foundation of China [81571772].

References (35)

J. Ritter et al.
Osteosarcoma
Ann. Oncol.
(2010)
A.F. Frangi et al.
Bone tumor segmentation from MR perfusion images with neural networks using multi-scale pharmacokinetic features
Image Vision Comput.
(2001)
J.O. Glass et al.
Hybrid artificial neural network segmentation and classification of dynamic contrast-enhanced MR imaging (DEMRI) of osteosarcoma
Magn. Reson. Imaging
(1998)
R. Tripathy et al.
Gaussian processes with built-in dimensionality reduction: Applications to high-dimensional uncertainty propagation
J. Comput. Phys.
(2016)
ZhangX. et al.
Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy
Pattern Recognit.
(Aug 2016)
ZhangW. et al.
Deep convolutional neural networks for multi-modality isointense infant brain image segmentation
NeuroImage
(2015)
P.A. Meyers et al.
Osteosarcoma: the addition of muramyl tripeptide to chemotherapy improves overall survival—a report from the Children's Oncology Group
J. Clin. Oncol.
(2008)
C.D. Fletcher et al.
Pathology and Genetics of Tumours of Soft Tissue and Bone
Iarc
(2002)
R. Mandava et al.
Osteosarcoma segmentation in MRI using dynamic harmony search based clustering
ChenC. et al.
Osteosarcoma segmentation in CT images based on hybrid relative fuzzy connectedness

MaJ. et al.

Segmentation of multimodality osteosarcoma MRI with vectorial fuzzy-connectedness theory

ChenC.-x. et al.

Osteosarcoma Segmentation in MRI Based on Zernike Moment and SVM

Chin. J. Biomed. Eng.

(2013)

M. Havaei et al.

Brain tumor segmentation with deep neural networks

Med. Image Anal.

(2016)

R. Girshick et al.

Region-based convolutional networks for accurate object detection and segmentation

IEEE Trans. Pattern Anal. Mach. Intell.

(2016)

O.Z. Kraus et al.

Classifying and segmenting microscopy images with deep multiple instance learning

Bioinformatics

(2016)

P. Moeskops et al.

Automatic segmentation of MR brain images with a convolutional neural network

IEEE Trans. Med. Imaging

(2016)

G. Ertas et al.

Computerized detection of breast lesions in multi-centre and multi-instrument DCE-MR data using 3D principal component maps and template matching

Phys. Med. Biol.

(2011)

Cited by (63)

DECIDE: A decoupled semantic and boundary learning network for precise osteosarcoma segmentation by integrating multi-modality MRI
2024, Computers in Biology and Medicine
Automated Osteosarcoma Segmentation in Multi-modality MRI (AOSMM) holds clinical significance for effective tumor evaluation and treatment planning. However, the precision of AOSMM is challenged by the diverse characteristics of multi-modality MRI and the inherent heterogeneity and boundary ambiguity of osteosarcoma. While numerous methods have made significant strides in automated osteosarcoma segmentation, they primarily focused on the use of a single MRI modality and overlooked the potential benefits of integrating complementary information from other MRI modalities. Furthermore, they did not adequately model the long-range dependencies of complex tumor features, which may lead to insufficiently discriminative feature representations. To this end, we propose a decoupled semantic and boundary learning network (DECIDE) to achieve precise AOSMM with three functional modules. The Multi-modality Feature Fusion and Recalibration (MFR) module adaptively fuses and recalibrates multi-modality features by exploiting their channel-wise dependencies to compute low-rank attention weights for effectively aggregating useful information from different MRI modalities, which promotes complementary learning between multi-modality MRI and enables a more comprehensive tumor characterization. The Lesion Attention Enhancement (LAE) module employs spatial and channel attention mechanisms to capture global contextual dependencies over local features, significantly enhancing the discriminability and representational capacity of intricate tumor features. The Boundary Context Aggregation (BCA) module further enhances semantic representations by utilizing boundary information for effective context aggregation while also ensuring intra-class consistency in cases of boundary ambiguity. Substantial experiments demonstrate that DECIDE achieves exceptional performance in osteosarcoma segmentation, surpassing state-of-the-art methods in terms of accuracy and stability.
An advanced W-shaped network with adaptive multi-scale supervision for osteosarcoma segmentation
2023, Biomedical Signal Processing and Control
Citation Excerpt :
(1) Cropping: due to the fact that the CT images in our dataset include too many background areas rather than the osteosarcoma lesions located in the leg, we crop the image to a 320 × 320 region covering the leg region completely. This cropping procedure is helpful not only for improving segmentation accuracy but also for reducing the computation burden [4]. (3) Histogram equalization: due to the small difference between pixel values in regions of osteosarcoma and normal tissues in the CT image, usually no clear boundary exists between them.
Osteosarcoma is one of the most common malignant bone tumors in adolescents, hence a precise and reliable automatic segmentation method is urgently needed in clinical practice. In this paper, an advanced W-shaped network is proposed for automatic and accurate segmentation of osteosarcoma in computed tomographic images. This deep model is developed based on two cascaded baseline U-Nets where feature maps of the same scales in encoding and decoding paths of both networks are fused in terms of advanced skip connections. Different from simple skip connections in the traditional U-Net which fuse low-level and high-level feature maps directly, the advanced skip connection module learns fine details from low level feature maps before concatenating to the corresponding high-level feature maps. Multiple side outputs are used to supervise the training process of the network. Multi-scale channel attention module is introduced to enable the network learn to suppress the irrelevant side outputs while highlight the useful ones to osteosarcoma tasks. The performance of our method is evaluated on a home-built dataset containing 2303 computed tomographic images of osteosarcoma whose results show that our method outperforms the U-Net and Multiple Supervised Residual Network with improvements of 7.47% and 2.59% in dice similarity coefficient, respectively. Our method also performs better than our previously developed W-Net++ with an improvement of 1.04% in dice similarity coefficient.
Delta thermal radiomics: An application in dairy cow teats
2022, JDS Communications
We describe a novel approach for analyzing thermal images by way of radiomics (i.e., thermal radiomics) and how it can be used to monitor short-term temperature changes of dairy cow hind teats; that is, delta thermal radiomics. The heat generated from metabolic activities and blood-flow patterns can be visualized using thermal radiography of the skin surface. The hind teats from 25 dairy cows were imaged with a digital thermal camera and the images were converted to medical images (DICOM format) by mapping the multi-channel colorized thermal image to a monochromatic image whose intensities represent temperature. The 50 teats (left and right hind) were then manually segmented by 2 investigators. Radiomics analysis, which is a common method of extracting semantic and nonsemantic image biomarkers from medical images for machine learning, was performed. To evaluate whether this approach can detect pre- and postmilking differences, 18 cows were imaged before and after milking, the teats were manually segmented, and radiomic calculations were performed. Student's t-test was used to provide an estimate of the likelihood of whether postmilking thermal image biomarkers are the same as premilking thermal image biomarkers, and Cohen's d was used to evaluate the size of the effect (d > 1.2). To evaluate uncertainties from manual segmentation, the Dice similarity score (DS) between the 2 investigators' segments was computed. The average DS (95% confidence limit) was 0.952 (0.913–0.982) when comparing the 2 investigators' segmentations. There was no significant difference in DS when comparing the left and right segmented teats, suggesting that teats can be segmented consistently. No differences (d < 0.36) were observed when comparing image biomarkers from one investigator's segments with the other's, suggesting that image biomarkers computed from one investigator's segmentation of teats are not likely to differ from those computed from the other investigator. When comparing image biomarkers before and after milking, 109 image biomarkers were analyzed, and 17 image biomarkers were simultaneously significant and exhibited effect size. Thus, delta thermal radiomics offers a noninvasive and quantitative method of monitoring skin temperature changes in humans and animals after an intervention. The advantage of this approach is that it can reveal both perceptible and imperceptible surface temperature features that may be useful for detecting and managing dairy teat health.
Global field of view-based pixel-level recognition method for medical images
2023, Journal of Intelligent and Fuzzy Systems
Image segmentation technology based on transformer in medical decision-making system
2023, IET Image Processing
Deep Learning for Medical Image-Based Cancer Diagnosis
2023, Cancers

View all citing articles on Scopus

View full text

MSFCN-multiple supervised fully convolutional networks for the osteosarcoma segmentation of CT images

Highlights

Abstract

Background and objective

Methods

Results

Conclusion

Introduction

Section snippets

Data acquisitions

Model initialization parameters

Data augmentation

Discussion and conclusion

Acknowledgment

Ann. Oncol.

Image Vision Comput.

Magn. Reson. Imaging

J. Comput. Phys.

Pattern Recognit.

NeuroImage

Osteosarcoma: the addition of muramyl tripeptide to chemotherapy improves overall survival—a report from the Children's Oncology Group

J. Clin. Oncol.

Pathology and Genetics of Tumours of Soft Tissue and Bone

Iarc

Osteosarcoma segmentation in MRI using dynamic harmony search based clustering

Osteosarcoma segmentation in CT images based on hybrid relative fuzzy connectedness

Segmentation of multimodality osteosarcoma MRI with vectorial fuzzy-connectedness theory

Osteosarcoma Segmentation in MRI Based on Zernike Moment and SVM

Chin. J. Biomed. Eng.

Brain tumor segmentation with deep neural networks

Med. Image Anal.

Region-based convolutional networks for accurate object detection and segmentation

IEEE Trans. Pattern Anal. Mach. Intell.

Classifying and segmenting microscopy images with deep multiple instance learning

Bioinformatics

Automatic segmentation of MR brain images with a convolutional neural network

IEEE Trans. Med. Imaging

Computerized detection of breast lesions in multi-centre and multi-instrument DCE-MR data using 3D principal component maps and template matching

Phys. Med. Biol.