Next Article in Journal
CD5-Positive B Lymphocytes after Kidney Transplantation
Next Article in Special Issue
Diagnostic Value of Contrast-Enhanced Ultrasound for Evaluation of Transjugular Intrahepatic Portosystemic Shunt Perfusion
Previous Article in Journal
Deep Learning-Based Prediction of Paresthesia after Third Molar Extraction: A Preliminary Study
Previous Article in Special Issue
Computed Tomography-Navigation™ Electromagnetic System Compared to Conventional Computed Tomography Guidance for Percutaneous Lung Biopsy: A Single-Center Experience
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Stability and Reproducibility of Radiomic Features Based Various Segmentation Technique on MR Images of Hepatocellular Carcinoma (HCC)

by
Nurin Syazwina Mohd Haniff
1,
Muhammad Khalis Abdul Karim
1,*,
Nurul Huda Osman
1,
M Iqbal Saripan
2,
Iza Nurzawani Che Isa
3 and
Mohammad Johari Ibahim
4
1
Department of Physics, Faculty of Science, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
2
Department of Computer Engineering, Faculty of Engineering, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
3
Programme of Diagnostic Imaging and Radiotherapy, Universiti Kebangsaan Malaysia, Kuala Lumpur 50300, Wilayah Persekutuan, Malaysia
4
Department Biochemistry & Molecular Medicine, Faculty of Medicine, Universiti Teknologi MARA, Sungai Buloh 47200, Selangor, Malaysia
*
Author to whom correspondence should be addressed.
Diagnostics 2021, 11(9), 1573; https://doi.org/10.3390/diagnostics11091573
Submission received: 9 July 2021 / Revised: 21 August 2021 / Accepted: 23 August 2021 / Published: 30 August 2021
(This article belongs to the Special Issue Novel Approaches in Oncologic Imaging)

Abstract

:
Hepatocellular carcinoma (HCC) is considered as a complex liver disease and ranked as the eighth-highest mortality rate with a prevalence of 2.4% in Malaysia. Magnetic resonance imaging (MRI) has been acknowledged for its advantages, a gold technique for diagnosing HCC, and yet the false-negative diagnosis from the examinations is inevitable. In this study, 30 MR images from patients diagnosed with HCC is used to evaluate the robustness of semi-automatic segmentation using the flood fill algorithm for quantitative features extraction. The relevant features were extracted from the segmented MR images of HCC. Four types of features extraction were used for this study, which are tumour intensity, shape feature, textural feature and wavelet feature. A total of 662 radiomic features were extracted from manual and semi-automatic segmentation and compared using intra-class relation coefficient (ICC). Radiomic features extracted using semi-automatic segmentation utilized flood filling algorithm from 3D-slicer had significantly higher reproducibility (average ICC = 0.952 ± 0.009, p < 0.05) compared with features extracted from manual segmentation (average ICC = 0.897 ± 0.011, p > 0.05). Moreover, features extracted from semi-automatic segmentation were more robust compared to manual segmentation. This study shows that semi-automatic segmentation from 3D-Slicer is a better alternative to the manual segmentation, as they can produce more robust and reproducible radiomic features.

1. Introduction

Hepatocellular carcinoma (HCC) is of the common type of malignant tumour of the liver and its incidence has increased in recent years. In Malaysia, HCC is one of the top five cancers with 4.4% new cases in 2018, and it ranks fourth in leading causes of cancer death [1]. HCC known to have a poor survival rate despite availability varieties of clinical treatments [2]. Medical imaging techniques such as computed tomography (CT) and magnetic resonance imaging (MRI) are vital procedures in oncological treatment. Both CT scans and MRI have higher spatial resolution that has advantage in the early detection of tissues’ abnormality in patients [2]. Imaging is extensively used in detection, diagnosis, and staging of cancer. Several studies have been conducted to improve the tumour diagnosis through proposed methods [3,4,5,6]. Although MRI is acknowledged for its advantages, the false-negative diagnosis from the examinations is inevitable, although several quantitative techniques have been introduced which are mostly pertaining to MR study. It is crucial for optimizing quantitative imaging feature extraction through computational approaches and developing decision support systems to accurately estimate the cancer stage. Hence, radiomics study are introduced and used to extract high number of quantitative image features [7].
Feature extraction is essential to obtain relevant information on input images and represents data in lower dimensionality space [8]. These features are extracted by using advanced mathematical algorithm which describes phenotypes of tumour that might not be able to perceive by naked eye. Previous studies have shown the importance of quantitative feature extraction in classifying the cancer stage of various types of cancer, including HCC [9,10,11,12]. In 2016, investigation of extraction of radiomics data was performed to predict the pathological response after chemotherapy [13]. Incorporations of radiomics and machine learning are essential for information extraction to aid in better prognosis. From shape, statistics, and texture features, 20 features were selected based on stability and variance.
Recently, uses of radiomics in cancer related field shows significant progress. Radiomics application is said to be one of the fundamental methods for machine learning development in the medical imaging field [14,15]. Extraction of radiomic features from various sources of medical images also overcomes the limitation of visual image interpretation [16]. Several literature reviews show data mining and predictive analysis have widened the scope of medical imaging [17,18,19,20]. This can facilitate prognostic models used in oncology. However, the poor reliability of radiomic features could affect research outcomes and become an obstacle for further use in models [21].
It is important to ensure the accuracy of quantitative features extracted from medical images. The process of tumour segmentation is one of the main obstacles for radiomics. Conventional manual segmentation is considered quite tedious and time consuming where quick and reliable segmentation techniques are required. Previous studies show that semi-automatic segmentation methods are preferred, as they are better alternatives to manual segmentation [22,23,24]. Comparative analysis of different types of image segmentation had been done for mammographic images in 2019. This study concludes flood filling algorithm has the best segmentation result compared to the watershed algorithm, the mean-shift algorithm, and the k-means algorithm [25]. Recently, a study was done for non-small cell lung cancer, radiomic features extracted for semi-automatic segmentation using a grow-cut algorithm implemented in 3D-Slicer, has high reproducibility and is more robust compared to manual segmentation [26]. Studies on CT and MR images prove that semi-automatic segmentation is robust, which indicates its ability to produce reliable and reproducible radiomic models [27,28]. Hence, in this study, we aim to evaluate the reproducibility and robustness of various segmentation techniques of MR HCC images based on the extract radiomic features.

2. Materials and Methods

In this study, we analysed robustness of features extracted from 30 patients (median age: 63 y/o) diagnosed with HCC by two different methods of segmentations, manual and semi-automatic. We extracted 662 features using 3D Slicer (Boston, MA, USA) software for both segmentations [29]. Features extractions for manual and semi-automatic segmentations were compared and analyzed. Despite having small samples size (n = 30), numbers of sets for observations were examined to find the pattern on features extracted. Briefly, a total of 240 segmentations (120 manual segmentations and 120 semi-automatic segmentations) were analyzed in this study.

2.1. Magnetic Resonance Imaging (MRI) of HCC Patients

The image data was obtained from open archive for medical images of cancer, The Cancer Imaging Archive (TCIA) (Arkansas, USA) [30]. Ninety-seven patients verified with hepatocellular carcinoma, Stages I–IV, were retrieved. After reviewing the data images, patients who underwent CT scans were excluded, and only non-contrast enhanced T1-weighted (T1W) images (TR/TE = 210/1.1 ms) were chosen. A total of 30 patients diagnosed using MRI were included in this study. For the final study, 17 male (median age: 64 years; range 23–77 years) and 13 females (median age: 62 years; range 45–77 years). Stages of HCC are divided into two groups, Stages I–II and Stages III–IV, respectively.

2.2. Semi-Automatic Segmentation in 3D-Slicer

For the semi-automatic segmentation, the flood fill algorithm from 3D Slicer software was applied on the subject MR images. Prior to the installation, the Segment Editor Extra Effects extension was installed through the extension’s manager. After loading data images through the DICOM module, the observers identify the location of HCC. The nodes were added around the tumour region using a mouse cursor. Subsequently, the flood fill effects were activated, and ROI were segmented according to similar voxels intensity. To finalize the output, the segmented tumour was manually edited in the finalization phase as semi-automatic segmentation. Flood fill is an algorithm used to determine connection of an area in multi-dimensional array with the help of similarity of intensity voxels to the selected node determined by users. This algorithm is comparable with bucket tool in paint programs, which fill connected similar intensity voxels with different colours [25]. The algorithm was initiated with a start node by selecting the region of interest (ROI). Pixels connected four-directionally with the start node and pixels connected four-directionally to the former are considered in this effect. As the intensity voxels were decided, the algorithm identified the path of target node given and replaced it with different colours. Leakage prevention to other structures under this effect were manipulated using the neighbourhood size parameter.

2.3. Feature Extraction

All segmented image data were analysed and performed in the same software, 3D Slicer under the Radiomics module to extract imaging features. Features were extracted using the mathematical algorithm predicated on pixel intensities. From semi-automatic and manual segmentations, we defined 662 radiomic features for MR images that assist in quantifying tumour characteristics. The features were divided into four groups: (I) tumour intensity, (II) shape, (III) texture, and (IV) wavelet-features with examples of shape features shown in Table 1. The total features extracted from volume of interest for tumour intensity, shape and textural were 18, 14, and 54 features, respectively.
Tumour intensity represents a first order statistic, which distinguishes a histogram of voxel intensity within the tumour region on MRI. Shape features are calculated and described with volume properties of the tumour. Shape features extracted using the pyRadiomics package are independent from gray-level intensities and can only be calculated on a non-derived image, which is the original image [31]. Thus, the wavelet filter is not applied to shape features as it will not be calculated on derived images. Textural features were determined by patterns or spatial distributions of voxel intensities derived from grey level dependence matrix (GLDM), grey level co-occurrence matrix (GLCM) and grey level run-length matrices (GLRLM) [32]. Features derived from co-occurrence and run-length matrices were computed by averaging all 13 symmetric directions in three dimensions [6].
Wavelet features are obtained by transforming domain representations of tumour intensity and textural features. These features were applied as either a high (H) or low pass (L) filter in each of the three dimensions—X-axis, Y-axis, and Z-axis: wavelet-LHL, wavelet-LHH, wavelet-HLL, wavelet-LLH, wavelet-HLH, wavelet-HHH, wavelet-HHL, and wavelet-LLL [14]. Eight decomposed volumes of images were used on the intensity and textural features in the volume of interest, which resulted in a total of 576 (8 × 72) wavelet transforms features [7,23]. These filters help to identify the detail or sudden changes in intensity in the image [33]. Several fundamental formulas, such as entropy, contrast, uniformity, and correlation, are presented in the equation below:
E n t r o p y = i = 1 N g p ( i ) log 2 ( p ( i ) + ε )
Entropy measures the average amount of information required to encode the image values, where N g = number of non-zero bins and p i = normalized first order histogram.
C o n t r a s t = i = 1 N g j = 1 N g ( i j ) 2 p ( i , j )
Contrast determines local intensity variation present in the image. A larger value correlates with greater disparity in intensity values among neighboring voxels.
U n i f o r m i t y = i = 1 N g p ( i ) 2
Uniformity measures the sum of squares of each intensity value. Greater uniformity implies greater homogeneity. Correlation defines the linear dependency of grey level values to their respective voxels in GLCM. The value for correlation is between 0 (uncorrelated) and 1 (perfectly correlated). The equation of correlation is shown below:
C o r r e l a t i o n = i = 1 N g j = 1 N g p ( i , j ) i j μ x μ y σ x ( i ) σ y ( j )
where μ x , μ y and σ x , σ y are mean grey level intensity and standard deviation of px and py, respectively.

2.4. Statistical Analysis

The intra-class correlation coefficient (ICC) is referred to as correlations within a class of data. It was calculated to quantify the reproducibility of features obtained. Figure 1 shows the flowcharts for reproducibility analysis in this study. ICC is a well-known statistical tool among researchers, ranging between 0 to 1. There are three type of ICC models which can be chosen appropriately according to experimental situation. For this study, variance estimates were obtained to determine the ICC for inter-observer segmentations by using a two-way mixed effect model of analysis of variance (ANOVA) [25]. Equation below defines the ICC:
I C C ( A , 1 ) = M S R M S E M S R + ( k + 1 ) M S g + k n ( M S C M S E )
One-way analysis of variance (ANOVA) was used to obtained the ICC values for intra-observer segmentation [15,16]. Equation below defines ICC (C,1):
I C C ( C , 1 ) = M S R M S W M S R + ( k 1 ) M S W
where MSR = mean square for rows, MSW = mean square for residual sources of variance, MSE = mean square error, MSC = mean square for columns, k and n are number of observers involved and subjects.
We evaluate intra-observers’ reproducibility by letting one observer segmentized two months apart of 30 patients and this demonstrates variation of data measured from two different sets of segmentation. Furthermore, this also helps to assess multiple initializations of segmentation algorithm from the same observer. For inter-observers’ reproducibility, the delineation was accomplished by several observers using the same methods of segmentation and the degree of agreement between different observers were analyzed. The difference of reproducibility for each segmentation were assessed using the Wilcoxon rank-sum test with p-value set at 0.05. All data were expressed in mean ± SD. All analyses of data were accomplished using Statistical Package for Social Sciences (SPSS, also known as IBM SPSS statistics) version 25 (SPSS Chicago, IL, USA).

3. Results

In order to investigate the reliability of radiomic features extracted from 3D Slicer segmentation on MR images, a total of 662 features under four groups, (I) tumour intensity, (II) shape, (III) texture, and (IV) wavelet-features, were assessed. Of 30 subjects, the size of tumours for Stages III–IV (average size = 8.9 ± 3.4 cm) are larger than Stages I–II (average size = 3.7 ± 0.7 cm). We extracted radiomic from volume of interest determined by two independent observers twice using 3D Slicer software semi-automatic segmentation and compared them to manual segmentation by four independent observers. The semi-automatic segmentations were divided into two sets, each having two segmentations. Figure 2 shows comparisons of intra-class correlation coefficient between semi-automatic and manual segmentation in terms of categorized features. We observed that features extracted from semi-automatic segmentation had significantly higher reproducibility (average ICC = 0.952 ± 0.009, p < 0.05) compared to features extracted from manual segmentation (average ICC = 0.897 ± 0.011, p > 0.05).
There are 576 features of wavelet features, we separate the features into their respective feature types such as GLDM, GLCM, GLRLM and first order statistics for both semi-automatic and manual segmentations to ease the analysis process. There are 8 wavelet transforms in total. Figure 3 shows comparisons for ICC values of GLDM and first order features from both techniques. We observed that ICC values for intensity and texture features applied with wavelet transforms were high for semi-automatic segmentation compared to manual segmentation. Most ICC values for manual segmentations that undergoes wavelet transforms seen to have lower value which at lower level of the graphs. The ICC values of GLDM, GLCM, GLRLM, and intensity statistics for semi-automatic segmentation are 0.935 ± 0.008, 0.921 ± 0.013, 0.943 ± 0.011, and 0.948 ± 0.008, respectively. In contrary, ICC values for manual segmentations shows fluctuation in all four parameters (GLDM, GLCM, GLRLM, and intensity statistics) with average ICC equals to 0.796 ± 0.032, 0.848 ± 0.021, 0.941 ± 0.012, and 0.578 ± 0.037. Of the 662 features, 354 features (53%) showed higher ICC values for semi-automatic segmentation compared to manual one. Table 2, Table 3 and Table 4 show the reproducibility of ICC for both segmentations in regards with shape features and wavelet features.
Features that had been extracted are classified into three groups, high reproducibility (ICC ≥ 0.8), medium reproducibility (0.8 > ICC ≥ 0.5), and low reproducibility (ICC < 0.5), based on their ICC values. Out of 662 features, manual segmentation had 78.1% high, 6.5% medium, and 15.4% low reproducibility. However, semi-automatic segmentation, which is the flood filling effect, had 96.7% features in high reproducibility, 1.8% medium reproducibility, and 1.5% low reproducibility. Table 5 summarizes features extracted from both segmentations in reproducibility groups. Hence, reproducibility of the extracted features was higher for semi-automatic segmentations.
The robustness of each technique was evaluated by analysing the ICC of features extracted from inter- and intra-observers. Based on Figure 4, high ICC values for inter-observer from semi-automatic segmentation were observed (ICC = 0.976 ± 0.006 and ICC = 0.978 ± 0.003, respectively). Table 6 tabulates the inter-observer reproducibility of shape features for one subject.

4. Discussion

MR images were used in this study, as it produces high quality images compared to CT scans [34]. It is reported that resolution of input images influences the robustness of radiomic features as higher resolution images enhance visualization for segmentations [35]. In 2014, Chintan et al. assessed reproducibility of feature extracted using 3D-Slicer segmentation (GrowCut algorithm) which resulted in semi-automatic segmentation higher than manual delineation of CT images [7]. Therefore, additional features such as wavelet transform are implemented to study MR images using different types of algorithms. Despite increasing development in application of radiomic studies, robustness and reproducibility of radiomic features extracted from MR images are still one of the main challenges’ oncologist encounters.
In this study, we investigate the reproducibility and robustness of radiomic features of HCC between two types of segmentations, manual and semi-automatic, using 3D-Slicer software. A total of 662 features were extracted from volume of interest and can be classified into four main groups (18 tumour intensity, 14 shape features, 54 textural features, and 576 wavelet features). Based on the results, semi-automatic segmentation has higher ICC values compared to manual segmentation. Semi-automatic segmentation, which we used for flood filling algorithm, is also more robust and significantly higher compared to manual segmentation. Flood filling algorithm was performed when the observer chose several points around the volume of interest, and the tumour region was digitized according to the uniform colour which results in accurate segmentation [30]. This algorithm is proven to be the best segmentation method by comparing their accuracy in selection of objects [25].
All four quantitative imaging features extracted from semi-automatic segmentation had higher reproducibility score (p < 0.05) compared to manual segmentation. For most tumour intensity feature, semi-automatic segmentation had higher reproducibility compared to manual segmentation. There were a few negative ICC values for wavelet features in both segmentations shown in Figure 3, which indicates that particular features vary than any features randomly chosen from the whole population [36]. Despite that, the reproducibility of wavelet features from semi-automatic segmentation is still higher compared to manual segmentation. Furthermore, semi-automatic segmentation was consistent with ICC values (97% good reproducibility) compared to manual segmentation (78% good reproducibility). In 2020, Wang et al. implied that the features’ reliability increased using wavelet filters through their proposed deep learning tools compared to normal features [21].
We also analysed intra- and inter-observer reproducibility to evaluate the performance for both segmentations. Two sets from two different observers are shown in Figure 4 with one observer segments without formal clinical training. Despite using semi-automatic segmentation, experience of observers affects the segmentations of tumours as human interaction’s influence the process of segmentation. this indicates the difference in radiomic features. However, semi-automatic segmentation demonstrates high ICC values for both intra- and inter-observers compared to manual segmentation. This suggests that features extracted from semi-automatic segmentation are more robust and reproducible. There are two limitations in this study. First, this study does not involve image descriptors for prognostic models due to the insufficient images of HCC patients from the accessible source. Secondly, this study only comprises of single MRI sequence. The contrast enhanced MRI and infiltrative HCC are not included in this study.

5. Conclusions

This study assesses the reproducibility of radiomic features between two techniques. Semi-automatic segmentation using the flood fill algorithm produces more reproducible features, and this indicates it could be a better alternative to the current manual segmentation. Thus, this algorithm could be further applied for prognostic and classification models with sufficient data taken from several centers. Note that the study was focused on robustness and reproducibility of features extracted from semi-automatic and manual segmentation. Hence, for adapting machine learning, the study should be in larger scales with more data so the application will be much accurate.

Author Contributions

Conceptualization, M.K.A.K.; formal analysis, N.S.M.H.; funding acquisition, M.K.A.K. and M.J.I.; investigation, M.K.A.K. and N.H.O.; methodology, N.S.M.H. and M.I.S.; resources, I.N.C.I.; software, I.N.C.I.; validation, M.K.A.K.; writing—original draft, N.S.M.H.; writing—review and editing, M.K.A.K. and M.J.I. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Ministry of Higher Education (MOHE) Malaysia under the Fundamental Research Grant Scheme (FRGS) with grant number FRGS/1/2020/STG07/UPM/02/3.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

The authors would like to acknowledge Universiti Putra Malaysia and Ministry of Higher Education Malaysia (MOHE) for the financial support of this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Azizah, A.M.; Hashimah, B.; Nirmal, K.; Siti Zubaidah, A.R.; Puteri, N.A.; Nabihah, A.; Sukumaran, R.; Balqis, B.; Nadia, S.M.R.; Sharifah, S.S.S.; et al. Malaysian National Cancer Registry Report; Ministry of Health: Putrajaya, Malaysia, 2017; pp. 1–116.
  2. Kharrat, A.; Gasmi, K.; Ben Messaoud, M.; Benamrane, N.; Abid, M. A Hybrid Approach for Automatic Classification of Brain MRI Using Genetic Algorithm and Support Vector Machine An Hybrid Approach for Automatic Classification of Brain MRI Using Genetic Algorithm and Support Vector Machine. Leonardo J. Sci. 2010, 17, 71–82. [Google Scholar]
  3. Ma, J.; Wang, Q.; Ren, Y.; Hu, H.; Zhao, J. Automatic lung nodule classification with radiomics approach. In Medical Imaging 2016: PACS and Imaging Informatics: Next Generation and Innovations; International Society for Optics and Photonics: California, CA, USA, 2016; Volume 9789, p. 978906. [Google Scholar] [CrossRef]
  4. Vauthey, J.N.; Lauwers, G.Y.; Esnaola, N.F.; Do, K.A.; Belghiti, J.; Mirza, N.; Curley, S.A.; Ellis, L.M.; Regimbeau, J.M.; Rashid, A.; et al. Simplified staging for hepatocellular carcinoma. J. Clin. Oncol. 2002, 20, 1527–1536. [Google Scholar] [CrossRef] [PubMed]
  5. Scrivener, M.; De Jong, E.E.C.; van Timmeren, J.E.; Pieters, T.; Geets, X. Radiomics applied to lung cancer: A review. Transl. Cancer Res. 2016, 5, 398–409. [Google Scholar] [CrossRef]
  6. Xu, L.; Gao, Q.; Yousefi, N. Brain tumor diagnosis based on discrete wavelet transform, gray-level co-occurrence matrix, and optimal deep belief network. Simulation 2020, 96, 867–879. [Google Scholar] [CrossRef]
  7. Parmar, C.; Velazquez, E.R.; Leijenaar, R.; Jermoumi, M.; Carvalho, S.; Mak, R.H.; Mitra, S.; Shankar, B.U.; Kikinis, R.; Haibe-Kains, B.; et al. Robust radiomics feature quantification using semiautomatic volumetric segmentation. PLoS ONE 2014, 9, e0102107. [Google Scholar] [CrossRef] [PubMed]
  8. Kumar, G.; Bhatia, P.K. A detailed review of feature extraction in image processing systems. In Proceedings of the 2014 Fourth International Conference on Advanced Computing & Communication Technologies, Rohtak, India, 8–9 February 2014; pp. 5–12. [Google Scholar] [CrossRef]
  9. Chen, W.; Liu, B.; Peng, S.; Sun, J.; Qiao, X. Computer-Aided Grading of Gliomas Combining Automatic Segmentation and Radiomics. Int. J. Biomed. Imaging 2018, 2018, 2512037. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Delzell, D.A.P.; Magnuson, S.; Peter, T.; Smith, M.; Smith, B.J. Machine Learning and Feature Selection Methods for Disease Classification with Application to Lung Cancer Screening Image Data. Front. Oncol. 2019, 9, 1–8. [Google Scholar] [CrossRef] [Green Version]
  11. Oyama, A.; Hiraoka, Y.; Obayashi, I.; Saikawa, Y.; Furui, S.; Shiraishi, K.; Kumagai, S.; Hayashi, T.; Kotoku, J. Hepatic tumor classification using texture and topology analysis of non-contrast-enhanced three-dimensional T1-weighted MR images with a radiomics approach. Sci. Rep. 2019, 9, 2–11. [Google Scholar] [CrossRef]
  12. Dutra, L.V.; Huber, R. Feature extraction and selection for ers-1/2 insar classification. Int. J. Remote Sens. 1999, 20, 993–1016. [Google Scholar] [CrossRef]
  13. Coroller, T.P.; Agrawal, V.; Huynh, E.; Narayan, V.; Lee, S.W.; Mak, R.H.; Aerts, H.J.W.L. Radiomic-Based Pathological Response Prediction from Primary Tumors and Lymph Nodes in NSCLC. J. Thorac. Oncol. 2017, 12, 467–476. [Google Scholar] [CrossRef] [Green Version]
  14. Çinarer, G.; Gürsel, B.; Haşim, A. Prediction of glioma grades using deep learning with wavelet radiomic features. Appl. Sci. 2020, 10, 6296. [Google Scholar] [CrossRef]
  15. Izdihar, K.; Karim, M.K.A.; Aresli, N.N.; Radzi, S.F.M.; Sabarudin, A.; Yunus, M.M.; Rahman, M.A.; Shamsul, S. Detection of Novel Coronavirus from Chest X-Ray Radiograph Images via Automated Machine Learning and CAD4COVID. In Proceedings of the 2021 International Congress of Advanced Technology and Engineering (ICOTEN), Taiz, Yemen, 4–5 July 2021; pp. 1–4. [Google Scholar]
  16. Haniff, N.S.M.; Karim, M.K.B.A.; Ali, N.S.; Rahman, M.A.A.; Osman, N.H.; Saripan, M.I. Magnetic Resonance Imaging Radiomics Analysis for Predicting Hepatocellular Carcinoma. In Proceedings of the 2021 International Congress of Advanced Technology and Engineering (ICOTEN), Taiz, Yemen, 4–5 July 2021; pp. 1–5. [Google Scholar]
  17. Cao, X.H.; Stojkovic, I.; Obradovic, Z. A robust data scaling algorithm to improve classification accuracies in biomedical data. BMC Bioinf. 2016, 17, 359. [Google Scholar] [CrossRef] [Green Version]
  18. El-Dahshan, E.A.S.; Mohsen, H.M.; Revett, K.; Salem, A.B.M. Computer-aided diagnosis of human brain tumor through MRI: A survey and a new algorithm. Expert Syst. Appl. 2014, 41, 5526–5545. [Google Scholar] [CrossRef]
  19. Zhou, T.; Ruan, S.; Canu, S. A review: Deep learning for medical image segmentation using multi-modality fusion. Array 2019, 3–4, 100004. [Google Scholar] [CrossRef]
  20. Simi, V.R.; Joseph, J. Segmentation of Glioblastoma Multiforme from MR Images—A comprehensive review. Egypt. J. Radiol. Nucl. Med. 2015, 46, 1105–1110. [Google Scholar] [CrossRef] [Green Version]
  21. Wang, L.; Tan, J.; Ge, Y.; Tao, X.; Cui, Z.; Fei, Z.; Lu, J.; Zhang, H.; Pan, Z. Assessment of liver metastases radiomic feature reproducibility with deep-learning-based semi-automatic segmentation software. Acta Radiol. 2021, 62, 291–301. [Google Scholar] [CrossRef] [PubMed]
  22. McGraw, K.O.; Wong, S.P. “Forming inferences about some intraclass correlations coefficients”: Correction. Psychol. Methods 1996, 1, 390. [Google Scholar] [CrossRef]
  23. Zhao, S.; Ren, W.; Zhuang, Y.; Wang, Z. The Influence of Different Segmentation Methods on the Extraction of Imaging Histological Features of Hepatocellular Carcinoma CT. J. Med. Syst. 2019, 43, 1–7. [Google Scholar] [CrossRef] [PubMed]
  24. Mat Radzi, S.F.; Abdul Karim, M.K.; Saripan, M.I.; Abd Rahman, M.A.; Osman, N.H.; Dalah, E.Z.; Mohd Noor, N. Impact of Image Contrast Enhancement on Stability of Radiomics Feature Quantification on a 2D Mammogram Radiograph. IEEE Access 2020, 8, 127720–127731. [Google Scholar] [CrossRef]
  25. Podgornova, Y.A.; Sadykov, S.S. Comparative analysis of segmentation algorithms for the allocation of microcalcifications on mammograms. CEUR Workshop Proc. 2019, 2391, 122–127. [Google Scholar] [CrossRef]
  26. Qiu, Q.; Duan, J.; Gong, G.; Lu, Y.; Li, D.; Lu, J.; Yin, Y. Reproducibility of radiomic features with GrowCut and GraphCut semiautomatic tumor segmentation in hepatocellular carcinoma. Transl. Cancer Res. 2017, 6, 940–948. [Google Scholar] [CrossRef]
  27. Haarburger, C.; Müller-Franzes, G.; Weninger, L.; Kuhl, C.; Truhn, D.; Merhof, D. Radiomics feature reproducibility under inter-rater variability in segmentations of CT images. Sci. Rep. 2020, 10, 1–10. [Google Scholar] [CrossRef] [PubMed]
  28. Lebre, M.A.; Vacavant, A.; Grand-Brochier, M.; Rositi, H.; Strand, R.; Rosier, H.; Abergel, A.; Chabrot, P.; Magnin, B. A robust multi-variability model based liver segmentation algorithm for CT-scan and MRI modalities. Comput. Med. Imaging Graph. 2019, 76, 101635. [Google Scholar] [CrossRef]
  29. Fedorov, A.; Beichel, R.; Kalpathy-Cramer, J.; Finet, J.; Fillion-Robin, J.C.; Pujol, S.; Bauer, C.; Jennings, D.; Fennessy, F.; Sonka, M.; et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn. Reson. Imaging. 2012, 30, 1323–1341. [Google Scholar] [CrossRef] [Green Version]
  30. Clark, K.; Vendt, B.; Smith, K.; Freymann, J.; Kirby, J.; Koppel, P.; Moore, S.; Phillips, S.; Maffitt, D.; Pringle, M.; et al. The cancer imaging archive (TCIA): Maintaining and operating a public information repository. J. Digit. Imaging 2013, 26, 1045–1057. [Google Scholar] [CrossRef] [Green Version]
  31. 3D Slicer. Slicer-Community 3D Slicer Documentation; 3D Slicer: Boston, MA, USA, 2019; pp. 2–48. [Google Scholar]
  32. Zhou, J.; Lu, J.; Gao, C.; Zeng, J.; Zhou, C.; Lai, X.; Cai, W.; Xu, M. Predicting the response to neoadjuvant chemotherapy for breast cancer: Wavelet transforming radiomics in MRI. BMC Cancer 2020, 20, 100. [Google Scholar] [CrossRef]
  33. Sudarshan, V.K.; Mookiah, M.R.K.; Acharya, U.R.; Chandran, V.; Molinari, F.; Fujita, H.; Ng, K.H. Application of wavelet techniques for cancer diagnosis using ultrasound images: A Review. Comput. Biol. Med. 2016, 69, 97–111. [Google Scholar] [CrossRef] [PubMed]
  34. Kwong, R.Y.; Yucel, E.K. Computed Tomography Scan and Magnetic Resonance Imaging. Circulation 2003, 108, 1–3. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Coenegrachts, K. Magnetic resonance imaging of the liver: New imaging strategies for evaluating focal liver lesions. World J. Radiol. 2009, 1, 72. [Google Scholar] [CrossRef]
  36. Taylor, P.J. An Introduction to Intraclass Correlation that Resolves Some Common Confusions. Programs Sci. Technol. Values Crit. Creat. Thinking Public Policy 2009, 7656, 1–9. [Google Scholar]
Figure 1. Schematic diagram of the analysis.
Figure 1. Schematic diagram of the analysis.
Diagnostics 11 01573 g001
Figure 2. Comparisons of intra-class correlation coefficient (ICC) values in (a) shape based features, (b) textural features, and (c) first order statistics features.
Figure 2. Comparisons of intra-class correlation coefficient (ICC) values in (a) shape based features, (b) textural features, and (c) first order statistics features.
Diagnostics 11 01573 g002aDiagnostics 11 01573 g002b
Figure 3. Comparisons of wavelet features for semi-automatic and manual segmentation in; (a) Gray-level dependence matrix (GLDM) features and, (b) first order statistics features.
Figure 3. Comparisons of wavelet features for semi-automatic and manual segmentation in; (a) Gray-level dependence matrix (GLDM) features and, (b) first order statistics features.
Diagnostics 11 01573 g003
Figure 4. Boxplot of the ICC value represents by inter-observers’ reproducibility on the radiomic features.
Figure 4. Boxplot of the ICC value represents by inter-observers’ reproducibility on the radiomic features.
Diagnostics 11 01573 g004
Table 1. Composition of 662 radiomic features extracted using 3D-Slicer.
Table 1. Composition of 662 radiomic features extracted using 3D-Slicer.
FeaturesOriginal (n = 1)Wavelet (n = 8)
Shape (n = 14)14 × 1-
Texture (n = 54)54 × 154 × 8
Tumour Intensity (n = 18)18 × 118 × 8
Table 2. Intra-class correlation coefficient (ICC) of semi-automatic and manual segmentation for shape and grey level run length matrix (GLRLM) features.
Table 2. Intra-class correlation coefficient (ICC) of semi-automatic and manual segmentation for shape and grey level run length matrix (GLRLM) features.
FeaturesOriginalSemi-AutomaticManual
ShapeVoxel Volume *0.9940.996
Maximum 3D-Diameter0.9640.973
Mesh Volume *0.9940.856
Major Axis Length0.9720.909
Sphericity0.9400.909
Least Axis Length *0.9970.909
Elongation0.9680.909
Surface Volume Ratio *0.9900.909
Maximum 2D-Diameter Slice0.9600.909
Flatness0.9830.909
Surface Area0.9710.908
Minor Axis Length *0.9960.909
Maximum 2D-Diameter Column0.9610.909
Maximum 2D-Diameter Row0.9550.909
Gray-level run-length matrix (GLRLM)Short Run Low Gray-Level Emphasis0.9250.909
Gray-Level Variance0.9510.909
Low Gray-Level Run Emphasis *0.9720.909
Gray-Level Non-Uniformity Normalized0.9700.909
Run Variance *0.9870.909
Gray-Level Non-Uniformity *0.9960.909
Long-Run Emphasis0.9850.909
Short Run High Gray-Level Emphasis0.9460.909
Run-Length Non-Uniformity *0.9910.909
Short Run Emphasis0.9800.909
Long Run High Gray-Level Emphasis0.9370.906
Run Percentage *0.9780.909
Long Run Low Gray-Level Emphasis *0.9890.907
Run Entropy0.9540.909
High Gray-Level Run-Emphasis0.9460.909
Run-Length Non-Uniformity Normalized *0.9780.909
Tumour Intensity (First Order Statistics)Interquartile Range *0.9800.909
Skewness0.9470.909
Uniformity0.9680.909
Median0.9250.909
Energy *0.9510.514
Robust Mean Absolute Deviation0.9450.909
Mean Absolute Deviation0.9710.909
Total Energy *0.9780.053
Maximum *0.9940.909
Root Mean Squared0.9720.909
90-Percentile0.9490.909
Minimum0.9680.909
Entropy0.9800.909
Range0.9050.909
Variance0.9750.909
10-Percentile0.9370.909
Kurtosis0.9220.909
Mean0.8810.909
* Statistically significant (p < 0.05).
Table 3. Wavelet features for semi-automatic segmentation.
Table 3. Wavelet features for semi-automatic segmentation.
FeaturesWaveletSemi-Auto Segmentation
1.HLL2.LHL3.LHH4.LLH5.HLH6.HHH7.HHL8.LLL
Gray-level run-length matrix (GLRLM)Short Run Low Gray-Level Emphasis0.9920.9830.9890.9470.9810.9790.9700.915
Gray-Level Variance0.9110.9340.9300.9880.9170.9180.9210.953
Low Gray-Level Run Emphasis0.9950.9790.9860.9490.9780.9740.9610.915
Gray-Level Non-Uniformity Normalized0.9600.9800.9860.9900.9680.9790.9820.967
Run Variance0.9840.9830.9880.9970.9780.9900.9900.863
Gray-Level Non Uniformity0.9960.9970.9960.9960.9950.9950.9960.997
Long Run Emphasis0.9820.9790.9850.9960.9750.9890.9900.912
Short Run High Gray-Level Emphasis0.9190.9460.8360.8990.8210.9200.9260.935
Run-Length Non-Uniformity0.9900.9950.9960.9940.9920.9960.9970.883
Short Run Emphasis0.9750.9860.9900.9890.9770.9880.9890.000
Long Run High Gray-Level Emphasis0.9250.9470.8340.8720.8210.9220.9270.768
Run Percentage0.9740.9890.0020.9880.9780.9890.9900.980
Long Run Low Gray-Level Emphasis0.9980.9710.9710.9360.9710.9660.9430.908
Run Entropy0.9510.9730.9760.9830.9580.9770.9800.971
High Gray-Level Run Emphasis0.9200.9470.8360.8960.8200.9210.9270.935
Run-Length Non-Uniformity Normalized0.9740.9900.9930.9870.9780.9880.9890.971
First Order StatisticsInterquartile Range 0.9890.988−0.0010.9910.9750.9260.9900.989
Skewness0.9450.9680.9660.9860.9450.9500.9560.935
Uniformity0.9670.9870.9880.9840.9670.9760.9830.951
Median0.9030.9360.6650.9770.9160.9170.9210.923
Energy 0.9120.9340.9290.9880.9160.9170.9210.953
Robust Mean Absolute Deviation0.9190.9460.8360.8690.8200.9210.9270.935
Mean Absolute Deviation0.9490.9610.9700.9810.9450.9740.9730.973
Total Energy 0.9940.9930.9960.9930.9940.9950.9970.996
Maximum 0.9940.9940.9940.9940.9940.9940.9940.994
Root Mean Squared0.9760.9860.9830.9840.9780.9690.9760.974
90-Percentile0.9100.9230.8330.9520.8280.9060.9120.940
Minimum0.9760.9830.9700.9820.9750.9330.9630.965
Entropy0.9770.9870.9920.9910.9780.9910.9910.933
Range0.9990.9760.9770.9650.9700.9650.9420.845
Variance0.9890.9960.9870.9860.9930.9530.9750.867
10-Percentile0.9540.9090.8580.7700.9870.8980.9130.949
Kurtosis0.9710.9640.9560.9630.9940.9880.9910.946
Mean0.9950.9800.9860.9540.9970.9740.9610.922
Table 4. Wavelet features for manual segmentation.
Table 4. Wavelet features for manual segmentation.
FeaturesWaveletManual Segmentation
1.HLL2.LHL3.LHH4.LLH5.HLH6.HHH7.HHL8.LLL
Gray-level run-length matrix (GLRLM)Short Run Low Gray-Level Emphasis0.9260.9690.9740.9870.9800.9420.9630.960
Gray-Level Variance0.9980.9990.9990.9990.9980.8871.0000.998
Low Gray-Level Run Emphasis0.9240.9650.9720.9870.9790.9380.9570.950
Gray-Level Non-Uniformity Normalized0.9800.9910.9930.9970.9830.9890.9860.993
Run Variance0.9840.9760.9890.9780.9800.9870.9810.968
Gray-Level Non Uniformity0.9960.9960.9960.9970.9960.9960.9960.998
Long Run Emphasis0.9850.9790.9880.9790.9790.9870.9810.972
Short Run High Gray-Level Emphasis0.8720.5160.9270.7160.5060.9990.9800.984
Run-Length Non-Uniformity0.9900.9910.9950.9910.9930.9960.9940.991
Short Run Emphasis0.9830.9900.9940.9960.9840.9900.9860.989
Long Run High Gray-Level Emphasis0.8690.5070.9250.6750.4900.9960.9790.981
Run Percentage0.9820.9920.9950.9960.9860.9910.9880.988
Long Run Low Gray-Level Emphasis0.9150.9390.9620.9550.9690.9380.9450.914
Run Entropy0.9920.9890.9930.9990.9920.9960.9950.994
High Gray-Level Run Emphasis0.8720.5150.9270.7080.5040.9990.9800.984
Run-Length Non-Uniformity Normalized0.9830.9930.9950.0010.9870.9920.9890.991
First Order StatisticsInterquartile Range −0.0050.0210.0870.0330.0100.026−0.007−0.008
Skewness0.9940.9980.9980.9990.9950.9990.9990.999
Uniformity0.9720.9850.9880.9920.9820.9840.9840.957
Median0.9980.9990.9990.9990.9991.0001.0000.997
Energy 0.0000.0000.0000.0000.0000.0000.0000.000
Robust Mean Absolute Deviation0.8720.5150.9270.7080.5040.9990.9800.984
Mean Absolute Deviation0.0960.9460.9580.9180.8670.9710.9250.057
Total Energy0.0000.0010.0300.0000.0020.0510.0020.000
Maximum 0.9960.9960.9960.9970.9960.9960.9960.997
Root Mean Squared−0.0180.0340.3150.010−0.0070.2420.031−0.007
90-Percentile0.8730.5170.9280.8530.5040.9990.9800.990
Minimum0.0010.0060.0070.0130.0000.004−0.001−0.025
Entropy0.9770.9720.9760.9720.9700.9640.9650.949
Range0.5480.9530.9790.7870.9650.9270.9390.183
Variance0.1020.9120.9810.8370.5600.9740.6300.065
10-Percentile0.8880.7180.8900.6000.5870.9640.9690.921
Kurtosis−0.005−0.006−0.007−0.002−0.006−0.005−0.0100.000
Mean0.9180.9610.9810.4230.9780.9300.9570.002
Table 5. Features extracted in different reproducibility groups.
Table 5. Features extracted in different reproducibility groups.
Reproducibility GroupsSemi-AutomaticManual
High (ICC ≥ 0.8)639 (96.7%)517 (78.1%)
Medium (0.8 > ICC ≥ 0.5)12 (1.8%)43 (6.5%)
Low (ICC < 0.5)11 (1.5%)102 (15.4%)
Table 6. ICC of radiomic features.
Table 6. ICC of radiomic features.
FeaturesSemi_1Semi_2Manual
ShapeVoxel Volume *0.9840.9960.996
Maximum 3D-Diameter0.8780.9650.973
Mesh Volume 0.9850.9960.857
Major Axis Length0.9390.9560.909
Sphericity0.7560.9030.909
Least Axis Length 0.9910.9990.909
Elongation *0.9410.9320.909
Surface Volume Ratio 0.9620.9770.909
Maximum 2D-Diameter Slice0.9240.9440.909
Flatness0.9700.9540.909
Surface Area0.9410.9880.908
Minor Axis Length 0.9840.9980.909
Maximum 2D-Diameter Column0.8860.9950.909
Maximum 2D-Diameter Row0.8660.9650.909
Gray-level run-length matrix (GLRLM)Short Run Low Gray-Level Emphasis *0.9640.9270.909
Gray-Level Variance0.9990.9980.909
Low Gray-Level Run Emphasis 0.9830.9620.909
Gray-Level Non-Uniformity Normalized0.9970.9980.909
Run Variance 0.9930.9990.909
Gray-Level Non-Uniformity 0.9950.9950.909
Long-Run Emphasis0.9920.9990.909
Short Run High Gray-Level Emphasis0.9430.9270.909
Run-Length Non-Uniformity 0.9960.9980.909
Short Run Emphasis0.9990.9990.909
Long Run High Gray-Level Emphasis *0.9370.9220.909
Run Percentage 0.9990.9990.909
Long Run Low Gray-Level Emphasis 0.9860.9980.909
Run Entropy0.9970.9980.909
High Gray-Level Run-Emphasis *0.9430.9270.909
Run-Length Non-Uniformity Normalized 0.9990.9990.909
Tumour Intensity (First Order Statistics)Interquartile Range 0.980.9890.909
Skewness0.9990.9960.909
Uniformity10.9940.909
Median10.9930.909
Energy 0.9990.9980.805
Robust Mean Absolute Deviation0.9420.9260.909
Mean Absolute Deviation0.9910.9980.909
Total Energy10.9920.849
Maximum 0.9990.9920.909
Root Mean Squared0.9970.9950.909
90-Percentile *0.9450.9190.909
Minimum0.9950.9870.909
Entropy0.9990.9990.909
Range0.9980.9980.909
Variance0.9990.9970.909
10-Percentile0.9440.9280.909
Kurtosis0.8880.90.909
Mean0.9880.9750.909
* Statistically significant (p < 0.05).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Haniff, N.S.M.; Abdul Karim, M.K.; Osman, N.H.; Saripan, M.I.; Che Isa, I.N.; Ibahim, M.J. Stability and Reproducibility of Radiomic Features Based Various Segmentation Technique on MR Images of Hepatocellular Carcinoma (HCC). Diagnostics 2021, 11, 1573. https://doi.org/10.3390/diagnostics11091573

AMA Style

Haniff NSM, Abdul Karim MK, Osman NH, Saripan MI, Che Isa IN, Ibahim MJ. Stability and Reproducibility of Radiomic Features Based Various Segmentation Technique on MR Images of Hepatocellular Carcinoma (HCC). Diagnostics. 2021; 11(9):1573. https://doi.org/10.3390/diagnostics11091573

Chicago/Turabian Style

Haniff, Nurin Syazwina Mohd, Muhammad Khalis Abdul Karim, Nurul Huda Osman, M Iqbal Saripan, Iza Nurzawani Che Isa, and Mohammad Johari Ibahim. 2021. "Stability and Reproducibility of Radiomic Features Based Various Segmentation Technique on MR Images of Hepatocellular Carcinoma (HCC)" Diagnostics 11, no. 9: 1573. https://doi.org/10.3390/diagnostics11091573

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop