Prediction of Pseudoprogression versus Progression using Machine Learning Algorithm in Glioblastoma

Jang, Bum-Sup; Jeon, Seung Hyuck; Kim, Il Han; Kim, In Ah

doi:10.1038/s41598-018-31007-2

Download PDF

Article
Open access
Published: 21 August 2018

Prediction of Pseudoprogression versus Progression using Machine Learning Algorithm in Glioblastoma

Bum-Sup Jang¹^na1,
Seung Hyuck Jeon¹^na1,
Il Han Kim^1,3 &
…
In Ah Kim^2,3

Scientific Reports volume 8, Article number: 12516 (2018) Cite this article

6050 Accesses
86 Citations
3 Altmetric
Metrics details

Subjects

Abstract

We aimed to investigate the feasibility of machine learning (ML) algorithm to distinguish pseudoprogression (PsPD) from progression (PD) in patients with glioblastoma (GBM). We recruited the patients diagnosed as primary GBM who received gross total resection (GTR) and concurrent chemoradiotherapy in two institutions from April 2010 to April 2017 and presented suspicious contrast-enhanced lesion on brain magnetic resonance imaging (MRI) during follow-up. Patients from two institutions were allocated to training (N = 59) and testing (N = 19) datasets, respectively. We developed a convolutional neural network combined with a long short-term memory ML structure. MRI data, which was 9 axial post-contrast T1-weighted images in our study, and clinical features were incorporated (Model 1). In the testing set, the trained Model 1 resulted in AUC of 0.83, AUPRC of 0.87, and F1-score of 0.74 using optimal threshold. The performance was superior to that of Model 2 (CNN-LSTM model with MRI data alone) and Model 3 (random forest model with clinical feature alone). The developed algorithm involving MRI data and clinical features could help making decision during follow-up of patients with GBM treated with GTR and concurrent CCRT.

Discriminating pseudoprogression and true progression in diffuse infiltrating glioma using multi-parametric MRI data through deep learning

Article Open access 23 November 2020

A Neural Network Approach to Identify the Peritumoral Invasive Areas in Glioblastoma Patients by Using MR Radiomics

Article Open access 16 June 2020

Pseudoprogression prediction in high grade primary CNS tumors by use of radiomics

Article Open access 08 April 2022

Introduction

Even after the introduction of a standard regimen consisting of concurrent chemoradiotherapy (CCRT) and adjuvant temozolomide, most patients with glioblastoma multiforme (GBM) experience disease progression¹. Clinicians often encounter a situation where they need to distinguish progressive disease (PD) from pseudoprogression (PsPD) following CCRT. PsPD is resulted from disruption of blood-brain barrier by CCRT and subsequent leakage of contrast material outside blood vessel. The discrimination is challenging because both lesions demonstrate similar contrast enhancement (CE) on gadolinium-enhanced T1-weighted magnetic resonance imaging (MRI)^2,3. Although pathologic confirmation is the most reliable method to diagnose PD or PsPD, numerous non-invasive attempts have been made for discrimination using diffusion-weighted imaging^4,5,6, perfusion imaging^7,8, or positron emission tomography (PET)^9,10. While these advanced imaging techniques have shown certain values, most experts do not agree with that traditional MRI such as T1-weighted or T2-weighted MRI can distinguish PsPD from PD¹¹. The conventional images, however, may give us a clue when they are analyzed with modern tools such as machine learning (ML).

Recently, ML algorithms are actively employed in the field of oncology. Convolutional neural network (CNN) is one of the ML algorithms that imitates a human visual cortex. CNN is designed to extract the feature maps that are compressed and abstracted from the input images and to perform given tasks with these feature maps. Thus, this model has proved its advantages in pulmonary nodule detection¹², mitosis detection in microscopic images¹³, and skin cancer classification¹⁴. As another popular ML algorithm, long short-term memory (LSTM)¹⁵ was recently introduced to effectively train recurrent neural networks by preventing explosion and vanishment of gradient problems that are common in deep recurrent neural networks¹⁶. Therefore, LSTM is prominently used in challenging sequence predictions such as automatic image caption generation¹⁷, automatic translation of text¹⁸, and automatic handwriting generation¹⁹. Regarding sequence, combination of CNN with LSTM (CNN-LSTM) were found to predict RNA-protein sequence and structure binding preferences²⁰. This architecture has been introduced in visual recognition, image description, and video description²¹.

To the best of our knowledge, there are no studies that investigate the potential role of CNN-LSTM structure in discrimination of PsPD from PD. The specific aim of our study was to demonstrate the feasibility of the ML algorithm in predicting PsPD with conventional images, especially gadolinium-enhanced T1-weighted MRI, in patients with GBM after CCRT.

Methods

Patient Selection

The institutional review board at SNUH (Seoul National University Hospital) and SNUBH (Seoul National University Bundang Hospital) approved this study protocol with a waiver of the written informed consent. All methods were performed in accordance with the relevant guidelines and regulations. We retrospectively reviewed patients with primary GBM who underwent gross total resection of enhancing tumor (GTR) followed by CCRT and adjuvant temozolomide from April 2010 to April 2017 at two institutions: SNUH and SNUBH. All patients who exhibited single measurable CE lesion of any size on gadolinium-enhanced T1-weighted MRI within 80% isodose line after CCRT (based on Response Assessment in Neuro-Oncology criteria²²) were included in the study. The exclusion criteria were as follows: (1) Demonstration of CE lesion on MRI not per institutional protocol, (2) No sufficient follow-up to determine the identity of lesion, (3) Detection of CE lesion before CCRT, (4) Suspicious residual CE lesion at immediate post-operative MRI, indicating incomplete resection, and (5) Incomplete CCRT. Finally, 78 patients (SNUH, N = 59 and SUNBH, N = 19) were included in the present study.

Data Collection and Preprocessing

In both the institutions, initial and follow-up images were obtained according to the specific protocol for glioma patients. All the images included T1-weighted 3D magnetization-prepared rapid acquisition gradient echo (MPRAGE) sequence before and after administration of gadolinium. Nine successive axial images of post-contrast MPRAGE sequence were selected by clinicians where the fifth image best represents the suspicious CE lesion. MRIs were acquired using 1.5-T (N = 11) or 3-T (N = 67) scanner. The 3D-MPRAGE images were obtained with matrix ranged from 256 × 256 to 1024 × 1024. The median slice thickness was 1-mm (range, 0.86–1.50 mm). Detailed information of imaging parameters is provided in Supplementary Table S1. Because pixel size and field of view (FOV) varied, input images were normalized as follow. First, they were resized into 200 × 200 (mm) images by cropping or padding. This size was selected since FOV was greater than 200 × 200 (mm) in all images but one with FOV of 193 × 193 (mm). The resized images were resampled into 256 × 256 pixels. The intensities of pixels were linearly scaled to have zero mean and unit norm.

The following clinical features were collected from medical records: age at the time of surgery, gender, methylation status of the O6-methylguanine-DNA-methyltransferase (MGMT) promoter, mutational status of the isocitrate dehydrogenase (IDH) gene, the total dose and number of fractions of radiotherapy, and the interval between the end of CCRT and the appearance of CE lesion. All clinical parameters were normalized and ranged between 0 and 1.

Forty-eight CE lesions that were surgically confirmed to be PD (N = 20), increased without spontaneous decrease on follow-up MRI (N = 25), or showed significant uptake on PET (N = 3) were classified as PD, and 30 CE lesions that were pathologically proved to be PsPD (N = 3), reduced on follow-up MRI before intervention (N = 21), remained stable for at least 120 days after appearance (N = 5), or no significant uptake on PET (N = 1) were considered as PsPD. The discrimination of lesions in our study was in accord with the multi-disciplinary assessment and treatment planning of the two institutions.

ML Network Structure

In the present analysis, we utilized the deep CNN-LSTM structure because CNN can learn features from brain MRI and LSTM recognizes the spatial sequence of images. Along with MRI, clinical factors are important when clinicians decide the identity of a lesion. Therefore, clinical parameters including age, gender, total radiation dose, number of fractions, interval between CCRT and appearance of lesion, MGMT methylation status, and IDH mutation status were also utilized in our study.

A total of three models were built to evaluate and compare the performance of the models and parameters. In ‘Model 1’, both MR images and clinical parameters were incorporated into the CNN-LSTM structure. All the nine axial images were passed through each three CNN layer that contains 2 × 2 kernels to create 64, 128, and 256 filters. The binary cross-entropy loss function was minimized using the classical stochastic gradient descent optimizer²³ at a learning rate of 0.001. ReLu nonlinear function was applied at every CNN layer. Batch normalization and max pooling with 2 × 2 kernel size were applied after every CNN layer. The flatten layer was added at the end of the CNN layers, and the nine flattened patches entered LSTM sequentially. Clinical factors were passed into two successive fully connected layers with four nodes, which were activated by the ReLu function. The outputs of LSTM and the fully connected layer were merged by concatenation. Finally, the merged information was connected to the output of the fully connected layer with one node activated by sigmoid function to determine PsPD or PD.

‘Model 2’ and ‘Model 3’ were developed as a benchmark against ‘Model 1’. In Model 2, MR images but not clinical parameters were used as input of the CNN-LSTM structure. The structure of ‘Model 2’ was identical to that of ‘Model 1’, except the layers of clinical parameters. ‘Model 3’ was built with random forest (RF) classifier to evaluate the ability of clinical parameters without MR images. The number of trees used to train was 1,000 and number of variables randomly sampled at each split was 2. The schematic representation of the structures of the three models is shown in Fig. 1.

Deep learning architecture was implemented using “Keras” wrapper library version 2.0.8 in Python version 3.3 environment with Tensorflow version 1.4 as backend.

Analysis

Patients collected from SNUH (N = 59) and SNUBH (N = 19) were allocated to training and testing sets, respectively. Because the distribution of binary cases was not uniform, we estimated the area under the ROC curve (AUC) and the area under the precision-recall curve (AUPRC) values to evaluate the trained model in the testing set. Furthermore, we generated the confusion matrix and estimated the precision, recall, and F1-score to compare the performance among three models²⁴.

Results

Patient Characteristics and Treatment

The clinical characteristics of study patients are presented in Table 1. Thirty (38.5%) and 48 (61.5%) of the CE lesions were PsPD and PD, respectively. There was no significant difference between training and testing sets, except IDH mutation status (p = 0.04, Fisher’s exact test). Female patients tended to present PD rather than PsPD (p = 0.03, Chi-squared test), and the interval between CCRT and CE appearance was significantly shorter in PsPD than PD (p = 0.02, t-test).

Table 1 Patient Characteristics.

Full size table

Negative Control

To identify negative control considering class imbalance, we performed 10-fold internal validation in the scrambled training set. The resulted mean AUC value was 0.47 (Supplementary Fig. S1A). Next, we trained the Model 1 with scrambled training set and tested the finalized model in the testing set (N = 19). The estimated AUC value was approximately 0.5 (Supplementary Fig. S1B). Thus, we considered the ‘luck’ as which results AUC of 0.5 in our downstream analysis.

Parameter Tuning

We tuned parameters including the number of epochs, batch size, memory cell size of LSTM, and learning rate of Model 1. First, we sought to find the optimal the number of epochs. We traced the performances of the model in the training set along with the number of epochs in five iterations, plotting train and validation loss when learning rate was 0.001, the number of memory cells in LSTM was 24, and batch size was 8. Twenty percent of the training set was used as the validation set. We assumed that train loss and validation loss would meet at the optimal epoch number. In current study, we found that 25 was the adequate number of epochs to train the model. Figure 2A shows the train and validation traces from each epoch, representing the behavior of the model over time.

Subsequently, we determined the memory cell size by comparing AUC value from 5-fold validation in case of the memory cell sizes of 18, 20, 22, and 24. The statistics of AUC value for each memory cell size are shown in box and whisker plots (Fig. 2B); we selected the most appropriate memory cell size to be 24. To determine the optimal batch size, similar comparison was performed. The batch size was adjusted not to exceed 9 to avoid errors from memory shortage. On the other hand, if the batch size was below 6, training time was significantly long. Therefore, we compared AUC values with batch sizes of 6, 7, 8, and 9 (Fig. 2C); batch size of 8 resulted in the highest AUC value and determined as the optimal value. Lastly, appropriate learning rate was determined to be 0.01 by comparing results from various learning rates which were 0.0001, 0.001, 0.01, and 0.1 (Fig. 2D). The tuned parameters were applied to both Model 1 and Model 2.

Training

We performed a 10-fold cross validation in the training set. ROC curve and precision-recall curve of each fold in Model 1 are shown in Fig. 3A,B, respectively. The estimated values of mean AUC and micro average AUPRC were 0.72 and 0.92, respectively. Approximately 15 minutes to perform 10-fold validation in the training set and 2 minutes to finalize the model were required with 11GB Geforce 1080Ti GPU. Training procedure of Model 2 was same as Model 1. Model 3 was trained in RF classifier with parameters described above.

Testing and Benchmarking

The ROC curve, precision-recall curve, and the normalized confusion matrix of Model 1 in the testing set are depicted in Fig. 4A. The estimated values of AUC and AUPRC were 0.83 and 0.87 in testing set, respectively. The optimal threshold value was determined when the true positive rate (TPR) was high and the difference of TPR and (1-false positive rate) was nearly zero. As a result, the average precision, average recall, and average F1-score of ‘Model 1’ were 0.74, 0.74, and 0.74, respectively.

The results of Model 2 in the testing set is demonstrated in Fig. 4B. The average precision, average recall, and average F1-score of this model were 0.58, 0.58, and 0.85, respectively. The estimated values of AUC and AUPRC was 0.69 and 0.81, respectively. As shown in Fig. 4C, the estimated values of AUC and AUPRC of Model 3 were 0.52 and 0.59, respectively, indicating that the performance was the best in Model 1. Results of all models are summarized in Table 2.

Table 2 Summary of Result and Model Comparison.

Full size table

Discussion

In current study, we presented a novel ML algorithm to predict PsPD in GBM patients after adjuvant CCRT, given both MRI showing a suspicious CE lesion and clinical factors. Our algorithm achieved a moderate predictability in the unseen testing set (AUC = 0.83, AUPRC = 0.87, and F1-score = 0.74) collected from the independent institution. To our knowledge, the present study is the first to use deep ML algorithm for the identification of PsPD in GBM patients.

Early differentiation between PsPD and PD is extremely important in terms of salvage treatment. Many researchers have investigated the usefulness of radiologic features for the prediction. Several authors have reported that diffusion and perfusion MRI have additional roles in detecting PsPD. By combining parameters from diffusion tensor imaging and perfusion imaging, Wang et al. built a model differentiating PsPD from non-PsPD with AUC of 0.807²⁵. Prager et al. also demonstrated a model using diffusion and perfusion MRI that yielded 93.1% sensitivity and 83.3% specificity in predicting PD²⁶. Recent meta-analysis revealed that PET provided better accuracy in detecting recurrent tumors than conventional MRI²⁷. Despite the role of diffusion and perfusion images, there are still no specific guideline involving the advanced imaging modalities²². Therefore, the model using the conventional MRI may have potency in terms of widespread use and easy validation. Chen et al.²⁸ attempted to predict PsPD using texture features of T1-weighted and T2-weighted MRI and showed an accuracy of 86.4% using the model, suggesting the potential role of conventional images. However, they included 22 patients and did not validate the model.

Recently, ML algorithms have been an attractive tool in analyzing medical images. Several investigators utilized support vector machine classifier and multi-parametric MR images^29,30, but the results were not externally validated. The deep and sequential ML algorithm adopted in our analysis is widely used. CNN has been utilized for segmentation or classification of brain tumor on MR image^{31,32,33,34,35}. We avoided to use modern CNN models, such as ‘GoogLeNet’³⁶, because deeper structures may cause worse performance with small number of samples. Nevertheless, the F1-score of our algorithm was acceptable in external validation.

One of the strengths of our study is incorporation of clinical parameters. No previous ML studies included clinical factors in their models. It is well known that some clinical variables are associated with the likelihood of PD or PsPD. For instance, methylation of MGMT and IDH mutation is associated with the formation of PsPD and disease progression^37,38. Consequently, we hypothesized that clinical parameters could improve the performance of ML model, and the hypothesis was tested by developing the three models. Our results suggest that model including both MR and clinical features performs better than models including only one of them. Given that CNN-LSTM is a black-box technique, we cannot quantify exact importance of features among input data. However, it implies that combining radiologic and clinical data in discriminating PsPD is necessary not only in clinic but also in future investigations. Minimization of intervention by clinician is another advantage of our models where CE lesions were not needed to be segmented. The segmentation process is subjective to operator, labor-intensive, and difficult to be automated.

We selected 9 axial images to be incorporated in the models. The number of input images depends on training resources such as data availability, time, cost, or algorithm. Nine images are easy to handle and make developers use moderate-scale algorithm without expensive high-end computers with less time to train and predict. Moreover, using whole MR image set may cause ‘curse of dimensionality’³⁹ because the number of samples is small and could possibly consume more computational resources. On the other hand, we utilized MR images with only enhanced T1-weighted sequence. The MPRAGE sequence is widely used in daily practice and can be acquired with both 1.5-T and 3-T MR scanners, facilitating validation and application of the developed algorithm⁴⁰. Furthermore, including other sequences may also increase the input dimension compared to the sample size. Eventually, 9 input images with one sequence were selected as input for the algorithm, which is acceptable given the sample size and the clinical applicability of the model.

Difficulty in defining PD and PsPD is an intrinsic limitation of the study. While only surgical resection can confirm the identity of lesion, the majority of patients (70.5%, 55/78) did not receive the second resection due to poor performance of patients. Instead, we used strict criteria in discriminating PD and PsPD, which was in concordance with multi-disciplinary decision.

Compared to other ML studies dealing with a binary decision problem, however, metrics estimated from the present study appears not remarkable. Small number of cases was one of the possible reasons. Basically, the incidence of GBM is low and the inclusion criteria of the study were strict. We excluded patients who received partial resection of tumor because they cannot exhibit pure PsPD. Those with short follow-up period were also excluded. Due to small datasets, our model could have been overfitted; hence, further validation with more samples is required to confirm the clinical utility of our model.

In addition, one may consider adversarial examples, which were recently reported to attack deep neural network⁴¹, in our study. Existence of adversarial examples and defense against them are under investigation. Concerning the existence of adversarial examples, which might lead to false decision, Szegedy et al.⁴² addressed that the probability of adversarial examples is extremely low and they are hardly seen in testing sets. Another study⁴³ reported that adversarial examples are mainly distributed with low probability compared to clean data. Nevertheless, we attempted to generate adversarial images with fast gradient sign method⁴⁴, which adds perturbation to original images. However, perturbation cannot be simply calculated from backpropagation since our CNN is involved with LSTM and neural network taking clinical features. Most adversarial cases focus only on image classification using deep neural network and, to our knowledge, there is no adversarial examples about CNN-LSTM model to date. If adversarial examples of our model exist and can be found in future studies, the performance of our model would be improved.

In conclusion, we developed a deep ML algorithm that can be applied to differentiate PsPD and PD in GBM patients who had completed current standard therapy. With 9 selected axial MR images and clinical factors, the model showed acceptable performance in the independent dataset. Our algorithm could help making decision during follow-up. Further validation studies with larger samples from various institutions is necessary to ensure the clinical utility of this model.

Data availability

The Python source codes of the process are free and available at https://github.com/bigwiz83/PsPDvsPD. However, the analyzed datasets cannot be opened publicly due to the law for handling of medical information in Korea. Reasonable request following approval from institutional review board is required to access to the datasets.

References

Stupp, R. et al. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. New England Journal of Medicine 352, 987–996 (2005).
Article PubMed CAS Google Scholar
Brandsma, D., Stalpers, L., Taal, W., Sminia, P. & van den Bent, M. J. Clinical features, mechanisms, and management of pseudoprogression in malignant gliomas. The lancet oncology 9, 453–461 (2008).
Article PubMed Google Scholar
Topkan, E., Topuk, S., Oymak, E., Parlak, C. & Pehlivan, B. Pseudoprogression in patients with glioblastoma multiforme after concurrent radiotherapy and temozolomide. American journal of clinical oncology 35, 284–289 (2012).
Article PubMed Google Scholar
Chu, H. H. et al. Differentiation of true progression from pseudoprogression in glioblastoma treated with radiation therapy and concomitant temozolomide: comparison study of standard and high-b-value diffusion-weighted imaging. Radiology 269, 831–840 (2013).
Article PubMed Google Scholar
Park, J. E., Kim, H. S., Goh, M. J., Kim, S. J. & Kim, J. H. Pseudoprogression in patients with glioblastoma: assessment by using volume-weighted voxel-based multiparametric clustering of MR imaging data in an independent test set. Radiology 275, 792–802 (2015).
Article PubMed Google Scholar
Reimer, C. et al. Differentiation of pseudoprogression and real progression in glioblastoma using ADC parametric response maps. PloS one 12, e0174620 (2017).
Article PubMed PubMed Central CAS Google Scholar
Suh, C., Kim, H., Choi, Y., Kim, N. & Kim, S. Prediction of pseudoprogression in patients with glioblastomas using the initial and final area under the curves ratio derived from dynamic contrast-enhanced T1-weighted perfusion MR imaging. American Journal of Neuroradiology 34, 2278–2286 (2013).
Article PubMed CAS Google Scholar
Thomas, A. A. et al. Dynamic contrast enhanced T1 MRI perfusion differentiates pseudoprogression from recurrent glioblastoma. Journal of neuro-oncology 125, 183–190 (2015).
Article PubMed PubMed Central Google Scholar
Galldiks, N. et al. Diagnosis of pseudoprogression in patients with glioblastoma using O-(2-[18F] fluoroethyl)-L-tyrosine PET. European journal of nuclear medicine and molecular imaging 42, 685–695 (2015).
Article PubMed CAS Google Scholar
Kebir, S. et al. Unsupervised consensus cluster analysis of [18F]-fluoroethyl-L-tyrosine positron emission tomography identified textural features for the diagnosis of pseudoprogression in high-grade glioma. Oncotarget 8, 8294 (2017).
Article PubMed Google Scholar
Abdulla, S., Saada, J., Johnson, G., Jefferies, S. & Ajithkumar, T. Tumour progression or pseudoprogression? A review of post-treatment radiological appearances of glioblastoma. Clinical radiology 70, 1299–1312 (2015).
Article PubMed CAS Google Scholar
Setio, A. A. A. et al. Pulmonary nodule detection in CT images: false positive reduction using multi-view convolutional networks. IEEE transactions on medical imaging 35, 1160–1169 (2016).
Article PubMed Google Scholar
Cireşan, D. C., Giusti, A., Gambardella, L. M. & Schmidhuber, J. In International Conference on Medical Image Computing and Computer-assisted Intervention. 411–418 (Springer, 2013).
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article ADS PubMed CAS Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural computation 9, 1735–1780 (1997).
Article PubMed CAS Google Scholar
Min, S., Lee, B. & Yoon, S. Deep learning in bioinformatics. Briefings in bioinformatics 18, 851–869 (2017).
PubMed Google Scholar
Vinyals, O., Toshev, A., Bengio, S. & Erhan, D. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3156–3164 (2015).
Sutskever, I., Vinyals, O. & Le, Q. V. In Advances in neural information processing systems. 3104–3112 (2014).
Graves, A. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013).
Pan, X., Rijnbeek, P., Yan, J. & Shen, H.-B. Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks. bio Rxiv, 146175 (2017).
Donahue, J. et al. Long-Term Recurrent Convolutional Networks for Visual Recognition and Description. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 677–691, https://doi.org/10.1109/TPAMI.2016.2599174 (2017).
Article PubMed Google Scholar
Wen, P. Y. et al. Updated response assessment criteria for high-grade gliomas: response assessment in neuro-oncology working group. Journal of Clinical Oncology 28, 1963–1972 (2010).
Article PubMed Google Scholar
Bengio, Y. In Neural networks: Tricks of the trade 437–478 (Springer, 2012).
Hripcsak, G. & Rothschild, A. S. Agreement, the f-measure, and reliability in information retrieval. Journal of the American Medical Informatics Association 12, 296–298 (2005).
Article PubMed PubMed Central Google Scholar
Wang, S. et al. Differentiating tumor progression from pseudoprogression in patients with glioblastomas using diffusion tensor imaging and dynamic susceptibility contrast MRI. American Journal of Neuroradiology 37, 28–36 (2016).
Article PubMed CAS Google Scholar
Prager, A. et al. Diffusion and perfusion MRI to differentiate treatment-related changes including pseudoprogression from recurrent tumors in high-grade gliomas with histopathologic evidence. American Journal of Neuroradiology 36, 877–885 (2015).
Article PubMed CAS Google Scholar
Nihashi, T., Dahabreh, I. & Terasawa, T. Diagnostic accuracy of PET for recurrent glioma diagnosis: a meta-analysis. American Journal of Neuroradiology 34, 944–950 (2013).
Article PubMed CAS Google Scholar
Chen, X. et al. Differentiation of true-progression from pseudoprogression in glioblastoma treated with radiation therapy and concomitant temozolomide by GLCM texture analysis of conventional MRI. Clinical imaging 39, 775–780 (2015).
Article PubMed Google Scholar
Hu, X., Wong, K. K., Young, G. S., Guo, L. & Wong, S. T. Support vector machine multiparametric MRI identification of pseudoprogression from tumor recurrence in patients with resected glioblastoma. Journal of Magnetic Resonance Imaging 33, 296–305, https://doi.org/10.1002/jmri.22432 (2011).
Article PubMed PubMed Central Google Scholar
Qian, X. et al. Stratification of pseudoprogression and true progression of glioblastoma multiform based on longitudinal diffusion tensor imaging without segmentation. Medical physics 43, 5889–5902 (2016).
Article ADS PubMed PubMed Central Google Scholar
Hussain, S., Anwar, S. M. & Majid, M. Segmentation of Glioma Tumors in Brain Using Deep Convolutional Neural Network (2017).
Soltaninejad, M., Zhang, L., Lambrou, T. & Allinson, N. Multimodal MRI brain tumor segmentation using random forests with features learned from fully convolutional neural network (2017).
Pereira, S., Pinto, A., Alves, V. & Silva, C. A. In International Workshop on Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. 131–143 (Springer, 2016).
Havaei, M. et al. Brain tumor segmentation with Deep Neural Networks. Medical Image Analysis 35, 18–31, https://doi.org/10.1016/j.media.2016.05.004 (2017).
Article PubMed Google Scholar
Pan, Y. et al. Brain Tumor Grading Based on Neural Networks and Convolutional Neural Networks. 699–702 (2015).
Szegedy, C. et al. Going Deeper with Convolutions (Cvpr, 2015).
Brandes, A. A. et al. MGMT promoter methylation status can predict the incidence and outcome of pseudoprogression after concomitant radiochemotherapy in newly diagnosed glioblastoma patients. Journal of Clinical Oncology 26, 2192–2197 (2008).
Article PubMed Google Scholar
Li, H., Li, J., Cheng, G., Zhang, J. & Li, X. IDH mutation and MGMT promoter methylation are associated with the pseudoprogression and improved prognosis of glioblastoma multiforme patients who have undergone concurrent and adjuvant temozolomide-based chemoradiotherapy. Clinical neurology and neurosurgery 151, 31–36 (2016).
Article PubMed Google Scholar
Trunk, G. V. A problem of dimensionality: A simple example. IEEE Transactions on pattern analysis and machine intelligence, 306–307 (1979).
Ellingson, B. M. et al. Consensus recommendations for a standardized Brain Tumor Imaging Protocol in clinical trials. Neuro-oncology 17, 1188–1198 (2015).
Article PubMed PubMed Central Google Scholar
Yuan, X., He, P., Zhu, Q., Bhat, R. R. & Li, X. Adversarial Examples: Attacks and Defenses for Deep Learning. arXiv preprint arXiv:1712.07107 (2017).
Szegedy, C. et al. Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199 (2013).
Song, Y., Kim, T., Nowozin, S., Ermon, S. & Kushman, N. PixelDefend: Leveraging Generative Models to Understand and Defend against Adversarial Examples. arXiv preprint arXiv:1710.10766 (2017).
Goodfellow, I. J., Shlens, J. & Szegedy, C. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 (2014).

Download references

Acknowledgements

This work was supported by Seoul National University Big Data Institute via The Data Science Research Project 2017 and SNUBH Research Fund (#14-2018-003) to In Ah Kim.

Author information

Bum-Sup Jang and Seung Hyuck Jeon contributed equally.

Authors and Affiliations

Department of Radiation Oncology, Seoul National University Hospital, Seoul, Korea
Bum-Sup Jang, Seung Hyuck Jeon & Il Han Kim
Department of Radiation Oncology, Seoul National University Bundang Hospital, Seongnamsi, Korea
In Ah Kim
Institute of Radiation Medicine, Cancer Research Institute, Seoul National University College of Medicine, Seoul, Korea
Il Han Kim & In Ah Kim

Authors

Bum-Sup Jang
View author publications
You can also search for this author in PubMed Google Scholar
Seung Hyuck Jeon
View author publications
You can also search for this author in PubMed Google Scholar
Il Han Kim
View author publications
You can also search for this author in PubMed Google Scholar
In Ah Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.-S.J. and I.A.K. wrote the main manuscript text. Two first authors (B.-S.J. and S.H.J.) contributed equally to this work. Specifically, B.-S. developed machine learning model, performed training/testing it, and drafted a part of the manuscript. S.H.J. collected and prepared data samples from the two independent institutions and drafted another part of the manuscript. I.H.K. acquired the approval of institutional review board and edited the manuscript. I.A.K. supervised this research, acquired the funding source, and edited the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to In Ah Kim.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jang, BS., Jeon, S.H., Kim, I.H. et al. Prediction of Pseudoprogression versus Progression using Machine Learning Algorithm in Glioblastoma. Sci Rep 8, 12516 (2018). https://doi.org/10.1038/s41598-018-31007-2

Download citation

Received: 16 March 2018
Accepted: 09 August 2018
Published: 21 August 2018
DOI: https://doi.org/10.1038/s41598-018-31007-2

This article is cited by

A deep learning model for discriminating true progression from pseudoprogression in glioblastoma patients
- Mana Moassefi
- Shahriar Faghani
- Bradley J. Erickson
Journal of Neuro-Oncology (2022)
Clinical applications of artificial intelligence and radiomics in neuro-oncology imaging
- Ahmed Abdel Khalek Abdel Razek
- Ahmed Alksas
- Eman Helmy
Insights into Imaging (2021)
Differentiation of Pseudoprogression from True Progressionin Glioblastoma Patients after Standard Treatment: A Machine Learning Strategy Combinedwith Radiomics Features from T1-weighted Contrast-enhanced Imaging
- Ying-Zhi Sun
- Lin-Feng Yan
- Guang-Bin Cui
BMC Medical Imaging (2021)
The predictive value of absolute lymphocyte counts on tumor progression and pseudoprogression in patients with glioblastoma
- Jing Xi
- Bilal Hassan
- Jian L. Campian
BMC Cancer (2021)
MRI biomarkers in neuro-oncology
- Marion Smits
Nature Reviews Neurology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.