1 Introduction

Stroke ranks second as leading cause of deaths worldwide, with ischemic stroke being the most common type. Ischemic stroke arises from a sudden occlusion of a cerebral artery. Diagnosis and treatment begins with the acquisition of multi-modal MRI or CT images, followed by appropriate medical intervention. While several clinical trials have proven the efficacy of mechanical thrombectomy, the treating physician must carefully evaluate the associated risks and benefits: the volume of ill-perfused tissue potentially salvageable, versus the risk of causing haemorrhage or other complications [9, 14]. Hence, predicting the outcome of a stroke lesion (i.e. lesion status at three-month follow-up), and thereby evaluating the effect of a successful or unsuccessful mechanical thrombectomy, has a great potential to guide the decision of the physician.

MRI perfusion and diffusion sequences have been gaining importance in the characterization of ischemic stroke, which provides important information for the revascularization therapy [2, 12]. Based on MRI sequences, some machine learning approaches have been recently proposed for ischemic stroke lesion prediction. The majority is based on multivariate linear regression models [5, 11], or through more advanced models such as decision trees [7] and CNN-based deep learning architectures [6]. However, up to our knowledge none of the approaches takes into consideration the temporal Perfusion Weighted Imaging data (4D PWI) for stroke lesion prediction. We hypothesize that a data-driven approach to model the raw perfusion imaging data can unveil information complementing the standard clinical perfusion maps derived using kinetic analysis.

In this paper, we propose a novel end-to-end deep learning multi-data branched network that incorporates information from the 4D PWI alongside the standard clinical perfusion and diffusion maps. From the time-stamp acquisitions of the 4D PWI, that characterize the bolus passage, we aim to learn brain blood flow hemodynamics (principal and collateral) to characterize tissue at risk of infarction (penumbra), and the unsalvageable tissue (ischemic core). Since standard perfusion and diffusion maps are generated from kinetic models followed by thresholding (based on clinical knowledge and experience), we hypothesize that there might be loss of relevant information. Hence, we aim to enhance the clinical MRI diffusion and perfusion maps with data-driven maps for stroke lesion outcome prediction. The proposed architecture was evaluated using the publicly-available ISLES 2017 dataset.

The remainder of the paper is organized as follows: Sect. 2 describes the methods of the proposed approach. Section 3 addresses the setup for stroke lesion outcome prediction. Section 4 contains the obtained results and its discussion. Finally, Sect. 5 contains the conclusions.

2 Methods

4D PWI. At the arrival of the contrast agent to the brain, during the acquisition of the 4D PWI, the healthy tissue will present a drop in the signal intensity value, which then increases as contrast agent starts diluting throughout the system. However, in the presence of ill-perfused tissue the signal intensity values barely changes, since there is no propagation of the contrast agent to the damaged tissue [12]. Figure 1 depicts such signal intensity behaviour in a patient with ischemic stroke. The perfusion blood flow dynamic, captured by the temporal slices of the 4D PWI, is responsible for the generation of the 3D MRI perfusion maps, through the application of kinetic models, deconvolutions in the time space, and clinical thresholding. Therefore, rCBF, rCBV, MTT, TTP, and Tmax perfusion maps can be viewed as surrogate parametric summaries of the raw 4D PWI, encompassing specific blood flow dynamics. From this knowledge emerged the intention to evaluate the encoding of the blood flow hemodynamics directly from 4D PWI, considering altogether complementary information over the diffusion and perfusion maps. Our approach was based on the peak concentration of contrasting agent, which is of extreme importance, since it characterizes the point where the differences of perfusion between healthy tissue and the ill-perfused tissue are higher, allowing a better detection of the penumbra [4]. We developed an automatic approach to detect the time slice where the concentration of contrast agent is higher, which corresponds to a lower signal intensity. We detect automatically the peak of concentration using k-means on the mean signal intensity and standard deviation. The peak is used to define a temporal window to retrieve specific temporal acquisitions regarding the blood dynamics needed to estimate the tissue at risk of infarction. Besides reducing the total number of temporal slices, we also enforce the same spatial-temporal space across patients. Aligning patient data across the peak concentration of contrasting agent yields a common time interval for the retrieval of information. A fixed temporal window size of 26 slices was used, based on the sampling rate of the MRI acquisition.

Fig. 1.
figure 1

General 4D PWI signal intensity over time acquisitions in a patient with an ischemic stroke lesion. The dashed line corresponds to the temporal slicing performed automatically. The system automatically detects and selects a temporal window of interest, modeling temporal acquisitions characterizing blood dynamics of interest.

Baseline Architecture. For stroke outcome lesion prediction, we based our network on the U-Net [10], which has proved to be competitive in many biomedical image segmentation applications. The output of the U-Net is fed to a bi-dimensional GRU layer [3] that processes the information in four directions (superior-inferior, inferior-superior, anterior-posterior and posterior-anterior), to enforce a greater spatial context in stroke lesion outcome prediction. The baseline architecture only considers standard diffusion and perfusion MRI maps (as employed by the state of the art approaches).

Multi-Data Branched Network. To merge information from the standard perfusion and diffusion maps and the data-driven 4D PWI data, the proposed architecture fuses two U-Nets as shown in Fig. 2. The top branch U-Net models the raw 4D PWI information, where the temporal information is coded as input channels. The first two layers consist of a feature expansion and feature reduction by 4. Recombining the feature maps allows complex interactions between temporal slices within the temporal window [15]. Since each branch is able to learn different specific features, we then merge the output of each branch, which is fed into a smaller architecture in order to take advantage from complementary information for stroke lesion outcome prediction. The final portion of the network encompasses also a bi-dimensional GRU layer present in both branches of the network.

Fig. 2.
figure 2

Overview of the proposed architecture for stroke lesion outcome prediction. Blue feature maps result from bi-dimensional convolutions.

Table 1. Methods from ISLES 2017 testing dataset. Each metric contains the mean ± standard deviation.

3 Setup

Our proposal was validated on the publicly available ISLES 2017 database, with a total of 75 cases divided into two datasets: Training dataset (n = 43) and Challenge dataset (n = 32). Each case contains a raw 4D PWI, five 3D MRI perfusion maps (rCBF, rCBV, MTT, TTP, Tmax), one 3D MRI diffusion map (ADC), and the final lesion outcome, which was manually segmented by a clinician on a 90-day follow-up (only available for the training dataset). All MRI maps are already co-registered and skull-stripped [6].

Since MRI acquisitions are from different centers, all maps were resized to the same volume space: \(256 \times 256 \times 32\). \(T_{max}\) was clipped to \(\left[ 0, 20s \right] \), and the ADC was clipped to be within the range \(\left[ 0, 2600 \right] \times 10^{-6} \mathrm{mm}^{2}/s\), as values beyond these ranges are known to be biologically meaningless [7]. Afterwards, a linear scaling was performed between \(\left[ 0, 255 \right] \). Bias field correction was performed to the 4D PWI using the N4ITK method [13] before the resizing and scaling steps.

The training dataset was divided into 36 cases for training and 7 cases for validation. For each case 550 patches of size \(88 \times 88\) were randomly extracted. The network was trained with ADAM optimizer (learning rate\(=1e-5\)), with a mini-batch of size 4, using as loss function the soft-dice loss [8]. The sum is performed for the N voxels of the patch both in the binary prediction \(p_i \in P\) and the ground truth \(\text {g}_i \in G\). The gradient of the Dice score for the \(j-th\) voxel of prediction, was calculated as in Eq. 1.

$$\begin{aligned} \frac{\delta Dice }{\delta p_j} = \frac{g_j (\sum _{i}^{N} p_{i}^{2} + \sum _{i}^{N} g_{i}^{2}) - 2 p_j \sum _{i}^{N} p_i g_i}{(\sum _{i}^{N} p_{i}^{2} + \sum _{i}^{N} g_{i}^{2})^2} \end{aligned}$$
(1)

The tests were conducted using Keras with Theano, on a Nvidia GeForce GTX-1070 8 GB, where each prediction took around 30 s.

We compare the performance of our proposal with three different studies: Standard Branch, Data-Driven Branch, and Multi-Data Single Branch. The Standard Branch architecture considers diffusion and perfusion maps. The Data-Driven Branch studies the 4D PWI. In the Multi-Data Single Branch we combined the inputs from both branches into a single network. In Table 1 we report results on ISLES 2017 test dataset, which enables us to compare with state of the art methods.

4 Results and Discussion

Table 1 contains the results obtained on the ISLES 2017 testing dataset. The Standard Branch and the Data-Driven Branch achieved the same Dice score, but with differences in the distance metrics. The Data-Driven Branch was capable of predicting the lesion outcome with higher robustness, since the Hausdorff and ASSD are lower when compared to the Standard Branch. Nevertheless, both approaches are not capable of reaching state of the art performance. However, when we fuse the information of both models, as proposed in this paper, we observe an improvement on the average Dice score, but also on Hausdorff and ASSD. Being so, both models provide distinct information of value for stroke lesion outcome prediction. Additionally, we also study the performance of a single model with all the inputs combined, referred to as the Multi-Data Single Branch architecture. Such approach reached higher Dice score than the two branches separately, and with lower distance metrics. However, it was not capable of reaching the same performance of our proposal. Therefore, having 2 U-Nets for different input data has benefits on modularity and specificity on how the information is modelled. Using separate deep learning models to directly learn intrinsic biological phenomena allowed a higher robustness and accuracy in lesion outcome location and delineation, which is sustained by the lower distance metric values and higher Dice score.

In addition, we compare our proposal with other approaches. However, such comparison needs to consider the fact that top rank approaches use multiple models (i.e. ensembling). Our proposal was able to achieve a Dice score among the top 5 ranking methods, being within the same Hausdorff range of those methods, with just a single network. From this comparison we highlight the low distance metric values obtained, and also a Dice score in the same level of the 4th ranked method. Figure 3 shows the average Dice score and respective Hausdorff for each method.

Fig. 3.
figure 3

Hausdorff Distance versus Dice score from methods of ISLES 2017 in the testing set. Note that closer to the x axis and further away from the origin is better. Ensemble methods are marked with a cross.

From this analysis we emphasize the benefits of the proposed approach to extract and model information that might not be fully characterized by the standard perfusion and diffusion maps. To assess the complementarity of the data-driven perfusion maps, we computed the normalized mutual information between all the standard perfusion maps and each of the learned feature maps from the data-driven raw 4D PWI branch. As shown in Fig. 4, low association values (less than 0.2) were obtained for all the extracted feature maps, meaning that both branches introduce new and complementary features.

Fig. 4.
figure 4

Normalized Mutual Information between the standard perfusion/diffusion maps and the feature maps from the data-driven branch, in training case 36. Values closer to 0 mean low mutual information and closer to 1 represent a high association.

Fig. 5.
figure 5

Example of extracted features for training case 33 (a) and case 36 (b), alongside the corresponding ground-truth overlapped over the ADC map.

Figure 5 shows the extracted features from the data-driven raw 4D PWI branch. Visually, feature 10 can reflect some descriptions of collateral blood flow, where features 16 and 18 focus on the surrounding lesion area itself in a complementary way. This analysis is particularly important since it can provide complementary information in the decision making process performed by the clinician, providing a better prediction of potentially salvageable tissue.

5 Conclusions and Future Work

Parametric perfusion maps can be affected by intrinsic patient physiology [12]. To cope with this effect mathematical models are applied to standardize the behaviour of the contrast bolus passing. Nonetheless, it cannot be independent from patient specific blood flow hemodynamics, which can highly affect the perfusion parametric maps by adding a wide variability in the penumbra delineation [12]. Therefore, in this work, we propose a deep learning architecture, that can process the information from raw 4D PWI data and generate complementary information to the perfusion parametric sequences, which as shown here can increase stroke lesion outcome prediction.

In the future, we intend to perform an interpretability analysis of the data-driven perfusion maps as well as an analysis of the learning patterns of the architecture to ensure the correctness of the predictions with respect to the data being used to drive the lesion outcome predictions.