In this study, we showed that in over one third of the brain tumours voxels (35.5%), standard model fitting of DCE-MRI data was inconclusive and therefore fitting these models to the data would lead to incorrect perfusion parameters. Considering a CV of 20%, only ≈ 25% of remaining voxels were found to be reliable. The reproducibility of this technique and, as a consequence, its reliability can be improved by improving the main sources of variability in quantitative DCE-MRI (the acquisition method and the quantification process) [
16]. Nonetheless, in most studies, the intrinsic heterogeneity of the lesion is ignored by quantifying the perfusion with one single pharmacokinetic model and, more importantly, carrying out statistical analyses on one single whole tumour statistic (usually the average). In this study, we investigated the reliability of DCE-MRI focusing on the quantification analysis. We took into consideration the particularly heterogeneous nature of brain tumour vascular permeability due to the presence of the BBB, as well as existence of necrotic regions and provided a method to identify robust DCE-MRI data based on a model selection procedure and a stability test.
DCE-MRI models
MR scanners usually employ post processing perfusion tools which fit DCE data with the TM. This model (together with its extended version [
32]) considers the system in a fast exchange limit [
36,
37], assuming an infinitely fast transcytolemmal water exchange between the EES and the intracellular space, which does not affect the overall signal decrease [
36,
37]. Therefore, many studies on the cell membrane water permeability coefficient have shown FXL to be physiologically unreasonable and inconsistent [
38]. The shutter speed model was introduced to reflect a more realistic tissue environment. The model accounts for the intercompartmental water exchange effect, modelling this non-infinitely-fast exchange with the mean intracellular water molecule lifetime τ
i. In 2005, Li et al. introduced a second generation of the shutter speed model which considers also a non-infinitely fast equilibrium transendothelial water exchange.
Model comparison and stability
The heterogeneity that exists in brain tumours means that one model is insufficient in explaining the different biologies that exist in different tumour regions. Multiple pharmacokinetic models are required for a complete description of the tissue. This variability is testified by the model selection procedure which showed how, in a single slice of one tumour, multiple models perform better. This result confirms the study of Bagher-Ebadian where they implemented a selection method based on nested models [
3]. They found that in the necrotic core of the tumour, models describing vascular filling with no microvascular leakage (similar to the TM) and leakage without vascular reabsorption were selected because of the lack of blood flow. They also hypothesised that the model describing leakage with reabsorption (similar to the ETM) would be selected in the fast growing rims of the lesion. Our results show that there are a number of regions in the tumour where the CA exudation is prevented by the BBB and where the concentration of CA is so low that the evidence of perfusion is missing. In this case, the use of the NEM is recommended as the use of different models could result only in overfitting the data. In fact, our results showed that no leakage of the CA into the interstitium and the lack of flow of the CA through the tissue made the NEM the model of choice for the majority of regions, particularly in the non-enhancing lesions (37.5% of voxels). The result is very close to the ETM (32%), which was the model of choice in the enhancing lesions (54.8%). This suggests that, in areas where there is enhancement, a model with three parameters performs better and that the choice is dependent on the underlying state of the tissue. In fact, both the ETM and SSM are fitted by three parameters but the third parameter is very different between the two models (
vp for the ETM describing a vascular component in the tissue, and τ
i for the SSM describing the transcytolemmal water exchange). Furthermore, with the implementation of the ESSM, we saw that the transendothelial water exchange did not have any impact on the signal (compared with parameters derived by the fitting of simpler models). It is necessary to consider that the ESSM required nine parameters to be fitted and that the cost of fitting extra parameters is often contrary to the principle of parsimony. In fact, in fitting data to a noise-limited dataset, the estimation could be very poor and dependent on the optimization procedure itself (the initial conditions, for example) [
3]. We compared the AIC values from the different fitting procedures to check whether a model with more parameters is more appropriate than a simple one. The ESSM was selected as the model of choice by < 0.1% of voxels, indicating that a model with three parameters performed better in the description of brain tumours and further confirming the poor quality of fit observed for the ESSM model. Our outcome agreed with the results of Duan et al. [
39]. Using representative in silico and clinical (cervical cancer) DCE-MRI data, they demonstrated the sensitivity of complicated models (parameters > 3) to noise and their decreasing probability of being selected in low signal-to-noise data [
39].
The reliability of DCE-MRI data is not only based on the goodness of fit of the chosen pharmacokinetic model, but also on the robustness of the extracted parameters. For this reason, we assessed, for each lesion and for each parameter, the coefficient of variation. We worked in a simulation environment where we added Gaussian noise to our signal and we fitted the noisy curves 500 times. This procedure resulted with a heterogeneous distribution of CVs that was not linked to contrast enhancement. In fact, Fig.
6 shows the plot of four different tissue activity curves together with the
Ktrans value and its CV in an enhancing and non-enhancing tumour. The curves in Fig.
6b, c belong to the same enhancing lesion and while they correspond to similar
Ktrans values of 0.81 and 0.52 [1/min], they varied by 12 and 128%, respectively. On the other hand, the curves in Fig.
6e, f belong to the same non-enhancing lesion and show regions with both a low (4%) and a high (97%) variability.
We set three different thresholds for the CV to evaluate the variability of the
Ktrans. Table
1 shows the percentage of voxels and their relative mean
Ktrans value at different CVs thresholds (10, 20 and 50%), for each lesion and for each patient. Higher grade glioma tend to have more voxels with a lower CV and also a more stable value of
Ktrans while, for some of the other patients, the mean value of
Ktrans is highly affected by the portion of voxels taken into consideration (P04, P07). This result confirmed the improper use of one average value in statistical comparisons of brain tumours. Not only because of the heterogeneity of the tissue under investigation but also, and more importantly, because it is affected by the reliability of fit within voxels.
Finally, Fig.
6 gives a graphical representation of this effect showing
Ktrans values under the 20% threshold of CV covering only 25% of voxels (an average percentage value evaluated among all patients). This result suggested that only this selection of voxels represents robust values, which can be used in the following statistical analyses, as, more importantly, in clinical evaluations. The selection of the threshold that makes DCE-MRI robust is, however, dependent on the effect size that is being measured and hence will vary across studies.
The main limitation of this study is the small size of the dataset. Furthermore, the sensitivity of DCE-MRI data to water exchange effect was reduced by the 26° flip angle acquisition (exchange-minimized approach) [
40]. As a consequence, the precision of the τ
i parameter extracted might be low.
In conclusion, DCE-MRI methods hold great promise for quantitative in vivo evaluation of permeability and vascular properties under different pathophysiological conditions. It allows us to identify, and quantitatively measure, smaller changes in permeability for pathological conditions effecting the BBB, than would be observed through visuassessment of post-contrast T1-weighted images. Different models yield different pharmacokinetic parameters and, for this reason, a model selection is critical for the appropriate analysis of DCE-MRI time courses based on the regional tissue biology, specifically permeability and vasculature. Future work needs to assess the physiological basis for each model in the reliable selection of DCE-MRI data. The applicability of each model depends on the physiology, anatomy and heterogeneity of the tumour and the tumour microenvironment. In addition, due to the noisy nature of DCE-MRI data, a model selection procedure alone is not enough: pharmacokinetic parameters need to be validated with a stability test in order to give only robust results for statistical analyses and clinical evaluation.