Background
Nonalcoholic fatty liver disease (NAFLD), a worldwide epidemic, is a chronic liver disease associated with cirrhosis, which affects 25% of adults. NAFLD is correlated with components of the metabolic syndrome, such as obesity, hypertriglyceridemia, and type 2 diabetes mellitus [
1]. NAFLD is classified into simple fatty liver (SFL), nonalcoholic fatty hepatitis (NASH) and NAFLD-related cirrhosis, and NASH is a progressive form that may lead to cirrhosis or hepatocellular carcinoma (HCC) [
2,
3]. Due to that staging of fibrosis could indicate disease progression and prognosis in patients with NAFLD, and it is a critical predictors of cirrhosis, hepatocellular carcinoma, and death [
4,
5]. Hence, it is critically important for accurate objective tests to detect liver fibrosis in patients with NAFLD.
Biopsy is still considered the gold standard for the diagnosis of liver fibrosis stage in NAFLD [
4], but invasive examination may cause sampling error and intra- and inter-observer variability, and may be complicated by morbidity and even death [
6]. Therefore, it is needed to develop noninvasive tests that can detect advanced fibrosis in NAFLD patients, but there still remains no noninvasive test approved to diagnose fibrosis in NAFLD patients.
Currently, Noninvasive markers such as Cytokeratin-18, NAFLD fibrosis score, and Enhanced Liver Fibrosis (ELFTM) Test have been proposed for evaluating liver fibrosis, but may not be sufficiently accurate in routine clinical use. Ultrasound-based imaging tests, such as transient elastography (FibroScan) and acoustic radiation force impulse imaging (ARFI) elastography [
7] have high (21–50%) failure rates in obese patients [
8], and are evaluated only a limited portion of the liver, and findings may be influenced by necroinflammatory activity [
7].
Magnetic resonance elastography (MRE), a magnetic resonance-based imaging technique, could utilize shear waves to characterize liver fibrosis. MRE has made significant progress as a non-invasive test for staging liver fibrosis in NAFLD due to its high accuracy in the evaluation of liver fibrosis and also due to the possibility of evaluating a large area of the parenchyma with the option of choosing the region of interest [
9]. Although several recent studies have reported a high diagnostic accuracy of MRE in patients with NAFLD [
3,
10‐
20], but those studies had a limited sample size. In this study, we searched all related studies, and performed a systematic review to systematically evaluate the accuracy of MRE in the diagnosis of liver fibrosis in patients with NAFLD.
Methods
Literature search strategy
PubMed, Web of Science, CNKI, Embase and Cochrane library database were searched for related literatures in Chinese or English regarding the diagnosis and staging liver fibrosis of NAFLD by MRE. The publication time was from January 2008 to December 2018. The retrieval strategy was (“NAFLD” OR “nonalcoholic fatty liver disease) AND (“liver fibrosis” OR “hepatic fibrosis”) AND (“MRE” OR “MR elastography” OR “magnetic resonance elastography”). In order to perform comprehensive search, the reference lists of the eligible literatures were also searched.
Inclusion and exclusion criteria
Inclusion criteria
(1)
NAFLD Patient is older than 18 years old;
(2)
Studies evaluated the diagnostic performance of MRE, if the study includes other diagnostic test, the corresponding data will still be included in the study;
(3)
Studies used biopsy as the gold standard.
(4)
If the study population contained NAFLD and other chronic liver disease, the data of NAFLD is separately extracted;
(5)
Literatures published from January 2008 to December 2018;
(6)
True positives, false positives, false negatives and true negatives can be directly or indirectly extracted from the literature to construct a 2 × 2 table.
Exclusion criteria
(1)
Reviews, conferences, case reports, animal experiments and technical literature, etc.;
(2)
The original data is incomplete to construct the four-grid table;
Two researchers independently screened the literature and extracted the information according to the established inclusion and exclusion criteria, and cross-checked. If there is any inconsistency, it will be resolved through negotiation. The extracted information mainly includes: the first author, publication year, country, study type, the interval between gold standard and MRE examination, patient information (age, BMI, gender), and four-grid data true positive (TP), false positive (FP), false negative (FN), and true negative (TN).
Assessment of methodological quality
The quality of the included literature was assessed independently by 2 researchers, and differences were resolved through discussion. Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool was introduced to assess the quality of the included studies. The software Review Manager (version 5.3) was used to present the result of assessment. Each item included in the studies in QUADAS-2 tool was evaluated as yes, no or unclear [
21,
22]. The Metavir liver fibrosis staging evaluation system was used. The system was consistent with the liver fibrosis stage of Batts-Ludwing and Scheuer evaluation system, and was divided into five stages of F0, F1, F2, F3 and F4, among which F0 was non-fibrotic. F ≥ 1 was mild liver fibrosis. F ≥ 2 was significant liver fibrosis. F ≥ 3 was severe liver fibrosis. F ≥ 4 was early cirrhosis.
Statistical analysis
According to the required information, data extraction form was created to calculate relevant indicators, and the data was processed by STATA (version 14.0).
Assessment of heterogeneity and publication bias
The Q test was used to evaluate the heterogeneity of the included literatures, and the degree of heterogeneity was determined according to the I2 value. If I2 ≤ 25%: less heterogeneity, 25% < I2 ≤ 50%: moderate heterogeneity, I2 ≥ 50%: greater heterogeneity. When the heterogeneity is large, the bivariate mixed effect model should be further adopted. Deeks’ funnel plot was used to detect publication bias when detecting publication bias, and P ≤ 0.01 indicated that the publication bias was more significant.
Summery statistics
By processing the original data of the included studies, true positive, false positive, false negative and true negative were extracted. Systematic review was performed using STATA 14.0 software for F ≥ 1, F ≥ 2, F ≥ 3, and F ≥ 4 separately in the included literatures, the pooled sensitivity, the pooled specificity, the pooled positive likelihood ratio, the pooled negative likelihood ratio and the pooled diagnostic ratio were calculated. The forest graph and the hierarchical summary receive operating characteristic (HSROC) curve were drawn, and the area under the curve (AUC) was calculated to obtain the AUROC value and its confidence interval. P < 0.05 indicates statistical significance.
If there is a high heterogeneity in the included studies, the single independent regression of the continuous independent variables is performed by STATA 14.0 software, and the sub-combinations of each independent variable are calculated and the results obtained.
Sensitivity analysis
In order to observe the stability and heterogeneity of the results of the summery statistics, each included literature was excluded for pooled analysis, and the summery statistics were performed on each group. The results of the summery statistics, I2 and those obtained before the exclusion were compared to observe the results.
Discussion
In this systematic review, we performed pooled analysis of diagnostic performance of MRE in 12 studies with 910 patients with NAFLD. Regarding the overall diagnostic accuracy of MRE in patients with NAFLD, we found that the pooled sensitivities of MRE for diagnosis of liver fibrosis stage F ≥ 1, F ≥ 2, F ≥ 3, F ≥ 4 were 0.77, 0.87, 0.89, and 0.94 respectively, and the pooled specificities were 0.90, 0.86, 0.84, and 0.75, respectively. The area under the SROC curve was 0.89, 0.93, 0.93, and 0.95, among which MRE has the highest accuracy in F4 stage. Due to NAFLED-associated fibrosis is a strong predictor of mortality, cirrhosis, and hepatocellular carcinoma in patients, it is possible to further develop effective clinical treatment by measuring liver elasticity and staging of liver fibrosis in patients with MRE. And the pooled AUROC of stage F ≥ 2, F ≥ 3 and F ≥ 4was greater than 90%, suggesting excellent discriminative ability for detection of liver fibrosis stage. The optimal threshold values of F ≥ 1, F ≥ 2, F ≥ 3, and F ≥ 4 were different (F ≥ 1 group: 1.77–5.02Kpa, F ≥ 2 group: 2.38–5.37Kpa, F ≥ 3 group: 2.43–5.97Kpa, F ≥ 4 group: 2.74–6.7Kpa), and thus this study couldn’t determine the optimal threshold of fibrosis stage. The possible reasons are as follows: different study designs (10 prospective studies, 2 retrospective studies); different pathological interval time; different MRE technology, parameter setting (6 1.5 T MR, 6 3.0 T MR) and operator’s technical experience; when post-processing is used to obtain an elastic diagram, the region of interest needs to be manually drawn.
In a systematic review by Singh et al. that included 9 studies reporting on 232 patients [
5], the pooled AUROC for MRE diagnosis of NAFLD liver fibrosis stage (F ≥ 1, F ≥ 2, F ≥ 3, F ≥ 4) was 0.86, 0.87, 0.90, and 0.91 respectively, and the values of each group are lower than that in our study. The possible reason may arise from 1) we included more included studies in this study; 2) the bivariate mixed effect model was adopted; 3) as a systematic review, the study of Singh et al. is more like a data-sorted analysis without relevant heterogeneity analysis and publication bias analysis. In the meta-analysis by Xiao et al. [
23]., they compared the performance of different noninvasive methods for diagnosing liver fibrosis including APRI, FIB-4, BARD score, NAFLD fibrosis score, FibroScan, shear wave elastography (SWE) and MRE in NAFLD, and found that MRE and SWE may have the highest diagnostic accuracy for staging fibrosis in NAFLD patients (the summary AUROC values was 0.96 and 0.95), which accordingly indicated the high accuracy of MRE diagnosis of NAFLD liver fibrosis. However, the study by Xiao et al. is a meta-analysis of multiple examinations applied to NAFLD, the number of relevant studies is relatively small compared with our study, so the specific analysis of MRE is not detailed enough, and the analysis method is less, so the AUC value is lower than this study. In our study, we performed more deep analysis on diagnostic accuracy of the MRE in liver fibrosis stage of NAFLD, further conducted meta regression, subgroup analysis to find sources of heterogeneity, and sensitivity analysis to assess the influence of each study on the overall result. It was proved by many studies that liver stiffness is positively correlated with the severity of liver fibrosis. Elastography is a dynamic imaging technique for measuring the mechanical properties of tissues, which can detect the tissue elasticity (stiffness) [
24]. This feature prompts the obvious advantages MRE in the diagnosis of liver fibrosis stage of NAFLD [
2,
4].
Georges et al. concluded that hepatocyte swelling and stromal edema (ballooning) caused by inflammation would lead to increased liver stiffness, while Ichikawa et al. observed that hepatitis activity grade may also influence liver stiffness measured using MRE [
25,
26]. Therefore, large-scale prospective studies are still needed to investigate the effect of inflammation on the liver hardness measured by MRE. The diagnostic performance is various based on the different technique or machine for MR evaluation. Wagner et al. indicated that the failure rate of overall examination in NAFLD patients was about 7.7%, of which the failure rate of liver MRE was only 3.5% at 3.0 T, while the failure rate was 15.3% at 1.5 T [
27]. Loomba et al. performed a prospective study included 100 patients with NAFLD to assess the accuracy of 2D-MRE at 60 Hz, 3D-MRE at 40 Hz and 60 Hz in diagnosing advanced fibrosis [
10]. They found that at a threshold of 2.43 kPa, 3D-MRE at 40 Hz had sensitivity 1.0 and specificity 0.94 for diagnosing advanced fibrosis, and concluded that 3D MRE at 40 Hz has the highest diagnostic accuracy in diagnosing NAFLD advanced fibrosis [
10]. Compared to 2D-MRE, 3D-MRE allows for improved assessment of spatial patterns of hepatic fibrosis and focal lesions [
10]. Salomone et al. suggest that NAFLD fibrosis score can be considered an accurate tool for the stratification of the risk of death in NAFLD patients [
28]. Although the diagnostic accuracy of MRE is high as showed by results, it would be important in the future studies to assess the predictive value of MRE for mortality, the “real” outcome for patients with NAFLD.
Due to the high heterogeneity of the included studies, the bivariate mixed effect model was adopted to perform pooled analysis of diagnostic indicators of each group. The sources of heterogeneity were analyzed through sensitivity analysis, meta regression and subgroup analysis and we conclude that the heterogeneity might be related to variety in designs or quality of the included studies, study population and difference in field strength and parameters of MRE equipment. This systematic review has certain limitations: (1) 3 included studies contained other chronic liver diseases, and the amout of NAFLD cases was relatively small relatively to other included studies, and the data may have confound bias, (2) We only retrieves the study in English database, but didn’t search the study published in other languages, which may have information bias, (3) Fibrosis staging errors often occur in clinical diagnosis, so there is the possibility of information bias [
29].
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.