Introduction
Parastomal hernia (PSH) is a common complication following stoma formation and can cause discomfort, pain, strangulation, and incarceration of intestines, as well as difficulties with stoma care [
1]. The exact incidence of PSH remains unclear, but most studies report high rates of over 30%, especially in case of colostomy [
1,
2]. Still, reported rates vary widely in the literature, ranging from 0 to 86% [
1,
3,
4]. This variability depends on several factors such as the length of follow-up, patient and surgical characteristics including type of stoma, method of stoma construction, but also on definition of PSH [
5‐
8].
Moreover, several different diagnostic modalities can be used for the diagnosis of PSH, making it a factor affecting the incidence rate. In practice, clinical examination is the first method to assess the presence or absence of a PSH. In case of doubt about the diagnosis or to help plan for the surgical approach and management, an imaging modality can be chosen, such as ultrasonography (US), computed tomography (CT) scan, or magnetic resonance imaging (MRI) scan.
In addition, the lack of a clear definition and the use of several different classifications of PSH is a significant problem in PSH research [
9]. Some studies use imaging to confirm the diagnosis of PSH, whereas others only use imaging in clinically unconvincing cases [
10,
11]. Due to these differences, protocols often deviate between clinical practice and the research setting, as well as between clinical studies.
In 2014, the European Hernia Society (EHS) proposed a classification depending on the defect size and the presence of a concomitant incisional hernia [
9,
12]. With the ability to correctly compare different studies and thus to provide a uniform research reporting, this classification is recommended by the EHS to use in PSH research [
9]. However, these guidelines also emphasize the uncertainties on the accuracy of clinical and imaging diagnoses of PSH.
Therefore, the aim of this systematic review is to evaluate the accuracy of the different modalities used to identify PSH after stoma construction or after PSH repair. The secondary objective is to assess the inter-observer variation, correlations between (a) symptomatic PSH and imaging or surgical findings, and identify different definitions and classifications used for diagnosis of PSH.
Methods
The study protocol was registered in PROSPERO (CRD42018112732; International Prospective Register of Systematic Reviews). The Preferred Reporting Items for Systematic Reviews and Meta-Analyses of Diagnostic Test Accuracy (PRISMA-DTA) statement was followed [
13]. Moreover, the article by Wille-Jørgensen et al. on systematic reviews and meta-analyses in coloproctology was used for methodological guidance [
14].
Systematic literature search
A systematic search was performed by a biomedical information specialist instructed by first author (G.S.). Embase, MEDLINE, Cochrane, Web of Science, and Google Scholar databases were searched on March 5, 2019. Full search strategies and results per database are presented in Appendix
1. There was no limit on date of publication. After duplicate removal, studies were reviewed independently by two researchers (D.L. and G.S.) on title and abstract, followed by full-text review using EndNote X9®. Differences in article selection were discussed, and articles were included or excluded after consensus was reached.
Inclusion and exclusion criteria
Studies were included if they met the following criteria: (1) inclusion of patients that underwent stoma construction (ileal conduit, ileo- or colostomy) or PSH repair surgery; (2) studies assessing the performance of a diagnostic modality (clinical examination, CT, US, MRI, or diagnosis at surgery) used for the diagnosis of PSH. Only (non) randomized controlled trials, prospective, or retrospective cohort or case-control studies were included. Excluded were as follows: studies reporting on pediatric patients (< 18 years of age), studies reporting only on gastro-/oesophago- or duodenostomies, studies in which no data on diagnostic modalities were described, and studies with unclear diagnostic work-up, so that diagnostic data could not be extracted. Studies not written in English, case reports, letters, comments, abstracts, or posters were also excluded.
Data from included studies were extracted by one researcher (G.S.) and were checked independently by another researcher (S.H.) using standard forms covering study characteristics (year, journal, study design, level of evidence, and risk of bias), patient characteristics (number of patients, sex, age, body-mass index, and follow-up), surgical characteristics (indication for surgery, acute or elective, laparoscopic or open abdominal surgery, reoperation, stoma type, use of mesh, location of mesh), and outcome characteristics (definition and classification of PSH, diagnostic modalities and corresponding incidence of PSH and inter-observer variation). Since there is no gold standard modality for diagnosing a PSH, the detection rates of the different diagnostic modalities are compared within each study. The available absolute data and incidence rates of modalities are presented and compared in contingency tables. Intra-class correlation coefficient and Kappa values for inter-observer variation were extracted and presented. Inter-modality agreements were expressed as Cohen’s Kappa values for each study if possible. Statistical level of agreement per Cohen’s Kappa value range is presented in Supplemental table
1. The pooled Cohen’s Kappa value was calculated in a random effects model using inverse variance method, using meta-package for R version 3.5.1. (R Foundation for Statistical Computing, Vienna, Austria).
Study quality assessment
Two researchers (S.H. and G.S.) independently assessed the quality of included studies by assessing the level of evidence according to the Oxford Centre for Evidence-based Medicine Levels of Evidence [
15] and the possible risk of bias using the Cochrane Collaboration’s tool for assessing risk of bias [
16] and the QUADAS-2 tool [
17] with RevMan 5.3 (Cochrane Centre, Denmark).
Discussion
Today, in both clinical practice and research there is no gold standard modality to examine patients for the presence of PSH. The literature on this subject is diverse and inconclusive. Facilitating comparison between studies on PSH remains challenging, due to, among others, the number of existing definitions, imaging modalities, and classifications. Indeed, this systematic review shows a great variance in detection rates of PSH between different diagnostic modalities.
Most included studies compared CT with clinical examination. The majority of these studies found higher incidence rates by using CT [
2,
19‐
21,
23,
26,
28,
30,
32‐
34,
37‐
40]. However, some studies showed contradictory results in favor of clinical examination [
24,
27,
36]. This discrepancy between studies could be explained by the technical differences in examination of the patients’ abdominal wall, bearing in mind that a patients’ body position and the use of Valsalva maneuver during examination might affect detection rates [
31]. It is possible to use Valsalva maneuver in case of patients undergoing CT imaging. However, this is rarely reported in studies.
Gurmu et al. found a low inter-observer reliability when patients were clinically examined by surgeons, indicating that PSH is difficult to diagnose by clinical examination [
25]. This was also stated by Sjödahl et al. who found poor correlation between US and findings at clinical examination [
41]. If these examinations are performed correctly, the use of dynamic modalities such as US and clinical examination may have some advantages compared with the more static and expensive CT or MRI. However, the inter-observer variation and diagnostic accuracy of US have not been investigated thoroughly. In contrast, more evidence is available on the diagnostic performance of clinical examination and CT. For research purposes, the combined use of these two modalities might be recommended since multiple studies found significant disagreements in detection rates between both modalities [
27,
33,
34].
This is the first review to date that provides a complete overview of the research of the available literature on different diagnostic modalities for PSH diagnosis. Nevertheless, it is important to note that this systematic review covers studies that investigate the PSH incidence rates in the setting of a research protocol that might not always fully reflect standard clinical practice. Also, the minority of included studies has the accuracy of the used diagnostic modality as primary outcome [
20,
23,
25,
31,
34,
35,
41,
42]. In clinical practice the main goal is to identify symptomatic PSHs that might require treatment and for asymptomatic patients it seems unnecessary to follow a full diagnostic workup. Therefore, the clinical approach might differ from that in a research setting. In general, patients with stoma problems such as pain, appliance leakage, bowel obstruction, or symptoms of incarceration first undergo clinical examination by a stoma nurse and/or clinician. When PSH is identified clinically or the diagnosis is inconclusive the clinician can consider an imaging modality to confirm the diagnosis, taking into account patient safety, patient comfort, availability, and costs, whereas for research purposes, factors as costs and availability might play a less important role in the decision on imaging modality.
Intrastomal 3D US is a relatively new imaging modality for diagnosing PSH or other stoma-related pathology [
44]. 3D US seems to be an accurate imaging modality with a sensitivity of 83% when compared with peroperative diagnosis [
35]. With this imaging modality it is possible to examine patients in erect position and without the use of radiation, providing potential advantages over CT. There is, however, too little available evidence for this technique to consider this as standard imaging modality for the diagnosing of PSH.
In contrast to diagnosing incisional hernia, traditional two-dimensional ultrasonography (2D US) is not often used for diagnosing PSH in both research and in clinical setting. However, 2D US is the most patient-friendly, inexpensive, and practical modality of all imaging modalities. This systematic review included only one study comparing 2D US to clinical examination for diagnosing PSH. However, to make any recommendations on 2D US, it would be interesting to compare ultrasonography with other imaging modalities in the future.
Another important aspect of clinical practice with regard to the use of diagnostic modalities is that many stoma patients have a stoma created after oncological resection, and for these patients a CT is routinely made during follow-up to detect potential cancer recurrence. Although some PSHs occur many years after stoma construction, most PSHs develop within the first years after stoma construction and are thus likely to be identified with follow-up CT [
5]. This is one of the main reasons why most included studies used CT instead of MRI or US. However, with the patient in supine position a CT is not a reliable tool for diagnosing PSH and a CT with the patient in prone position is associated with higher inter-observer agreement and an increase in sensitivity [
31]. By using CT routinely for cancer follow-up, asymptomatic PSHs will appear more frequently. Although not entirely insignificant, studies do not often distinguish between symptomatic and asymptomatic when reporting PSH incidence rates.
Evidently, patient-reported outcomes are of paramount importance in the context of stoma-related complications. Patients know their own bodies in a way no physician possibly can, and have to take care of the stoma several times a day, whereas the physician examines the patients’ stoma once or maybe twice. Any physical differences of the stoma will be noticed by the patient, which probably makes it more reliable than the studied modalities on the existence of bulging at some time point during follow-up. Currently, prospective cohort studies, such as the PROPHER and CIPHER studies (ISRCTN17573805; ISRCTN registry), are assessing the value of subjective and objective outcomes after stoma construction or for parastomal hernia treatment, respectively.
Despite the increased interest in PSH care and research in recent decades, there is still no consensus regarding the definition of PSH or a gold standard for diagnosis [
9]. Although many definitions consisted of similar terms and contexts, some definitions differ considerably which can lead to discrepancies in detection rates. Moreover, the fact that five included studies have not even described the definition of PSH, emphasizes the need for uniform reporting in studies regarding PSH [
18,
23,
24,
29,
35]. This heterogeneity in diagnostic procedures makes it difficult to compare studies and to determine an accurate incidence of PSH. Therefore, a clear and standard definition and diagnosis of PSH is of paramount importance. The European Hernia Society (EHS) acknowledged this problem and proposed to use the definition of PSH introduced by Muysoms et al. [
45]: “An incisional hernia through the abdominal wall defect created during placement of a colostomy, ileostomy or ileal conduit stoma”. Furthermore, the EHS proposed a new classification for PSH, which might help to facilitate more uniform reporting of outcomes in PSH research (Appendix 4 Table
10) [
12].
Limitations
This systematic review has some limitations. Firstly, the low level of evidence of included studies is an important limitation. Eleven studies have a retrospective study design, which is prone for selection and information bias. Also, most studies presented small study populations. Nevertheless, to give a complete overview of diagnostic accuracy and variation of the different modalities, studies of low quality or studies with small samples were not excluded and a comprehensive overview of study characteristics and study quality assessment was provided.
Secondly, significant heterogeneity between studies was demonstrated, as operation and stoma types, use of mesh reinforcement, patient characteristics (e.g., age and BMI), and follow-up duration differed between included studies. Besides the choice of diagnostic modality, all these factors also influence the PSH incidence rates. Although it was not possible to account for this, these factors would be of less importance for the within-study diagnostic performance, since diagnostic modalities were only compared within each study. However, some studies did not investigate the PSH incidence rate or the accuracy of the diagnostic modalities as primary objective. As a result, the incidence could easily be underestimated. Accordingly, the results of diagnostic performance may also be affected.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.