Background
Pancreatic cancer is the fourth largest cancer-related cause of death worldwide, with an overall 5‑year survival rate of 2–7% [
1‐
4]. Although approximately 15% of patients are diagnosed with a seemingly resectable tumour, 10–35% of these tumours prove to be irresectable during explorative laparotomy [
5]. Also, although some studies suggest that the percentage of resectable tumours may be increased by preoperative radiochemotherapy, those studies were mainly single-arm phase II trials and many did not report using the intention-to-treat approach [
5].
The Dutch Pancreatic Cancer Group (DPCG) started the multicentre, randomised, controlled PREOPANC trial (Netherlands Trial Registry: NTR 3709), which became open for accrual in April 2013 [
6]. In this phase III trial, patients with (borderline) resectable pancreatic tumours are randomized between (1) direct explorative laparotomy and (2) preoperative radiochemotherapy followed by explorative laparotomy.
For pancreatic cancer, it is well known that target volume delineation can be difficult [
7]. In our department, the planning CT scan is combined with a contrast medium and a four-dimensional CT scan (4DCT) to determine the respiratory-induced motion of the tumour and neighbouring organs [
8]. Because of the large day-to-day variation in tumour position, in our clinic we currently use intratumoural fiducial markers for delineation and daily online position verification [
8‐
12]. The radiation oncologist delineates the gross tumour volume (GTV) on the CT scan and expands this volume to encompass the GTV in all respiratory phases. This expanded volume is defined as the internal gross tumour volume (iGTV). The iGTV combined with an additional margin forms the internal clinical target volume (iCTV), which is then expanded to create the planning target volume (PTV).
A dummy run procedure at the beginning of a clinical trial is a good tool to improve protocol compliance with the radiotherapy prescriptions and to improve uniformity of the treatment. In multicentre trails, a dummy run can diminish heterogeneity in treatment quality and can also be used to detect correctable variations in treatment [
13‐
18].
The aim of this dummy run procedure was to evaluate compliance with the radiotherapy protocol, and the uniformity of delineation and treatment planning, among institutions participating in the DPCG PREOPANC trial.
Methods
General
All 11 radiation oncology departments participating in the PREOPANC trial were asked to participate in the dummy run and were encouraged to complete the dummy run before the start of patient accrual. The opening for accrual varied between the different institutions: the first opened in April 2013 and the last in November 2015. In the PREOPANC trial, the preoperative radiochemotherapy consists of a total dose of 36 Gy in 15 fractions of 2.4 Gy (5 fractions/week), with concurrent gemcitabine (1000 mg/m
2 on days 1, 8 and 15) preceded and followed by a cycle of gemcitabine (1000 mg/m
2 on days 1 and 8) with one week rest in between the three cycles [
6].
Dummy run procedure
The ‘dummy’ patient had a borderline resectable pancreatic tumour in the pancreatic tail and was selected by the Quality Assurance (QA) team. This team consisted of one radiation oncologist (GT), one radiation oncology resident (EV), two medical physicists (AB, JV) and two researchers (EL, AH). Institutions were provided with a set of images and clinical data of this ‘dummy’ patient to be uploaded to the local delineation and treatment planning system. The data contained the diagnostic CT scan (4 phases with vascular contrast) with the radiological report, the planning CT scans, including a contrast-enhanced 3DCT (fast scan) and a 4DCT, the dummy run instructions and a questionnaire.
Prior to the planning CT scan, the patient had three intratumoural fiducial markers (Visicoil, Core Oncology, Santa Barbara, CA, USA) implanted by the gastroenterologist during endoscopic ultrasound. In our institution, intratumoural fiducial markers are standard practice for daily position verification of the tumour with cone beam CT [
11,
12]. The planning CT scan images were acquired using a GE LightSpeed RT16 scanner (General Electric Co., Waukesha, WI, USA): slice thickness 2.5 mm. First, the CT scan with vascular contrast was obtained (fast scan), followed by a 4DCT scan which provided the respiratory-induced motion of the tumour in 10 respiratory phase scans. In addition, a maximum intensity projection and an average intensity projection (Ave-IP) were reconstructed from the data of the 10 respiratory phase scans. On the provided planning CT scan, the organs at risk (OARs; kidneys, liver and spinal cord) were already delineated to exclude dosimetric differences due to differences in OAR delineation.
The responsible radiation oncologists were instructed to delineate and expand the target volumes (GTV, iGTV, iCTV and PTV) and supply a radiation plan according to the PREOPANC trial protocol (Electronic Supplementary Material; [
6]). The radiation oncologists received this protocol at the moment the institution had received approval for inclusion of patients in the PREOPANC trial. Furthermore, the responsible radiation oncologists were asked to fill out a questionnaire concerning treatment facilities and procedures. The QA team performed the quality evaluation of all submissions.
Delineation prescriptions
Observers were asked to delineate the GTV on the Ave-IP projection according to the protocol (Electronic Supplementary Material). This GTV had to be expanded to encompass the tumour on all respiratory phases of the 4DCT scan, thus creating the iGTV. The iCTV was defined as the iGTV with a 5-mm uniform margin to account for possible microscopic tumour extensions. An alternative (but more time-consuming) possibility was to delineate the GTV in all respiratory phases, expanding each of these GTVs with 5 mm and summing these volumes, forming the internal target volume (ITV). The PTV included the iCTV (or ITV) plus a 10-mm uniform margin. Consultation of a diagnostic radiologist during the delineation process was allowed.
Treatment planning technique
In the PREOPANC trial, at least a 3D conformal treatment delivery technique was required (Electronic Supplementary Material; [
6]). More sophisticated techniques, such as intensity-modulated radiation therapy (IMRT) and volumetric modulated arc therapy (VMAT) were also allowed. The dose of 3600 cGy in 15 fractions should be specified according to the International Commission on Radiation Units and Measurements (ICRU) guidelines; at least 95% of the prescribed dose should cover ≥98% of the PTV [
19‐
21]. For the OARs, the prescriptions in the PREOPANC trial were recommended in equivalent doses (fraction dose of 2 Gy). Due to a fraction dose of 2.4 Gy, we also supplied the physical doses of the constraints for the OARs to calculate treatment planning. If the mean physical dose to one kidney exceeded 1680 cGy, irradiation of the contralateral kidney should be avoided as much as possible. The mean physical dose of the liver was not to exceed 2640 cGy. Since the total dose in the slightly hypofractionated schedule of 36 Gy is relatively low, the dose to the spinal cord will (at maximum) be well below the tolerance. Similarly, the dose constraints for stomach and small bowel will not be exceeded.
Dummy run assessment
For assessment of the first three institutions, we arranged a meeting with all members of the QA team to ensure that the procedure of evaluating the target delineation and treatment planning/dosimetry was uniform, and conform the dummy run protocol. The first five delineations were evaluated by the senior radiation oncologist (GT) and the radiation oncology resident (EV) together (both experienced in delineation of pancreatic tumours) This evaluation determined whether the delineation was adequate with respect to the iGTV location and delineation, compared with the tumour seen on the diagnostic CT and described in the diagnostic report. After a standard procedure of evaluating the delineations was drafted between the radiation oncologist and the resident, the radiation oncology resident alone assessed the remaining six delineations. The mean volumes of the iGTV and PTV, as well as the ratio of the smallest and largest iGTV and PTV, were calculated.
The treatment plans were evaluated based on dose coverage of the PTV, dose to the OARs, and the radiation delivery technique used. The conformity index (CI) described by the Radiation Oncology Therapy Group (RTOG), defined as the ratio between the volume enclosed by the reference isodose (95%) and the PTV [
22], was used as a measure for conformity with the high-dose region to the PTV. The first three submitted treatment plans were evaluated by the entire QA team to assess the standard procedure that was drafted for this dummy run in the protocol. After evaluating the procedure, the remaining treatment plans and dosimetric parameters were evaluated by one of the researchers (EL) and the radiation oncologist resident (EV).
An overall conclusion concerning protocol compliance was made for each institution. Feedback was provided to each participating institution on an individual basis within 6 weeks after submission. In case of deviations that occur in the majority of the participating institutions, the protocol instructions were to be modified.
Discussion
In multicentre clinical trials, variation in protocol interpretation and noncompliance are well-known pitfalls that may lead to protocol deviations and discrepancies between participating institutions. A modern protocol prescription of radiotherapy leaves some freedom with regard to radiation technique, but clearly describes target volumes, dose requirements and constraints for OARs [
23]. Quality assurance in the form of a dummy run can be beneficial because this can minimise clinical variations in treatment within the trial. Previous dummy run studies (in various tumour groups) showed that a dummy run ensures optimal radiotherapy delivery based on recommendations and protocol adaptations [
13,
15,
16]. Our results showed acceptable deviations, i. e. essentially, inevitable interobserver variations in target delineation, and no prescription adaptations were required for the PREOPANC protocol. Because no adaptations to the protocol were required we did not repeat the dummy run and did not perform a central review of individual patients prospectively.
In the present dummy run, we considered our assessment (EV, GT) as the standard for evaluation of the pancreatic tumour delineations of the ‘dummy’ patient. Consultation between the QA team and the diagnostic radiologist concerning the borders of the tumour took place before evaluation of the delineations. We are aware that no gold standard is available for the delineation of pancreatic tumours and that evaluation of the delineations by two clinicians is subjective and prone to error. Creating a consensus delineation with more observers as a gold standard may be helpful, as was shown in cervical cancer [
24]. In pancreatic cancer, the study of Carvatta et al. [
25] proposed guidelines for high-risk nodal areas and CTV delineation, showing acceptable interobserver variation. No guideline is available for the GTV delineation in pancreatic cancer and large interobserver variation in pancreatic cancer delineation is reported [
26,
27]. For that reason, we did not provide a reference contour.
The volumes of GTV and iGTV can be used as an objective measure to indicate variation [
28]. Our results show that, for this ‘dummy’ patient, the mean iGTV volume of the pancreatic tumour was 41.5 cm
3, with a ratio between the largest and the smallest iGTV of 4.0. Two earlier dummy runs, in three different cases of pancreatic cancer, reported ratios of the largest to the smallest GTV volume of 9, 3 and 6.8 [
29,
30]. In the present study, the questionnaire revealed that there is little experience in the Netherlands with preoperative radiotherapy of pancreatic tumours. The variation in delineation between observers may decrease in time when institutions gain more experience.
Tumour extension outside the pancreas is an important problem in the delineation of the GTV. Tumour extension may result in only slight differences in density compared with the surrounding fat tissue [
29]. Also, no delineation guidelines are available for pancreatic cancer, only for postoperative radiotherapy of pancreatic cancer [
31].
In our study, institution #10, applied the CTV margin before expanding the GTV to account for respiratory motion. This suggested that the CTV was manually expanded in the 10 respiratory phases to include the visible tumour; this means that the microscopic tumour extensions are not properly accounted for in the CTV. As a result, the PTV margin used may be insufficient to account for all remaining uncertainties. In this institution, the PTV volume (241 cm
3) was not the smallest of all the institutions. On an annual basis, this institution treats (on average) only 5 patients with pancreatic cancer, implying that the experience in delineation can be considered only moderate as compared with the other institutions. Unfortunately we only noticed this deviation in a later stage of the study, so this feedback was reported in a later stage of the dummy run to the concerned institution. The consequences of the minor deviations in iGTV delineation for the PREOPANC trial outcome are unclear, as the impact of interobserver variation on a trial outcome is not yet fully explored [
28]. However, by producing a clear protocol and using a dummy run to test protocol compliance we aimed to reduce this variation between institutions. RTOG studies reporting on deviations in delineation in pancreatic trials showed unacceptable deviations in around 5% of the cases in a total of four studies, without knowing the impact on outcome [
32]. In a multivariate analysis, one study showed that the radiation therapy quality assurance (per-protocol failure of adherence to guidelines) was significant for survival of patients with pancreatic tumours [
33].
One of our institutions used conformal 3‑field treatment planning, with a CI of 1.52 and higher doses to the spinal cord and left kidney compared with the other institutions. This illustrates that conformity and OAR sparing can be improved with techniques such as VMAT or IMRT, as described by Nabavizadeh et al. [
34]. These latter authors showed that the conformity for both VMAT and IMRT techniques was superior compared with a 3D-CRT technique. VMAT provided a comparable CI to IMRT, with reduced mean doses to the PTV and an overall reduction in treatment time [
34].
It is known that respiratory-induced abdominal tumour motion can be substantial. Almost all our institutions used 4DCT in the treatment of patients with pancreatic cancer, taking into account the differences in position of the GTV during the respiratory cycle. This can result in better coverage of the pancreatic tumour within one fraction. Intratumoural fiducial markers and daily online position verification are used to compensate for day-to-day position variation [
11,
35]. The above-mentioned procedures in radiotherapy of pancreatic cancer are recommended in the PREOPANC trial. Also, the interobserver variation in pancreatic tumour delineation is currently being quantified in the Netherlands, by means of a study comparing delineation on 3DCT and 4DCT.
Unfortunately, no quantitative evaluation of the variation in delineations, such as the kappa index, could be performed with the software used for the present study. This is a limitation of this study. The kappa index can be used for measuring observer agreement when there is no gold standard [
36]. However the added value of the kappa index in this study would be limited since we only had a single dummy patient. No other radiotherapy quality assurance measures were undertaken, as the compliance to protocol prescriptions was acceptable and the observed deviations were inevitable interobserver variations in target delineation. Although guidelines are available for the postoperative setting [
31], no gold standard exists for preoperative target delineation for pancreatic cancer. Also, since extensive QA procedures are known to hamper accrual [
37], we considered the value of adding prospective individual case review (or other measures) to be too small to implement them.