Background
MicroRNAs (miRNAs) are ~22 nucleotides long non-protein-coding RNAs involved in post-transcriptional regulation of gene expression [
1,
2]. Mature miRNAs join the RNA-induced silencing complex (RISC) in the cytoplasm and bind to messenger RNAs (mRNAs), whereby they block translation or induce degradation of the mRNA transcript. Each miRNA targets specific genes through sequence complementarity between the miRNAs "seed" region (nucleotides 2–7) and a miRNA recognition element (MRE) in the mRNA, most often located in the 3'-untranslated region (3'UTR). More than 2,500 mature human miRNA sequences have been annotated so far (http://www.mirbase.org) [
3]. Because of the targeting of the miRNA seed region to a specific 7-nucleotide MRE in the mRNA, each miRNA can potentially regulate the expression of hundreds of genes. Indeed, it is estimated that most of the protein-coding genes are regulated by miRNAs [
2,
4]. Accordingly, most, if not all, developmental, physiological and disease processes, such as cancer, are regulated by miRNAs [
5,
6]. MiRNAs are involved in all the hallmark capabilities of cancer [
7‐
9]. Deregulation of miRNA expression is associated with cancer development, and changes in miRNA expression are associated with survival in patients with cancer [
9].
MiRNAs have been investigated intensely as potential biomarkers in cancer. A commonly used method for determination of miRNA expression is the reverse transcription quantitative polymerase chain reaction (RT-qPCR). RT-qPCR can be utilized to measure either single- or multiple miRNAs per experiment. One of the most important and challenging issues in miRNA expression experiments is normalization. The purpose of normalization is to remove as much non-biological variation, "noise" and bias, from the data as possible and to make it possible to compare results within or between experiments. In large microarray studies in which the expression of hundreds of miRNAs is measured, global mean normalization is the gold standard [
10]. This normalization method uses the average expression of all miRNAs in each sample for normalization. In experiments with a smaller number of miRNAs, global mean normalization is not an option, and instead reference genes are needed for normalization. Traditionally, small nuclear- or nucleolar RNAs like
RNU6B have been used for normalization in miRNA experiments. Yet, these have been shown to be inferior to the use of stably expressed global mean-associated miRNAs as reference genes [
11‐
13]. Only a few studies have identified suitable reference miRNAs for use in frozen blood or tissue samples [
10‐
15], and no published studies have identified reference miRNAs in FFPE tissue in an unbiased manner. Because miRNAs are highly tissue specific [
14], reference miRNAs need to be validated within each tissue and tumor type, and for some prevalent malignancies like pancreatic cancer (PC), no publications regarding suitable reference miRNAs exist.
In the present study, we sought to identify stable reference miRNAs useful for normalization of RT-qPCR-determined miRNA expression in FFPE tissue samples from patients with colorectal cancer (CRC) and PC, and also to quantify the sources of variability associated with measurements of miRNA expression in archival FFPE samples.
Methods
Cohorts and clinical data
The miRNA measurements used for identifying reference miRNAs in this paper were produced in two previously published studies of CRC and PC [
16‐
18]. For details regarding the clinical study populations, readers are referred to these papers. Importantly, all of the samples used were resected before any systemic treatment was initiated. Briefly, the 203 CRC samples (“CRC cohort”) were collected retrospectively from patients with metastatic CRC (mCRC) who had started first line treatment with capecitabine, oxaliplatin, and bevacizumab from 2006 to 2011 at one of 10 departments of oncology in Denmark. The endpoint overall survival (OS) was measured from initiation of first-line treatment to death from any cause. The vital status of all patients was updated in July 2013. The 256 PC samples (“PC cohort”) were retrospectively collected from patients undergoing surgery for pancreatic ductal adenocarcinoma or ampullary adenocarcinoma at the Department of Surgical Gastroenterology, Herlev University Hospital, from 1976 to 2008. Control tissue samples from resected normal pancreas (
n = 20) and chronic pancreatitis (
n = 20) were also included. OS in this cohort was measured from surgical resection to death from any cause, and participant vital status was updated October 2010. Patients in both cohorts who were alive at the time of last vital status update were censored. The 37 methodological samples were all from patients with CRC who had undergone surgery at the Department of Surgical Gastroenterology at Herlev University Hospital. The methodological samples that were used for the comparison of frozen- and FFPE tissue were anonymized samples acquired from the Danish CancerBiobank at Herlev University Hospital.
Tissue sample handling and -preparation
All the samples in the CRC- and PC cohorts were FFPE samples from primary tumors handled according to the standard procedures at each local department of pathology. In general, resected tumor specimens were transported to the pathology department right after surgery. The specimens were then inspected and described by the pathologist, and the tumors were fixed in 10 % formalin-fixation solution for at least 48 hours, most often 2–3 days, but sometimes up to 5 days. After fixation, tumors sections were embedded in paraffin, and then stored at room temperature in a dry environment.
A collection of methodological samples from nine different CRC tumors was also used. These samples were treated differently: frozen or formalin fixed and paraffin embedded and serially sectioned. Within 30 minutes after surgery, tumor tissue was partitioned into three or four sections. One tumor section was immediately frozen in liquid isopentane. When frozen, the tumor was transferred to a container and kept in the freezer at -80 °C. The remaining tumor sections were fixed in formalin for 2, 3, or 6 days and then embedded in paraffin and kept at room temperature.
The diagnosis of carcinoma was confirmed by an experienced gastro-intestinal pathologist (DL) by review of a 3-μm hematoxylin and eosin (HE)-stained section from each tumor block. Three 10-μm sections were then cut from each tumor block without micro- or macro-dissection, and the sections were placed in sterile Eppendorf tubes. An overview of how the individual methodological samples were handled is provided in Table S1 in Additional file
1. From tumors 1–3, adjacent FFPE sections were cut and placed in two separate tubes. From tumors 4–9, sections were cut from both frozen samples and from FFPE tissue samples that were fixed for 2 to 6 days. From tumor 4, additional adjacent FFPE sections were cut and placed in five separate tubes.
The tissue sectioning was performed by experienced laboratory technicians at the Department of Pathology, Herlev University Hospital.
MiRNA purification
All miRNA purification and expression analyses procedures were performed by the biotech service provider AROS Applied Biotechnology, Aarhus, Denmark (
www.arosab.com) using commercially available reagents. The company was blinded to all clinical information.
For the CRC cohort and the methodological samples, RNA was purified using the miRNeasy FFPE Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions (miRNeasy FFPE Handbook September 2010, www.qiagen.com). Briefly, samples were deparaffinized and then lysed with proteinase K digestion followed by heat treatment. After centrifugation, the supernatant was recovered and treated with DNase. After mixing with buffer and ethanol, part of the mixture was transferred to an RNeasy MinElute spin column where total RNA was bound. After washing, the RNA was eluted and normalized to 70 ng/μl manually. For the PC cohort samples, RNA was purified using the High Pure miRNA Isolation Kit (Roche, Basel, Switzerland) according to the manufacturer’s instructions. Briefly, the tissue sections were deparaffinized in xylene and ethanol, then treated with proteinase K, and finally RNA was isolated using the one-column spin column protocol for total RNA. After washing, the RNA was eluted and an aliquot was normalized to 133 ng/μl. A few samples (approximately 10) were below 133 ng/μl and were therefore concentrated by speed vac. The purity and concentration of RNA were assessed by absorbance spectrophotometry on a NanoDrop 8000 (Thermo Fisher Scientific, Waltham, MA, USA). Samples with a 260/280 nm absorbance ratio below 1.8 were discarded and new sections from the corresponding tissue block were cut and purified, if possible. Purified samples were stored at -80 °C until they were used for miRNA expression analysis.
MiRNA reverse transcription and expression analysis
The TaqMan® Array Human MicroRNA A + B Cards Set version 3.0 (Applied Biosystems/Life Technologies, Carlsbad, CA, USA) was used to quantify expression of 754 miRNAs in the CRC cohort samples and in the methodological samples. The same array in version 2.0 was used for the PC cohort samples. The A-card (377 miRNAs) contained the same miRNA assays in the two versions, whereas there were minor differences between the B-cards. The instructions and reagents from the manufacturer were used in all steps (
http://www.lifetechnologies.com/). Briefly, the procedure utilized for the array analysis was as follows. RNA was reverse transcribed (RT) using the TaqMan® MicroRNA Reverse Transcription Kit into cDNA in two multiplex reactions each containing 3 μl of the small RNA preparation, corresponding to 200 ng total RNA, and either Megaplex RT Primer Pool A or Pool B in a total reaction volume of 7.5 μl. The RT reaction was run at 16 °C for 2 min, 42 °C for 1 min, and 50 °C for 1 sec for 40 cycles, then at 85 °C for 5 min, and held at 4 °C. Prior to loading the arrays, a 12-cycle pre-amplification reaction was performed using 2.5 μl cDNA in a 25-μl reaction and using Megaplex PreAmp Primers Pool A or B. The preamplification was run at 95 °C for 10 min, 55 °C for 2 min, 72 °C for 2 min, and then 12 cycles at 95 °C for 15 sec and 60 °C for 4 min, and finally 99.9 °C for 10 min and held at 4 °C. The preamplified solution was then diluted with 75 μl 0.1x TE buffer to a total volume of 100 μl. Each of the arrays was loaded with 1/50 (8 μl) of the preamplification reaction which was mixed with TaqMan Gene Expression Master Mix in a total reaction volume of 800 μl and run on the 7900HT Fast Real-Time PCR System. The PCR reaction was run using the same program as for the RT. The quantification cycle (Cq) was defined as the fractional cycle number at which the fluorescence passed the fixed threshold. All raw C
q values >32 were discarded. MiRNA expression values were transformed to 40-C
q so that higher values corresponded to higher expression. Data from samples that had been analyzed in spite of a 260/280 nm absorbance ratio <1.8 were removed from the final data set. Data from methodological samples with a 260/230 nm absorbance ratio <1.5 were also removed.
The technical replicates were all from the same purification, and they were analyzed on the same day on the same machine, consecutively.
Statistical analysis
Normalization: Global mean miRNA expression was defined as the sample-wise arithmetic mean C
q of all included miRNA measurements. In analyses in which low expression measurements and inconsistently expressed miRNAs were removed, only the remaining measurements were used to calculate global mean. Global mean normalization was performed as previously described [
10]. Briefly, the mean of all C
q values for sample “i” (global mean) was subtracted from each individual C
q value from sample “i”. Normalization using reference miRNAs was performed in a similar manner using the arithmetic mean of the reference miRNAs instead of the global mean.
Identification of reference miRNAs (CRC cohort and PC cohort)
Calculations were performed separately for each tumor type. Only miRNAs with less than 5 % missing values were considered. Missing/undetermined = no signal after 40 PCR cycles. A list of 20 miRNAs with the best correlation to global mean miRNA expression was identified using Spearman's rank correlation coefficient,
r
s
. The association between the expression of each of the 20 miRNAs and OS was investigated using a Cox proportional hazards (CPH) model [
19,
20] to test if the expression of individual candidate reference miRNAs was independent of prognosis. The stability measure
M was calculated for each of the 20 miRNA according to the formula by Vandesompele et al
.[
21]. If more than one miRNA from the same miRNA family was included in the list, only the miRNA with the lowest
M value was kept (miRNA family list found at http://www.mirbase.org/ftp.shtml). Removing miRNAs from the same family is recommended because miRNAs from the same family are expected to be co-regulated [
21,
22]. Consequently, adding additional co-regulated reference genes would not significantly improve the combined value of the genes as normalizers. The optimal number of reference miRNAs to use was then calculated using the pairwise variation measure
V as described by Vandesompele et al
. [
21], and Chang et al
. [
11].
V = 0.15 has been suggested as a suitable threshold below which the inclusion of additional reference miRNAs is not required, i.e. addition of further reference miRNAs will not significantly improve stability [
21].
The performance of previously identified candidate prognostic miRNAs for each cancer type [
16,
18] was tested by analyzing the association between OS and expression of the candidate prognostic miRNAs using both global mean normalization and normalization with the identified reference miRNAs.
Effect of sample age (CRC cohort and PC cohort)
Global mean miRNA expression was plotted for each sample according to the age of the tumor block, and the effect of age was estimated using linear regression with age as a continuous variable.
Methodological samples
Correlations between related samples were described using correlation coefficients,
r
s
. Linear mixed effects models with miRNA and samples as random effects were used to quantify technical variability corresponding to the between-sample and within-sample variation and were reported as standard deviations (SD) in C
q. SDs after normalization with global mean [
10], the mean of the optimal group of miRNAs, or the mean of the two best reference miRNAs were also calculated. Hierarchical clustering was performed using the miRNAs measured on all samples. Euclidean distance and complete linkage were used with four different normalization strategies: raw values (no normalization), global mean, mean of optimal reference miRNAs, and mean of the two best reference miRNAs. Additionally, 1-Pearson correlation was used as distance metric. Dendrograms were made for each of the mentioned clustering procedures. Clustering was performed using the function
hclust in the
stats package in R, and clusters were defined by
cutreeDynamicTree in the package
dynamic-TreeCut [
23].
P < 0.05 was considered statistically significant. The statistical software package R [
24] version 3.0.2 (
www.r-project.org) was used for all analyses. An overview of the sample cohorts and associated analyses is presented in Figure S1 in Additional file
1.
Ethics
The study was approved by the Regional Scientific Ethics Committee of the Capital Region of Denmark (
http://www.regionh.dk/vek, approval numbers: H-KA-20060181 and H-1-2010-081). The CRC study was a retrospective study wherein the ethics committee waived the requirement for obtaining informed consent. The PC study included participants prospectively and all participants signed an informed consent form allowing for publication of the results.
Discussion
MiRNAs hold great promise as biomarkers for the diagnosis, prognosis, and prediction of treatment benefit in patients with cancer, and they are also potential targets for new cancer drugs. In order to utilize miRNAs for these purposes, it is paramount to first learn about the precise dysregulation of individual miRNAs in various disease states. To this end, optimal normalization is essential. In whole “miRNome” studies, global mean normalization is the established standard. But in studies with fewer miRNAs, normalization with stable global mean-associated reference miRNAs is the best option [
10‐
15].
To the best of our knowledge, we have presented in this study the first unbiased identification of stable miRNA reference genes in FFPE cancer tissue. This is also the first study to identify miRNA reference genes in PC tissue, and it is the largest study of its kind to date. Normalization with the selected CRC reference miRNAs was able to slightly reduce variability between biological replicates and between frozen and FFPE samples, and it improved hierarchical clustering results. The reason why the reference miRNAs did not decrease variability for technical replicates could be that the variability in the raw data was very low. As shown in Table S4 in Additional file
1, 10 out of the 20 candidate reference miRNAs for CRC tissue and 8 out of the 20 candidate reference miRNAs for PC tissue have been identified as stable global mean-associated reference miRNAs in previous studies on frozen samples. In fact, at least one reference miRNA from each of the previously reported studies was represented in our top-20 lists (Table S4 in Additional file
1). This indicates that some reference miRNAs are suitable for normalization of both frozen- and FFPE samples, which has been suggested previously [
12].
Reference miRNAs are not necessarily equally suited for normalization of miRNA expression in cancer cells and in normal cells. However, the expression of the identified CRC reference miRNAs was independent of tumor cell content. Moreover, the expression of the identified PC reference miRNAs was strongly correlated with global mean miRNA expression in both cancer and non-cancer samples. Therefore, these reference miRNAs could be suitable for normalization regardless of the ratio of tumor- versus normal cells in the sample. Further, several of the candidate reference miRNAs we identified have previously been shown to be stably expressed in normal tissue from multiple organs [
10,
12,
14]. The most stable global mean-associated miRNA in both of our cohorts was miR-103a-3p. This miRNA has also been identified as a stable normalizer in frozen kidney and lung cancer samples [
12,
13]. Interestingly, in the study by Peltier et al. [
12], the authors found that miR-103a-3p was only the fourth most stable reference miRNA in frozen lung cancer tissue, while it was the most stable reference miRNA in FFPE lung cancer tissue. Thus, this miRNA could be especially suitable for normalization in FFPE cancer tissue. In the same study, miR-16 – a commonly used reference miRNA – was the least stable of seven candidate miRNAs in both frozen and FFPE lung cancer samples. MiR-16 was not identified as a candidate reference miRNA in our study, and its relevance as a normalizer in FFPE cancer tissue is uncertain.
We identified a significant but modestly sized decrease in miRNA expression as a function of storage time in PC. The trend was similar in CRC samples, but it was not significant. The effect in PC amounts to a 50 % reduction of the miRNA expression over 20 years. Siebolts et al. found a very similar rate of decline of miR-16 expression in FFPE blocks stored for up to 27 years [
25]. An effect of this size should not have a detrimental impact on the use of FFPE samples for clinical biomarkers in PC or CRC, because these samples are often only a few months to a couple of years old when used. Also, normalization would mitigate some of the difference, as demonstrated in two other studies in which the authors did not see any effect of storage time in miRNA expression normalized to
RNU6B or to other miRNAs [
26,
27].
An excellent reproducibility of the chosen platform was demonstrated with highly correlated miRNA expression in technical replicates. The biological replicates were different sections from the same tumor block. The variability in these replicates is the sum of biological differences in the tumor block and variability in sectioning, purification, reverse transcription, and amplification. Even with all these additional sources of variability, we found an excellent correlation between these samples, with only a minor increase in SD, from 0.31 to 0.52 Cq, compared to the technical variability caused by the amplification process alone. This suggests that miRNA expression is not very heterogeneous within an FFPE tumor block.
Formalin fixation and paraffin embedding caused a decline in miRNA expression of between 0.5 and ~1.5 C
q; yet, correlation between miRNA expression in frozen- and FFPE tissue from the same tumor was still high. Hoefig et al. also found a 1.0–1.5 C
q decline in miRNA expression in formalin-fixed compared to frozen liver- and tonsil samples [
28]. The decrease in global mean miRNA expression could be a result of miRNA degradation, excessive fixation of miRNA in the FFPE tissue with suboptimal purification, or small fragmented ribosomal- and messenger RNA interfering with the miRNA signal on the array. Many previous studies have reported a good correlation between frozen- and FFPE samples [
28‐
32]. The variability between frozen and FFPE samples was high in our study, and frozen samples from different tumors tended to cluster together in the hierarchical clustering analysis. Moreover, global mean normalization did not greatly improve the clustering of frozen samples. These findings could be a result of formalin-fixation causing non-uniform changes in miRNA measurability. They highlight the importance of doing large scale studies in both frozen and FFPE samples and not relying on direct portability of results between the two.
The quality of the purified RNA was lower in the PC cohort than in the CRC cohort, resulting in a higher proportion of excluded samples in the PC cohort. The PC samples were purified and analyzed in smaller batches distributed over a longer period compared to the CRC samples, which could have decreased overall quality of this sample cohort. In addition, the purification kit used for the PC cohort, which differed from the kit used in the CRC cohort, could have been less effective. Finally, the samples in the PC cohort were older which could also have influenced RNA quality.
Conclusions
Low expression measurements (C
q > 32) were removed before undertaking the analyses. This was done to reduce noise in the data, but it could also introduce a bias to the results. Therefore, all of the analyses were repeated without removal of low expression measurements, but this did not result in any major changes in the results, apart from the anticipated increase in variability. It is also important to note that several different technologies are used for miRNA expression quantification apart from RT-qPCR, e.g. hybridization-based arrays and sequencing technologies [
33], and our findings may not be applicable to all such technologies. In summary, we have identified stable global mean-associated reference miRNAs for use in miRNA expression studies on FFPE cancer tissue from patients with colorectal and pancreatic cancer. This is the first study to search specifically for reference miRNAs for use in FFPE cancer tissue. We also found that the length of storage is not a significant determinant of miRNA abundance in FFPE cancer tissue and that intra-block miRNA expression heterogeneity seems to be low. Formalin fixation caused a decline in miRNA expression, but expression profiles from frozen and FFPE samples from the same tumor were generally still highly correlated. These results should provide valuable information for the planning and execution of future miRNA biomarker studies in patients with cancer.
Availability of data and materials
All source data can be found in the additional supporting files.
Acknowledgments
We would like to thank Mel C. Heeran, PhD for excellent technical assistance with the sectioning of the FFPE tissue samples, Mogens Kruhøffer, PhD, from AROS Applied Biotechnology, Denmark, for help with the miRNA analyses, and the Danish CancerBiobank (DCB) for biological material and for the data regarding handling and storage. We would also like to thank Roche Denmark and The Capital Region of Denmark Health Research Fund for financial support.
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
MKB and CD had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. MKB and JSJ designed the study. MKB, NAS, BVJ and JSJ collected and assembled the clinical data, while DL and EVSH collected and evaluated pathology data. CD performed all of the statistical analyses and performed the initial data interpretation together with MKB and JSJ. The remaining authors also participated in the later stages of the data interpretation. MKB was responsible for writing most of the text for the initial draft of the paper but all the other authors contributed with individual text sections to the manuscript, according to their field of expertise. All authors critically revised the manuscript and approved the final version.