Background
Multiple sclerosis (MS) is the most common chronic inflammatory disease of the central nervous system (CNS) leading to demyelination, axonal damage, and neuronal loss [
1,
2]. The diagnosis of MS relies on clinical symptoms, magnetic resonance imaging (MRI) findings, and laboratory tests, such as detection of oligoclonal bands in the cerebrospinal fluid (CSF) [
3]. MS has a heterogeneous and unpredictable clinical course spanning decades; the different rates of progression and the different responses of patients with relapsing-remitting MS (RRMS) to therapy remain unexplained.
It is widely accepted that MS pathology is caused by an inappropriate T-cell-mediated immune response that is induced in secondary lymphoid organs upon encounter with still unknown antigens [
4]. Leukocyte migration and activation inside the brain and spinal cord is accompanied by persistent intrathecal B-cell activation and antibody production whose role in MS pathology is not understood yet [
5]. At the cellular level, MS-associated inflammation is characterized by mild-to-moderate CSF pleiocytosis, perivascular accumulation of leukocytes (predominantly lymphocytes and myeloid cells) in the white matter and in the meninges, organization of lymphoid-like structures in the subarachnoid space, and microglia/macrophage activation in the neural parenchyma [
1,
2]. The relationship between peripheral immune system activation and CNS inflammation is highlighted by the therapeutic efficacy of natalizumab, which blocks leukocyte trafficking in the CNS and markedly reduces disease activity and leukocyte number, cytokine, and IgG levels in CSF [
6‐
9].
The identification of sensitive and specific biomarkers for diagnosis, prognosis, and treatment efficacy of MS is a relentless effort [
10]. At the protein level, CSF biomarkers for inflammation, like the B-cell attracting chemokine C-X-C motif chemokine 13 (CXCL13) and the extracellular matrix-degrading enzyme metalloprotease-9 (MMP-9) [
11], and for CNS tissue damage, like myelin basic protein and neurofilament light chain subunit [
12], have been identified. It has been shown that CSF levels of chitinase-3-like-1 [
13,
14] and neurofilament light [
13] chain are significant predictors of MS development and neurological disability. Several microarray-based gene expression studies have been carried out in whole blood or peripheral blood mononuclear cells (PBMC), aiming at detecting differences in gene signatures between MS patients and control subjects and between patients with clinically or radiologically active and inactive disease [
15‐
20]. However, owing to small sample size, disease heterogeneity, and differences in microarray technology and data analysis, reproducibility across studies has been extremely limited [
21]. Due to the small number of cells collected from the CSF, comprehensive gene expression studies in CSF cells are sparse. Compared with healthy controls or patients with non-inflammatory neurological diseases, MS patients show increased expression of genes involved in T-, NK-, and B-cell function in CSF cells [
17,
22‐
24]. Only a few studies have examined gene expression in paired CSF cells and PBMC from patients with MS and non-inflammatory neurological diseases, confirming poor correlation between intrathecal and peripheral immune activation [
17,
22,
23].
Real-time reverse-transcription polymerase chain reaction (RT-PCR) incorporating a target gene pre-amplification (PreAmp) step has the double advantage to improve detection of low-frequency transcripts and to enable analysis of a large number of transcripts even with low amounts of starting RNA [
25]. Here, we report the application of this enhanced RT-PCR method to paired CSF cell and PBMC samples from patients with RRMS and the results of a preliminary analysis on nearly 50 genes, including immune-related genes and genes expressed by Epstein-Barr virus (EBV). This ubiquitous DNA herpesvirus establishes a life-long latent infection and shows strong association with MS [
26,
27]. MS risk is higher after infectious mononucleosis, and immune reactivity to EBV is increased or deregulated in MS patients compared to healthy subjects, indicating a disturbance in virus-host interactions [
27‐
29]. It is debated whether an active EBV infection in the CNS of MS patients can cause an immunopathological response [
27,
30‐
36]. Although EBV DNA load in CSF and peripheral blood does not differ significantly between MS patients and healthy donors or patients with other neurological diseases [
37‐
40], some studies support an association between increased EBV load in peripheral blood and clinical MS attacks [
38,
41,
42]. It has not been established yet whether the study of EBV gene expression, which defines more precisely the different phases of viral infection [
43], might be a better strategy to investigate EBV perturbation in MS [
38,
44].
Thus, the goal of this study has been to evaluate whether the combined analysis of immune-related and EBV genes in CSF cells and PBMC obtained from clinically and radiologically characterized, therapy-free RRMS patients could provide novel information on the relationship between immune status, EBV infection, and MS disease features. After analysis of the expression levels of each selected gene according to sex, clinical, MRI, and CSF findings, all the collected data have been extensively analyzed using multivariate statistical methods in the attempt to identify gene expression patterns representative of underlying immunopathological processes and potentially useful for patient classification.
Methods
Subjects
Patients were recruited at the MS centers of the University Hospital San Luigi Gonzaga in Orbassano, University of Cagliari, and University of Firenze. The study was approved by the ethic committees of the three participating MS centers and of the Istituto Superiore di Sanità, and carried out according to the Declaration of Helsinki. Written informed consent was obtained from all study participants.
Thirty-one patients with RRMS were included in this study [
45,
46]. None of the patients received immunomodulatory or immunosuppressive treatment at the time of CSF and blood sample collection and had not received such treatments for at least 12 months. Demographic and clinical information were derived from medical records and are summarized in Table
1. MS disease onset was defined as the first episode of focal neurological dysfunction indicative of MS; relapses were defined as the development of new or recurrent neurological symptoms not associated with fever or infection and lasting for at least 24 h [
46]. Patients were categorized on the basis of the presence of a relapse or a condition of remission at the time of sampling. The expanded disability status scale (EDSS) score was calculated on the basis of a complete neurological examination by a neurologist expert in MS.
Table 1
Demographic, clinical, and CSF data of the analyzed patients
Relapsing remitting MS (n = 31) | 20/11 (1.8) | 33 years (20–65) | 1 (0–4.5) | 12 months (0.1–144) | 12 (38.7) | 13 (41.9) | 7 (0.5–45) | 0.81 (0.43–2.2) |
All patients were examined by a routine brain MRI protocol before or after sample collection (median time interval =26 days; range 1–90 days). The time interval between sample collection and MRI was significantly shorter in patients in clinical relapse (median =11 days, range 1–37 days) than in patients in clinical remission (median =42 days, range 2–90) (
p = 0.0015 by Student’s t test). MRI scans (T2-weighted and T1-weighted pre- and post-gadolinium administration, slice thickness 5 or 3 mm) were obtained in all patients using a standardized scanning protocol [
47] with 1.5 T MR scanners.
Sample collection
All CSF and peripheral blood samples were obtained for routine diagnostic work-up. CSF and blood from each patient were always drawn on the same day. CSF samples were processed according to the BioMS-eu consortium guidelines [
48]. A total of 3 to 17 ml of CSF (median =11 ml) were obtained by lumbar spinal tap with an atraumatic needle. Within 30 min after lumbar puncture, CSF samples were centrifuged at 1200 rpm for 10 min at room temperature to separate the cellular component from cell-free supernatant; the cell pellets were stored at −80 °C in RNAlater (Qiagen) or RNA was immediately extracted (see below). Blood (10 ml) was drawn in EDTA tubes, and PBMC were isolated using Lymphoprep, preserved in RNAlater and frozen at −80 °C. CSF samples were routinely analyzed for cell counts. Quantitative (IgG index) and qualitative (oligoclonal bands) analysis of intrathecal IgG synthesis after lumbar puncture was performed using standard methods.
Pre-amplification real-time RT-PCR
Total RNA was extracted from CSF cells (median =4.5 × 10
4, range 7 × 10
3–5 × 10
5) and PBMC (4 × 10
5) using the AMBION RNAqueous micro kit (Life Technologies, Grand Island, NY, USA) according to the manufacturer’s instructions, including genomic DNA digestion. Total RNA from PBMC was quantified by Nanodrop 2000 (Thermo Fisher Scientific, Waltham, MA, USA) and 200 ng was reverse transcribed for each sample. Because of the very low and highly variable RNA yield from CSF cells, the entire volume (15 μl) of RNA extracted from each CSF sample was reverse transcribed. Reverse-transcription (RT) was performed using the high capacity reverse transcription kit with RNase inhibitor (Life Technologies). The resulting cDNA was diluted to a final volume of 50 μl and splitted into four 12.5 μl aliquots. To increase the number of targeted copies, each cDNA aliquot was amplified for the specific gene assays by pre-amplification reaction (14 cycles) using the TaqMan PreAmp Master Mix (Life Technologies) and pooled gene-specific primers, and the reaction conditions indicated by the manufacturer. Inventoried and self-designed TaqMan gene expression assays were used to study cellular and EBV genes, respectively (see Additional files
1 and
2). Cellular gene assays were pre-amplified together with the housekeeping gene GAPDH; viral gene assays were pre-amplified separately together with GAPDH and the B-cell-specific genes CD19 and CD20. The pre-amplification product was diluted 1:5 up to 250 μl in TE buffer, and 4 μl of this dilution was used as template for a single real-time PCR analysis. Quantitative PCR experiments were performed in triplicates with the same inventoried or self-designed TaqMan assays used in the pre-amplification step (250 nM probe and 900 nM each primer), using the 7500 Real-Time PCR System (Life Technologies) for cellular genes and the StepOne Plus Real-Time PCR System (Life Technologies) for viral genes. Thermocycling parameters were 50 °C (2 min), 95 °C (10 min), followed by 40 cycles of 95 °C (15 s) and 60 °C (1 min) for both cellular and viral genes. The results of gene expression analysis are expressed as Ct values (Ct = threshold cycle of PCR at which the amplified product is detected). The ΔCt is the difference in Ct values derived from the gene of interest and the reference gene GAPDH; the factor 2^-ΔCt is used to express the ratio between the gene of interest and the internal reference gene. To rule out cross-contamination of reagents and primers, all RT, pre-amplification, and real-time PCR experiments included a NTC sample, containing all the components of each reaction except for the template. Considering that 12 μl of pre-amplified cDNA was analyzed for each transcript and that the available volume of each pre-amplified aliquot was 250 μl, we were able to analyze in triplicates up to 20 transcripts per aliquot.
To check that all amplicons were amplified uniformly without bias, we performed pre-amplification uniformity experiments using non-limiting cDNA from a human non pathological pulmonary hilar lymph node (obtained from Dr. Egidio Stigliano, Institute of Pathological Anatomy, Policlinico A. Gemelli, Rome, Italy), as control for cellular genes, and from an EBV transformed B-lymphoblastoid cell line (LCL), as control for EBV genes. The EBV+ LCL (L5) was generated by infecting 5 × 106 PBMC obtained from a patient with MS with B95.8 EBV strain in a medium containing cyclosporin A (1 μg/ml, Calbiochem); the outgrowth of B95.8-infected PBMC was monitored twice a week, and after 5 weeks post-infection, the LCL was permanently established. Amplification of pre-amplified cDNA from lymph node and EBV+ LCL was compared with that of non pre-amplified cDNA. Primer uniformity was calculated by the formula ΔΔCt = ΔCt (Preamp) − ΔCt (cDNA). A ΔΔCt value within ±1.5 is considered acceptable, as indicated in the manufacturer’s instructions. Pre-amplification uniformity values related to the reference gene GAPDH were very close to zero for all the investigated gene assays (mean ΔΔCt values ± SD were 0.33 ± 0.35 and 0.90 ± 0.41 for EBV and cellular transcripts, respectively), indicating optimal pre-amplification uniformity. PCR efficiency by direct and PreAmp real-time PCR was checked for viral genes and found to be similar (range 0.97–1.08; optimal efficiency = 100 ± 10 %) over serial dilutions of EBV+ LCL cDNA (from 100 to 0.1 ng, corresponding to approximately 10.000 to 1 cells). Importantly, similar data were obtained for each target gene after pre-amplification from pooled and single assays.
Statistical analysis
Demographic, clinical, radiological, CSF, and gene expression data of 31 MS patients were analyzed by univariate and multivariate statistical techniques. In univariate analyses, comparisons between groups of patients were carried out by Student’s t test and Mann-Whitney test for continuous variables, and by Fisher’s exact probability test for categorical variables. Correlation between variables was assessed by Spearman’s rank correlation coefficient. Means and SE, or medians and interquartile ranges, were used to summarize continuous data, and percentages were used for categorical variables.
To unravel complex gene interactions that may better capture pathological processes in MS, the collected gene expression data were analyzed using two multivariate statistical techniques: cluster analysis aiming to group subjects into clusters and factor analysis aiming to define artificial factors, that is underlying latent variables, adequately describing the correlation structure of the original variables. Cluster analysis was carried out by average linkage method with Euclidean similarity measure. Clustering of patients was visualized by dendrogram and the choice of number of groups was based on Calinski/Harabasz pseudo-F index and Duda/Hart index stopping rules. Factor analysis was carried out using the principal factor method. Factor loadings, that is, correlations of the original variables with factors, were used for interpretation of artificial factors. Scores of subjects on artificial factors were entered in further analyses. The influence of demographic, clinical, or radiological parameters on patient clustering and factor scores was investigated by univariate analyses. Finally, the discriminating power of artificial factors to predict patient clustering was assessed. Classification accuracy was assessed through receiver operating characteristic (ROC) curve analysis, by calculating the area under ROC curves (AUC) and its 95 % confidence interval (CI). These analyses were carried out separately on immune gene expression data obtained in CSF and PBMC samples.
The level of confidence was set at 0.05 and statistical significance was assessed by adopting the Bonferroni correction for multiple testing. Number of comparisons considered patient subgroups differing for demographic, clinical, and radiological characteristics, and resulting from cluster analysis. For correlation analyses, multiplicity due to two inflammatory CSF parameters (IgG index and CSF cell count) was considered. Stata 11 was used for statistical analyses.
Discussion
The establishment of a relatively simple procedure to perform large-scale gene expression studies in both CSF and peripheral blood is an important step forward towards a better understanding of immunopathological mechanisms and biomarker identification in MS. Here, we have explored the reliability and usefulness of PreAmp real-time RT-PCR to analyze expression of 41 immune-related genes and 7 EBV genes expressed during viral latent and lytic infection in CSF cells and PBMC obtained from 31 therapy-free RRMS patients with relatively short disease duration since diagnosis (median time =12 months). Due to improved sensitivity, PreAmp RT-PCR allowed relative quantification of low level cellular transcripts, such as IL-2, IL-4, IL-6, p40, IL-15, and IL-17A, which are usually undetectable in CSF and/or peripheral blood cells from MS patients using conventional RT-PCR methods. By confirming well-established differences in immune cell composition and mediators of immune responses between CSF cells and PBMC and the association of B-cell/plasmablast enrichment with inflammatory CSF parameters, like CSF cell counts and IgG index, this preliminary study indicates that Pre-Amp RT-PCR can provide reliable information on the abundance and activation status of different immune cell types in both the innate and adaptive branch.
Univariate analysis revealed no or only minor differences in immune-related gene expression between MS patients stratified by sex, clinical, or MRI status. Higher expression of CD4 in CSF cells from female and remitting patients suggests a relatively higher frequency of T helper cells, the predominant population in the CSF, compared to male and relapsing patients. In PBMC, higher expression of BDCA-2 during clinical remission and of IL-10 during clinical relapse may reflect an increased frequency of circulating plasmacytoid DC, the main source of type-1 IFN, and activation of immune regulatory/suppressive mechanisms, respectively. To date, no relevant differences in gene expression have been reproducibly demonstrated in whole blood cells or PBMC when comparing MS patients and controls, patients in clinical relapse and remission, or patients with different disease courses [
15‐
21]. A higher type-1 IFN signature has been detected in the blood, specifically in monocytes [
60,
61], of a subset of treatment naïve patients with RRMS and has been associated with a poor response to IFN-β [
60,
62,
63]. Recently, differences in PBMC gene expression profiles were detected between male and female patients with RRMS [
44], although results interpretation is complicated by different treatment regimens.
Owing to the multivariate dimension, cluster and factor analyses of immune-related gene expression data yielded more relevant results, allowing for gene signature-based subgrouping of patients and interpretation of underlying biological processes and their interaction. The results obtained with the multivariate approach completely differed between CSF cells and PBMC, confirming poor correlation between intrathecal and systemic immune responses. Cluster analysis carried out on CSF gene expression data yielded three clusters of patients. Of these, cluster 1 and cluster 2 (representing 77 and 19 % of the study population, respectively) significantly differed by gene expression but not by sex, clinical condition, disease activity on MRI, or inflammatory CSF parameters. Specifically, cluster 2 showed relatively higher expression of genes encoding MHC class II, macrophage (CD68) and T helper cell (CD4) markers, the type 1 IFN-regulated molecule OAS1, indicators of inflammation (COX-2, NAMPT), and the macrophage-derived pro-inflammatory cytokine IL-1β. By factor analysis, correlated genes were grouped into artificial factors providing information on specific biological processes. Of these, two factors potentially mirror different and interacting biological processes that predominate in the CSF of cluster 2 compared to cluster 1. Factor 1 strongly associates with genes related to a type-1 IFN response (IRF7, MxA, PKR, Usp18, OAS1, IFI6, IFIT1, IFN-αR1, BAFF, BDCA-2), cytotoxic/Th1 T-cell activation (CD8, IFN-γ), and inflammation (NAMPT, COX-2), while the main contribution of CD68, IL-10, MHC class II, and CD4 to factor 4 likely reflects an inhibitory circuit involving immune regulatory cells. This interpretation, along with the finding that assignment of patients to cluster 1 or cluster 2 depends on the ratio between factor 1 and factor 4, is consistent with a tight balancing of pro- and anti-inflammatory immune responses in CSF. Because ROC analysis showed excellent accuracy of some differentially expressed genes, particularly MHC class II, CD4, CD68, and OAS1, as well as of factor 1 and factor 4, in classifying cluster 1 and cluster 2 patients, future studies should ask whether these CSF gene signatures, alone or in combination, may have a prognostic value or be useful to predict a therapeutic response. Multivariate analysis carried out on PBMC gene expression data neither allowed patient clustering nor revealed any significant association of immune-related genes grouped into artificial factors with demographic, clinical, MRI, or CSF characteristics.
Despite the use of an enhanced RT-PCR method, EBV RNA was detected in a minority of CSF cell (10 %) and PBMC (14 %) samples obtained from RRMS patients, a finding that is in line with most previous studies assessing EBV DNA load in MS [
37‐
40]. As we have shown that EBV gene expression was detectable only in CSF cell samples with higher RNA content (GAPDH Ct values <19), it cannot be excluded that low RNA amount remains a major limiting factor for accurate evaluation of EBV infection status in CSF. However, it is worth noting that in all CSF samples (
n = 3) and half of the PBMC samples (two out of four) with detectable viral RNA, EBV gene expression was indicative of a deregulated infection. Perturbation of EBV infection was inferred by detection of gene products that are associated with different phases of viral latency activation and lytic cycle and are usually not detected in healthy subjects, even when using highly sensitive PCR techniques [
64,
65].
Higher antibody- and T-cell-mediated immune responses to EBV in MS patients than in control subjects indicate that EBV infection is perturbed in MS [
27‐
29]. Low prevalence of EBV nucleic acids in CSF and peripheral blood of MS patients and absence of marked differences in EBV DNA load between MS patients and controls [
37‐
40] indicate no persistent or substantial EBV perturbation in these body fluids. However, a higher EBV DNA load was found in PBMC of patients with CIS [
66] and during MS clinical exacerbations when serial blood samples were analyzed [
38,
41,
42]. Furthermore, a significantly higher incidence of EBV-induced B-lymphocyte transformation in MS patients compared to healthy subjects supports the presence of higher numbers of circulating EBV latently infected B cells in MS [
67,
68]. In normal conditions, the EBV life cycle mainly occurs inside the lymphoid tissue, particularly in mucosa-associated lymphoid tissue, like tonsils, where EBV reactivation can occur at very low frequency in plasma cells leading to release of viral particles; the released virus can infect naïve B cells and establish latency in memory B cells [
43,
69]. Latently infected memory B cells leaving the lymphoid tissue and entering the blood circulation are extremely rare and shut down expression of viral genes and proteins to avoid detection by cytotoxic T cells, thereby maintaining a life-long infection [
70]. Asymptomatic EBV reactivation in healthy individuals may lead to an increase in viral DNA load in the blood in the absence of detectable EBV latent and lytic transcripts, reflecting viral replication in remote lymphoid tissue [
64]. The presence of EBV latent and lytic transcripts in CSF cells and/or PBMC from a minority of MS patients described in this study should be interpreted as perturbance of the normal EBV life cycle, which may be transient and therefore difficult to capture in cells circulating through these body fluids, particularly in studies with a single sampling design. The results obtained in autoptic tissue samples, though still controversial [
30‐
32], suggest that an active EBV infection in MS could be mainly confined to brain intraparenchymal perivascular spaces, subarachnoid space where B cells accumulate and organize into B-follicle-like structures [
30], and/or CNS-draining lymph nodes [
71]. Selective enrichment of CD4+ and/or CD8+ T cells specific for EBV antigens in the CSF of patients with CIS and definite MS has been demonstrated in several studies, supporting the idea of a localized T-cell response to EBV in MS [
33,
35,
36].
Due to the low prevalence of EBV RNA+ samples in the analyzed MS cohort, the putative link between EBV infection status, cellular gene expression, and MS disease features could not be evaluated. However, it is worth noting that the only patient displaying EBV reactivation in CSF cells and PBMC was clinically and radiologically active. In PBMC from this patient, a transcript profile suggestive of profound deregulation of viral latency (EBNA-1/EBNA-3A) and virion production (EBER/gp350/220 RNA) was accompanied by the activation of a cellular transcript profile (CD20, CD56, NKp46, perforin, IFN-αR1, MHC class II, NAMPT, IL-1β, CXCL13) that is compatible with B-cell expansion and early induction of a robust innate immune response by EBV reactivation [
72]. Productive EBV infection in the peripheral blood in the presence of a disrupted blood-brain barrier could facilitate entry of viral particles and/or newly infected B cells into the CNS. It is intriguing that in CSF cells from the same patient, only BZLF1 RNA, which is associated with the EBV early lytic cycle, was detectable in the absence of any comparable sign of immune arousal. This finding may suggest abortive EBV reactivation and impaired/delayed virus recognition by the immune system in CSF compared to the peripheral blood. A longitudinal study with serial PBMC sampling could help understand whether EBV reactivation recurs in the peripheral blood of MS patients and is associated with immune-related gene signatures and disease activity. It is envisaged that MS patients displaying more frequent EBV reactivation could benefit more from the treatment with last generation B-cell depleting antibodies [
5] or antiviral drugs [
73,
74].
Competing interests
FM and LG received honoraria for attending and speaking at national and international meetings from Biogen Idec, Merck Serono, and EUROIMMUN. GC received speaking fees from Teva and Admirall. EC received honoraria for serving in the scientific advisory boards and speaking fees from Bayer, Biogen, Merck Serono, Novartis, Sanofi-Genzyme, and Teva. AB received honoraria for serving in the scientific advisory boards of Almirall, Bayer, Biogen Idec, and Genzyme and received speaking fees from Biogen Idec, Genzyme, Novartis, and TEVA with approval of the Director of AOU San Luigi Gonzaga University Hospital; his institution has received grant support from Bayer, Biogen Idec, Merck, Novartis, Teva, and from the Italian Multiple Sclerosis Society, Associazione Ricerca Biomedica ONLUS, and San Luigi Gonzaga ONLUS. All other authors declare that they have no competing interests.
Authors’ contributions
FA conceived and coordinated the study. AMR, EC, GC, and AB were responsible for patient care and clinical documentation. FM, CB, LG, and LS collected and processed the samples. CV and FM performed the laboratory analyses. EA established the EBV+ LCL. MP planned and performed statistical analysis. CV, FM, MP, and FA analyzed and interpreted the results. FA wrote the manuscript. All authors contributed to the critical review of the manuscript. All authors read and approved the final version of the manuscript.