Background
Coding mutations like the well-known A53T α-synuclein and G2019S LRRK2 mutations cause familial Parkinson’s disease (PD) [
38]. Dopaminergic neurons carrying these mutations obtained via an induced pluripotent stem cell (iPSC) intermediate have been shown to recapitulate hallmarks of the neurodegenerative process in PD like increased susceptibility to oxidative [
42] and nitrosative stress [
53].
Much less is known about the possibility to model sporadic PD with iPSC based models. In sporadic PD that constitutes about 90% of cases and where none of the well-known genes causing familial Parkinsonism is mutated, no single variant in the whole coding sequence of the human genome was found to be associated with PD in a recent study [
55]. In contrast, alterations in non-coding regions, which are enriched in PD-related genes, are well established risk factors for PD [
12]. In addition, alterations of the epigenome in sporadic Parkinson’s disease have been reported in brain tissue on methylation [
25] and small RNA level [
26] and epigenetic alterations are also present in the patients’ peripheral blood [
24]. Although iPSC reprogramming has been reported to remove all marks associated with cellular ageing [
39], disease-specific alterations may survive reprogramming and then can impact iPSC function [
16], even allowing for imprinting disorders to be modelled with iPSCs [
7]. Importantly, both small RNAs [
62] as well as methylation [
43] can contribute to somatic memory.
Differential expression of disease associated small RNAs in cultured, differentiated neurons would allow to recapitulate and to study epigenome-mediated pathological alterations. Mature miRNAs, which repress target gene functions both by regulating target mRNA levels as well as by repression of target mRNA translation, have already been implicated in the pathogenesis of Parkinson’s disease, e.g. in animal models [
47]. PIWI interacting RNAs, a class of small regulatory RNAs first described in the male germline [
13], are less well studied in neuronal cells but have been implicated in retrotransposon silencing in the brain [
41] as well as regulation of epigenetic modifications [
49]. Importantly, piRNAs were recently described to be differentially regulated in Alzheimer’s disease brain tissue [
48,
52] connecting them to the process of neurodegeneration.
Most phenotypes observed in neurons differentiated from cells carrying a mutation that causes familial PD can only be observed after the cells have been stressed, implying that the alterations in the coding sequence lead to a stress-susceptible, primed state in the neurons. More specifically, increased cell death in neurons carrying the A53T mutation in α-synuclein is observed after treatment with mitochondrial toxins [
53] and in neurons carrying the G2019S LRRK2 mutation after H
2O
2 application [
42].
Under certain circumstances, dopaminergic neurons derived from sporadic Parkinson’s disease patients have been reported to show phenotypes similar to genetic cases including cellular abnormalities [
54], altered methylation patterns [
12] as well as abnormalities in protein turnover [
18]. Importantly, the epigenomic alterations observed were present in pathways important for physiological neurodevelopment [
12] supporting the hypothesis that these pathways are altered in PD [
29] as has been reported for Alzheimer’s disease [
14]. The alterations in protein turnover were already present in fibroblasts [
18] and a large subset of fibroblasts derived from sporadic PD cases showed increased sensitivity to valinomycin treatment [
58]. As fibroblasts are frequently used for reprogramming to iPSCs followed by differentiation into neuronal cells, it is possible that an epigenetic signature survives reprogramming that potentially impacts the function of transplanted cells. More importantly, this signature -if present- might provide unique insights into the disease aetiology of sporadic PD. Therefore, the Bavarian Research Network Induced Pluripotent Stem Cells (ForIPS) was initiated to test the hypothesis, which -if any- phenotypes can be observed in cells derived from sporadic PD-patients.
Based on these premises, we examined a large region specific cohort of matched fibroblasts, iPSCs and differentiated neuronal cells from healthy individuals and Parkinson’s disease patients and applied rigorous quality control standards. Previous to our analyses, we confirmed the PD-patients to be negative for all known monogenic forms of the disease. We aimed to determine in a global and unbiased approach if there were any epigenetic marks that distinguish cells from healthy and diseased individuals. Indeed, we found that cells derived from sporadic Parkinson’s disease patients show a unique small RNA signature and that pathways that are known to be altered in diseased tissue as well as genetic cases, like the pathway regulating PGC1α, are deregulated.
Methods
Cell culture, iPSC generation and differentiation to midbrain neurons
Induced pluripotent stem cells (iPSC) were generated from eight sporadic PD patients and six healthy controls by retroviral transduction of the four transcription factors (Oct 3/4, c-Myc, Sox2 and Klf4) as previously described [
15]. The Institutional Review Board approval (Nr. 4120: Generierung von humanen neuronalen Modellen bei neurodegenerativen Erkrankungen) and informed consent forms are on file at the movement disorder clinic at the Department of Molecular Neurology, Universitätsklinikum Erlangen (Erlangen, Germany). All the hiPSC lines were cultured in mTeSR (Stemcell Technologies) or MACSbrew (Miltenyi) on Geltrex (Gibco® Thermo Fisher Scientific) and splitted every five to seven days using Collagenase IV or Gentle Cell Dissociation Reagent (Stemcell Technologies). Pluripotency and a stable karyotype were tested by flow cytometry and G-banding chromosomal analysis, respectively. One to two hiPSC clones per individual (Additional file
1: Table S1) were differentiated into small molecule neural precursor cells (smNPCs) following the protocol published by others [
50] with some adaption as described in [
59]. The smNPCs were further differentiated to midbrain neurons within three weeks of maturation [
50,
59]. Briefly, 70% confluent iPSC were detached by collagenase IV (Gibco® Thermo Fisher Scientific) treatment for 20 min at 37 °C, 5% CO
2. Cell colonies were cultured as free-floating aggregates in human embryonic stem cells (hESC) medium (80% KO-DMEM, 20% KO serum replacement, 1% non-essential amino acids, 1% Penicillin/Streptavidin (all from Thermo Fisher Scientific), 1 mM ß-Mercaptoethanol (Sigma-Aldrich) supplemented with the small molecules 1 μM LDN (Stemgent), 10 μM SB, 3 μM Chir, and 0.5 μM Purmorphoamine (PMA, all from Tocris) on ultra-low adhesion plates. After two days of incubation at 37 °C, 5% CO2, the cell colonies were centred and the medium was changed to N2B27 medium (50% DMEM/F12, 50% Neurobasal Medium, 1:200 N2, 1:100 B27 (all from Thermo Fisher Scientific) supplemented with the same small molecules. On day four, the medium was changed to smNPC medium (N2B27 medium supplemented with 3 uM Chir, 0.5 uM PMA and 150 uM Ascorbic acid (AA; Sigma-Aldrich). After a total of six days of suspension culture, cell colonies were replaced on geltrex-coated (Gibco® Thermo Fisher Scientific) 12-well plates in smNPC medium supplemented with Rho kinase inhibitor Y27532 (RI, Axxora) for 24 h. Medium was changed every other day and cells were passaged once a week by accutase treatment. After at least five passages, smNPCs were differentiated into MN. Therefore, two days after passaging, the medium was exchanged to N2B27 medium supplemented with 100 ng/ml FGF8 (Peprotech), 1 μM PMA and 200 μM AA. On day 10 of differentiation, medium was supplemented with 100 ng/ml FGF8, 10 ng/ml GDNF (Peprotech), 10 ng/ml TGFb (eBioscience), 200 uM AA, and 500 μM Dibutyryl-cAMP (dbcAMP; Sigma-Aldrich. On the next day, cells were passaged at ratios of 1:2–1:3 as single cells after accutase treatment (Sigma-Aldrich), plated onto geltrex-coated four-well chamber slides (Ibidi) or 12-well plates and further cultured for at least two weeks in maturation medium (N2B27 medium plus 100 ng/ml FGF8, 10 ng/ml GDNF (Peprotech), 10 ng/ml TGFb (eBioscience), 200 uM AA, and 500 μM Dibutyryl-cAMP (dbcAMP; Sigma-Aldrich) with two times media change per week.
Poly-a RNA library preparation
Libraries for next-generation sequencing were prepared from 1 μg total RNA with the TrueSeq RNA library preparation kit v2 according to the manufacturer’s instructions (Illumina, San Diego, CA, USA). Briefly, poly-A RNA was purified from the total RNA preparation with magnetic oligo-dT beads. The RNA-bead mixture was incubated at 65 °C for five minutes to denature the RNA. Then, the mixture was incubated for five minutes at RT to allow the RNA to bind to the beads. Afterwards, the beads were washed with bead washing buffer and resuspended in elution buffer for two minutes at 80 °C. Bead binding buffer was added to allow rebinding of the eluted RNA. The beads were washed with bead washing buffer and resuspended in elute, prime, fragment mix. The RNA was eluted at 94 °C for eight minutes, and first strand cDNA synthesis was performed with SuperScript II (Thermo Fisher Scientific, Waltham, MA, USA) in first strand mix supplied by Illumina. Second strand synthesis was performed with the second strand mix for one hour at 16 °C and the resulting double stranded cDNA was purified using AMPure XP beads (Beckman Coulter, Brea, CA, USA). Then, end repair was performed for 30 min at 30 °C with the provided end repair mix. The end repaired DNA was again cleaned up with AMPure XP beads. A-tailing was performed with the A-tailing mix at 37 °C for 30 min followed by 70 °C for five minutes. Indexed adapters were ligated to the end-repaired A-tailed cDNA at 30 °C for ten minutes. Next, stop ligation buffer was added and the libraries were cleaned up with AMPure XP beads. PCR amplification was performed with the provided PCR reagents and the following cycling conditions: Denaturation at 98 °C for 30 s and then 15 cycles of1) 98 °C for 10 s, 2)60 °C for 30 s and 3)72 °C for 30 s. Afterwards, a final extension at 72 °C for five minutes was performed and the amplified libraries were purified again with AMPure XP beads. Finally, quality control was performed with a Bioanalyser® (Agilent).
RNA library preparation for tissue samples
For all tissue samples (RIN usually < 8), we used the TrueSeq RNA Access Kit (Illumina) to prepare libraries from 100 ng total RNA, which is suitable for degraded RNA. First, first strand cDNA was synthetized with the Elute, Prime, Fragment High Mix and super script II in First Strand Master Mix (25 °C for 10 min, 42 °C for 15 min and 70 °C for 15 min). Then, 20 μl Second Strand Marking Master Mix were added and the second strand was synthetized for 1 h at 16 °C. The cDNA was cleaned up with AMPure XP beads and A-tailing was performed with A-tailing mix (37 °C for 30 min followed by 5 min at 70 °C). Following this, adapter ligation was performed with Ligation Mix for 10 min at 30 °C after which the reaction was stopped with Stop Ligation Buffer. The libraries were cleaned up with AMPure XP beads and amplified with PCR Master Mix and PCR Primer Cocktail (98 °C for 30 s, 15 cycles of 98 °C for 10 s, 60 °C for 30 s, 72 °C for 30 s and 72 °C for 5 min). Following a further clean up with AMPure XP beads, the libraries were quantitated with a Bio-Analyser and 4 libraries were pooled at equal concentrations (200 ng each) for exome capture, which was repeated twice. For exome capture, Coding Exome Oligos were added to the pooled libraries together with Capture Target Buffer 3 and incubated for 95 °C for 10 min and 18 cycles of one minute incubations, starting at 94 °C, then decreasing 2 °C per cycle with a final incubation for 58 °C for 90 min. Immediately afterwards, the hybridized probes were captured with streptavidin magnetic beads for 25 min at RT and washed twice with Enrichment Wash Solution (incubation at 50 °C for 20 min). Finally an Elution Premix was prepared with Enrichment Elution Buffer 1 and NaOH. The streptavidin beads were resuspended in this Elution Premix, incubated for two minutes at RT, the supernatant was separated from the beads on a magnetic stand and Elute Target Buffer 2 was added to the supernatant. After the second exome capture, the libraries were cleaned up with AMPure XP beads. Finally, a second enrichment was performed with the Enhanced PCR Mix (98 °C for 30 s followed by 10 cycles of 98 °C for 10 s, 60 °C for 30 s and 72 °C for 30 s, with a final extension at 72 °C for 5 min) and the amplified libraries were purified again with AMPure XP beads. Finally, quality control was performed with a Bioanalyser® (Agilent).
Reduced representation bisulfite sequencing (RRBS) library preparation
Reduced representation bisulfite sequencing allows for highly accurate and efficient analysis of methylation patterns at single base pair resolution with a focus on CpG islands [
21]. Digestion was performed with 2.5 μg gDNA in buffer 4 (NEB) and 400 units MSPI O/N at 37 °C. Afterwards, the DNA was purified from this reaction with the PCR purification Kit (Qiagen) according to the manufacturer’s instructions. Briefly, 5 volumes buffer PB were added to the MSPI digest, applied to a spin column provided in the kit and centrifugation was performed for 1 min at 9500
× g. Then, the column was washed with 750 μl buffer PE, dried by centrifugation for 1 min at 9500
× g and the DNA was eluted with 30 μl buffer EB. Library preparation was then performed with the NEXTflex™ Bisulfite Library Prep Kit (BIOO Scientific) according to the manufacturer’s instructions with some modifications. Briefly, end repair was performed with 500 ng digested, purified DNA in end repair buffer mix and end repair enzyme mix in a total volume of 50 μl. The reaction was incubated at 22 °C for 30 min and then cleaned up with the MinElute® PCR Cleanup Kit. Then, 16.5 μl of the eluate were mixed with 4.5 μl of adenylation mix and the reaction was incubated for 30 min at 37 °C. Afterwards, 31.5 μl ligation mix and 2.5 μl of individual adapters (diluted 1:2) were added, and adapter ligation was performed for 15′ at 22 °C. Afterwards, the DNA was cleaned with AMPure XP beads and size selection for fragments from 175 to 400 bp was performed with a gel purification step. The libraries were separated on a 2% low melt agarose gel (Sigma-Aldrich), the cut out gel fragments were dissolved for 10 min at RT in DNA binding buffer and 150 μl ethanol were added. Then, the solution was applied to a clean-up spin column and centrifuged at 18500
xg until the complete volume was processed. Afterwards, the column was washed twice with DNA wash buffer, dried by centrifugation and the DNA was eluted with column elution buffer. Then, bisulfite conversion of the DNA was performed with the EZ Methylation Gold Kit (Zymo Research) according to the manufacturer’s instructions. Briefly, 130 μl conversion reagent were added to 20 μl purified DNA. The reaction was incubated for 10 min at 98 °C and for 2.5 h at 64 °C. Then, the samples were loaded on spin columns containing 600 μl M-Binding buffer and mixed by inverting. The DNA was bound to the column by centrifugation for 30 s at 18620
x g. Then, the column was washed with 100 μl wash buffer, and 200 μl desulphonation buffer were added. The desulphonation buffer was incubated for 17 min at RT, and then removed by centrifugation. The column was washed twice with 200 μl wash buffer, and dried by centrifugation for 10 s. Finally, 17 μl elution buffer were added to the column, incubated for 1 min and the DNA was eluted by centrifugation. Afterwards, PCR amplification of the bisulfite converted libraries was performed with PfuCx Hot Start (Agilent). 15 μl DNA were amplified with the NEXTflex™ Primer Mix. Cycling conditions were: Initial denaturation for five minutes at 95 °C. Then 18 cycles of 95 °C for 30 s, 65 °C for 30 s and 72 °C for 45 s. Afterwards a final extension at 72 °C for 7 min. The libraries were purified again using AMPure XP beads (Beckmann Coulter). Finally, quality control with a Bioanalyser® (Agilent) was performed.
Small RNA library preparation
Small RNA libraries were prepared from 1 μg total RNA containing small RNAs with the TrueSeq Small RNA kit (Illumina) according to the manufacturer’s instructions. Briefly, a 3’adapter was added to the RNA, denaturation was performed for two minutes at 70 °C and afterwards the mixture was put on ice immediately. Then, the adapter was ligated with a T4 RNA Ligase2 deletion mutant (epicentre) for 1 h at 28 °C. Then the reaction was stopped with stop solution for 15 min at 28 °C, the previously denatured 5′ adapter was added together with ATP and T4 DNA Ligase and ligated to the RNA for 1 h at 28 °C. After that, cDNA synthesis was performed with super script II and Illumina supplied dNTPs for 1 h at 50 °C. Afterwards, the cDNA was amplified and indexed with the primers and PCR mix supplied in the kit (eleven cycles of denaturation at 98 °C for 10 s, annealing 60 °C for 30 s and extension at 72 °C for 15 s with a final extension at 72 for 10 min) and size selection was performed on 5% TBE acrylamide gels (Bio-Rad, Munich, Germany). Here, the region marked by Illumina’s custom ladder between 145 bp and 160 bp were cut out and pooled for sequencing. The gel was homogenized by centrifugation at 20000 x g through a gel breaker tube (Bio-Cat) and 300 μl ultrapure H2O were added to elute the DNA O/N in a DNA LoBind tube (Eppendorf). The following day, the gel debris was separated from the water by centrifugation for 10 s at 600 xg through a 5 μm filter tube (Bio-Cat). Then, 2 μl glycogen (CALBIOCHEM), 30 μl sodium acetate (Thermo-Fisher Scientific), 2 μl 0.1× pellet paint and 975 μl prechilled 100% ethanol were added and the mixture was incubated for 20 min at − 80 °C. Then, the library was pelleted by centrifugation for 20 min at 4 °C and 20,000 xg, washed with 70% ethanol and recentrifuged for 5 min. Finally, the ethanol was removed, the pellet dried for five minutes at 37 °C and resuspended in 15 μl ultrapure water.
Next-generation sequencing
Illumina deep sequencing as well as quantification of small RNA content were performed at a genomics core facility: Center of Excellence for Fluorescent Bioanalytics (KFB, University of Regensburg, Germany). For deep sequencing, all libraries were quantified using the KAPA SYBR FAST ABI Prism Library Quantification Kit (Kapa Biosystems, Woburn, MA, USA). Equimolar amounts of each library were used for cluster generation on the cBot with the TruSeq SR Cluster Kit v3 (Illumina, San Diego, CA, USA). The sequencing run was performed on a HiSeq 1000 instrument (Illumina, San Diego, CA, USA) using the indexed, 50 cycles single read (SR) protocol and the TruSeq SBS v3 Kit (Illumina, San Diego, CA, USA). Image analysis and base calling resulted in .bcl files which were then converted into .fastq files by the CASAVA1.8.2 software.
Data analysis for (small) RNA libraries
Analysis of poly-A RNA, RNA enriched with the coding exome oligos and small RNA data was performed using the Genomatix software (Genomatix, Munich, Germany). For poly-A RNA and RNA enriched with the coding exome oligos, the .fastq files were mapped to the human genome assembly GRCh38 (annotation based on ElDorado 6–2015) using the Genomatix Mining Station Mapper v3.7.6.3 allowing one mismatch. We sequenced all poly-A RNA libraries until at least 15 * 10
6 and all exome capture libraries until at least 10 × 10
6 unique hits per sample could be mapped. All unique hits were further processed using the Genomatix Genome Analyser v3.51106 which was used to create count tables and RPKM expression values for all samples. Reads were counted locus-based, i.e. for unions of exons of genes. All further analyses based on these count tables were performed with the free software R v3.1.1, Bioconductor v3.0 [
20] and the package DESeq2 v1.6.3 [
36]. Gene set enrichment analysis [
53] was performed with GenePattern and the GSEA module v18 [
53] with RPKM (reads per kilobase of exon model per million mapped reads) values and gene sets c2.Biocarta as well as c2.KEGG using standard settings. For small RNA analysis, we mapped .fastq files against a small RNA library based on GRCh38 (annotation based on ElDorado 6–2015) allowing one mismatch. Afterwards, count tables were created for each small RNA type (piRNA, mature miRNA, miRNA hairpin structures) with the small RNA workflow available in the Genomatix Genome Analyser. We sequenced all samples until at least 1 × 10
6 counts for mature miRNAs were reached when reads were counted, with a mean of unique hits for mature miRNAs of 3,037,413 ± 2088,481and for piRNA 9,754,231 ± 4,096,709. All further single locus based analyses were performed as described for poly-A RNA. For further analysis of piRNAs, we matched the RNAdb2.0 identifiers [
45], on which Genomatix is based, to the pIRbase [
66], which provides information on the genomic localization, sequence as well as the elements from which the respective piRNAs are derived. Parts of the poly-A RNA-Seq dataset were generated in a collaborative project and three poly-A RNA-Seq runs from neurons are also contained/analysed in [
59].
RRBS analysis
For RRBS, .fastq files were trimmed with TRIM Galore v0.4.2 in the rrbs mode and mapped with BISMARK v0.16.3 [
28] to the human genome GRCh38.84 as obtained from
www.ensembl.org. Then, the methylation values were extracted with the BISMARK methylation extractor again removing 2 bp at the five prime end of the reads to remove the methylation bias in untrimmed reads stemming from the end repair procedure. We sequenced the libraries until at least 1 × 10
6 alignments with a unique best hit were found by BISMARK. All further analyses were performed based on extracted coverage files with RnBeads v1.2.0 [
4].
Data availability
All normalized NGS data were deposited in GEO (URL:
https://www.ncbi.nlm.nih.gov/geo) under the super series accession GSE110720. Coding exome RNA-Seq data is deposited under accession GSE110716, Poly-A RNA-Seq data is deposited under accession GSE110717, RRBS data is deposited under accession GSE110718 and small RNA-Seq data under accession GSE110719.
Quality control and inclusion criteria
Due to historical reasons, the mRNA data for quality control analyses of iPSCs were mapped to the human genome GRCh37 without mismatch. We included all samples that we analysed, as long as they met the following criteria: iPSCs had to show a marker expression of the pluripotency markers DNMT3B, SOX2, NANOG, OCT4 and REX1 within the range of a previously analysed high quality ESC cohort and to fall at least in the safety margin of a previously established database driven PluriTest adaption [
57]. Furthermore, viral transgene silencing had to be comparable to a published cohort for which NGS raw data was freely available [
1]. We analysed viral transgene silencing both by comparison of RPKM values of established markers of viral transgene silencing [
27] as well as by direct mapping of the plasmids used for reprogramming counting multiple and unique hits when mapping against the vector sequence (excluding the coding regions of the pluripotency factors) normalized to the number of unique hits of the respective sequencing run when mapped to the genome. The upper-limit cut-offs shown in the supplementary material were calculated as described in [
57]. SNPs were called within the Genomatix Genome analyser with a workflow based on samtools, with at least 4 x coverage per SNP and exclusion of indels. We excluded five iPSC lines based on these criteria.
Patient samples, tissue samples and embryonic stem cells
The Institutional Review Board approval (Nr. 4120: Generierung von humanen neuronalen Modellen bei neurodegenerativen Erkrankungen) and informed consent forms are on file at the movement disorder clinic at the Department of Molecular Neurology, Universitätsklinikum Erlangen (Erlangen, Germany). All procedures involving patient samples (tissues or cells) were approved by the local institutional review board (Ethikkommission Regensburg), approval 14–101-0216. The experiments involving embryonic stem cells were approved by the Central Ethics Committee for Stem Cell Research in Germany according to StZG (AZ: 3.04.02/0121). Tissue samples were obtained from the Netherlands brain bank as fresh frozen tissue. iPSCs were generated from skin biopsies of PD- and control-patients by the ForIPS core project as described elsewhere [
15].
Immunohistochemistry
The presence of Lewy bodies in the substantia nigra and more importantly in the cingulum was verified with stained sections from the Netherlands brain bank (NBB). For those cases were no staining was available, we obtained paraffin sections from the NBB and performed a staining with an antibody directed against aggregated α-synuclein (anti-human α-synuclein 5G4, mouse monoclonal, analytikjena, Jena, Germany). After deparaffinization, antigen retrieval was performed by cooking in citrate buffer for 20 min and DAB staining was performed with the Envision Dual Link System-HRP DAB+ Kit (Agilent, Santa Clara, CA, USA) according to the manufacturer’s instructions. Briefly, the sections were blocked with Dual Endogenous Enzyme Block for 10 min and rinsed with PBS. Then, primary antibody was applied (diluted 1:500 in PBS + 1% BSA) for 40 min and again the slides were washed with PBS. Afterwards, labelled polymer was added for 30 min. After a further washing step, substrate was added for five minutes. Afterwards, the slides were washed in running tap water, counterstained in hemalaun (Dako/Agilent) and again rinsed in running tap water. Finally, the slides were dehydrated in increasing ethanol concentrations and xylol and mounted in entellan mounting medium (Merck).
Methyl-cytosine staining
For analysis of methyl-cytosine content in tissue, we cut frozen tissue sections from the cingulate gyrus material. The DAB staining was performed with the Envision Dual Link System-HRP DAB+ Kit (Agilent, Santa Clara, CA, USA) according to the manufacturer’s instructions with some modifications. After cutting and thawing, the sections were first fixed with 4% PFA for 15 min, which was necessary as we retrieved unfixed material. Then the sections were washed three times for five minutes in TBS. Afterwards, the slides were incubated for 30 min in 2 N HCl for antigen retrieval. The slides were washed twice with PBS and blocking was performed with the Dual Endogenous Enzyme Block reagent for 10 min at RT. After further washing in TBS, anti-methyl-cytosine (Epigentek, mouse monoclonal, clone 33D3) antibody was added in 1:400 dilution and incubated O/N at 4 °C in 1% normal goat serum (PAN Biotech, Aidenbach, Germany) in TBS + 0.3% TritonX. A mouse IgG (Thermo-Fisher) was used as negative control. The next day, the slides were washed three times, covered with labelled polymer and incubated for 30 min at RT. Then, after one washing with TBS, the sections were covered with chromogen for 10 min. Afterwards, the slides were washed in running tap water, counterstained in Mayer’s hemalaun and again rinsed in running tap water. Finally, the slides were dehydrated in increasing ethanol concentrations and xylol (Carl-Roth) and mounted in entellan mounting medium (Merck).
cDNA synthesis, real-time PCR and semiquantitative PCR
cDNA synthesis was performed from 200 ng total RNA using random hexamer primers (Gene Link, Hawthorne, NY, USA) and the SuperScript™ II Reverse Transcriptase (Life Technologies) according to the manufacturer’s instructions. Real-time RT-PCR was performed with the SensiFAST™ SYBR Hi-Rox Kit (Bioline, London, UK) on the StepOnePlus™ cycler (Life Technologies). Relative expression values were calculated with the ΔΔC
T (analysis of relative gene expression) method [
35] using
ARF-1 as the reference transcript. Primers used for
ARF-1 were 5´-GACCACGATCCTCTACAAGC (forward) and 5´-TCCCACACAGTGAAGCTGATG (reverse), for
TH 5´-CCAAGACCAGACGTACCAGT (forward) and 5′- CGTGAGGCATAGCTCCTGAG (reverse). Primers for analysis of SYP, MAP2, TUBB3, EN1, NURR1, KCNJ6 and FOXA2 were described by others [
34]. Primers for WNT3 5´-GGAGAGGGACCTGGTCTACTA (forward) and 5′- CTTGTGCCAAAGGAACCCGT (reverse) were and downloaded from the PrimerBank database [
63]. Analysis of mtDNA content was performed with the real-time PCR conditions as described above and primers for mtDNA and nuclear DNA as described by others [
51], as was analysis of mtDNA deletions [
11]. Semiquantitative PCR was performed with the HotStarTaq polymerase (Qiagen) according to the manufacturer’s instructions and the equivalent of 125 ng cDNA per reaction. An annealing temperature of 55 °C and 35 cycles were used for amplification. Primers for semiquantitative PCR were designed with Primer3Plus [
61]. Primer sequences for PIWIL2 were 5´-TTGGATTCGAAAATGGCTTC (forward) and 5´-AGCCAGGAAGCGGTTATTTT (reverse), PIWIL4 5´-CAAGGACGTGATGGTTGTTG (forward) and 5´-ACCGACAGTCTTGAGCTGGT (reverse). GAPDH was used as reference with the primers 5´-CCATCTTCCAGGAGCGAGAT (forward) and 5´-ATGATGTTCTGGAGAGCCCC (reverse).
Statistics
All statistical analyses of NGS data were performed with the software packages described above and standard settings, unless otherwise indicated. A (small-) RNA locus was defined as differentially regulated when it was deregulated at a log2FC of ≥0.6 and an adjusted p-value of < 0.1. For analysis of methylation data a cytosine was accepted as differentially methylated when it passed the threshold Δ-meth ≥0.2 and an adjusted p-value of < 0.1. A gene set was accepted as differentially enriched at a p-value < 0.05 and a false discovery rate (FDR) < 0.25. Otherwise, all statistical tests used are provided in the figure legends and the results part together with the sample size. All bar graphs show the mean + standard deviation (SD) unless otherwise indicated.
Discussion
Systematic screening for phenotypes in cells established from well-defined cohorts of sporadic PD-patients has not been performed, yet. Therefore, as part of the ForIPS consortium, we aimed to elucidate if sporadic PD-patient derived cells carry any alterations as compared to matched control patients. We can show that PD-patient derived cells show a specific small RNA signature in every cell type examined.
Several studies have suggested similarities between cells established from genetic and sporadic cases in certain assays [
12,
18,
54]. To identify the molecular basis of such potential disease phenotypes we performed a comprehensive analysis of mRNA and small RNA expression patterns as well as methylation analysis at single base resolution in a unique cohort of fibroblasts, iPSCs and differentiated midbrain neurons from sporadic PD-patients. In classical phenotypic assays, the midbrain neurons differentiated from sporadic PD-patients did not show a cellular phenotype [
59].
Nonetheless, on the mRNA level, the pathway regulating PGC1α (peroxisome proliferator-activated receptor-γ coactivator-1α) is inactivated in PD-patient derived midbrain neurons, which was previously reported to be involved in disease-specific phenotypes in an A53T model of PD [
53] and is a hallmark of PD pathology [
67]. PGC1α is a master regulator of mitochondrial function and protects neurons from apoptotic cell death under stress conditions in in vitro models of PD [
53,
67]. In addition, the CREB-(cAMP response element binding protein) pathway, which is a known neuroprotective pathway [
23], was impaired in PD-patient derived neurons. CREB proteins, which are transcription factors mediating cAMP responses, (besides their function in cell survival) are involved in numerous processes in the nervous system, e.g. memory formation and neurogenesis [
44]. Importantly, CREB-activity is modulated by PGC1α [
9] connecting the identified pathways on a functional level. On a single gene level we found a significant PD-associated effect on the transcriptome in differentiated neurons, while fibroblasts or iPSCs showed no differences. Among the genes deregulated in neuronal cells, there were several WNT-pathway members. These have been reported to be hypermethylated in PD [
65] and upregulation of these genes in our midbrain neurons might serve as a protective mechanism as reported in a PD mouse model [
31]. In addition, genes and pathways regulating neurodevelopment that function downstream of WNT signals and play a role in midbrain development, i.e.
LMX1B [
29] and
OTX2 [
12], have been suggested to be involved in PD pathogenesis. We add the WNT-pathway members WNT3, ANT3A and WNT9B to this catalogue.
We furthermore show that neuronally differentiated cells show striking similarities with diseased tissue on the level of small RNAs. Here we detected the differential regulation of many PIWI interacting RNAs and/or piRNA-like molecules. Importantly, it has been reported by others that the orthologues of PIWI-proteins are expressed in the mammalian brain [
37,
41]. Although piRNAs were first described in testes where they show the highest abundance [
2], a large number of subsequent studies have identified their presence in the mammalian brain including studies on human tissue [
10,
30,
41,
48,
52]. PiRNAs play an important role for retrotransposon silencing in the brain [
41] and retrotransposon activation contributes to the genetic mosaicism in neurons in
Drosophila [
46]. Retrotransposon encoding loci are hypomethylated in mice deficient for a mouse orthologue of PIWI (MILI) [
41]. On the functional level, chromatin modification and transcriptional repression are guided by piRNAs [
19]. Moreover, piRNAs modulate synaptic plasticity in
Aplysia neurons via CREB2 in response to serotonin [
49] as well as dendritic spine size in mammalian cells [
30]. Of note, others have recently described deregulated piRNA expression in Alzheimer’s disease, emphasizing the relevance of piRNAs for neurodegenerative diseases [
48,
52]. We identfied a number of piRNAs that are shared between diseased PD-patient brain tissue and cultured neuronal cells. Importantly, the overrepresentation of LINE- and SINE-derived piRNAs among the downregulated piRNAs points towards a failure of PD-patient derived neurons to properly silence these elements. In addition, by analysing the size fraction of the piRNAs, we can show that both bona fide piRNAs as well as piRNA-like molecules contribute to the pool of deregulated sequences. This is important, as the catalogues of deregulated piRNAs in brain tissue from other diseases [
10,
52,
56] will most likely also contain a significant number of piRNA-like sequences. These are abundant outside of the testes [
64], but their function is less well understood than that of canonical piRNAs.
It has been reported that sporadic PD-patient derived neurons show aberrant protein turnover, altered morphology and methylation patterns when compared to control-patient derived cells [
12,
18,
54]. We extend these findings by completely tracing the cells on epigenomic and transcriptomic level from the primary fibroblasts through the iPSC stage to the neuronally differentiated stage. More importantly, we can show that the alterations are detectable with the widely used protocols of OKSM reprogramming followed by differentiation to midbrain neurons. Interestingly, neither on the methylation nor the small RNA level there were changes that would comprise a significant epigenetic memory. Only the PD-patient derived neurons had alterations that were comparable to the diseased tissue. However, we cannot completely rule out the possibility that epigenetic memory persists in epigenetic modifications that were not analysed in our study, e.g. on histones, as has been reported in other systems [
17].
There may be some other limitations to our study that should be shortly mentioned: 1) We used a classical retroviral four factor reprogramming cocktail. Reprogramming with non-integrating vectors like Sendai-viruses or pure RNA may provide a platform for functional studies less prone to effects due to reprogramming factor integration into the genome. 2) Our study is focused on the analysis of mRNA/small RNA expression and DNA methylation patterns. Biochemical validation of the effects of deregulated piRNAs/piRNA-like molecules and pathways will be an active area of future research when the appropriate tools are available. 3) Further examination of cortically differentiated neurons and/or laser capture microdissection of purified dopaminergic neurons from the substantia nigra could support and refine the conclusions on disease etiology of different types of neurons both in the in vitro and in vivo setting.
Nonetheless, our study implies that disease modelling with cells derived from sporadic PD cases is possible. We conclude that although four factor reprogramming removes ageing associated marks [
39] and iPSCs from healthy individuals are equivalent to ESCs [
5,
8], Parkinson’s disease derived neuronal cells show disease-specific alterations on transcriptomic and epigenetic level. More importantly, we show that both in neurons and in post-mortem brain tissue from PD-patients, piRNAs are deregulated in a similar fashion, which is a striking new finding both in the field of PD as well as cell biology research. We support this findings with a total number of more than 200 next-generation sequencing runs (DNA methylation, microRNAs, mRNAs) on one of the largest collectives of high-quality PD-derived cell preparations that is available in the field.
Acknowledgements
We thank André Reis and Mandy Krumbiegel (Department of Human Genetics, FAU, Erlangen University Hospital, Erlangen, Germany) for critical discussion on the topic. We thank Shunmin He and Peng Zhang (Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China) for their help on piRNAs.