Introduction

Sudden infant death syndrome (SIDS) is defined as the sudden and unexpected death of an infant younger than one year of age, with the onset of the fatal episode apparently occurring during sleep.1 The cause of death remains unexplained after a thorough investigation, including performance of a complete autopsy,2 review of the circumstances of death and the clinical history. Although the incidence rate of SIDS cases drastically decreased in the last years, SIDS is still one of the leading causes of postneonatal infant death in developed countries with a prevalence between 0.1 and 0.8 deaths per 1000 live births.3 The occurrence of SIDS is described by a triple risk model involving (1) a critical developmental period in the first months after birth, (2) a vulnerable infant and (3) exogenous stress factors.4 Environmental risk factors such as the prone sleeping position, sharing bed with parents, or smoking exposure during pregnancy are widely accepted stressors to expose a vulnerable infant at risk for cardiorespiratory failure or other homeostatic imbalance.5 However, the pathophysiological mechanisms responsible for SIDS still remain poorly understood.6, 7 Genetic studies in SIDS cohorts collectively suggest that up to 15% of SIDS cases might be explained by inherited cardiac diseases not detectable during conventional forensic autopsy investigations.8, 9, 10 Ion channelopathies such as Brugada syndrome (BrS), long QT syndrome (LQTS), short QT syndrome or catecholaminergic polymorphic ventricular tachycardia (CPVT), are described as disrupted channel functions causing disturbed ion current flow and lethal cardiac arrhythmias.11 Cardiomyopathies are characterized by structural abnormalities in the heart, such as hypertrophic cardiomyopathy, dilated cardiomyopathy, arrhythmogenic right ventricular cardiomyopathy (ARVC) and left ventricular non-compaction cardiomyopathy.12 Additionally, undiagnosed inherited metabolic diseases such as medium-chain-acyl-CoA dehydrogenase (MCAD) deficiency or glucose metabolism deficiency might contribute to the cause of death in another 1% of the SIDS infants.13

Post-mortem genetic testing by using massive parallel sequencing (MPS) approaches represents an efficient and rapid strategy to investigate potential disease-causing mechanisms that remained undetected during conventional autopsy and may help to identify the cause of death in some of the SIDS infants and to detect families at risk for further sudden deaths.14, 15 A first MPS-based genetic investigation in 104 genes associated with sudden cardiac death had identified likely pathogenic variants in two cardiomyopathy-associated genes (PKP2 and VCL) in one representative SIDS case.10 A second MPS-based targeted sequencing study in 47 Danish cases of sudden unexpected death in infancy reported likely causative variants in cardiac disease-associated genes in 16 cases (34%), demonstrating the potential of performing a molecular autopsy in sudden death cases.16 As whole-exome sequencing (WES) is not substantially more expensive than a targeted sequencing approach, this sequencing strategy allows extended data analysis in cases of a negative result within a predefined gene list. Therefore, the aim of this study was to perform a WES analysis in our large SIDS cohort of 161 infant cases with focus on 192 genes associated with cardiovascular and metabolic disorders.

Materials and methods

SIDS study population

Our study population consisted of 161 SIDS cases collected between 1985 and 2014 at the Zurich Institute of Forensic Medicine, Zurich, Switzerland. Most of the SIDS cases were examined by the same forensic pathologist, ensuring a high level of uniformity in autopsy procedures and case reporting. The classification of SIDS cases has always been performed according to the generally accepted international definitions of SIDS, including a complete autopsy, review of the circumstances of death, and examination of the clinical history.1

Forty-one infants were determined as genuine SIDS cases belonging to SIDS category I, including infants with normal clinical history, normal growth and development, no similar deaths among siblings and found in a safe sleeping environment with no evidence of accidental death. The remaining 120 infants were classified into SIDS category II due to slight infections before death, preterm birth or other deviations to category I requirements. The median age of the 161 SIDS infants was 15.03±8.3 weeks (range 0.6–48.1 weeks) and 60.2% were boys (97 males/64 females). All of the SIDS infants were Europeans, most of them Swiss. A targeted MPS approach (HID-Ion AmpliSeq Ancestry Panel, Thermo Fisher, Rotkreuz, Switzerland) using the Ion Torrent PGM platform (Thermo Fisher) was applied to verify the geographical origin of SIDS cases where no information on the family origin was available. Eleven SIDS cases were excluded from our original cohort of 172 individuals, because of non-European ethnicities. Even small amounts of population admixture can shift the results toward an association, and therefore, it is important to have a well-defined study population in regard to ethnic and geographic background.17 Additional epidemiological data of the SIDS cohort are illustrated in Supplementary Table 1.

Ethical approval for this study was provided by the local ethics committee (KEK-ZH-Nr. 2013–0086), and the study was conducted in full conformance with Swiss laws and regulations. Family members were not available for co-segregation analysis.

DNA extraction and quantification

Genomic DNA of the SIDS infants was obtained from tissues stored in alcohol or from alcohol-fixed and paraffin-embedded tissue blocks.18 In most of the cases kidney or tongue was used (otherwise heart, muscle or brain) because of reported good post-mortem DNA stability in these tissues.19 DNA extractions were performed using the QIAamp DNA Mini Kit (Qiagen, Hombrechtikon, Switzerland) according to the manufacturer’s protocol. All DNA quantities were determined with a Qubit 1.0 fluorometric quantification device (Thermo Fisher).

Exome sequencing and bioinformatics

DNA library preparation and exome capture were performed with the SureSelect target enrichment and SureSelect All Exon V5+UTR’s kits (Agilent Technologies AG, Basel, Switzerland), using the protocol for 200 ng input amount of genomic DNA. Sequencing as well as sequence alignment and variant calling were performed at the Functional Genomics Center Zurich, Switzerland. Sequencing was done on the Illumina HiSeq2500 platform (Illumina, San Diego, USA), generating 2 × 100 bp paired-end reads. Sequences were aligned to the reference genome (GRCh37/hg19) using BWA.20 MEM algorithm with default setting and quality control of the exome coverage was performed with Bioconductor package TecQC.21 A filter was set so that a sample was required to have at least 80% of exome covered at ≥20 × read depth. Variants discovery was performed by means of GATK,22 following the GATK best practices workflow.23 In particular, groups were reassigned using PicardTools,24 duplicate reads were removed using Samtools25 and local realignment, variants discovery and filtering (minimum 20 × coverage, minimum 20% alternate allele frequency) was done with GATK walker and the dbSNP database.26

Data analysis

Within our WES results, we focused on a gene panel of 192 genes associated with cardiovascular or metabolic disease-associated genes (Supplementary Table 2). Variants with <50 × bidirectional coverage and/or an alternate allele frequency ratio <0.4 were additionally confirmed by standard Sanger sequencing methods. Annotation of the variants was performed with the Software Alamut Batch version 1.4.2 (Interactive Biosoftware, Rouen, France). Output results were reported in an Excel-sheet for data analysis. Variants in TTN were not further evaluated due to reported difficulties in sequencing and variant interpretation.27 We adapted our previously published filter strategy28 as follows: (1) a global minor allele frequency value (MAF) of ≤0.01 based on NCBI dbSNP,26 (2) focus on exonic and splice site variants and (3) the exclusion of synonymous variants (Supplementary Figure 1). In addition, Human Gene Mutation Database (HGMD, Qiagen)29 was consulted to check already reported variants in the literature. Alamut Visual Version 2.7.1 (Interactive Biosoftware) was used to visualize coverage of variants and to review conservation of the variants across a variety of species. Pathogenicity of variants was assigned according to an adapted scoring scheme originally described by van Spaendonck-Zwarts et al.30 and Hertz et al.31 Our scoring scheme was based on the assessment of variant types (null-variants, splice site variants, missense variants), in silico protein predictions, and MAF in three European control populations namely Exome Sequencing Project (ESP), 1000 Genomes Project and Exome Aggregation consortium (ExAC) database (Table 1).26, 32, 33, 34, 35, 36, 37, 38, 39 A small proportion of the 60 706 individuals in the ExAC database is originally coming from smaller databases possibly leading to an overlap of the European individuals.40 ESP is not well-powered to filter at 0.1% allele frequency without removing many genuinely rare variants, however the majority of ESP European singletons are not seen a second time in ExAC. Therefore, we still used the European MAF of all three databases. Based on the scoring scheme, variants were classified into five separate subcategories, designated as variants of unknown significance VUS0-4 (Table 2). Co-segregation and functional analyses would have been required to classify a variant as pathogenic;30 therefore, the highest score a variant in our study could get was VUS4. DNA variants were numbered according to reference sequences using HGVS nomenclature (http://varnomen.hgvs.org). Variants in subcategories VUS4/VUS3 have been submitted to the Leiden Open Variation Database (Individual IDs: 00064759 to 00064769/00065124 to 00065157) (http://databases.lovd.nl/shared/diseases/02087).

Table 1 Scoring scheme for variant evaluation (adapted from Hertz et al.31)
Table 2 Subcategories of variants based on scoring scheme

Variant confirmation

Potential disease-causing variants not reported in the mentioned databases were confirmed by Sanger sequencing. Additionally, allele frequencies were checked in an in-house exome database of 118 European patients with eye diseases.

Results

Whole-exome sequencing and data analysis was successfully completed for 155 SIDS cases. DNA library preparation failed for six SIDS samples due to low DNA quantities or poor DNA qualities resulting from decayed post-mortem tissues and/or the fixation method.

Overall, 72.3% of the bases had a coverage of ≥20 reads and the average on-target coverage was 90.2% at ≥20 reads. The average depth within our gene panel was 100.64±34.25 (Supplementary Table 3). By focusing on 192 genes of interest, an average of 1960±467 variants per case were obtained for further data analysis. After the filtering steps, an average of 14.6±9.8 variants per sample were manually checked with the Alamut Visual v2.7 software and evaluated according to our scoring scheme. Sanger sequencing confirmed all VUS4/VUS3 variants with <50 × coverage and/or not reported in the databases (Supplementary Table 4). The majority of these variants were missense variants (97.1%) followed by splice site variants (1.5%), nonsense variants (0.9%), and frameshift variants (0.6%). Based on our scoring scheme, 11 variants (1.3%) were classified into sub-category VUS4 and 24 variants (2.7%) into VUS3 indicating variants with probably pathogenic effects.

Thirty-one (20%) out of the 155 SIDS cases had at least one variant with likely pathogenic functional effects (Table 3). Four of these cases had two likely causative variants. Details of the variants are available in Supplementary Table 4. Of the 31 SIDS cases, 17 (54.8%) were males and the median age of death was 4 months (range 1–9 months). Most of the variants were detected in SIDS category II infant cases (67.7%).

Table 3 Variants with likely disease-causing effects in the SIDS cohort

Among the 31 SIDS infants with likely causative variants, 14 infants (9%) carried putative pathogenic variants in genes associated with ion channelopathies and 11 SIDS infants (7%) had disease-causing variants in genes associated with cardiomyopathies (Figure 1). Additional four SIDS infants (2.5%) had variants in genes associated with mitral valve prolapse, aortic valve disease, Marfan syndrome or Ehlers-Danlos syndrome. Variants in genes associated with metabolic diseases were found in two SIDS cases (1%), in glycogen storage disease and systemic primary carnitine deficiency.

Figure 1
figure 1

Percentage of SIDS infants with likely causative variants in genes associated with cardiomyopathies, ion channelopathies, other cardiovascular diseases, and metabolic diseases. VUS, variant of unknown significance.

Most of the variants were detected in genes associated with BrS (2.5%), followed by dilated cardiomyopathy (2.1%), hypertrophic cardiomyopathy (1.4%), LQTS (1.4%), ARVC (1.0%) and CPTV (1.0%).

Discussion

High-throughput sequencing provides a comprehensive and time-efficient sequencing strategy to identify rare DNA sequence variants in the genome/exome of patients with complex disorders or to discover underlying genetic causes in large heterogeneous study populations as for example in SIDS cases.

Starting with tissue collection of SIDS infants already in the early 1980s, we have a valuable and well-defined SIDS cohort of 161 infant cases at our institute. Although alcohol-fixed and paraffin-embedded tissue blocks do not provide optimal DNA qualities and quantities, exome sequencing was successfully completed for 155 out of 161 SIDS cases. By focusing on a gene list of 192 genes associated with cardiovascular or metabolic diseases, we identified potentially disease-causing variants in 20% of the 155 SIDS cases. The majority of these cases had variants in genes associated with channelopathies (9%) and cardiomyopathies (7%).

The main cardiac genes reported in other SIDS studies are CAV3, GJA1, GPD1-L, KCNE2, KCNJ8, KCNQ1, KCNH2, MYBPC3, RYR2, SCN5A and TNNI3.6 We detected potentially causative variants in two of our SIDS cases in SCN5A p.(Arg1897Trp) and RYR2 (c.2907-1G>C). SCN5A is primarily expressed in the cell membrane of cardiac tissue where it encodes sodium-gated channels. Although SCN5A p.(Arg1897Trp) has been reported in patients with LQTS and atrial fibrillation, functional studies have indicated no effect on QTc intervals, syncope propensity, and overall mortality assuming that this variant is less likely associated with a dominant monogenic form of the disease.41 RYR2 encodes a ryanodine receptor found in cardiac sarcoplasmic reticulum and causative variants are described in stress-induced CPTV and ARVC.42 RYR2 (c.2907-1G>C) is located at the 5′-end of exon 26 and causes an altered acceptor site, however the variant was not described in ARVC or CPTV patients so far. Further variants with less likely functional effects were detected in KCNE2, CAV3, RYR2 and MYBPC3 (Supplementary Table 4), but no variants in GJA1, GPD1-L, KCNH2, KCNJ8 and TNNI3.

Additional variants with likely functional effects were found in genes related to different cardiac diseases or sudden death such as ANK2 p.(Glu148Gly), ACTN2 p.(Glu891*), DSP (c.273+5G>A), TRPM4 p.(Ala83Glfs*13), TRPM4 p.(Trp525*), and TSFM p.(Gln324Argfs*11). Interestingly, ANK2 p.(Glu148Gly) and TRMP4 p.(Trp525*) were already described in a Danish SIDS cohort.16 ANK2 encodes ankyrin-B, which has an essential role in the localization and membrane stabilization of ion transporters and ion channels in cardiomyocytes.43 The same variant was first reported in a large French family with LQTS including sinus node dysfunction and episodes of atrial fibrillation and one individual who suffered sudden death,44 but ANK2 p.(Glu148Gly) was also detected in eight Danish control individuals with normal mean QTc interval.41 TRPM4 belongs to the melastatin-related transient receptor channel family and encodes calcium-permeable cation channels localized predominantly in the plasma membrane.45 Variants in TRPM4 were described in patients with progressive familial heart block and BrS. Both alterations represent interesting candidate variants involved in the sudden death event of SIDS cases.

The most investigated gene with regard to metabolic diseases in SIDS is ACADM, which catalyzes the first step in the beta-oxidation of fatty acids.13 The most prevalent variation causing MCAD deficiency is ACADM p.(Lys329Glu), which is present in 80% of individuals who clinically are diagnosed with MCAD.7 The only variant detected in our SIDS cohort with regard to ACADM was ACADM p.(Arg53Cys) (Supplementary Table 4). Although this variant has been reported in one MCAD-patient in combination with the most common ACADM p.(Lys329Glu) pathogenic variant,46 our scoring scheme predicts little functional effect for this variant.

Altered ion channel functions causing lethal arrhythmias may represent the most plausible and comprehensible cause in infant death cases.10 Many channelopathies are characterized by incomplete penetrance and variable expressivity where sudden cardiac death is often the first manifestation of the disease.47 In contrast, cardiomyopathies are mainly caused by variants in genes encoding desmosomal cell adhesion proteins or in sarcomeric proteins involved in heart contraction inducing structural heart abnormalities. However, a growing number of studies have established links between desmosomes and components of cardiac electrical machinery.48 Consequently, variants in cardiomyopathy-associated genes may contribute more generally to cardiac diseases and might be involved in the cause of death in some of the SIDS infants even in absence of morphological abnormalities in the heart.

Today, exome sequencing is not substantially more expensive compared to targeted gene panels, but represents a more efficient and comprehensive sequencing method to investigate sudden unexplained death cases in absence of a specific phenotype. To our knowledge, this is the first WES study in a large SIDS cohort. Although we exclusively report the findings within predefined genes of interest, one major advantage of exome sequencing is the alternative of extended data analysis in cases without any results providing a chance to identify new candidate genes in SIDS. The underlying cause of death in the majority of SIDS cases still remains elusive and might be explained by a multifactorial etiology due to a combination of different genetic and environmental risk factors. Therefore, further analyses could focus on SIDS-related predisposing genetic factors in genes involved in early brain development, respiratory regulation, nicotine response, immune system, metabolic and energy production, thermoregulation and mitochondrial activity.6, 7

The main current challenge in exome sequencing studies is the clinical interpretation of genetic variants identified. The categorization of variants in our study was based on a stringent scoring scheme involving different population-specific databases and in silico protein prediction tools. Nevertheless, the different VUS categories included single variants with a higher allele frequency than expected for specific cardiovascular diseases meaning that a part of the here reported VUS might not be severe enough to cause death in infancy; still some of them could act as predisposing risk factors whereas others might be reclassified as benign based on prospective findings. Therefore, further assessments such as functional studies are required and strongly recommended for an evidence-based classification of the pathogenicity.49 Recently, mutations in genes previously associated with SIDS were identified in exome data from population studies indicating that many variants might have some pathological influence, but are most likely not the exclusive genetic cause of SIDS.50 Therefore, caution is needed when translating such exome sequencing results from research to diagnostic applications. Genetic counseling of first-degree relatives should be based on a multidisciplinary approach, involving forensic pathologists, geneticists and cardiologists, to inform the family in case of positive genetic findings and to discuss further steps regarding genetic testing of family members and/or to monitor the affected person.2

A limitation of this study is the lack of functional assays in order to verify the potentially pathogenic role of detected variants, in particular amino acid substitutions. Also, family members were not available for co-segregation analyses due to the sample anonymity required by the ethical committees. This would be necessary to determine the mode of inheritance, to classify variants into the pathogenic category,30 and to identify other genetic carriers at risk for sudden cardiac death. Our case reports only included clinical records on sudden death cases in siblings but not in other family members, which would be an important point to consider. Finally, exome sequencing data reveal lower sequencing coverage compared to targeted gene panels potentially leading to a loss of important low-coverage variants and more false negative/positive calls.

Additional MPS studies combined with functional assessment in large SIDS cohorts are inevitable to better understand the etiology of SIDS and to identify additional pathophysiologic mechanisms involved in this tragic death event.