Background
Cardiomyopathy is defined as the presence of a structural or functional impairment in the myocardium and is classified as either primary or secondary. Cardiomyopathy has been formally classified into five distinct forms: hypertrophic (HCM), dilated (DCM), restrictive (RCM), arrhythmogenic right ventricular cardiomyopathies (ARVC), and left ventricular non-compaction (LVNC) [
1]. Genetic aberrations may contribute to a significant percentage of primary cardiomyopathy patients. Approximately 60 genes have been reported to be disease-related in inherited cardiomyopathy (IC), As such, a fast, effective genetic screening approach for IC cases would be useful. With a precise molecular diagnosis, physicians can provide accurate treatment strategies and genetic counseling for patients and their family members.
Targeted sequencing of multiple genes of interest is a rapid, cost-effective alternative to whole exome sequencing (WES) or whole genome sequencing and is becoming more commonly used in clinical laboratories. Bench-top sequencers have the advantages of low cost, flexible sequencing options, and easy-to interpret results compared to high throughput sequencers, and have significant cost and time savings over the conventional Sanger sequencing method [
2].
The aim of this study was detection of pathogenic variants in 64 candidate genes associated with IC in a cohort of patients with various type of primary cardiomyopathies, and explored the potential clinical application of Ion AmpliSeq™ custom designed panel and the Ion Personal Genome Machine (PGM) system in IC.
Methods
Inclusion and exclusion criteria
Patients diagnosed with IC phenotypes as HCM, DCM, RCM, ARVC/D, LVNC, and overlapping or undefined phenotypes were included in this study. HCM, DCM, RCM, ARVC/D and LVNC were defined based on guidelines described by a the American Heart Association [
1]. Overlapping or undefined cardiomyopathies were defined as cardiac manifestations exhibited at least two phenotypes (such as a significantly thickened interventricular septum together with left ventricular dilation) that cannot be ascribed to a single classical phenotype.
Exclusion criteria included secondary cardiomyopathies or cardiomyopathies with pathogenic mechanisms that have already been described, including inflammatory (myocarditis), stress-provoked (Tako-tsubo), peripartum, ischemic, hypertensive, valvular, hyperthyroid/hypothyroid, alcoholic, and diabetic cardiomyopathies.
Sample collection and DNA extraction
A total of 110 unrelated patients diagnosed with IC, including HCM (n = 34), DCM (n = 22), RCM (n = 13), ARVC/D (n = 7), LVNC (n = 9), and overlapping or undefined cardiomyopathies (n = 25), were identified and enrolled at the Cardiology Clinic at the Peking Union Medical College from January 2012 to December 2015. All patients were determined to not have secondary cardiomyopathy. Of these patients, 18 had a family history of cardiomyopathy or sudden death, while 92 were sporadic cases. Clinical evaluation consisted of a medical history, family history, physical examination, 12-lead echocardiogram (ECG), transthoracic and/or transesophageal ECG, and/or cardiac magnetic resonance imaging. Peripheral blood samples were collected from patients and their family members (if available). Genomic DNA was extracted from peripheral blood leukocytes using a QIAamp DNA Blood Midi Kit (Qiagen, Hilden, Germany), according to the manufacturer’s instructions. The study was approved by the Peking Union Medical College Hospital Institutional Review Board, and all individuals signed a written informed consent.
Panel design and library preparation
According to Online Mendelian Inheritance in Man (OMIM,
http://omim.org) and PubMed literature retrieval, 64 candidate genes have been reported to be causes of inherited cardiomyopathy, and were selected for panel design (Additional file
1: Table S1). Primers of overlapping amplicons covering the coding sequence (CDS) region, untranslated regions (UTR), and flanking sequences (padding +25 base pairs) of each targeted gene were automatically generated by Ion AmpliSeq designer software. This produced 2231 amplicons, which were divided into 2 primer pools (Life Technologies; Thermo Fisher Scientific). Although Titin (
TTN) is a major causative gene for DCM [
3,
4], it was not included in the 64-gene panel. As such, we used a separate 6-gene panel (including
DMD,
TTN,
OBSCN,
FBN1,
TGFBR2 and
TGFBR1) for 23 patients with DCM, including one patient with overlapping DCM phenotype. Amplicon libraries were prepared using the Ion AmpliSeq Library Kit v2.0 and custom designed primer pools, according to the manufacturer’s instructions. DNA fragments from different samples were ligated with barcoded sequencing adaptors using the Ion Xpress Barcode Adapter 1-16 Kit. The library was quantified with a Qubit 2.0 fluorometer (Invitrogen; Thermo Fisher Scientific).
Next generation sequencing and data analysis
Fifteen barcoded samples were pooled in equimolar amounts. Amplified libraries were subjected to emulsion polymerase chain reaction performed on the Ion OneTouch system using the Ion PGM Hi-Q OT2 Kit. Next, ion sphere particles (ISPs) were recovered and enriched using the Ion OneTouch ES system. Enriched template-positive ISPs were sequenced on an Ion 318 V2 chip using the Ion PGM HI-Q SEQ Kit by the Ion Torrent PGM. Data from PGM runs were processed using Ion Torrent Suite 4.4 software to generate sequence reads. After sequence alignment and variant calling, synonymous variants, intronic variants far away from the exon/intron boundaries, and variants with a minor allelic frequency (MAF) ≥ 1% in the 1000 Genomes Project, the dbSNP database, and the Exome Aggregation Consortium (ExAC) database were removed from further analysis. Variants were subsequently selected according to the prevalence of each type of cardiomyopathy (for example, MAF < 0.4%, < 0.3%, < 0.2%, < 0.05% for DCM, LVNC, HCM, and ARVC, respectively) in the general population [
5,
6]. NGS reads were visualized using an integrated genomic viewer (IGV).
Variant classification
We placed verified variants into the following categories according to guidelines from the American College of Medical Genetics and Genomics (ACMG) and Association of Molecular Pathology (AMP) [
7]: pathogenic (P), likely pathogenic (LP), variant of uncertain significance (VUS), likely benign (LB) and benign (B).
Discussion
More than 60 genes have been described to cause inherited cardiomyopathy. Genetic screening of these genes individually using traditional Sanger sequencing is labor-intensive and expensive. High-throughput, cost-effective genetic testing methods are urgently needed. The NGS approach provides opportunities to identify and investigate thousands of genetic aberrations simultaneously among diverse cardiomyopathy phenotypes, and is an important auxiliary tool for clinicians regarding decision-making, diagnosis, and treatment. In this study, we developed a 64-gene Ampliseq-based targeted resequencing panel, and subsequently analyzed using next generation semiconductor sequencing technology. We obtained a diagnostic yield of 23.6% (26/110). The mutation detection rate was significantly higher in familial (50%) vs. sporadic (21.7%) cardiomyopathy. Several studies applying NGS have shown different molecular diagnostic rates for different types of cardiomyopathies. For example, Cuenca et al. [
8] utilized a panel of 126 genes and identified DCM-causing mutations in 73% of patients with familial dilated cardiomyopathy undergoing heart transplantation. Furthermore, Gómez et al. [
9] reported a diagnostic rate of 25% when investigating 76 patients with HCM using a 9-gene panel. The divergence in diagnostic mutation rate may be ascribed to patient selection, size of panel design, and/or choice of NGS platform. In the present study,
TTN truncating mutations were not as prevalent in patients with DCM as was previously reported, perhaps because our sample size was too small and does not represent conclusive results in the Chinese population evaluated.
High-throughput sequencing technology enable researchers to find an abundance of variants in individual cases, although determining the pathogenicity of each variant identified by NGS remains a challenge. When selecting appropriate criteria for filtering (e.g. excluding the common variants in databases, such as dbSNP and 1000 Genome), the morbidity of the disease must be considered. On the other hand, variants listed in mutation databases that may have previously been regarded as disease-causing mutations may later be proven to be benign or VUS. For example, 38 variants found in our patients were also recorded in HGMD and/or the ClinVar database. According to ACMG and AMP guidelines, only 15 variants were classified as pathogenic or likely pathogenic; 3 were reclassified as benign or likely benign, and the remaining 20 variants were reclassified as VUS (Table
1 and Additional file
5: Table S4). Thus, extreme caution needs to be used when defining a variant as disease-causing.
Forty novel single nucleotide variations (SNVs) were found in this cohort of patients that have not been reported in public mutation databases. According to ACMG guidelines, and familial co-segregation analysis, 11 novel variants were classified as pathogenic or likely pathogenic mutations (Table
1). One novel variants was classified as likely benign. The remaining 28 novel variants were classified as VUS because there was no evidence supporting classification as either pathogenic or benign.
Target sequencing of a gene panel can revise the clinical diagnosis and guidelines for management. For example, we found a homozygous
SLC25A4 (also called
ANT1, c.358G>A, p.Gly120Ser) mutation in one patient with an HCM and DCM overlapping phenotype. The mother of the proband was a heterozygous mutation carrier and the father’s blood sample was not available. This variant was not found in the 1000 Genome, ESP6500, and ExAC databases. All three bioinformatics analyses classified this variant as damaging. A homozygous
SLC25A4 (c.368C>A, p.Ala123Asp) mutation was previously identified in a patient with mitochondrial myopathy and cardiomyopathy [
10]. An in vitro study showed that the mutant produced a loss-of-function effect on SLC25A4 activity. Both of the two amino acid substitutions (p.Gly120Ser and p.Ala123Asp) occurred in conserved residues, with the position of two mutations nearby. Gly120 is located in the third transmembrane domain of ANT1 [
11] within the dimerization motif “GXXXG”. This sequence is thought to be involved in high affinity association between transmembrane domains [
12]. As such, we consider this to be a disease-causing mutation. A 42-year-old patient who presented with undefined cardiomyopathy had a p.Gly120Ser mutation and was born to non-consanguineous parents who were both reported to be unaffected. His cardiac manifestations included a significantly thickened interventricular septum together with left ventricular dilation and noncompaction, with subsequent development of heart failure. Histological examination of a muscle biopsy showed ragged-red fibers. Based on genetic testing results, the diagnosis was revised to mitochondrial DNA depletion syndrome 12B (cardiomyopathic type). The patient was nominated as a candidate for future heart transplantation.
There are no formal standards for classifying a variant as causative. All filtered results are based on genotype quality, frequencies, bioinformatics tools, and published data from databases. Additional criteria are needed to support or refute pathogenicity, such as in vitro functional studies or long-term follow-up during the clinical care of each patient. Familial co-segregation studies play a crucial role in determining the variants’ pathogenicity, but incomplete penetrance and variable expressivity should be considered in cardiomyopathy. The common occurrence of sudden death in cardiomyopathy made co-segregation analysis more difficult in the present study, and most variants were classified as VUS. This result still valuable, because it can provide genetic data for primary cardiomyopathies to disease-specific databases. Genetic aberrations identified in this study provided a precise clinical diagnosis, appropriate genetic counseling, and proper medical management for some IC patients with pathogenic mutation. The impacts of genetic variants on prognosis will require long-term follow-up by clinicians. Clinicians and geneticists must collaborate to determine whether a variant can be considered pathogenic following identification by NGS.
Although the rapid, economic characteristics of panel-based NGS are compelling, ampliseq-based NGS has several weaknesses. Firstly, only 94% of the target region (UTR+CDS) can be covered at the panel-design stage. Secondly, newly identified causative genes might not be included in the panel. For example, mutations in
FLNC were reported to cause HCM and RCM after the panel was designed in this study [
13,
14], so this gene was not included. Thirdly, at the PCR stage, high GC content can influence amplification efficiency. Thus, a fraction of the coding regions remains unsequenced. Moreover, false-positive rates of indel variants identified by PGM are higher than other NGS platforms, especially in homopolymer regions. In order for the application to be used in clinical testing, alternative methods must be developed to overcome these disadvantages.
Another weakness of this study is that we did not performed variants detection in healthy individuals from Chinese population in the same fashion. In public online database, such as the 1000 Genomes Project, the dbSNP database, and the Exome Aggregation Consortium (ExAC) database, the allele frequency for all candidate variants were listed, including the allele frequency in East Asian population (showed in 1000G_EAS and ExAC_EAS column in Additional file
3: Table S2). We think these data showed the allele frequency of detected variants in healthy population, to a certain extent. In order to save the cost of this study, we used the information in these databases as control.
Authors’ contributions
XZ and SZ conceived and designed the study. CL, FL, KY, JL, YL, RW, NS performed the experiments and analyzed the NGS data. WW, PG and YL carried out the study of the clinical part. CL, WW and XZ prepared and revised the manuscript. All authors approved the final manuscript.