Background
Autism spectrum disorder (ASD) is a common neurodevelopmental disorder with an increasing prevalence worldwide [
1,
2]. ASD manifests the wide range of symptoms and severity in perceivability and socialization with others, such as limited and repetitive patterns of behavior. Both genetic and environmental factors are involved in ASD pathogenesis. Environmental factors, including viral infections, medications during pregnancy, and air pollutants, may contribute to ASD risks [
3]. Compared with environmental factors, genetic factors appear to be a prerequisite for ASD development: genetic changes (mutations) may increase ASD risks; additionally, genes, such as
CHD8 [
4],
CNTNAP2 [
5],
DCC [
6], neurexin genes [
7],
SHANK1 [
8],
SHANK2 [
9],
SHANK3 [
10], and
WNT2 [
11] may affect brain development or brain-cells communication. ASD heritability has been estimated to be 50%, reflecting that genetic factors afford main components in ASD etiology [
12].
ASD begins in early childhood. Children with ASD usually show symptoms of autism within the first year, and regress during a period between one and two years of age. Although there is no specific medication for ASD patients [
13], early treatment can confer the lives of children with ASD beneficially. Gene-based test provides an impressive opportunity to identify potential infants with ASD [
8].
Accumulating whole-genome, association, and linkage studies have strongly documented the roles of genes in ASD [
14‐
16]. Copy-number variants (CNVs) are defined as deletions and duplications of DNA segments in the genome greater than one kilobase (Kb) [
17,
18]. De novo CNV events have been found to be implicated in the etiology of depression, schizophrenia, bipolar disorder, attention deficit hyperactivity disorder, and ASD [
19‐
22]. Array-based comparative genomic hybridization (aCGH) technology has proven to be a rapid method to detect the association between CNVs and ASD risks [
23,
24]. Large simplex ASD cohort studies show that the rate of rare de novo CNVs is significantly higher in affected siblings (5.8–7.9%) than that in unaffected siblings (1.7–1.9%) [
25,
26]. CNVs at 1q21.1, 2p16.3, 3q29, 7q11.23, 15q11.2–13.1, 16p11.2, 17p11.2, 17q12, and 22q11.2 are associated with ASD risks [
24,
27]. Moreover, CNVs in
NRXN1,
SETD5,
HDAC9, and
PARK2 are found to be associated with ASD risks [
28‐
30]. However, there exist substantial differences in terms of location and frequency of some CNVs in the general Asian population [
31]. In this paper, we investigated in CNVs in Northeast Han Chinese individuals with ASD.
Discussion
In the present study, we identified that 22 kinds of CNVs (six deletions and 16 duplications), eight protein-coding genes, and 12 miRNAs-coding genes are associated with ASD risks in northeast Chinese Han from Jilin province, China.
CNVs have repeatedly been found to correlate with ASD risks [
40,
41]. In our study, we filtered 22 potential pathogenic CNVs. Individuals with deletions and duplications of 15q13.3 have been found to manifest neuropsychiatric disease and cognitive deficits [
42]. In line with the discoveries of Bitar et al
. [
43], Bremer et al
. [
44], Celestino-Soper et al
. [
45], Chen et al
. [
23], Chen et al
. [
46], Pinto et al
. [
28], and Rosenfeld et al
. [
47], we further documented that CNVs at 5p15.33, 5p15.33-p15.2, 7p22.3, 7p22.3-p22.2, 7q22.1-q22.2, 10q26.2-q26.3, 11q25, 12p12.1-p11.23, 15q13.3, 16p13.3, 22q13.31-q13.33, and Xq12-q13.1 were associated with ASD risks. Autism-related phenotypes are common in patients with deletion or duplication at 22q13.3 [
48‐
51]. Most of the defects are due to haploinsufficiency of
SHANK3 [
49]. Chen et al
. found a deletion at 22q13.3 in two male children with ASD and a duplication at 22q13.31-q13.33 in one male child with ASD from Taiwan, China [
46]. In our study, we found a duplication at 22q13.31-q13.33 that overlaps
SHANK3 from two male children with ASD, indicating that the duplication at 22q13.31-q13.33 may play a key role in ASD etiology in our population. CNVs at 15q13.3 have been found to be involved in a variety of neuropsychiatric diseases, including intellectual disability/developmental delay, epilepsy, schizophrenia, and ASD [
42,
52‐
54]. The relation between
CHRNA7 at 15q13.3 and neuropsychiatric disorder phenotype has been validated intensively [
53]. In accordance with the discovery of Pinto et al
. [
28], we also found that a deletion of
CHRNA7 was associated with ASD risks.
Except
CHRNA7 and
SHANK3, we found CNVs-duplications (
DRD4,
HRAS,
OPHN1,
SLC6A3, and
TSC2) and CNVs-deletions (
PTEN). For
DRD4 and
HARS, we found seven children with ASD had duplications at 11p15.5, which overlaps
DRD4 and
HARS. Mutations in
DRD4 are associated with ASD risks [
55‐
57]. The mRNA expression levels of
DRD4 in peripheral blood lymphocytes are higher in people with ASD than those in healthy controls [
58,
59]. Herault et al
. also found positive association between
HRAS and autism in French-Caucasian [
60,
61]. For
OPHN1 at Xq12-q13.1, Celestino-Soper et al
. found a deletion of exons 7–15 of
OPHN1 at Xq12 in a male child with ASD [
45]. In contrast, we found a male child with ASD had a duplication at Xq12-q13.1. For
SLC6A3 at 5p15.33-p15.2, Bowton et al
. found
SLC6A3 coding variant Ala559Val is related to ASD [
62]. We further found a child with ASD had a duplication at 5p15.33-p15.2. For
TSC2 at 16p13.3 and
PTEN at 10q23.2-q23.31, Bourgeron et al
. found that mutations in
TSC2 and
PTEN activate the mTOR/PI3K pathway, associating with ASD risks [
63]. We found duplications at 16p13.3 in two female children with ASD. PTEN loss involved in white matter pathology in human with ASD is consistent with that in mouse models [
64]. We revealed that deletions at 10q23.2-q23.31 overlapping
PTEN in 13 male children with ASD, rather than 3 female children with ASD. Thus, these eight genes may be implicated in ASD etiology.
MiRNAs encoded within CNVs are important functional variants, providing a new dimension to recognize the association between genotype and phenotype [
65]. MiRNAs play vital roles in governing essential aspects of inhibitory transmission and interneuron development in nervous system [
66]. Deletion or duplication of a chromosomal loci changes the levels of miRNAs which further impact on neuronal function and communication [
36]. In our study, 12 candidate-susceptible miRNAs-coding genes of ASD were identified (ten duplications [
MIR202,
MIR210,
MIR3178,
MIR339,
MIR4516,
MIR4717,
MIR483,
MIR675,
MIR6821, and
MIR940] and two deletions [
MIR107 and
MIR558]).
BDNF, a brain-derived neurotrophic factor and a member of the neurotrophic factor family, is a target gene of miR-202 [
67]. Moreover, we further predicted that miR-4717-5p, miR-483-3p, and miR-940 also targeted
BNDF. Skogstrand et al
. found that lower BDNF levels in serum correlate with ASD risks [
68,
69]. miR-339-5p has been found to be a drug target for Alzheimer's disease, and is low expressed in mature neurons and related to axon guidance [
70,
71]. In our study, we found that miR-339-5p targets 42 genes associated with ASD risks. Among these genes, the association of
DIP2A and ASD risks has been validated by our team [
72]; moreover,
Dip2a knockout mice exhibit autism-like behaviors, including excessive repetitive behavior and social novelty defects [
73]. Notably, autism-like behaviors and germline transmission in
MECP2 transgenic monkeys corroborate association between miR-339-5p and
MECP2 [
74]. In addition, miR-202-5p, miR-483-3p, and miR-940 also targets
MECP2. For these reasons, miRNAs encoded within CNVs may be implicated in ASD etiology.
For enrichment analysis, we found that genes were enriched in synapse, synapse-related signal regulation, neurotransmitter activity, neurotransmitter transport, and neurotransmitter binding. Mutations in synapse-related or neurotransmitter-related genes are associated with ASD risks in multiple unbiased, targeted sequencing, and neuropathological studies, evidencing that dysregulation in synaptogenesis and neurotransmission is implicated in the pathogenesis of ASD [
75‐
78]. We corroborated that ASD pathogenesis was related to dopaminergic synapse, mTOR signaling pathway, insulin signaling pathway, and cholinergic synapse [
79‐
82]. Dopamine affects ASD-related-brain regions (basal ganglia, cortex, and amygdala) via dopaminergic synapse [
79]. mTOR is involved in integrating signaling from ASD synaptic and regulatory proteins, such as SHANK3, FMRP and the glutamate receptors mGluR1/5 [
63,
83]. Dysfunction in mTOR signaling affords one of mechanisms of ASD — an imbalance between excitatory and inhibitory currents [
80]. Insulin signaling pathway is feasible for development of autism [
81]. Neurochemical abnormalities in the cholinergic system are involved in ASD pathogenesis, highlighting the potential for intervention-targeted cholinergic synapses [
82].
Functional network analysis of the 219 CNVs-encoded-miRNAs-targeted genes elicited that a novel regulating mechanism of these CNVs-encoded miRNAs consisted of synapse-related functions (glutamatergic synapse, dopaminergic synapse, serotonergic synapse, and GABAergic synapse), axon guidance, ion channel (ion-gated channel and cation channel complex), and Wnt signaling pathway. Synaptic function and Wnt signaling pathway are affected by mutations in diverse ASD-related genes, and altered Wnt pathway signaling may confer an involvement in ASD pathogenesis [
78]. Interestingly, dysfunction of axon-guidance signaling is integral to the microstructural abnormalities of the brain in people with ASD [
84]. Notably, the involvement of ion channel-related genes has been found in ASD etiology [
85]. Mutations in ion channel genes contribute to low-to-moderate susceptibility of ASD [
85].
Both GO and Pathway enrichment analyses showed that CNVs-relating genes and CNVs-encoded-miRNAs-targeted genes mapped synapse-related functions. Additionally, CNVs-relating genes also enriched in mTOR signaling pathway and insulin signaling pathway. In contrast, CNVs-encoded-miRNAs-targeted genes enriched in axon guidance, ion channel, and Wnt signaling pathway. These results documented the high complexity and heterogeneity of ASD, suggesting that different genomic alteration on same chromosomal location may confer distinct but complementary effects on the brain of people with ASD.
Our study had some limitations: (1) the sample size in our study may confer limited statistical power to discover significant findings; (2) genetic and environmental factors contribute to ASD risk; however, environmental factors were not available for us; and (3) de novo or inherited of the CNVs were not be classified because of the lack of data from parents.
Despite these limitations above, our study also had some strength. Firstly, we found eight de novo CNVs (duplications at 1p36.31, 1p36.33, 1q42.13, 11p15.5, and 16q21; deletions at 2p23.1-p22.3, 10q23.2-q23.31, and 14q11.2) and 12 validated CNVs (duplication at 5p15.33, 5p15.33-p15.2, 7p22.3, 7p22.3-p22.2, 10q26.2-q26.3, 11q25, 16p13.3, 22q13.31-q13.33, and Xq12-q13.1; deletion at 7q22.1-q22.2, 12p12.1-p11.23, and 15q13.3), further documenting that ASD is of high genetic heterogeneity after comparing our results and previous findings (Supplementary Table
12). Secondly, we identified 20 genes (eight protein-coding genes supported by SFARI and AutismKB and 12 microRNAs-coding genes that refine understanding of involving approach of ASD-susceptible-genes in etiology) are implicated in ASD risks. Thirdly, we performed GO and KEGG pathway analyses of CNVs-relating genes and CNVs-encoded-miRNAs-targeted genes, providing a new dimension to revealing ASD etiology.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.