Literature DB >> 28281572

Genome-wide copy number variation analysis in a Chinese autism spectrum disorder cohort.

Hui Guo1,2, Yu Peng1, Zhengmao Hu1, Ying Li1, Guanglei Xun3, Jianjun Ou2, Liangdan Sun4, Zhimin Xiong5, Yanling Liu1, Tianyun Wang1, Jingjing Chen1, Lu Xia1, Ting Bai1, Yidong Shen2, Qi Tian1, Yiqiao Hu1, Lu Shen1, Rongjuan Zhao1, Xuejun Zhang4, Fengyu Zhang2,6, Jingping Zhao2, Xiaobing Zou7, Kun Xia1,8,9.   

Abstract

Autism spectrum disorder (ASD) describes a group of neurodevelopmental disorders with high heritability, although the underlying genetic determinants of ASDs remain largely unknown. Large-scale whole-genome studies of copy number variation in Han Chinese samples are still lacking. We performed a genome-wide copy number variation analysis of 343 ASD trios, 203 patients with sporadic cases and 988 controls in a Chinese population using Illumina genotyping platforms to identify CNVs and related genes that may contribute to ASD risk. We identified 32 rare CNVs larger than 1 Mb in 31 patients. ASD patients were found to carry a higher global burden of rare, large CNVs than controls. Recurrent de novo or case-private CNVs were found at 15q11-13, Xp22.3, 15q13.1-13.2, 3p26.3 and 2p12. The de novo 15q11-13 duplication was more prevalent in this Chinese population than in those with European ancestry. Several genes, including GRAMD2 and STAM, were implicated as novel ASD risk genes when integrating whole-genome CNVs and whole-exome sequencing data. We also identified several CNVs that include known ASD genes (SHANK3, CDH10, CSMD1) or genes involved in nervous system development (NYAP2, ST6GAL2, GRM6). Besides, our study also implicated Contactins-NYAPs-WAVE1 pathway in ASD pathogenesis. Our findings identify ASD-related CNVs in a Chinese population and implicate novel ASD risk genes and related pathway for further study.

Entities:  

Mesh:

Year:  2017        PMID: 28281572      PMCID: PMC5345089          DOI: 10.1038/srep44155

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Autism spectrum disorders (ASDs) are a group of conditions characterized by impairments in social interaction and communication and by restricted, repetitive patterns of interest or behaviour1. The disorder presents clinically in the first 3 years of life. Its precise prevalence has proven difficult to estimate due to changing practices of diagnosis and ascertainment; rates have increased from no more than 5 per 10,000 individuals throughout the 1980 s to approximately 1 in 68 children today, according to a report from America’s CDC. Genetic epidemiology studies have indicated that ASDs have a high heritability234, suggesting a strong genetic basis for ASDs. Evidence from different cases supports that de novo and rare chromosomal abnormalities, copy number variations (CNVs), single nucleotide variations (SNVs)/small insertion and deletions (InDels) and multiple common genetic variants contribute to the risk of ASD5. Dozens of de novo and rare inherited copy number variations (CNVs) have been found through large-cohort genome-wide studies that involved more than 5000 ASD families of primary European ancestry678910111213. The most significant CNVs are 16p11.2 deletions and duplications, 7q11.23 duplications, 15q11–13 duplications, 1q21.1 duplications, 3q29 deletions, 22q11.2 deletions and duplications, deletions at NRXN1 and deletions at CDH136. These structural variants, many of which have large effects but are individually rare, together may account for approximately 4% of ASD cases6. Recently, whole-exome sequencing studies and candidate resequencing studies of large cohorts have also identified multiple genes with recurrent de novo, likely gene-disruptive (LGD) mutations in ASD patients, such as CHD8, SYNGAP1, DYRK1A, ARID1B, SCN2A, DSCAM, ANK2, ADNP, POGZ, GRIN2B and CHD214151617181920. Common genetic variants have also been reported at 5p14.1 (CDH10-CDH9)21, 5p15.2 (SEMA5A)22, MACROD223, CNTNAP224 and 1p13.2 (CSDE1, TRIM33) loci25 through genome-wide association studies (GWAS). Together, these results suggest that ASD has a complex inheritance pattern; its genetic architecture and underlying mechanism are still largely unknown. Although several large-cohort genome-wide CNV studies have been performed, few studies have investigated CNVs related to ASD in Chinese populations2627. Previously, we performed a GWAS of autism in a Han Chinese population for common genetic variants associated with ASD25. Here, with additional samples, we present a genome-wide study in an attempt to identify CNVs that may contribute to the aetiology of ASD in the Han Chinese population. We found 32 rare, large CNVs longer than 1 Mb accounting for 5.68% of the total patients, of which five CNVs were recurrently identified in ASD patients.

Results

Burden of rare, large CNVs in ASD patients of a Chinese Han population

After strict quality control and CNV calling, 546 ASD subjects (343 trios) and 988 normal controls were analysed (Fig. 1). On average, 18 CNV calls per individual were made for samples genotyped using the HumanCNV370 BeadChip, 24.7 CNV calls for samples genotyped using the HumanCNV610 BeadChip and 357.3 CNV calls for samples genotyped using the HumanCNV660 BeadChip (the HumanCNV660 BeadChip includes many common CNV loci) (Figure S1). As we used low-density BeadChips and multiple genotyping platforms to perform the whole-genome genotyping, small CNV calls would not be confident and would be difficult to integrate from multiple platforms. Therefore, we only considered large CNVs (>1 Mb) in the analysis.
Figure 1

Pipeline of CNV discovery and analysis.

ASD patients and parents were genotyped by an Illumina 370 K or 660 K BeadChip. Control subjects were genotyped using an Illumina 610 K BeadChip. CNV calling and quality control were performed using the PennCNV program. Rare, large CNVs (>1 Mb) were used for validation and analysis. The global burden was subsequently determined. De novo CNVs and inherited CNVs were characterized.

We observed a higher CNV burden in patients with ASD. A total of 33 rare CNVs (<1%) larger than 1 Mb were identified in ASD patients by PennCNV, but one was not consistently validated (Figure S2). Finally, 32 rare CNVs were identified in 31 ASD probands (5.68% of the patients, Table 1), and 19 CNVs were identified in 19 control subjects (1.92% of the control subjects) (Table S1). The ASD patients had a significantly higher number of CNVs than the control subjects (odds ratio: 3.05, 95% CI: 1.66–5.74, p = 1.55E-04, Fisher’s exact test, Fig. 2, Table S2). The CNV burden was higher when considering CNVs larger than 2 Mb (odds ratio: 28.9, 95% CI: 4.47–1208.24, p = 8.7E-07, Fisher’s exact test). Both deletions and duplications contributed to the CNV burden. None of the 32 rare CNVs detected in this study were present in the control subjects. Among the 32 case-private CNVs, there were 16 de novo CNVs, 11 inherited CNVs, and five CNVs with unknown inheritance because of the absence of parental samples. Five CNVs were recurrently identified in 13 ASD patients (Table 1).
Table 1

Rare, large CNVs (>1 Mb) identified in this study.

Region1BandLength (bp)CNV status2Patient IDInheritanceRecurrent3Candidate Gene4
de novo       
chr1:93416265-1056488011p22.1-21.112,232,537DelM11409de novoN56 genes (including OLFM3)
chr5:80564-87232195p15.33-15.28,642,656DelM8820de novoN55 genes (including SLC9A3)
chr6:148602550-1704699346q24.3-q2721,867,385DupM16084de novoN126 genes (including ARID1B)
chr8:791912-160658398p23.3-p2215,273,928DelM9118de novoN102 genes (including RP1L1/XKR6)
chr15:19157192-2619410115q11.2-q13.37,036,910Dup = 4M8145de novoY(5)142 gene (including UBE3A, GABRB3)
chr15:19767013-3069043715q11.2-q13.310,923,425Dup = 4M15042de novoY(5)162 gene (including UBE3A, GABRB3)
chr15:20049770-3050069815q11.2-q13.310,450,929Dup = 4M16079de novoY(5)155 gene (including UBE3A, GABRB3)
chr15:20306549-2621967315q11.2-q13.15,913,125DupM10117de novoY(5)119 gene (including UBE3A, GABRB3)
chr15:20314760-2620886115q11.2-q13.15,894,101DupM16147de novoY(5)109 gene (including UBE3A, GABRB3)
chr15:26762141-2815353915q13.1-13.21,391,399DelM15147de novoY(2)6 genes (including APBA2)
chr15:66041595-7036256315q234,320,969DelM16229de novoN34 (including GRAMD2)
chr16:32090048-3324008716p11.21,150,040DupM8302de novoN6 genes
chr20:55665989-6242615720q13.31-13.336,760,169DupM9118de novoN96 genes
chr22:46871209-4949859022q13.31-13.332,627,381DelM16087de novoN40 genes (including SHANK3)
chrX:3944205-7480499Xp22.33-22.313,536,295DelM8590de novoY(2)6 genes (including NLGN4X)
chrX:4482028-8177903Xp22.32-22.313,695,876DelM15199de novoY(2)10 genes (including NLGN4X)
inheritance unknown
chr2:81088888-822139572p121,125,070DupM8725unknownY(2).
chr3:1137041-29966633p26.31,859,623DupM8560unknownY(2)2 genes (including CNTN4)
chr3:2610044-50126353p26.3-26.22,402,592DupM10120unknownY(2)10 genes (including CNTN4)
chr8:2014561-30789918p23.3-23.21,064,431DupM16201unknownN2 genes (including CSMD1)
chr10:17559724-1866346910q12.331,103,746DupM12457unknownN7 genes (including STAM)
Inherited
chr2:106245033-1077892422q12.2-12.31,544,210DupM16053paternalN5 genes (including ST6GAL2)
chr2:226225481-2272980242q36.31,072,544DupM10045paternalN1 gene (including NYAP2)
chr2:81240319-827404702p121,500,152DupM13360paternalY(2).
chr4:189683236-1906872994q35.21,004,064DelM11400paternalNLINC01060
chr5:177795689-1788658755q35.31,070,187DupM10006paternalN11 genes (including GRM6)
chr5:24638407-266602735p14.2-14.12,021,867DelM8479maternalN1 gene (including CDH10)
chr5:69041250-706722985q13.21,631,049DupM13395maternalN12 genes
chr8:112622028-1144245128q23.31,802,485DupM12449maternalN2 genes (including CSMD3)
chr15:26798209-2815644515q13.1-13.21,358,237DelM12315maternalY(2)6 genes (including APBA2)
chr15:32021107-3329814315q141,277,037DupM11403maternalN19 genes
chr17:14059029-1539903317p121,340,005DelM8767maternalN7 genes

Notes: 1. RefSeq version is hg18; 2. All Del is one copy number, all Dup is three copy numbers except M8145, M15042 and M16079, which are four copy numbers; 3. N represents private CNVs, Y represents recurrent CNVs, the numbers in the braces represent recurrent numbers; 4. All refSeq gene numbers, ASD risk genes or implicated ASD risk genes in the CNV regions are listed in the braces.

Figure 2

Burden analysis of rare, large CNVs in patients and controls.

Deletions, duplications and the combined rate for all CNVs are shown. The CNV size was categorized as >1 Mb and >2 Mb. For each event type, the significance between patients and controls is given at the bottom.

Recurrent de novo or case-private CNVs

The 15q11–13 duplication was found to be one of the most common CNVs in the ASD patients. Of 32 CNVs, five de novo duplications were found at 15q11–13 (Table 1), which accounts for approximately 1% of the ASD patients in our study. We compared the frequency of this CNV with the Simons Simplex Collection (SSC) and Autism Genome Project (AGP) samples with primary European ancestry67 (Table S4). The incidence of 15q11–13 duplications was significantly higher in the Chinese population in our study (5-fold, p = 0.021, Fisher’s exact test). Among the five individuals with 15q11–13 duplications, two individuals carried three copy numbers of 15q11–13 respectively, and three individuals carried four copy numbers respectively. We then performed a karyotyping analysis using the blood samples of the patients and confirmed that the three incidences with four repeats were due to a partial tetrasomy of 15q (idic(15)) (Figure S3). A methylation assay confirmed that four of the duplications originated on maternally derived chromosomes (Table S3). The parent-of-origin of the last one is not confirmed because of the unsufficiency of the DNA quantity. Several other recurrent CNVs were observed at four distinct chromosome regions. First, two deletions (3.5 Mb and 3.7 Mb) were found on chromosome Xp22.3, which includes the NLGN4X gene, in two ASD patients. The first proband (M8590) is a female with a de novo deletion; the other proband (M15199) is a male (hemizygous) whose mother carried the heterozygous deletion. Second, two deletions (1.39 Mb and 1.35 Mb) were found in 15q13.1–13.2, which includes the APBA2 gene (Table 1, Figure S4). One of these deletions was de novo, whereas the other was maternally inherited. This is the first report of deletions involving the APBA2 gene in ASD patients. Third, two duplications (1.9 Mb and 2.4 Mb) were detected on chromosome 3p26, which have been reported in our previous paper28. Both duplications disrupted the gene encoding Contactin, CNTN4, which has been implicated in developmental delays and ASDs. Finally, two duplications (1.1 Mb and 1.5 Mb) were detected on 2p12 in two ASD patients (Table 1). One was paternally inherited; the inheritance of the other is unknown. However, no known gene was found in this region.

Integration of CNV and SNV data identified novel ASD candidate risk genes

In addition to the recurrent CNVs, 19 case-private CNVs were identified in individual ASD patients (Table 1). This included disruptions in four known ASD risk genes, ARID1B, SHANK3, CDH10 and CSMD1. One large de novo deletion (21.9 Mb) at 6q24.3–6q27 contains the ARID1B gene, and one de novo deletion (2.6 Mb) at 22q13.3 includes SHANK3 (Table 1). Two duplications disrupted genes encoding CUB and sushi domain-containing proteins (CSMDs), one of which (1.1 Mb) disrupted CSMD1, which has been implicated in ASD risk by a whole-exome sequencing study29. Notably, we identified another duplication (1.8 Mb) at 8q23.3 disrupting another gene encoding the CUB and sushi domain-containing protein CSMD3 (Table 1). A maternally inherited deletion (2 Mb) at 5p14.2–14.1 disrupted a single gene, CDH10 (Table 1), of which common variants have been implicated in ASD risk21. To identify novel potential ASD risk genes in these case-private CNVs (Table 1), we integrated de novo CNV data (Table S4) from large-cohort whole-genome CNV studies6789101113 and SNV/InDel data (Table S5) from the SSC and Autism Sequencing Consortium (ASC) whole-exome sequencing studies1415 to refine the CNV regions and search for potential de novo mutations or private LGD mutations of the genes represented by our case-private CNVs that were shorter than 5 Mb (n = 11). Finally, three published de novo CNVs were identified as overlapping with a de novo CNV (15q23) in this study (Fig. 3). The overlapped region includes only seven refSeq genes (Fig. 3). We analysed these genes in the SSC and ASC whole-exome sequencing data, and two de novo variants were identified in GRAMD2 (Fig. 3). One of them is a frameshift mutation (p.K59EfsX11); the other is located in the 3’ UTR (c.*37 T > C). No other gene in this region was identified with a de novo mutation, and no de novo mutation of GRAMD2 was identified in the unaffected SSC siblings. In addition, a de novo frameshift mutation (p. F526SfsX3) of CDH10 was identified in an SSC simplex quad family, and an LGD mutation was identified in an ASC patient (Fig. 4). No CDH10 de novo mutation was identified in the unaffected SSC siblings or ASC controls. In addition, a de novo frameshift mutation (p.S261LfsX4) of STAM was identified in an SSC simplex quad family, and a missense de novo mutation was identified in an unaffected sibling (Fig. 4). A private LGD mutation of STAM was identified in an ASC patient. No LGD mutation was found in unaffected siblings and ASC controls.
Figure 3

Convergence of de novo CNVs at 15q23 and de novo mutations of GRAMD2 within the overlapped region.

Red bars indicate the chromosome locations of the four deletions identified in this study (M16229) and other studies (11233.p1, AU008, SK0243-003) (9, 13, 43). Two de novo mutations of GRAMD2 (a frameshift and an SNV in the 3’ UTR) were identified in two SSC simplex quad families (16). SSC family IDs and pedigree plots are presented at the bottom.

Figure 4

Convergence of rare private CNVs in this study and LGD mutations in SSC and ASC families or patients for STAM and CHD10.

(a) Displayed for the STAM gene is a RefSeq gene model (larger ticks are exons), a loss-of-function deletion identified in this study, a de novo frameshift mutation (orange arrows) identified in an SSC quad simplex family and a nonsense mutation (orange arrows) identified in an ASC patient; (b). Displayed for the CDH10 gene is a RefSeq gene model (larger ticks are exons), a disrupted deletion identified in this study and a de novo frameshift mutation (orange arrows) identified in SSC quad simplex families. SSC family IDs and pedigree plots are presented.

Discussion

We have performed a large-cohort genome-wide CNV study of ASD patients in a Chinese population to search for de novo and rare private CNVs that cause or contribute to the risk of ASD. Our study revealed the ASD-related CNVs (>1 Mb) in a Chinese population, which accounted for approximately 4% of the ASD cases. This study revealed multiple known ASD CNVs or genes and implicated novel ASD candidate CNVs or genes. Significantly, the 15q11–13 de novo duplication is more frequently found in Chinese samples compared with primary European samples. There are three plausible explanations for this finding: 1) the power of this study is low, and the finding will not be replicated with large populations; 2) the population analysed here is more severely affected than equivalent populations (SSC/AGP); and 3) there is a higher rate of mutation or penetrance at this locus in Han populations. Unfortunately, we do not have IQ data or other neurological examination data for the patients in our study and are therefore unable to address the second possibility. This is a limitation of our study. A larger detailed phenotyping cohort is needed to further explore this difference. Another limitation of this study is the low-density genotyping array and multiple platforms used. Because of this limitation, small CNVs (<1 Mb) were not investigated in this study. Another limitation is that the sample size is small considering the high genetic heterogeneity of ASD. Further genome-wide CNV studies using high-resolution approaches should be conducted in larger ASD cohorts from Chinese populations. In this study, we identified known ASD risk CNVs or genes and implicated novel ASD risk CNVs or genes. The known ASD risk CNVs include those at 15q11–13 (UBE3A, GABRB3), Xp22.3–22.3 (NLGN4X), CNTN4, 22q13.31–13.33 (SHANK3) and CDH10. The overlapped region of 15 duplications on 15q11–13 identified from this study and other large-cohort studies include UBE3A, GABRB3 and 11 other genes (Figure S5). The role of UBE3A has been clearly demonstrated by both functional studies and mouse model studies. GABRB3 was recently implicated as another risk gene in this region6. NLGN4X has long been recognized as an ASD risk gene. Jamain et al. first identified NLGN4X as an ASD causative gene in 200330. Subsequently, NLGN4X de novo and inherited mutations have been occasionally reported3132. In total, 15 NLGN4X mutations were identified in the ASD patients in this study, including three missense mutations previously reported by our group (Figure S4). CNTN4 has been implicated in development delays and ASDs. Up to this point, three studies, involving six patients or families with CNTN4-disruptive deletions or duplications, have been published283334. The function of APBA2 also supports the potential involvement of this gene in the pathogenesis of ASD. APBA2 encodes a neuronal adaptor protein essential to synaptic transmission. It can form a complex that is able to couple synaptic vesicle exocytosis to neuronal cell adhesion. In addition, APBA2 protein deficiency is associated with impaired social interaction in mice35. By integrating large-scale CNV and SNV data, our study also implicated several new ASD risk genes, including GRAMD2 and STAM, which encode signal transducing adaptor molecules. The function of GRAMD2 and STAM are still unclear. Further studies with both genetic and functional perspectives should be conducted to replicate the finding that these genes are involved in ASD risk. Several genes in the CNV regions that were not specifically described in the result section are also worthy of mention here. The first one is NYAP2, which is disrupted by a paternally inherited duplication (1.07 Mb) at 2q36.3 (Table 1). NYAPs are expressed predominantly in developing neurons. Three genes, NYAP1, NYAP2, NYAP3, encoding the Neuronal tYrosine-phosphorylated Adaptor for the PI3-kinase (NYAP) family of phosphoproteins, have been recognized. Interestingly, the NYAP1 locus (7q22.1) and NYAP3 locus (13q33.3) have suggestive linkage and association signals with ASD2136. Upon stimulation with Contactins (such as CNTN4, CNTNAP2 and CNTN5), NYAPs activate the downstream pathway including WAVE1 complex and PI3K-mTOR pathway3738. The phosphalated NYAPs can active WAVE1 complex which function by interaction with several proteins, including NCKAP1, ABI2, CYFIP2 and HSP300. Recently, recurrent de novo LGD mutations of NCKAP1 were identified in ASD patients15. We further analyzed the SSC and ASC whole-exome sequencing data, two de novo missense mutations of ABI2 were identified in two ASD probands (simplex quad families) but not for siblings (Fig. 5). CYFIP1 was also implicated in ASD previous by both genetic and functional studies3940. Therefore, disruption of Contactins-NYAPs-WAVE1 pathway (Fig. 5) appeared to be an important pathogenesis pathway in ASD etiology. In addition, there are two other genes worthy of mention. One is GRM6 at 5q35.3, and the other is ST6GAL2 at 2q12.2. GRM6 encodes metabotropic glutamate receptor (mGluR). Interestingly, structural defects in the mGluR gene family interaction networks are significantly enriched in ASD patients41. In addition, common variants of ST6GAL2 are associated with the risperidone response of schizophrenic patients42. Both of these genes are good candidates for further study.
Figure 5

Genetic identification of genes involved in Contactins-NYAPs-WAVE1 pathway.

Disruption of Contactin gene, CNTN4, was recurrently identified in this study and other studies. Case-private rare CNV disrupting NYAP2 was also detected in this study. Red bars indicate deletions. The pedigree plots of SSC families with recurrent de novo mutations of NCKAP1 and ABI2 were presented. Orange arrows indicated the mutation location.

Conclusion

We performed the largest genome-wide copy number variation analysis of 546 ASD patients with 343 trios and 988 controls in a Han Chinese population. ASD patients were found to carry a higher global burden of rare, large CNVs. Recurrent de novo or case-private CNVs were found in five loci. The frequency of de novo 15q11–13 duplication was found to be significant higher compared with cohorts of European ancestry. In addition, several genes were implicated as novel ASD risk genes. Our findings identify ASD-related CNVs in a Chinese population, which could facilitate early disease testing and diagnosis and, most importantly, early intervention. Our study also implicated novel ASD risk genes, providing motivation for further functional and translational studies.

Methods

Study subjects

All the subjects who participated in this study completed informed consent before the original sample collection. ASD subjects were diagnosed independently by two experienced psychiatrists according to DSM-IV-TR criteria (American Psychiatric Association, 2000). The diagnostic procedure also included the assessment using a series of tools for neurological examination, mental status examination. A total of 406 ASD case-parent triad families (one affected offspring and two healthy parents) and 225 children with sporadic ASD were recruited for this study. The mean age of the patients was 5.065 years, and the mean age of onset was 2.5 years. An additional 1000 unrelated control subjects were also recruited. These control subjects had no history of ASDs or any other psychiatric diseases, nor did they have a familial history of psychiatric, neurological or autoimmune diseases. The mean age for the control subjects was 34.3 years. In summary, the study subjects include 406 ASD trios, 225 patients with sporadic cases and 1000 healthy controls of Chinese ancestry. This study was approved by the Institutional Review Board (IRB) of the State Key Laboratory of Medical Genetics, School of Life Sciences at Central South University, Changsha, Hunan, China and adhered to the tenets of the Declaration of Helsinki.

Whole-genome genotyping

Genomic DNA was extracted from the whole blood for genome-wide genotyping. The DNA was extracted using a standard proteinase K digestion and phenol-chloroform method. All subject samples were genotyped using an Illumina BeadChip. The samples used in the initial GWAS of autism (290 autism trios and 174 autism cases) were genotyped using the Illumina HumanCNV370-Quad BeadChip including approximately 370 K SNPs, as demonstrated in our GWAS study25. The additional 126 trios and 51 sporadic cases were genotyped using the Illumina Human660W-Quad BeadChip, which included approximately 660 K genetic variations. A cohort of 1,000 healthy controls was genotyped using the Illumina Human610-Quad BeadChip, which included approximately 610 K genetic variations. All SNPs in the HumanCNV370 BeadChip are covered in the Human610 BeadChip. All experiments were carried out in accordance with relevant guidelines.

Quality control and CNV calling

Quality control was first performed based on the whole-genome genotyping data before CNV calling. We excluded individuals with likely poor DNA samples to avoid possible false positive CNVs. Individuals with a missing SNP call rate >2% were excluded. To reduce the noise of the genome-wide intensity signal, we only included samples whose standard deviation (SD) of normalized intensity was less than 0.3. Because wave artefacts roughly correlating with GC content resulting from the hybridization bias of low full-length DNA quantity are known to interfere with the accurate inference of copy number variations, only samples for which the correlation between the LRR and the wave model ranged between −0.2 < X < 0.4 were used in the CNV calling analysis. CNVs were called using the PennCNV algorithm43, which combines multiple sources of information, including the Log R Ratio (LRR) and B Allele Frequency (BAF) at each SNP marker, along with SNP spacing and population frequency of the B allele to generate CNV calls.

CNV verification

The CNVs were validated by quantitative PCR (qPCR). Quantitative PCR validation was performed using the ABI PrismTM 7900HT Sequence Detection System (Applied Biosystems, Foster City, CA). Three pairs of primers were selected from the start, middle and end of each CNV, separately. The sample was analysed in triplicate in a 10 μl reaction mixture (200 nM each primer, Maxima® SYBR Green/ROX qPCR Master Mix (2X) from Fermentas, and 5 or 10 ng of genomic DNA). The values were evaluated using System 7900 Software SDS2.3 (Applied Biosystems, CA). Further data analysis was performed using the qBase method. Reference genes, chosen from COBL, GUSB, and SNCA, were included based on the minimal coefficient of variation, and the data were then normalized by setting a normal control to a value of 1.

Published CNV and exome sequencing data analysis

De novo CNV data were collected from six whole-genome CNV studies6789101113 (Table S3). De novo SNV/InDel data were collected from the SSC and ASC whole-exome sequencing studies1415 (Table S4). All CNVs and genes within our CNV regions of interest were analysed in the de novo CNV and SNV/InDel datasets both in cases and controls/unaffected siblings. We also examined ExAC data for the inferring variant deleteriousness and gene constraint of the implicated genes.

Additional Information

How to cite this article: Guo, H. et al. Genome-wide copy number variation analysis in a Chinese autism spectrum disorder cohort. Sci. Rep. 7, 44155; doi: 10.1038/srep44155 (2017). Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  43 in total

1.  PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.

Authors:  Kai Wang; Mingyao Li; Dexter Hadley; Rui Liu; Joseph Glessner; Struan F A Grant; Hakon Hakonarson; Maja Bucan
Journal:  Genome Res       Date:  2007-10-05       Impact factor: 9.043

2.  A genomewide screen for autism: strong evidence for linkage to chromosomes 2q, 7q, and 16p.

Authors: 
Journal:  Am J Hum Genet       Date:  2001-07-30       Impact factor: 11.025

3.  Sibling recurrence and the genetic epidemiology of autism.

Authors:  John N Constantino; Yi Zhang; Thomas Frazier; Anna M Abbacchi; Paul Law
Journal:  Am J Psychiatry       Date:  2010-10-01       Impact factor: 18.112

4.  De novo mutations revealed by whole-exome sequencing are strongly associated with autism.

Authors:  Stephan J Sanders; Michael T Murtha; Abha R Gupta; John D Murdoch; Melanie J Raubeson; A Jeremy Willsey; A Gulhan Ercan-Sencicek; Nicholas M DiLullo; Neelroop N Parikshak; Jason L Stein; Michael F Walker; Gordon T Ober; Nicole A Teran; Youeun Song; Paul El-Fishawy; Ryan C Murtha; Murim Choi; John D Overton; Robert D Bjornson; Nicholas J Carriero; Kyle A Meyer; Kaya Bilguvar; Shrikant M Mane; Nenad Sestan; Richard P Lifton; Murat Günel; Kathryn Roeder; Daniel H Geschwind; Bernie Devlin; Matthew W State
Journal:  Nature       Date:  2012-04-04       Impact factor: 49.962

5.  Autism genome-wide copy number variation reveals ubiquitin and neuronal genes.

Authors:  Joseph T Glessner; Kai Wang; Guiqing Cai; Olena Korvatska; Cecilia E Kim; Shawn Wood; Haitao Zhang; Annette Estes; Camille W Brune; Jonathan P Bradfield; Marcin Imielinski; Edward C Frackelton; Jennifer Reichert; Emily L Crawford; Jeffrey Munson; Patrick M A Sleiman; Rosetta Chiavacci; Kiran Annaiah; Kelly Thomas; Cuiping Hou; Wendy Glaberson; James Flory; Frederick Otieno; Maria Garris; Latha Soorya; Lambertus Klei; Joseph Piven; Kacie J Meyer; Evdokia Anagnostou; Takeshi Sakurai; Rachel M Game; Danielle S Rudd; Danielle Zurawiecki; Christopher J McDougle; Lea K Davis; Judith Miller; David J Posey; Shana Michaels; Alexander Kolevzon; Jeremy M Silverman; Raphael Bernier; Susan E Levy; Robert T Schultz; Geraldine Dawson; Thomas Owley; William M McMahon; Thomas H Wassink; John A Sweeney; John I Nurnberger; Hilary Coon; James S Sutcliffe; Nancy J Minshew; Struan F A Grant; Maja Bucan; Edwin H Cook; Joseph D Buxbaum; Bernie Devlin; Gerard D Schellenberg; Hakon Hakonarson
Journal:  Nature       Date:  2009-04-28       Impact factor: 49.962

6.  Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci.

Authors:  Stephan J Sanders; Xin He; A Jeremy Willsey; A Gulhan Ercan-Sencicek; Kaitlin E Samocha; A Ercument Cicek; Michael T Murtha; Vanessa H Bal; Somer L Bishop; Shan Dong; Arthur P Goldberg; Cai Jinlu; John F Keaney; Lambertus Klei; Jeffrey D Mandell; Daniel Moreno-De-Luca; Christopher S Poultney; Elise B Robinson; Louw Smith; Tor Solli-Nowlan; Mack Y Su; Nicole A Teran; Michael F Walker; Donna M Werling; Arthur L Beaudet; Rita M Cantor; Eric Fombonne; Daniel H Geschwind; Dorothy E Grice; Catherine Lord; Jennifer K Lowe; Shrikant M Mane; Donna M Martin; Eric M Morrow; Michael E Talkowski; James S Sutcliffe; Christopher A Walsh; Timothy W Yu; David H Ledbetter; Christa Lese Martin; Edwin H Cook; Joseph D Buxbaum; Mark J Daly; Bernie Devlin; Kathryn Roeder; Matthew W State
Journal:  Neuron       Date:  2015-09-23       Impact factor: 17.173

7.  Strong association of de novo copy number mutations with autism.

Authors:  Jonathan Sebat; B Lakshmi; Dheeraj Malhotra; Jennifer Troge; Christa Lese-Martin; Tom Walsh; Boris Yamrom; Seungtai Yoon; Alex Krasnitz; Jude Kendall; Anthony Leotta; Deepa Pai; Ray Zhang; Yoon-Ha Lee; James Hicks; Sarah J Spence; Annette T Lee; Kaija Puura; Terho Lehtimäki; David Ledbetter; Peter K Gregersen; Joel Bregman; James S Sutcliffe; Vaidehi Jobanputra; Wendy Chung; Dorothy Warburton; Mary-Claire King; David Skuse; Daniel H Geschwind; T Conrad Gilliam; Kenny Ye; Michael Wigler
Journal:  Science       Date:  2007-03-15       Impact factor: 47.728

8.  Common genetic variants on 1p13.2 associate with risk of autism.

Authors:  K Xia; H Guo; Z Hu; G Xun; L Zuo; Y Peng; K Wang; Y He; Z Xiong; L Sun; Q Pan; Z Long; X Zou; X Li; W Li; X Xu; L Lu; Y Liu; Y Hu; D Tian; L Long; J Ou; Y Liu; X Li; L Zhang; Y Pan; J Chen; H Peng; Q Liu; X Luo; W Su; L Wu; D Liang; H Dai; X Yan; Y Feng; B Tang; J Li; Z Miedzybrodzka; J Xia; Z Zhang; X Luo; X Zhang; D St Clair; J Zhao; F Zhang
Journal:  Mol Psychiatry       Date:  2013-11-05       Impact factor: 15.992

9.  Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders.

Authors:  Brian J O'Roak; Laura Vives; Wenqing Fu; Jarrett D Egertson; Ian B Stanaway; Ian G Phelps; Gemma Carvill; Akash Kumar; Choli Lee; Katy Ankenman; Jeff Munson; Joseph B Hiatt; Emily H Turner; Roie Levy; Diana R O'Day; Niklas Krumm; Bradley P Coe; Beth K Martin; Elhanan Borenstein; Deborah A Nickerson; Heather C Mefford; Dan Doherty; Joshua M Akey; Raphael Bernier; Evan E Eichler; Jay Shendure
Journal:  Science       Date:  2012-11-15       Impact factor: 47.728

10.  Convergence of genes and cellular pathways dysregulated in autism spectrum disorders.

Authors:  Dalila Pinto; Elsa Delaby; Daniele Merico; Mafalda Barbosa; Alison Merikangas; Lambertus Klei; Bhooma Thiruvahindrapuram; Xiao Xu; Robert Ziman; Zhuozhi Wang; Jacob A S Vorstman; Ann Thompson; Regina Regan; Marion Pilorge; Giovanna Pellecchia; Alistair T Pagnamenta; Bárbara Oliveira; Christian R Marshall; Tiago R Magalhaes; Jennifer K Lowe; Jennifer L Howe; Anthony J Griswold; John Gilbert; Eftichia Duketis; Beth A Dombroski; Maretha V De Jonge; Michael Cuccaro; Emily L Crawford; Catarina T Correia; Judith Conroy; Inês C Conceição; Andreas G Chiocchetti; Jillian P Casey; Guiqing Cai; Christelle Cabrol; Nadia Bolshakova; Elena Bacchelli; Richard Anney; Steven Gallinger; Michelle Cotterchio; Graham Casey; Lonnie Zwaigenbaum; Kerstin Wittemeyer; Kirsty Wing; Simon Wallace; Herman van Engeland; Ana Tryfon; Susanne Thomson; Latha Soorya; Bernadette Rogé; Wendy Roberts; Fritz Poustka; Susana Mouga; Nancy Minshew; L Alison McInnes; Susan G McGrew; Catherine Lord; Marion Leboyer; Ann S Le Couteur; Alexander Kolevzon; Patricia Jiménez González; Suma Jacob; Richard Holt; Stephen Guter; Jonathan Green; Andrew Green; Christopher Gillberg; Bridget A Fernandez; Frederico Duque; Richard Delorme; Geraldine Dawson; Pauline Chaste; Cátia Café; Sean Brennan; Thomas Bourgeron; Patrick F Bolton; Sven Bölte; Raphael Bernier; Gillian Baird; Anthony J Bailey; Evdokia Anagnostou; Joana Almeida; Ellen M Wijsman; Veronica J Vieland; Astrid M Vicente; Gerard D Schellenberg; Margaret Pericak-Vance; Andrew D Paterson; Jeremy R Parr; Guiomar Oliveira; John I Nurnberger; Anthony P Monaco; Elena Maestrini; Sabine M Klauck; Hakon Hakonarson; Jonathan L Haines; Daniel H Geschwind; Christine M Freitag; Susan E Folstein; Sean Ennis; Hilary Coon; Agatino Battaglia; Peter Szatmari; James S Sutcliffe; Joachim Hallmayer; Michael Gill; Edwin H Cook; Joseph D Buxbaum; Bernie Devlin; Louise Gallagher; Catalina Betancur; Stephen W Scherer
Journal:  Am J Hum Genet       Date:  2014-04-24       Impact factor: 11.025

View more
  20 in total

1.  Maternal fatty acid concentrations and newborn DNA methylation.

Authors:  Sonia L Robinson; Sunni L Mumford; Weihua Guan; Xuehuo Zeng; Keewan Kim; Jeannie G Radoc; Mai-Han Trinh; Kerry Flannagan; Enrique F Schisterman; Edwina Yeung
Journal:  Am J Clin Nutr       Date:  2020-03-01       Impact factor: 7.045

Review 2.  LINE-1 retrotransposons in healthy and diseased human brain.

Authors:  Nicole A Suarez; Angela Macia; Alysson R Muotri
Journal:  Dev Neurobiol       Date:  2017-12-29       Impact factor: 3.964

3.  Genetic analysis of very obese children with autism spectrum disorder.

Authors:  Herman D Cortes; Rachel Wevrick
Journal:  Mol Genet Genomics       Date:  2018-01-11       Impact factor: 3.291

4.  Corpus Callosum Volumes in Children with Autism Spectrum Disorders: Sex-Associated Differences.

Authors:  Yun Zhang; Bin Qin; Longlun Wang; Ke Zhang; Cui Song; Jie Chen; Jinhua Cai; Tingyu Li
Journal:  J Autism Dev Disord       Date:  2022-03-29

5.  Maternal-fetal stress and DNA methylation signatures in neonatal saliva: an epigenome-wide association study.

Authors:  Ritika Sharma; Martin G Frasch; Silvia M Lobmaier; Marta C Antonelli; Camila Zelgert; Peter Zimmermann; Bibiana Fabre; Rory Wilson; Melanie Waldenberger; James W MacDonald; Theo K Bammler
Journal:  Clin Epigenetics       Date:  2022-07-14       Impact factor: 7.259

6.  A general framework for identifying oligogenic combinations of rare variants in complex disorders.

Authors:  Vijay Kumar Pounraja; Santhosh Girirajan
Journal:  Genome Res       Date:  2022-03-17       Impact factor: 9.438

7.  Rare Copy Number Variations and Predictors in Children With Intellectual Disability and Epilepsy.

Authors:  Miriam Kessi; Juan Xiong; Liwen Wu; Lifen Yang; Fang He; Chen Chen; Nan Pang; Haolin Duan; Wen Zhang; Ahmed Arafat; Fei Yin; Jing Peng
Journal:  Front Neurol       Date:  2018-11-19       Impact factor: 4.003

8.  Genome sequencing identifies multiple deleterious variants in autism patients with more severe phenotypes.

Authors:  Hui Guo; Michael H Duyzend; Bradley P Coe; Carl Baker; Kendra Hoekzema; Jennifer Gerdts; Tychele N Turner; Michael C Zody; Jennifer S Beighley; Shwetha C Murali; Bradley J Nelson; Michael J Bamshad; Deborah A Nickerson; Raphael A Bernier; Evan E Eichler
Journal:  Genet Med       Date:  2018-12-03       Impact factor: 8.822

Review 9.  A Review of Oxytocin and Arginine-Vasopressin Receptors and Their Modulation of Autism Spectrum Disorder.

Authors:  Ilaria Cataldo; Atiqah Azhari; Gianluca Esposito
Journal:  Front Mol Neurosci       Date:  2018-02-13       Impact factor: 5.639

Review 10.  Genetic and Epigenetic Etiology Underlying Autism Spectrum Disorder.

Authors:  Sang Hoon Yoon; Joonhyuk Choi; Won Ji Lee; Jeong Tae Do
Journal:  J Clin Med       Date:  2020-03-31       Impact factor: 4.241

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.