| Literature DB >> 28281572 |
Hui Guo1,2, Yu Peng1, Zhengmao Hu1, Ying Li1, Guanglei Xun3, Jianjun Ou2, Liangdan Sun4, Zhimin Xiong5, Yanling Liu1, Tianyun Wang1, Jingjing Chen1, Lu Xia1, Ting Bai1, Yidong Shen2, Qi Tian1, Yiqiao Hu1, Lu Shen1, Rongjuan Zhao1, Xuejun Zhang4, Fengyu Zhang2,6, Jingping Zhao2, Xiaobing Zou7, Kun Xia1,8,9.
Abstract
Autism spectrum disorder (ASD) describes a group of neurodevelopmental disorders with high heritability, although the underlying genetic determinants of ASDs remain largely unknown. Large-scale whole-genome studies of copy number variation in Han Chinese samples are still lacking. We performed a genome-wide copy number variation analysis of 343 ASD trios, 203 patients with sporadic cases and 988 controls in a Chinese population using Illumina genotyping platforms to identify CNVs and related genes that may contribute to ASD risk. We identified 32 rare CNVs larger than 1 Mb in 31 patients. ASD patients were found to carry a higher global burden of rare, large CNVs than controls. Recurrent de novo or case-private CNVs were found at 15q11-13, Xp22.3, 15q13.1-13.2, 3p26.3 and 2p12. The de novo 15q11-13 duplication was more prevalent in this Chinese population than in those with European ancestry. Several genes, including GRAMD2 and STAM, were implicated as novel ASD risk genes when integrating whole-genome CNVs and whole-exome sequencing data. We also identified several CNVs that include known ASD genes (SHANK3, CDH10, CSMD1) or genes involved in nervous system development (NYAP2, ST6GAL2, GRM6). Besides, our study also implicated Contactins-NYAPs-WAVE1 pathway in ASD pathogenesis. Our findings identify ASD-related CNVs in a Chinese population and implicate novel ASD risk genes and related pathway for further study.Entities:
Mesh:
Year: 2017 PMID: 28281572 PMCID: PMC5345089 DOI: 10.1038/srep44155
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Pipeline of CNV discovery and analysis.
ASD patients and parents were genotyped by an Illumina 370 K or 660 K BeadChip. Control subjects were genotyped using an Illumina 610 K BeadChip. CNV calling and quality control were performed using the PennCNV program. Rare, large CNVs (>1 Mb) were used for validation and analysis. The global burden was subsequently determined. De novo CNVs and inherited CNVs were characterized.
Rare, large CNVs (>1 Mb) identified in this study.
| Region1 | Band | Length (bp) | CNV status2 | Patient ID | Inheritance | Recurrent3 | Candidate Gene4 |
|---|---|---|---|---|---|---|---|
| chr1:93416265-105648801 | 1p22.1-21.1 | 12,232,537 | Del | M11409 | N | 56 genes (including OLFM3) | |
| chr5:80564-8723219 | 5p15.33-15.2 | 8,642,656 | Del | M8820 | N | 55 genes (including SLC9A3) | |
| chr6:148602550-170469934 | 6q24.3-q27 | 21,867,385 | Dup | M16084 | N | 126 genes (including ARID1B) | |
| chr8:791912-16065839 | 8p23.3-p22 | 15,273,928 | Del | M9118 | N | 102 genes (including RP1L1/XKR6) | |
| chr15:19157192-26194101 | 15q11.2-q13.3 | 7,036,910 | Dup = 4 | M8145 | Y(5) | 142 gene (including UBE3A, GABRB3) | |
| chr15:19767013-30690437 | 15q11.2-q13.3 | 10,923,425 | Dup = 4 | M15042 | Y(5) | 162 gene (including UBE3A, GABRB3) | |
| chr15:20049770-30500698 | 15q11.2-q13.3 | 10,450,929 | Dup = 4 | M16079 | Y(5) | 155 gene (including UBE3A, GABRB3) | |
| chr15:20306549-26219673 | 15q11.2-q13.1 | 5,913,125 | Dup | M10117 | Y(5) | 119 gene (including UBE3A, GABRB3) | |
| chr15:20314760-26208861 | 15q11.2-q13.1 | 5,894,101 | Dup | M16147 | Y(5) | 109 gene (including UBE3A, GABRB3) | |
| chr15:26762141-28153539 | 15q13.1-13.2 | 1,391,399 | Del | M15147 | Y(2) | 6 genes (including APBA2) | |
| chr15:66041595-70362563 | 15q23 | 4,320,969 | Del | M16229 | N | 34 (including GRAMD2) | |
| chr16:32090048-33240087 | 16p11.2 | 1,150,040 | Dup | M8302 | N | 6 genes | |
| chr20:55665989-62426157 | 20q13.31-13.33 | 6,760,169 | Dup | M9118 | N | 96 genes | |
| chr22:46871209-49498590 | 22q13.31-13.33 | 2,627,381 | Del | M16087 | N | 40 genes (including SHANK3) | |
| chrX:3944205-7480499 | Xp22.33-22.31 | 3,536,295 | Del | M8590 | Y(2) | 6 genes (including NLGN4X) | |
| chrX:4482028-8177903 | Xp22.32-22.31 | 3,695,876 | Del | M15199 | Y(2) | 10 genes (including NLGN4X) | |
| chr2:81088888-82213957 | 2p12 | 1,125,070 | Dup | M8725 | unknown | Y(2) | . |
| chr3:1137041-2996663 | 3p26.3 | 1,859,623 | Dup | M8560 | unknown | Y(2) | 2 genes (including CNTN4) |
| chr3:2610044-5012635 | 3p26.3-26.2 | 2,402,592 | Dup | M10120 | unknown | Y(2) | 10 genes (including CNTN4) |
| chr8:2014561-3078991 | 8p23.3-23.2 | 1,064,431 | Dup | M16201 | unknown | N | 2 genes (including CSMD1) |
| chr10:17559724-18663469 | 10q12.33 | 1,103,746 | Dup | M12457 | unknown | N | 7 genes (including STAM) |
| chr2:106245033-107789242 | 2q12.2-12.3 | 1,544,210 | Dup | M16053 | paternal | N | 5 genes (including ST6GAL2) |
| chr2:226225481-227298024 | 2q36.3 | 1,072,544 | Dup | M10045 | paternal | N | 1 gene (including NYAP2) |
| chr2:81240319-82740470 | 2p12 | 1,500,152 | Dup | M13360 | paternal | Y(2) | . |
| chr4:189683236-190687299 | 4q35.2 | 1,004,064 | Del | M11400 | paternal | N | LINC01060 |
| chr5:177795689-178865875 | 5q35.3 | 1,070,187 | Dup | M10006 | paternal | N | 11 genes (including GRM6) |
| chr5:24638407-26660273 | 5p14.2-14.1 | 2,021,867 | Del | M8479 | maternal | N | 1 gene (including CDH10) |
| chr5:69041250-70672298 | 5q13.2 | 1,631,049 | Dup | M13395 | maternal | N | 12 genes |
| chr8:112622028-114424512 | 8q23.3 | 1,802,485 | Dup | M12449 | maternal | N | 2 genes (including CSMD3) |
| chr15:26798209-28156445 | 15q13.1-13.2 | 1,358,237 | Del | M12315 | maternal | Y(2) | 6 genes (including APBA2) |
| chr15:32021107-33298143 | 15q14 | 1,277,037 | Dup | M11403 | maternal | N | 19 genes |
| chr17:14059029-15399033 | 17p12 | 1,340,005 | Del | M8767 | maternal | N | 7 genes |
Notes: 1. RefSeq version is hg18; 2. All Del is one copy number, all Dup is three copy numbers except M8145, M15042 and M16079, which are four copy numbers; 3. N represents private CNVs, Y represents recurrent CNVs, the numbers in the braces represent recurrent numbers; 4. All refSeq gene numbers, ASD risk genes or implicated ASD risk genes in the CNV regions are listed in the braces.
Figure 2Burden analysis of rare, large CNVs in patients and controls.
Deletions, duplications and the combined rate for all CNVs are shown. The CNV size was categorized as >1 Mb and >2 Mb. For each event type, the significance between patients and controls is given at the bottom.
Figure 3Convergence of de novo CNVs at 15q23 and de novo mutations of GRAMD2 within the overlapped region.
Red bars indicate the chromosome locations of the four deletions identified in this study (M16229) and other studies (11233.p1, AU008, SK0243-003) (9, 13, 43). Two de novo mutations of GRAMD2 (a frameshift and an SNV in the 3’ UTR) were identified in two SSC simplex quad families (16). SSC family IDs and pedigree plots are presented at the bottom.
Figure 4Convergence of rare private CNVs in this study and LGD mutations in SSC and ASC families or patients for STAM and CHD10.
(a) Displayed for the STAM gene is a RefSeq gene model (larger ticks are exons), a loss-of-function deletion identified in this study, a de novo frameshift mutation (orange arrows) identified in an SSC quad simplex family and a nonsense mutation (orange arrows) identified in an ASC patient; (b). Displayed for the CDH10 gene is a RefSeq gene model (larger ticks are exons), a disrupted deletion identified in this study and a de novo frameshift mutation (orange arrows) identified in SSC quad simplex families. SSC family IDs and pedigree plots are presented.
Figure 5Genetic identification of genes involved in Contactins-NYAPs-WAVE1 pathway.
Disruption of Contactin gene, CNTN4, was recurrently identified in this study and other studies. Case-private rare CNV disrupting NYAP2 was also detected in this study. Red bars indicate deletions. The pedigree plots of SSC families with recurrent de novo mutations of NCKAP1 and ABI2 were presented. Orange arrows indicated the mutation location.