| Literature DB >> 23275889 |
Aparna Prasad1, Daniele Merico, Bhooma Thiruvahindrapuram, John Wei, Anath C Lionel, Daisuke Sato, Jessica Rickaby, Chao Lu, Peter Szatmari, Wendy Roberts, Bridget A Fernandez, Christian R Marshall, Eli Hatchwell, Peggy S Eis, Stephen W Scherer.
Abstract
The identification of rare inherited and de novo copy number variations (CNVs) in human subjects has proven a productive approach to highlight risk genes for autism spectrum disorder (ASD). A variety of microarrays are available to detect CNVs, including single-nucleotide polymorphism (SNP) arrays and comparative genomic hybridization (CGH) arrays. Here, we examine a cohort of 696 unrelated ASD cases using a high-resolution one-million feature CGH microarray, the majority of which were previously genotyped with SNP arrays. Our objective was to discover new CNVs in ASD cases that were not detected by SNP microarray analysis and to delineate novel ASD risk loci via combined analysis of CGH and SNP array data sets on the ASD cohort and CGH data on an additional 1000 control samples. Of the 615 ASD cases analyzed on both SNP and CGH arrays, we found that 13,572 of 21,346 (64%) of the CNVs were exclusively detected by the CGH array. Several of the CGH-specific CNVs are rare in population frequency and impact previously reported ASD genes (e.g., NRXN1, GRM8, DPYD), as well as novel ASD candidate genes (e.g., CIB2, DAPP1, SAE1), and all were inherited except for a de novo CNV in the GPHN gene. A functional enrichment test of gene-sets in ASD cases over controls revealed nucleotide metabolism as a potential novel pathway involved in ASD, which includes several candidate genes for follow-up (e.g., DPYD, UPB1, UPP1, TYMP). Finally, this extensively phenotyped and genotyped ASD clinical cohort serves as an invaluable resource for the next step of genome sequencing for complete genetic variation detection.Entities:
Keywords: chromosomal abnormalities; cytogenetics; gene copy number; molecular pathways; rare variants
Mesh:
Year: 2012 PMID: 23275889 PMCID: PMC3516488 DOI: 10.1534/g3.112.004689
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1 CNV analysis workflow. The ASD cases and controls were typed using the Agilent 1M CGH array, and CNVs were identified using two algorithms, DNA Analytics and DNAcopy. CNVs detected by both algorithms were defined as the stringent dataset and were used for novel rare variant discovery. Rare variants were defined as described in the Materials and Methods. The 1884 rare ASD CNVs (as reported in Table S2) were compared with the CNVs obtained from SNP microarray studies and this resulted in identification of 946 rare CNVs that were novel to the Agilent 1M CGH platform. The rare CNVs from only European ASD cases (505 cases) were then used for global rare CNV burden analysis and gene-set association tests by comparison to 1000 PDx controls.
Summary statistics of stringent CNVs found in ASD and PDx control data sets
| Unrelated ASD Cases | PDx Controls | |
|---|---|---|
| No. samples | 676 | 1000 |
| No. males/females | 560/116 | 502/498 |
| No. stringent CNVs | 23,493 | 49,524 |
| Mean no. CNVs/ sample ± SD | 34.75 ± 6.40 | 49.52 ± 5.95 |
| Median no. CNVs/sample | 34 | 49 |
| Mean CNV size, kb ± SD | 97.75 ± 229.412 | 87.50 ± 198.59 |
| Median CNV size, kb | 30 | 30 |
| % gain /% loss | 42.36/57.64 | 48.24/51.76 |
| No. CNVs of size >1 Mb (%) | 471 (2) | 942 (1.90) |
| No. CNVs of size from 100 Kb to 1 Mb (%) | 3715 (15.81) | 7982 (16.12) |
| No. CNVs of size from <100 Kb (%) | 19,307 (82.18) | 40,600 (81.98) |
ASD, autism spectrum disorder; CNV, copy number variations; PDx, control cohort DNA samples.
CNVs detected in the same individual using two algorithms, DNA Analytics and DNAcopy, were merged with the outside probes used as boundaries and defined as stringent CNV dataset containing at least five consecutive probes. Samples containing CNVs larger than 5 Mb were excluded (Table S1).
We observed a significant difference in average number of CNVs per sample in PDx controls than in ASD cases (One-tailed exact Wilcoxon test p value < 2.2e-16) possibly due to reference bias.
Summary statistics of rare CNVs found in ASD and PDx control data sets
| Unrelated ASD Cases | PDx Controls | |
|---|---|---|
| No. samples with ≥1 rare CNV | 630 | 896 |
| No. CNVs | 1884 | 2299 |
| Mean no. CNVs/sample ± SD | 2.99 ± 1.69 | 2.57 ± 1.44 |
| Median no. CNVs/sample | 3 | 2 |
| Mean CNV size, kb ± SD | 109.79 ± 314.56 | 95.62 ± 215.74 |
| Median CNV size, kb | 28.74 | 30.43 |
| % gain/% loss | 39.81/60.19 | 44.04/55.96 |
| No. CNVs of size >1 Mb (%) | 30 (1.59) | 24 (1.04) |
| No. CNVs of size from 100 Kb to 1 Mb (%) | 362 (19.21) | 498 (21.66) |
| No. CNVs of size from <100 Kb (%) | 1492 (79.19) | 1777 (77.29) |
ASD, autism spectrum disorder; CNV, copy number variations; PDx, control cohort DNA samples.
We observed significant difference in average number of rare CNVs in ASD cases compared with PDx controls (one-tailed exact Wilcoxon test p value 0.002287).
Figure 2 A Venn diagram showing comparison of Agilent 1M CNV calls with those detected by other SNP microarray platforms including Illumina 1M single/duo array, Affymetrix500K, Affymetrix6.0, and Illumina Omni 2.5M array for a total of 615 samples. Agilent 1M CGH data yielded 21,346 stringent CNVs and the SNP (other) microarray platforms yielded 21,782 stringent CNVs. A total of 7774 CNVs (36%) detected by the Agilent 1M array (AGLT1M) were detected by other platforms, whereas 8107 CNVs (37%) detected by other microarrays were detected by the Agilent 1M array. A total of 6213 (29%) Agilent 1M CNVs have less than 5 probe coverage in other SNP microarray platforms and 10,173 (47%) CNVs from the other SNP microarray platforms have less than 5 probe coverage in Agilent 1M array and are shown as smaller circles in this figure.
Details of platform comparison
| Arrays | No. Samples | No. Stringent CNVs | Average CNVs/Sample | Average CNV Size, kb | No. Overlapping calls | % Overlap/ Validation | % CNVs With Insufficient Probes in Other Array | Novel CNVs/Sample | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Agilent 1M | 82 | 2747 | 33.5 ± 6.11 | 107 | 183 | 6 | 67.5 | 31.27 ± 5.38 |
| Affy 500K | 82 | 289 | 3.52 ± 2.28 | 520 | 189 | 65 | 6 | 2.17 ± 1.22 | |
| 2 | Agilent 1M | 262 | 8971 | 34.24 ± 6.45 | 97 | 2725 | 30 | 38 | 23.84 ± 5.30 |
| Ilmn 1M single | 262 | 4779 | 18.24 ± 4.69 | 110 | 2729 | 57 | 59.12 | 7.82 ± 3.11 | |
| 3 | Agilent 1M | 11 | 413 | 37.55 ± 6.76 | 83 | 99 | 24 | 39.70 | 28.55 ± 5.13 |
| Ilmn 1M duo | 11 | 169 | 14.36 ± 4.32 | 84 | 108 | 64 | 67.21 | 5.54 ± 1.81 | |
| 4 | Agilent 1M | 234 | 8270 | 35.49 ± 6.23 | 98 | 4439 | 53 | 5.6 | 16.44 ± 4.27 |
| Affy 6.0 | 234 | 15,315 | 65.45 ± 8.87 | 80 | 4701 | 34 | 77.46 | 45.35 ± 7.39 | |
| 5 | Agilent 1M | 26 | 945 | 36.35 ± 6.27 | 108 | 328 | 35 | 33 | 23.73 ± 4.63 |
| Ilmn Omni 2.5M | 26 | 1230 | 47.31 ± 9.77 | 61 | 380 | 31 | 81.4 | 32.69 ± 8.57 |
ASD, autism spectrum disorder; CNV, copy number variations; PDx, control cohort DNA samples.
% overlap/validation- refers to the percentage of CNVs that were also detected by the other array. For example, for the first platform comparison between Agilent 1M and Affy500K arrays, only 6% of the CNVs detected by Agilent 1M were also detected by Affy500K while 65% of the CNVs detected by Affy500K were also detected or validated by Agilent 1M array.
Novel recurrent/overlapping ASD CNVs (≥ 2 cases) and a de novo CNV
| No. | Chromosome | Sample | Gender | Size, bp | CNV | Origin | Genes | Other Rare Variants |
|---|---|---|---|---|---|---|---|---|
| 1 | 1p36.23 | 61878-L | M | 8602 | Gain | Paternal | 7q31.1 (231.3 kb) loss not in any gene | |
| 1p36.23 | 59800L | M | 12,682 | Loss | Unknown | 2q31.3 (126.4 kb) loss not in any gene;11q14.3 (62.6 kb) loss not in any gene;13q12.13 (9.8 kb) Gain not in any gene;16p13.3 (34 kb) loss | ||
| 2 | 1p21.3 | 82302 | M | 11,699 | Loss | Paternal | 13q33.2 (25.2 kb) loss not in any gene;18p11.21 (1010.9 kb) gain | |
| 1p21.3 | 82062L | M | 10,405 | Loss | Maternal | 7q31.1 (8.4 kb) Gain not in any gene | ||
| 3 | 1q43 | 117370L | F | 13,664 | Loss | Unknown | 7q21.2 (505.2kb) loss | |
| 1q43 | 52401 | M | 13,664 | Loss | Paternal | 15q25.2 (42.8 kb) gain | ||
| 4 | 2p16.3 | 87396 | F | 6925 | Loss | Maternal | 1q32.1 (18.9 kb) Gain not in any gene;2p16.3 (106.6kb) loss not in any gene;5q33.1 (349.1 kb) loss not in any gene | |
| 2p16.3 | 78391 | F | 13,027 | Loss | Maternal | 1q23.1 (21.7 kb) loss | ||
| 2p16.3 | 122686L | M | 8739 | Loss | Paternal | 1q44 (229 kb) gain | ||
| 2p16.3 | L384 | M | 81,779 | Loss | Paternal | 1q25.3 (24 kb) loss | ||
| 5 | 2q14.3 | 111520L | M | 17,780 | Loss | Maternal | 3q23 (56.4 kb) loss | |
| 2q14.3 | 129914 | M | 10,249 | Loss | Maternal | 1p36.22 (15 kb) Gain | ||
| 6 | 2q34 | 90188 | F | 6323 | Loss | Maternal | 2q33.1 (60.5 kb) gain not in any gene;17p13.2 (174.7 kb) loss | |
| 2q34 | 138145L | M | 11,613 | Loss | Not qPCR tested | 4q22.1 (97.3 kb) loss | ||
| 7 | 2q21.2 | 62257L | M | 19,521 | Gain | Paternal | 11p14.3 (36k b) gain | |
| 2q21.2 | 88032 | M | 10,109 | Loss | Maternal | 2q22.1 (23.6 kb) loss | ||
| 8 | 3p12.3 | 156900 | M | 11,610 | Loss | Maternal | 15q15.1 (8.7 kb) loss | |
| 3p12.3 | 52335 | F | 11,610 | Loss | Unknown | 8q21.13 (36.3 kb) loss not in any gene;14q23.3 (96 kb) loss not in any gene;17q25.3 (11.5 kb) loss not in any gene | ||
| 9 | 4q23 | 115813L | F | 45,723 | Gain | Paternal | 17q12 (976.7 kb) gain | |
| 4q23 | 117463L | M | 45,723 | Gain | Paternal | 1q42.13 (110.3 kb) gain | ||
| 10 | 7q31.33 | 119776 | M | 11,094 | Gain | Paternal | 7q34 (15.4 kb) loss not in any gene | |
| 7q31.33 | 44644 | M | 11,094 | Gain | Maternal | 8p11.21 (15.9 kb) gain | ||
| 11 | 7q31.33 | 128860 | M | 34,699 | Gain | Paternal | 7p22.1 (14.8 kb) loss not in any gene;18q22.1 (12.5kb) gain | |
| 7q31.33 | 146436L | M | 16,963 | Gain | Maternal | 9q34.3 (38.7 kb) loss | ||
| 7q31.33 | 130293 | F | 21,856 | Loss | Maternal | 13q21.1 (100 kb) loss not in any gene;Xq27.3 (33.2 kb) gain not in any gene | ||
| 12 | 7q32.1 | 91617 | M | 35,669 | Gain | Unknown | 17q23.2 (9.8 kb) loss | |
| 7q32.1 | 45751 | M | 37,923 | Gain | Maternal | 16q24.2 (12 kb) gain | ||
| 13 | 7q32.3 | 69180 | M | 14,254 | Loss | Paternal | 1p36.32 (40.7 kb) gain not in any gene;13q12.11,13q12.12 (383.8 kb) gain | |
| 7q32.3 | 59144 | M | 15,521 | Gain | Maternal | 2q14.2 (8.5 kb) loss not in any gene;5q35.3 (22.2 kb) gain | ||
| 14 | 8p11.21 | 44644 | M | 15,873 | Gain | Maternal | 7q31.33 (11.1 kb) gain | |
| 8p11.21 | 65690 | M | 11,965 | Loss | Unknown | 4q32.1 (8.8 kb) loss not in any gene;7q11.23 (30.3 kb) Gain not in any gene;15q26.2 (13.8 kb) loss not in any gene;17q12 (25.7 kb) loss not in any gene | ||
| 15 | 8q24.3 | 47389 | F | 12,937 | Loss | Maternal | 8q11.23 (48 kb) loss | |
| 8q24.3 | 110612L | F | 12,937 | Loss | Paternal | 12q13.11 (12.2 kb) loss COL2A1 | ||
| 16 | 9q22.31 | 60666L | M | 7480 | Loss | Paternal | 6q16.1 (87.9 kb) loss | |
| 9q22.31 | 117525L | F | 7480 | Loss | Unknown | 3p26.1 (24.8 kb) loss not in any gene | ||
| 9q22.31 | 60973L | M | 7480 | Loss | Maternal | 7p21.3 (42.5 kb) gain | ||
| 17 | 14q24.3 | 102350 | M | 14,061 | Loss | Maternal | 1q42.13 (10.2 kb) loss not in any gene;5q14.3 (96.8 kb) gain not in any gene;6p21.2 (194.5 kb) gain | |
| 14q31.1 | 95458L | M | 289,711 | Loss | Paternal | 10q25.3 (135 kb) gain | ||
| 18 | 15q25.1 | 117395L | F | 23,541 | Gain | Maternal | 12q21.2 (31.2 kb) loss not in any gene | |
| 15q25.1 | 94478 | M | 17,639 | Gain | Paternal | 19p12 (573.7 kb) loss | ||
| 15q25.1 | 132199L | M | 18,728 | Gain | Maternal | 2p14 (14.9 kb) loss not in any gene;6q12 (411.3 kb) loss | ||
| 19 | 16p13.3 | 110408 | M | 18,045 | Loss | Maternal | 3q13.2 (21.4 kb) loss not in any gene;4p13 (12.6 kb) loss | |
| 16p13.3 | 60973L | M | 18,045 | Loss | Unknown | 7p21.3 (42.5 kb) Gain | ||
| 20 | 17p13.3 | 68672 | M | 24,812 | Gain | Maternal | 4q28.3 (79.3 kb) Gain | |
| 17p13.3 | 50800L | M | 24,812 | Gain | Maternal | 7q31.1-q31.31 (11 Mb) loss encompasses 32 genes including | ||
| 21 | 17p13.1 | 59902L | M | 9003 | Loss | Maternal | 2p21 (25.6 kb) loss | |
| 17p13.1 | 114094L | F | 9003 | Loss | Maternal | − | ||
| 22 | 19p13.11 | 154267L | M | 17,268 | Gain | Paternal | 1p36.23 (11 kb) loss | |
| 19p13.11 | 66673 | F | 19,829 | Gain | Maternal | 1p36.22 (9.5 kb) loss | ||
| 23 | 19q13.32 | 44644 | M | 24,729 | Gain | Paternal | 7q31.33 (11.1 kb) gain | |
| 19q13.32 | 124475 | M | 24,729 | Gain | Unknown | 17p13.1 (428.4 kb) gain | ||
| 19q13.32 | 45554 | M | 24,729 | Gain | Paternal | 1q43,1q42.3 (96.9 kb) gain | ||
| 24 | Xp22.2 | 55310 | M | 19,171 | Gain | Maternal | 3q13.33 (29.1 kb) gain | |
| Xp22.2 | 58294L | M | 19,171 | Gain | Maternal | 7p21.3 (57.3 kb) loss not in any gene;8p22 (89.9 kb) gain | ||
| 25 | Xp21.1 | 100570L | M | 6261 | Loss | Maternal | − | |
| Xp21.1 | 91548L | M | 6917 | Loss | Maternal | 2q31.2 (15.8 kb) loss | ||
| 26 | 14q23.3 | 103018L | M | 36,180 | Loss | − |
ASD, autism spectrum disorder; CNV, copy number variations; F, female; M, male; qPCR, quantitative polymerase chain reaction; UTR, untranslated region.
The CNVs were validated by qPCR. However, accurate breakpoints have not been identified. The size of the CNVs shown is as detected by microarrays.
A CNV encompassing coding regions of the gene are defined as exonic, whereas those encompassing introns were defined as intronic. One of the CNVs intersected untranslated region of the gene (UTRs).
Other rare variants present in the individual.
Inheritance is unknown if the parents DNA were unavailable.
Figure 3 Pedigrees (A-Q) represent ASD families with overlapping/recurrent CNVs in novel loci and a de novo CNV event (from Table 4). The open symbols represent unaffected individuals, filled symbols represent individuals with ASD diagnosis and arrows indicate the probands. Individuals from which DNA was not available (N/A) for testing are denoted inside the symbols.
Figure 4 Genome browser views of a subset of novel rare CNVs occurring in two or more ASD cases or are de novo (from Table 4). The genome coordinates are from genome build 36 (hg18) and, in each panel, ASD case ID numbers are listed next to blue bars (denoting a duplication/gain) or red bar (denoting a deletion/loss). Each panel also shows, in separate tracks, the RefSeq genes, UCSC segmental duplications, and probe distributions for the different microarray platforms used for CNV detection (Affy500K, Affy6.0, Illumina 1M, and Agilent 1M). (A) Maternally inherited duplications at the 15q25.1 locus encompassing an exon of the CIB2 gene in three unrelated ASD cases. (B) Paternally inherited duplications at the 4q23 locus encompassing several exons of the DAPP1 gene in two unrelated ASD cases. (C) Maternally inherited duplications at the 17p13.3 locus encompassing exons of the YWHAE gene in two unrelated ASD cases. (D) Duplications at the 19q13.32 locus encompassing exons of SAE1 gene in four unrelated ASD cases, three of which are novel and the fourth (ASD case 90278) was previously detected using an Affymetrix 6.0 array. (E) A paternally inherited deletion and a maternally inherited duplication at the 7q32.3 locus encompassing exons of the PLXNA4 gene (F). A de novo deletion at the 14q23.3 locus encompassing an intronic region of the GPHN gene in one ASD case.
Other singleton novel rare CNVs
| Chromosome | Sample | Gender | Size, bp | CNV | Origin | Genes | Other Rare Variants |
|---|---|---|---|---|---|---|---|
| 1p36.11 | 88253 | M | 16,863 | Loss | Maternal | 9p21.3 (13.1 kb) loss | |
| 1p34.1 | 94073 | F | 9205 | Loss | Paternal | 2q35 (6.8 kb) loss 40972;8p23.1 (74.5 kb) loss | |
| 2p21 | 115818L | M | 9068 | Loss | Paternal | 16q22.3 (90.8 kb) gain not in any gene | |
| 2p16.3 | 75420 | F | 10,491 | Gain | Maternal | 2p16.3 (10.5 kb) gain | |
| 2p12 | 100678L | F | 52,069 | Gain | Paternal | 1p31.1 (53.5 kb) loss not in any gene;1q42.2 (5 kb) loss | |
| 2q11.2 | 47173L | F | 780,104 | Gain | Unknown | 11 genes | 19q13.43 (843.3 kb) loss |
| 3p22.2 | 58016 | M | 38,181 | Gain | Paternal | 6p22.1 (9.5 kb) loss | |
| 3p14.2 | L656 | M | 332,421 | Loss | Maternal | 1q32.1 (37 kb) loss | |
| 3q13.31 | 52026 | M | 10,004 | Loss | Maternal | 7p11.2 (81.7 kb) gain | |
| 3q13.33 | 55310 | M | 29,147 | Gain | Paternal | 7q21.12 (12.3 kb) loss not in any gene;10q23.1 (23.3 kb) loss | |
| 3q25.1 | 50002 | F | 7969 | Gain | Unknown | 1p36.22 (6.4 kb) loss | |
| 3q26.2 | 46685 | M | 10,633 | Loss | Paternal | 4q13.1 (32.6 kb) gain not in any gene | |
| 3q29 | 76066 | M | 27,516 | Loss | Unknown | 17q25.3 (13.4 kb) gain | |
| 4p15.31 | 100678L | F | 19,015 | Loss | Maternal | 1p31.1 (53.5 kb) loss not in any gene;1q42.2 (5 kb) loss | |
| 4q21.1 | 62345L | M | 8527 | Loss | Paternal | 5q21.1 (20.8 kb) loss not in any gene;14q21.2 (59.5 kb) loss not in any gene | |
| 4q32.1 | 55360 | M | 16,194 | Gain | Maternal | 9p21.3 (18 kb) gain | |
| 5p15.2 | 51119 | M | 12,982 | Loss | Unknown | — | |
| 5p14.3 | 167532 | F | 16,492 | Gain | Maternal | 11p15.4 (98.4 kb) loss | |
| 5q14.3 | 55262-L | M | 665,114 | Gain | Maternal | 2q37.1 (119.8 kb) gain | |
| 5q22.2 | 59269L | F | 25,834 | Loss | Maternal | 22q12.1 (225.1 kb) loss | |
| 5q31.3 | 66559 | M | 19,794 | Gain | Unknown | 6q27 (307.6 kb) gain | |
| 5q33.1 | 60560L | M | 54,785 | Gain | Paternal | 20p13 (22.4 kb) gain | |
| 6p22.1 | 58016 | M | 9549 | Loss | Paternal | 3p22.2 (38.2 kb) gain | |
| 6q26 | 75744 | M | 17,712 | Loss | Paternal | 21q21.3 (8.1 kb) loss | |
| 7p21.3 | 68687 | M | 258,183 | Loss | Maternal | 1q32.1 (145.7 kb) gain | |
| 8p21.2 | 88810 | M | 28,281 | Gain | Paternal | 6p22.1 (17.3 kb) gain not in any gene;19q13.43 (24.1 kb) gain | |
| 9p22.3 | 131240 | M | 8341 | Loss | Paternal | 5p13.2 (51.9 kb) gain | |
| 9p22.1 | 61180-L | M | 7974 | Loss | Paternal | 3p21.31 (20.4 kb) loss | |
| 9p13.3 | 59641L | F | 190,658 | Gain | Maternal | 6q24.1 (13.2 kb) loss | |
| 10q25.1 | 115816L | M | 9442 | Loss | Maternal | 11q23.3 (9.2 kb) loss | |
| 11q12.2 | 72816L | M | 15,159 | Loss | Paternal | 7q31.31 (19.4 kb) gain not in any gene;8q22.1 (19.8 kb) loss not in any gene;12q12 (25 kb) gain | |
| 11q23.3 | 96280L | M | 19,424 | Gain | Paternal | 3p12.3 (123 kb) loss not in any gene;13q33.1 (74.2 kb) loss | |
| 11q24.2 | 138952L | F | 11,398 | Loss | Maternal | 4p13 (71.3 kb) gain not in any gene;11q14.1 (28.8 kb) gain not in any gene;14q31.2 (27.9 kb) loss not in any gene | |
| 11q24.3 | 98320L | M | 8428 | Loss | Maternal | 1q31.1 (321.4 kb) loss not in any gene;4q13.1 (1326.4 kb) gain not in any gene;5q23.1 (42 kb) loss not in any gene;7q31.1 (98.6 kb) loss | |
| 12q15 | 59794L | M | 9279 | Loss | Maternal | 5q35.2 (8 kb) loss | |
| 12q24.33 | 50280 | M | 202,302 | Gain | Maternal | 3q21.3 (6.1 kb) loss | |
| 14q23.2 | 50002 | F | 8427 | Gain | Unknown | 1p36.22 (6.4 kb) loss | |
| 15q24.2 | 62261L | M | 12,936 | Loss | Paternal | 5q14.1 (15.3 kb) gain | |
| 15q25.1 | 68388 | F | 7142 | Loss | Paternal | 1p32.2 (567.4 kb) gain | |
| 16p13.3 | 47378 | M | 13,736 | Loss | Maternal | 2p23.1 (42.7 kb) loss | |
| 16p11.2 | 100564 | F | 546,709 | Gain | Maternal | 27 genes | 8q12.3 (17.9 kb) loss |
| 16q24.1 | 100678L | F | 12,352 | Loss | Maternal | 1p31.1 (53.5 kb) loss not in any gene;1q42.2 (5 kb) loss | |
| 16q24.2 | 68711 | M | 17,969 | Loss | Maternal | 2p23.1 (61.4 kb) loss | |
| 17p13.1 | 47387 | M | 7758 | Gain | Maternal | 2p23.1 (7.4 kb) loss | |
| 17q11.2 | 87042 | F | 14,177 | Loss | Maternal | − | |
| 17q25.3 | 103818L | M | 48,984 | Gain | Maternal | 8q24.13 (54.1 kb) loss not in any gene | |
| 19q13.32 | 85287L | M | 17,455 | Loss | Maternal | 16q22.1 (36.5 kb) gain | |
| 19q13.33 | 168753 | M | 61,936 | Loss | Maternal | 4p15.33 (5.4 kb) loss not in any gene | |
| 20p13 | 92540L | M | 15,645 | Loss | Paternal | 2q34 (23 kb) loss | |
| 22q11.21 | 118909L | F | 7277 | Loss | Unknown | 7q22.2 (12.4 kb) gain not in any gene;10q21.3 (8.1 kb) loss | |
| 22q11.22-q11.23 | MM0177-3 | M | 656,280 | Gain | Maternal | − | |
| 22q11.23 | 154266L | M | 6784 | Loss | Maternal | 11p14.3 (138.6 kb) loss not in any gene | |
| Xq28 | 100676L | M | 123,871 | Gain | Maternal | 3q22.3 (11.5 kb) loss not in any gene;5p14.2 (12.2 kb) loss not in any gene;22q12.3 (35.6 kb) gain not in any gene |
ASD, autism spectrum disorder; CNV, copy number variations; F, female; M, male; qPCR, quantitative polymerase chain reaction; UTR, untranslated region.
These CNVs were validated by qPCR. However accurate breakpoints have not been identified. The size of the CNVs shown is as detected by microarrays.
A CNV encompassing coding regions of the gene are defined as exonic, whereas those encompassing introns were defined as intronic.
Other rare variants present in the individual.
Inheritance is unknown if the parents DNA were unavailable.
Figure 5 Genome browser view of a subset of the novel rare CNVs found in one ASD case that impact one or more exons (Table 5). The genome coordinates are from genome build 36 (hg18) and, in each panel, the ASD case ID number is listed next to a red bar (denoting a deletion/loss). Each panel also shows, in separate tracks, the RefSeq genes and probe distributions for the different microarray platforms used for CNV detection (Affy500K, Affy6.0, Illumina 1M and Agilent 1M). (A) Paternally inherited deletion at the 9p22.1 locus impacting the 5′-end of the SLC24A2 gene. (B) Deletion at the 22q11.21 locus resulting in loss of an exon of the CECR2 gene. (C) Paternally inherited deletion at the 11q12.2 impacting the 3′-end of the DAGLA gene. (D) Maternally inherited deletion at the 22q11.23 impacting exons near the 5′-end of the UPB1 gene. (E) Pedigrees of ASD families for the CNVs in novel loci occurring in one ASD case that correspond to panels A-D. The open symbols represent unaffected individuals, filled symbols represent individuals with ASD diagnosis and arrows indicate the probands. Individuals from which DNA was not available (N/A) for testing are denoted inside the symbols.
Global burden of rare CNVs in European ASD cases vs. PDx controls
| Type | Measure | Mean ASD | Mean Controls | Mean ratio ASD/controls | |
|---|---|---|---|---|---|
| All | No. CNV | 2.680942 | 2.565848 | 1.05 | 0.0575976 |
| All | Total CNV size, kb | 124.50 | 112.92 | 1.10 | 0.1804119 |
| All | Total no. genes | 3.850107 | 2.560268 | 1.50 | 0.2345081 |
| Deletions only | No. CNV | 1.958225 | 1.773793 | 1.10 | 0.0046012 |
| Deletions only | Total CNV size, kb | 54.29 | 48.81 | 1.11 | 0.1043462 |
| Deletions only | Total number of genes | 1.741514 | 0.9834483 | 1.77 | 0.0087554 |
| Duplications only | No. CNV | 1.549383 | 1.601266 | 0.97 | 0.7716051 |
| Duplications only | Total CNV size, kb | 101.66 | 96.02 | 1.11 | 0.3778319 |
| Duplications only | Total no. genes | 3.490741 | 2.495253 | 1.40 | 0.8564186 |
Significant differences (P ≤ 0.01) are indicated.
CNV burden in known ASD genes in cases vs. PDx controls
| Type | Size | Case counts | Control counts | ASD % | CT % | |
|---|---|---|---|---|---|---|
| All | 110 | 14 | 15 | 4.307692 | 2.340094 | 0.0700680 |
| Deletions | 110 | 11 | 1 | 5.641026 | 0.288184 | 8.90E-05 |
| Duplications | 110 | 3 | 14 | 1.369863 | 3.030303 | 0.9480197 |
ASD, autism spectrum disorder; CNV, copy number variations; PDx, control cohort DNA samples.
Number of genes causally implicated in ASD (Betancur 2011).
Significant differences (p ≤ 0.01) are indicated; ASD and CT (controls) % are based on the total number of subjects with at least one rare exonic CNV of the respective type.
Figure 6 A gene-set functional enrichment map of ASD. Each node represents gene-sets with its size proportional to the total number of genes in each set. The map shows gene sets enriched for deletions with enrichment significance (permutation-based FDR) represented as a node color gradient.