| Literature DB >> 23341896 |
Nori Matsunami1, Dexter Hadley, Charles H Hensel, G Bryce Christensen, Cecilia Kim, Edward Frackelton, Kelly Thomas, Renata Pellegrino da Silva, Jeff Stevens, Lisa Baird, Brith Otterud, Karen Ho, Tena Varvil, Tami Leppert, Christophe G Lambert, Mark Leppert, Hakon Hakonarson.
Abstract
Structural variation is thought to play a major etiological role in the development of autism spectrum disorders (ASDs), and numerous studies documenting the relevance of copy number variants (CNVs) in ASD have been published since 2006. To determine if large ASD families harbor high-impact CNVs that may have broader impact in the general ASD population, we used the Affymetrix genome-wide human SNP array 6.0 to identify 153 putative autism-specific CNVs present in 55 individuals with ASD from 9 multiplex ASD pedigrees. To evaluate the actual prevalence of these CNVs as well as 185 CNVs reportedly associated with ASD from published studies many of which are insufficiently powered, we designed a custom Illumina array and used it to interrogate these CNVs in 3,000 ASD cases and 6,000 controls. Additional single nucleotide variants (SNVs) on the array identified 25 CNVs that we did not detect in our family studies at the standard SNP array resolution. After molecular validation, our results demonstrated that 15 CNVs identified in high-risk ASD families also were found in two or more ASD cases with odds ratios greater than 2.0, strengthening their support as ASD risk variants. In addition, of the 25 CNVs identified using SNV probes on our custom array, 9 also had odds ratios greater than 2.0, suggesting that these CNVs also are ASD risk variants. Eighteen of the validated CNVs have not been reported previously in individuals with ASD and three have only been observed once. Finally, we confirmed the association of 31 of 185 published ASD-associated CNVs in our dataset with odds ratios greater than 2.0, suggesting they may be of clinical relevance in the evaluation of children with ASDs. Taken together, these data provide strong support for the existence and application of high-impact CNVs in the clinical genetic evaluation of children with ASD.Entities:
Mesh:
Year: 2013 PMID: 23341896 PMCID: PMC3544904 DOI: 10.1371/journal.pone.0052239
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Case and control samples used in this study.
| case | control | |||
| male | female | male | female | |
| AGRE/AGP | 1,517 | 626 | 0 | 0 |
| CHOP | 633 | 224 | 3,992 | 2,008 |
| sub-total | 2,150 | 850 | 3,992 | 2,008 |
| grand-total | 3,000 | 6,000 | ||
Figure 1Workflow for CNV analysis for samples analyzed on the custom array.
The same process was used for both CNAM and PennCNV analyses. All samples used for CNV analysis in this study had to meet the quality control measures described. Only unrelated cases and controls were used for the final statistical analysis.
Figure 2Manhattan plot of CNVs called both by PennCNV and CNAM.
Association statistics across all regions covered on the Illumina custom array are shown. Since the array used was not a genome-wide array, the width of each chromosome on the plot is not proportional to the chromosome length. Adjacent chromosomes are displayed in alternating red and blue colors to aid in distinguishing them.
Confirmation of CNV calls by quantitative PCR.
| TaqMan CNV Validation Status | Utah Family CNVs | Utah Sequence SNP CNVs | Literature CNVs | Total |
| PASS | 24 (2 overlap with Lit. CNV) | 15 | 25 | 64 |
| FAIL | 9 | 9 | 5 | 23 |
| NoCall | 0 | 1 | 0 | 1 |
A summary of the PCR validation result is shown. Sequence SNP CNVs were discovered in this work using SNVs present on this array for sequence variant confirmation in the same cohort.
Validated CNVs discovered using affected children from Utah families.
| TaqMan validated Utah and sequence SNP CNV regions of significance | |||||||||||
| CNV Origin | Cytoband | CNV Region - Discovery Cohort | CNV Region - Replication Cohort | CNV Type | Total Cases | Total Controls | OddsRatio | P Value | Cases | Controls | Gene/Region |
| Utah CNV | 1q21.1 | chr1:145714421-146101228 | chr1:145703115-145736438 | Dup | 1542 | 5754 | 3.37 | 9.60E-03 | 9 | 10 | CD160, PDZK1 |
| Utah CNV | 1q41 | chr1:215858193-215861879 | chr1:215854466-215861792 | Del | 1540 | 5754 | 2.12 | 5.02E-03 | 22 | 39 | USH2A |
| Utah CNV | 2p16.3 | chr2:51272055-51336043 | chr2:51266798-51339236 | Del | 1542 | 5755 | 14.96 | 8.26E-03 | 4 | 1 | upstream of NRXN1 |
| Utah CNV | 3q26.31 | chr3:172596081-172617355 | chr3:172591359-172604675 | Dup | 1540 | 5754 | 3.74 | 2.11E-01 | 1 | 1 | downstream of SPATA16 |
| Utah CNV | 4q35.2 | chr4:189084983-189117429 | chr4:189084240-189117031 | Del | 1544 | 5762 | 3.74 | 1.98E-01 | 2 | 2 | downstream of TRIML1 |
| Utah CNV | 6p24.3 | chr6:7425246-7464367 | chr6:7461346-7470321 | Del | 1544 | 5762 | ∞ | 2.11E-01 | 1 | 0 | between RIOK1 and DSP |
| Utah CNV | 6q11.1 | chr6:62443739-62462295 | chr6:62426827-62472074 | Dup | 1544 | 5762 | 3.74 | 1.98E-01 | 2 | 2 | KHDRBS2 |
| Utah CNV | 6q24.3 | chr6:147588752-147664671 | chr6:147577803-147684318 | Del | 1533 | 5751 | ∞ | 2.10E-01 | 1 | 0 | STXBP5 |
| Utah CNV | 7p22.1 | chr7:6838712-6864071 | chr7:6870635-6871412 | Dup | 1544 | 5762 | 7.47 | 1.15E-01 | 2 | 1 | upstream of CCZ1B |
| Sequence SNP CNV | 7q21.3 | Not found | chr7:93070811-93116320 | Del | 1544 | 5762 | ∞ | 4.46E-02 | 2 | 0 | CALCR, MIR653, MIR489 |
| Utah CNV | 9p21.1 | chr9:28190069-28347679 | chr9:28207468-28348133 | Del | 1544 | 5761 | 3.74 | 6.72E-02 | 4 | 4 | LINGO2 |
| Utah CNV | 9p21.1 | chr9:28190069-28347679 | chr9:28354180-28354967 | Del | 1544 | 5762 | 3.73 | 3.78E-01 | 1 | 1 | LINGO2 (intron) |
| Utah CNV | 10q23.1 | chr10:83893626-84175018 | chr10:83886963-83888343 | Del | 1505 | 5640 | 3.76 | 1.54E-02 | 7 | 7 | NRG3 (intron) |
| Utah CNV | 10q23.31 | chr10:92274764-92289762 | chr10:92262627-92298079 | Dup | 1544 | 5761 | 7.47 | 1.15E-01 | 2 | 1 | downstream of BC037970 |
| Utah CNV | 12q23.2 | chr12:102097012-102106306 | chr12:102095178-102108946 | Dup | 1544 | 5762 | 7.47 | 1.15E-01 | 2 | 1 | CHPT1 |
| Utah CNV | 13q13.3 | chr13:40087689-40088007 | chr13:40089105-40090197 | Del | 1544 | 5761 | ∞ | 2.11E-01 | 1 | 0 | LHFP (intron) |
| Sequence SNP CNV | 14q32.2 | Not found | chr14:100705631-100828134 | Dup | 1544 | 5762 | 9.36 | 5.99E-03 | 5 | 2 | SLC25A29, YY1, MIR345, SLC25A47, WARS |
| Sequence SNP CNV | 14q32.31 | Not found | chr14:102018946-102026138 | Dup | 1544 | 5762 | 4.62 | 1.01E-14 | 60 | 50 | DIO3AS, DIO3OS |
| Sequence SNP CNV | 14q32.31 | Not found | chr14:102729881-102749930 | Del | 1544 | 5762 | 7.47 | 1.15E-01 | 2 | 1 | MOK |
| Sequence SNP CNV | 14q32.31 | Not found | chr14:102973910-102975572 | Dup | 1544 | 5762 | 3.82 | 8.29E-26 | 136 | 142 | ANKRD9 (RAGE) |
| Sequence SNP CNV | 15q11.2-q13.1 | Not found | chr15:25690465-28513763 | Dup | 1544 | 5762 | 41.05 | 1.82E-08 | 11 | 1 | ATP10A, GABRB3, GABRA5, GABRG3, HERC2 |
| Sequence SNP CNV | 15q13.2–15q13.3 | Not found | chr15:31092983-31369123 | Del | 1543 | 5761 | ∞ | 4.46E-02 | 2 | 0 | FAN1, MTMR10, MIR211, TRPM1 |
| Sequence SNP CNV | 15q13.3 | Not found | chr15:31776648-31822910 | Dup | 1544 | 5762 | 4.40 | 6.91E-06 | 21 | 18 | OTUD7A |
| Sequence SNP CNV | 20q11.22 | Not found | chr20:32210931-32441302 | Dup | 1544 | 5762 | 2.72 | 3.16E-02 | 8 | 11 | NECAB3, CBFA2T2, C20orf144, NECAB3, C20orf134, PXMP4, NECAB3, ZNF341, E2F1, CHMP4B |
CNVs shown here were selected based on their p value, their case/control odds ratio, or both and were subject to molecular validation.
This CNV is contiguous with the chromosome 15q11.2 CNV described in Table 4 based on TaqMan data.
Designates CNVs not previously seen in ASD, based on queries for genes included in or flanking the CNV.
Published CNVs observed in our sample population.
| Cytoband | Literature CNVs | Region of Highest Significance | CNV Type | TaqMan Validation | Total Cases | Total Controls | OddsRatio | P Value | Cases | Controls | Gene/Region |
| 1q21.1 | chr1:146555186-147779086 | chr1:146656292-146707824 | Dup | NT | 1543 | 5761 | 7.48 | 1.15E-01 | 2 | 1 | FMO5 |
| 2p24.3 | chr2:13202218-13248445 | chr2:13203874-13209245 | Del | Validated (chr2:13203874-13209245) | 1544 | 5761 | ∞ | 2.11E-01 | 1 | 0 | upstream of LOC100506474 |
| 2p21 | chr2:45455651-45984915 | chr2:45489954-45492582 | Dup | NT | 1541 | 5756 | ∞ | 4.46E-02 | 2 | 0 | between UNQ6975 and SRBD1 |
| 2p16.3 | chr2:50145644-51259671 | chr2:51237767-51245359 | Del | NT | 1544 | 5762 | ∞ | 1.99E-03 | 4 | 0 | NRXN1 |
| 2p15 | chr2:62258231-63028717 | chr2:62230970-62367720 | Dup | NT | 1543 | 5762 | ∞ | 2.11E-01 | 1 | 0 | COMMD1 |
| 2q14.1 | chr2:115139568-115617934 | chr2:115133493-115140263 | Del | NT | 1543 | 5759 | 7.47 | 1.15E-01 | 2 | 1 | between LOC440900 and DPP10 |
| 3p26.3 | chr3:1940192-1940920 | chr3:1937796-1941004 | Del | Validated (chr3:1937796-1942764) | 1544 | 5760 | 5.60 | 6.70E-02 | 3 | 2 | between CNTN6 and CNTN4 |
| 3p14.1 | chr3:67656832-68957204 | chr3:67657429-68962928 | Del | NT | 1544 | 5762 | ∞ | 2.11E-01 | 1 | 0 | SUCLG2, FAM19A4, FAM19A1 |
| 4q13.3 | chr4:73756500-73905356 | chr4:73766964-73816870 | Dup | Validated (chr4:73753294-74058988) | 1544 | 5760 | ∞ | 2.11E-01 | 1 | 0 | COX18, ANKRD17 |
| 4q33 | chr4:154087652-172339893 | chr4:171366005-171471530 | Del | NT | 1543 | 5761 | ∞ | 4.46E-02 | 2 | 0 | between AADAT and HSP90AA6P |
| 5q23.1 | chr5:118478541-118584821 | chr5:118527524-118589485 | Dup | Validated (chr5:118527524-118614781) | 1541 | 5760 | 3.74 | 1.98E-01 | 2 | 2 | DMXL1, TNFAIP8 |
| 6p21.2 | chr6:39071841-39082863 | chr6:39069291-39072241 | Del | Validated (chr6:39069291-39072241) | 1544 | 5759 | 2.37 | 1.93E-02 | 12 | 19 | SAYSD1 |
| 8q11.23 | chr8:54858496-54907579 | chr8:54855680-54912001 | Dup | Validated (chr8:54855680-54912001) | 1544 | 5762 | ∞ | 2.11E-01 | 1 | 0 | RGS20, TCEA1 |
| 10q11.22 | chr10:46269076-50892143 | chr10:49370090-49471091 | Dup | NT | 1528 | 5750 | 3.77 | 1.96E-01 | 2 | 2 | FRMPD2P1, FRMPD2 |
| 10q11.23 | chr10:50892146-51450787 | chr10:50884949-50943185 | Dup | NT | 1542 | 5760 | 3.74 | 1.98E-01 | 2 | 2 | OGDHL, C10orf53 |
| 12q13.13 | chr12:53183470-53189890 | chr12:53177144-53180552 | Del | Validated (chr12:53177144-53182177) | 1544 | 5762 | ∞ | 4.46E-02 | 2 | 0 | between KRT76 and KRT3 |
| 15q11.1 | chr15:20266959-25480660 | chr15:20192970-20197164 | Dup | Validated (chr15:20192970-20212798) | 1515 | 5632 | 4.97 | 4.06E-02 | 4 | 3 | downstream of HERC2P3 |
| 15q11.2 | chr15:20266959-25480660 | chr15:25099351-25102073 | Del | NT | 1540 | 5761 | 3.75 | 1.13E-01 | 3 | 3 | SNRPN |
| 15q11.2 | chr15:20266959-25480660 | chr15:25099351-25102073 | Dup | NT | 1541 | 5759 | 45.19 | 7.93E-08 | 12 | 1 | SNRPN |
| 15q11.2 | chr15:25582397-25684125 | chr15:25579767-25581658 | Dup | Validated (chr15:25576642-25581880) | 1540 | 5761 | ∞ | 3.86E-06 | 8 | 0 | between SNORD109A and UBE3A |
| 15q11.2 | chr15:25582397-25684125 | chr15:25582882-25662988 | Dup | NT | 1540 | 5762 | 30.08 | 2.82E-05 | 8 | 1 | UBE3A |
| 16p12.2 | chr16:21901310-22703860 | chr16:21958486-22172866 | Dup | NT | 1544 | 5761 | ∞ | 4.47E-02 | 2 | 0 | C16orf52, UQCRC2, PDZD9, VWA3A |
| 16p11.2 | chr16:29671216-30173786 | chr16:29664753-30177298 | Del | NT | 1544 | 5761 | 7.47 | 1.15E-01 | 2 | 1 | DOC2A, ASPHD1, LOC440356, TBX6, LOC100271831, PRRT2, CDIPT, QPRT, YPEL3, PPP4C, MAPK3, SPN, MVP, FAM57B, ZG16, ALDOA, INO80E, SEZ6L2, TAOK2, KCTD13, MAZ, KIF22, GDPD3, C16orf92, C16orf53, TMEM219, C16orf54, HIRIP3 |
| 16q23.3 | chr16:82195236-82722082 | chr16:82423855-82445055 | Dup | NT | 1542 | 5758 | ∞ | 4.46E-02 | 2 | 0 | between MPHOSPH6 and CDH13 |
| 17p12 | chr17:14139846-15282723 | chr17:14132271-14133349 | Dup | Validated (chr17:14132271-14133568) | 1544 | 5762 | 1.60 | 3.57E-01 | 3 | 7 | between COX10 and CDRT15 |
| 17p12 | chr17:14139846-15282723 | chr17:14132271-15282708 | Del | NT | 1544 | 5761 | 5.61 | 6.70E-02 | 3 | 2 | PMP22, CDRT15, TEKT3, MGC12916, CDRT7, HS3ST3B1 |
| 17p12 | chr17:14139846-15282723 | chr17:14952999-15053648 | Dup | NT | 1543 | 5760 | 3.74 | 1.98E-01 | 2 | 2 | between CDRT7 and PMP22 |
| 17p12 | chr17:14139846-15282723 | chr17:15283960-15287134 | Del | Validated (chr17:15283960-15287134) | 1544 | 5761 | 3.74 | 1.13E-01 | 3 | 3 | between TEKT3 and FAM18B2-CDRT4 |
| 20p12.3 | chr20:8044044-8527513 | chr20:8162278-8313229 | Dup | NT | 1544 | 5761 | 3.73 | 1.98E-01 | 2 | 2 | PLCB1 |
| Xp21.2 | chrX:28605682-29974014 | chrX:29944502-29987870 | Dup | NT | 1544 | 5760 | ∞ | 4.47E-02 | 2 | 0 | IL1RAPL1 |
| Xq27.2 | chrX:139998330-140443613 | chrX:140329633-140348506 | Del | Validated (chrX:140329633-140456325) | 1544 | 5762 | 7.48 | 2.06E-02 | 4 | 2 | SPANXC |
| Xq28 | chrX:148858522-149097275 | chrX:148882559-148886166 | Del | Validated (chrX:148882559-149020410) | 1540 | 5754 | ∞ | 4.46E-02 | 2 | 0 | MAGEA8 |
Denotes CNVs contiguous with the chromosome 15q11.2–13.1 CNV shown in Table 3.
Figure 3UCSC Genome browser view of CNVs in the NRXN1 region.
CNVs observed in the vicinity of the NRXN1-alpha transcription start site are shown. Note that most CNVs observed in ASD patients include exon 1 of NRXN1-alpha while only 1 control CNV extends into exon 1. Produced with custom tracks listing CNV calls and uploaded to http://genome.ucsc.edu.
Figure 4UCSC Genome Browser View of CNVs in the GABR Region on chromosome 15q12.
Duplications were called by both PennCNV and by CNAM in this region, however the number of duplications called by each program differed, with many additional duplications called by CNAM. Produced with custom tracks listing CNV calls and uploaded to http://genome.ucsc.edu.
Top Significant Networks Identified by Pathway Analysis using Ingenuity IPA.
| Network | Score |
| Cell-To-Cell Signaling and Interaction, Tissue Development, Gene Expression | 55 |
| Neurological Disease, Behavior, Cardiovascular Disease | 28 |
| Cell Death, Cellular Compromise, Neurological Disease | 26 |
| Cellular Development, Cell Morphology, Nervous System Development and Function | 20 |
| Behavior, Cardiovascular Disease, Neurological Disease | 18 |
Network scores are the –log P for the results of a right-tailed Fisher's Exact Test.
Top Significant Biological Functions identified by Ingenuity IPA and literature searches.
| Function | p-value range | # Genes |
| Neurological Disease | 2.71E-05 - 3.15E-02 | 14 (18) |
| Behavior | 5.93E-05 - 4.36E-02 | 10 |
| Cardiovascular Disease | 8.58E-05 - 4.30E-02 | 10 |
| Cellular Development | 1.39E-04 - 4.77E-02 | 9 |
| Inflammatory response | 4.84E-04 - 2.89E-02 | 6 |
The right-tailed Fisher's exact test was used to calculate P-values representing the probability that selecting genes associated with that pathway or network is due to chance alone. Each functional category represents a collection of associated subcategories, each of which has an associated P-value. For example, within ‘Neurological Disease,’ are subcategories of genes associated with seizures, Huntington Disease, schizophrenia, etc. The P-value range given represents the range of P-values generated for each subcategory. In the first line, 14 genes were associated with a function in Neurological Disease by Ingenuity software. An additional 4 genes were identified as having neurological functions in the literature, giving a total of 18 with known or suspected roles in neurological disease.