| Literature DB >> 19557195 |
Maja Bucan1, Brett S Abrahams, Kai Wang, Joseph T Glessner, Edward I Herman, Lisa I Sonnenblick, Ana I Alvarez Retuerto, Marcin Imielinski, Dexter Hadley, Jonathan P Bradfield, Cecilia Kim, Nicole B Gidaya, Ingrid Lindquist, Ted Hutman, Marian Sigman, Vlad Kustanovich, Clara M Lajonchere, Andrew Singleton, Junhyong Kim, Thomas H Wassink, William M McMahon, Thomas Owley, John A Sweeney, Hilary Coon, John I Nurnberger, Mingyao Li, Rita M Cantor, Nancy J Minshew, James S Sutcliffe, Edwin H Cook, Geraldine Dawson, Joseph D Buxbaum, Struan F A Grant, Gerard D Schellenberg, Daniel H Geschwind, Hakon Hakonarson.
Abstract
The genetics underlying the autism spectrum disorders (ASDs) is complex and remains poorly understood. Previous work has demonstrated an important role for structural variation in a subset of cases, but has lacked the resolution necessary to move beyond detection of large regions of potential interest to identification of individual genes. To pinpoint genes likely to contribute to ASD etiology, we performed high density genotyping in 912 multiplex families from the Autism Genetics Resource Exchange (AGRE) collection and contrasted results to those obtained for 1,488 healthy controls. Through prioritization of exonic deletions (eDels), exonic duplications (eDups), and whole gene duplication events (gDups), we identified more than 150 loci harboring rare variants in multiple unrelated probands, but no controls. Importantly, 27 of these were confirmed on examination of an independent replication cohort comprised of 859 cases and an additional 1,051 controls. Rare variants at known loci, including exonic deletions at NRXN1 and whole gene duplications encompassing UBE3A and several other genes in the 15q11-q13 region, were observed in the course of these analyses. Strong support was likewise observed for previously unreported genes such as BZRAP1, an adaptor molecule known to regulate synaptic transmission, with eDels or eDups observed in twelve unrelated cases but no controls (p = 2.3x10(-5)). Less is known about MDGA2, likewise observed to be case-specific (p = 1.3x10(-4)). But, it is notable that the encoded protein shows an unexpectedly high similarity to Contactin 4 (BLAST E-value = 3x10(-39)), which has also been linked to disease. That hundreds of distinct rare variants were each seen only once further highlights complexity in the ASDs and points to the continued need for larger cohorts.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19557195 PMCID: PMC2695001 DOI: 10.1371/journal.pgen.1000536
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
CNVs (>500 kb) on 16p11, 15q11–13, and 22q11 are present in a subset of AGRE families.
| Region | #SNP | Length (bp) | Type | AGRE ID | Scored status | Inheritance status | Shared by affected sibling? | Previous reports |
| 15q11–13 | 1246 | 5,902,313 | dup | AU010601 | parent |
| ||
| 15q11–13 | 1246 | 5,902,313 | dup | AU010604 | Autism | Inherited | No |
|
| 15q11–13 | 1246 | 5,902,313 | dup | AU1331202 | parent | |||
| 15q11–13 | 1246 | 5,902,313 | dup | AU1331302 | Autism | inherited | Yes | |
| 15q11–13 | 1246 | 5,902,313 | dup | AU1331303 | Autism | inherited | Yes | |
| 15q11–13 | 1130 | 5,008,629 | dup | AU006501 | parent | |||
| 15q11–13 | 1130 | 5,008,629 | dup | AU006503 | Spectrum | inherited | Yes | AGRE cytogenetic annotation |
| 15q11–13 | 1130 | 5,008,629 | dup | AU006504 | Autism | inherited | Yes | AGRE cytogenetic annotation, |
| 15q11–13 | 1130 | 5,008,629 | dup | AU1135202 | Autism |
| NA | |
| 15q11–13 | 1127 | 4,993,869 | dup | AU023303 | Spectrum | NA | Yes |
|
| 15q11–13 | 1127 | 4,993,869 | dup | AU023304 | Autism | NA | Yes |
|
| 15q11–13 | 1127 | 4,993,869 | dup | AU1607307 | Autism |
| No | |
| 15q11–13 | 569 | 3,540,078 | del | AU1024202 | parent | |||
| 15q11–13 | 569 | 3,540,078 | del | AU1024301 | Autism | inherited | NA | |
| 15q11–13 | 437 | 1,347,744 | dup | AU038504 | Autism |
| No |
|
| 15q11–13 | 287 | 1,578,642 | dup | AU1208301 | Autism |
| No | |
| 15q11–13 | 273 | 1,517,841 | dup | AU1875202 | parent | |||
| 15q11–13 | 98 | 572,462 | dup | AU052003 | Autism | NA | Yes |
|
| 15q11–13 | 98 | 572,462 | dup | AU052004 | Autism | NA | Yes | |
| 16p11.2 | 47 | 530,466 | del | AU0154302 | Autism |
| Yes |
|
| 16p11.2 | 47 | 530,466 | del | AU0154303 | Autism |
| Yes |
|
| 16p11.2 | 47 | 530,466 | del | AU029803 | Autism |
| No |
|
| 16p11.2 | 47 | 530,466 | del | AU041905 | Autism |
| No |
|
| 16p11.2 | 47 | 530,466 | del | AU0938301 | Autism |
| No |
|
| 16p11.2 | 47 | 530,466 | dup | AU002901 | parent |
| ||
| 16p11.2 | 47 | 530,466 | dup | AU002903 | Autism | inherited | Yes |
|
| 16p11.2 | 47 | 530,466 | dup | AU002904 | None | inherited |
| |
| 16p11.2 | 47 | 530,466 | dup | AU002905 | Autism | inherited | Yes |
|
| 22q11.21 | 512 | 2,534,567 | dup | AU001802 | parent |
| ||
| 22q11.21 | 512 | 2,534,567 | dup | AU001804 | Autism | inherited | No |
|
| 22q11.21 | 512 | 2,534,567 | dup | AU004903 | Autism |
| No |
|
| 22q11.21 | 335 | 1,429,207 | dup | AU0991301 | Autism | NA | No | |
| 22q11.21 | 177 | 728,859 | dup | AU1334201 | parent | |||
| 22q11.21 | 177 | 728,859 | dup | AU1334302 | Spectrum | inherited | No | |
| 22q11.21 | 149 | 601,423 | del | AU1555302 | Autism | NA | NA |
Figure 1TaqMan experiments validate large de novo CNV calls.
To validate results using an independent method we designed TaqMan assays to evaluate gene dosage. Results from representative experiments highlight results at loci at 1q21, 8q21, and 10q24. AGRE individual harboring deletions (red arrows) or gains (green arrows) are indicated.
Figure 2Rare exonic deletions (eDels) in NRXN1 and novel candidate genes alter predicted protein structures.
For each of BZRAP1 (a) NRXN1 (b) and MDGA2 (c) reference loci and encoded proteins (top) are contrasted against mutant loci and corresponding proteins (bottom; grey shading). Unique genomic deletions and corresponding protein truncations are highlighted in red and with black hatching, respectively. Schematized protein domains genes are as follows: BZRAP1—Src homology-3 (orange square), Fibronectin, type III (blue oval); NRXN1—Laminin G (orange hexagon), EGF-like (blue oval), 4.1 binding motif (green rectangle); MDGA2—IG-like domains (blue pentagon), MAM aka Meprin/A5-protein/PTPmu (blue oval).
A subset of eDel loci were observed to harbor rare variants in both discovery and replication cohorts, but none of 2539 controls. eDel: exonic deletion; ACRD: autism chromosome rearrangement database (http://projects.tcag.ca/autism/).
| Gene | Locus | Unrelated Discovery Cases (n = 912) | UnrelatedDiscovery Controls (n = 1488) | Unaffected Carrier Fraction | Unrelated Replication cases (n = 859) | Unrelated Replication controls (n = 1051) | ACRD? | Combined P-value (Fisher Exact) |
|
| 1p36 | 3 | 0 | 0.50 | 1 | 0 | No | 0.028 |
|
| 1q44 | 3 | 0 | 0.40 | 1 | 0 | No | 0.028 |
|
| 2p16 | 5 | 0 | 0.46 | 4 | 0 | Yes | 3.3×10−4 |
|
| 2q23.3 | 4 | 0 | 0.54 | 1 | 0 | Yes | 0.012 |
|
| 3p14 | 2 | 0 | 0.25 | 1 | 0 | Yes | 0.069 |
|
| 6p12 | 3 | 0 | 0.36 | 2 | 0 | No | 0.012 |
|
| 7q31.3 | 3 | 0 | 0.60 | 1 | 0 | Yes | 0.028 |
|
| ||||||||
|
| 11q13 | 3 | 0 | 0.43 | 1 | 0 | No | 0.028 |
|
| ||||||||
|
| 11q13 | 3 | 0 | 0.25 | 1 | 0 | No | 0.028 |
|
| 14q21.3 | 8 | 0 | 0.56 | 2 | 0 | No | 1.4×10−4 |
|
| 15q11.2 | 26 | 0 | 0.64 | 2 | 0 | Yes | 1.3×10−11 |
|
| ||||||||
|
| ||||||||
|
| 16p13 | 3 | 0 | 0.40 | 3 | 0 | No | 4.8×10−3 |
|
| ||||||||
|
| ||||||||
|
| ||||||||
|
| 17q22 | 6 | 0 | 0.50 | 2 | 0 | No | 8.0×10−4 |
|
| 19p13 | 3 | 0 | 0.40 | 8 | 0 | No | 5.5×10−5 |
|
|
Predicted event sizes in bps for unrelated AGRE* and ACC# cases are as follows: - 2317*, 12464*, 19146*, 19145#; - 44644*, 577691*, 1464677*, 44643#; - 19979*, 152437*, 241327*, 373015*, 439406*, 134010#, 161199#, 256373#, 533842#; - 14126*, 46413*, 113282*, 24100#; - 2192*, 1389749*, 1389748#; - 36354*, 36902*,157321*, 36901#, 67160#; - 33473*, 37226*, 1515817*, 43966#; – 21993*, 21993*, 21993*, 123569#; - 11410*, 19223*, 84727*, 159861#; – 19651*, 23292*, 57714*, 58528*, 122985*, 131623*, 150178*, 226468*, 194601#, 288518#; – 591007* in 26 families, 24941# and 926360#; - 40468*, 81127*, 88373*, 82786#, 82786#, 124947#; – 10102*, 10102*, 16897*, 18532*, 22806*, 34235*, 29600#, 28360#; / - 100187*, 171989*, 187147*, 98264#, 103788#, 277715#, 280201#, 292525#, 294446#, 344224#, 384324#.
For eDels at a given locus, the ratio of unaffected carriers (siblings or parents) to total number of carriers (cases and family members).
The significant difference in CNV frequency between AGRE and ACC cases (p = 2.6×10−6), along with multiple instances of similar variation in the DGV (see Tables S4 and Table S5), suggests that additional factors – including some potentially unrelated to diagnosis – may be relevant here. Sparse SNP coverage along with regional complexity (large segmental duplications) is also likely to increase false positive and false negatives at this locus. Replication data (and corresponding p Value) is for OR4N4, as only one eDel at either LOC650137 or OR4M2 was observed amongst ACC cases.
A comparable number of eDels were observed at multiple neighboring genes; carrier fraction corresponds to FLYWCH1, the lowest observed at this locus.
Joint consideration of eDels (n = 8) and eDups (n = 4) at BZRAP1 further improves statistical support for this locus (p = 2.3×10−5).
Note extreme telomeric position of this locus which may undermine/interfere with reliable calling of structural variants. CNV counts and carrier fraction corresponds to MADCAM1; fewer variants were observed at C19orf19 amongst ACC cases and carrier fraction was higher than that for MADCAM1.
Figure 3Observed replication unlikely to be attributable to chance alone.
We performed 10,000 phenotype permutation trials on replication data and determined for each the number of loci harboring CNVs in cases but not controls. Thus, within each trial, the number of loci absent from controls in the replication cohort was determined. None of the permutation trials generated as many case-specific loci as observed in our actual dataset (n = 14; p<0.0001).
Figure 4Exonic deletions, although enriched in cases versus controls, show imperfect segregation with disease in multiplex families.
Pedigrees for representative AGRE families harboring exonic deletions in BZRAP1 (A,B), kb), NRXN1 (C,D), and MDGA2 (E,F) are illustrated. Red filled circles correspond to exonic deletions. Black stars (upper right) highlight individuals for which CNV calls were not obtained (not genotyped or failing to meet criteria for quality control).