| Literature DB >> 21841781 |
Gregory M Cooper1, Bradley P Coe, Santhosh Girirajan, Jill A Rosenfeld, Tiffany H Vu, Carl Baker, Charles Williams, Heather Stalker, Rizwan Hamid, Vickie Hannig, Hoda Abdel-Hamid, Patricia Bader, Elizabeth McCracken, Dmitriy Niyazov, Kathleen Leppig, Heidi Thiese, Marybeth Hummel, Nora Alexander, Jerome Gorski, Jennifer Kussmann, Vandana Shashi, Krys Johnson, Catherine Rehder, Blake C Ballif, Lisa G Shaffer, Evan E Eichler.
Abstract
To understand the genetic heterogeneity underlying developmental delay, we compared copy number variants (CNVs) in 15,767 children with intellectual disability and various congenital defects (cases) to CNVs in 8,329 unaffected adult controls. We estimate that ∼14.2% of disease in these children is caused by CNVs >400 kb. We observed a greater enrichment of CNVs in individuals with craniofacial anomalies and cardiovascular defects compared to those with epilepsy or autism. We identified 59 pathogenic CNVs, including 14 new or previously weakly supported candidates, refined the critical interval for several genomic disorders, such as the 17q21.31 microdeletion syndrome, and identified 940 candidate dosage-sensitive genes. We also developed methods to opportunistically discover small, disruptive CNVs within the large and growing diagnostic array datasets. This evolving CNV morbidity map, combined with exome and genome sequencing, will be critical for deciphering the genetic basis of developmental delay, intellectual disability and autism spectrum disorders.Entities:
Mesh:
Year: 2011 PMID: 21841781 PMCID: PMC3171215 DOI: 10.1038/ng.909
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1CNV size distributions in affected and unaffected individuals
The population frequency of the largest CNV in a sample is displayed as a survivor function with the proportion of samples carrying a CNV of a given size displayed as a curve, with 95% confidence intervals indicated by dotted lines. (A) The distribution of large CNVs in the Signature set (filtered to only contain events detectable by the Illumina 550K array) versus our control population (downsampled to only events detectable by the Signature 97K array) is indicated for the overall population. After corrections for different array densities, we observed a >13.5% increase in CNV burden beyond 500 kbp in cases with a proportion of the burden representing potentially novel loci. (B) We also performed a similar analysis on subphenotypes; in this analysis, we included all Signature CNVs in conjunction with downsampled control CNVs as we are highlighting interphenotype differences rather than case versus control frequencies. This is demonstrated here for the autism, cardiovascular and craniofacial phenotypes, which represent fairly distinct sample sets and show an increased burden for the cardiovascular and craniofacial phenotypes, even after exclusion of karyotypically visible (>10 Mbp) events.
Frequency of known genomic disorders in cases and controls.
| Deletions (<10 Mbp) | Duplications (<10 Mbp) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||||
| chr | start | end | Deletion | Cases | Control | Penetrance | Duplication | Cases | Control | Penetrance | ||
| chr1 | 0.00 | 10.00 | 1p36 deletion syndrome ( | 79 | 0 | 2.62E-15 | 1.00 | 1p36 duplication ( | 16 | 1 | 0.0074 | 0.94 |
| chr1 | 144.00 | 144.34 | TAR deletion ( | 13 | 2 | 0.0659 | 0.87 | 1q21.1 duplication ( | 25 | 6 | 0.0511 | 0.81 |
| chr1 | 145.04 | 145.86 | 1q21.1 deletion ( | 47 | 2 | 3.28E-07 | 0.96 | 1q21.1 duplication ( | 26 | 1 | 0.0002 | 0.96 |
| chr2 | 96.09 | 97.04 | 2q11.2 deletion ( | 2 | 0 | 0.4282 | 1.00 | 2q11.2 duplication ( | 1 | 0 | 0.6543 | 1.00 |
| chr2 | 100.06 | 107.81 | 2q11.2q13 deletion ( | 0 | 0 | 1.0000 | NA | 2q11.2q13 duplication ( | 2 | 0 | 0.4282 | 1.00 |
| chr2 | 110.18 | 110.34 | 2q13 deletion ( | 78 | 30 | 0.0813 | 0.72 | 2q13 duplication ( | 118 | 32 | 0.0003 | 0.79 |
| chr2 | 239.37 | 242.12 | 2q37 deletion ( | 22 | 0 | 0.0001 | 1.00 | 2q37 duplication ( | 0 | 0 | 1.0000 | NA |
| chr3 | 197.23 | 198.84 | 3q29 deletion ( | 6 | 0 | 0.0785 | 1.00 | 3q29 duplication ( | 4 | 0 | 0.1833 | 1.00 |
| chr4 | 1.84 | 1.98 | Wolf-Hirschhorn deletion ( | 21 | 0 | 0.0001 | 1.00 | Wolf-Hirschhorn region duplication | 7 | 0 | 0.0513 | 1.00 |
| chr5 | 175.65 | 176.99 | Sotos syndrome deletion ( | 8 | 0 | 0.0336 | 1.00 | 5q35 duplication ( | 0 | 0 | 1.0000 | NA |
| chr6 | 100.92 | 101.05 | 6q16 deletion ( | 1 | 0 | 0.6543 | 1.00 | 6q16 duplication ( | 1 | 0 | 0.6543 | 1.00 |
| chr7 | 72.38 | 73.78 | Williams syndrome deletion ( | 42 | 0 | 1.80E-08 | 1.00 | Williams syndrome duplication ( | 16 | 0 | 0.0011 | 1.00 |
| chr7 | 74.80 | 76.50 | WBS-distal deletion ( | 2 | 0 | 0.4282 | 1.00 | WBS-distal duplication ( | 0 | 0 | 1.0000 | NA |
| chr8 | 8.13 | 11.93 | 8p23.1 deletion ( | 7 | 0 | 0.0513 | 1.00 | 8p23.1 duplication ( | 7 | 0 | 0.0513 | 1.00 |
| chr9 | 136.95 | 140.20 | 9q34 deletion ( | 60 | 0 | 8.54E-12 | 1.00 | 9q34 duplication ( | 4 | 0 | 0.1833 | 1.00 |
| chr10 | 81.95 | 88.79 | 10q23 deletion ( | 8 | 0 | 0.0336 | 1.00 | 10q23 duplication ( | 1 | 0 | 0.6543 | 1.00 |
| chr11 | 43.94 | 46.02 | Potocki-Shaffer syndrome ( | 5 | 0 | 0.1199 | 1.00 | 11p11.2 duplication ( | 0 | 0 | 1.0000 | NA |
| chr11 | 67.51 | 70.96 | SHANK2 FGFs deletion | 1 | 0 | 0.6543 | 1.00 | 0 | 0 | 1.0000 | NA | |
| chr12 | 63.36 | 66.93 | 12q14 deletion syndrome (GRIP1, HMGA2) | 2 | 0 | 0.4282 | 1.00 | 12q14 duplication ( | 0 | 0 | 1.0000 | NA |
| chr13 | 19.71 | 19.91 | 13q12 deletion ( | 14 | 12 | 0.9240 | 0.54 | 13q12 duplication ( | 4 | 0 | 0.1833 | 1.00 |
| chr15 | 20.35 | 20.64 | 15q11.2 deletion ( | 94 | 19 | 2.13E-05 | 0.83 | 15q11.2 duplication ( | 64 | 36 | 0.6614 | 0.64 |
| chr15 | 22.37 | 26.10 | Prader-Willi/Angelman | 16 | 0 | 0.0011 | 1.00 | Prader-Willi/Angelman region duplication | 27 | 0 | 1.06E-05 | 1.00 |
| chr15 | 28.92 | 30.27 | 15q13.3 deletion ( | 42 | 0 | 1.8E-08 | 1.00 | 15q13.3 duplication ( | 20 | 3 | 0.0200 | 0.87 |
| chr15 | 70.70 | 72.20 | 15q24 BP0-BP1 deletion ( | 4 | 0 | 0.1833 | 1.00 | 15q24 BP0-BP1 duplication ( | 1 | 0 | 0.6543 | 1.00 |
| chr15 | 70.70 | 73.58 | 15q24 BP0-BP1 ( | 4 | 0 | 0.1833 | 1.00 | 15q24 BP0-BP1 ( | 4 | 0 | 0.1833 | 1.00 |
| chr15 | 73.76 | 75.99 | 15q24 BP2-BP3 deletion ( | 1 | 0 | 0.6543 | 1.00 | 15q24 BP2-BP3 duplication ( | 0 | 0 | 1.0000 | NA |
| chr15 | 80.98 | 82.53 | 15q25.2 deletion ( | 1 | 0 | 0.6543 | 1.00 | 15q25.2 duplication ( | 0 | 0 | 1.0000 | NA |
| chr15 | 97.18 | 100.34 | None | 10 | 1 | 0.0641 | 0.91 | None | 1 | 0 | 0.6543 | 1.00 |
| chr16 | 3.72 | 3.80 | Rubinstein-Taybi Syndrome | 7 | 0 | 0.0513 | 1.00 | Rubinstein-Taybi region duplication | 6 | 0 | 0.0785 | 1.00 |
| chr16 | 15.41 | 16.20 | 16p13.11 deletion ( | 18 | 3 | 0.0361 | 0.86 | 16p13.11 duplication ( | 24 | 10 | 0.3315 | 0.71 |
| chr16 | 21.26 | 29.35 | 16p11.2p12.1 deletion | 2 | 0 | 0.4282 | 1.00 | 16p11.2p12.1 duplication | 2 | 0 | 0.4282 | 1.00 |
| chr16 | 21.85 | 22.37 | 16p12.1 deletion ( | 37 | 3 | 0.0001 | 0.93 | 16p12.1 duplication ( | 4 | 1 | 0.4368 | 0.80 |
| chr16 | 28.68 | 29.02 | 16p11.2 distal deletion ( | 15 | 1 | 0.0107 | 0.94 | 16p11.2 distal duplication ( | 14 | 2 | 0.0484 | 0.88 |
| chr16 | 29.56 | 30.11 | 16p11.2 deletion ( | 64 | 3 | 3.39E-09 | 0.96 | 16p11.2 duplication ( | 28 | 2 | 0.0004 | 0.93 |
| chr17 | 0.05 | 2.54 | 17p13.3 deletion (both YWHAE and PAFAH1B1) | 7 | 0 | 0.0513 | 1.00 | 17p13.3 duplication (both | 2 | 0 | 0.4282 | 1.00 |
| chr17 | 0.50 | 1.30 | 17p13.3 deletion (including PAFAH1B1) | 8 | 0 | 0.0336 | 1.00 | 17p13.3 duplication (including | 6 | 0 | 0.0785 | 1.00 |
| chr17 | 2.31 | 2.87 | 17p13.3 deletion (including YWHAE) | 7 | 0 | 0.0513 | 1.00 | 17p13.3 duplication (including | 4 | 0 | 0.1833 | 1.00 |
| chr17 | 14.01 | 15.44 | HNPP ( | 3 | 0 | 0.2801 | 1.00 | CMT1A ( | 9 | 2 | 0.2086 | 0.82 |
| chr17 | 16.65 | 20.42 | Smith-Magenis syndrome deletion | 16 | 0 | 0.0011 | 1.00 | Potocki-Lupski syndrome | 9 | 0 | 0.0220 | 1.00 |
| chr17 | 26.19 | 27.24 | NF1 deletion syndrome | 5 | 0 | 0.1199 | 1.00 | NF1 duplication | 2 | 0 | 0.4282 | 1.00 |
| chr17 | 31.89 | 33.28 | RCAD (renal cysts and diabetes) ( | 14 | 2 | 0.0484 | 0.88 | 17q12 duplication | 18 | 3 | 0.0361 | 0.86 |
| chr17 | 41.06 | 41.54 | 17q21.31 deletion ( | 23 | 0 | 0.0001 | 1.00 | 17q21.31 duplication ( | 2 | 0 | 0.4282 | 1.00 |
| chr22 | 17.40 | 18.67 | DiGeorge/VCFS deletion | 96 | 0 | 0.0000 | 1.00 | 22q11.2 duplication | 50 | 5 | 1.26E-05 | 0.91 |
| chr22 | 20.24 | 21.98 | 22q11.2 distal deletion ( | 13 | 0 | 0.0040 | 1.00 | 22q11.2 distal duplication ( | 7 | 0 | 0.0513 | 1.00 |
| chr22 | 49.46 | 49.52 | Phelan-McDermid syndrome deletion ( | 45 | 0 | 0.0000 | 1.00 | 22q13 duplication ( | 7 | 0 | 0.0513 | 1.00 |
All coordinates are according to build36. The genes in parentheses are potential candidate genes and identifiers of the genomic locations.
Rearrangements not mediated by segmental duplications; VCFS – velocardiofacial syndrome, WBS – Williams-Beuren syndrome, HNPP – hereditary neuropathy with liability to pressure palsies, CMT1A – Charcot-Marie-Tooth disease type 1A. No CNVs were identified in 2p15p16.1 (VRK2), 15q24 (BP1-BP2) (CLK3), 15q24 (SIN3A), 17q23 (TUBD1), and 17q23.1q23.2 (TBX2, TBX4). Note that a single CNV may encompass more than one genomic disorder.
Figure 2Maps of CNV locations for chromosomes 15 (top) and 17 (bottom)
CNVs (>400 kbp) in affected individuals are shown in the upper portion for each chromosome with control CNVs shown in the lower portion. Disease enrichment p-values are plotted just below the control CNV maps, computed in 200 kbp windows along each chromosome (step size of 50 kbp). Deletions and duplications are red and blue, respectively, with the p-value wiggle plots colored accordingly and plotted on a negative log scale. In the middle of each plot, chromosomal features are colored as depicted. Significantly enriched regions are numbered and named on the right-hand side.
Novel, potentially pathogenic loci identified by sliding window analysis.
| Chr | Start (Mb) | End (Mb) | Size (Mb) | CNV | Cases (adjusted) | Controls (adjusted) | Description | Ethnicity | |
|---|---|---|---|---|---|---|---|---|---|
| chr2 | 111.05 | 112.95 | 1.9 | del | 0.006 (0.032) | 12 (12) | 0 (1) | 2q13 | 10C,1A |
| chr10 | 81.6 | 88.9 | 7.3 | del | 0.014 (0.064) | 10 (10) | 0 (1) | 10q23.1 | 6C,1O |
| chr2 | 45.2 | 45.9 | 0.7 | dup | 0.022 (0.022) | 9 (9) | 0 (0) | 2p21 | 8C |
| chr2 | 111.05 | 112.85 | 1.8 | dup | 0.034 (0.022) | 8 (9) | 0 (0) | 2q13 | 5C,2O |
| chr4 | 9.45 | 10.45 | 1 | dup | 0.034 (0.051) | 8 (7) | 0 (0) | 4p16.1 | 6C,1A,1O |
| chr4 | 81.95 | 83.35 | 1.4 | del | 0.034 (0.034) | 8 (8) | 0 (0) | 4q21.21 - q21.22 | 6C,1A |
| chr2 | 3.25 | 3.45 | 0.2 | dup | 0.051 (0.051) | 7 (7) | 0 (0) | 2p25.3 | 3C,1O |
| chr2 | 165.4 | 166.1 | 0.7 | del | 0.051 (0.051) | 7 (7) | 0 (0) | 2q24.3 | 5C,1O |
| chr21 | 19.95 | 20.25 | 0.3 | del | 0.051 (0.079) | 7 (6) | 0 (0) | 21q21.1 | 1C,1A,2O |
| chr8 | 53.45 | 54.05 | 0.6 | dup | 0.051 (0.051) | 7 (7) | 0 (0) | 8q11.23 | 6C,1O |
| chr1 | 170 | 170.6 | 0.6 | del | 0.079 (0.079) | 6 (6) | 0 (0) | 1q24.3 | 5C |
| chr12 | 8.05 | 8.25 | 0.2 | dup | 0.079 (0.051) | 6 (7) | 0 (0) | 12p13.31 | 6C |
| chr15 | 82.9 | 83.6 | 0.7 | del | 0.079 (0.12) | 6 (5) | 0 (0) | 15q25 | 1C,2A,2O |
| chr6 | 20.85 | 21.25 | 0.4 | del | 0.079 (0.079) | 6 (6) | 0 (0) | 6p22.3 | 1E,1A,1O |
The counts and p-values are based on the single most significant 200 kb window, while the 'adjusted' counts include all samples with a CNV overlapping the region but exclude all related samples (see Supplementary Table 7).
C – Caucasian (primarily European descent), A - African-American, O – other.
Previously described loci[16,50] with uncertain pathogenicity
Hotspot regions.
Figure 3Discovery of novel microdeletions associated with genomic disorders
(A) A novel microdeletion on chromosome 15q25.2q25.3. Array CGH analysis for three individuals with a 660 kbp (chr15:82,889,423–83,552,890) deletion is shown. This microdeletion maps within a genomic hotspot flanked by high-identity SD blocks. Intrachromosomal SDs of high similarity relevant to this hotspot region are depicted as red (69.8 kbp, 98.6% identity) and green (17.6 kbp, 98.6% identity) block arrows. Note that the directly orientated SDs (red block arrows) likely mediate the underlying 15q25 rearrangements by non-allelic homologous recombination (NAHR). This region also contains a 60 kbp (chr15:82,775,465–82,835,495) gap in the current builds (build36 and build37) of the reference genome assembly. (B) Atypical 17q21.31 microdeletions refine critical interval genes. High-density array CGH for the 17q21.31 microdeletion region is shown for three individuals. Probes with log2 ratios below a threshold of 1.5 standard deviations from the normalized mean log2 ratio denote deletions (red). The typical deletions (top panel) were identified in 23 individuals while atypical deletions were identified in three individuals. Note that the smallest deletion (blue box) refines the phenotype-associated critical region (chr17:41,356,798–41,631,306) to encompass only five RefSeq genes. (C) Photographs of two individuals (9888884 and 648) with atypical deletions are shown. Patient #9888884 is a 5-year-old female child with clinical features typical of 17q21.31 microdeletion syndrome and includes distinctive dysmorphic features with a bulbous nasal tip, upslanting and almond-shaped palpebral fissures, long face, strabismus, epicanthal folds, and prominent ears; DD with limited speech; hypotonia in infancy; and a friendly disposition. Additional features are low birth weight, short stature, microcephaly, long fingers, and heart defects. She also presented with postaxial polysyndactyly, neonatal cholestasis, resolved leucopenia, dry skin with some hyperpigmented lesions, and an anteriorly split tongue. Patient #648 is 9-year-old male child and has a clinical history of generalized hypotonia, seizures, autism, mental retardation, motor DD, and dysmorphic features consistent with the 17q21.31 microdeletion syndrome (epicanthal folds; ptosis; long, pear-shaped nose; long, tapering fingers). Informed consent was obtained to publish the photographs.
Validation of smaller deletions.
| Chrom | Start | Stop | Gene | Confirmation | Identical BP |
|---|---|---|---|---|---|
| Tier 1 | |||||
| chr12 | 113316929 | 113317081 | 3 of 4 | Ambiguous | |
| chr1 | 40001351 | 40013297 | 6 of 6 | Ambiguous | |
| chr1 | 233932670 | 233932900 | 6 of 6 | Yes | |
| chr12 | 12868741 | 12873755 | 6 of 6 | Yes | |
| chr11 | 43729037 | 43732247 | 6 of 6 | Yes | |
| chr20 | 45205105 | 45205194 | 6 of 6 | Yes | |
| chr13 | 21173329 | 21173574 | 4 of 6 | Yes | |
| chr6 | 162314324 | 162314439 | 6 of 6 | No | |
| chr9 | 93525765 | 93527210 | 6 of 6 | No | |
| chr1 | 166548570 | 166548864 | 6 of 6 | Yes | |
| Tier 2 | |||||
| chr18 | 148699 | 148714 | 3 of 4 | Yes | |
| chr2 | 166518441 | 166518461 | 0 of 5 | NA | |
| chr10 | 26889040 | 26896423 | 2 of 3 | No | |
| chr4 | 110114972 | 110115164 | 4 of 5 | Yes | |
| chr4 | 77301890 | 77308653 | 2 of 4 | Yes | |
| chr9 | 883912 | 884195 | 5 of 5 | Yes | |
| chr12 | 31835960 | 31836367 | 4 of 4 | Yes | |
| chr13 | 97907423 | 97907559 | 0 of 4 | NA | |
| chr9 | 86546627 | 86546662 | 5 of 5 | Yes | |
Exon-altering variants.
Five samples harbor a non-exonic copy number polymorphism; one sample has a unique, exon-altering deletion.
Overlaps neighboring gene: FAM47D. Note that annotations are based on the UCSC gene model and not RefSeq genes. BP: breakpoints.
Figure 4Discovery of novel, exon-altering CNVs using the Signature CGH data
(A) For each coding exon (red bar), the three probes (black rectangles) nearest the exon for any given individual are used to define a cassette score. (B) Distribution of cassette intensities for exon 6 of PARK2 are sorted from lowest to highest (measured in standard deviations, Y-axis) across all samples (X-axis). Red points correspond to known, large deletion events that span the exon. (C) Validation results for the most strongly negative samples from (B) not previously known to carry deletions. Log2 ratio values (Y-axis for each individual row) for PARK2 (coordinates on the X-axis) in each of six tested samples are shown. Probes with very low intensities (< −0.5) are colored red, while those with moderately low values (< −0.3) are gray. Locations of PARK2 exons and probes on two of the most commonly used original oligonucleotide arrays are shown at the top.