| Literature DB >> 28260531 |
Ryan L Collins1,2,3, Harrison Brand1,3, Claire E Redin1,3, Carrie Hanscom1,3, Caroline Antolik1,3, Matthew R Stone1,3, Joseph T Glessner1,3, Tamara Mason3, Giulia Pregno4, Naghmeh Dorrani5, Giorgia Mandrile4, Daniela Giachino4, Danielle Perrin3, Cole Walsh3, Michelle Cipicchio3, Maura Costello3, Alexei Stortchevoi1,3, Joon-Yong An6, Benjamin B Currall1,3, Catarina M Seabra1,3,7, Ashok Ragavendran1,3, Lauren Margolin3, Julian A Martinez-Agosto5, Diane Lucente1, Brynn Levy8, Stephan J Sanders6, Ronald J Wapner9, Fabiola Quintero-Rivera5, Wigard Kloosterman10, Michael E Talkowski11,12,13.
Abstract
BACKGROUND: Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies.Entities:
Keywords: Autism; Chromoanagenesis; Chromothripsis; Complex chromosomal rearrangement; Copynumber variation; Germline mutation; Inversion; Neurodevelopmental disorders; Structural variation; Whole-genome sequencing
Mesh:
Year: 2017 PMID: 28260531 PMCID: PMC5338099 DOI: 10.1186/s13059-017-1158-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1The diverse landscape of SV in participants with ASD and other developmental disorders. We sequenced the genomes of 689 participants with ASD and other developmental disorders. a Physical coverage and (b) median insert size of liWGS libraries. c Count and distributions of large SV detected by liWGS (Additional file 1). d Distribution of SVs per participant by SV class. e Density plots of SV sizes by class. Characteristic Alu and L1 peaks are absent due to the resolution of liWGS (> ~ 5 kb) being larger than most mobile element insertions. f Cumulative distributions of SV frequencies by class. Singletons (single observation among all 686 samples) are marked with an arrow. Rare SVs are defined as those with variant frequency (VF) < 1%
Fig. 2Classifying 16 recurrent subclasses of large, complex SVs in the human genome. At liWGS resolution, we identified 16 recurrent classes of cxSV, defined here as non-canonical rearrangements involving two or more distinct SV signatures or at least three linked breakpoints. We validated 97.4% (150/154) of all cxSV sites assessed by at least one assay. Each participant harbored a median of 14 cxSVs at liWGS resolution (range: 6–23 cxSVs per participant). We identified 289 distinct cxSVs across 686 participants, totaling 9666 cxSV observations. Each row represents a subclass of cxSV, with columns representing the subclass abbreviation, number of distinct variants discovered, validation rate, total number of observed variants across all participants, the percentage of participants that were found to harbor at least one such variant in their genome, the median size of all variants in that subclass, each subcomponent SV signature that comprises the class, a linear schematic of each class of cxSV, and a simulated example of the copy-number profile as would be observed by chromosomal microarray or WGS
Fig. 3liWGS and lrWGS resolved a de novo gene-disrupting cxSV that was cryptic to standard siWGS. We performed lrWGS from 10X Genomics (Pleasanton, CA, USA) as a method of orthogonal validation for three large complex SVs detected by liWGS, two of which failed to fully validate by traditional methods. One notable example is shown here; the other two are provided in Additional file 2: Figures S4 and S5. a A de novo complex reciprocal translocation with three breakpoints between chromosomes 2 (pink) and 6 (green) was discovered by liWGS in a participant with ASD and predicted to result in LoF of PARK2 and CAMKMT. However, two of three breakpoints (breakpoints #1 and #3; orange) were not detectable by siWGS. b lrWGS heatmaps from Loupe software [113] analysis of lrWGS data showed clear evidence for each of the three SV breakpoints. c lrWGS resolved and phased all three breakpoints, including both breakpoints that failed molecular validation due to low-complexity repetitive sequence (blue), which were resolved by spanning the low-complexity sequence with 28 liWGS reads and 30 lrWGS molecules at breakpoint #1 and 12 liWGS reads and 41 lrWGS molecules at breakpoint #3
Fig. 4Rare SVs are enriched for hallmarks of deleterious biological outcomes. Comparing all rare (VF < 1%) and common (VF > 1%) SVs discovered in this cohort revealed differences in their respective functional annotations (Additional file 2: Table S2). a Rare SVs were larger on average than common SVs [1]. b Rare SVs were more likely than common SVs to disrupt genes, particularly when the disruption was predicted to result in LoF. Rare SVs were also more likely than common SVs to result in disruption of promoters [112, 114], enhancers [112, 114], and TAD boundaries [110]. c Genes predicted to harbor at least one LoF mutation due to a rare SV were enriched in many subcategories when compared to common SV, including genes predicted to be constrained against truncating mutations in healthy individuals (Constrained) [65, 66], genes predicted to be intolerant of functional variation in healthy individuals (Intolerant) [67], genes with significant burdens of exonic deletions in NDD cases versus healthy controls (NDD ExDels) [38], genes associated with an autosomal dominant disorder (Autosomal Dom.) [68, 69], and genes with at least one pathogenic variant reported in ClinVar (Disease Assoc.) [70] (Additional file 2: Table S3)
Characteristics of chromoanagenesis classes
|
|
|
| |
|---|---|---|---|
| Mutational event | Single | Single or multiple | Single or multiple |
| Chromosomes | Few (1–4) | Few (usually 1) | Many (usually ≥ 4) |
| Breakpoints | Many (≥5; sometimes > 25) | Fewer (usually 5–25) | Fewer (usually 5–25) |
| Breakpoint distribution | Clustered | Clustered | Interspersed (usually in active chromatin) |
| Breakpoint signature | Blunt ends | Microhomology | Blunt ends |
| Dosage alteration |
| Unbalanced (predominantly copy gain) | Mainly balanced (occasional deletion bridges) |
| Proposed mechanism | Micronucleus missegregation + chromosome pulverization + NHEJ | Micronucleus missegregation + chromosome pulverization + MMBIR/FoSTeS | Multiple DSBs during active transcription + NHEJ |
| Proposed parent-of-origin bias | Paternal | None | None |
| Proposed transmission bias | Maternal | None | None |
| Germline reports | 43 | 10 | 6 |
| Case:Control | 39:4 | 10:0 | 6:0 |
| References | [ | [ | [ |
Fig. 5Extreme chromoanagenesis manifests by multiple mutational mechanisms in three participants with developmental anomalies. We applied WGS to resolve microscopically visible cxSVs in three unrelated participants with developmental abnormalities. a, b Circos representations of two cases of extreme and largely balanced chromothripsis, involving > 40 breakpoints, > 40 Mb, and > 12 genes across four chromosomes [9, 115]. Points plotted around the inner ring represented estimated copy number alterations; deletions are highlighted in red. Links represent non-reference junctions on derivative chromosomes. c Circos representation of a somatic mosaic chromoanasynthesis event of chromosome 19 [115]. Duplications are shaded in blue and interspersed duplications are designated by shaded ribbons leading from the duplicated sequence to their insertion site. d CMA and WGS analysis of the mosaic chromoanasynthesis from panel c (participant TL009) revealed all nine CNVs involved in the rearrangement to have arisen on the maternal homologue and that 6/8 duplications were apparently mosaic (2.57 ± 0.02 copies, 95% CI; median coverage shown in yellow; yellow shading indicates 95% CI). Surprisingly, 2/8 duplications (outlined in teal) exhibited significantly greater copy numbers than the other six (p = 9.18 × 10–8), were linked by an underlying interstitial inversion and appeared to represent approximately three copies, suggesting this rearrangement might have originated as a de novo dupINVdup cxSV in the maternal germline (Additional file 2: Figure S7)