| Literature DB >> 31610793 |
Wentao Shi1,2, Andrea Massaia1,3, Sandra Louzada1, Juliet Handsaker1, William Chow1, Shane McCarthy1,4, Joanna Collins1, Pille Hallast1,5, Kerstin Howe1, Deanna M Church6,7, Fengtang Yang1, Yali Xue8, Chris Tyler-Smith9.
Abstract
BACKGROUND: Large palindromes (inverted repeats) make up substantial proportions of mammalian sex chromosomes, often contain genes, and have high rates of structural variation arising via ectopic recombination. As a result, they underlie many genomic disorders. Maintenance of the palindromic structure by gene conversion between the arms has been documented, but over longer time periods, palindromes are remarkably labile. Mechanisms of origin and loss of palindromes have, however, received little attention.Entities:
Keywords: Copy number variation; Inverted repeat; Sex chromosome; Structural variation; Variable Charge Y gene
Mesh:
Substances:
Year: 2019 PMID: 31610793 PMCID: PMC6790999 DOI: 10.1186/s13059-019-1816-y
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Characteristics of structural rearrangements involving palindrome P8
| Cell line | VCY CN | Hg | Rearrangement summary | Novel junction structure | Data types | Similar samples | Possible mechanisms |
|---|---|---|---|---|---|---|---|
| HG00742 | 1 | R1b-L11 | 37.8-kb deletion removing one P8 arm | 14-bp insertion | Fiber-FISH 10x | NHEJ | |
| HG01781 | 3 | J2a-M410 | 31.5-kb inverted duplication expanding P8 to 113.8 kb | Recombination within 92-bp duplicated region | Fiber-FISH 10x | HG01991 | NAHR or BIR |
| HG02390 | 3 | O2-K18 | 191.3-kb tandem duplication distal to P8 | Recombination within 2-bp microhomology region | Fiber-FISH 10x | HG01031, HG02401, HG00982 | MMBIR |
| HG01377 | 4 | E1b-M35 | Complex formation of additional 436.7-kb new palindrome | Recombination within 5-bp microhomology region | Fiber-FISH 10x | MMBIR | |
| HG01097 | 3 | G-M201 | 158.1-kb tandem duplication distal to P8 | Recombination within 6-bp microhomology region | Fiber-FISH 10x | MMBIR | |
| HG04131 | 3 | R1a-Z93 | 70.1-kb tandem duplication distal to P8 | Complex recombination within 5.1-kb duplicated region | Fiber-FISH 10x | NAHR or BIR | |
| NA18953 | 3 | O2b-M176 | 60.5-kb tandem duplication proximal to P8 | Recombination within 5.1-kb duplicated region | Fiber-FISH 10x | NAHR or BIR | |
| HG00707* | 3 | O3-M122 | 60.5-kb tandem duplication proximal to P8 | – | Fiber-FISH | – |
Abbreviations: CN copy number, Hg haplogroup (haplogroup defined from previous work [16]), NHEJ non-homologous end joining, NAHR non-allelic homologous recombination, BIR break-induced replication, MMBIR microhomology-mediated break-induced replication
*HG00707 showed a similar fiber-FISH structure to NA18953 but was not sequenced using 10x Genomics Linked-Read sequencing
Fig. 1Palindrome death. a Schematic representation of the reference sequence for P8 and its surrounding regions. The inverted orange arrows show P8, with the VCY genes indicated by blue boxes. The remaining colored arrows represent sections of the flanking sequences that facilitate interpretations of the rearranged structures shown in other figures. b Representative fiber-FISH image of HG000096, showing the reference sequence for palindrome P8 and its surrounding regions. Seven PCR probes (1–7) together with the BAC clone RP11-53 K10 (blue) produce the fiber-FISH signals shown schematically above the fiber-FISH image. c Corresponding fiber-FISH image for HG00742 carrying a deletion of one P8 arm and the central spacer and lacking one red and green signal. d Representation of the location of the deleted section on the reference sequence (top) and resulting deleted structure (bottom). The small arrows show the location of the breakpoint PCR fragment. e 10x Linked-Read sequencing results represented as a Loupe file. The green axes show the read depth, and the intensity of the heat-map the extent of barcode sharing. Read depth is reduced across P8 due to deletion of the region between the black arrows and dispersal of the remaining reads across both palindrome arms. f Breakpoint PCR showing amplification in the deleted sample HG00742, but not in HG03006 with the reference structure. g Sanger sequence of the breakpoint showing (top) two regions of the reference sequence with nucleotides present in the breakpoint sequence shown in upper case blue and nucleotides absent in lower case black; (center) the sequence trace; and (bottom) nucleotides matching the reference sequence in upper case blue and novel nucleotides in lower case orange
Fig. 2Palindrome expansion. a–e of Figs. 2, 3, and 4 show results from samples with different structural variants that correspond to sections c–g of Fig. 1. a, b Fiber-FISH image for HG01781 showing an additional copy of the short red and green+pink signals, and genomic interpretation. c Loupe file showing increased read depth in distal P8 and the distal flanking sequence, with a junction at the location of the black arrow. d Breakpoint PCR amplifies a fragment in HG01781, but not in HG03006. e The top and bottom sections show two regions of the reference sequence with a 92-bp repeated sequence indicated in upper case blue, except for differences between the two copies which are in orange and purple, respectively. The middle section shows the breakpoint sequence trace revealing recombination between the two 92-bp repeats
Fig. 3Tandem duplication of part of a palindrome. a, b Fiber-FISH image for HG02390 showing an additional copy of the short red, green+pink, and long red signals, and genomic structure. c Loupe file showing increased read depth in distal P8 and a long section of distal flanking sequence, with a junction at the location of the black arrow. d Breakpoint PCR amplifies a fragment in HG02390, but not in HG03006. e The top and bottom sections show two regions of the reference sequence with sections detected in the breakpoint PCR sequence in upper case blue. The middle section shows Sanger sequence traces from the breakpoint PCR product, each ending before the breakpoint in a T- or A-stretch. f The middle section shows a 10x Genomics sequencing read spanning the breakpoint and linking the two breakpoint Sanger sequence traces in e, revealing a recombination between the sequences in the upper and lower sections at the orange TC microhomology breakpoint
Fig. 4Palindrome birth. a Fiber-FISH image for HG01377 showing a group of multiple new signals distal to P8. Because of the length spanned by these signals, a composite of three fibers is shown, with the correspondences indicated by lines. b Genomic interpretation, showing that the resulting structure can be accounted for by two duplications, which could be parts of a single mutational event. c Loupe file showing increased read depth in distal P8 and a long section of distal flanking sequence, with a junction at the location of the black arrow. d Breakpoint PCR amplifies a fragment in HG01377, but not in HG03006. e The top and bottom sections show two regions of the reference sequence with sections detected in the breakpoint PCR sequence in upper case blue, or upper case orange for a 5-bp microhomology region at the breakpoint. The middle section shows the Sanger sequence trace from the breakpoint PCR product
Fig. 5Location of VCY copy number variants on the Y-chromosomal phylogeny. A Y-chromosomal phylogeny at the level of the major haplogroups. The branching order is shown, but the line lengths have no meaning. B Clusters of lineages including each of the VCY CNVs identified (brown) and the sample representing the reference sequence (HG03006). These clusters lie within the major haplogroup indicated by the dotted line