| Literature DB >> 24958239 |
Julia Vogt, Kathrin Bengesser, Kathleen B M Claes, Katharina Wimmer, Victor-Felix Mautner, Rick van Minkelen, Eric Legius, Hilde Brems, Meena Upadhyaya, Josef Högel, Conxi Lazaro, Thorsten Rosenbaum, Simone Bammert, Ludwine Messiaen, David N Cooper, Hildegard Kehrer-Sawatzki.
Abstract
BACKGROUND: Genomic disorders are caused by copy number changes that may exhibit recurrent breakpoints processed by nonallelic homologous recombination. However, region-specific disease-associated copy number changes have also been observed which exhibit non-recurrent breakpoints. The mechanisms underlying these non-recurrent copy number changes have not yet been fully elucidated.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24958239 PMCID: PMC4229983 DOI: 10.1186/gb-2014-15-6-r80
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Location of the breakpoints of the 17 atypical deletions. At the top is a schematic representation of the NF1 gene and its flanking regions. The relative positions of the genes located within this region are denoted by horizontal black bars. Below, the extents of the 17 NF1 deletions analyzed are indicated by horizontal bars. The centromeric breakpoints of the deletions depicted by red bars are located within SUZ12P. None of these deletions had telomeric breakpoints located within SUZ12. The deletions depicted by blue bars exhibit breakpoints located within NF1-REPa. Two deletions (grey bars) extended beyond the region indicated here in a centromeric direction (indicated by dotted lines). The patient identification numbers are given on the left. cen, centromere; tel, telomere.
Breakpoint positions and sequence features of the 17 atypical deletions
| 08D2261 | 29,102,848 (30,079,302) | 976,455 | − | – | – | Yes | – | NHEJ | |
| 100206 | 29,065,415 (30,016,354) | 950,940 | – | – | Yes | – | NHEJ/ RBM | ||
| D1008345 | 29,094,424 (30,218,204) | 1,123,781 | 1 | – | – | No | – | NHEJ | |
| D05.2678 | 28,142,439 (34,112,082) | 5,969,644 | 1 | – | – | No | – | NHEJ | |
| R84329 | 29,074,557 (30,223,384) | 1,148,828 | 1 | TGTCCCCTCTG | Yes | – | NHEJ/RBM | ||
| 70969 | 29,092,903 (30,175,393) | 1,082,491 | 1 | GGCCAGGTT | – | No | – | NHEJ/RBM | |
| 619 | 28,946,218 (31,954,580) | 3,008,363 | 2 | GTAGCAGAAT | – | No | NF1REPa ( | – | NHEJ/RBM |
| 61541 | 29,082,032 (30,187,273) | 1,105,242 | 2 | – | Yes | – | NHEJ/RBM | ||
| 2535 | 29,101,686 (30,250,762) | 1,149,077 | 2 | – | – | No | – | NHEJ | |
| R48018 | 29,084,006 (30,241,383) | 1,157,378 | 2 | – | – | No | – | NHEJ | |
| Ak-47055 | 29,082,023 (30,243,011) | 1,160,989 | 4 | – | – | Yes | – | NHEJ | |
| D06.1047 | 29,264,225 (29,783,515) | 519,291 | 6 | – | – | Yes | – | MMEJ/RBM | |
| 659g | 28,948,946 (30,345,260) | 1,396,315 | 20 | – | – | Yes | NF1REPa (NF1REPc) | ||
| 1106 | 29,001,813 (29,765,892) | 764,080 | 24 | – | + | No | NF1REPa ( | ||
| D0801587 | 27,726,501 (29,729,864) | 2,003,364 | 52 | – | – | Yes | |||
| DA-77/grandmother | 29,100,005 (30,101,550) | 1,001,546 | 2 | SVA_F1 element | – | Yesh | – | SVA insertion | |
| ASB4-55 | 29,103,071 (29,969,839) | 866,769 | – | SVA_F element | – | Yes | – | SVA insertion |
aIndicated are the genomic positions of the centromeric breakpoints and, in parentheses, the positions of the telomeric breakpoints (hg19). The indicated genomic positions correspond to the nucleotides immediately before and immediately after the deleted DNA sequence. Where microhomology (100% sequence identity) was present at the deletion breakpoints, the position of the centromeric breakpoint was defined as the last nucleotide adjacent to the region of microhomology whilst the position of the telomeric breakpoint was defined as the first nucleotide adjacent to the region of microhomology.
bIndicated are the numbers of nucleotides exhibiting microhomology at the corresponding breakpoints. Microhomology was defined as one or more perfectly matching basepairs.
cPlus signs indicate single nucleotide changes (SNCs) identified within the breakpoint-flanking regions (±75 bp) of the patients but absent from the reference sequence of the human genome (hg19). Where an SNC was identified within the breakpoint-flanking sequence, a replication-based mechanism (RBM) was considered to have been responsible for causing the SNC as well as the large NF1 deletion. Dashes indicate that SNCs were not detected in the respective breakpoint-flanking sequences.
dSINEs and LINEs spanning the breakpoints were identified by means of the Repeat Masker track of the UCSC Genome Browser (date: 04.11.2013). Indicated are the Alu elements that were located at both breakpoints and in direct orientation with respect to each other. Dashes denote that directly oriented SINEs or LINEs located at both breakpoints and harboring the respective deletion breakpoints at homologous positions were not identified.
eIndicated is the homology between the Alu elements as well as the length of the Alu sequences exhibiting the indicated homology.
fNHEJ, non-homologous end joining; MMEJ, microhomology-mediated end joining; Alu-mediated NAHR,Alu-mediated nonallelic homologous end joining; RBM, replication-based mechanisms such as fork stalling and template-switching (FoSTeS) and microhomology-mediated break-induced replication (MMBIR). SVA insertion, SVA insertion-associated deletion.
The deletions in patients 100206 and 61541 may have been mediated by either NHEJ or RBM. In both patients, SNCs were detected in breakpoint-flanking regions suggestive of the involvement of a low-fidelity DNA polymerase and a RBM. However, the involvement of NHEJ in the occurrence of these deletions cannot be excluded. In addition, the deletions identified in patients R84239, 70969 and 619 may have been caused by either NHEJ or RBM. The microinsertions identified at the breakpoints exhibit homology to sequences closely flanking the breakpoints and hence could have been caused by multiple template switching events indicative of a replication-based mechanism. However, we cannot exclude the possibility that the microinsertions occurred at random and that the deletions were mediated by NHEJ.
The deletion identified in patient D06.1047 may have been mediated by MMEJ since microhomology > 5 bp is indicative of MMEJ [29]. However, microhomology has also been observed at rearrangement breakpoints mediated by a replication-based mechanism and hence the deletion in this patient could also have been caused by an RBM.
In the case of patients 659, 1106 and D0801587, several mechanisms (Alu-mediated NAHR, MMEJ or RBM) could have caused the respective deletions. We observed directly oriented Alu elements at the deletion breakpoints of all three patients. Hence, the deletions may have been caused by Alu-mediated NAHR. Alternatively, the deletions could have been caused by MMEJ or RBM since microhomologies of 20 to 52 bp were observed at the breakpoints.
gThe centromeric deletion breakpoint in patient 659 is located within intron 7 of the LRRC37Bpseudogene in NF1-REPa whereas the telomeric breakpoint is located within intron 2 of the functional LRRC37B gene located in NF1-REPc. The breakpoints of the deletion in patient 659 do not overlap with the nonallelic homologous recombination (NAHR) hotspots of type-1 NF1 deletions, termed PRS1 and PRS2. Hence, this atypical NF1 deletion is not considered to exhibit recurrent breakpoints located within regions of extended sequence homology.
hThe grandmother of patient DA-77 exhibits somatic mosaicism since she possesses cells harboring the deletion alongside normal cells. Her granddaughter, patient DA-77, harbors the deletion in all of her cells.
Figure 2Structure of the SVA elements inserted at the deletion breakpoints and their source elements. (A) The SVA_F1 element H10_1 spans 4,039 bp and is the likely source element of the SVA copy that inserted within SUZ12P intron 8 in the grandmother of patient DA-77. Starting at its 5′ end, H10_1 comprises a target site duplication (TSD), a transduced sequence (5′TD) and a full-length AluSc from chromosome 9p13.3, a transduced partial exon 1 of MAST2, an Alu-like region, a variable number of tandem repeats (VNTR) region, a SINE-R, a polyA(17) tract, the second TSD, an AluSp element, a second polyA(17) tract and a non-repetitive, unique sequence resulting from a 3′ transduction that harbors two polyadenylation signals (AATAAA). The size of each region is given in basepairs. (B) A copy of the source element H10_1 integrated within SUZ12P intron 8 in the grandmother of patient DA-77. The SVA insertion was associated with a deletion of approximately 1 Mb. The inserted SVA spans 1.7 kb and is 5′ truncated. (C) Structure of the putative source SVA element H6_1084, which spans 2,691 bp and belongs to the SVA_F subfamily. A copy of H6_1084 is presumed to have integrated within SUZ12P intron 8 in patient ASB4-55. Full-length H6_1084 has the following structure starting from the 5′ end: a TSD, a 5′TD from chromosome 12p11.21, a CCCTCT(4) repeat, a 343-bp Alu-like region, a GC-rich VNTR region, a SINE-R element, two polyadenylation signals, a polyA(11) tract and the second TSD. The length of each region is indicated in basepairs. (D) Structure of the 5′ truncated copy of H6_1084 that has integrated within SUZ12P intron 8 in patient ASB4-55. The SVA insertion was associated with an atypical NF1 deletion of 867 kb. The SVA integration sites within SUZ12P intron 8 demarcate the centromeric breakpoints of the atypical NF1 deletions.
Figure 3Locations of the telomeric breakpoints identified in the 17 atypical deletions. A schematic representation of the genes located within the region is given on top. The extent of each of the 17 atypical NF1 deletions is indicated by a red bar. The centromeric breakpoints of these deletions differ from each other and are not indicated on this schema. The numbering of the breakpoint locations is according to the human GRCh37/hg19 assembly. Five deletions exhibited breakpoints that were located within a 32.6 kb region (demarcated by a grey box). tel, telomeric direction.
Figure 4Centromeric breakpoint positions of the 11 atypical deletions with centromeric breakpoints located within . The exon-intron structure of SUZ12P is indicated as well as the numbering of the exons presented as vertical black lines. The extent of each of the 11 atypical NF1 deletions is shown by red bars. The telomeric breakpoints of these deletions differ from each other and are not indicated on this schema. The numbering of the centromeric breakpoint locations is given according to the human GRCh37/hg19 assembly. tel, telomeric direction.
Figure 5Putative mechanism underlying the large atypical deletions identified in patient ASB4-55 and the grandmother of patient DA-77. The deletions were associated with the insertion of an SVA element mediated by the LINE 1 protein machinery via target-primed reverse transcription. (A) SUZ12P intron 8 is indicated in lilac whereas the telomeric part of the NF1 region is shown in green. The dotted lines indicate the approximately 1-Mb distance between these two regions. The SVA insertion within SUZ12P intron 8 is likely to have been initiated by the L1 endonuclease (L1-EN), which will have introduced a nick at the consensus cleavage site 5′-CTTT/A-3′. (B) Next, the SVA mRNA annealed to the T-overhang by means of its polyA-tail. Then, the L1 reverse transcriptase used the SVA mRNA as a template for reverse transcription to synthesize the SVA cDNA (blue). Second strand cleavage by the L1-EN occurred upstream of the first-strand cleavage site. Independently, a double strand break (DSB) occurred in the telomeric region of 17q11.2. (C,D) After dissociation of the SVA mRNA, the integration process was not finalized by recombinational repair using the downstream SUZ12P intron 8 sequence. Instead, the DNA ends were ligated by NHEJ to the open-ended DNA sequence located within the telomeric 17q11.2 region, between the RAB11FIP4 and COPRS genes, which resulted in the deletion of the intervening sequence and hence the occurrence of the atypical NF1 deletion (D).
Known pathogenic retrotransposon insertions associated with deletions ≥ 100 bp in the human genome
| 6p22.1 | Leukemiaa | SVA_F1 (2 kb) | ~14 kb | [ | |
| Xq28 | Adrenoleukodystrophy | 4,726 bp | [ | ||
| 1q25.1 | Antithrombin deficiency type 1 | 1,444 bp | [ | ||
| 8p21.3 | Lipoprotein lipase deficiency | 2.2 kb | [ | ||
| 8q12.2 | CHARGE syndrome | 10 kb | [ | ||
| 16p13.2 | Congenital disorders of glycosylation type-Ia | 28 kb | [ | ||
| 5q22.2 | Familial adenomatous polyposis | 1,599 bp | [ | ||
| 8q13.3 | Branchio-oto-renal syndrome | L1 Hs (3,756 bp) | 17 kb | [ | |
| 11p13 | Pyruvate dehydrogenase complex deficiency | L1 Hs (6,086 bp) | 46 kb | [ | |
| 17q21.31 | Hereditary breast/ovarian cancer | 23,363 bp | [ |
aThe germline SVA insertion-associated deletion was identified in three unrelated Japanese families. Of the individuals harboring the SVA insertion-associated deletion, one individual in each family presented with leukemia.
bThe affected family members harbored an intragenic 1,444 bp deletion and an insertion of a polyT tract of 40 nucleotides followed by a 6 bp sequence (5′-GAGACG-3′). This 6 bp sequence, located at the 3′end of the insertion, was homologous to the consensus sequence of the free right Alu monomer (FRAM).