| Literature DB >> 18067655 |
Solomon G Nergadze1, Marco Andrea Santagostino, Alberto Salzano, Chiara Mondello, Elena Giulotto.
Abstract
BACKGROUND: In vertebrates, tandem arrays of TTAGGG hexamers are present at both telomeres and intrachromosomal sites (interstitial telomeric sequences (ITSs)). We previously showed that, in primates, ITSs were inserted during the repair of DNA double-strand breaks and proposed that they could arise from either the capture of telomeric fragments or the action of telomerase.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18067655 PMCID: PMC2246262 DOI: 10.1186/gb-2007-8-12-r260
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Mechanisms of ITS insertion
| Number of loci | |||||||
| Rodents | Primates | ||||||
| Flanking sequence modification | Mouse | Rat | Total (%) | Human | Chimp | Both* | Total (%) |
| No modification | 15 | 16 | 31 (23,2) | 1 | 0 | 3 | 4 (16) |
| Deletion | 27 | 38 | 65 (50) | 2 | 2 | 9 | 13 (52) |
| Addition | 10 | 12 | 22 (17) | 1 | 1 | 2 | 4 (16) |
| Random sequence | 8 | 12 | |||||
| TERC sequence† | 2 | 0 | |||||
| Duplication | 2 | 4 | 6 (4,6) | 1 | 1 | 2 | 4 (16) |
| Addition and deletion | 4 | 2 | 6 (4,6) | 0 | 0 | 0 | 0 (0) |
| Random sequence addition | 1 | 2 | |||||
| TERC sequence addition† | 3 | 0 | |||||
| Total | 58 | 72 | 130 (100) | 5 | 4 | 16 | 25 (100) |
*These ITSs are present in both primate species and were inserted within repetitive elements. Their insertion mechanism was defined from the repetitive element consensus. †TERC sequence additions are present only in rodents.
Figure 1Length of ITSs. Comparison of ITS length in (a) the two primate and (b) the two rodent species.
ITS age
| Number of ITS (%) | ||||
| ITS type | Human | Chimpanzee | Mouse | Rat |
| Conserved ITS loci (old) | 75 (90.4) | 75 (94.9) | 58 (23.8) | 58 (23.2) |
| Species-specific ITS loci (young) | 5 (6.0) | 4 (5.1) | 58 (23.8) | 72 (28.8) |
| Not informative ITS loci* | 3 (3.6) | 0 (0) | 128 (52.5) | 120 (48.0) |
| Total | 83 | 79 | 244 | 250 |
*ITS loci for which the orthologous loci were not found or were grossly rearranged.
Telomeric sequence mutation
| Number of mismatches per TTAGGG unit | ||||
| ITS type | Human | Chimpanzee | Mouse | Rat |
| Conserved ITS loci (old) | 0.29 ± 0.07 | 0.30 ± 0.08 | 0.40 ± 0.13 | 0.34 ± 0.09 |
| Species specific ITS loci (young) | 0.13 ± 0.12 | 0 | 0.14 ± 0.03 | 0.12 ± 0.03 |
Figure 2Microhomology between break sites and inserted telomeric repeats. Telomeric repeats are in red; in the empty ancestral loci the nucleotides in register with the inserted telomeric repeats are boxed. (a) Mouse specific ITS at the MMU12qA1 locus; an AGGG tetranucleotide from the orthologous rat empty locus RNO6q15 is in register with the inserted telomeric repeats. (b) Rat specific ITS at RNO14q21; a GGG trinucleotide from the orthologous mouse locus MMU3qB3 is in register with the inserted telomeric repeats. (c) The human specific ITS at HSA11q24 was inserted together with seven random nucleotides; a TA dinucleotide from the orthologous chimpanzee PTR9 locus is in register with the inserted telomeric repeats.(d) The insertion of the chimpanzee specific ITS at PTR22 occurred together with a 7 bp deletion; an AGG trinucleotide from the orthologous human locus HSA21q22 is in register with the inserted telomeric repeats.
Number of loci containing nucleotides in register with the telomeric insertion*
| Number of observed loci (%) | |||||
| No. of nucleotides in register with telomeric insertion | Mouse | Rat | Human | Chimpanzee | No. of expected loci (%) |
| 0† | 11 (23) | 16 (25) | 1 (20) | 1 (33) | (75) |
| 1 or more‡ | 36 (77) | 47 (75) | 4 (80) | 2 (67) | (≤ 25) |
| 2 or more§ | 26 (55) | 31 (49) | 3 (60) | 2 (67) | (≤ 6.25) |
| 3 or more¶ | 16 (34) | 21 (33) | 2 (40) | 1 (33) | (≤ 1.56) |
| 4 or more¥ | 8 (17) | 11 (17) | 1 (20) | 0 (0) | (≤ 0.39) |
* For this analysis we utilized the informative species-specific loci listed in Additional data files 4 and 6, namely 47 mouse, 63 rat, 5 human and 3 chimpanzee ITS loci. †This class includes the loci in which the last nucleotide at the 3' end of the break is not in register with the inserted telomeric repeat insertion. ‡This class includes all loci with in register nucleotides. This class, together with the previous class comprises the totality of the loci. §These loci are also included in the '1 or more' class. ¶These loci are also included in the '1 or more and 2 or more' classes. ¥These loci are also included in the '1 or more, 2 or more and 3 or more' classes.
Mouse loci containing TERC-like sequences
| Mouse locus organization | Orthologous rat locus organization | |||||||
| Chromosomal localization | Starting nucleotide of fragment homologous to 3' TERC domain (length) | Position within TERC sequence | No. of nucleotides complementary to sequence preceding template | ITS length | Chromosomal localization | Starting nucleotide of TERC fragment (length) | Starting nucleotide of ITS (length) | |
| 1 | MMU8qA2 | 21522357 (38) | 351-388 | 0 | 213 | RNO16q12 | 73726184 (38) | 73726146 (58) |
| 2 | MMU13qA1 | 3475939 (54) | 331-384 | 0 | 22 | RNO17q12 | 77825660 (53) | 77825608 (22) |
| 3 | MMUXqC3 | 94682056 (52) | 328-377 | 7 | 25 | RNOXq31 | 88164322 (52) | 88164265 (43) |
| 4 | MMU5qA3 | 23908490 (37) | 357-393 | 6 | 21 | RNO3q41 | 139145891 | No TERC, no ITS |
| 5 | MMU9qA5 | 47975305 (60) | 314-373 | 6 | 68 | RNO8q23 | 51245798 | No TERC, no ITS |
| 6 | MMU1qC1 | 47024551 (31) | 341-374 | 3 | 57 | RNO9q22 | 52312295 | No TERC, no ITS |
| 7 | MMU4qD2 | 119006678 (42) | 351-392 | 6 | 53 | RNO5q36 | 140562692 | No TERC, no ITS |
| 8 | MMU10qB4 | 58505103 (118) | 271-388 | 0 | 27 | RNO20q11 | 37230663 | No TERC, no ITS |
| 9 | MMU12qF1 | 106800391 (81) | 308-388 | 0 | 13 | RNO6q32 | 136092775 | No TERC, no ITS |
| 10 | MMUXqA6 | 61359852 (74) | 322-395 | 6 | 23 | RNOXq37 | 152798431 | No TERC, no ITS |
| 11 | MMU1qC3 | 69326421 (50) | 346-395 | 6 | 139 | RNO9q32 | 68032617 | No TERC, no ITS |
| 12 | MMU10qA3 | 20387038 (98) | 289-388 | 6 | 108 | RNO1p12 | 15780743 | No TERC, no ITS |
| 13 | MMU6qC1 | 68259720 (44) | 351-394 | 0 | 93 | Not found | - | - |
| 14 | MMU11qC | 86742217 (38) | 343-381 | 0 | 55 | Not found | - | - |
Figure 3Organization of TERC-ITS loci. RNA sequences are in italic. The RNA sequences involved in the events and the DNA sequences corresponding to them (that is, complementary to retrotranscribed sequences) are in light colors (orange, grey and light blue) while the DNA sequences derived from retrotranscription of the RNA are in dark colors (red, black and dark blue). (a) Sequence of the mouse telomerase RNA component. The nucleotides of the canonical telomerase template, located near the 5' end, are shown in orange (nucleotides 3-10). Nucleotides adjacent to the template that are retrotranscribed together with the first inserted hexamer are grey underlined. The nucleotides of the 3' domain of TERC involved in the TERC-ITS loci are indicated in light blue. The 17 nt core sequence, present in all TERC-ITSs, has a blue background. In the 3' domain of the RNA, the mouse TERC sequence homologous to the human TERC sequence interacting with Ku is underlined. (b) Example of a mouse specific TERC-ITS locus (MMU9qA5). The top row shows the 5' domain of TERC containing the canonical template (orange) and the adjacent sequence (grey underlined). The second row shows the sequence of the mouse locus: telomeric repeats are in red; the nucleotides complementary to those adjacent to the hexameric template are black underlined; the light blue nucleotides indicate the region derived from the 3' domain of TERC. The third row reports, in light blue, the sequence of the 3' domain of TERC from nucleotides 314 to nucleotides 373. The bottom row shows the sequence of the orthologous empty rat locus RNO8q23. The CG dinucleotide (yellow) is present both in the ancestral rat sequence, at the 3' end of the break, and in the region of the TERC RNA immediately preceding the retrotranscribed 3' domain. (c) Overall organization of TERC-ITS loci. At the top is the structure of TERC: orange oval, canonical template; grey square, adjacent nucleotides; light blue strip, 3' domain. At the bottom is the organization of the double-stranded DNA at TERC-ITS loci: light blue strip, sequence corresponding to the 3' domain of TERC; blue strip, complementary sequence; black square, sequence complementary to the nucleotides adjacent to the canonical template; grey square, sequence corresponding to the nucleotides adjacent to the canonical template; red ovals, TTAGGG repeats; orange ovals, complementary repeats.
Figure 4Model for TERC-ITS insertion. (a) TERC interaction at DNA double strand break. (b) Retrotranscription of TERC 3' domain. (c) Second strand synthesis. (d) Retrotranscription of TERC 5' domain. (e) Digestion of DNA/RNA junction and addition of canonical telomeric repeats. (f) Gap filling. Curved thin lines represent the telomerase RNA (TERC) in which the orange oval corresponds to the canonical telomeric template, the grey line to the nucleotides immediately adjacent to the 3' side of the template, the yellow line to nucleotides homologous to the last nucleotides of the break site; the light blue line represents the retrotranscribed 3' region of TERC. Straight thick lines represent DNA strands. The DNA involved in the double-strand break is in black, the yellow boxes correspond to nucleotides homologous to the region of TERC preceding the sequence retrotranscribed from the 3' end, the dark blue line represents the DNA strand retrotranscribed from the 3' end of TERC and the light blue line is the complementary strand. Red and orange ovals represent TTAGGG and CCCTAA repeats, respectively. Black and grey lines correspond to the sequence homologous to the nucleotides immediately adjacent to the telomeric template.