| Literature DB >> 32325780 |
Marco Santagostino1, Francesca M Piras1, Eleonora Cappelletti1, Simone Del Giudice1, Ornella Semino1, Solomon G Nergadze1, Elena Giulotto1.
Abstract
Interstitial telomeric sequences (ITSs) are short stretches of telomeric-like repeats (TTAGGG)n at nonterminal chromosomal sites. We previously demonstrated that, in the genomes of primates and rodents, ITSs were inserted during the repair of DNA double-strand breaks. These conclusions were derived from sequence comparisons of ITS-containing loci and ITS-less orthologous loci in different species. To our knowledge, insertion polymorphism of ITSs, i.e., the presence of an ITS-containing allele and an ITS-less allele in the same species, has not been described. In this work, we carried out a genome-wide analysis of 2504 human genomic sequences retrieved from the 1000 Genomes Project and a PCR-based analysis of 209 human DNA samples. In spite of the large number of individual genomes analyzed we did not find any evidence of insertion polymorphism in the human population. On the contrary, the analysis of ITS loci in the genome of a single horse individual, the reference genome, allowed us to identify five heterozygous ITS loci, suggesting that insertion polymorphism of ITSs is an important source of genetic variability in this species. Finally, following a comparative sequence analysis of horse ITSs and of their orthologous empty loci in other Perissodactyla, we propose models for the mechanism of ITS insertion during the evolution of this order.Entities:
Keywords: DNA repair; evolution; horse genome; human genome; insertion polymorphism; interstitial telomeres; telomerase; telomeres
Year: 2020 PMID: 32325780 PMCID: PMC7215372 DOI: 10.3390/ijms21082838
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Human-specific interstitial telomeric sequence (ITS) loci analyzed for insertion polymorphism. Coordinates of the loci, country of origin and number of individuals analyzed for each locus are indicated. NA, not assayed.
| Locus, Number of Individuals | ||||||||
|---|---|---|---|---|---|---|---|---|
| chr3:159412014-159412054 | chr11:129573442-129573473 | chr15:42243201-42243234 | chr16:75368098-75368138 | |||||
| Macro Area (Country/Population) | No. of Individuals | No. of Empty Loci | No. of Individuals | No. of Empty Loci | No. of Individuals | No. of Empty Loci | No. of Individuals | No. of Empty Loci |
| West Africa (Senegal/Wolof) | 17 | 0 | 25 | 0 | 24 | 0 | 22 | 0 |
| West Africa (Senegal/Mandenka) | 22 | 0 | 26 | 0 | 28 | 0 | 28 | 0 |
| East Africa (Kenya) | 10 | 0 | 9 | 0 | 10 | 0 | 11 | 0 |
| East Africa (Ethiopia) | 11 | 0 | 13 | 0 | 12 | 0 | 13 | 0 |
| Near East (Lebanon) | 31 | 0 | NA | NA | 31 | 0 | 30 | 0 |
| Asia (Turkey) | 17 | 0 | NA | NA | 20 | 0 | 14 | 0 |
| Asia (China/Han) | 6 | 0 | 11 | 0 | 7 | 0 | 7 | 0 |
| Europe (Croatia) | 11 | 0 | NA | NA | 11 | 0 | 10 | 0 |
| Europe (Hungary) | 6 | 0 | NA | NA | 9 | 0 | 7 | 0 |
| Europe (Italy) | 19 | 0 | 12 | 0 | 26 | 0 | 23 | 0 |
| Europe (Poland) | 15 | 0 | 10 | 0 | 13 | 0 | 11 | 0 |
| Europe (Ukraine) | 10 | 0 | 10 | 0 | 8 | 0 | 10 | 0 |
| Europe (Others) | 12 | 0 | 11 | 0 | 10 | 0 | 10 | 0 |
|
| 187 | 0 | 127 | 0 | 209 | 0 | 196 | 0 |
Figure 1Fluorescence in situ hybridization with telomeric probe on two metaphase spreads from horse primary fibroblasts. Chromosomes are labelled with DAPI. Hybridization signals (red) at chromosome ends mark telomeres. Some of the chromosomes showing faint hybridization signals corresponding to short interstitial telomeric sequences are indicated (white arrows). Magnifications of selected chromosomes are reported below each metaphase, and some of the interstitial telomeric signals are marked by arrowheads.
Figure 2ITS loci heterozygous in Twilight. For each locus, sequence alignment of the empty allele (top) with the ITS-containing allele (bottom) is shown. The sequence indicated as Reference corresponds to the allele found in the horse reference genome EquCab3.0, the alternative allele is indicated by the NCBI Trace ID number. Telomeric repeats in TTAGGG and CCCTAA orientation are indicated in red and orange, respectively. Flanking sequence modifications that occurred together with telomeric repeat insertions are indicated using different colors: nucleotides deleted from the flanking sequence are in blue; random nucleotides sequence insertions are in green; duplications of the insertion site are shaded in grey. Nucleotides at break sites in register with the inserted telomeric repeats are boxed.
Insertion polymorphism of ITS loci in seven horse populations.
| Frequency of the Empty Allele | ||||
|---|---|---|---|---|
| Population | Number of Individuals | chr2:13178780-13178780 | chr15:23487997-23488047 | chr19:32741-32803 |
| Show Jumping horse | 18 | 0.53 | 0.53 | 0.89 |
| Quarter horse | 20 | 0.47 | 0.62 | 0.87 |
| Lipizzaner | 23 | 0.83 | 0.96 | 1.00 |
| Norwegian Fjord | 20 | 0.85 | 0.63 | 1.00 |
| Andalusian | 25 | 0.66 | 0.56 | 1.00 |
| Icelandic pony | 26 | 1.00 | 0.85 | 1.00 |
| Przewalski’s horse | 20 | 1.00 | 0.75 | 1.00 |
Variable number of tandem repeats at 11 ITS loci in 18 horses.
| ITS Locus | No. of Telomeric Repeats | Frequency |
|---|---|---|
|
| ||
| Allele 1 | 11 | 0.03 |
| Allele 2 | 12 | 0.14 |
| Allele 3 | 13 | 0.61 |
| Allele 4 | 14 | 0.11 |
| Allele 5 | 15 | 0.08 |
| Allele 6 | 16 | 0.03 |
|
| ||
| Allele 1 | 5 | 0.03 |
| Allele 2 | 7 | 0.31 |
| Allele 3 | 8 | 0.25 |
| Allele 4 | 9 | 0.36 |
| Allele 5 | 10 | 0.06 |
|
| ||
| Allele 1 | 6 | 0.72 |
| Allele 2 | 7 | 0.11 |
| Allele 3 | 8 | 0.17 |
|
| ||
| Allele 1 | 5 | 0.06 |
| Allele 2 | 7 | 0.78 |
| Allele 3 | 8 | 0.17 |
|
| ||
| Allele 1 | 9 | 0.72 |
| Allele 2 | 10 | 0.28 |
|
| ||
| Allele 1 | 7 | 0.11 |
| Allele 2 | 8 | 0.89 |
|
| ||
| Allele 1 | 6 | 1.00 |
|
| ||
| Allele 1 | 16 | 1.00 |
|
| ||
| Allele 1 | 0 | 0.53 |
| Allele 2 | 15 | 0.11 |
| Allele 3 | 16 | 0.31 |
| Allele 4 | 17 | 0.06 |
|
| ||
| Allele 1 | 0 | 0.53 |
| Allele 2 | 6 | 0.17 |
| Allele 3 | 7 | 0.31 |
|
| ||
| Allele 1 | 0 | 0.89 |
| Allele 2 | 10 | 0.11 |
Horse ITS loci informative for the mechanisms of telomeric repeat insertion. Loci are ranked by coordinate. Legend: ITS, telomeric repeats are present at the insertion site; Empty, the orthologous locus is empty at the insertion site; NF, orthologous locus not found.
| Loci Orthologous to Horse ITS | ||
|---|---|---|
| Horse ITS Locus | Donkey | White Rhinoceros |
| chr1:27309501-27309532 | ITS | Empty |
| chr1:90843617-90843649 | Empty | Empty |
| chr1:98488870-98488928 | ITS | Empty |
| chr1:156995264-156995361 | ITS | Empty |
| chr2:661976-662028 | ITS | Empty |
| chr2:13178780-13178780 * | Empty | Empty |
| chr2:30174747-30174793 | ITS | Empty |
| chr3:6047114-6047139 | ITS | Empty |
| chr3:45427234-45427258 | ITS | Empty |
| chr3:82188585-82188664 | ITS | Empty |
| chr3:92686382-92686405 | ITS | Empty |
| chr3:106340791-106340815 | ITS | Empty |
| chr4:65502437-65502468 | ITS | Empty |
| chr4:68540207-68540247 | ITS | Empty |
| chr4:81320764-81320806 | ITS | Empty |
| chr4:90006773-90006813 | Empty | NF |
| chr5:36007702-36007752 | Empty | NF |
| chr5:88367091-88367115 | ITS | Empty |
| chr6:8289417-8289447 | ITS | Empty |
| chr6:38922364-38922401 | ITS | Empty |
| chr7:29297817-29297852 | ITS | Empty |
| chr7:51713092-51713124 | ITS | Empty |
| chr7:100764221-100764245 | ITS | Empty |
| chr8:24251768-24251863 | Empty | Empty |
| chr8:64629752-64629778 | ITS | Empty |
| chr8:94527922-94527948 | ITS | Empty |
| chr9:1570127-1570127 * | Empty | Empty |
| chr9:74200299-74200328 | ITS | Empty |
| chr9:80878399-80878422 | ITS | Empty |
| chr10:7134943-7135113 | ITS | Empty |
| chr10:22622806-22622833 | ITS | Empty |
| chr11:42736447-42736474 | ITS | Empty |
| chr11:44810227-44810253 | ITS | Empty |
| chr12:32574287-32574323 | ITS | Empty |
| chr15:23487997-23488047 * | ITS | NF |
| chr15:29677583-29677653 | Empty | Empty |
| chr15:51951378-51951403 | ITS | Empty |
| chr17:74095846-74095878 | ITS | Empty |
| chr17:24696793-24696817 | ITS | Empty |
| chr17:76046120-76046146 | ITS | Empty |
| chr18:79615942-79615971 | ITS | Empty |
| chr19:32741-32803 * | Empty | Empty |
| chr19:1124529-1124602 | ITS | Empty |
| chr19:10034261-10034295 | Empty | NF |
| chr20:18458893-18458969 | Empty | Empty |
| chr21:15781540-15781563 | ITS | Empty |
| chr21:52191569-52191606 | ITS | Empty |
| chr22:8731063-8731104 | ITS | Empty |
| chr23:37412257-37412345 | ITS | Empty |
| chr23:40710560-40710607 | Empty | Empty |
| chr23:44283331-44283381 | ITS | Empty |
| chr23:45515290-45515320 | ITS | Empty |
| chr24:26535796-26535829 | ITS | Empty |
| chr25:15560895-15560949 | ITS | Empty |
| chr26:28375514-28375542 | ITS | Empty |
| chr27:11505874-11505908 | ITS | Empty |
| chr27:21217687-21217745 * | Empty | NF |
| chr27:24213772-24213798 | ITS | Empty |
| chr27:29189054-29189097 | ITS | Empty |
| chr28:18062208-18062233 | ITS | Empty |
| chr28:41719878-41719912 | ITS | Empty |
| chr29:23755261-23755308 | ITS | Empty |
| chr30:11679429-11679454 | ITS | Empty |
| chr30:25048051-25048089 | ITS | Empty |
| chrX:95073117-95073140 | ITS | Empty |
| chrX:118861332-118861368 | Empty | Empty |
* ITS locus heterozygous in Twilight.
Figure 3Examples of data used to describe the insertion mechanism of telomeric repeats. For each locus, the alignment of the empty ancestral sequence from donkey or white rhinoceros with the ITS sequence in the horse reference genome assembly is shown. A sketch of the mechanism responsible for telomeric repeat insertion is shown on top of each sequence alignment. Telomeric repeats in TTAGGG and CCCTAA orientation are indicated in red and orange, respectively. At empty loci, nucleotides in register with the inserted telomeric repeats are boxed. (a) Interstitial telomeric repeat insertion occurred without modification of the sequences flanking the double-strand break. The orthologous locus from donkey is empty at the insertion site. The double-strand break exposed a GGG trinucleotide in register with the inserted telomeric repeats. (b) Interstitial telomeric repeat insertion accompanied by the deletion of nucleotides from the flanking sequence (blue nucleotides, blue strip in sketch). The double-strand break exposed a GGT trinucleotide in register with the inserted telomeric repeats. (c) Interstitial telomeric repeat insertion accompanied by the addition of a nucleotide sequence (green nucleotides, green strip in sketch). Telomeric repeat insertion was accompanied by the addition of a random nucleotide sequence, and the double-strand break exposed an AG dinucleotide in register with the inserted telomeric repeats. The ITS was inserted together with 17 bp homologous to a region of horse TERC 91 nucleotides away from the telomeric repeat template. (d) Interstitial telomeric repeat insertion at a staggered double-strand break followed by flanking sequence duplication (nucleotides shaded in grey, grey strip in sketch). The double-strand break exposed a GG dinucleotide in register with the inserted telomeric repeats. (e) Interstitial telomeric repeat insertion accompanied by a complex modification of the insertion site involving the simultaneous deletion of nucleotides and addition of a random sequence. A TA dinucleotide in register with the inserted telomeric repeats was exposed by the double-strand break.
Figure 4ITS insertions accompanied by complex rearrangements. Telomeric repeat insertions were accompanied by complex modifications of the break site involving addition of random nucleotide sequences, deletion of nucleotides from the insertion site, inversions or generation of inverted duplications. (a) Telomeric repeat insertion was accompanied by inversion of a single copy sequence (light green arrow) and generation of inverted duplications (yellow and light blue arrowheads) of sequences present at the empty locus. The rearrangement also caused the formation of telomeric stretches with head-to-head orientation. (b) The ancestral ITS-less locus from donkey contains a single copy sequence (light green arrow) flanked by short direct repeats (grey arrowheads). Telomeric repeat insertion at the horse locus was accompanied by duplication and inversion of the upstream direct repeat and of the single copy sequence. (c) The empty locus in rhinoceros contains a Mammalian-wide interspersed repeat (Mir, purple arrow) and a single copy sequence (light green arrow). Telomeric repeat insertion was accompanied by deletion of a small sequence from the insertion site (blue strip), duplication and inversion of the single copy sequence and of a portion of the Mir and insertion of small random nucleotide sequences (green strips).
Flanking sequence modification.
| Flanking Sequence Modification | Number of Loci (%) |
|---|---|
| No modification | 11 (16.7) |
| Deletion | 20 (30.3) |
| Sequence insertion (Total) | 13 (19.7) |
| Random sequence | 12 (18.2) |
| TERC sequence | 1 (1.5) |
| Duplication | 3 (4.5) |
| Complex modification (Total) | 19 (28.8) |
| Deletions and additions | 14 (21.2) |
| Deletion, addition, duplication | 2 (3.0) |
| Complex modifications including inversions | 3 (4.5) |
|
| 66 (100) |
Number of loci containing nucleotides in register with the telomeric insertion.
| No. of Nucleotides in Register with Telomeric Insertion | Number of Observed Loci (%) | Expected Loci (%) |
|---|---|---|
| 0 | 9 (22.50) | (75) |
| 1 or more | 31 (77.50) | (≤25) |
| 2 or more | 24 (60.00) | (≤6.25) |
| 3 or more | 14 (35.00) | (≤1.56) |
| 4 or more | 6 (15.00) | (≤0.39) |
| 5 or more | 2 (5.00) | (≤0.16) |