| Literature DB >> 32957642 |
Werica P Bernardo1, Renata T Souza1, André G Costa-Martins2, Eden R Ferreira1, Renato A Mortara1, Marta M G Teixeira2, José Luis Ramirez3, José F Da Silveira1.
Abstract
Retrotransposon Hot Spot (RHS) is the most abundant gene family in Trypanosoma cruzi, with unknown function in this parasite. The aim of this work was to shed light on the organization and expression of RHS in T. cruzi. The diversity of the RHS protein family in T. cruzi was demonstrated by phylogenetic and recombination analyses. Transcribed sequences carrying the RHS domain were classified into ten distinct groups of monophyletic origin. We identified numerous recombination events among the RHS and traced the origins of the donors and target sequences. The transcribed RHS genes have a mosaic structure that may contain fragments of different RHS inserted in the target sequence. About 30% of RHS sequences are located in the subtelomere, a region very susceptible to recombination. The evolution of the RHS family has been marked by many events, including gene duplication by unequal mitotic crossing-over, homologous, as well as ectopic recombination, and gene conversion. The expression of RHS was analyzed by immunofluorescence and immunoblotting using anti-RHS antibodies. RHS proteins are evenly distributed in the nuclear region of T. cruzi replicative forms (amastigote and epimastigote), suggesting that they could be involved in the control of the chromatin structure and gene expression, as has been proposed for T. brucei.Entities:
Keywords: Retrotransposon Hot Spot (RHS) multigene family; Trypanosoma cruzi; chromosome distribution; evolution; gene mosaic structure; nuclear protein; recombination
Mesh:
Substances:
Year: 2020 PMID: 32957642 PMCID: PMC7563717 DOI: 10.3390/genes11091085
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Circos diagram depicting the genomic organization and recombination events of the RHS family in the whole genome of Trypanosoma cruzi clone CLB. Inner track 1 represents the recombination between RHS genes. The recombinant sequences are linked to putative major and minor parental, using purple and green lines, respectively. Track 2 shows the genomic organization of RHS genes in chromosomes. Genes on forward and reverse strands are colored in blue and red, respectively. Track 3 shows the genomic organization of RHS pseudogenes in chromosomes. Pseudogenes on forward and reverse strands are colored in green and orange, respectively. Track 4 depicts a heat map of RHS genes’ and pseudogenes’ density for each chromosome. Values were obtained by summing the length (bp) of RHS genes and pseudogenes and were divided by the chromosome size. Outer track 5 shows the representation of T. cruzi CLB chromosomes for Esmeraldo (haplotype S) and non-Esmeraldo (haplotype P) allelic loci.
Figure 2Phylogeny and classification of transcribed RHS sequences. Phylogenetic analysis was carried out using the RaxML v 8.2.9 program with an automatic search for substitution models (PROTGAMMAAUTO) selected using the Akaike information criterion (AIC) (auto-prot = AIC), with 1000 bootstrap replicates. Groups 1–10 comprise RHS sequences, with supported values separated by colors, and RHS sequences with bootstrap values below the cutoff (unclassified groups) are indicated in black.
Distribution of the members of RHS groups across the chromosomes of clone CLB.
| Group | Gene ID TriTrypDB 1 | CDS (bp) 2 | Peptide (aa) 3 | Direction of Transcription 4 | Subtelomeric Region 5 | Chromosome 6 |
|---|---|---|---|---|---|---|
| 1 | TcCLB.511845.10 | 270 | 90 | Sense | - | TcChr20-P (580,762–581,031) |
| TcCLB.509717.176 | 402 | 134 | Sense | - | TcChr4-P (157,230–157,631) | |
| TcCLB.509295.90 | 771 | 256 | Sense | Tel 6 | TcChr28-P (746,714–747,484) | |
| TcCLB.510479.11 | 1701 | 567 | Sense | - | TcChr38-P (1,335,682–1,337,382) | |
| TcCLB.506961.10 | 1929 | 642 | Anti-Sense | - | TcChr18-S (118–2046) | |
| TcCLB.506001.90 | 2763 | 920 | Sense | - | TcChr4-P (166,550–169,312) | |
| TcCLB.507167.70 | 2772 | 923 | Sense | Tel 6 | TcChr28-P (837,994–840,765) | |
| TcCLB.508479.500 | 2892 | 963 | Anti-Sense | - | TcChr40-P (1,914,173–1,917,064) | |
| 2 | TcCLB.509875.11 | 819 | 273 | Sense | Tel 13 | TcChr26-P (793,295–794,113) |
| TcCLB.509873.10 | 831 | 276 | Sense | Tel 13 | TcChr26-P (794,215–795,045) | |
| TcCLB.508285.10 | 1767 | 588 | Sense | Tel 3 | TcChr19-S (653,962–655,728) | |
| TcCLB.506421.10 | 1038 | 345 | Anti-Sense | Tel 49 | TcChr31-P (53,479–54,51) | |
| TcCLB.509915.60 | 1767 | 588 | Anti-Sense | Tel 49 | TcChr31-P (64,469–66,235) | |
| TcCLB.506443.150 | 2400 | 799 | Sense | Tel 24 | TcChr11-P (510,464–512,863) | |
| TcCLB.507555.80 | 2757 | 918 | Anti-Sense | Tel 35 | TcChr35-S (510,464–512,863) | |
| 3 | TcCLB.459199.10 | 2820 | 939 | Anti-Sense | Tel 28 | TcChr15-P (5578–8397) |
| TcCLB.506047.20 | 1815 | 604 | Sense | Tel 9 | TcChr35-S (1,183,688–1,185,502) | |
| TcCLB.506017.51 | 1122 | 374 | Sense | - | TcChr29-P (869,711–870,832) | |
| TcCLB.507167.20 | 2835 | 944 | Sense | Tel 6 | TcChr28-P (849,015–851,849) | |
| TcCLB.507611.10 | 2841 | 946 | Anti-Sense | Tel 17 | TcChr37-S (1391–4231) | |
| TcCLB.506393189 | 2274 | 758 | Sense | - | TcChr14-P (596,251–598,524) | |
| TcCLB.506323.30 | 2790 | 929 | Anti-Sense | Tel 4 | TcChr22-P (62,292–65,081) | |
| TcCLB.509429.4 | 2613 | 871 | Sense | - | TcChr6-P (364,778–367,390) | |
| TcCLB.511773.110 | 2472 | 995 | Anti-Sense | - | TcChr17-P (301–2772) | |
| TcCLB.508037.10 | 1146 | 381 | Anti-Sense | Tel 48 | TcChr27-S (1297–2442) | |
| TcCLB.511929.30 | 2781 | 926 | Sense | - | TcChr25-P (736,933–739,713) | |
| TcCLB.504109.200 | 3294 | 1097 | Anti-Sense | - | TcChr39-P (599–3892) | |
| TcCLB.508473.10 | 4512 | 1503 | Sense | Tel 30 | TcChr39-S (1,847,980–1,852,491) | |
| TcCLB.507625.10 | 4149 | 1382 | Sense | Tel 45 | TcChr40-S (1,133,828–1,137,976) | |
| TcCLB.39997.10 | 1053 | 350 | Anti-Sense | - | TcChr37-P (33,320–34,372) | |
| 4 | TcCLB.504343.30 | 1779 | 592 | Anti-Sense | - | TcChr7-S (60,071–61,849) |
| TcCLB.507.907.30 | 1779 | 592 | Anti-Sense | - | TcChr7-S (73,533–75,311) | |
| TcCLB.507.907.60 | 1779 | 592 | Anti-Sense | - | TcChr7-S (62,859–64,637) | |
| TcCLB.505207.30 | 1626 | 541 | Anti-Sense | - | TcChr41-P (8244–9869) | |
| 5 | TcCLB.511019.80 * | 1500 | 499 | Sense | - | TcChr35-P (101,616–103,187) |
| TcCLB.503881.30 | 1509 | 502 | Sense | - | TcChr33-S (730,729–732,237) | |
| TcCLB.508119.140 | 1503 | 500 | Anti-Sense | - | TcChr33-P (724,554–726,056) | |
| TcCLB.511907.330 | 1503 | 500 | Sense | - | TcChr26-P (250,686–252,188) | |
| TcCLB.506529.680 | 444 | 148 | Sense | - | TcChr6-S (201,683–202,126 | |
| TcCLB.510889.352 | 510 | 170 | Sense | - | TcChr6-P (201,577–202,086 | |
| 6 | TcCLB.509085.120 | 1896 | 631 | Anti-Sense | - | TcChr15-P (164,566–166,461 |
| TcCLB.509437.110 | 1896 | 631 | Sense | - | TcChr15-P (256,827–258,722 | |
| TcCLB.509349.20 | 1893 | 630 | Anti-Sense | Tel 2 | TcChr11-S (115,973–117,865) | |
| TcCLB.508479.80 | 1947 | 648 | Sense | - | TcChr40-P (1,993,454–1,995,400) | |
| TcCLB.509163.110 | 1962 | 653 | Sense | - | TcChr35-P (1,138,639–1,140,600) | |
| TcCLB.511871.130 | 1896 | 631 | Sense | - | TcChr15-S (101,636–103,531) | |
| TcCLB.511861.90 | 1896 | 631 | Sense | - | TcChr15-P (118,605–120,500) | |
| TcCLB.511863.4 | 1572 | 524 | Sense | - | TcChr15-P (101,616–103,187) | |
| 7 | TcCLB.506809.5 | 354 | 117 | Sense | - | TcChr16-P (453,466–453,819) |
| TcCLB.509575.10 | 2763 | 920 | Sense | - | TcChr16-P (389,424–392,186) | |
| TcCLB.424771.10 | 873 | 290 | Sense | - | TcChr16-P (477,440–478,312) | |
| TcCLB.507843.10 | 1779 | 592 | Sense | - | TcChr16-S (390,540–392,318) | |
| TcCLB.509827.4 | 962 | 320 | Sense | - | TcChr16-S (389,477–390,438) | |
| TcCLB.507841.14 | 2562 | 854 | Sense | - | TcChr16-S (452,820–455,381) | |
| 8 | TcCLB.511019.13 | 1548 | 516 | Sense | - | TcChr35-P (446,350–447,897) |
| TcCLB.509219.20 | 3633 | 1210 | Sense | - | TcChr20-P (567,813–571,445) | |
| TcCLB.506271.30 | 324 | 108 | Sense | - | TcChr20-P (586,959–587,282) | |
| TcCLB.510643.190 | 2496 | 831 | Sense | - | TcChr16-P (642,806–645,301) | |
| TcCLB.505997.60 | 2316 | 771 | Anti-Sense | Tel 1 | TcChr9-P (12,280–14,595) | |
| TcCLB.506595.149 | 2465 | 821 | Anti-Sense | - | TcChr33-P (101–2565) | |
| TcCLB.511371.10 | 1785 | 594 | Sense | - | TcChr5-S (200,095–201,879) | |
| TcCLB.511415.11 | 1095 | 365 | Anti-Sense | - | TcChr9-S (30,121–31,215) | |
| TcCLB.508559.90 | 1821 | 606 | Sense | Tel 21 | TcChr25-S (700,188–702,008) | |
| TcCLB.511585.320 | 1932 | 643 | Anti-Sense | Tel 14 | TcChr33-S (31,331–33,262) | |
| TcCLB.507015.10 * | 2988 | 995 | Anti-Sense | Tel 10 | TcChr13-P (1626–4613) | |
| TcCLB.509917.19 | 1815 | 605 | Anti-Sense | Tel 49 | TcChr31-P (54,619–56,433) | |
| 9 | TcCLB.503401.11 | 243 | 81 | Sense | - | TcChr22-S (214,572–214,814) |
| TcCLB.506629.240 | 327 | 109 | Anti-Sense | - | TcChr39-P (389,442–389,768) | |
| TcCLB.509829.9 | 909 | 303 | Anti-Sense | - | TcChr39-S 392,244–393) | |
| TcCLB.509329.9 | 752 | 250 | Sense | - | TcChr22-P (339,264–340,015) | |
| TcCLB.509463.41 * | 1209 | 403 | Anti-Sense | - | TcChr22-P (391,811–393,019) | |
| TcCLB.509843.10 | 1503 | 500 | Sense | - | TcChr22-S (214,918–216,420) | |
| 10 | TcCLB.506139.200 | 1674 | 557 | Sense | - | TcChr18-P (357,746–359,419) |
| TcCLB.510845.10 | 1824 | 608 | Anti-Sense | - | TcChr19-S (28,739–30,562) |
1 TriTrypDB [41]. 2 CDS (coding DNA sequence), size in bp. 3 The translated peptide, size in amino acid (aa). 4 The direction of transcription. 5 RHS is located in the subtelomeric regions of the chromosomes of clone CLB [45]. 6 Genomic coordinates at the in silico chromosome of clone CLB (TcChr) [40]. * The other allele at the same locus is a pseudogene.
Figure 3Gene duplication events in the RHS sequences of clone CLB. The figure shows the physical map of the chromosome regions involved in the recombination event. For clarity, only RHS sequences are shown. The direction of transcription is indicated by blue (sense) and red (anti-sense) arrows. (A,B) Groups 1 and 4: duplication of RHS genes by unequal crossing-over with loss of tandem counterparts in one of the haplotypes (TcCh4-S and TcChr7-P). (C) Group 6: duplication of the RHS genes by unequal crossing-over with the conservation of one of the RHS counterparts in the TcChr15-S haplotype. (D) Group 7: duplication followed by genetic conversion between paralogous genes located in the TcChr16-P and TcChr16-S haplotypes (interlocus nonallelic gene conversion). The identity between homologous RHS proteins of the P and S haplotypes is indicated in the figure. The identity between paralogous RHS proteins ranged from 93 to 100%. The physical maps showing the position of RHS sequences were downloaded from the public genome database TriTrypDB [41].
Figure 4Detection of potential recombination events in T. cruzi RHS sequences. Recombination analysis was performed using the RDP4 program composed of non-parametric recombination detection methods by the algorithms: RDP, GENECONV, MaxChi, Chimera, Bootscan, SiSscan, and 3Seq. RHS sequences of groups 1–10 (parental sequences) are highlighted in different colors and unclassified groups (recombinant sequences) are presented in black. All RHS sequences are also indicated by their access number in the TriTrypDB [41].
Figure 5Analysis of the expression of RHS by Western blot in T. cruzi and other trypanosomes and cellular localization in T. cruzi by indirect immunofluorescence. (A) Protein extracts of epimastigotes of T. cruzi (CLB and strain G), T. cruzi marinkellei (Tcm), and T. rangeli (Tr) and procyclic forms of T. brucei (Tb) were separated by SDS-PAGE, transferred to nitrocellulose membranes, and incubated with anti-RHS polyclonal antibodies (diluted 1:500). The RHS recombinant protein was included as a positive control. The molecular masses of the reference proteins are indicated on the left in kDa. (B) Confocal microscopy images from indirect immunofluorescence reaction with anti-RHS antibodies (diluted 1:1000) in permeabilized epimastigotes of clone CLB. The labeling of the nucleus and kinetoplast DNA (DAPI) and RHS proteins is shown in blue and green, respectively. At the top, the reaction with two epimastigotes is shown at 3 μm scale. In the lower panel, the image shows epimastigotes (scale bar 10 μm). N, nucleus; K, kinetoplast.
Figure 6Cellular localization of RHS in the amastigote of T. cruzi. Confocal microscopy images from indirect immunofluorescence reaction with anti-RHS antibodies (diluted 1:1000) in permeabilized epimastigotes and amastigotes of clone CLB. The labeling of the nucleus and kinetoplast DNA (DAPI) and RHS proteins is shown in blue and green, respectively. Scale bar 3 μm. N, nucleus; K, kinetoplast.