| Literature DB >> 33066599 |
Alfonso Herreros-Cabello1, Francisco Callejas-Hernández1, Núria Gironès1,2, Manuel Fresno1,2.
Abstract
Chagas disease caused by the parasite Trypanosoma cruzi affects millions of people. Although its first genome dates from 2005, its complexity hindered a complete assembly and annotation. However, the new sequencing methods have improved genome annotation of some strains elucidating the broad genetic diversity and complexity of this parasite. Here, we reviewed the genomic structure and regulation, the genetic diversity, and the analysis of the principal multi-gene families of the recent genomes for several strains. The telomeric and sub-telomeric regions are sites with high recombination events, the genome displays two different compartments, the core and the disruptive, and the genome plasticity seems to play a key role in the survival and the infection process. Trypanosoma cruzi (T. cruzi) genome is composed mainly of multi-gene families as the trans-sialidases, mucins, and mucin-associated surface proteins. Trans-sialidases are the most abundant genes in the genome and show an important role in the effectiveness of the infection and the parasite survival. Mucins and MASPs are also important glycosylated proteins of the surface of the parasite that play a major biological role in both insect and mammal-dwelling stages. Altogether, these studies confirm the complexity of T. cruzi genome revealing relevant concepts to better understand Chagas disease.Entities:
Keywords: Trypanosoma cruzi strain; gene expression; genome plasticity; mucins; sequencing methods; trans-sialidases
Year: 2020 PMID: 33066599 PMCID: PMC7602482 DOI: 10.3390/genes11101196
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Different classifications of Trypanosoma cruzi since 1999. (A) Classification of the meeting of 1999. (B) First consensus classification of 2009. (C) Second consensus classification of 2009. (D) Alternative classification proposed in 2016.
Data of the most recent genomes of the best-studied strains of T. cruzi and the B7 strain of T. cruzi marinkellei. BNEL: CL Brener Non-Esmeraldo-like; BEL: CL Brener Esmeraldo-like; PacBio: Pacific Biosciences. Contig N50: is a statistic median such that the 50% of the whole assembly is contained in contigs equal to or larger than this value.
| Strain | DTU | Size (Mbp) | Contigs | Contig N50 | %GC | Date of Version | Sequencing Method | References |
|---|---|---|---|---|---|---|---|---|
| G | I | 25.17 | 1450 | 74,655 | 47.40 | November 2018 | Roche 454 | [ |
| Dm28c | I | 53.27 | 636 | 317,638 | 51.60 | May 2018 | Illumina + PacBio | [ |
| Sylvio X10/1 | I | 38.59 | 27,019 | 2307 | 51.20 | October 2012 | Roche 454 + Illumina | [ |
| Berenice | II | 40.80 | 934 | 148,957 | 51.20 | June 2020 | Illumina + Nanopore | [ |
| Y | II | 39.34 | 10,127 | 11,782 | 51.43 | October 2017 | Illumina | [ |
| 231 | III | 35.36 | 8469 | 14,202 | 48.60 | January 2018 | Illumina | [ |
| Bug2148 | V | 55.22 | 934 | 196,760 | 51.63 | October 2017 | PacBio | [ |
| CL | VI | 65.00 | 7764 | 73,547 | 39.80 | November 2018 | Roche 454 | [ |
| TCC | VI | 87.06 | 1236 | 264,196 | 51.70 | May 2018 | Illumina + PacBio | [ |
| CL Brener | VI | 89.94 | 32,746 | 14,669 | 51.70 | July 2005 | Sanger | [ |
| BNEL | VI | 32.53 | 41 | 870,934 | 43.94 | December 2015 | Sanger | [ |
| BEL | VI | 32.53 | 41 | 870,934 | 40.35 | December 2015 | Sanger | [ |
| --- | 38.65 | 23,154 | 2846 | 50.90 | October 2012 | Roche 454 + Illumina | [ |
Figure 2Transcription process of T. cruzi. RNA polymerase II produces polycistronic RNAs that are modified by trans-splicing and polyadenylation. The final mature mRNAs contain the Cap with the SL and the poly A tail. SL: spliced leader.
Figure 3Genome copy number of the most abundant multi-gene families of T. cruzi and the B7 strain of T. cruzi marinkellei. BNEL: CL Brener Non-Esmeraldo-like; BEL: CL Brener Esmeraldo-like; DGF-1: Dispersed Gene Family 1; GP63: Glycoprotein 63; MASP: Mucin-Associated Surface Proteins; RHS: Retrotransposon Hot Spot genes.
Figure 4First classification of TS members. Four groups were described according to their sequence similarity and functional properties. The structure and functions of each group are displayed as well as the known members with their host and parasite-stage in which they are expressed. BT: bloodstream trypomastigotes; A: amastigotes; MT: metacyclic trypomastigotes; E: epimastigotes.
Figure 5Classification of 2011 of TS members according to a sequence cluster analysis. Each group is defined by specific motifs. Logos of each Asp-box and canonical TS motifs are displayed. Adapted from Freitas, L. M. et al., 2011 [110].
Presence of at least one member of each TS group in different Trypanosoma species.
| TS Groups of | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Group I | Group II | Group III | Group IV | Group V | Group VI | Group VII | Group VIII | ||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
| ||||||
|
|
|
| |||||||
|
|
| ||||||||
|
|
| ||||||||
|
|
| ||||||||
|
|
| ||||||||
|
|
| ||||||||
|
|
| ||||||||
Figure 6TS group distribution in genomes of different strains of T. cruzi and B7 strain of T. cruzi marinkellei. The percentage of each TS group is displayed. BNEL: CL Brener Non-Esmeraldo-like; BEL: CL Brener Esmeraldo-like.