| Literature DB >> 30964924 |
Lorena Becco1, Pablo Smircich1,2, Beatriz Garat1.
Abstract
Trypanosoma cruzi, the protozoan parasite that causes Chagas' disease, exhibits peculiar biological features. Among them, the presence of a unique mitochondrion is remarkable. Even though the mitochondrial DNA constitutes up to 25% of total cellular DNA, the structure and functionality of the mitochondrion are dependent on the expression of the nuclear genome. As in other eukaryotes, specific peptide signals have been proposed to drive the mitochondrial localization of a subset of trypanosomatid proteins. However, there are mitochondrial proteins encoded in the nuclear genome that lack of a peptide signal. In other eukaryotes, alternative protein targeting to subcellular organelles via mRNA localization has also been recognized and specific mRNA localization towards the mitochondria has been described. With the aim of seeking for mitochondrial localization signals in T. cruzi, we developed a strategy to build a comprehensive database of nuclear genes encoding predicted mitochondrial proteins (MiNT) in the TriTryps (T. cruzi, T. brucei and L. major). We found that approximately 15% of their nuclear genome encodes mitochondrial products. In T. cruzi the MiNT database reaches 1438 genes and a conserved peptide signal, M(L/F) R (R/S) SS, named TryM-TaPe is found in 60% of these genes, suggesting that the canonical mRNA guidance mechanism is present. In addition, the search for compositional signals in the transcripts of T. cruzi MiNT genes produce a list, being worth to note a conserved non-translated element represented by the consensus sequence DARRVSG. Taking into account its reported interaction with the T. brucei TRRM3 protein which is enriched in the mitochondrial membrane fraction, we here suggest a putative zip code role for this element. Globally, here we provide an inventory of the mitochondrial proteins in T. cruzi and give evidence for the existence of both peptide and mRNA signals specific to nuclear encoded mitochondrial proteins.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30964924 PMCID: PMC6456187 DOI: 10.1371/journal.pone.0215160
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic representation of the in-silico strategy to build a database of the nuclear genes encoding mitochondrial proteins in T. cruzi.
The genomes of the mentioned trypanosomes were interrogated under selected categories (annotated as mitochondrial, GO and Text). For each, the output number of genes is shown underneath in brackets. The braces point to the number of genes obtained by the union of the involved outputs. When performed on the T. cruzi genome, the search yields only 474 genes. The same strategy in T. brucei genome, due to the state of the annotation, allowed to define a database containing 1501 nuclear genes encoding mitochondrial proteins. By comparing this later with the T. cruzi genome, a database of 1438 nuclear genes encoding mitochondrial proteins could be obtained.
Fig 2Analysis of MiNT vs No-MiNT CDS parameters.
Box-plot with whiskers (percentile 1–99%) of (A) G+C Content ratio (p-value 0.4321), (B) G+C in the third codon position ratio (p-value < 0.002), (C) MELP, Measure Independent of Length and Composition -based Expression Level Predictor, (p-value < 0.001) and (D) MFE (minimum free energy)/Length ratio (p-value < 0.001).
Fig 3Analysis of MiNT vs No-MiNT UTRs parameters in epimastigote stage.
Box-plot with whiskers (percentile 1–99%) of (A) Length; (B) G+C Content ratio (C) MFE (minimum free energy)/Length ratio. Multiple comparisons amongst groups were performed by Dunn’s multiple comparison test and differences were seen by comparing mean ranks.
Fig 4Comparison of expression data for MiNT and No-MiNT genes in T. cruzi life cycle.
(A) Transcriptome data obtained from the microarray approach of Minning, et al. [40] as a ratio of the particular stage to an equal mixture of all four life cycle stages. (B) Transcriptome data obtained from RNA-Seq from Smircich, et al. [25] as number of reads per kilobase nRPK. (C) Ribosome footprinting data from RNA-Seq from Smircich, et al. [25] as number of footprints per transcript (translational efficiency). Parasite stages: Ama = amastigote; Trypo = trypomastigote; Epi = epimastigote; Meta = Metacyclic.
Fig 5TryM-TaPe definition strategy.
Mitochondrial targeting sequences available in [44] were used to obtain a preliminary signal (A) that was then used to search among the mitochondrial annotated proteins. This later search output was used as an input to obtain the final signal named as TryM-TaPe (B).
Fig 6Venn diagram of the different strategies used to define the subset of nuclear encoded mitochondrial protein carrying a signal peptide in T. cruzi.
The signal peptide predictors TargetP [16] and PredSL [37] predictors with the default parameters selecting non-plant sequence, and FIMO using TryM-TaPe were used on the T. cruzi MiNT database.
Enriched motifs found in MiNT-NoMTS database.
| MiNT-NoMTS vs No-MiNT | ||||||
|---|---|---|---|---|---|---|
| Motif 1 | MAAAR | 4.8e-008 | Tbg972.9.5210 | polyadenylate-binding protein 1 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative |
| LmjF.35.4130 | polyadenylate-binding protein 2 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative | |||
| Motif 2 | TTGTKATT | 4.0e-009 | Tbg972.7.6230 | Double RNA binding domain protein 12 | TcCLB.506825.10 | Double RNA binding domain protein 12 |
| Motif 3 | TAGAATGG | 2.6e-008 | Tbg972.3.3970 | Triple RNA binding domain protein 3 | TcCLB.510149.140 | Triple RNA binding domain protein 3 |
| Motif 4 | AGAGAGGT | 9.3e-010 | Tbg972.6.2300 | RNA-binding protein, putative | TcCLB.509965.180 | RNA-binding protein, putative |
| Motif 5 | AAMARA | 2.5e-020 | Tbg972.9.5210 | polyadenylate-binding protein 1 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative |
| LmjF.35.4130 | polyadenylate-binding protein 2 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative | |||
| Motif 6 | GDAAA | 5.8e-014 | Tbg972.9.5210 | polyadenylate-binding protein 1 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative |
| Motif 7 | TTGTYGTT | 2.4e-008 | Tbg972.6.2300 | RNA-binding protein, putative | TcCLB.509965.180 | RNA-binding protein, putative |
| Motif 8 | HGTAG | 5.5e-008 | Tbg972.11.17950 | RNA-binding protein, putative | TcCLB.507037.20 | RNA-binding protein, putative |
| Motif 9 | AGAW | 1.1e-012 | Tbg972.11.17950 | RNA-binding protein, putative | TcCLB.507037.20 | RNA-binding protein, putative |
| Motif 10 | TTMTTW | 7.2e-011 | Tbg972.9.4840 | Double RNA binding domain protein 3 | TcCLB.506649.80 | Double RNA binding domain protein 3 |
| Tbg972.7.6230 | Double RNA binding domain protein 12 | TcCLB.506825.10 | Double RNA binding domain protein 12 | |||
| Motif 11 | ARRGGG | 2.0e-019 | Tbg972.3.3970 | Triple RNA binding domain protein 3 | TcCLB.510149.140 | Triple RNA binding domain protein 3 |
| Motif 12 | WAGG | 3.8e-018 | Tbg972.3.3970 | Triple RNA binding domain protein 3 | TcCLB.510149.140 | Triple RNA binding domain protein 3 |
| Motif 13 | GAAGCCC | 9.8e-009 | Tbg972.3.3970 | Triple RNA binding domain protein 3 | TcCLB.510149.140 | Triple RNA binding domain protein 3 |
Enriched motif found in MiNT-MTS database.
| MiNT-MTS vs No-MiNT | ||||||
|---|---|---|---|---|---|---|
| 3'UTR | ||||||
| Motif | Consensus Sequence | P-value | TomTom Result | Product Description | Ortholog gene | Product Description |
| Motif 14 | MAAA | 6.6e-008 | Tbg972.9.5210 | polyadenylate-binding protein 1 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative |
| LmjF.35.4130 | polyadenylate-binding protein 2 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative | |||
| Motif 15 | AARRA | 1.4e-015 | Tbg972.9.5210 | polyadenylate-binding protein 1 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative |
| LmjF.35.4130 | polyadenylate-binding protein 2 | TcCLB.506885.70 | polyadenylate-binding protein 1, putative | |||
| Tbg972.3.3970 | Triple RNA binding domain protein 3 | TcCLB.510149.140 | Triple RNA binding domain protein 3 | |||
| Motif 16 | CTTWTT | 6.7e-014 | Tbg972.9.4840 | Double RNA binding domain protein 3 | TcCLB.506649.80 | Double RNA binding domain protein 3 |
| Motif 17 | GAACGCCT | 1.2e-007 | LmjF.35.2550 | Double RNA binding domain protein 9 | TcCLB.510747.80 | Double RNA binding domain protein 9 |