| Literature DB >> 34468164 |
Santiago Chávez1,2,3, Michael D Urbaniak4, Corinna Benz4, Pablo Smircich1,2, Beatriz Garat1, José R Sotelo-Silveira3,5, María Ana Duhagon1,2.
Abstract
Trypanosoma cruzi is the etiological agent for Chagas disease, a neglected parasitic disease in Latin America. Gene transcription control governs the eukaryotic cell replication but is absent in trypanosomatids; thus, it must be replaced by posttranscriptional regulatory events. We investigated the entrance into the T. cruzi replicative cycle using ribosome profiling and proteomics on G1/S epimastigote cultures synchronized with hydroxyurea. We identified 1,784 translationally regulated genes (change > 2, false-discovery rate [FDR] < 0.05) and 653 differentially expressed proteins (change > 1.5, FDR < 0.05), respectively. A major translational remodeling accompanied by an extensive proteome change is found, while the transcriptome remains largely unperturbed at the replicative entrance of the cell cycle. The differentially expressed genes comprise specific cell cycle processes, confirming previous findings while revealing candidate cell cycle regulators that undergo previously unnoticed translational regulation. Clusters of genes showing a coordinated regulation at translation and protein abundance share related biological functions such as cytoskeleton organization and mitochondrial metabolism; thus, they may represent posttranscriptional regulons. The translatome and proteome of the coregulated clusters change in both coupled and uncoupled directions, suggesting that complex cross talk between the two processes is required to achieve adequate protein levels of different regulons. This is the first simultaneous assessment of the transcriptome, translatome, and proteome of trypanosomatids, which represent a paradigm for the absence of transcriptional control. The findings suggest that gene expression chronology along the T. cruzi cell cycle is controlled mainly by translatome and proteome changes coordinated using different mechanisms for specific gene groups. IMPORTANCE Trypanosoma cruzi is an ancient eukaryotic unicellular parasite causing Chagas disease, a potentially life-threatening illness that affects 6 to 7 million people, mostly in Latin America. The antiparasitic treatments for the disease have incomplete efficacy and adverse reactions; thus, improved drugs are needed. We study the mechanisms governing the replication of the parasite, aiming to find differences with the human host, valuable for the development of parasite-specific antiproliferative drugs. Transcriptional regulation is essential for replication in most eukaryotes, but in trypanosomatids, it must be replaced by subsequent gene regulation steps since they lack transcription initiation control. We identified the genome-wide remodeling of mRNA translation and protein abundance during the entrance to the replicative phase of the cell cycle. We found that translation is strongly regulated, causing variation in protein levels of specific cell cycle processes, representing the first simultaneous study of the translatome and proteome in trypanosomatids.Entities:
Keywords: Chagas’ disease; Trypanosoma; Trypanosoma cruzi; cell cycle; cell proliferation; genomics; mass spectrometry; posttranscription; proteomics; regulon; ribosome profiling; translational control
Mesh:
Substances:
Year: 2021 PMID: 34468164 PMCID: PMC8550152 DOI: 10.1128/mSphere.00366-21
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1Ribosome profiling data set. (A) A fluorescence-activated cell sorting analysis of DNA content staining with propidium iodide was carried out for HU-synchronized T. cruzi populations. The distributions of G1 (2C cells, white), S (2-4C cells, gray), and G2/M (4C cells, black) are presented for an asynchronous parasite culture (Asyn), a G1-enriched population (0 h post-HU), and an S-phase-enriched population (6 h post-HU). (B) Principal-component analysis of the gene expression values for the replicates of G1-enriched population (gray squares) and S-enriched population (black diamonds). (C) Volcano plot showing the distribution of FDR values for each gene versus the fold change in expression. Red dots are genes classified as DEGs (FC > 2, FDR < 0.05, nRFPs > 40). (D) Heatmap representing the 50 up- and downregulated genes (log2 nRFP values are presented). (E) Observed (obs)-to-expected (exp) ratio of the 5′-end footprint mapping distribution in the three reading frames, analyzed for the ribosome profiling (R) and the transcriptome (T). (F) Average mapping position was taken and counted among coding (CDS, dark gray) and noncoding (non-CDS, light gray) sequences for the ribosome profiling (R) and the transcriptome (T).
FIG 2Proteomic data set. (A) A fluorescence-activated cell sorting analysis of DNA content staining with propidium iodide was carried out for HU-synchronized T. cruzi populations. The distributions of G1 (2C cells, white), S (2-4C cells, gray), and G2/M (4C cells, black) are presented for an asynchronous parasite culture (Asyn), a G1-enriched population (0 h post-HU), and an S-phase-enriched population (6 h post-HU). (B) Principal-component analysis of the LFQ values for the duplicates of G1-enriched population (gray squares) and S-enriched population (black diamonds). (C) Volcano plot showing the distribution of FDR values for each gene versus the fold change in expression. Red dots are genes classified as DEGs. (D) Heatmap representing the top 50 up- and downregulated genes (transformed log2 LFQ values are presented).
FIG 3Data set comparisons. (A) The threshold for detection was 15 nCounts for both the transcriptome (T, violet) and translatome (R, red), and at least 1 unique peptide for the proteome (P, blue). (B.) The histogram presents the mRNA levels for the set of detected genes in each data set. (C) Dot plots for pairwise gene expression comparisons of the two data sets. R values for Pearson correlations for the comparisons are presented above the arrows connecting the data sets analyzed.
FIG 4Correlations of S/G1 changes. Log2 fold change comparisons between the 3 data sets. Transcriptome data set is from the work of Chavez et al., 2017 (44). For this analysis, we included genes with 15 normalized reads for the sequencing studies (transcriptome [T] and translatome [R]) and at least 1 valid LFQ value for one replicate of each of the two cell cycle phases studied for the proteome (P). The number of genes analyzed in each diagram resulting from the threshold mentioned above is presented at the bottom. Fold changes were taken from average gene expression for the replicates for both R and P. Genes displaying changes are colored as described below the plots; the thresholds considered are those used to define DEGs in further analysis (1.5-fold for T and P and 2-fold for R). (A) Translatome versus transcriptome. (B) Proteome versus transcriptome. (C) Translatome versus proteome.
Enriched Gene Ontology terms of G1/S DEGs
| G1-enriched GO terms | S-enriched GO terms | ||||||
|---|---|---|---|---|---|---|---|
| Name | No. of genes | FE | Name | No. of genes | FE | ||
|
| |||||||
| Monosaccharide binding (MF) | 3 | 56.6 | 1.8E−03 | DNA binding (MF) | 13 | 6.0 | 3.0E−05 |
| Ligase activity, forming carbon-nitrogen bonds (MF) | 4 | 17.4 | 9.8E−03 |
| 8 | 11.6 | 1.4E−04 |
| Nucleoside phosphate catabolic process (BP) | 4 | 18.9 | 1.7E−02 | Chromatin (CC) | 5 | 11.8 | 6.4E−03 |
|
| 7 | 5.9 | 2.1E−02 | Nucleic acid binding (MF) | 21 | 2.5 | 6.9E−03 |
|
| 11 | 4.0 | 2.2E−02 | Nucleus (CC) | 32 | 1.9 | 8.1E−03 |
|
| 5 | 10.1 | 4.0E−02 | Protein folding (BP) | 8 | 6.0 | 2.9E−02 |
| Regulation of macromolecule metabolic process (BP) | 11 | 3.5 | 7.8E−02 |
| 4 | 12.2 | 3.2E−02 |
| Chromosomal part (CC) | 6 | 6.0 | 5.0E−02 | ||||
| Kinetoplast (CC) | 9 | 3.8 | 5.7E−02 | ||||
|
| |||||||
|
| 87 | 5.3 | 5.9E−44 | Cytoskeleton (CC) | 113 | 2.5 | 1.8E−19 |
|
| 102 | 3.4 | 2.0E−30 | ATP binding (MF) | 107 | 2.0 | 1.8E−11 |
|
| 97 | 3.5 | 6.9E−30 | Nucleoside triphosphatase activity (MF) | 72 | 2.4 | 1.5E−10 |
| Biosynthetic process (BP) | 167 | 2.2 | 1.1E−24 | Ciliary basal body (CC) | 48 | 3.0 | 1.7E−10 |
| Protein-containing complex | 192 | 1.9 | 1.1E−19 | Microtubule organizing center (CC) | 48 | 2.9 | 8.3E−10 |
| Vacuole (CC) | 42 | 3.1 | 5.9E−10 | Microtubule motor activity (MF) | 29 | 4.2 | 1.4E−09 |
| Oxidoreductase activity (MF) | 65 | 2.3 | 2.8E−09 | Chromosome segregation (BP) | 15 | 6.5 | 2.1E−07 |
| Mitochondrion (CC) | 143 | 1.5 | 4.1E−07 | Protein binding (MF) | 120 | 1.7 | 3.7E−07 |
|
| 72 | 2.0 | 9.6E−07 | Small-molecule binding (MF) | 119 | 1.7 | 3.9E−07 |
|
| 38 | 2.5 | 3.9E−05 | DNA helicase activity (MF) | 12 | 7.2 | 1.2E−06 |
|
| 21 | 3.3 | 2.8E−04 | DNA repair (BP) | 24 | 3.4 | 2.9E−05 |
| Glucose metabolic process (BP) | 8 | 6.4 | 2.9E−03 | Cell projection (CC) | 130 | 1.5 | 8.8E−05 |
| Cytochrome complex (CC) | 6 | 7.5 | 3.7E−03 | Telomere organization (BP) | 7 | 9.5 | 2.4E−04 |
| Cofactor binding (MF) | 30 | 2.2 | 1.1E−02 | Kinetochore (CC) | 10 | 5.7 | 3.7E−04 |
| Proteasome complex (CC) | 12 | 3.2 | 4.4E−02 | Axoneme (CC) | 40 | 2.0 | 3.2E−03 |
| Nucleotide catabolic process (BP) | 10 | 4.0 | 5.5E−02 | DNA recombination (BP) | 9 | 5.4 | 6.2E−03 |
| NADP binding (MF) | 7 | 5.1 | 5.9E−02 |
| 20 | 2.5 | 2.0E−02 |
|
| 69 | 1.5 | 7.3E−02 |
| 15 | 2.8 | 8.5E−02 |
|
| |||||||
|
| 22 | 3.22 | 3.6E−04 |
| 12 | 7.3 | 3.7E−06 |
|
| 14 | 3.65 | 5.3E−03 | DNA packaging (BP) | 9 | 7.8 | 3.9E−04 |
|
| 18 | 2.81 | 1.2E−02 | Small-molecule metabolic process (BP) | 41 | 2.2 | 9.2E−04 |
| Calmodulin binding (MF) | 4 | 15 | 1.9E−02 | Mitochondrion (CC) | 78 | 1.6 | 1.1E−03 |
| Translation elongation factor activity (MF) | 7 | 6.11 | 2.5E−02 | Chromatin assembly (BP) | 7 | 9.6 | 1.2E−03 |
| Entry into host cell (BP) | 3 | 28.13 | 2.8E−02 | Ciliary transition zone (CC) | 14 | 3.9 | 2.3E−03 |
| Nucleic acid binding (MF) | 34 | 1.86 | 4.7E−02 | Oxidoreductase activity (MF) | 33 | 2.3 | 2.3E−03 |
|
| 23 | 2.18 | 6.0E−02 | Nucleotide metabolic process (BP) | 19 | 3.2 | 3.5E−03 |
| Ciliary plasm (CC) | 67 | 1.7 | 4.4E−03 | ||||
| Coenzyme binding (MF) | 16 | 3.1 | 1.4E−02 | ||||
| Voltage-gated channel activity (MF) | 5 | 10.3 | 1.5E−02 | ||||
| Cofactor binding (MF) | 15 | 2.9 | 3.7E−02 | ||||
|
| 15 | 2.9 | 4.7E−02 | ||||
The top significantly enriched (adjusted P values < 0.01) terms are presented. The number of genes, the fold enrichment (FE), and the Bonferroni -adjusted P values (P-val) are indicated. Term redundancy was manually curated. The type of ontological term is presented next to the term name in parentheses as follows: MF, molecular function; BP, biological process; CC, cellular component. Repeated GO terms on the same cell cycle phase (bold italic) or across stages (gray shading) are highlighted.
FIG 5Expression profiles of selected cell cycle regulators. Heatmaps for transcriptome-translatome-proteome (T-R-P) log2 fold change of selected differentially expressed genes. (A) Genes coding for cyclins. (B) Genes coding for cdc2-related kinases (CRKs). (C) Genes coding for RNA binding proteins (RBPs). The white asterisk denotes the data set that displayed regulation. Gray shading implies fold change could not be calculated from the proteomic data. Black shading indicates that no protein was detected in any of the replicates.
FIG 6Putative sets of coregulated genes. (A) (Top) Pearson correlation clustering for a set of 2,757 genes with valid fold change values for both the translatome (R) and the proteome (P) data sets. (Middle) The number of genes comprising each cluster. (Bottom) Schematic representation of the location of the genes of each cluster on the fold change diagram presented in Fig. 4C. (B) Top 3 nonredundant gene ontology terms overrepresented in each cluster of genes. Bar heights represent fold enrichment values, and color represents statistical significance (Bonferroni-corrected P value) ranging from 3.5E−8 (green) to 7.7E−2 (red). *1, nucleoside triphosphatase activity. *2, small-molecule metabolic process.