| Literature DB >> 24828104 |
Raúl O Cosentino1, Fernán Agüero1.
Abstract
In Trypanosoma cruzi the isoprenoid and sterol biosynthesis pathways are validated targets for chemotherapeutic intervention. In this work we present a study of the genetic diversity observed in genes from these pathways. Using a number of bioinformatic strategies, we first identified genes that were missing and/or were truncated in the T. cruzi genome. Based on this analysis we obtained the complete sequence of the ortholog of the yeast ERG26 gene and identified a non-orthologous homolog of the yeast ERG25 gene (sterol methyl oxidase, SMO), and we propose that the orthologs of ERG25 have been lost in trypanosomes (but not in Leishmanias). Next, starting from a set of 16 T. cruzi strains representative of all extant evolutionary lineages, we amplified and sequenced ∼ 24 Kbp from 22 genes, identifying a total of 975 SNPs or fixed differences, of which 28% represent non-synonymous changes. We observed genes with a density of substitutions ranging from those close to the average (∼ 2.5/100 bp) to some showing a high number of changes (11.4/100 bp, for the putative lathosterol oxidase gene). All the genes of the pathway are under apparent purifying selection, but genes coding for the sterol C14-demethylase, the HMG-CoA synthase, and the HMG-CoA reductase have the lowest density of missense SNPs in the panel. Other genes (TcPMK, TcSMO-like) have a relatively high density of non-synonymous SNPs (2.5 and 1.9 every 100 bp, respectively). However, none of the non-synonymous changes identified affect a catalytic or ligand binding site residue. A comparative analysis of the corresponding genes from African trypanosomes and Leishmania shows similar levels of apparent selection for each gene. This information will be essential for future drug development studies focused on this pathway.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24828104 PMCID: PMC4020770 DOI: 10.1371/journal.pone.0096762
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genes in the sterol biosynthesis pathway (SBP) of yeast.
| Gene ID/Locus | Molecular Function | EC Number | Pathway Order | Ortholog Group |
| ERG10/YPL028W | Acetoacetyl-CoA thiolase | 2.3.1.9 | 1 | OG4_10214 |
| ERG13/YML126C | 3-hydroxy-3-methylglutaryl-CoA (HMG-CoA) synthase | 2.3.3.10 | 2 | OG4_11016 |
| HMG1/2/YML075C, YLR450W | HMG-CoA reductase | 1.1.1.34 | 3 | OG4_11458 |
| ERG12/YMR208W | Mevalonate kinase | 2.7.1.36 | 4 | OG4_11698 |
| ERG8/YMR220W | Phosphomevalonate kinase | 2.7.4.2 | 5 | OG4_15366 |
| ERG19/YNR043W | Mevalonate pyrophosphate decarboxylase | 4.1.1.33 | 6 | OG4_11688 |
| ERG20/YJL167W | Farnesyl pyrophosphate synthetase | 2.5.1.10 | 7 | OG4_11009 |
| IDI1/YPL117C | Isopentenyl diphosphate isomerase (IPP isomerase) | 5.3.3.2 | 7' | OG4_12197 |
| ERG9/YHR190W | Farnesyl-diphosphate farnesyl transferase (squalene synthase) | 2.5.1.21 | 8 | OG4_13084 |
| ERG1/YGR175C | Squalene epoxidase | 1.14.99.7 | 9 | OG4_13490 |
| ERG7/YHR072W | Lanosterol synthase (oxidosqualene cyclase) | 5.4.99.7 | 10 | OG4_11381 |
| ERG11/YHR007C | Lanosterol 14-alpha-demethylase | 1.14.13.70 | 11 | OG4_12975 |
| ERG24/YNL280C | C-14 sterol reductase | 1.3.1.70 | 12 | OG4_12018 |
| ERG25/YGR060W | C-4 methyl sterol oxidase | 1.14.13.72 | 13 | OG4_13007 |
| ERG26/YGL001C | C-3 sterol dehydrogenase | 1.1.1.170 | 14 | OG4_12533 |
| ERG27/YLR100W | 3-keto sterol reductase | 1.1.1.270 | 15 | OG4_16167 |
| ERG6/YML008C | Delta(24)-sterol C-methyltransferase | 2.1.1.41 | 16 | OG4_13307 |
| ERG2/YMR202W | C-8 sterol isomerase | 5.-.-.- | 17 | OG4_14573 |
| ERG3/YLR056W | C-5 sterol desaturase (lathosterol oxidase) | 1.3.3.- | 18 | OG4_12421 |
| ERG5/YMR015C | C-22 sterol desaturase | 1.14.14.- | 19 | OG4_14688 |
| ERG4/YGL012W | C-24(28) sterol reductase | 1.3.1.71 | 20 | OG4_16908 |
The table lists the reference genes used in this work for mapping the corresponding T. cruzi genes. Information for this table was derived from the Saccharomyces Genome Database (SGD, yeastgenome.org). Ortholog group identifiers are from the OrthoMCL Database, version 4 (orthomcl.org).
Genes in the Trypanosoma cruzi sterol biosynthesis pathway.
| SBP Ortholog | T. cruzi Locus Identifiers(s) | Current annotation | Ortholog Group | In KEGG? | Gene Name |
| ERG10 | TcCLB.511003.60 | Hypothetical protein | OG4_10214 | Yes | TcACAT |
| ERG13 | TcCLB.511903.40, TcCLB.511071.50 | Hypothetical protein, conserved | OG4_11016 | No | TcHMGS |
| HMG1, HMG2 | TcCLB.509167.20, TcCLB.506831.40 | 3-hydroxy-3-methylglutaryl-CoA reductase, putative | OG4_11458 | Yes | TcHMGR |
| ERG12 | TcCLB.436521.9, TcCLB.509237.10 | Mevalonate kinase, putative | OG4_11698 | Yes | TcMK |
| ERG8 | TcCLB.508277.140, TcCLB.507913.20 | Phosphomevalonate kinase e-like protein, putative | OG4_15366 | Yes | TcPMK |
| ERG19 | TcCLB.507993.330, TcCLB.511281.40 | Diphosphomevalonate decarboxylase, putative | OG4_11688 | Yes | TcMVD |
| ERG20 | TcCLB.511823.70, TcCLB.508323.9 | Farnesyl pyrophosphate synthase, putative | OG4_11009 | Yes | TcFPPS |
| IDI1 | TcCLB.408799.19, TcCLB.510431.10 | Isopentenyl-diphosphate delta-isomerase, putative | OG4_14499 | Yes 6 | TcIDI |
| ERG9 | TcCLB.507897.20, TcCLB.508369.20 | Farnesyl transferase, putative; squalene synthase, putative | OG4_13084 | Yes | TcSQS |
| ERG1 | TcCLB.509589.20, TcCLB.503999.10 | Squalene monooxigenase, putative | OG4_13490 | Yes | TcSQLE |
| ERG7 | TcCLB.508175.70, TcCLB.506825.170 | Lanosterol synthase, putative | OG4_11381 | No | TcOSC |
| ERG11 | TcCLB.506297.260, TcCLB.510101.50 | Lanosterol 14- | OG4_12975 | Yes | Tc14DM, cCYP51 |
| ERG24 | TcCLB.507969.60, TcCLB.507129.30 | C-14 sterol reductase, putative | OG4_12018 | Yes | Tc14SR |
| – | TcCLB.511339.20, TcCLB.511895.69, TcCLB.509235.20 | C-5 sterol desaturase, putative | OG4_20087 | No | TcSMO-like |
| ERG26 | TcCLB.510873.10 | NAD(P)-dependent steroid dehydrogenase protein, putative | OG4_12533 | No | TcNDSDHL |
| ERG27 | – | – | – | No | – |
| ERG6 | TcCLB.504191.10, TcCLB.505683.10, TcCLB.510185.10 | Sterol 24-C-methyltransferase, putative | OG4_13307 | Yes | Tc24SMT |
| ERG2 | TcCLB.510329.90 | C-8 sterol isomerase, putative | OG4_14573 | Yes | Tc8SI |
| ERG3 | TcCLB.473111.10, TcCLB.507853.10 | Lathosterol oxidase, putative | OG4_12421 | No | TcSC5D |
| ERG5 | TcCLB.506945.190 | Cytochrome p450-like protein, putative | OG4_14688 | No | TcSC22D |
| ERG4 | TcCLB.506577.120, TcCLB.507709.90 | Sterol C-24 reductase, putative | OG4_16908 | Yes | TcSC24R |
The table lists T. cruzi genes mapped to either the corresponding KEGG maps, or the yeast SBPs (see Table 1). OrthoMCL identifiers are from the OrthoMCL Database version 4. T. cruzi gene identifiers are those currently available at the TriTrypDB resource. The fifth column shows whether the gene was mapped to the corresponding KEGG metabolic map at the time of this writing. The last column shows the nomenclature used in this work. Some of these gene names were previously used in the literature, and others are used here for the first time, based on the gene names of relevant orthologs. In the case of the TcIDI gene, the two listed alleles are truncated copies of the gene. The full-length gene is deposited in GenBank as AJ866772. While in the case of the TcNSDHL gene, the copy annotated by the genome project is truncated due to genome assembly problems. The full-length gene is deposited in GenBank as JN050853 (this work).
Figure 1Phylogenetic tree of selected ERG3/ERG25 orthologs.
Seleced orthologs from kinetoplastids, plants, vertebrates, invertebrates and fungi were aligned with t_coffee. A phylogenetic reconstruction was calculated using PhyML (LG model, bootstrap resampling with N = 1000). For clarity, highly similar genes/alleles were not included in the final tree. Organism abbreviations are: lbra = L. braziliensis; lmex = L. mexicana; linf = L. infantum; lmaj = L. major; tcru = T. cruzi; tviv = T. vivax; tcon = T. congolense; tbru = T. brucei; agam = Anopheles gambiae; aaeg = Aedes aegypti; cpip = Culex pipiens; hsap = Homo sapiens; calb = Candida albicans; scer = S. cerevisiae; spom = Schizosaccharomyces pombe; atha = A. thaliana; drer = Danio rerio; trub = Takifugu rubripes; tnig = Tetraodon nigroviridis.
Figure 2The T. cruzi isoprenoid biosynthesis pathway.
The figure shows the metabolic steps leading to farnesyl-diphosphate, from acetyl-CoA, and the corresponding yeast and T. cruzi enzymes that catalyze these steps. Gene names used are those in Tables 1 and 2.
Figure 3The T. cruzi sterol biosynthesis pathway.
The figure shows the metabolic steps leading from farnesyl-diphosphate to ergosterol (in yeast, and in T. cruzi epimastigotes) or to different 24-alkylsterols (in T. cruzi amastigotes [64]), and the corresponding yeast and T. cruzi enzymes that catalyze these steps. The two methyl groups at C4 are removed in two rounds of successive C4-oxidation, C4-decarboxylation and C3-ketoreduction (*). Gene names used are those in Tables 1 and 2. Unknown/hypothetical assignments are shown with question marks.
Comparative analysis of the quantity, density and type of SNPs identified in the sterol biosynthesis pathway of Trypanosoma cruzi.
| Polymorphisms (SNPs) | Length (bp) | ||||||||
| Synonymous (S) | Non synonymous (NS) | Ratio | Total | CDS | Resequenced | ||||
| Genes | No. (%) | density | No. (%) | density | NS/S | No. | bp | bp (%) | PCR Fragments |
| TcACAT | 35 (75.00) | 3.27 | 11 (25.00) | 1.09 | 0.33 | 48 | 1188 | 1100 (92.59) | 2 |
| TcHMGS | 37 (86.05) | 2.66 | 6 (13.95) | 0.43 | 0.16 | 43 | 1497 | 1390 (92.85) | 2 |
| TcHMGR | 27 (79.41) | 2.18 | 7 (20.59) | 0.56 | 0.26 | 34 | 1356 | 1240 (91.45) | 2 |
| TcMK | 31 (79.49) | 3.20 | 12 (30.77) | 1.24 | 0.39 | 50* | 1086 | 970 (89.32) | 2 |
| TcPMK | 35 (53.85) | 2.65 | 33 (50.77) | 2.50 | 0.94 | 69* | 1428 | 1320 (92.44) | 2 |
| TcMVD | 32 (62.75) | 3.08 | 19 (37.25) | 1.83 | 0.59 | 51 | 1140 | 1040 (91.23) | 2 |
| TcIDI | 25 (78.13) | 2.50 | 7 (21.88) | 0.70 | 0.28 | 32 | 1071 | 1000 (93.37) | 2 |
| TcFPPS | 22 (66.67) | 2.27 | 11 (33.33) | 1.13 | 0.50 | 36* | 1275 | 970 (76.08) | 2 |
| TcSQS | 32 (69.57) | 2.88 | 14 (30.43) | 1.26 | 0.44 | 46 | 1212 | 1110 (91.58) | 2 |
| TcSQLE | 50 (72.46) | 3.07 | 21 (30.43) | 1.29 | 0.42 | 73* | 1716 | 1630 (94.99) | 3 |
| TcOSC | 61 (66.30) | 2.53 | 31 (33.70) | 1.29 | 0.51 | 92 | 2706 | 2410 (89.06) | 4 |
| Tc14DM | 47 (87.04) | 3.46 | 7 (12.96) | 0.51 | 0.15 | 54 | 1443 | 1360 (94.25) | 2 |
| Tc14SR | 33 (63.46) | 2.58 | 19 (36.54) | 1.48 | 0.58 | 52 | 1371 | 1280 (93.36) | 2 |
| TcSMO-like | 14 (51.85) | 2.09 | 13 (48.15) | 1.94 | 0.93 | 27 | 837 | 670 (80.05) | 1 |
| TcNSDHL | 21 (70.00) | 1.75 | 9 (30.00) | 0.75 | 0.43 | 30 | 1221 | 1200 (98.28) | 2 |
| Tc24SMT | 17 (77.27) | 2.07 | 5 (22.73) | 0.61 | 0.29 | 22 | 1050 | 820 (78.10) | 1 |
| Tc8SI | 18 (78.26) | 3.16 | 5 (21.74) | 0.88 | 0.28 | 23 | 654 | 570 (87.16) | 1 |
| TcSC5D | 70 (89.74) | 9.72 | 11 (14.10) | 1.53 | 0.16 | 82* | 834 | 720 (86.33) | 1 |
| TcSC22D | 29 (69.05) | 2.09 | 13 (30.95) | 0.94 | 0.45 | 42 | 1518 | 1390 (91.57) | 2 |
| TcSC24R | 53 (75.71) | 3.79 | 17 (24.29) | 1.21 | 0.32 | 70 | 1467 | 1400 (95.43) | 2 |
|
| 692 (71.71) | 2.93 | 273 (28.29) | 1.16 | 0.39 | 975 | 26070 | 23590 (90.50) | 39 |
SNP counts marked with * indicate totals that do not result from the sum of synonymous + non-synonymous substitutions, in some cases because of the presence of two substitutions in the same codon, and in other cases, because some SNPs fall in the non-coding region of the gene.
Comparative analysis of the genetic diversity present in the sterol biosynthesis pathway of different kinetoplastids.
| dN/dS ( | |||
| Yeast gene, Std symbol | Af Tryps | Leish | Tcr |
| ERG10/ACAT | – | – |
|
| ERG13/HMGS | 0.057 | 0.048 | 0.054 |
| HMG1/HMGR | 0.065 | 0.052 | 0.057 |
| ERG12/MK | 0.077 | 0.076 | 0.077 |
| ERG8/PMK |
|
|
|
| ERG19/MVD | 0.067 | 0.072 | 0.059 |
| IDI1/IDI |
|
|
|
| ERG20/FPPS | 0.060 | 0.061 | 0.065 |
| ERG9/FDFT, SQS | 0.085 | 0.087 | 0.093 |
| ERG1/SQLE |
|
|
|
| ERG7/LSS, OSC |
|
|
|
| ERG11/14DM, CYP51 |
|
|
|
| ERG24/14SR, TM7SF2 |
|
|
|
| ERG25/SMO/SC4MOL | – |
| – |
| –/SMO-like | 0.080 | 0.082 | 0.092 |
| ERG26/NSDHL |
| 0.057 | 0.058 |
| ERG27/3KSR | – | – | – |
| ERG6/24SMT | 0.065 | 0.062 | 0.070 |
| ERG2/8SI |
| 0.100 | 0.097 |
| ERG3/SC5D | 0.069 | 0.059 | 0.060 |
| ERG5/SC22D | – |
|
|
| ERG4/SC24R | – | 0.097 | 0.070 |
| Average | 0.080 | 0.075 | 0.079 |
| Standard deviation | 0.028 | 0.028 | 0.026 |
The table shows the dN/dS (ω) ratio observed in the african trypanosome lineage (Af Tryps) (T. brucei brucei, T. brucei gambiense, T. evansi, T. congolense, T. vivax), in the Leishmania lineage (Leish) (L. major, L. infantum, L. braziliensis, and L. mexicana), and in the T. cruzi lineage (Tcr, sequences from TcI{TcVI DTUs). The values that are above (†) or under (*) a standard deviation from the average of each column are marked.
Figure 4Alignment of T. cruzi, T. brucei, M. tuberculosis and human Lanosterol 14-α demethylases, showing the non-synonymous changes identified in this work (red arrows).
Important residues either in Tc14DM or in the CYP51 family are noted [100], [101], as well as residues associated with resistance to azoles in C. albicans and U. necator [56], [57]. PS00086 is the Prosite Cytochrome 450 motif (cysteine heme-iron ligand signature).