| Literature DB >> 21829751 |
Carlos A Flores-López1, Carlos A Machado.
Abstract
BACKGROUND: The genetic diversity of Trypanosoma cruzi, the etiological agent of Chagas disease, has been traditionally divided in two major groups, T. cruzi I and II, corresponding to discrete typing units TcI and TcII-VI under a recently proposed nomenclature. The two major groups of T. cruzi seem to differ in important biological characteristics, and are thus thought to represent a natural division relevant for epidemiological studies and development of prophylaxis. To understand the potential connection between the different manifestations of Chagas disease and variability of T. cruzi strains, it is essential to have a correct reconstruction of the evolutionary history of T. cruzi. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2011 PMID: 21829751 PMCID: PMC3149036 DOI: 10.1371/journal.pntd.0001272
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
The main Trypanosoma cruzi strains used in this study.
| Strain | DTU | Zymodeme | Isoenzyme type | 1999 nomenclature | New nomenclature |
| SO34 cl4 | I | Z1 | 20 | Tc I | TcI |
| SC13 | I | Z1 | ? | Tc I | TcI |
| EP 255 | IIa | Z3 | nd(27) | Tc II | TcIV |
| CBB cl3 | IIb | Z2 | 32 | Tc II | TcII |
| M6241 cl6 | IIc | Z2 | 35 | Tc II | TcIII |
| SO3 cl5 | IId | Z2 | 39 | Tc II | TcV |
| CL Brener (CL F11F5) | IIe | Z2 | 43 | Tc II | TcVI |
Discrete typing unit (DTU) [8];
[4];
[6];
[30];
[32].
List of loci included in this study.
| Locus ID | N | bp | Chromosome | Gene location in Chr. (strand), gene length | Predicted function | Topology |
| COII-ND1 | 48 | 1226 | Maxicircle (mtDNA) | N/A | Cytochrome oxidase subunit II-NADH dehydrogenase subunit 1 | A |
| Tc00.1047053503555.30 | 11 | 1290 | Chr 37 | 713055–714533 (−), 1479 bp | Trypanothione reductase (TR) | A |
| Tc00.1047053509153.90 | 40 | 1473 | Chr 27 | 718463–720028 (+), 1566 bp | Dihydrofolate reductase-thymidylate synthase (DHFR-TS) | A |
| HSP70 | 11 | 508 | Chr 32 | 699686–700540 (−), 855 bp | Intergenic region | A |
| Tc00.1047053503885.80 | 12 | 946 | Chr 26 | 163788–164993 (+), 1206 bp | Hypothetical protein, conserved | A |
| Tc00.1047053503891.50 | 10 | 813 | Chr 20 | 75320–76489 (−), 1170 bp | Hypothetical protein, conserved | A |
| Tc00.1047053503909.76 | 12 | 614 | 556434–557156 (+), 723 bp | Ferric reductase transmembrane protein, putative | B | |
| Tc00.1047053504013.40 | 25 | 805 | Chr 34 | 465693–466718 (−), 1026 bp | Serine acetyltransferase, putative | A |
| Tc00.1047053504045.100 | 10 | 886 | Chr 40 | 1854961–1856415 (−), 1455 bp | Hypothetical protein, conserved | A |
| Tc00.1047053504057.80 | 10 | 858 | Chr 34 | 417310–418677 (−), 1368 bp | Hypothetical protein, conserved | D |
| Tc00.1047053504059.20 | 13 | 896 | Chr 14 | 465730–467526 (−), 1797 bp | Endomembrane protein, putative | A |
| Tc00.1047053506247.200 | 10 | 920 | Chr 37 | 133811–136708 (+), 2898 bp | Beta-adaptin, putative | A |
| Tc00.1047053506525.150 | 13 | 821 | Chr 40 | 593462–594415 (+), 954 bp | Hypothetical protein, conserved | A |
| Tc00.1047053506529.310 | 27 | 727 | Chr 6 | 97318–98676 (−), 1359 bp | Hypothetical protein, conserved | C |
| Tc00.1047053506739.20 | 11 | 810 | Chr 3 | 25655–27589 (−), 1935 bp | Hypothetical protein, conserved | F |
| Tc00.1047053507801.70 | 11 | 677 | Chr 23 | 535126–535959 (+), 834 bp | Protein kinase, putative | A |
| Tc00.1047053508153.540 | 13 | 774 | Chr 36 | 699363–700391 (+), 1029 bp | Hypothetical protein, conserved | A |
| Tc00.1047053508461.80 | 20 | 838 | Chr 39 | 1187987–1189126 (−), 1140 bp | Prostaglandin F2alpha synthase | G |
| Tc00.1047053508719.70 | 24 | 709 | Chr 37 | 375185–376402 (+), 1218 bp | Hypothetical protein, conserved | A |
| Tc00.1047053509007.30 | 13 | 815 | Chr 31 | 573767–574690 (+), 924 bp | Hypothetical protein, conserved | E |
| Tc00.1047053509105.70 | 24 | 897 | Chr 37 | 769449–770786 (−), 1338 bp | Thiol-dependent reductase 1, putative | A |
| Tc00.1047053509561.20 | 23 | 880 | Chr 12 | 285842–287581 (−), 1740 bp | Flagellum-adhesion glycoprotein, putative | A |
| Tc00.1047053509967.50 | 11 | 595 | Chr 10 | 184622–185329 (+), 708vbp | Hypothetical protein, conserved | A |
| Tc00.1047053510101.480 | 12 | 829 | Chr 27 | 190063–191427 (−), 1365 bp | Hypothetical protein, conserved | B |
| Tc00.1047053510123.24 | 12 | 880 | Chr 20 | 372476–373429 (+), 954 bp | Hypothetical protein, conserved | A |
| Tc00.1047053510131.90 | 12 | 936 | Chr 30 | 340360–342003 (+), 1644 bp | Hypothetical protein, conserved | A |
| Tc00.1047053510765.50 | 13 | 817 | Chr 39 | 1780396–1781763 (+), 1368 bp | Hypothetical protein, conserved | C |
| Tc00.1047053510877.190 | 8 | 453 | Chr 34 | 493531–494328 (−), 798 bp | Hypothetical protein, conserved | A |
| Tc00.1047053510889.210 | 25 | 693 | Chr 6 | 154383–156290 (−), 1908 bp | Hypothetical protein, conserved | A |
| Tc00.1047053510889.310 | 23 | 763 | Chr 6 | 193929–196025 (+), 2097 bp | Hypothetical protein, conserved | A |
| Tc00.1047053511153.124 | 12 | 513 | Chr 27 | 412720–413271 (+), 552 bp | Hypothetical protein, conserved | A |
| Tc00.1047053511529.200 | 13 | 667 | Chr 35 | 170438–171232 (−), 795 bp | Hypothetical protein, conserved | A |
N: number of haplotypes sequenced.
bp: sequenced region (in base pairs).
[52].
See Figure 1.
Figure 1Phylogenetic topologies obtained from the 32 analyzed loci.
Number on top of each topology represents the number of times that particular topology was observed (Table 2). All internal branches shown had bootstrap support values >70%. The topologies are depicted with respect to the classification system that divides T. cruzi in two major lineages [30], T. cruzi I (blue) and T. cruzi II (red), and the six major DTUs are labeled. Topology H is consistent with the current classification, and represents a history of divergence in which T. cruzi I and II are reciprocally monophyletic.
Figure 2Maximum likelihood tree of concatenated data set.
Data set consists of 31 nuclear loci and 1 mitochondrial region (COII-ND1), totaling 26,329 nucleotides per strain. Numbers above and below branches are Bootstrap (from ML analyses) and Bayesian support values, respectively. Taxon names represent the six major DTUs. Scale bar in number of substitutions per site.
Results from the selection, distance and midpoint rooting analyses.
| Locus ID | N | bp | % sites ω>1 | ω | CL Brener ω | Kimura | Ks | Midpoint rooting |
| COII-ND1 | 48 | 1226 | 0 | NA | Only 1 copy | 0.095 | 0.3677 | A |
| Tc00.1047053503555.30 | 11 | 1290 | 0 | NA | 0.08 | 0.016 | 0.0520 | A |
| Tc00.1047053509153.90 | 40 | 1473 | 0.07 | 1.09 | 0.04 | 0.014 | 0.0403 | A |
| Hsp70 | 11 | 508 | NA | NA | NA | 0.053 | NA | A |
| Tc00.1047053503885.80 | 12 | 945 | 0.03 | 9.07 | 1.47 | 0.038 | 0.0216 | A |
| Tc00.1047053503891.50 | 10 | 810 | 0.06 | 6.17 | 1.54 | 0.053 | 0.0268 | A |
| Tc00.1047053503909.76 | 12 | 612 | 0.47 | 1.09 | 1.27 | 0.023 | 0.0231 | B |
| Tc00.1047053504013.40 | 25 | 804 | 0 | NA | 0.37 | 0.029 | 0.0451 | A |
| Tc00.1047053504045.100 | 10 | 885 | 0 | NA | Only 1 copy | 0.019 | 0.0428 | A |
| Tc00.1047053504057.80 | 10 | 855 | 0 | NA | Only 1 copy | 0.015 | 0.0150 | ND |
| Tc00.1047053504059.20 | 13 | 894 | 0.02 | 2.88 | 0.74 | 0.019 | 0.0316 | ND |
| Tc00.1047053506247.200 | 10 | 918 | 0.03 | 2.67 | Only 1 copy | 0.018 | 0.0080 | ND |
| Tc00.1047053506525.150 | 13 | 819 | 0 | NA | 2.06 | 0.019 | 0.0049 | A |
| Tc00.1047053506529.310 | 27 | 726 | 0.01 | 13.97 | 0.42 | 0.028 | 0.0501 | A |
| Tc00.1047053506739.20 | 11 | 807 | 0.33 | 1.41 | Only 1 copy | 0.029 | 0.0120 | B |
| Tc00.1047053507801.70 | 11 | 675 | 0.03 | 4.79 | ∞ | 0.021 | 0.0002 | ND |
| Tc00.1047053508153.540 | 13 | 786 | 0.02 | 3.41 | 6.35 | 0.030 | 0.0036 | H |
| Tc00.1047053508461.80 | 20 | 699 | 0 | NA | - | 0.017 | 0.0388 | G |
| Tc00.1047053508719.70 | 24 | 708 | 0.19 | 1.16 | 0.33 | 0.020 | 0.0292 | H |
| Tc00.1047053509007.30 | 13 | 813 | 0.58 | 1.25 | 1.59 | 0.028 | 0.0166 | E |
| Tc00.1047053509105.70 | 24 | 849 | 0.07 | 4.07 | 0.34 | 0.037 | 0.0531 | H |
| Tc00.1047053509561.20 | 23 | 879 | 0.11 | 5.62 | 0.95 | 0.045 | 0.0318 | A |
| Tc00.1047053509967.50 | 11 | 591 | 0 | NA | 2.22 | 0.024 | 0.0103 | C |
| Tc00.1047053510101.480 | 12 | 828 | 0.01 | 3.57 | 2.19 | 0.025 | 0.0109 | G |
| Tc00.1047053510123.24 | 12 | 879 | 0.09 | 3.21 | 3.76 | 0.037 | 0.0098 | A |
| Tc00.1047053510131.90 | 12 | 933 | 0.01 | 2.54 | 2.81 | 0.022 | 0.0051 | B |
| Tc00.1047053510765.50 | 13 | 813 | 0.29 | 2.00 | 1.58 | 0.025 | 0.0145 | A |
| Tc00.1047053510877.190 | 8 | 453 | 0.49 | 1.58 | Only 1 copy | 0.048 | 0.014 | A |
| Tc00.1047053510889.210 | 25 | 693 | 0.02 | 5.96 | 1.01 | 0.030 | 0.0234 | A |
| Tc00.1047053510889.310 | 23 | 762 | 0.16 | 2.13 | 0.82 | 0.022 | 0.0343 | A |
| Tc00.1047053511153.124 | 12 | 510 | 0.29 | 1.69 | 5.32 | 0.034 | 0.0063 | C |
| Tc00.1047053511529.200 | 13 | 666 | 0.008 | 20.94 | 3.51 | 0.035 | 0.0089 | H |
N: number of strains (haplotypes) sequenced. bp: number of aligned nucleotides used in the PAML analyses.
The percent of sites with ω = dN/dS>1. ω estimated from the M8 model implemented in PAML. NA: Non-applicable, non-coding intergenic region.
Average ω (dN/dS) for sites with dN/dS>1. NA: Non-applicable, since no sites had dN/dS>1.
dN/dS estimated for the two haplotypes of CL Brener using PAML's codeml program with the pairwise distance estimation option (runmode = −2).
Estimate of the % corrected distance for all sites (Kimura 2-parameter) or for synonymous sites only (Ks) between a strain of TcI (SC13) and a strain of TcII (CBB cl3), corresponding to the largest genetic distance within T. cruzi. NA: Non-applicable, non-coding intergenic region.
The topology obtained with midpoint rooting (See Figure 1 for Topology definitions). ND (Non Described topology): the topology obtained was different from the topologies described in Figure 1.
HSP70 is an intergenic region, thus selection tests were not conducted.
The midpoint rooting topology was different from the topology reconstructed with an outgroup (Table 2).
*: Only significant for M8 vs M8a (p ≤ 0.05).
**: Significant for M8 vs M8a (p ≤ 0.01), M1 vs M2 (p ≤ 0.05), and M7 vs M8 (p ≤ 0.05).
***: p value ≤0.0001 in all three tests (M1 vs M2, M7 vs M8, M8 vs M8a).
Bayesian estimates of divergence time (in mya) for different T. cruzi lineages.
| Nuclear loci (20 loci) | |||||||
| Clockmodel | Tryps |
| TcI | TcI, TcIII-VI | TcII-Hybrids | TcIII-Hybrids | PosteriorLikelihood |
| Strict | 6.23(4.7–7.7) | 1.36(1–1.7) | 0.15(0.09–0.2) | 1.11(0.8–1.4) | 0.49(0.3–0.6) | 0.49(0.3–0.6) | −57232.7477 |
| Relaxedlognormal | 8.3(3.8–13.8) | 2.18(0.9–3.7) | 0.25(0.08–0.5) | 1.69(0.77–2.9) | 0.8(0.3–1.4) | 0.73(0.3–1.3) | −57183.9809 |
Times to the most recent common ancestor (tMRCA) are shown in mya. In parentheses are 95% HPD (highest posterior density) intervals.
tMRCA of T. cruzi and its two outgroups (T. c. marinkellei, T. verspertilionis).
tMRCA of extant T. cruzi lineages.
tMRCA of TcI (nuclear data: SO34, SC13; mtDNA: TEH cl2, CEPA EP, Vin C6, X10 cl1, SABP3, A80, A92, MA-V, OPS21 cl11, CUTIA cl1, 133 79 cl7, V121, 26 79, CUICA cl1, SO34 cl4, P209 cl1, 85/818, P0AC, Esquilo cl1, SC13).
tMRCA of strains SO34, SC13, CL35, EP225, CLA39-Haplotype1 and CL_Brener-Haplotype1.
tMRCA of strains Florida C16, CANIII, M6241, CM 17, EP 255, 86-1, SO3, EPP, PSC-O, Tulahuen, CL F11F5, VM V4, P63, 86/2036, P251, X9/3, XII0/8 and XI09/2.
tMRCA of TcII or TcIII and the respective closest haplotypes from both hybrid DTUs (TcV, TcVI).
tMRCA of strains Esmeraldo, X-300, CBB, MCV, MSC2, TU18, MVB.
The Relaxed Lognormal clock model for the mitochondrial data set did not converge even after combining 10 independent runs in Tracer. Therefore, the estimates from these analyses are not reliable and not shown here.
Figure 3Divergence times for main DTU clades of T. cruzi using nuclear loci with the relaxed clock model.
Data set consists of an alignment of 22 concatenated nuclear loci for which the molecular clock was not rejected (Table S4), and that had a homolog in T. brucei. Taxon names represent the six major DTUs. Scale bar in millions of years ago (mya).