| Literature DB >> 25054834 |
Nicolás Tomasini1, Juan José Lauthier1, Francisco José Ayala2, Michel Tibayrenc3, Patricio Diosque1.
Abstract
The model of predominant clonal evolution (PCE) proposed for micropathogens does not state that genetic exchange is totally absent, but rather, that it is too rare to break the prevalent PCE pattern. However, the actual impact of this "residual" genetic exchange should be evaluated. Multilocus Sequence Typing (MLST) is an excellent tool to explore the problem. Here, we compared online available MLST datasets for seven eukaryotic microbial pathogens: Trypanosoma cruzi, the Fusarium solani complex, Aspergillus fumigatus, Blastocystis subtype 3, the Leishmania donovani complex, Candida albicans and Candida glabrata. We first analyzed phylogenetic relationships among genotypes within each dataset. Then, we examined different measures of branch support and incongruence among loci as signs of genetic structure and levels of past recombination. The analyses allow us to identify three types of genetic structure. The first was characterized by trees with well-supported branches and low levels of incongruence suggesting well-structured populations and PCE. This was the case for the T. cruzi and F. solani datasets. The second genetic structure, represented by Blastocystis spp., A. fumigatus and the L. donovani complex datasets, showed trees with weakly-supported branches but low levels of incongruence among loci, whereby genetic structuration was not clearly defined by MLST. Finally, trees showing weakly-supported branches and high levels of incongruence among loci were observed for Candida species, suggesting that genetic exchange has a higher evolutionary impact in these mainly clonal yeast species. Furthermore, simulations showed that MLST may fail to show right clustering in population datasets even in the absence of genetic exchange. In conclusion, these results make it possible to infer variable impacts of genetic exchange in populations of predominantly clonal micro-pathogens. Moreover, our results reveal different problems of MLST to determine the genetic structure in these organisms that should be considered.Entities:
Mesh:
Year: 2014 PMID: 25054834 PMCID: PMC4108389 DOI: 10.1371/journal.pone.0103131
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of main features of analyzed datasets.
| Datasets | |||||||
| Tc | Fs | Af | B3 | Ld | Ca | Cg | |
| Number of strains | 47 | 51 | 98 | 98 | 38 | 1386 | 212 |
| Number of STs | 24 | 41 | 28 | 58 | 27 | 1000 | 68 |
| Number of polymorphysms | 125 | 213 | 40 | 181 | 47 | 165 | 125 |
| Number of fragments | 7 | 5 | 7 | 5 | 5 | 7 | 6 |
| Typing efficiency | 0.2 | 0.19 | 0.70 | 0.32 | 0.47 | 6.06 | 0.54 |
Tc, Trypanosoma cruzi; Fs, Fusarium solani complex; Af, Aspergillus fumigatus; B3, Blastocystis spp ST3; Ld, Leishmania donovani complex; Ca, Candida albicans; Cg, Candida glabrata.
Typing efficiency: defined as the number of STs per polymorphic site.
Figure 1Consensus support distribution for standardized datasets.
The color scale-bar represents the level of consensus support that varies from 0 fragment trees (white bars) to ≥3 fragment trees (black bars) supporting the branch in the tree for concatenated alignments. The values are calculated as the mean of 10 replications.
Figure 2Bootstrap support distribution for standardized datasets.
The color scale-bar represents the level of bootstrap support that varies from 0–50% (white bars) to more than 90% (black bars) supporting each branch. The values are calculated as the mean of 10 replications.
Figure 3Topological incongruence distribution for standardized datasets.
The color scale-bar represents the number of fragments topologically incompatible with certain branch. It varies from n incongruent fragments (black bars) to less than n-3 (white bars), where n is the number of fragments of the dataset. The values are calculated as the mean of 10 replications.
Figure 4Significance distribution of NJ-LILD test for standardized datasets.
The color scale-bar represents the p-value significance level for the test. NS, not significant at alpha = 0.05; NSB, not significant after Bonferroni correction; SB, significant after Bonferroni correction.
Congruence among distance matrices of different MLST fragments (AAT1, ACC, ADP, MPB, SYA, VPS, ZWF) for two different datasets of Candida albicans.
| Null Hypothesis (H0) | Datasets | |
| 18 STs | 60 STs | |
| All matrices incongruent | 0.0002 | 0.0002 |
| AAT1 incongruent | 0.0128 | 0.0102 |
| ACC incongruent | 0.0002 | 0.0004 |
| ADP incongruent | 0.0002 | 0.0002 |
| MPB incongruent | 0.0002 | 0.0002 |
| SYA incongruent | 0.0546 | 0.0008 |
| VPS incongruent | 0.0002 | 0.0002 |
| ZWF incongruent | 0.0002 | 0.0002 |
*p value calculated from 5000 permutations.
Figure 5MLST clade architecture proposed by [28] for Candida albicans.
Singleton STs are excluded and only the clades are shown. MLST clade 1 (red), MLST clade 2 (blue) and MLST clade 3 (green) were analyzed in the present work.
Significance of different near-clades in distance matrices of the 60 STs dataset of Candida albicans.
| Locus | ||||||||
| Near-Clade | Concat | AAT | ACC | ADP | MPB | SYA | VPS | ZWF |
| 1 |
| 3.27 | 0.06 | 0.45 | 0.08 |
| 1.80 |
|
| 2 | 0.86 | 0.15 | 1.61 | 6.14 | 6.40 | 0.18 | 3.55 | 5.96 |
| 3 | 0.26 | 6.75 | 1.31 |
|
| 6.32 | 0.98 | 1.86 |
Distance matrix for concatenated dataset.
Bonferroni corrected p value for Mantel test with 5,000 random permutation. A significant value (p<0.05) means congruence between the distance matrix and a binary distance matrix that discriminate just one of the proposed near-clades.
Significant p values before Bonferroni correction.
Summary of dataset structure types found in our comparative analyses.
| Structure type | |||
| 1 | 2 | 3 | |
| Consensus support | Moderate to high | low | low |
| Bootstrap | Moderate to high | low | low |
| BIONJ-ILD pval | variable | <0.01 | <0.01 |
| Topological incongruence | Low to moderate | Low to moderate | High |
| Branches with significant LILD | few | Few | More than 40% |
| Datasets |
|
|
|
Consensus support was arbitrarily considered moderate for datasets with 30–60% of branches supported by at least two fragments and High for datasets with more than 60%.
Bootstrap support was arbitrarily considered moderate for datasets with 40%–60% of branches supported by bootstrap higher than 80% and High for datasets with more than 60% with bootrstrap value higher than 80%.
Topological incongruence was considered moderate for datasets with 20–40% of branches with n-1 fragments topologically incompatible with the validity of the near-clade in the concatenated tree and high incongruence was considered for datasets with more than 40% of branches with n-1 fragments topologically incompatible.
Significant NJ-LILD after Bonferroni correction.
*Thresholds are only used to define limits to different genetic structure types, which clearly emerge from a visual comparison of Figures 1, 2 and 3, S2, S3 and S4. It is important to note that the used thresholds are applicable to define structure types when only datasets around 24 STs are used.