| Literature DB >> 35058485 |
Jeffrey C Long1, Sarah E Taylor2, Lucio M Barbosa3,4, Luciano K Silva4, Mitermayer G Reis4,5,6, Ronald E Blanton7.
Abstract
Patterns of diversity in pathogen genomes provide a window into the spatiotemporal spread of disease. In this study, we tested the hypothesis that Schistosoma mansoni parasites form genetic clusters that coincide with the communities of their human hosts. We also looked for genetic clustering of parasites at the sub-community level. Our data consists of 14 microsatellite DNA markers, typed from pooled DNA samples from [Formula: see text] infected individuals living in three Brazilian communities. We found a one-to-one correspondence between genetic clusters found by K-means cluster analysis and communities when [Formula: see text]. These clusters are also easily identified in a neighbor-joining tree and principal coordinates plots. K-means analysis with [Formula: see text] also reveals genetic clusters of parasites at the sub-community level. These sub-clusters also appear on the neighbor-joining tree and principal coordinates plots. A surprising finding is a genetic relationship between subgroups in widely separated human communities. This connection suggests the existence of common transmission sites that have wide influence. In summary, the genetic structure of S. mansoni in Brazil juxtaposes local isolation that is occasionally broken by long-range migration. Permanent eradication of schistosomes will require both local efforts and the identification of regional infection reservoirs.Entities:
Mesh:
Year: 2022 PMID: 35058485 PMCID: PMC8776789 DOI: 10.1038/s41598-022-04776-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genetic loci and characteristics.
| Locus | Alleles | ||||||
|---|---|---|---|---|---|---|---|
| SMMS2 | 4 | 0.63 | 0.62 | 0.62 | 0.027 | 0.000 | 0.026 |
| SMMS13 | 5 | 0.696 | 0.678 | 0.677 | 0.056 | 0.004 | 0.06 |
| SMMS16 | 8 | 0.245 | 0.23 | 0.207 | 0.019 | 0.029 | 0.048 |
| SMMS3 | 13 | 0.169 | 0.152 | 0.147 | 0.021 | 0.005 | 0.026 |
| SMMS18 | 15 | 0.251 | 0.226 | 0.203 | 0.033 | 0.029 | 0.061 |
| SMMS21 | 9 | 0.557 | 0.542 | 0.376 | 0.034 | 0.266 | 0.291 |
| SMDA23 | 12 | 0.308 | 0.286 | 0.286 | 0.031 | 0.000 | 0.032 |
| 1F8A | 11 | 0.287 | 0.265 | 0.251 | 0.029 | 0.019 | 0.048 |
| 13TAGA | 11 | 0.288 | 0.272 | 0.253 | 0.022 | 0.025 | 0.047 |
| SM13-410 | 8 | 0.45 | 0.433 | 0.296 | 0.029 | 0.194 | 0.218 |
| SMU31768 | 14 | 0.200 | 0.193 | 0.19 | 0.008 | 0.003 | 0.012 |
| 15J15A | 10 | 0.275 | 0.263 | 0.261 | 0.016 | 0.003 | 0.019 |
| 29E6A | 9 | 0.281 | 0.259 | 0.25 | 0.03 | 0.012 | 0.041 |
| SM13-478 | 13 | 0.286 | 0.209 | 0.204 | 0.097 | 0.006 | 0.103 |
| Multiple Locus | – | 0.352 | 0.331 | 0.302 | 0.032 | 0.041 | 0.072 |
| Lower 95% | – | 0.275 | 0.255 | 0.231 | 0.022 | 0.010 | 0.038 |
| Upper 95% | – | 0.443 | 0.424 | 0.394 | 0.046 | 0.086 | 0.116 |
Figure 1Two examples of loci with alleles that reach a high frequency in one community while being rare or absent in the other communities. Arrows point to the diagnostic markers. At SMMS18 the 189 and 198 base-pair alleles are clear markers of infrapopulations sampled in the Volta do Rio community. The 177 base-pair allele at SMMS21 is a clear marker of infrapopulations sampled in the São Bartolomeu community.
Figure 2Principal Coordinates of Genetic Distance Matrix. (a) PC1 versus PC2 with infrapopulations color coded according to community sampled. (b) PC1 versus PC3 with the same coding as in (a). (c) PC1 versus PC2 with infrapopulations color coded according to cluster analysis results with . (d) PC1 versus PC3 with the same coding as in (c).
Figure 3(a) Fan plot of the neighbor-joining tree with tips labeled according to communities. (b) Fan plot of the neighbor-joining tree with tips labeled according to results for K-means cluster analysis with . (c) Genetic distances computed as the sum of branch lengths on the neighbor-joining tree plotted against genetic distances computed directly from allele frequencies. The orange-filled points indicate comparisons between infrapopulations in Cluster 4 with those in Cluster 5.
Figure 4Heatmap of Gene Identity Matrix. Each ‘pixel’ on the diagonal corresponds to the gene identity within an infrapopulation. Each ‘pixel’ on the off-diagonal corresponds to the gene identity between two infrapopulations. Rows and columns are ordered using the results from K-means clustering with K = 5. VdR = Volta do Rio, C4 is a subgroup of infrapopulations sampled in São Bartolomeu and C5 is a subgroup of infrapopulations sampled in Jenipapo. The arrow identifies a block of increased gene identity between infrapopulations in C4 and C5.