| Literature DB >> 35152493 |
Roy N Platt1, Winka Le Clec'h1, Frédéric D Chevalier1, Marina McDew-White1, Philip T LoVerde2, Rafael Ramiro de Assis3, Guilherme Oliveira3,4, Safari Kinung'hi5, Amadou Garba Djirmay6, Michelle L Steinauer7, Anouk Gouvras8, Muriel Rabone8, Fiona Allan9,10, Bonnie L Webster8,10, Joanne P Webster9,10, Aidan M Emery8,10, David Rollinson8,10, Timothy J C Anderson1.
Abstract
Schistosoma mansoni, a snail-borne, blood fluke that infects humans, was introduced into the Americas from Africa during the Trans-Atlantic slave trade. As this parasite shows strong specificity to the snail intermediate host, we expected that adaptation to South American Biomphalaria spp. snails would result in population bottlenecks and strong signatures of selection. We scored 475,081 single nucleotide variants in 143 S. mansoni from the Americas (Brazil, Guadeloupe and Puerto Rico) and Africa (Cameroon, Niger, Senegal, Tanzania, and Uganda), and used these data to ask: (i) Was there a population bottleneck during colonization? (ii) Can we identify signatures of selection associated with colonization? (iii) What were the source populations for colonizing parasites? We found a 2.4- to 2.9-fold reduction in diversity and much slower decay in linkage disequilibrium (LD) in parasites from East to West Africa. However, we observed similar nuclear diversity and LD in West Africa and Brazil, suggesting no strong bottlenecks and limited barriers to colonization. We identified five genome regions showing selection in the Americas, compared with three in West Africa and none in East Africa, which we speculate may reflect adaptation during colonization. Finally, we infer that unsampled populations from central African regions between Benin and Angola, with contributions from Niger, are probably the major source(s) for Brazilian S. mansoni. The absence of a bottleneck suggests that this is a rare case of a serendipitous invasion, where S. mansoni parasites were pre-adapted to the Americas and able to establish with relative ease.Entities:
Keywords: Africa; Brazil; codispersal; exome; human parasite; migration
Mesh:
Year: 2022 PMID: 35152493 PMCID: PMC9305930 DOI: 10.1111/mec.16395
Source DB: PubMed Journal: Mol Ecol ISSN: 0962-1083 Impact factor: 6.622
FIGURE 1Sampling localities. Location and number of samples for Schistosoma mansoni (yellow circles) and S. rodhaini (green squares) samples included in this study. Members of the genus Biomphalaria are the predominant intermediate hosts with specific distributions of the different species involved in transmission varying across Africa and South America. Intermediate snail vectors are listed in greyed boxes
Whole genome summary statistics for Schistosoma mansoni populations and S. rodhaini
|
|
|
|
| Tajima's | Θ |
|
| Tajima's | |
|---|---|---|---|---|---|---|---|---|---|
|
| 9 | 5.61E‐04 | Na | Na | 0.479 | 4.29E‐04 | 13,226 | Na | Na |
| Sm (Brazil) | 45 | 6.93E‐04 | 2.10E‐03 | 0.984 | 0.034 | 5.84E‐04 | 18,032 | 2.22E‐04 | −1.218 |
| Sm (Niger) | 10 | 6.00E‐04 | 5.89E‐03 | 0.978 | −0.579 | 6.07E‐04 | 18,737 | 2.46E‐04 | −1.018 |
| Sm (Senegal) | 25 | 4.97E‐04 | 4.72E‐03 | 0.997 | −1.417 | −1.375 | 21,992 | 1.75E‐04 | −1.816 |
| Sm (Tanzania) | 55 | 1.45E‐03 | 7.25E‐03 | 1.0 | −0.729 | −0.739 | 51,508 | Na | Na |
"n"—number of samples; "π"—nucleotide diversity; "H"—haplotype diversity; "Θ"—Watterson estimator; "N e"—effective population size.
Abbreviations: PRS, putative region of selection; Sm, Schistosoma mansoni.
Eight of nine S. rodhaini samples came from a single laboratory population: population statistics are probably biased.
No S. rodhaini reads mapped to the S. mansoni mitochondria.
Not calculated.
F ST between Schistosoma species and populations
| Pop1 | Pop2 |
|
|
|---|---|---|---|
|
| Sm (Caribbean) | 0.929 | 0.0012 |
|
| Sm (Tanzania) | 0.844 | 0.0016 |
|
| Sm (Senegal) | 0.937 | 0.0013 |
|
| Sm (Niger) | 0.931 | 0.0011 |
|
| Sm (Brazil) | 0.919 | 0.0013 |
| Sm (Caribbean) | Sm (Tanzania) | 0.279 | 0.0036 |
| Sm (Caribbean) | Sm (Senegal) | 0.323 | 0.0085 |
| Sm (Caribbean) | Sm (Niger) | 0.236 | 0.0071 |
| Sm (Caribbean) | Sm (Brazil) | 0.154 | 0.0067 |
| Sm (Tanzania) | Sm (Senegal) | 0.416 | 0.0032 |
| Sm (Tanzania) | Sm (Niger) | 0.348 | 0.0031 |
| Sm (Tanzania) | Sm (Brazil) | 0.379 | 0.0034 |
| Sm (Senegal) | Sm (Niger) | 0.135 | 0.0042 |
| Sm (Senegal) | Sm (Brazil) | 0.235 | 0.0047 |
| Sm (Niger) | Sm (Brazil) | 0.152 | 0.0036 |
Abbreviation: Sm, Schistosoma mansoni.
FIGURE 2Linkage disequilibrium (LD) decay and diversity within populations—(a) LD between single nucleotide variants was quantified with r 2 values for each population. Mean r 2 values were taken in 500‐bp windows and loess smoothed. Vertical dotted lines indicate the distance where r 2 = .5 for each population. LD decayed to r 2 = .5 in 28 bp (Tanzania), 15,150 bp (Niger), 19,318 bp (Senegal) and 26,196 bp (Brazil). (b) Nucleotide diversity (π) varied between Schistosoma mansoni populations with the highest levels of diversity occurring in East Africa (Tanzania). π was measured in 100‐kb windows across the autosomal chromosomes. Outliers are not shown
Admixture statistics
| Comparison | Test Stat |
|
|
|---|---|---|---|
|
| |||
| (((Br, Tz)Sr), Smr) | −0.033 | 0.0404 | −0.807 |
| (((Ni, Tz)Sr), Smr) | 0.006 | 0.0437 | 0.129 |
| (((Se, Tz)Sr), Smr) | 0.051 | 0.0473 | 1.070 |
|
| |||
| Br: Tz, Sr | 0.764 | 0.0207 | 36.910 |
| Ni: Tz, Sr | 0.993 | 0.0299 | 33.214 |
| Se: Tz, Sr | 1.265 | 0.0461 | 27.428 |
|
| |||
| (Br, Tz, Sr) | −0.0047 | 0.00036 | 0.418 |
| (Ni, Tz, Sr) | −0.0046 | 0.00034 | 0.419 |
| (Se, Tz, Sr) | −0.0054 | 0.00048 | 0.353 |
| (Cm, Tz, Sr) | 0.0005 | 0.00066 | −0.022 |
| (Cr, Tz, Sr) | −0.0048 | 0.00029 | 0.517 |
| (Ug, Tz, Sr) | 0.0034 | 0.00062 | −0.174 |
"Z"—Z‐score of the mean from 0.
Population abbreviations: Br, Brazil; Cm, Cameroon; Cr, Caribbean; Ni, Niger; Se, Senegal; Smr, Schistosoma margrebowiei; Sr, Schistosoma rodhaini; Tz, Tanzania; Ug, Uganda.
FIGURE 4Population structure in Schistosoma mansoni. admixture analyses with k = 4 and k = 5 populations identified clear distinctions between each of the major sampling localities (the Americas, West Africa, East Africa and S. rodhaini). The population components in each of the whole genome samples from Uganda (UGN), Kenya (KNY), Cameroon (CMR) and the Caribbean (CRB) were more heterogeneous than samples with exome data. Cameroonian and Nigerien samples contain moderate proportions of the Brazilian population component. BRN, Burundi; BRZ, Brazil; NGR, Niger; SNG, Senegal; TNZ, Tanzania
FIGURE 3A principal component analysis (PCA) of unlinked autosomal single nucleotide variants. The PCA plot that included Schistosoma mansoni and S. rodhaini (left) clearly shows a large distinction between the two species with some variation within S. mansoni along PC2. A PCA with only S. mansoni (right) differentiates East African S. mansoni along PC1. The remaining S. mansoni samples fall along a continuum on PC2 that goes from samples in West Africa and transitions to the Americas. Inset bar charts represent the percentage variation explained along the first 20 PCs. Only PC1 and PC2 were examined (shaded)
FIGURE 5Phylogenetic relationships between Schistosoma mansoni populations. Multiple phylogenetic analyses and marker types were used to discern relationships between S. mansoni populations. (a) A median‐joining haplotype network was constructed from 815 variants across the mitochondria of all S. mansoni samples. (b) A coalescent‐based species tree from 100,819 parsimony‐informative single nucleotide variants with bootstrap values shown on each clade. Monophyletic populations are shown as a collapsed clade except in the case of East Africa which contains samples from Tanzania and Uganda. (c) A maximum‐likelihood phylogenetic network of autosomal variants identified a single, weak migration edge orientated from Brazil to Niger. All three analyses identify a relationship between Senegal, Niger and Brazil that excludes East African samples. The mitochondrial (a) and autosomal (c) networks both allow for direct relationship or allele sharing between Brazil and Niger. The species tree (b) indicates a strong relationship between Senegal and Niger that excludes Brazil (bootstrap percentage = 100)
FIGURE 6Positive selection across the Schistosoma mansoni genome. Selection across the S. mansoni genome was calculated with haplotype (H)‐ and allele frequency (likelihood ration; LR)‐based methods. Dotted lines represent positions with an H or LR value in the 99th percentile. The solid black line represents the maximum H or LR calculated from simulated data under neutral conditions. Regions of interest (blue, boxed) were identified by finding sites where the H and LR values were in the >99th percentile and were both greater than the maximum H or LR from simulated data. Once these sites were identified we combined variants within 333,333‐bp windows that showed signs of selection: an H or LR greater than the simulated threshold
Number of SNPs and regions identified in genome‐wide scans for selection
| Brazil | Niger | Senegal | Tanzania | |
|---|---|---|---|---|
| Outlier SNVs ( | 4366 | 44,024 | 9182 | 2 |
| Outlier SNVs ( | 12,871 | 41,525 | 23,379 | 4288 |
| Merged regions (1/3 Mb) | 113 | 250 | 168 | 46 |
| 99th Percentile SNPs | 703 | 239 | 176 | 0 |
| Outlier SNPs ( | 442 | 442 | 442 | Na |
| Putative regions under selection | 5 | 3 | 3 | 0 |
| Genes in selected regions | 116 | 112 | 157 | 0 |
| Genes w/99th percentile SNPs | 10 | 5 | 7 | 0 |
Abbreviations: SNP, single nucleotide polymorphism; SNV, single nucleotide variant.
Outlier SNPs are those that are greater than the maximum value derived from neutrally simulated data.
Genes in Schistosoma mansoni populations with the strongest signals of directional selection
| Gene ID | UniProtKB accession | UniProtKB description | HHsearch annotation |
|---|---|---|---|
| Brazil | |||
| Smp_060090 | G4VAJ9 | 40S ribosomal protein S12 | Ribosomal protein S12e ribosome |
| Smp_073680 | G4V701 | Putative TATA‐box binding protein | DNA‐directed RNA polymerase II subunit |
| Smp_123510 | A0A5K4EKW3 | Vacuolar protein sorting‐associated protein 16 homolog | Vps16_C |
| Smp_123520 | A0A3Q0KKK4 | Putative RNA M5U methyltransferase | rRNA (Uracil‐5‐)‐methyltransferase RumA |
| Smp_123570 | A0A3Q0KKM5 | BHLH domain‐containing protein | Aryl hydrocarbon receptor nuclear translocator |
| Smp_123590 | A0A5K4EL22 | Uncharacterized protein | Swi5‐dependent recombination DNA repair protein |
| Smp_148460 | A0A3Q0KPP3 | Putative neurofibromin | GAP‐related domain of neurofibromin |
| Smp_162000 | A0A5K4ETP0 | UBR‐type domain‐containing protein | E3_UbLigase_R4 |
| Smp_246630 | A0A5K4F427 | UBC core domain‐containing protein | E2 Ubiquitin conjugating enzyme |
| Smp_341570 | A0A5K4FC62 | Uncharacterized protein | Uncharacterized protein family UPF0183 |
| Niger | |||
| Smp_008230 | G4V7H8 | Putative rab‐18 | di‐Ras2 |
| Smp_126620 | A0A3Q0KL42 | Uncharacterized protein | Ligand‐binding domain of low‐density lipoprotein receptor |
| Smp_165060 | A0A3Q0KRY3 | Uncharacterized protein | Uncharacterized protein |
| Smp_167890 | Q6BC90 | Peptide‐methionine ( | C‐terminal MsrB domain of methionine sulphoxide reductase PilB |
| Smp_313490 | A0A5K4F3H5 | Uncharacterized protein | Transforming protein RhoA, Rho‐associated, coiled‐coil |
| Senegal | |||
| Smp_070780 | G4VEM1 | UDP‐glucose 4‐epimerase | Uridine diphosphogalactose‐4‐epimerase |
| Smp_123440 | A0A3Q0KKL2 | Putative fad oxidoreductase |
|
| Smp_123520 | A0A3Q0KKK4 | Putative RNA M5U methyltransferase | rRNA (Uracil‐5‐)‐methyltransferase RumA |
| Smp_164560 | A0A3Q0KRR1 | Uncharacterized protein | Na |
| Smp_167890 | Q6BC90 | Peptide‐methionine (R)‐S‐oxide reductase | C‐terminal MsrB domain of methionine sulfoxide reductase PilB |
| Smp_213150 | A0A5K4EZI5 | Uncharacterized protein | Ribonucleases P/MRP protein subunit POP1 |
| Smp_313490 | A0A5K4F3H5 | Uncharacterized protein | Transforming protein RhoA, Rho‐associated, coiled‐coil |
UniProtKB descriptions are from release 2020_06. HHsearch annotations are from Le Clec'h et al. (2021).
Shared between Brazil and Senegal.
Shared between Niger and Senegal.
FIGURE 7Export and importation of slaves between Africa and Brazil. During the Trans‐Atlantic slave trade, 12 million people were trafficked from Africa. More than 3.5 million were exported into Brazil at major ports along the east coast. Most slaves originated from regions along the West African coast from Benin to Angola. General locations of major embarkation and disembarkation ports are shown along the (a) Brazilian and (b) Africa coastlines. (c) The movement of people between Brazilian and African ports is shows with horizontal lines. Line width is proportional to the number of migrants. The Brazilian samples in this study were collected from Ponto dos Volantes in Minas Gerais, Brazil, between the Ports of Bahaia and South East Brazil. The primary embarkation points for Bahaia and South East Brazil were in West Central Africa and the Bight of Benin. These regions are a probable source for the S. mansoni collected in Ponto dos Volantes. Collection localities for S. mansoni are represented by grey dots. Slave exportation data are from the Slave Voyages Database (2009)