| Literature DB >> 33087522 |
Massimiliano S Tagliamonte1,2,3, Charles A Yowell1, Maha A Elbadry4,3, Jacques Boncy5, Christian P Raccurt6, Bernard A Okech4,3, Erica M Goss7,3, Marco Salemi8,3, John B Dame9,3.
Abstract
The malaria parasite, Plasmodium falciparum, was introduced into Hispaniola and other regions of the Americas through the slave trade spanning the 16th through the 19th centuries. During this period, more than 12 million Africans were brought across the Atlantic to the Caribbean and other regions of the Americas. Since malaria is holoendemic in West Africa, a substantial percentage of these individuals carried the parasite. St. Domingue on Hispaniola, now modern-day Haiti, was a major port of disembarkation, and malaria is still actively transmitted there. We undertook a detailed study of the phylogenetics of the Haitian parasites and those from Colombia and Peru utilizing whole-genome sequencing. Principal-component and phylogenetic analyses, based upon single nucleotide polymorphisms (SNPs) in protein coding regions, indicate that, despite the potential for millions of introductions from Africa, the Haitian parasites share an ancestral relationship within a well-supported monophyletic clade with parasites from South America, while belonging to a distinct lineage. This result, in stark contrast to the historical record of parasite introductions, is best explained by a severe population bottleneck experienced by the parasites introduced into the Americas. Here, evidence is presented for targeted selection of rare African alleles in genes which are expressed in the mosquito stages of the parasite's life cycle. These genetic markers support the hypothesis that the severe population bottleneck was caused by the required adaptation of the parasite to transmission by new definitive hosts among the Anopheles (Nyssorhynchus) spp. found in the Caribbean and South America.IMPORTANCE Historical data suggest that millions of P. falciparum parasite lineages were introduced into the Americas during the trans-Atlantic slave trade, which would suggest a paraphyletic origin of the extant isolates in the Western Hemisphere. Our analyses of whole-genome variants show that the American parasites belong to a well-supported monophyletic clade. We hypothesize that the required adaptation to American vectors created a severe bottleneck, reducing the effective introduction to a few lineages. In support of this hypothesis, we discovered genes expressed in the mosquito stages of the life cycle that have alleles with multiple, high-frequency or fixed, nonsynonymous mutations in the American populations which are rarely found in African isolates. These alleles appear to be in gene products critical for transmission through the anopheline vector. Thus, these results may inform efforts to develop novel transmission-blocking vaccines by identifying parasite proteins functionally interacting with the vector that are important for successful transmission. Further, to the best of our knowledge, these are the first whole-genome data available from Haitian P. falciparum isolates. Defining the genome of these parasites provides genetic markers useful for mapping parasite populations and monitoring parasite movements/introductions.Entities:
Keywords: Haiti; Plasmodium falciparum; adaptive mutations; evolutionary biology; malaria; phylogenetics; vector-borne diseases
Mesh:
Substances:
Year: 2020 PMID: 33087522 PMCID: PMC7580960 DOI: 10.1128/mSphere.00937-20
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1Clustering analyses on 24 SNP markers. (A) Map of Haiti and sampling sites. Incidence data are from reference 74. Dotted lines indicate origins of samples from reference 14; solid lines indicate origins of our samples, for which whole-genome data are available. (B) sPCA using 24 SNP barcodes from reference 14; following original paper filtering protocol for consistency, one of the isolates that were filtered out was our sample from the Sud-Est Department.
FIG 2Two-dimensional plot of the principal-component analysis using 50,469 SNPs. (A) PCA performed using all samples. (B) To account for the possibility that covariation among parasites that share recent ancestry has a disproportionate impact on PC weightings, the PCA was repeated keeping only a single representative of the two dominant Haitian lineages. (C) PCA performed using a subset of 149 sequences created by removing samples resulting from apparent clonal (epidemic) expansion identified in Fig. S2 or having complexity of infection.
FIG 3Neighbor-joining tree calculated using synonymous SNPs. MNVs and SNPs in genes upregulated in the mosquito stages of the parasite life cycle were removed (13,627 sites were retained). The tree was calculated using the log-det model.
P. falciparum genes upregulated in the mosquito stage
| Gene name | Gene ID | Product description | No. of variant codons in American strains | Upregulation (fold) in stage | ||
|---|---|---|---|---|---|---|
| Gametocyte V | Ookinete | Sporozoite | ||||
| TRAP | PF3D7_1335900 | Thrombospondin-related anonymous protein | 7 | 0 | 0 | 3,235.7 |
| CTRP | PF3D7_0315200 | Circumsporozoite- and TRAP-related protein | 5 | 0 | 582.1 | 0 |
| PSOP26 | PF3D7_1244500 | Conserved | 5 | 0 | 133 | 0 |
| P47 | PF3D7_1346800 | 6-cysteine protein | 5 | 37.6 | 27.8 | 0 |
| NA | PF3D7_0511400 | Conserved | 2 | 0 | 0 | 147.4 |
| SIAP1 | PF3D7_0408600 | Sporozoite invasion-associated protein 1 | 1 | 0 | 0 | 270.7 |
| NA | PF3D7_0515500 | Amino acid transporter, putative | 1 | 27.2 | 0 | 0 |
| PBLP | PF3D7_0818600 | BEM46-like protein, putative | 1 | 0 | 0 | 33.4 |
| CRMP1 | PF3D7_0911300 | Cysteine repeat modular protein 1 | 1 | 0 | 0 | 14 |
| ICP | PF3D7_0911900 | Falstatin | 1 | 0 | 0 | 11.4 |
| NA | PF3D7_0924600 | Conserved | 1 | 29.4 | 19.8 | 0 |
| NA | PF3D7_1020200 | Conserved | 1 | 86.5 | 223.6 | 0 |
| CRMP3 | PF3D7_1208200 | Cysteine repeat modular protein 3 | 1 | 0 | 0 | 16.8 |
| P48/45 | PF3D7_1346700 | 6-cysteine protein | 1 | 25 | 15.8 | 0 |
| NA | PF3D7_1348400 | Conserved | 1 | 20 | 0 | 0 |
| NA | PF3D7_1403200 | Conserved | 1 | 158.9 | 82.1 | 0 |
| SOAP | PF3D7_1404300 | Secreted ookinete adhesive protein, putative | 1 | 0 | 70.8 | 0 |
Variant codons which have ≥0.7 frequency in America and ≤0.3 frequency in Africa.
Determined by comparison to blood stages.
FUBAR results for CTRP
| Codon no. | Amino acids | Amino acid frequencies in: | FUBAR posterior probability of positive selection | |||
|---|---|---|---|---|---|---|
| Haiti | South America | America | Africa | |||
| 17 | H, P | 1, 0 | 0.96, 0.04 | 0.97, 0.03 | 0, 1 | 0.95 |
| 319 | N, D | 1, 0 | 0.96, 0.04 | 0.97, 0.03 | 0, 1 | 0.82 |
| 659 | R, Q | 1, 0 | 1, 0 | 1, 0 | 0.3, 0.7 | 0.94 |
| 1046 | K, N | 1, 0 | 0.88, 0.12 | 0.91, 0.09 | 0, 1 | 0.83 |
| 1260 | N, S | 1, 0 | 0.92, 0.08 | 0.94, 0.06 | 0.27, 0.73 | 0.96 |
| 2093 | P, S | 1, 0 | 0.62, 0.38 | 0.73, 0.27 | 0, 1 | 0.94 |
| 2098 | Q, E | 1, 0 | 0.96, 0.04 | 0.97, 0.03 | 0.17, 0.83 | 0.96 |
PF3D7_0315200 (circumsporozoite and TRAP-related protein; 2,114 amino acids [aa]). Only codons with differential frequencies between Haiti and Africa were retained.
FUBAR results for PSOP26
| Codon no. | Amino acids | Amino acid frequencies in: | FUBAR posterior probability of positive selection | |||
|---|---|---|---|---|---|---|
| Haiti | South America | America | Africa | |||
| 209 | F, V | 1, 0 | 0.88, 0.12 | 0.91, 0.09 | 0, 1 | 0.84 |
| 494 | P, R | 1, 0 | 1, 0 | 1, 0 | 0, 1 | 0.80 |
| 664 | R, S | 1, 0 | 1, 0 | 1, 0 | 0, 1 | 0.93 |
| 722 | K, N | 1, 0 | 1, 0 | 1, 0 | 0, 1 | 0.87 |
| 736 | N, K | 1, 0 | 1, 0 | 1, 0 | 0.07, 0.93 | 0.96 |
PF3D7_1244500 (conserved Plasmodium protein, unknown function; 810 aa). Only codons with differential frequencies between Haiti and Africa were retained.
FUBAR results for TRAP
| Codon no. | Amino acids | Amino acid frequencies in: | FUBAR posterior probability of positive selection | |||
|---|---|---|---|---|---|---|
| Haiti | South America | America | Africa | |||
| 66 | K, N | 0.56, 0.44 | 0.92, 0.08 | 0.82, 0.18 | 0.1, 0.9 | 1.00 |
| 83 | E, D | 0.78, 0.22 | 1, 0 | 0.94, 0.06 | 0.23, 0.77 | 0.96 |
| 92 | I, V | 0.78, 0.22 | 0.92, 0.08 | 0.88, 0.12 | 0, 1 | 0.98 |
| 277 | L, I, T | 0.89, 0.11, 0 | 0.92, 0.08, 0 | 0.91, 0.09, 0 | 0.27, 0.7, 0.03 | 0.99 |
| 297 | Q, H, D | 0.67, 0.33, 0 | 0.92, 0.08, 0 | 0.85, 0.15, 0 | 0, 0.53, 0.47 | 0.97 |
| 509 | R, K | 0.67, 0.33 | 0.83, 0.17 | 0.79, 0.21 | 0, 1 | 0.92 |
| 541 | F, Y | 0.67, 0.33 | 0.92, 0.08 | 0.85, 0.15 | 0, 1 | 0.84 |
PF3D7_1335900 (thrombospondin-related anonymous protein; 574 aa). Only codons with differential frequencies between Haiti and Africa were retained.
FUBAR results for Pfs47
| Codon no. | Amino acids | Amino acid frequencies in: | FUBAR posterior probability of positive selection | |||
|---|---|---|---|---|---|---|
| Haiti | South America | America | Africa | |||
| 178 | V, I | 1, 0 | 0.83, 0.17 | 0.87, 0.13 | 0, 1 | 0.86 |
| 236 | I, T | 1, 0 | 1, 0 | 1, 0 | 0, 1 | 0.85 |
| 242 | L, S | 1, 0 | 1, 0 | 1, 0 | 0, 1 | 0.86 |
| 247 | A, V | 1, 0 | 1, 0 | 1, 0 | 0, 1 | 0.86 |
| 248 | L, I | 1, 0 | 1, 0 | 1, 0 | 0.17, 0.83 | 0.94 |
PF3D7_1346800 (6-cysteine protein; 439 aa). Only codons with differential frequencies between Haiti and Africa were retained.