| Literature DB >> 30127015 |
Dorothy E Loy1,2, Lindsey J Plenderleith3,4, Sesh A Sundararaman1,2, Weimin Liu1, Jakub Gruszczyk5, Yi-Jun Chen4,6, Stephanie Trimboli1, Gerald H Learn1, Oscar A MacLean3,4, Alex L K Morgan3,4, Yingying Li1, Alexa N Avitto1, Jasmin Giles1, Sébastien Calvignac-Spencer7, Andreas Sachse7, Fabian H Leendertz7, Sheri Speede8, Ahidjo Ayouba9, Martine Peeters9, Julian C Rayner10, Wai-Hong Tham5,6, Paul M Sharp3,4, Beatrice H Hahn11,2.
Abstract
Wild-living African apes are endemically infected with parasites that are closely related to human Plasmodium vivax, a leading cause of malaria outside Africa. This finding suggests that the origin of P. vivax was in Africa, even though the parasite is now rare in humans there. To elucidate the emergence of human P. vivax and its relationship to the ape parasites, we analyzed genome sequence data of P. vivax strains infecting six chimpanzees and one gorilla from Cameroon, Gabon, and Côte d'Ivoire. We found that ape and human parasites share nearly identical core genomes, differing by only 2% of coding sequences. However, compared with the ape parasites, human strains of P. vivax exhibit about 10-fold less diversity and have a relative excess of nonsynonymous nucleotide polymorphisms, with site-frequency spectra suggesting they are subject to greatly relaxed purifying selection. These data suggest that human P. vivax has undergone an extreme bottleneck, followed by rapid population expansion. Investigating potential host-specificity determinants, we found that ape P. vivax parasites encode intact orthologs of three reticulocyte-binding protein genes (rbp2d, rbp2e, and rbp3), which are pseudogenes in all human P. vivax strains. However, binding studies of recombinant RBP2e and RBP3 proteins to human, chimpanzee, and gorilla erythrocytes revealed no evidence of host-specific barriers to red blood cell invasion. These data suggest that, from an ancient stock of P. vivax parasites capable of infecting both humans and apes, a severely bottlenecked lineage emerged out of Africa and underwent rapid population growth as it spread globally.Entities:
Keywords: Plasmodium vivax; genomics; great apes; malaria; zoonotic transmission
Mesh:
Substances:
Year: 2018 PMID: 30127015 PMCID: PMC6130405 DOI: 10.1073/pnas.1810053115
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Genome features of ape P. vivax
| Genome attributes | PvSY56 | PvSY43 | PvP01 |
| Host species | Chimpanzee | Chimpanzee | Human |
| Country | Cameroon | Cameroon | Indonesia |
| Chromosomal assembly | 21,928,114 | 21,224,756 | 24,177,188 |
| Mean depth of coverage | 319 | 240 | N/A |
| Chromosomal contigs | 7,112 | 6,604 | 14 |
| G + C content, % | 44.1 | 44.6 | 43.3 |
| Core protein-coding genes | 4,883 (98.8) | 4,908 (99.3) | 4,941 (100) |
| Full length | 4,391 (88.9) | 4,350 (88.0) | N/A |
| Partial (% of PvP01) | 492 (10.0) | 558 (11.3) | N/A |
| Genes in hypervariable regions | 415 | 276 | 1,702 |
N/A, not applicable.
The genome assembly of PvSY43 represents a consensus sequence of at least two major and three minor chimpanzee P. vivax variants (Fig. 2 and ).
Chimpanzee P. vivax genomes were compared with the human P. vivax reference PvP01 (24).
Number of unambiguous bases.
Calculated by dividing the number of nucleotides in reads mapped to the assemblies by the expected genome size from PvP01.
Subtelomeric, core and internal hypervariable regions were defined as described (20).
Genes classified as full length comprised at least 90% of the length of the corresponding PvP01 ortholog.
Fig. 2.Evolutionary relationships of ape and human P. vivax strains. (A) An unrooted neighbor-joining tree constructed from a matrix of pairwise genetic distances from an alignment of 241 nuclear genes is shown for nine human (black) and six chimpanzee (red) P. vivax strains (the Inset shows the human P. vivax strains in greater detail). (B) As in A, but based on six nuclear genes with coverage in one gorilla P. vivax strain (green). The same human and chimpanzee P. vivax strains were included, except for PvSY81, which did not cover these genes. (C and D) Maximum-likelihood trees for fragments of nuclear genes PVP01_1418300 (C) and PVP01_1418500 (D) with P. cynomolgi and P. knowlesi included as outgroups. Sequences of P. carteri parasites are shown in blue. “Pv” denotes sequences from genome-wide analyses, shown in bold if generated by SWGA or derived from published data (); all other sequences except for P. cynomolgi and P. knowlesi were generated by SGA and include a code identifying their geographic origin (), ape subspecies (G.g.g., Gorilla gorilla gorilla; P.t.e., Pan troglodytes ellioti; P.t.t., Pan troglodytes troglodytes), and sample number (see for GenBank accession numbers). Bootstrap values ≥70 are shown for clades with two or more nonidentical tips. Fragment lengths in PvP01 are indicated above the trees. The scale bars indicate substitutions per site (see also for phylogenetic network representations).
Nucleotide polymorphism in ape and human P. vivax
| Parasites | πall | π0 | π4 | NS polymorphisms | S polymorphisms | NS/S | |
| Ape | 6 | 0.00698 | 0.00357 | 0.01604 | 32,364 | 47,494 | 0.68 |
| Human | 9 | 0.00085 | 0.00060 | 0.00143 | 10,530 | 7,673 | 1.37 |
n = number of strains included in the analysis (see for gene coverage among the different strains).
Mean pairwise diversity at coding sites from 4,263 genes (6.5 million sites).
Mean pairwise diversity at zerofold degenerate sites from 4,263 genes (4.0 million sites).
Mean pairwise diversity at fourfold degenerate sites from 4,263 genes (0.7 million sites).
Numbers of nonsynonymous (NS) and synonymous (S) polymorphisms were calculated by counting the number of SNPs that changed (NS) or did not change (S) the protein sequence of the respective reference (PvSY56 for ape P. vivax; PvP01 for human P. vivax).
Fig. 1.Nucleotide sequence diversity in ape and human P. vivax. (A) The π calculated across a common set of 4,260 core genes for six chimpanzee and nine human P. vivax strains (as in Table 2, but three genes with fewer than 35 aligned sites were excluded). Median and mean (weighted by gene length) π values are indicated by solid and dashed lines, respectively; box and whiskers indicate the interquartile range and 99th percentiles, respectively. Plots including outliers are shown in . (B) Density plots of NI values shown on a log2 scale for ape (red) and human (black) P. vivax genes. Values are shown for 1,585 genes with nonzero values of NI in both populations. (C) Site-frequency spectra of polymorphisms at fourfold degenerate (blue) and zerofold degenerate (orange) sites extracted from SNP data of human P. vivax samples from Southeast Asia (20).
Fig. 3.The rbp gene family in ape and human P. vivax. (A) A midpoint-rooted maximum-likelihood phylogenetic tree is shown depicting the relationships of human (black) and chimpanzee (PvSY56 and PvSY43, red) P. vivax rbp genes with their orthologs in P. knowlesi, P. cynomolgi, P. inui, P. fragile, and human P. malariae (purple). P. vivax, P. cynomolgi, and P. knowlesi genes are labeled according to their published names; genes from P. inui, P. fragile, and P. malariae are labeled according to the clade in which they are placed. Pseudogenes are indicated by yellow stars. The Inset shows the relationship of rbp1a sequences among representative human and three sequenced chimpanzee P. vivax strains, rooted using P. cynomolgi (see for details). (B) Locations of frameshift (purple) and premature stop (black) mutations in rbp2d, rbp2e, and rbp3 sequences assembled from published human P. vivax strains (20, 21), relative to the full-length coding sequence from chimpanzee P. vivax (light green). Each bar represents a set of mutations that occurred in two or more human P. vivax strains for which a full-length sequence was assembled (128, 162, and 227 sequences for rbp2d, rbp2e, and rbp3, respectively); the percentage of sequences containing the respective mutations is shown on the right, with “other” summarizing all mutations that occurred only once.
Fig. 4.Binding of RBPs to ape and human red blood cells. (A) Dot plots depict the binding of human P. vivax RBP2a and RBP2b proteins and chimpanzee P. vivax RBP2e and RBP3 proteins to human (first row), gorilla (second row), and chimpanzee (third row) red blood cells, respectively, along with antibody-only controls of human red blood cells (fourth row). RBP binding was detected using an RBP-specific polyclonal rabbit antibody and an anti-rabbit (Alexa Fluor 647-labeled) secondary antibody (y axis), and reticulocytes were identified by staining with TO (x axis). Flow cytometry gates separating normocytes from reticulocytes and protein binding from no protein binding are shown by vertical and horizontal lines, respectively. Numbers indicate the percentage of total cells within the respective gate. (B) Percentage of gorilla (green), chimpanzee (red), and human (black) normocytes bound by the respective RBP. (C) Percentage of gorilla, chimpanzee, and human reticulocytes bound by the respective RBP. Experiments were performed as three technical replicates with the background signal from the antibody-only control subtracted from each binding result.