Literature DB >> 28493919

Genomic variation in Plasmodium vivax malaria reveals regions under selective pressure.

Ernest Diez Benavente1, Zoe Ward1,2, Wilson Chan1,3, Fady R Mohareb3, Colin J Sutherland1, Cally Roper1, Susana Campino1, Taane G Clark1.   

Abstract

BACKGROUND: Although Plasmodium vivax contributes to almost half of all malaria cases outside Africa, it has been relatively neglected compared to the more deadly P. falciparum. It is known that P. vivax populations possess high genetic diversity, differing geographically potentially due to different vector species, host genetics and environmental factors.
RESULTS: We analysed the high-quality genomic data for 46 P. vivax isolates spanning 10 countries across 4 continents. Using population genetic methods we identified hotspots of selection pressure, including the previously reported MRP1 and DHPS genes, both putative drug resistance loci. Extra copies and deletions in the promoter region of another drug resistance candidate, MDR1 gene, and duplications in the Duffy binding protein gene (PvDBP) potentially involved in erythrocyte invasion, were also identified. For surveillance applications, continental-informative markers were found in putative drug resistance loci, and we show that organellar polymorphisms could classify P. vivax populations across continents and differentiate between Plasmodia spp.
CONCLUSIONS: This study has shown that genomic diversity that lies within and between P. vivax populations can be used to elucidate potential drug resistance and invasion mechanisms, as well as facilitate the molecular barcoding of the parasite for surveillance applications.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28493919      PMCID: PMC5426636          DOI: 10.1371/journal.pone.0177134

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Background

The Plasmodium vivax malaria parasite is the second most virulent malaria species after P. falciparum. Geographically, it is found throughout Asia, South and Central America, Oceania, Middle East and some parts of Africa, with nearly 2.85 billion people at risk of infection [ Although P. vivax contributes to almost half of all malaria cases outside Africa, as it kills infrequently and is not amenable to continuous in vitro culture, it has been relatively neglected compared to the more deadly P. falciparum [ However, as P. vivax drug-resistant strains emerge and spread and fatality rates increase, the need to implement better control and elimination strategies is becoming urgent. Many of the interventions used for controlling P. falciparum malaria are not as effective against P. vivax. Consequently, P. vivax has become the dominant malaria parasite in several countries where P. falciparum transmission has been successfully reduced. Hence, control and elimination of P. vivax malaria calls for additional interventions, notably against the dormant liver stage of the parasite. However, gaps in our knowledge of P. vivax epidemiology and biology may compromise its control. Genomic research can contribute greatly to enhancing our understanding of P. vivax basic biology and evolutionary history, supporting the development and surveillance of new interventions. Since the first characterisation of the P. vivax genome sequence (Sal-1, [), several population genetic studies, based on microsatellite data and more recently using whole genomes, have shown that this parasite is more polymorphic than P. falciparum [. P. vivax populations harbour high genetic diversity, even on small spatial scales, and can differ extensively between locations due to vector species, host genetics and environmental factors [. Genetic variation enables the parasite to overcome host immune responses and antimalarial drugs to establish persistent infections and increase transmission. Genomic studies in natural populations of P. vivax can pinpoint genetic regions that are under selective pressure, including those associated with resistance to antimalarial drugs. Such studies can also contribute to the identification of vaccine targets. Moreover, global genomic studies can assist with identifying sets of polymorphism private to populations, allowing the monitoring of gene flow over space and time, and the tracking of imported infections. By developing a molecular barcode of individual parasites it will also be possible to distinguish recrudescent from re-infections. Highly polymorphic microsatellites have been the preferred method of genetic analysis, revealing high levels of diversity and highlighting interesting genotypic patterns and geographical clustering across global populations [ The advancement of whole genome sequencing technologies has opened up opportunities to obtain a comprehensive picture of the epidemiology and structural variation of P. vivax. There is now the ability to perform genome-wide analysis of the various populations without the need for in vitro culture and overcoming difficulties with low parasitaemias and high human DNA contamination [ Recent studies using genome-wide SNPs highlighted that signals of natural selection suggest that P. vivax is evolving in response to antimalarial drugs and is adapting to regional differences in the human host and the mosquito vector [ Several other whole genome sequence studies have been published [, covering 10 countries. Using these and other data, we explore the genetic diversity within and between continents, identify signatures of drug pressure and molecular barcodes that could be useful for determining the source of infections and monitoring parasite populations.

Methods

Samples and sequence data

Publicly available whole genome sequence data for 74 P. vivax samples were gathered from multiple sources, and included reference strains (India VII, Mauritania X, North Korea II, Brazil I, Sal-1 from El Salvador (see [])), field and clinical isolates (Cambodia (n = 3) [], Thailand (n = 39) [], Madagascar (n = 3) [, Colombia (n = 8) [] and Peru (n = 11) [) and clinical samples from travellers (to Papua Indonesia (n = 2) [], India (n = 2) [], and Papua New Guinea (PNG, n = 6) []). All sequencing data for non-reference strains were generated using Illumina paired end technologies (read lengths ≥75bp). The raw sequence data were mapped to the Sal-1 reference genome (version 10.0) using bwa-mem with default parameters. SNPs (n = 447,232) were identified using the samtools software suite (samtools.sourceforge.net) with high quality scores (phred score >30, 1 error per 1 kbp). Genotypes were called using ratios of coverage, where the minimal heterozygous calls still present after filtering were converted to the majority genotype if the coverage ratio was 80:20 or greater [. SNPs were retained if they were biallelic, had low genotype missingness (<10%) and heterozygous (<0.4%) calls. SNPs in regions of extreme coverage and very low coverage were excluded, as well as in non-unique regions (using a k-mer approach with length of 54 bp) and highly polymorphic VIR genes. Two samples were found to have P. vivax and P. falciparum co-infections (ERR020124 and SRR828528), and were excluded from population genetic analysis. Isolates were retained if they had at least average 10-fold genome-wide coverage, and at most 10% missing genotype calls. The final high quality dataset consisted of 46 (62.2%) isolates (Thailand 22, Southeast Asia 24, South America 11; other 11; ) and 219,288 SNPs, and used as the basis of population genetic analyses. FreeC software (http://bioinfo-out.curie.fr/projects/freec/tutorial.html) was used to identify regions of the genome with a significant increase or decrease in read coverage identifying potential copy number variants (CNVs) after accounting for GC bias through coverage normalization. Regions identified as CNVs were inspected visually and assessed using de novo assembly methods [.

Population genetics

Genetic diversity was estimated using the average pairwise nucleotide diversity (π) with the R package "pegas". An in-house R script was used to compute the allele frequency-based Tajima's D test [ to identify genes under balancing selection in the individual populations (values > 2.5; [), this method was chosen over the dN/dS approach given the latter being not fit for analysis on individual populations [. To detect signals of directional selection, the integrated Haplotype Score (iHS) approach [ was applied to individual populations supported by a principal component analysis (PCA). This approach used the most frequent allele where mixed calls where found so the haplotype analysis will be based on the most abundant strain in each sample [ P-values for iHS were computed from standardised values based on a 2-tailed conversion from a Gaussian distribution [. The Salvador-I being the reference and oldest sample was used as ancestral haplotype. Multiplicity of infection was estimated using a novel method of counting the unique haplotypes formed by polymorphism on paired sequencing reads (estMOI, [). For comparisons between populations, we first applied PCA and neighbourhood joining tree clustering based on a matrix of pairwise identity by state values. These analyses were followed by applying the cross population long-range haplotype method (XP-EHH [ Rsb implementation [) and the population differentiation metric F [. P-values for the Rsb estimates were calculated using a Gaussian approximation [. A significance threshold of P < 0.001 was established for both iHS and Rsb using bootstrap- and permutation-based simulation approaches [ We used the ranked F statistics to identify the informative polymorphism for the barcoding of populations and driving the clustering observed in the PCA. Linkage disequilibrium (LD) was assessed in the two populations with the largest sample sizes (Thailand and South America) using the r and D’ metrics [, calculated for pairs of SNPs with different physical separation up to 2 kbp using a sliding window approach. The SNPs were annotated and effects of variants on genes (such as amino acid changes) were predicted using snpEFF software [ The R statistical package was used to analyse SNP data, including implementation of selection analyses using the “rehh” library.

Results

Genetic polymorphisms

The genomic coverage in the nuclear genome was high (median 103-fold, range (30-5973-fold), and in keeping with multiple organellar copies, the mitochondria and apicoplast coverage was 30-fold and 1.8 fold greater than the nuclear coverage. The density of SNPs in the nuclear genome (219,288 SNPs, 1 every 99 bp) was greater than in the mitochondrial (23 SNPs, 1 every 165 bp) and apicoplast genomes (176 SNPs, 1 every 165 bp) (). Although 60% of the annotated reference genome is coding (chromosomal range: 54%-64%), approximately half the SNPs in the isolates were found in genic regions (mean 48% per chromosome, range 43% to 52%) (). The proportion of non-synonymous sites is consistent with those found in other Plasmodium species, with 52% of coding SNP sites being non-synonymous in the nuclear genome, 36% in the mitochondrion and 56% in the apicoplast. The differences in these genomes suggest they may be subject to differential selective pressure [. The majority of SNPs are rare, with nearly half of the mutations (45%) being observed in single samples () as seen in other Plasmodium populations [ There was some evidence of polyclonality in 22 samples (Cambodia 1/2, Colombia 5/5, Madagascar 2/2, PNG 2/5, Thailand 11/22). (a) SNP locations by annotation*. (b) Minor allele frequency spectrum indicates a predominance of rare alleles. (c) Linkage disequilibrium (r2) decays rapidly with physical genetic distance. * established using snpEFF software. Analysis of structural variants and copy number variants was limited to Thai, Cambodian and Madagascan isolates, which had high and uniform genomic coverage. CNVs were located less than 1 kbp distance from the MDR1 gene (chromosome 10, PVX_080100) in Cambodian and Thai isolates (). Several MDR1 variants have previously been reported, some considered putative chloroquine- and mefloquine-resistance alleles [ At the MDR1 locus, we observed either a duplication of ~35kb (position 351kbp to 389kbp, n = 1, Thailand), a major deletion in the promoter region of the gene (n = 7, Thailand; n = 1 Cambodia), or a combination of both structural variants, including two copies, one with the deletion in the promoter and another copy with a complete promoter (n = 4, Thailand); as confirmed by the increase or decrease in coverage and accumulation of split reads in the regions where a break in the coverage occurs. The known duplication in the Duffy binding protein PvDBP (chr. 6: 974,000–982,000, PVX_110810) in Malagasy [ was confirmed in one of the two Madagascan isolates (SRR828416). The PvDBP gene is potentially involved in erythrocyte invasion, and the duplication was also observed in thirteen Thai isolates. A further duplication was observed in Pv-fam-e (a RAD gene, chr. 5: 895,000–900,000, n = 8, Thailand), a gene linked to P. vivax selectivity for young erythrocytes and/or immune evasion [

Assessing genetic diversity, LD and positive directional selection

The average polymorphism (pair-wise mismatches measured by nucleotide diversity π) was calculated by gene and chromosome. There was little difference across the chromosomes with mean 11.1x10-4 (range 6.0 x 10−4 to 19.0x10-4), which is consistent with other studies with similar sample size [ as well as larger datasets when restricted to high quality SNPs (1.5x10-3) [ LD decays rapidly for non-rare polymorphism (minor allele frequency ≥ 5%) within a few hundred base pairs, and reaches a baseline within 500bp in South American and Thai nuclear genomes (). Like P. falciparum, there is a high correlation between non-rare SNPs (median D’ 0.918, range 0.425–1) in the mitochondrial and apicoplast genomes supporting the inference that the organelles are co-inherited and supporting the view that these SNPs have potential utility for barcoding [ To examine evidence for signatures of positive natural selection we calculated the iHS metric in the Thailand and South America populations, informed by the population structure reported in . Five contiguous loci of strong positive directional selection were identified, including the MRP1 gene (PVX_097025) and its promoter region in Thailand, and a region surrounding the MRP2 gene (PVX_097025, P. falciparum homologue associated with primaquine and antifolate drug sensitivity [). Several surface proteins were identified in both populations, including the MSP7 and MSP3.1, which are thought to be under selection pressure due to their role in erythrocyte invasion and strong vaccine candidates and have been identified before by other studies using sanger sequencing [, ). In addition, some helicases showed strong signals of selection (PVX_088190 and PVX_111220) which were also detected in the same study [ reinforcing the method used. Furthermore, we identified in South America a proximal region of selection (chr14: 1,414,164–1,479,586) described elsewhere [ * |iHS| > 3. * |iHS| > 3.

Allele frequency spectrum and balancing selection

The allele frequency spectrum of different classes of nucleotide sites all show an excess of rare alleles, with coding, non-synonymous, synonymous and intergenic sites more skewed than expected under a Wright-Fisher model of constant population size [ This observation could indicate a population expansion in the recent past, where as a population grows in size, the frequency of rare alleles also increases [. The Tajima’s D method was applied to genes with at least five SNPs in the two main populations (Thailand 4,673 (91.0%) and South America 3,549 (70.0%) genes). The majority of Tajima’s D values were negative (Thailand 90.2%; South America 64.4%), reinforcing the presence of an excess of low frequency and singleton polymorphisms, potentially due to population expansion in the recent past or purifying selection. For Thailand, we identified 398 (8.5%) genes with positive Tajima’s D values, of which 14 were in excess of 2.5 and potentially under balancing selection (). Similarly, for South America, of the 1,260 (35.5%) values that were positive, 12 were in excess of 2.5 (). The loci under potential balancing selection in both populations encode proteins with predominantly roles surface proteins (e.g. MSPs) and antigens. The majority of the 26 genes identified in this study have had positive indices of balancing selection in previous studies [, or have orthologues in P. falciparum [ * Tajima’s D > 2.5 ** at least 5 SNPs per gene.

Population structure and evidence of differing directional selection in populations

Both a principal component and a neighbourhood joining tree analysis () revealed clustering by continent, in keeping with similar P. falciparum analyses [ The across population long-range haplotype method (Rsb implementation) was applied to compare Thailand to the South American population, to identify regions potentially under recent directional selection (). We detected several loci including at multidrug resistance-associated protein MRP1 (PVX_097025), and the CCR4-associated factor 1 (CAF1, PVX_123230) located within 20kb of DHPS (associated with resistance to sulfadoxine [). Five non-synonymous mutations were identified in the DHPS gene (M616T, P553A, P383A, P382R, P382A), with evidence that the P383A has driven toward fixation across all geographical regions. Except for mutation in codon 616, all the others have been previously reported [ The DHFR gene, associated with resistance to pyrimethamine (part of the SP drug combination), exhibited elevated Rsb (>3). Seven non-synonymous mutations were identified, including the previously described S58R and S117N [ that were fixed across populations, and F57I/L and T61M [ that were absent from South America (). No evidence was observed of a hard sweep around the MDR1 copy number gene. However, nine non-synonymous SNP mutations were identified, five of which have been reported previously. These included the fixed alleles T958M and M908L, F1076L at high frequency across populations, and G698S and S513R absent from South America [. There was no evidence of a sweep around the P. vivax orthologues of the falciparum chloroquine related CRT (pvcrt-o, PVX_087980) or GTP cyclohydrolase I folate pathway (GTPCH, PVX_123830) genes. No common non-synonymous mutations were identified within the CRT gene, while 7 low frequency non-synonymous SNPs where identified in the GTPCH locus. * Rsb > 3; genes in bold refer to loci related with mosquito life stages of the parasite or drug-resistance. The Rsb analysis also revealed loci associated with the diversity of vectors, including the P28 (PVX_111180) gene expressed in the surface of the ookinete stage during the mosquito part of the life cycle, pv47 (PVX_083240) and pv48/45 (PVX_083235) involved in the transmission of the parasite. There are continental-specific pv47 and pv48/45 SNPs (and haplotypes) as previously found [, consistent with the presence of different species of mosquito in each the regions [ resembling a similar pattern found in P. falciparum [

Towards molecular barcoding of P. vivax

The development of molecular barcode for P. vivax could ultimately assist with surveillance and disease control. Previous work [ has described a 42 SNP barcode to classify geographically P. vivax across 7 countries. Across the 46 isolates analysed here, we found 3 SNPs in the barcode to be either non-segregating or not passing quality control filtering. Use of the remaining 39 SNPs led to imperfect clustering by continent (). Application of the F population differentiation metric identified SNPs driving the observed differences between Thailand, South America and other populations (). These SNPs occurred in drug resistance loci, including MRP1 (PVX_097025), DHPS (PVX_123230) and UBP1 (PVX_081540) (all F > 0.72), and in close proximity (e.g. PVX_089960 within 8kb of DHFR). Population differentiation due to genetic diversity in drug resistant loci is also observed in P. falciparum [. Previous work has proposed the mitochondria and apicoplast organellar genomes as candidate regions for a barcode [ Genotyping of organellar markers would benefit from greater copy number and coverage as well as highly conserved sequences [. Eight markers across five apicoplast genes could differentiate Thai and Southeast Asian samples from the other isolates, and two non-genic markers were found to be exclusive to South America (all F>0.7, ). No informative mitochondrial markers were identified (all F<0.7). Further, as the organelle genomes are known to be highly conserved between Plasmodia species, when comparing a set of P. falciparum geographical markers [ to P. vivax sequences, we found evidence of positions close in the sequence. Two of the samples (ERR020124 and SRR828528) had a high density of mixed calls in the organellar genomes, in this case, a signature of P. falciparum overlaying onto P. vivax (). In general, this density signature is indicative of a co-infection of P. vivax with another Plasmodium spp. By comparing the sequencing reads to the Plasmodium knowlesi reference genome [ there was no evidence of any vivax and knowlesi co-infections. However, the presence of a unique triallelic SNP reinforces the potential for an organellar inter-plasmodia species barcode ().

Discussion

Several studies have previously described the genomic diversity of P. vivax populations using whole genome data, but with low sample sizes. Recently, two papers using a combined collection of over 400 isolates from 17 countries described major genomic diversity in Plasmodium vivax [. Here we analysed a complementary collection of 46 high quality isolates spanning 10 countries across 4 continents in order to position them within the context of this new work. As expected we confirmed that P. vivax genomic diversity is greater compared to P. falciparum, and even at a relatively low sample size, the samples clustered geographically. We reveal a wider genomic distance between South American and Southeast Asian continents than observed between P. falciparum African and Southeast Asian populations [, highlighted by the greater and more uniform distribution of SNPs with a high F across the genome. Hotspots of selection pressure were identified, including the previously reported MRP1, DHPS [ and other putative drug resistance genes, as well as several loci related with the mosquito stage of the parasite life cycle. The latter observation is consistent with recent work [ and the presence of different Anopheles species across continents. We identified structural variants, including extra copies and deletions in the promoter region of the MDR1 gene, a locus associated with multiple antimalarial drugs [ We also confirmed the duplication in the Duffy binding protein gene (PvDBP) in a Madagascan sample, and detected it in Thai isolates. This duplication has been found in parasites from several regions in Africa, South America and Asia [ Many of these locations are areas where Duffy-negative individuals make up >45% of the population. However other regions like Cambodia do not present Duffy-negative individuals [. It has been theorized that the duplication allows the parasite to infect Duffy negative individuals [, however more research is needed in this area. Microsatellite genotyping has been used previously to cluster geographically P. vivax isolates, and together with antigen genotyping identify mixed infections and extent of transmission, used as the basis of genetic epidemiology. In comparison, whole genome sequencing provides a higher specificity in the application of geographical clustering [ While other studies have focused on creating a barcode using the nuclear genome [, we also considered organelle genomes (mitochondrion and apicoplast), which are more stable over time, do not undergo recombination and are co-inherited [. The analysis revealed organellar markers that are potentially Southeast Asian and South American specific, and others that highlighted the presence of multi-species mixed infections. The sequencing of large numbers of isolates, beyond currently published samples sizes, will be required to establish robust intra- and inter-species organellar-based barcode. Such large-scale datasets across multiple regions will also serve to identify the high genomic diversity that lies within and between P. vivax populations, which could be exploited for biological insights, including elucidating drug resistance and invasion mechanisms, and ultimately measures of disease control.

Conclusion

This study has shown that genomic diversity that lies within and between P. vivax populations can be used to elucidate potential drug resistance and invasion mechanisms, as well as facilitate the molecular barcoding of the parasite for surveillance applications. Structural variants located around the a sample without a copy number variant or deletion (even coverage), (ii) a major deletion in the promoter region of the gene (n = 7); (iii) duplication of ~35kb (position 351kbp to 389kbp, n = 1); and (iv) a combination of both structural variants (ii) and (iii), including two copies, one with the deletion in the promoter and another copy with a complete promoter (n = 4, Thailand). The horizontal dashed line is average chromosomal coverage and the red outline encloses the promoter region of the MDR1 gene. (TIFF) Click here for additional data file. Intra-population evidence of directional selective pressure ( Thailand b) South America. iHS integrated haplotype score; see and for a summary of the hits. (TIFF) Click here for additional data file.

Principal component analysis based on 225k SNPs reveals strong clustering of isolates by continent.

(PNG) Click here for additional data file.

Identifying regions under directional selective pressure between Thailand and South America.

Blue line: |Rsb| > 3 (P<0.003); Red line represents a human GWAS cut-off; see for a summary of the hits. (PNG) Click here for additional data file.

Principal component analysis based on the previously characterised 42 barcoding SNPs* does not reveal strong population clustering.

* SNPs and genotypes are shown in (PNG) Click here for additional data file.

Signatures of a mixed species infection based on heterozygous calls in mitochondrial markers (positions: 3,736–3,935bp).

(PNG) Click here for additional data file.

The 46 study isolates.

(DOCX) Click here for additional data file.

The SNPs.

(DOCX) Click here for additional data file.

Non-synonymous mutations in candidate genes.

(DOCX) Click here for additional data file.

Previously characterised 42 barcoding SNPs* in the 46 study isolates.

(DOCX) Click here for additional data file.

Sites of population differentiation between Thailand and South America.

(DOCX) Click here for additional data file.

Population informative apicoplast variants.

(DOCX) Click here for additional data file.
Table 1

Regions under directional selective pressure in Thailand *.

ChrPosition / RangeMax iHSGeneAnnotation
12840005.083PVX_087910E3 ubiquitin-protein ligase, putative
13794303.511..
21484134.319.Promoter region MRP1
21581223.452PVX_097025multidrug resistance-associated protein 1, MRP1
45767734.289..
46298523.483PVX_003770merozoite surface protein 5 (MSP5)
56739393.566PVX_089575trafficking protein particle complex protein, TRAPPC2L
77787193.986..
713971815.829PVX_086903Plasmodium exported protein, unknown function
87666043.709PVX_095055Rh5 interacting protein, putative (RIPR)
89211043.451PVX_095235protein phosphatase inhibitor 2, putative
89271913.489PVX_095245hypothetical protein, conserved
89854543.778PVX_095305hypothetical protein, conserved
91071236.186PVX_090925protein kinase domain containing protein
93115943.590..
95265574.115PVX_091440hypothetical protein, conserved
1012226465.499PVX_097715hypothetical protein
1012255294.109..
1012616509.180.Promotor region MSP3.1
1012618523.982PVX_097670merozoite surface protein 3 (MSP3.1)
119261664.034..
127321153.531PVX_082735thrombospondin-related anonymous protein (TRAP)
12734223–7458604.901PVX_082730hypothetical protein, conserved
127465364.319..
127517735.473PVX_082710hypothetical protein
127523323.558..
127659296.849..
127667846.170PVX_082675merozoite surface protein 7 (MSP7)
128642183.561PVX_082510hypothetical protein
128657803.930PVX_082505CPW-WPC family protein, putative
1210202355.042..
1224755284.740..
1225408413.633PVX_118270serine/threonine protein kinase, putative
1226220923.446PVX_118345protein transport protein SEC7, (SEC7)
1226388744.900..
1226712993.501PVX_118380GTP-binding protein, putative
1227322683.486PVX_118460hypothetical protein, conserved
1430289865.168..

* |iHS| > 3.

Table 2

Regions under directional selective pressure in South America *.

ChrPosition / RangeMax iHSGeneAnnotation
14903693.020PVX_088190helicase, putative
16624013.050PVX_093585SF-assemblin, putative
2244790–15.316.Promoter region PVX_081315
32477913.813PVX_000860hypothetical protein, conserved
33726794.109PVX_000695hypothetical protein, conserved
45748313.029PVX_003830serine-repeat antigen 5 (SERA)
55606373.649PVX_089445RAD protein (Pv-fam-e)
510464823.358PVX_090020hypothetical protein, conserved
6627960–13.605PVX_111230hypothetical protein, conserved
7437942–604.773PVX_099005cysteine repeat modular protein 1, CRMP1
75276513.182PVX_099125pseudouridylate synthase, putative
71116251–25.712PVX_099915RNA-binding protein, putative
712141796.840PVX_087145nucleolar protein Nop52, putative
82193593.180PVX_094405hypothetical protein, conserved
97300343.611PVX_091700circumsporozoite-related antigen, EXP1
97518573.149.Promoter region PVX_091715
98298903.161PVX_091770calcium-dependent protein kinase 7, CDPK7
91042906–75.519PVX_0920356-phosphofructokinase, putative
911368733.110PVX_092160hypothetical protein, conserved
103805353.108PVX_080110G10 protein, putative
1010634323.592PVX_097895TBC domain containing protein
1012572513.291.Promoter region MSP3.2
101260441–25.201.1 Kb from MSP3.2
118227154.026PVX_114575transmembrane amino acid transporter protein
1119737086.074..
129554883.652PVX_082400myosin C, putative
1213171953.435PVX_116815hypothetical protein, conserved
132151353.283PVX_084350hypothetical protein, conserved
136116685.708PVX_084755hypothetical protein, conserved
13856226–304.519PVX_085030aspartyl protease, putative
1412758353.114PVX_123250aquaporin, putative (AQP2)
1416651913.572PVX_123685histone-lysine N-methyltransferase, SET10
1418758334.986PVX_123890hypothetical protein, conserved

* |iHS| > 3.

Table 3

Genetic regions under potential balancing selection pressure in South America (SA) and Thailand (T)*.

Chr.Gene StartGene EndTajima's DGene**AnnotationPopulation
15219785273873.265PVX_088235ferlin, putativeSA
3191873071514.166, 7.134PVX_001080hypothetical protein, conservedSA, T
42650182672162.928PVX_002785ATP-dependent acyl-CoA synthetaseT
45627555663744.871, 6.085PVX_003840serine-repeat antigen 3 (SERA)SA, T
45673135710934.944PVX_003835serine-repeat antigen 1 (SERA)T
45721725758523.36PVX_003830serine-repeat antigen 5 (SERA)T
45962836001924.224PVX_003805serine-repeat antigen (SERA), putativeSA
5129780813010103.279PVX_090285Pvstp1, putativeT
51345372135404715.907PVX_090325reticulocyte binding protein 2c (RBP2c)T
5135874813608205.762PVX_090330reticulocyte binding protein 2 (PvRBP-2)T
7115774211629975.593, 4.124PVX_099980merozoite surface protein 1 (MSP1)SA, T
9642478114.195PVX_090835hypothetical proteinT
1022046234602.925PVX_079700hypothetical protein, conservedT
1065101692502.793PVX_079750hypothetical protein, conservedT
10118763911889095.206PVX_09776060S ribosomal protein L31, RPL31SA
10121851212218452.939,5.585PVX_097720merozoite surface protein 3 (MSP3.10)SA, T
10127235412741933.869PVX_0976604-diphosphocytidyl-2-C-methyl- kinase, IspESA
10130638413081535.991PVX_097600hypothetical protein, conservedSA
127510417522045.578PVX_082710hypothetical proteinSA
1337121591813.876PVX_084160dynein heavy chain, putativeSA
131286181317514.099PVX_084260hypothetical protein, conservedSA
14304464430463392.773PVX_101575hypothetical protein, conservedT

* Tajima’s D > 2.5

** at least 5 SNPs per gene.

Table 4

Regions under directional selective pressure between Thailand and South America *.

ChrPosition/RangeRsbGeneAnnotation
2145708–15160611.80.Promoter region MRP1
2154067–1581225.230PVX_097025multidrug resistance-associated protein 1, MRP1
2175191–1768033.717PVX_081215hypothetical protein, conserved
4914575.879PVX_002550hypothetical protein, conserved
4607568–6078374.457PVX_003795serine-repeat antigen (SERA)
4629831–6301205.205PVX_003770merozoite surface protein 5
5113273610.20PVX_090105holo-[acyl-carrier-protein] synthase, putative (ACPS)
59647713.624PVX_089950bifunctional dihydrofolate reductase-thymidylate synthase, DHFR-TS
6199049–1991654.703PVX_001850hypothetical protein
6605656–6081193.788PVX_111260hypothetical protein, conserved
6635433–6355394.406PVX_111220RNA helicase, putative
66618166.048PVX_11118028 kDa ookinete surface protein, (P28)
71396929–13969614.708.Promoter region PVX_086903
713971814.700PVX_086903Plasmodium exported protein, unknown function
8219359–2202515.257PVX_094405hypothetical protein, conserved
81417014–14170384.406PVX_119515hypothetical protein, conserved
815332223.214PVX_119360hypothetical protein
9419318–4196194.971.Promoter region PVX_091307
9920056–9201664.676PVX_091880hypothetical protein, conserved
910489903.304PVX_092040geranylgeranyl pyrophosphate synthase (GGPPS)
912298333.296PVX_092275apical membrane antigen 1 (AMA1)
101251585–12572517.094.Promoter region MSP3.2
101257754–12578156.617PVX_097675merozoite surface protein 3 (MSP3.2)
1115172693.234PVX_1137756-cysteine protein (P12)
111223546–12237903.816.Promoter region PVX_114125
111383108–13831556.010PVX_113925hypothetical protein, conserved
122869603.227PVX_0832406-cysteine protein (P47)
13141889–1422865.680PVX_084280hypothetical protein, conserved
13620154–6202615.922.Promoter region PVX_084770
13731328–7925225.375PVX_084860hypothetical protein, conserved
131034635–10347184.101PVX_085235hypothetical protein
1310427745.406PVX_085245hypothetical protein, conserved
1315531134.126PVX_085835hypothetical protein, conserved
141231525–12315286.056PVX_123205CCR4-associated factor 1, (CAF1)
1414298743.903PVX_123415adrenodoxin-type ferredoxin, putative

* Rsb > 3; genes in bold refer to loci related with mosquito life stages of the parasite or drug-resistance.

  52 in total

1.  Next-Generation Sequencing of Plasmodium vivax Patient Samples Shows Evidence of Direct Evolution in Drug-Resistance Genes.

Authors:  Erika L Flannery; Tina Wang; Ali Akbari; Victoria C Corey; Felicia Gunawan; A Taylor Bright; Matthew Abraham; Juan F Sanchez; Meddly L Santolalla; G Christian Baldeviano; Kimberly A Edgel; Luis A Rosales; Andrés G Lescano; Vineet Bafna; Joseph M Vinetz; Elizabeth A Winzeler
Journal:  ACS Infect Dis       Date:  2015-08-03       Impact factor: 5.084

2.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

3.  Assessment of the origins and spread of putative resistance-conferring mutations in Plasmodium vivax dihydropteroate synthase.

Authors:  Vivian N Hawkins; Stephanie M Suzuki; Kanchana Rungsihirunrat; Hapuarachchige C Hapuarachchi; Amanda Maestre; Kesara Na-Bangchang; Carol Hopkins Sibley
Journal:  Am J Trop Med Hyg       Date:  2009-08       Impact factor: 2.345

4.  Sensitivity to antifolates and genetic analysis of Plasmodium vivax isolates from Thailand.

Authors:  Kanchana Rungsihirunrat; Kesara Na-Bangchang; Vivian N Hawkins; Mathirut Mungthin; Carol Hopkins Sibley
Journal:  Am J Trop Med Hyg       Date:  2007-06       Impact factor: 2.345

Review 5.  Genetics in geographically structured populations: defining, estimating and interpreting F(ST).

Authors:  Kent E Holsinger; Bruce S Weir
Journal:  Nat Rev Genet       Date:  2009-09       Impact factor: 53.242

6.  Independent Origin and Global Distribution of Distinct Plasmodium vivax Duffy Binding Protein Gene Duplications.

Authors:  Jessica B Hostetler; Eugenia Lo; Usheer Kanjee; Chanaki Amaratunga; Seila Suon; Sokunthea Sreng; Sivanna Mao; Delenasaw Yewhalaw; Anjali Mascarenhas; Dominic P Kwiatkowski; Marcelo U Ferreira; Pradipsinh K Rathod; Guiyun Yan; Rick M Fairhurst; Manoj T Duraisingh; Julian C Rayner
Journal:  PLoS Negl Trop Dis       Date:  2016-10-31

7.  Chloroquine resistant Plasmodium vivax: in vitro characterisation and association with molecular polymorphisms.

Authors:  Rossarin Suwanarusk; Bruce Russell; Marina Chavchich; Ferryanto Chalfein; Enny Kenangalem; Varakorn Kosaisavee; Budi Prasetyorini; Kim A Piera; Marion Barends; Alan Brockman; Usa Lek-Uthai; Nicholas M Anstey; Emiliana Tjitra; François Nosten; Qin Cheng; Ric N Price
Journal:  PLoS One       Date:  2007-10-31       Impact factor: 3.240

8.  Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase.

Authors:  Vivian N Hawkins; Alyson Auliff; Surendra Kumar Prajapati; Kanchana Rungsihirunrat; Hapuarachchige C Hapuarachchi; Amanda Maestre; Michael T O'Neil; Qin Cheng; Hema Joshi; Kesara Na-Bangchang; Carol Hopkins Sibley
Journal:  Malar J       Date:  2008-04-28       Impact factor: 2.979

9.  Whole genome sequencing of field isolates reveals a common duplication of the Duffy binding protein gene in Malagasy Plasmodium vivax strains.

Authors:  Didier Menard; Ernest R Chan; Christophe Benedet; Arsène Ratsimbasoa; Saorin Kim; Pheaktra Chim; Catherine Do; Benoit Witkowski; Remy Durand; Marc Thellier; Carlo Severini; Eric Legrand; Lise Musset; Bakri Y M Nour; Odile Mercereau-Puijalon; David Serre; Peter A Zimmerman
Journal:  PLoS Negl Trop Dis       Date:  2013-11-21

10.  Whole Genome Sequencing of Field Isolates Reveals Extensive Genetic Diversity in Plasmodium vivax from Colombia.

Authors:  David J Winter; M Andreína Pacheco; Andres F Vallejo; Rachel S Schwartz; Myriam Arevalo-Herrera; Socrates Herrera; Reed A Cartwright; Ananias A Escalante
Journal:  PLoS Negl Trop Dis       Date:  2015-12-28
View more
  15 in total

Review 1.  Molecular approaches to determine the multiplicity of Plasmodium infections.

Authors:  Daibin Zhong; Cristian Koepfli; Liwang Cui; Guiyun Yan
Journal:  Malar J       Date:  2018-04-23       Impact factor: 2.979

2.  Plasmodium knowlesi as a model system for characterising Plasmodium vivax drug resistance candidate genes.

Authors:  Lisa H Verzier; Rachael Coyle; Shivani Singh; Theo Sanderson; Julian C Rayner
Journal:  PLoS Negl Trop Dis       Date:  2019-06-03

3.  Whole genome sequencing of amplified Plasmodium knowlesi DNA from unprocessed blood reveals genetic exchange events between Malaysian Peninsular and Borneo subpopulations.

Authors:  Ernest Diez Benavente; Ana Rita Gomes; Jeremy Ryan De Silva; Matthew Grigg; Harriet Walker; Bridget E Barber; Timothy William; Tsin Wen Yeo; Paola Florez de Sessions; Abhinay Ramaprasad; Amy Ibrahim; James Charleston; Martin L Hibberd; Arnab Pain; Robert W Moon; Sarah Auburn; Lau Yee Ling; Nicholas M Anstey; Taane G Clark; Susana Campino
Journal:  Sci Rep       Date:  2019-07-08       Impact factor: 4.379

4.  Evolution of the Plasmodium vivax multidrug resistance 1 gene in the Greater Mekong Subregion during malaria elimination.

Authors:  Huguette Gaelle Ngassa Mbenda; Meilian Wang; Jian Guo; Faiza Amber Siddiqui; Yue Hu; Zhaoqing Yang; Veerayuth Kittichai; Jetsumon Sattabongkot; Yaming Cao; Lubin Jiang; Liwang Cui
Journal:  Parasit Vectors       Date:  2020-02-12       Impact factor: 3.876

5.  A molecular barcode to inform the geographical origin and transmission dynamics of Plasmodium vivax malaria.

Authors:  Ernest Diez Benavente; Monica Campos; Jody Phelan; Debbie Nolder; Jamille G Dombrowski; Claudio R F Marinho; Kanlaya Sriprawat; Aimee R Taylor; James Watson; Cally Roper; Francois Nosten; Colin J Sutherland; Susana Campino; Taane G Clark
Journal:  PLoS Genet       Date:  2020-02-13       Impact factor: 5.917

6.  Distinctive genetic structure and selection patterns in Plasmodium vivax from South Asia and East Africa.

Authors:  Ernest Diez Benavente; Emilia Manko; Jody Phelan; Monica Campos; Debbie Nolder; Diana Fernandez; Gabriel Velez-Tobon; Alberto Tobón Castaño; Jamille G Dombrowski; Claudio R F Marinho; Anna Caroline C Aguiar; Dhelio Batista Pereira; Kanlaya Sriprawat; Francois Nosten; Robert Moon; Colin J Sutherland; Susana Campino; Taane G Clark
Journal:  Nat Commun       Date:  2021-05-26       Impact factor: 14.919

7.  Novel insights from the Plasmodium falciparum sporozoite-specific proteome by probabilistic integration of 26 studies.

Authors:  Lisette Meerstein-Kessel; Jeron Venhuizen; Daniel Garza; Nicholas I Proellochs; Emma J Vos; Joshua M Obiero; Philip L Felgner; Robert W Sauerwein; Marynthe Peters; Annie S P Yang; Martijn A Huynen
Journal:  PLoS Comput Biol       Date:  2021-04-30       Impact factor: 4.475

8.  Analysis of nuclear and organellar genomes of Plasmodium knowlesi in humans reveals ancient population structure and recent recombination among host-specific subpopulations.

Authors:  Ernest Diez Benavente; Paola Florez de Sessions; Robert W Moon; Anthony A Holder; Michael J Blackman; Cally Roper; Christopher J Drakeley; Arnab Pain; Colin J Sutherland; Martin L Hibberd; Susana Campino; Taane G Clark
Journal:  PLoS Genet       Date:  2017-09-18       Impact factor: 5.917

9.  Genomic analysis of a pre-elimination Malaysian Plasmodium vivax population reveals selective pressures and changing transmission dynamics.

Authors:  Sarah Auburn; Ernest D Benavente; Olivo Miotto; Richard D Pearson; Roberto Amato; Matthew J Grigg; Bridget E Barber; Timothy William; Irene Handayuni; Jutta Marfurt; Hidayat Trimarsanto; Rintis Noviyanti; Kanlaya Sriprawat; Francois Nosten; Susana Campino; Taane G Clark; Nicholas M Anstey; Dominic P Kwiatkowski; Ric N Price
Journal:  Nat Commun       Date:  2018-07-03       Impact factor: 14.919

10.  Whole genome sequencing of Plasmodium vivax isolates reveals frequent sequence and structural polymorphisms in erythrocyte binding genes.

Authors:  Anthony Ford; Daniel Kepple; Beka Raya Abagero; Jordan Connors; Richard Pearson; Sarah Auburn; Sisay Getachew; Colby Ford; Karthigayan Gunalan; Louis H Miller; Daniel A Janies; Julian C Rayner; Guiyun Yan; Delenasaw Yewhalaw; Eugenia Lo
Journal:  PLoS Negl Trop Dis       Date:  2020-10-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.