Literature DB >> 24069480

Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

Surendra K Prajapati1, Hema Joshi, Jane M Carlton, M Alam Rizvi.   

Abstract

The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.

Entities:  

Mesh:

Year:  2013        PMID: 24069480      PMCID: PMC3777877          DOI: 10.1371/journal.pntd.0002425

Source DB:  PubMed          Journal:  PLoS Negl Trop Dis        ISSN: 1935-2727


Introduction

Plasmodium vivax is the most prevalent malaria species outside Africa, causing widespread morbidity that can be severe and fatal [1], [2], [3], [4], [5], [6]. The fixation of the Duffy negativity trait (P. vivax resistance factor) in tropical African populations restricts infection by this parasite there [7]. P. vivax is distantly related to the more virulent Plasmodium falciparum, as inferred from mitochondrial and nuclear DNA variation [8], [9], [10], [11]; the former appears to have evolved from a monkey malaria parasite between 20–35 million years ago via host switching [12]. Various studies on the age and origin of extant P. vivax strongly support it as an ancient human malaria parasite that evolved in Southeast Asia [12], [13], [14], [15]. Single nucleotide polymorphisms (SNPs) have been used by several researchers to elucidate the population history of P. falciparum and P. vivax [12], [16], [17], [18]. These studies established the importance of using selectively neutral loci for determining the molecular age, origin, and evolutionary history of the parasites, assuming a “molecular clock” hypothesis. Many population studies have exploited polymorphisms in two kinds of neutral loci: SNPs in putative housekeeping genes, and repeat polymorphisms in tandem repeats such as microsatellites [19], [20], [21]. Genome sequencing of P. vivax has revealed a higher SNP density [22] and fewer microsatellite markers than in P. falciparum [23], [24]. India contributes about 78% of the total malaria cases in South Asia, and P. vivax accounts for 50–55% of this [25]. The paucity of information about the age and evolutionary history of P. vivax in Indian populations [15], [26] makes such investigations highly relevant. In this paper, we use an evolutionary genetics approach to unravel the evolutionary history of P. vivax on the Indian subcontinent. We have developed putative housekeeping gene, microsatellite, and minisatellite markers for investigating the population and evolutionary history of P. vivax on the Indian subcontinent.

Materials and Methods

Identification of putative housekeeping genes

Putative housekeeping genes were identified from the P. vivax genome database PlasmoDB (http://www.plasmodb.org). The major parameters used for selecting housekeeping genes from PlasmoDB were constitutive expression of gene, presence of orthologs in other malaria parasites, and an assigned metabolic function. For the detection of SNPs in putative housekeeping genes, both exons and introns from each gene were selected. PCR primers were manually designed and the location of each primer is shown in .
Figure 1

Schematic representation of Plasmodium vivax housekeeping genes and location of primers.

Red box, exon; gray box, intron; green arrow, forward primer; black arrow, reverse primer; number indicates start and end of respective gene. Abbreviations: L35e, Ribosomal protein l35e; ACP, Acyl carrier protein; Rpol, RNA polymerase-II; Dg, DNA gyrase; L34a, Ribosomal protein l34a; Stpk, Serine threonine protein kinase; Cdpk, Calcium dependent protein kinase; Ed, Exonuclease domain and Ac, Adenylate cyclase.

Schematic representation of Plasmodium vivax housekeeping genes and location of primers.

Red box, exon; gray box, intron; green arrow, forward primer; black arrow, reverse primer; number indicates start and end of respective gene. Abbreviations: L35e, Ribosomal protein l35e; ACP, Acyl carrier protein; Rpol, RNA polymerase-II; Dg, DNA gyrase; L34a, Ribosomal protein l34a; Stpk, Serine threonine protein kinase; Cdpk, Calcium dependent protein kinase; Ed, Exonuclease domain and Ac, Adenylate cyclase.

Plasmodium vivax isolates and DNA extraction

We analyzed 100 P. vivax field isolates collected between 2003–2006 from Delhi, Nadiad (Gujarat), Panna (Madhya Pradesh), Chennai (Tamil Nadu), and Kamrup (Assam), a total of twenty samples from each site (Figure S1). Human populations at these study sites do not have the Duffy negative trait [27]. Other details about the study sites such as parasite and vector species prevalence and disease transmission patterns are given in the Text S1 and reported elsewhere [28], [29]. DNA extraction was as described in reference [28], [29].

Ethics statement

The ethics committee of the National Institute of Malaria Research approved the study protocol. All subjects provided informed consent, with children providing consent via a parent or guardian.

PCR amplification, sequencing, and sequence analysis

PCR amplification reactions were carried out in a final volume of 20.0 µl that included 1–2 µl (∼3–5 ng) template DNA, 10 pM for each primer, and 2× Master Mix (Promega). PCR primers and the protocols used for amplification of the housekeeping genes are given in . PCR products were purified and sequenced commercially (Macrogen Inc, Seoul, Korea, http://dna.macrogen.com). For each gene, both forward and reverse primers were used in DNA sequencing. DNA Lasergene software (DNA Star Inc., USA) was used for editing raw DNA sequences (EditSeq Module), and sequences alignment (Clustal W module). Each mutation observed in a sample was confirmed by both forward and reverse sequences.

Single clone infection typing

As multi-clone isolates could hamper correct genotyping and lead to over-estimating genetic diversity in a multi-locus genotyping study, Pvmsp-3α PCR-RFLP analysis was used to identify single- and multi-clone infections [30]; only single clone samples (n = 100) were used for genotyping.

Tandem repeat genotyping

Tandem Repeat Finder version 4.00 [31] was used to identify microsatellites from the P. vivax genome. The characteristics of microsatellites are listed in . Each forward primer was modified with fluorescence dye, 6FAM, TET and HEX for accurate microsatellite allele sizing. Amplified PCR products were multiplexed with two different dyes and run on a DNA sequencer (ABI 3730: Applied Biosystems Inc. USA). Peak Scanner (Applied Biosystems, Foster City, CA) was used for microsatellite allele sizing.

Measures of genetic diversity and neutrality

Genetic diversity was measured in the P. vivax population using parameters including nucleotide diversity (π), haplotype diversity (Hd) and average number of nucleotide differences (K) using DnaSP version 4.10 [32]. Microsatellite genetic diversity was measured by calculating expected heterozygosity (He) for each locus using MicroSatellite Analyzer (MSA) version 4.00 [33]. Expected heterozygosity (He) was defined as [n/(n−1)] [1−Σpi 2], where n is the number of isolates analyzed and pi is the frequency of the ith allele in the population.

Genetic differentiation test

We estimated genetic differentiation between five geographical populations of P. vivax in the Indian subcontinent using DnaSP version 4.10 [32]. This software calculated chi-square (χ2) statistics of housekeeping gene haplotype frequencies between populations [34]. The significant P-value (<0.05) of χ2 statistics rejects null hypothesis. The null hypothesis assumes that all populations are genetically undifferentiated.

Test of neutrality

We calculated D statistics [35], [36] and ratio of dN/dS to detect signature of selection at putative housekeeping genes using DnaSP version 4.10 [32]. The number of segregating sites was used for inferring D test statistics. To establish the nature of selection at codons of putative housekeeping genes, we assessed the ratio of the rates of non-synonymous and synonymous substitutions.

Estimation of divergence, TMRCA, and effective population size

The divergence time of human and monkey has been deduced as 30–35 million years ago (mya) [37], [38] which is consistent with 11–41 mya divergence time between P. vivax and nonhuman primate malarias [8]. We presumed a 23–30 mya divergence time between P. vivax and P. knowlesi that is coincident with their host divergence time [37], [38]; this divergence time was used in the estimation of mutation rate and synonymous divergence rate by comparing orthologous putative housekeeping genes of P. vivax and P. knowlesi. In addition, the recent divergence estimates between P. vivax and P. knowlesi (2 to 3 mya and 3.8 to 6.3 mya) was assumed on the basis of radiation of Asian macaques [13], [39]. These divergence estimates were also used for the calculation of the above molecular rates. We estimated the Time to Most Recent Common Ancestor (TMRCA) and effective population size using SNPs present in putatively selectively neutral housekeeping genes. The TMRCA of P. vivax was determined by the equation dS = 2 μ s×Tw, where dS is synonymous substitution rate at synonymous site, μ s is synonymous mutation rate per site per year and Tw is TMRCA [18]. Synonymous mutation rate was determined by the equation Ks = 2 μ s×Tb+dS [18], where Ks is the divergence at synonymous sites between species (P. vivax and P. knowlesi), dS is the nucleotide diversity (polymorphism) within species (P. vivax), μ s is synonymous mutation rate per generation and Tb is time of the divergence between species (P. vivax and P. knowlesi). Effective population size of extant P. vivax was determined by the equation θ = 4Ne μ (for diploid organisms) and θ = 2Ne μ (for haploid and mitochondrial genomes), where θ (theta) is the mutation parameter, Ne the effective population size, and μ is the mutation rate per site per year.

Bottleneck/stable population detection

Heterozygosity deficiency and mode shift analyses were used to detect evidence of a recent population bottleneck using BOTTLENECK software 1.2.02 [40]. In brief, a recently bottlenecked population would show higher observed gene diversity than the expected equilibrium gene diversity (Heq) which was computed from the observed number of alleles (k), under the assumption of a constant-size (equilibrium) population [41]. A two-phase mutation model (TPM) was used for heterozygosity deficiency analysis at mini- and microsatellite loci. TPM is an intermediate between the stepwise mutation model (SMM) and the infinite allele model (IAM), mostly consisting of one-step mutations having a small percentage (5–10%) of multi-step changes, as per BOTTLENECK software [41]. The allele frequency distribution analysis (mode shift analysis) showed whether it was approximately L-shaped (as expected under mutation-drift equilibrium) or not (recent bottlenecks provoke a mode shift). Bottleneck analysis can be performed using tandem repeat polymorphism data; the minisatellite polymorphism data being used in this study has been taken from earlier published work [28]. We also used network analysis of tandem repeat polymorphisms to detect possible signatures of bottlenecked or stable populations [42]. Two minisatellites located on a single chromosome (Chr 6), were used to construct reduced-median (RM) network of haplotypes using Network software [42]. In brief, a star-like network indicates a recent population expansion, whereas the absence of a star-like network indicates an ancient population expansion.

Estimation of expected and observed pairwise differences

The estimation of expected and observed frequencies of pairwise differences at nucleotide sites provides a signature of whether a population is stable or has recently experienced a population bottleneck [36]. In summary, an L-shaped curve between observed and expected allele frequencies spectrum indicates a stable population whereas a non L-shaped curve suggests a population bottleneck. The pairwise nucleotide difference at individual housekeeping genes was estimated using DnaSP version 4.10 [32].

Accession numbers

Accession numbers for alleles of all housekeeping genes are: HM047879–HM047972 (ACPex), HM047973–HM048020 (ACPin), HM048021–HM048117 (DNA gyrase), HM048118–HM048217 (Exonuclease domain), HM048118–HM048217 (Enolase), HM048318–HM048418 (L34a), HM048419–HM048518 (L35e), HM048619–HM048711 (RNA polymerase-II), HM048712–HM048811 (Stpk), and HM048812–HM048829 (Ac).

Results

Screening the genome for SNPs and tandem repeats

We screened the P. vivax Salvador 1 reference genome and identified ten putative housekeeping genes, and ten minisatellites and eight microsatellites, to infer the evolutionary history of this parasite in the Indian subcontinent. The selected housekeeping genes are exonuclease domain (ed), adenylate cyclase (ac), serine/threonine protein kinase (stpk), acyl carrier protein (acp), ribosomal protein l35e (l35e), 60S ribosomal protein l34a (l34 a), DNA gyrase (dg), calcium-dependent protein kinase (cdpk), enolase and RNA polymerase-II (Rpol). The structure of the housekeeping genes (arrangement of introns and exons), gene identifier and fragments used for SNP identification are given in and . Approximately 400 to 600 bp of intron and exon sequence were used to identify SNPs. However, introns were not present in two housekeeping genes (ac and ed), therefore, partial exon sequence of these were used. The selected minisatellites and microsatellites are located at six different chromosomes and their features are listed in . Among the eight microsatellites, a single microsatellite (Gomez_1) was reported earlier by Gomez et al [43].
Table 1

Mutations in housekeeping genes from 100 Indian Plasmodium vivax isolates.

Housekeeping geneGene IDFragment size (bp)Coding sites (bp)Non-coding sites (bp)Segregating sitesSynonymous substitutionNon-Synonymous substitutionNon-coding substitutionTotal changesHaplotype diversity (Hd)Nucleotide diversity (π)
Ribosomal protein l35e PVX_09186551817034812255130.6650.00188
Exonuclease Domain PVX_0036603003000110010.3660.00119
Adenylate Cyclase PVX_0925354964960321030.6140.00146
Acyl Carrier Protein PVX_114420133670663010334100.4940.00083
Serine threonine protein kinase PVX_1134351103619484402240.1690.00016
Ribosomal protein l34a PVX_0880651134586548700770.8640.00132
RNA Polymerase-II PVX_122080984532452901890.6770.00106
Enolase PVX_0950151160557603202513200.8370.00147
DNA Gyrase PVX_1237951162507655733170.7410.00092
Calcium dependent protein kinase PVX_119610110580230220022NDND
Total9298527540227513204276

ND: not determined.

ND: not determined.

Putative housekeeping gene polymorphisms

Ten housekeeping genes were successfully amplified and sequenced to at least two-fold coverage. Analysis of the sequences revealed size polymorphisms in four of the housekeeping genes and two to six size variants were observed. These size variants were due to indels (l35e) or tandem repeat variation (l34a, DNA gyrase) in the gene's repeat region (). Analyzing 9,298 nucleotide sites revealed a substantial number of SNPs (n = 75) in both coding and non-coding regions (Table 1). The number of synonymous, non-synonymous and non-coding SNPs observed, are listed in . On average, putative housekeeping genes had a low level of nucleotide (π = 0.002076±0.00059) and haplotype diversity (Hd = 0.5894±0.108) in P. vivax field isolates. Of the 75 SNPs identified, 55 were putatively selectively neutral (13 synonymous and 42 non-coding) and 20 were non-synonymous ( ). Sequencing of the Pvcdpk gene revealed presence of tandem repeats in the central region of gene, which caused difficulty in aligning the sequences correctly, and was therefore excluded from further analysis.

Putative housekeeping genes are selectively neutral

The neutrality of housekeeping genes was inferred from the results of three tests. 1) The distribution of SNPs among exons (n = 33) and introns (n = 42) of putative housekeeping genes seems to be random (χ2 = 0.572, p = 0.449) suggesting a lack of functional constraint on exons. 2) By Tajima's D test, none of the putative housekeeping genes showed signs of significant deviation from neutrality ( ); however, Fu and Li's D test showed biased singleton mutation distribution in two of the nine putative housekeeping genes; these two genes were enolase (D = −2.920 p = <0.004) and l35e (D = −3.617, p = <0.004). 3) The dN/dS ratio showed a signature of purifying selection for two putative housekeeping genes (l35e and dg), whereas mutations in the remaining genes were selectively neutral ( ). Further, none of the housekeeping genes showed significant departure from neutrality by the above three tests conducted together. Thus, these tests suggest that these housekeeping genes are primarily evolving in a neutral fashion and therefore can be employed as genetic markers for inferring population and evolutionary history of P. vivax.
Table 2

Neutrality test and mutation parameter of Plasmodium vivax housekeeping genes on the Indian subcontinent.

GeneTajima's DFu & Li's DdN±SE (10−3)dS±SE (10−3)dN = / = dS Theta (θ) per siteNucleotide substitutions rates in PvDivergence rates between Pv & Pk
dSdNKsKa
ed 1.0270.4910.002.0±2.01.0050.00064* 0.000.002±0.0020.43340.1181
ac −0.421−0.8047.0±5.00.00−1.1510.001730.000.001±0.0010.33990.0410
acp-in −0.4341.006 - --0.001260.000.0000.29070.0257
acp-ex −1.36−1.7262.0±2.00.00−0.9550.001660.000.001±0.0010.42020.1423
l35e −1.543 −2.92 0.001.0±1.0 2.074 0.004480.004±0.0030.0000.37980.0134
l34a 0.231−0.5570.000.000-0.00120.000.0025±0.0020.23320.0068
stpk −1.487−1.4370.000.0001.4180.0007* 0.002±0.0020.0000.25750.0320
enolase −1.624 −3.617 4.0±3.00.000−1.0350.003320.0000.0000.66430.5624
RNA pol −1.031−1.6510.002.5±2.00.9920.001790.007±0.0050.0000.54960.0867
dg −0.501−0.5470.001.0±1.0 2.061 0.00117--0.69660.7130
Mean0.002076±0.000590.004±0.0010.0016±0.00030.39650.1142

Boldface values are statistically significant (p<0.05), *excluded in averaging θ, Pv: Plasmodium vivax, Pk: P. knowlesi, dS: synonymous substitution rate, dN: non-synonymous substitution rate, Ks: synonymous divergence rate, Ka: non synonymous divergence rate.

Boldface values are statistically significant (p<0.05), *excluded in averaging θ, Pv: Plasmodium vivax, Pk: P. knowlesi, dS: synonymous substitution rate, dN: non-synonymous substitution rate, Ks: synonymous divergence rate, Ka: non synonymous divergence rate.

Lack of geographical structuring between populations

To understand whether P. vivax isolates are structured in populations, we undertook a genetic differentiation test using individual housekeeping genes. Our null hypothesis is that populations of P. vivax in the Indian subcontinent are genetically un-differentiated. Analysis of seven of the eight housekeeping genes accepted this hypothesis, while only DNA gyrase rejected the null hypothesis ( ). Since DNA gyrase does not appear to be neutral by neutrality tests ( ), it may not be ideal for measuring genetic differentiation between P. vivax populations. We confirmed the suitability of using chi-square statistics for genetic differentiation by determining the critical value of haplotype diversity (<0.95) [34]. The haplotype diversity measured at each housekeeping gene was lesser than the critical value (Hd = 0.169–0.864) ( ), confirming suitability of the use of chi-square statistics for genetic differentiation. Based on the seven selectively neutral housekeeping genes, it appears that different populations of P. vivax are not geographically structured in India. Thus, P. vivax isolates collected from different geographical regions of the Indian subcontinent will not have an adverse impact on the pooled analyses of TMRCA, effective population size, and population bottleneck.
Table 3

Genetic differentiation test between Plasmodium vivax populations in India.

Housekeeping genesSample size (N)No of HaplotypesHaplotype diversity (Hd = <0.95)Chi square (χ2)Degree of freedom P value
acpex 9570.42525.241240.392
dg 98110.74168.88840 0.003
ed 10020.3664.29140.368
enolase 100220.837104.361840.065
l34a 100100.86451360.053
l35e 100140.66568.006520.067
RNA pol-II 94120.67748.65440.291
stpk 10050.16912.709120.39

Boldface values indicate significant genetic differentiation.

Boldface values indicate significant genetic differentiation.

Estimation of synonymous divergence rate and mutation rate

Since P. vivax is a close relative of P. knowlesi and the genome sequence of the latter has been determined [44], orthologous genes from P. knowlesi could be used to determine the genetic distance (divergence) between P. knowlesi and P. vivax and the pattern of evolutionary pressure (positive or negative selection) on these putative housekeeping genes. An approximately tenfold higher synonymous divergence rate over the non-synonymous divergence rate was observed between P. vivax and P. knowlesi ( ), suggesting that the putative housekeeping genes are evolving strictly under purifying selection. The average synonymous divergence and mutation rate for P. vivax was determined under various divergence time scenarios between the P. knowlesi and P. vivax. The estimated rates of synonymous divergence and mutation are given in . Based on this range, the synonymous divergence rate (per site per year) for the putative housekeeping genes was calculated as 8.61×10−9 (μsa) and 6.60×10−9 (μsb) under 23 mya and 30 mya divergence scenarios, respectively. Similarly, the mutation rates (per site per year) under the 23 mya and 30 mya divergence scenarios were determined to be 6.51×10−9 (μa) and 4.99×10−9 (μb), respectively. In contrast, few studies suggested relatively recent divergence time of P. vivax viz 2–3 mya [10] and 3.8–7 mya [13] and based on these divergence time mutation rates were also estimated. Under recent divergence time scenario, synonymous divergence rate (μ s) and mutation rate (μ) are calculated as above, and listed in .
Table 4

Estimation of mutation and synonymous divergence rates in Plasmodium vivax.

Divergence time scenarioAverage mutation rate per siteLower limits of mutation rate/site/year (μa)Upper limits of mutation rate/site/year (μb)Average synonymous divergence rate per siteLower limits of synonymous divergence rate/site/year (μsa)Upper limits of synonymous divergence rate/site/year (μsb)
[37] 23–30 mya0.149856.51×10−9 4.99×10−9 0.39658.61×10−9 6.60×10−9
[39] 2–3 mya7.49×10−8 4.99×10−8 9.90×10−8 6.60×10−8
[13] 3.8–6.3 mya3.94×10−8 2.37×10−8 5.21×10−8 3.14×10−8

TMRCA and effective population size (Ne) estimates

We determined a range of TMRCA for extant P. vivax in India that indicate for recent (20,202 to 30,303 years) or ancient (232,288 to 303,030 years) existence, based upon assumptions of recent or ancient divergence ( ). Similarly, on the basis of higher and lower mutation rate estimates, the effective population size estimate of extant P. vivax was obtained as recent (Ne = 6,929 to 10,400) and long-term (Ne = 79,235 to 104,008), respectively ( ).
Table 5

Time to Most Recent Common Ancestor (TMRCA) estimate of Plasmodium vivax on the Indian subcontinent.

Divergence time scenarioSynonymous substitutions rate (dS)TMRCA lower limits (yrs)TMRCA upper limits (yrs)Theta (θ) Ne lower limits Ne upper limitsTMRCA estimates of P. vivax in India based on mt genome (yrs)
*Jongwutiwes et al. [14] *Mu et al. [15] #Cornejo et al. [50]
23–30 mya0.004±0.001232288±58072303030±757570.002076±0.0005979235±22657104008±2955913,000–17,000122,700–164,900143 400–192 600
3.8–6.3 mya38387±959663897±1592313172±374721898±6223
2–3 mya20202±505030303±75756929±196910400±2933

: The TMRCA of Indian P. vivax is estimated by Cornejo et al, #: TMRCA estimates analyzed by joining data of Jongwutiwe et al 2005, and Mu et al 2005.

: The TMRCA of Indian P. vivax is estimated by Cornejo et al, #: TMRCA estimates analyzed by joining data of Jongwutiwe et al 2005, and Mu et al 2005.

Plasmodium vivax populations are stable

We tested the housekeeping gene polymorphism data to see if there were detectable differences between the observed and expected allele frequency, which might indicate an unstable population structure of P. vivax in India. Assuming constant population equilibrium, we determined that the observed allele frequency spectrum was similar to the expected one ( ). In addition, we calculated the pairwise difference analysis for all samples and individual populations (), and also found the observed allele frequency spectrum followed the expected frequency. Thus this analysis indicates the stable nature of P. vivax populations in the Indian subcontinent.
Figure 2

Expected and observed pairwise differences at seven Plasmodium vivax housekeeping genes.

Tandem repeat variability and population bottleneck

Seven microsatellites, three from one chromosome (Chr 6), one each from four chromosomes (Chr 2, Chr 5, Chr 7 and Chr10), were identified from the P. vivax genome sequence, and a subset of isolates (n = 40) was tested by microsatellite analysis. A substantial number of alleles (range 6–13, AE = 8.62±1.16) and high expected heterozygosity (He = 0.65 to 0.90, AE = 0.789±0.039) were observed for each locus. Using data that we had previously generated for ten minisatellites on the same sample set used here [28], bottleneck analysis (heterozygote deficiency and allele frequency mode shift tests) was undertaken to detect whether the extant P. vivax population in the Indian subcontinent reflects a signature of a recent population bottleneck or of a more ancient population. A majority of loci (80%) showed heterozygosity deficiency ( ), suggesting that they are in expected genetic diversity equilibrium. Similarly, mode shift analysis revealed a strictly L-shaped allele frequency distribution ( ). The high heterozygosity deficiency and L-shaped allele frequency distribution at minisatellites suggests that the Indian P. vivax population has not undergone a recent population bottleneck. This supports our ancient age estimates of P. vivax inferred from putative housekeeping genes.
Table 6

Observed and expected heterozygosity at mini and microsatellite loci of Plasmodium vivax on the Indian subcontinent.

LocusObserved (He)Calculated by TPM (Heq)
NK He Heq SDDH/sd
PvCDPK95220.916 0.932 0.013−1.254
MiniSat 195180.826 0.911 0.017−4.994
MiniSat 292110.842 0.84 0.0380.074
MiniSat 593160.9120.8980.0230.616
MiniSat 695160.888 0.897 0.021−0.426
MiniSat 85570.698 0.757 0.067−0.876
MiniSat 1194180.9240.910.020.7
MiniSat 1391140.887 0.88 0.0250.279
MiniSat 1496140.873 0.878 0.027−0.161
MiniSat 1695110.815 0.838 0.038−0.582
MS_383660.656 0.735 0.073−1.088
MS_404070.7860.7720.0620.223
MS_503990.779 0.833 0.041−1.327
MS_213980.734 0.808 0.050−1.497
MS_734070.8360.7720.0601.061
MS_9237110.862 0.876 0.031−0.472
MS_12838130.905 0.900 0.0350.150
Gomez_14080.760 0.808 0.052−0.922

N: number of samples, K: number of alleles observed, He: observed gene diversity and Heq: expected gene diversity equilibrium. Boldface values indicate heterozygosity deficiency.

Figure 3

Allele frequency distribution curve based on Plasmodium vivax minisatellite and microsatellite polymorphism in the Indian subcontinent.

N: number of samples, K: number of alleles observed, He: observed gene diversity and Heq: expected gene diversity equilibrium. Boldface values indicate heterozygosity deficiency.

Network analysis of tandem repeat loci

We constructed a reduced-median (RM) network of tandem repeat-based haplotypes to understand whether P. vivax populations in India expanded recently (which would produce a star-like network). A RM haplotype network derived from more than two tandem repeat loci produces a highly complex network, therefore, we limited our analysis to two tandem repeat loci on P. vivax Chr 6. No star- like network of haplotypes was produced ( ), suggesting an ancient population expansion of P. vivax has occurred in the Indian subcontinent.
Figure 4

Reduced-median haplotype network derived from Plasmodium vivax tandem repeat loci.

Discussion

This is the first study concerning P. vivax evolutionary history on the Indian subcontinent using SNPs in putative housekeeping genes and minisatellite and microsatellite markers. In this study we identified 1) polymorphisms in Indian P. vivax populations at neutral genetic loci; 2) neutral housekeeping genes suitable for inferring the evolutionary history of P. vivax in India; 3) no geographical structuring of P. vivax populations; and 4) an ancient and stable population of P. vivax on the Indian subcontinent. The high degree of genetic polymorphism observed in tandem repeat loci agrees with earlier studies which revealed tremendous genetic polymorphism among global P. vivax isolates [22], [43], [45], [46], [47], [48]. This study successfully identified neutral genetic variation in several genetic loci, and this renders them highly useful as molecular markers for population structure studies. The high density of putatively neutral SNPs observed in housekeeping genes among Indian P. vivax isolates is in accordance with observations of Feng et al. [22], suggesting SNPs in P. vivax isolates are comparatively more common than in P. falciparum [17], [22]. Recently this was confirmed by deep sequencing that showed a higher genetic variability in P. vivax than P. falciparum [49]. Our neutrality test results imply that most of the housekeeping genes are free from directed selection pressure. Neutral evolution of the housekeeping genes described here argues for their utility as molecular markers for understanding demographic events and population history of P. vivax. We also tested whether pooling the P. vivax isolates collected from five geographically disparate populations might have adversely impacted the analyses of TMRCA and effective population size estimations, due to geographic structuring of subpopulations. However, we found no evidence for significant divergence between the populations, suggest that pooling of samples will not have an adverse impact on the analyses of TMRCA and effective population size. The absence of population structure between these five populations was also confirmed in an earlier study on the basis of genetic distance data derived from ten minisatellite loci [28]. Using the neutral polymorphisms of several housekeeping genes, we deduced that the MRCA of extant P. vivax was present on the Indian subcontinent about 232,228 to 303,030 years ago. This ancient TMRCA is strongly supported by long-term effective P. vivax population size (Ne) estimates and stable population indexes. The stable population index of P. vivax in Indian populations was established earlier using other neutral genetic loci [26] that also supports our older age estimate of this species. This ancient evolutionary history of extant P. vivax in India is similar to the molecular age inferred from mitochondrial genome diversity, i.e., 162,400–464,600 years [14], [15], [50]. Our TMRCA estimate strongly supports the hypothesis that P. vivax was a hominoid primate parasite before it became a human parasite by means of a host switch [12], [15], [50] and that it has evolved in Southeast Asia. The alternative hypothesis claiming an African origin for P. vivax is based on the fixation of the Duffy negative trait in the black African population. However, this alternate hypothesis is fraught with inconsistencies. For example, the Duffy negative trait arose around 100,000 years ago [51]; this time scale is shorter than the P. vivax TMRCA estimated in the present study and others [14], [15], [50]. The Duffy receptor is used for invasion by many pathogens including bacteria [52] and viruses [53], [54], [55]; thus, P. vivax is not necessarily the only evolutionary force for the fixation of this trait in the West African native population. Finally, P. vivax infection of Duffy negative people [56], [57], [58] suggests the existence of alternate invasion mechanisms, such as occur in P. falciparum [59], [60], [61]. In contrast, a much younger age estimate of P. vivax was determined under a recent divergence model of P. vivax and P. cynomolgi [10] coincident with the radiation of Asian macaques 2–3 mya [39]. This assumption implies that the MRCA of extant P. vivax dates to just 20,202–30,303 years ago. It is of interest to mention that under this recent divergence model [10], P. vivax shows a ten-fold higher mutation rate than P. falciparum [17], and it is unlikely that such a high mutation rate difference exists between related species. The radiation of Asian macaques was inferred from analysis of a mitochondrial gene, which may be evolving at a different rate than nuclear genes. Combined, our population genetic analyses such as pairwise difference, allele frequency, reduced-median network of haplotypes, and heterozygote deficiency conducted in the present study do not reflect the signature of a recent population expansion or bottlenecked population of P. vivax in the Indian subcontinent. Therefore, an ancient divergence time (23–30 mya) of P. vivax and P. knowlesi, seems more plausible. In conclusion, our findings reveal that P. vivax isolates have an ancient evolutionary history in the Indian subcontinent. For the first time we report that neutral nuclear genome markers (housekeeping genes) displayed an evolutionary history of P. vivax similar to that inferred from mitochondrial genome diversity. Map of India showing location of Plasmodium vivax sampling sites. 1: Delhi; 2: Panna, Madhya Pradesh; 3: Nadiad, Gujarat; 4: Chennai, Tamil Nadu; and 5: Kamrup, Assam. (PPT) Click here for additional data file. Tandem repeat variation in housekeeping genes from Plasmodium vivax field isolates. A) DNA gyrase and B) Ribosomal protein l34a. Tandem repeat unit is underlined. Dash (–) represents deleted nucleotides. (PPT) Click here for additional data file. Expected and observed pairwise differences at seven Plasmodium vivax housekeeping genes. (PPT) Click here for additional data file. Primers and amplification conditions. (DOC) Click here for additional data file. Characteristic features of Plasmodium vivax mini and microsatellite markers. (DOC) Click here for additional data file. Details of study sites. (DOC) Click here for additional data file.
  58 in total

1.  Twelve microsatellite markers for characterization of Plasmodium falciparum from finger-prick blood samples.

Authors:  T J Anderson; X Z Su; M Bockarie; M Lagog; K P Day
Journal:  Parasitology       Date:  1999-08       Impact factor: 3.234

2.  Mitochondrial portraits of human populations using median networks.

Authors:  H J Bandelt; P Forster; B C Sykes; M B Richards
Journal:  Genetics       Date:  1995-10       Impact factor: 4.562

3.  Polymorphism at the merozoite surface protein-3alpha locus of Plasmodium vivax: global and local diversity.

Authors:  M C Bruce; M R Galinski; J W Barnwell; G Snounou; K P Day
Journal:  Am J Trop Med Hyg       Date:  1999-10       Impact factor: 2.345

4.  DNA hybridization evidence of hominoid phylogeny: results from an expanded data set.

Authors:  C G Sibley; J E Ahlquist
Journal:  J Mol Evol       Date:  1987       Impact factor: 2.395

5.  Phylogeny of the malarial genus Plasmodium, derived from rRNA gene sequences.

Authors:  A A Escalante; F J Ayala
Journal:  Proc Natl Acad Sci U S A       Date:  1994-11-22       Impact factor: 11.205

6.  Comparative genomics of the neglected human malaria parasite Plasmodium vivax.

Authors:  Jane M Carlton; John H Adams; Joana C Silva; Shelby L Bidwell; Hernan Lorenzi; Elisabet Caler; Jonathan Crabtree; Samuel V Angiuoli; Emilio F Merino; Paolo Amedeo; Qin Cheng; Richard M R Coulson; Brendan S Crabb; Hernando A Del Portillo; Kobby Essien; Tamara V Feldblyum; Carmen Fernandez-Becerra; Paul R Gilson; Amy H Gueye; Xiang Guo; Simon Kang'a; Taco W A Kooij; Michael Korsinczky; Esmeralda V-S Meyer; Vish Nene; Ian Paulsen; Owen White; Stuart A Ralph; Qinghu Ren; Tobias J Sargeant; Steven L Salzberg; Christian J Stoeckert; Steven A Sullivan; Marcio M Yamamoto; Stephen L Hoffman; Jennifer R Wortman; Malcolm J Gardner; Mary R Galinski; John W Barnwell; Claire M Fraser-Liggett
Journal:  Nature       Date:  2008-10-09       Impact factor: 49.962

7.  Insights into the invasion biology of Plasmodium vivax.

Authors:  Surendra K Prajapati; Om P Singh
Journal:  Front Cell Infect Microbiol       Date:  2013-03-05       Impact factor: 5.293

8.  Plasmodium vivax and mixed infections are associated with severe malaria in children: a prospective cohort study from Papua New Guinea.

Authors:  Blaise Genton; Valérie D'Acremont; Lawrence Rare; Kay Baea; John C Reeder; Michael P Alpers; Ivo Müller
Journal:  PLoS Med       Date:  2008-06-17       Impact factor: 11.069

9.  Multidrug-resistant Plasmodium vivax associated with severe and fatal malaria: a prospective study in Papua, Indonesia.

Authors:  Emiliana Tjitra; Nicholas M Anstey; Paulus Sugiarto; Noah Warikar; Enny Kenangalem; Muhammad Karyana; Daniel A Lampah; Ric N Price
Journal:  PLoS Med       Date:  2008-06-17       Impact factor: 11.069

10.  Severe vivax malaria: newly recognised or rediscovered.

Authors:  Stephen J Rogerson; Richard Carter
Journal:  PLoS Med       Date:  2008-06-17       Impact factor: 11.069

View more
  4 in total

1.  Genotyping of Plasmodium vivax by minisatellite marker and its application in differentiating relapse and new infection.

Authors:  Ram Das; Ramesh C Dhiman; Deepali Savargaonkar; Anupkumar R Anvikar; Neena Valecha
Journal:  Malar J       Date:  2016-02-24       Impact factor: 2.979

2.  New insights into the Plasmodium vivax transcriptome using RNA-Seq.

Authors:  Lei Zhu; Sachel Mok; Mallika Imwong; Anchalee Jaidee; Bruce Russell; Francois Nosten; Nicholas P Day; Nicholas J White; Peter R Preiser; Zbynek Bozdech
Journal:  Sci Rep       Date:  2016-02-09       Impact factor: 4.379

3.  Multilocus sequence typing, biochemical and antibiotic resistance characterizations reveal diversity of North American strains of the honey bee pathogen Paenibacillus larvae.

Authors:  Sasiprapa Krongdang; Jay D Evans; Jeffery S Pettis; Panuwan Chantawannakul
Journal:  PLoS One       Date:  2017-05-03       Impact factor: 3.240

4.  Whole Genome Sequencing of Field Isolates Reveals Extensive Genetic Diversity in Plasmodium vivax from Colombia.

Authors:  David J Winter; M Andreína Pacheco; Andres F Vallejo; Rachel S Schwartz; Myriam Arevalo-Herrera; Socrates Herrera; Reed A Cartwright; Ananias A Escalante
Journal:  PLoS Negl Trop Dis       Date:  2015-12-28
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.