Literature DB >> 27853513

South-East Asian strains of Plasmodium falciparum display higher ratio of non-synonymous to synonymous polymorphisms compared to African strains.

Gajinder Pal Singh1, Amit Sharma1.   

Abstract

Resistance to frontline anti-malarial drugs, including artemisinin, has repeatedly arisen in South-East Asia, but the reasons for this are not understood. Here we test whether evolutionary constraints on Plasmodium falciparum strains from South-East Asia differ from African strains. We find a significantly higher ratio of non-synonymous to synonymous polymorphisms in P. falciparum from South-East Asia compared to Africa, suggesting differences in the selective constraints on P. falciparum genome in these geographical regions. Furthermore, South-East Asian strains showed a higher proportion of non-synonymous polymorphism at conserved positions, suggesting reduced negative selection. There was a lower rate of mixed infection by multiple genotypes in samples from South-East Asia compared to Africa. We propose that a lower mixed infection rate in South-East Asia reduces intra-host competition between the parasite clones, reducing the efficiency of natural selection. This might increase the probability of fixation of fitness-reducing mutations including drug resistant ones.

Entities:  

Keywords:  Plasmodium falciparum; artemisinin resistance; non-synonymous polymorphism; resistance evolution

Year:  2016        PMID: 27853513      PMCID: PMC5089136          DOI: 10.12688/f1000research.9372.2

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


Introduction

Artemisinin combination therapy (ACT) is the frontline treatment for malaria caused by Plasmodium falciparum and has played a major role in reducing malaria mortality from an estimated 840,000 deaths in the year 2000 to 440,000 deaths in the year 2015 [1]. The emergence and spread of artemisinin resistance in South-East Asia, however, poses a serious threat to malaria control, and the containment of artemisinin resistance is thus a global public heath priority [2– 8]. One of the most important unanswered questions in anti-malarial drug resistance is why it has repeatedly emerged in South-East Asia [3, 5, 6, 9]. The resistance to chloroquine was first reported in South-East Asia in 1957 before spreading to India and Africa where it resulted in the significant increase in malaria child mortality possibly killing millions of children [10– 12]. The resistance to sulphadoxine-pyrimethamine also emerged in South-East Asia in the late 1960s following a similar route to India and Africa [9]. Worryingly, the resistance to artemisinin has emerged independently at multiple places in South-East Asia [13– 17] and is now present 25 km from the Indian border [16] threatening to follow the same trajectory as resistance to previous anti-malarial drugs. Improved understanding of the process of how and why anti-malarial drug resistance emerges in South-East Asia could provide critical information in developing strategies to prevent the spread of the current wave of artemisinin resistance. Here we ask whether there are evolutionary constraints on P. falciparum strains from South-East Asia that differ from African strains and thus might explain the higher predisposition of South-East Asia strains to evolve drug resistance. To address this question we utilized a recent large global genome sequencing data from ~3400 clinical samples which identified nearly million high-quality single nucleotide polymorphisms (SNPs) in the exonic regions of P. falciparum [18].

Results

Higher ratio of non-synonymous to synonymous polymorphism in P. falciparum from South-East Asia

Resistance to anti-malarial drugs often involves changes in the amino-acid sequence within specific proteins. Thus, we tested whether the ratio of non-synonymous (amino acid changing) to synonymous polymorphism is higher in South-East Asia (SEA). Figure 1 shows a significantly higher ratio of non-synonymous to synonymous polymorphism (N/S) in SEA samples compared to African samples with almost no overlap in their distributions. The mean and median N/S for samples from SEA were 2.33, compared to 2.06 for Africa (Wilcox test p-value 0, number SEA samples 1600, and number Africa samples 1647). The higher N/S in SEA compared to Africa was also evident at the gene level with a larger number of genes showing higher N/S in SEA than in Africa ( Figure 2). Mean and median N/S for genes in SEA samples were 2.1 and 1.9 respectively, while for African samples the mean and median N/S were 1.9 and 1.8 respectively (paired t-test p-value 1E-43, paired Wilcox-test p-value 4E-27, n = 4792). There were 75 genes with more than 3-fold higher N/S in SEA samples relative to African samples and N/S of more than four in SEA. Interestingly, most of these genes were not related to antigenic variation ( Supplementary Table 1), but perform basic housekeeping functions, suggesting that higher N/S of these genes in SEA might not be primarily driven by differential host immune selection. In addition to kelch13, -the only gene known to be causally associated with artemisinin resistance- the list includes CRT (chloroquine-resistance transporter) which shows an 8-fold higher N/S in SEA samples compared to African samples and has previously been shown to be associated with artemisinin resistance in a genome-wide association studies (GWAS) study [14]. In summary, P. falciparum strains from SEA show a higher ratio of non-synonymous to synonymous polymorphisms than African strains.
Figure 1.

Higher ratio of non-synonymous to synonymous polymorphism in P. falciparum samples from SEA.

The ratios of non-synonymous to synonymous polymorphism (N/S) for 3394 samples from 22 countries are shown. The y-axis is truncated at the top with 13 samples not shown. PNG - Papua New Guinea.

Figure 2.

P. falciparum genes show a higher rate N/S in SEA compared to Africa.

The scatter-plot shows N/S in SEA and Africa for 4792 genes. Genes previously associated with artemisinin resistance in a GWAS study [14] are shown in red, with Kelch13 and chloroquine-resistance transporter (CRT) labelled. The diagonal line is shown and the numbers of genes on both sides of the diagonal are indicated. The x and y-axes are truncated with 28 genes not shown.

Supplementary Table 1.

List of functionally annotated genes with more than 3-fold higher N/S in SEA than Africa and N/S in SEA more than 4.

GeneldDescriptionN/S SEAN/S AfricaFold Higher in SEA
PF3D7_0709000chloroquine resistance transporter (CRT)10.31.38.1
PF3D7_0302700CDGSH iron-sulfur domain-containing protein10.01.47.2
PF3D7_0816200vacuolar protein-sorting protein VPS26.00.96.4
PF3D7_0410400exosome complex component RRP4 (RRP4)7.01.25.9
PF3D7_0922900beta-ketoacyl-acyl carrier protein reductase (FabG)11.02.05.5
PF3D7_0314100.1vesicle transport v-SNARE protein6.01.15.5
PF3D7_1124800nuclear preribosomal assembly protein8.01.55.4
PF3D7_0936000ring-exported protein 2+(REX2)13.02.65.1
PF3D7_07200003'-5'+exoribonuclease Csl4 homolog6.01.25.1
PF3D7_0110900adenylate kinase8.01.74.8
PF3D7_1313700septum formation protein MAF homologue10.02.14.7
PF3D7_0304600circumsporozoite (CS) protein (CSP)12.02.74.5
PF3D7_0515100rhomboid protease ROM9 (ROM9)9.02.04.4
PF3D7_0109400tubulin-specific chaperone a7.01.64.3
PF3D7_0522700iron-sulfur assembly protein (SufA)8.01.94.1
PF3D7_0702300sporozoite threonine and asparagine-rich protein (STARP)15.53.94.0
PF3D7_1435000cytochrome c+oxidase assembly protein9.02.33.9
PF3D7_0109300fatty acid elongation protein, GNS1/SUR4 family4.51.23.9
PF3D7_1410100alpha/beta hydrolase5.31.43.9
PF3D7_0319600elongation factor 1+(EF-1)7.01.83.9
PF3D7_1143200DnaJ protein14.03.63.8
PF3D7_1028300nucleolar preribosomal assembly protein13.03.53.8
PF3D7_1358300rhomboid protease ROM7 (ROM7)5.51.53.7
PF3D7_1343700kelch protein4.21.13.7
PF3D7_0416100glutamyl-tRNA(Gln) amidotransferase subunit A8.02.23.6
PF3D7_0423400asparagine-rich protein (AARP)9.02.53.5
PF3D7_1210900GPI mannosyltransferase I+(PIG-M)5.01.43.5
PF3D7_1015800ribonucleotide reductase small subunit5.01.53.3
PF3D7_1217600anaphase promoting complex subunit 106.01.93.2
PF3D7_1143300DNA-directed RNA polymerase I4.31.43.1
PF3D7_1405200sybindin-like protein5.01.63.1
PF3D7_1023400HORMA domain protein8.02.63.1
PF3D7_1434000CCR4-associated factor 16 (CAF16)6.52.13.1

Higher ratio of non-synonymous to synonymous polymorphism in P. falciparum samples from SEA.

The ratios of non-synonymous to synonymous polymorphism (N/S) for 3394 samples from 22 countries are shown. The y-axis is truncated at the top with 13 samples not shown. PNG - Papua New Guinea.

P. falciparum genes show a higher rate N/S in SEA compared to Africa.

The scatter-plot shows N/S in SEA and Africa for 4792 genes. Genes previously associated with artemisinin resistance in a GWAS study [14] are shown in red, with Kelch13 and chloroquine-resistance transporter (CRT) labelled. The diagonal line is shown and the numbers of genes on both sides of the diagonal are indicated. The x and y-axes are truncated with 28 genes not shown.

Higher non-synonymous changes at the conserved positions in South-East Asia

Highly conserved proteins in P. falciparum show a much lower N/S, indicating the lower tolerance for non-synonymous polymorphism [18]. We tested whether the correlation between N/S and protein conservation might be different in SEA and Africa. The correlation between N/S and conservation was much weaker in SEA ( Figure 3) with Pearson correlation of -0.43 (95% CI: -0.41 to -0.46) compared to -0.69 (95% CI: -0.68 to -0.71) in Africa. The lower correlation in SEA suggests a higher ratio of non-synonymous to synonymous changes at conserved positions. Indeed, non-synonymous polymorphisms specifically observed in SEA are more likely to occur at conserved positions compared to those specific to Africa ( Figure 4). Samples from SEA show higher N/S compared to Africa when considering only conserved positions ( Figure 5). These results suggest a lower efficiency of negative selection in SEA in removing potentially deleterious mutations. This may be important for the acquisition of antimalarial drug resistance since drug-resistance mutations preferentially occur at the conserved sites [19], e.g. artemisinin resistance mutations in Kelch13 occur in the conserved region of the protein [18], resistance mutations also occur in the conserved regions in DHFR (dihydrofolate reductase), DHPS (dihydropteroate synthase), and CRT (chloroquine-resistance transporter) [19]. In summary, P. falciparum strains from SEA show a higher ratio of non-synonymous to synonymous polymorphisms at conserved sites in the protein sequences than African strains.
Figure 3.

Lower correlation between N/S and protein conservation in SEA.

A) Scatter-plot of N/S in Africa and percent protein conservation and B) Scatter-plot of N/S in SEA and percent protein conservation. Percent conservation for each protein is the percent of residues identical across orthologs in seven Plasmodium species ( P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii). Only proteins with orthologs in all Plasmodium species are shown (4075 proteins). Y-axis is truncated with 7 points not shown in Figure 3a and 112 points not shown in Figure 3b.

Figure 4.

Non-synonymous polymorphisms specifically observed in SEA are more likely to occur at conserved positions when compared to non-synonymous polymorphisms specifically observed in Africa.

A) Conserved sites were defined as sites identical across orthologs in Plasmodium species ( P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii) in multiple sequence alignment. B) Conserved sites were defined as sites identical across orthologs in diverse eukaryotes ( S. cerevisiae, D. melanogaster, C. elegans, H. sapiens) in multiple sequence alignment. Error bars indicate 95% confidence intervals of the mean from 1,000 bootstrap samples.

Figure 5.

Higher ratio of non-synonymous to synonymous polymorphism at conserved sites in P. falciparum from SEA.

A) Box-plot showing N/S for 3394 samples from 22 countries at sites identical across orthologs in seven Plasmodium species in multiple sequence alignment. B) Box-plot showing the N/S ratio at sites identical across orthologs in diverse eukaryotes ( S. cerevisiae, D. melanogaster, C. elegans and H. sapiens) in multiple sequence alignment. The y-axis is truncated at the top with 10 samples not shown in both panels. PNG - Papua New Guinea.

Lower correlation between N/S and protein conservation in SEA.

A) Scatter-plot of N/S in Africa and percent protein conservation and B) Scatter-plot of N/S in SEA and percent protein conservation. Percent conservation for each protein is the percent of residues identical across orthologs in seven Plasmodium species ( P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii). Only proteins with orthologs in all Plasmodium species are shown (4075 proteins). Y-axis is truncated with 7 points not shown in Figure 3a and 112 points not shown in Figure 3b.

Non-synonymous polymorphisms specifically observed in SEA are more likely to occur at conserved positions when compared to non-synonymous polymorphisms specifically observed in Africa.

A) Conserved sites were defined as sites identical across orthologs in Plasmodium species ( P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii) in multiple sequence alignment. B) Conserved sites were defined as sites identical across orthologs in diverse eukaryotes ( S. cerevisiae, D. melanogaster, C. elegans, H. sapiens) in multiple sequence alignment. Error bars indicate 95% confidence intervals of the mean from 1,000 bootstrap samples.

Higher ratio of non-synonymous to synonymous polymorphism at conserved sites in P. falciparum from SEA.

A) Box-plot showing N/S for 3394 samples from 22 countries at sites identical across orthologs in seven Plasmodium species in multiple sequence alignment. B) Box-plot showing the N/S ratio at sites identical across orthologs in diverse eukaryotes ( S. cerevisiae, D. melanogaster, C. elegans and H. sapiens) in multiple sequence alignment. The y-axis is truncated at the top with 10 samples not shown in both panels. PNG - Papua New Guinea.

Lower mixed infection rate in South-East Asia

Blood samples may contain more than one haploid parasite clone due to mixed infections by multiple genotypes. The rate of mixed strain infection is generally lower in areas of low-transmission such as SEA [20]. The lower efficiency of negative selection in removing potentially deleterious mutations at conserved positions in SEA could result from lower competition between parasite clones in the hosts. Indeed, the estimated rate of mixed strain infections, detected by a high proportion of heterozygous calls in the sequencing data, was much lower in South-East Asia compared to Africa ( Figure 6). We also confirmed that N/S is SEA samples was higher than samples from Africa even when separately analysing predicted single strain and mixed strain samples ( Supplementary Figure 1).
Figure 6.

Lower estimated rate of mixed strain infections in samples from SEA.

Mixed strain infection defined as samples with >10% SNP calls as heterozygous. This cut-off was determined from the distribution of heterozygous SNPs across the samples ( Supplementary Figure 3). The numbers of samples from each country are shown at the top of bar plots. PNG - Papua New Guinea.

Supplementary Figure 1.

Higher ratio of non-synonymous to synonymous polymorphism in P. falciparum samples from SEA when single strain and mixed strain samples are analysed separately..

A) Box-plot showing N/S for 1670 single strain samples. There was only one single strain sample each from Nigeria and Uganda. B) Box-plot showing N/S for 1724 mixed strain samples. There were no mixed strain samples from Colombia. Mixed strain samples defined as samples with >10% SNP calls as heterozygous. This cut-off was determined from the distribution of heterozygous SNPs across the samples ( Supplementary Figure 3).

Lower estimated rate of mixed strain infections in samples from SEA.

Mixed strain infection defined as samples with >10% SNP calls as heterozygous. This cut-off was determined from the distribution of heterozygous SNPs across the samples ( Supplementary Figure 3). The numbers of samples from each country are shown at the top of bar plots. PNG - Papua New Guinea.
Supplementary Figure 3.

The frequency distribution of percentage heterozygous calls in samples.

About half of the 3,394 samples showed percentage heterozygous calls less than ten percent and were defined as single strain samples. Rest of the samples showed roughly uniform distribution of percentage heterozygous calls and were defined as mixed strain samples.

Discussion

Here we find a higher N/S ratio in strains from SEA compared to Africa. We also find that non-synonymous mutations have a higher likelihood to occur at conserved sites in SEA strains compared to African strains. In addition, we confirm a lower rate of mixed strain infection in SEA compared to Africa in the MalariaGEN dataset, the largest whole-genome dataset on P. falciparum till date. Based on these three observations, we propose a model for the higher propensity of SEA populations to acquire drug resistance ( Supplementary Figure 2). Lower mixed strain infections in SEA may allow even less-fit parasites to be transmitted to the next set of hosts due to reduced level of intra-host competition. In contrast, the higher mixed strain infection rate in Africa may drive more intense intra-host competetion, and may therefore reduce the probability of transmission of less-fit parasites. Thus, fitness-reducing mutations including drug-resistance mutations might have a higher chance of spreading in SEA compared to Africa in patients not taking drugs. Since Africa has higher rate asymptomatic infections as well as untreated patients, this would also result in higher competition between drug resistant and drug sensitive clones in the absence of drug, further decreasing the spread of drug resistance mutations with a fitness cost.
Supplementary Figure 2.

A model for the higher propensity of SEA populations to acquire drug resistance.

Lower mixed infections in SEA may allow even less-fit parasites to be transmitted to next set of hosts due to reduced level of intra-host inter-parasite competition. Thus, fitness-reducing mutations including drug-resistance mutations might have a higher chance of spreading in SEA. In contrast, the higher mixed infection rate in Africa may drive more intense inter-clone competition within the host, thereby reducing the probability of transmission of the less-fit clones, including potentially drug-resistant clones. The public domain image of the outline of the human body was obtained from https://en.wikipedia.org/wiki/File:Outline-body.png.

This model is consistent with a number of previous studies. Our observation of higher likelihood of fixation of potentially deleterious mutations in P. falciparum strains from SEA compared to African strains is consistent with the previous observation of higher rate of potentially deleterious copy number variations in P. falciparum from SEA compared to Africa [21]. These observations suggest relaxed negative selection on P. falciparum from SEA compared to Africa and that SEA strains would have lower fitness than African strains. It would be fascinating to test this hypothesis experimentally e.g. by measuring the competitive asexual growth rate (an important component of fitness) of SEA and African P. falciparum strains. Mixed strain infection by P. falciparum has recently been demonstrated to lead to within-host competition in patients [22], the possible mechanisms of which might include strain-transcending immunity, resource competition (e.g. RBCs) or direct interference between strains [23– 26]. While within-host competition seems to be the major explanation for lower N/S in African strains, mixed strain infection would also lead to higher rate of recombination between gametes of different genotypes and efficient removal of deleterious mutations in Africa. In any case, a higher rate of mixed strain infection is expected to increase the strength of purifying selection. What are the implications of our model for the current wave of artemisinin resistance? The much larger population size of P. falciparum in Africa [21], as also evidenced by the high rate of mixed strain infection ( Figure 6) should make it easier for resistance mutations to appear. Indeed, artemisinin resistance mutations in kelch13gene were observed in samples from Africa, including the most common artemisinin resistance mutation C580Y [18]. The C580Y mutation is capable of generating artemisinin resistance in vitro in the NF54 parasite strain considered to be of African origin [27]. This raises an important question as to why artemisinin resistance is not spreading in Africa. Since artemisinin resistance is likely to incur a fitness cost in the drug-free environment [28– 30], we propose that strains with these mutations are continuously arising in Africa but get competitively removed by the fitter drug-sensitive strains [30] in hosts not taking artemisinin. This effect might be pronounced by the greater proportion of asymptomatic and untreated patients in Africa. However once a strain acquires compensatory mutations that may reduce the fitness cost of the original mutation, it may be able to spread in a more competitive environment in Africa. While compensatory mutations can occur anywhere in the genome and may even spread in South-East Asia, these could be unlinked by recombination in areas with high transmission rate such as Africa [31]. Thus, compensatory mutations in the same gene might be more likely to spread in high-transmission areas. Indeed, drug-resistance genes often acquire multiple mutations before spreading to Africa, e.g. pyrimethamine resistance gene dhfr acquired at least three different mutations in South-East Asia before it spread to Africa [9]. All chloroquine-resistant strains have the K76T mutation in CRT (chloroquine-resistance transporter) but are accompanied by a number of mutations in the same gene [32]. While at present kelch13 does not appear to have multiple mutations [33], it would be critical to monitor the acquisition of additional mutations in the kelch13 which might compensate the fitness cost of kelch13 resistant mutations in the drug-free environment. Resistance to chloroquine and sulphadoxine-pyrimethamine spread from SEA to India to Africa [3]. Interestingly we observed a higher mixed strain infection rate in Bangladesh than in neighboring SEA. The Indian subcontinent has areas with widely variable transmission rates [34]. This might allow drug-resistant P. falciparum evolved in low transmission areas in SEA to gradually adapt to higher transmission areas in the Indian subcontinent, which could then spread to the high transmission areas in Africa. Therefore, it would be critical to track the spread of artemisinin resistance in the Indian subcontinent. It is important to note that higher N/S in SEA populations does not necessarily imply higher mutation rate. Brown et al. previously found similar substitution rates in samples from Africa and SEA [35]. Mutation rate as measured by long-term in vitro culture was not higher in strains from SEA origin, either in the presence or absence of drug [36, 37]. Thus mutation rate in SEA population appears to be similar to that of African population, but a higher fraction of mutations are observed at conserved non-synonymous positions in SEA. The MalariGEN study from where we obtained the dataset reported much higher density (per sample) of both synonymous and non-synonymous polymorphisms in Africa compared to SEA [18]. It is also important to note that higher density of SNP/sample does not imply higher substitution rate in Africa, rather it reflects the higher rate of mixed strain infection in Africa, i.e. more SNPs are identified in samples from Africa because of the higher number of different parasite clones per samples ( Figure 6). The authors of the MalariaGEN also wrote that at the gene level “we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions” [18], however, no statistical test was performed by the authors. Furthermore, no comparison of N/S at the sample level was performed in the MalariaGEN study. Resistance to chloroquine and sulfadoxine-pyrimethamine appeared independently in SEA and South America [38]. While there were few samples from South-America in the MalariaGEN dataset ( Figure 6), we find that these samples also display lower mixed infection rate ( Figure 6), and N/S ratio in between the African and SEA samples ( Figure 1). Further analyses of a larger number of samples from South America could shed light on whether the mechanism we propose for a higher rate of resistance emergence in SEA might be applicable to South-America. In summary, we propose that the lower transmission rates in SEA lead to a lower rate of mixed strain infection, which leads to reduced strength of natural selection. This, in turn, allows a higher rate of fixation of potentially deleterious mutations including drug resistance mutation. However, other factors such as drug usage, the level of immunity, and social factors [3, 5, 39], could also contribute towards the faster development of resistance in SEA. Given the basic difference in the transmission rate between SEA and Africa, which is not easy to control, we should expect that SEA would remain a source of drug resistance malaria in the future.

Methods

The SNP data of P. falciparum was obtained from the MalariaGen community webpage ( https://www.malariagen.net/data/p-falciparum-community-project-jan-2016-data-release) [18]. The SNP data consist of filtered and high quality 939,687 exonic SNPs with 631,715 non-synonymous and 307,972 synonymous SNPs. The data comprised 3,394 samples from 22 countries, with roughly equal number of samples from South-East Asia (1,600 samples) and Africa (1,647 samples). The N/S ratio for each sample was obtained by dividing the number of non-synonymous SNPs by the number of synonymous SNPs in that sample. Proteome sequences of P. falciparum, P. berghei, P. chabaudi, P. cynomolgi, P. knowlesi, P. reichenowi, P. vivax, P. yoelii were downloaded from PlasmoDB database and proteome sequences of S. cerevisiae, D. melanogaster, C. elegans and H. sapiens were downloaded from European Bioinformatics Institute (EBI) database. Orthologous sequences were identified using best bidirectional hit algorithm [40] and aligned using ClustalO [41]. The conservation score for P. falciparum proteins was calculated as the percentage of positions identical across all orthologous proteins from Plasmodium species. The N/S ratio for each gene in South-East Asia and Africa was calculated by dividing the number of unique non-synonymous SNPs by the number of unique synonymous SNPs across samples from the two geographical areas. There were 136 genes with zero synonymous SNPs in SEA and thus were excluded from the analyses. The Pearson correlation between N/S for each gene and the conservation score was calculated in R. All figures were created in R version 3.2.3. Mixed infection samples were defined as samples with >10% SNP calls as heterozygous. This cut-off was determined from the distribution of heterozygous SNPs across the samples ( Supplementary Figure 3). It is important to note that this method is not likely to accurately classify each sample into a polyclonal (mixed infection) or a monoclonal sample, but the overall trend of higher rate of mixed infection in African samples compared to SEA samples is likely to be robust.

Data availability

The data referenced by this article are under copyright with the following copyright statement: Copyright: © 2016 Singh GP and Sharma A This publication uses data from the MalariaGEN Plasmodium falciparum Community Project as described in Genomic epidemiology of artemisinin resistant malaria, eLife, 2016 (DOI: http://dx.doi.org/10.7554/eLife.08714). This data is also available from the MalariaGEN website ( https://www.malariagen.net/data/p-falciparum-community-project-jan-2016-data-release). This revised article "South-East Asian strains of  Plasmodium falciparum display higher ratio of non-synonymous to synonymous polymorphisms compared to African strains" address several of the different criticisms listed by the reviewer in the recent submitted peer-review. In particular, the authors have revised their discussion section to better highlight the contrast between their conclusions and the conclusions made by a similar MalariaGen analysis. The authors have also addressed errors highlighted by the reviewer in both figure legends and discussion concerning the effects of selection. Together, these changes create a stronger article worth being indexed. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. The title is appropriate. The abstract makes some jumps in logic:  e.g “Furthermore, South-East Asian strains showed a higher proportion of non-synonymous polymorphism at conserved positions, suggesting reduced negative selection.” This pattern can equally well be explained by stronger positive selection in SEA than in Africa. Major points Results differ from the prior analysis of this dataset. The article comes to different conclusions from those reported by the MalariaGEN consortium in eLife (ref 18) who write:  "Accordingly, we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions ( Figure 3c)” page 6 bottom: MalariaGEN Plasmodium falciparum Community Project. eLife 2016;5:e08714. DOI: 10.7554/eLife.08714). It is unclear to me why this F1000 submission and the published eLife paper reach very different conclusions using the same dataset? I suggest that the authors should cite the conclusions of the eLife paper and explain why their analysis comes to a very different conclusion. This point needs urgently needs satisfactory resolution as this is the central conclusion of the paper. Methodology/analysis biases. Reduced ability to score non-synonymous variants in African populations could generate the N/S ratios observed. Such a bias could be generated because infections are more complex in Africa. Non-synonymous mutations tend to be at lower frequency than synonymous mutations. In complex mixed African infections low frequency non-synonymous mutations may be filtered out of the dataset (because very few reads show the mutation) but scored in SE Asian samples, where infections are simpler (so multiple reads show non-synonymous mutations). To eliminate this potential methodological bias, the authors should filter their data to minimize mixed infections prior to analysis. This could be done by setting a threshold for numbers of mixed base calls. Again, it is important to understand why the conclusions reached by this analysis differ from the published eLife paper. As pointed out in the eLife paper there is a dramatic difference in the allele frequency spectrum in the two locations. The excess of rare variants in Africa could also potentially alter the N/S ratios. Population genetics explanation. Assuming the central result is correct, a simpler model explains these results: The effective population size (Ne) of parasite populations is lower in SEA than in Africa. Therefore purifying selection is weaker in SEA than in Africa resulting in less efficient removal of deleterious mutations. Discussion of this simple alternative explanation would improve the paper. Also Cheeseman et al, MBE 2016 (Mol Biol Evol. 2016 Mar;33(3):603-20. doi: 10.1093/molbev/msv282.), who make this argument to explain patterns of global copy number polymorphism in Plasmodium falciparum. Statistics used: The paper counts numbers of non-synonymous and synonymous mutations and estimates the ratio (N/S). This is an unusual way to present such data – it would be more informative to show dN/dS – (Nonsynonymous changes per non-synonymous site/synonymous variants per synonymous site). This statistic is more useful because the expected ratio under neutrality is 1. This should not change the results but will be more easily interpretable with reference to a neutral model. Related questions: How is N/S ratio determined for genes in which there are NO synonymous mutations? Details of the methods for calculating this ratio, and how they differ from those used in the eLife paper should be provided. More support needed for intrahost competition explanation: The authors argue that the observed pattern may result from more within infection competition in Africa relative to Asia. If competition is invoked to explain the results, then it would be useful to examine the categories of genes contributing to the excess of NS variations. This argument would imply that this effect should be seen in subsets of genes involved in within host competition. On the other hand, if there is no particular enrichment of particular gene classes, then the simple population genetics explanation seems more likely. Lack of balance. The authors argue that drug resistance may arise in SEA rather than Africa because intrahost competition prevents emergence of resistance mutations associated with fitness costs. I agree in part – this may certainly contribute. However, greater selection for resistance to drugs in SE Asia is likely to be a critical factor. Most infections in SEA are symptomatic and infected individuals seek treatment. However, most infections in Africa are asymptomatic, so people to no seek treatment. Discussion of the difference in selection strength between continents would add balance to this paper. Other issues. I was very surprised to see this analysis of MalariaGEN data without involvement from malariaGEN authors. My understanding is that access to these data requires a "Fort Lauderdate" type agreement with MalariaGEN (https://www.malariagen.net/data/terms-use/pf3k-terms-use). The authors may have already discussed this with relevant people at MalariaGEN - I would suggest that the authors directly contact Dominic Kwiatkowski, if they have not already done so. I have read this submission. I believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above. We are grateful for your valuable comments, which we believe have significantly improved our manuscript. The abstract makes some jumps in logic:  e.g “Furthermore, South-East Asian strains showed a higher proportion of non-synonymous polymorphism at conserved positions, suggesting reduced negative selection.” This pattern can equally well be explained by stronger Author response: Positive selection can indeed increase (though not in all cases) N/S for the gene under selection, as has been observed for the kelch13 gene in SEA. However, the pattern of higher N/S in strains in SEA which can be observed at genome-wide level is much more consistent with relaxed negative selection in SEA, rather than higher positive selection in SEA, which typically acts at a few sites. Furthermore, reduced constraints at conserved sites at the genome-wide level supports relaxed negative selection rather than positive selection. We thus believe that our use of word “suggesting” is appropriate in the statement. )” page 6 bottom: MalariaGEN Plasmodium falciparum Community Project. eLife 2016;5:e08714. DOI: 10.7554/eLife.08714). It is unclear to me why this F1000 submission and the published eLife paper reach very different conclusions Author response: The authors in the MalariaGEN study did not perform any statistical test to support their statement that N/S in SEA and Africa is similar at the gene level. While the difference is subtle, it is highly statistically significant, as we report in our manuscript. Furthermore, the authors in MalariaGEN study did not analyse differences in N/S at the sample level, where we clearly see the differences between SEA and African strains (Figure 1). We have now added these statements to the Discussion section of the revised manuscript. Author response: Separately analysing predicted monoclonal and polyclonal samples did not change conclusions (Supplementary Figure 1 in the revised manuscript). This is not surprising given that we observed almost no overlap in the distributions of N/S in SEA and African samples (Figure 1), as we wrote in the manuscript. We would like to emphasize that authors in MalariaGEN study did not perform N/S analyses at the sample level. As pointed out in the eLife paper there is a dramatic difference in the allele frequency spectrum in the two locations. The excess of rare variants in Africa could also potentially alter the N/S ratios. Author response: As proposed by the reviewer in the previous comment, the excess of rare variants in Africa might increase N/S in Africa, thus the higher N/S in SEA could not be explained by excess rare variants in Africa. Author response: Indeed relaxed negative selection in SEA due to lower effective population (reflected in lower polyclonal infections) is exactly what we propose in our manuscript for our observation of higher N/S in SEA. Lower Ne (effective population size) is bound to be associated with lower polyclonal infection rate, which has been shown to lead to within-host competition in P. falciparum 1. Thus we do not believe that the two explanations are alternative independent explanations, one of which needs to be discounted. Rather the observation made by Cheeseman  et al, and in our manuscript are convergent which further supports reduced negative selection in P. falciparum in SEA and we have added these statements to the Discussion section of the revised manuscript. Author response: As pointed out by the reviewer dN/dS comparison would not change the results. P. falciparum is unusual in having very high N/S compared to other organisms 2, thus the usual expectation of dN/dS of 1 for neutral evolution is not applicable for P. falciparum. Related questions: How is N/S ratio determined for genes in which there are NO synonymous mutations? Details of the methods for calculating this ratio, and how they differ from those used in the eLife paper should be provided. Author response: There were 136 genes with zero synonymous SNPs in SEA (none in Africa) and thus were excluded from the analyses. We have added this statement in the Method section of the revised manuscript. Author response: There is no dataset of genes known to be involved in within-host competition in P. falciparum. We do note however that many housekeeping genes show higher N/S in SEA than Africa (Supplementary Table 1). As discussed in response to previous comments by the reviewer, the lower Ne, lower rate of polyclonal infections and within-host completion are not mutually exclusive explanations, one of which needs to be discounted. Author response: The higher rate asymptomatic infections as well as untreated patients in Africa would mean higher competition between drug resistant and drug sensitive clones in the absence of drug, further decreasing the spread of drug resistance mutations with a fitness cost. Thus higher immunity, lower treatment rates, and higher polyclonal infections are likely to work synergistically. We have added these statements to the Discussion section in the revised manuscript. Author response: We have taken permission from the authors of the MalariaGEN study, and their contribution has been appropriately acknowledged in our publication. We would again like to thank you for your critical comments on our manuscript, and would be happy to address any further concerns that you may have in the revised version of the manuscript. Reference List 1.    Bushman, M. et al. Within-host competition and drug resistance in the human malaria parasite Plasmodium falciparum. Proc. Biol. Sci. 283, 20153038 (2016). 2.    Chang, H. H. et al. Malaria life cycle intensifies both natural selection and random genetic drift. Proc. Natl. Acad. Sci. U. S. A. 110, 20129-20134 (2013). In this article the authors conduct a detailed analysis that focused on the frequency of non-synonymous mutations across an extensive genome set derived by the MalariaGEN consortium from an extensive set of P. falciparum isolates collected mainly in Africa and SE Asia. The analyses extend the work presented recently (DOI: 10.7554/eLife.08714) where a trend was noted for a higher level of non-synonymous mutations in the SE Asian parasites a compared to that in the African parasites. Specifically, the authors have presented the analysis for the parasites from individual countries and focused on SNPs that occur at residues that are highly conserved in orthologues within a variety of Plasmodium species and across eukaryotes in general. The authors have thus provided convincing evidence for statistically significant albeit relatively subtle differences between the parasites from SE Asia and Africa (it is unclear to me why parasites from Peru and Colombia were included in the analysis, all the more as their numbers were relatively low). Whereas, the quality of the data, the analyses and the results are not into question, I have some misgivings and indeed frustrations as to the conclusions put forward in an attempt to account for the differences observed. The scenario inspired by the observations that the authors propose as an explanation for the propensity of antimalarial drug resistance to emerge and spread in P. falciparum from SE Asia first before doing so in Africa is plausible and could be supported by the data presented. However, this scenario is based on a set of assumptions whose limitations are not discussed, without any justification as to why this particular scenario is the most likely. What is the nature of the intra-host competition between the genotypes of P. falciparum that circulate in humans, and is there any evidence that this actually alters transmissibility? Immunity is the most obvious selective intra-host factor. To which extend differences in the levels of immunity among the patients that contributed the P. falciparum isolates analysed here alters the multiplicity of the infection or indeed the likelihood of genetic crosses? Is there any evidence that parasite with the higher levels of non-synonymous mutations across the genome are biologically less fit? There is evidence that this is the case for drug resistant parasites in vitro, that for parasites growing in vivo is less straightforward and varies for different drugs. To what extend are the isolates collected from a limited number of febrile patients reflect the overall parasite population across a whole country. Could a bias be introduced simply because of differences in the degree of acquired immunity on admission, the time between the onset of the infection and the time treatment was sought, or the level of admission parasitaemia? Exposure to drug is clearly a strong selective constraint for those genes implicated in overcoming the effects of the drug. What can be the nature of the selective constrains that maintain the higher levels of non-synonymous mutations across numerous genes spread throughout the genome? Clearly not all can be considered to provide some speculative compensatory effect to a potential reduction in fitness following drug selection. It should be pointed out that in the 1950’s resistance to chloroquine first appeared not only in SE Asia but also in Colombia. Similarly resistance to pyrimethamine did not first appear in SE Asia, but in all areas where the use of this drug became widespread and systematic (this was the case in both SE Asia and African countries in the 1950’s). The “unanswered” question as to the reason why drug resistance has repeatedly emerged in SE Asia, might have more to do with drug usage than with some special property of the parasites. Ultimately, I would suggest that the authors should present the limitations of their conclusions as well as alternative scenarios to account for their observations (and rank or discount them if they can). This will surely be welcome by readers. Advances in knowledge are seeded by diversity in speculation (conservative or less so) and genome wide analyses are a rich source for this. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above. We are grateful for your valuable comments, which we believe have significantly improved our manuscript. In this article the authors conduct a detailed analysis that focused on the frequency of non-synonymous mutations across an extensive genome set derived by the MalariaGEN consortium from an extensive set of P. falciparum isolates collected mainly in Africa and SE Asia. The analyses extend the work presented recently (DOI: 10.7554/eLife.08714) where a trend was noted for a higher level of non-synonymous mutations in the SE Asian parasites a compared to that in the African parasites. Specifically, the authors have presented the analysis for the parasites from individual countries and focused on SNPs that occur at residues that are highly conserved in orthologues within a variety of Plasmodium species and across eukaryotes in general. The authors have thus provided convincing evidence for statistically significant albeit relatively subtle differences between the parasites from SE Asia and Africa (it is unclear to me why parasites from Peru and Colombia were included in the analysis, all the more as their numbers were relatively low). Author response: We would like to clarify that MalariaGEN authors did not write or suggest that N/S ratio of P. falciparum strains from SEA and Africa are different. We have discussed the difference between the work reported in MalariaGEN study and our manuscript in the Discussion section of the revised manuscript. We added data from Peru and Columbia in Figures 1, Figure 5 and Figure 6 only for completeness, but we have refrained from making any firm conclusions based on these samples. We have nevertheless commented upon the observations from South-American samples in the Discussion section in response to your comment. Whereas, the quality of the data, the analyses and the results are not into question, I have some misgivings and indeed frustrations as to the conclusions put forward in an attempt to account for the differences observed. The scenario inspired by the observations that the authors propose as an explanation for the propensity of antimalarial drug resistance to emerge and spread in P. falciparum from SE Asia first before doing so in Africa is plausible and could be supported by the data presented. However, this scenario is based on a set of assumptions whose limitations are not discussed, without any justification as to why this particular scenario is the most likely. Author response: We thank the reviewer for pointing this out. There is indeed evidence of intra-host competition between genotypes of P. falciparum in humans that reduces parasite density 1. The association between parasite density and gametocyte density in humans has also been previously shown 2. The reasons for intra-host competition however remain unknown. Strain-transcending immunity, resource competition (e.g. RBCs), and direct interference between strains have been proposed as possible mechanisms responsible for within-host competition 3-6. It is also possible that lower recombination rates due to lower rate of mixed infections in SEA may reduce the removal of deleterious mutations. We have added these sentences to the Discussion section of the revised manuscript. Author response: There is currently no evidence that P. falciparum strains from SEA are biologically less fit that strains from Africa. Indeed that is the prediction of our model. It would be fascinating to test this hypothesis. We have added these sentences to the Discussion section of the revised manuscript. Author response: It is possible that samples collected may not be representative of the whole country; however the consistent results we obtain for across countries in Africa and SEA suggest that our observations of higher N/S is SEA compared to Africa is robust to random variations introduced due to sampling. What is the nature of the intra-host competition between the genotypes of P. falciparum that circulate in humans, and is there any evidence that this actually alters transmissibility? Immunity is the most obvious selective intra-host factor. To which extend differences in the levels of immunity among the patients that contributed the P. falciparum isolates analysed here alters the multiplicity of the infection or indeed the likelihood of genetic crosses? Is there any evidence that parasite with the higher levels of non-synonymous mutations across the genome are biologically less fit? There is evidence that this is the case for drug resistant parasites in vitro, that for parasites growing in vivo is less straightforward and varies for different drugs. To what extend are the isolates collected from a limited number of febrile patients reflect the overall parasite population across a whole country. Could a bias be introduced simply because of differences in the degree of acquired immunity on admission, the time between the onset of the infection and the time treatment was sought, or the level of admission parasitaemia? It is possible that some biases in sampling may be introduced that could lead to differences in the observed rate of polyclonal infections between Africa and SEA. We separately analysed predicted monoclonal and polyclonal samples and find results similar to those obtained using all samples (Supplementary Figure 1 in the revised manuscript). We thus currently have no hypothesis of how biased sampling could actually lead to higher genome-wide N/S in SEA. Author response: As further explained in the Discussion section, the higher genome scale N/S in SEA is consistent with relaxed negative selection. We hypothesise that relaxed negative selection could be due to low level of polyclonal infection and thus lower within-host competition in SEA.  Author response: Indeed resistance to chloroquine and sulphadoxine-pyrimethamine also appeared independently in South-America (but not in Africa 7). We have very few samples from South America (27 samples from 2 countries) to make robust conclusions. However it is interesting that samples from South America do show low rate of polyclonal infections and higher N/S compared to Africa (Figure 1 and Figure 6). We have added these sentences to the Discussion section of the revised manuscript. Exposure to drug is clearly a strong selective constraint for those genes implicated in overcoming the effects of the drug. What can be the nature of the selective constrains that maintain the higher levels of non-synonymous mutations across numerous genes spread throughout the genome? Clearly not all can be considered to provide some speculative compensatory effect to a potential reduction in fitness following drug selection. It should be pointed out that in the 1950’s resistance to chloroquine first appeared not only in SE Asia but also in Colombia. Similarly resistance to pyrimethamine did not first appear in SE Asia, but in all areas where the use of this drug became widespread and systematic (this was the case in both SE Asia and African countries in the 1950’s). The “unanswered” question as to the reason why drug resistance has repeatedly emerged in SE Asia, might have more to do with drug usage than with some special property of the parasites. Ultimately, I would suggest that the authors should present the limitations of their conclusions as well as alternative scenarios to account for their observations (and rank or discount them if they can). This will surely be welcome by readers. Advances in knowledge are seeded by diversity in speculation (conservative or less so) and genome wide analyses are a rich source for this. Author response: We have significantly expanded the Discussion section which we hope addresses the reviewer’s concerns. We would be happy to address any further concerns that you may have in the revised version of the manuscript. Reference List 1.   Bushman, M. et al. Within-host competition and drug resistance in the human malaria parasite Plasmodium falciparum. Proc. Biol. Sci. 283, 20153038 (2016). 2.   Ross, A., Killeen, G. & Smith, T. Relationships between host infectivity to mosquitoes and asexual parasite density in Plasmodium falciparum. Am. J. Trop. Med. Hyg. 75, 32-37 (2006). 3.   Metcalf, C. J. et al. Partitioning regulatory mechanisms of within-host malaria dynamics using the effective propagation number. Science. %19;333, 984-988 (2011). 4.   Raberg, L. et al. The role of immune-mediated apparent competition in genetically diverse malaria infections. Am. Nat. 168, 41-53 (2006). 5.   Bruce, M. C. & Day, K. P. Cross-species regulation of malaria parasitaemia in the human host. Curr. Opin. Microbiol. 5, 431-437 (2002). 6.   Yap, G. S. & Stevenson, M. M. Blood transfusion alters the course and outcome of Plasmodium chabaudi AS infection in mice. Infect. Immun. 62, 3761-3765 (1994). 7.   Roper, C. et al. Intercontinental spread of pyrimethamine-resistant malaria. Science. %20;305, 1124 (2004). This is a compelling manuscript from Gajinder Pal Singh and Amit Sharma, which conducts an analysis of the accrual of non-synonymous changes within Plasmodium falciparum ( Pf) genes across the breadth of the MalariaGen population genomic dataset. This work builds upon a trend observed in the initial MalariaGen publication(s) that South East Asian (SEA) populations exhibited a general trend of higher Non-synonymous to synonymous mutation ratio (N/S) compared to African populations. The authors replicate and expand this analysis and then compare this trend to a protein dataset comprised of Eukaryotic sequences to show that even among the most conserved sites SEA continues to accrue more NS polymorphisms than African isolates. Lastly the authors set out an argument for the mechanism behind this apparent relaxed negative selection within SEA Pf, attributing it to the lower rate of mixed genotype infections across SEA and the subsequent lack of competition within host. This would create the circumstances that would allow for multiple independent resistance mutations arising and fixing within the SEA population. Exploring general patterns of genome architecture within P. falciparum is vital to understanding the spread and prevention of resistance to anti-malarial drugs, and as a result this work is of interest to the malaria community. There are some clarifications required, and two significant areas of improvement for the paper: Major clarifications: Minor clarifications are as follows: More detailed results/discussion sections, and deeper discussion of previous literature.While the authors cite the MalariaGen publication (citation 18) they should more explicitly describe the relationship between that publication and this analysis. Specifically, highlighting Figure 5 of citation 18 and its supplemental material, which shows a similar genome-wide analysis of N/S ratio between Africa/SEA. It would be good to directly acknowledge this article/figure as the first description of this African/SEA pattern. Furthermore, it would be constructive for the reader if the authors added a deeper justification for their extension of this work. Language used to describe the accrual of Nonsynonymous sites overstates the general trend of NS SNPs at conserved sites. Figure 4 shows that NS SNPs accrue at a higher rate in SEA across the set of conserved sites within Plasmodium and at the set of conserved sites across Eukaryotes. However, the language used to describe this figure in the legend states that “NS polymorphisms… in SEA are more likely to occur at conserved positions.” This statement implies that this is a general genome-wide trend of higher rate of fixing NS SNPs at sites with high conservation. Perhaps extending this legend to mention that NS SNPs are “are more likely to occur at conserved positions when compared to Africa” would avoid this implied generalization. Additionally, on page 3 the authors state “these mutations preferentially occur at the conserved sites”. This statement repeats the highlighted issue above, and adds the word “preferentially” which may create an unintentional teleological statement about the occurrences of NS mutations. NS SNPs occur at a higher rate within the pre-selected subset of conserved sites when compared to Africa, but the figures do not show a general genome-wide preference for NS mutations at conserved sites. “Kelch” is misspelled multiple times throughout the article. “Cambodia” is misspelled in multiple figures. “PNG” Abbreviation should be written out fully. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above. This is a compelling manuscript from Gajinder Pal Singh and Amit Sharma, which conducts an analysis of the accrual of non-synonymous changes within Plasmodium falciparum (Pf) genes across the breadth of the MalariaGen population genomic dataset. This work builds upon a trend observed in the initial MalariaGen publication(s) that South East Asian (SEA) populations exhibited a general trend of higher Non-synonymous to synonymous mutation ratio (N/S) compared to African populations. The authors replicate and expand this analysis and then compare this trend to a protein dataset comprised of Eukaryotic sequences to show that even among the most conserved sites SEA continues to accrue more NS polymorphisms than African isolates. Lastly the authors set out an argument for the mechanism behind this apparent relaxed negative selection within SEA Pf, attributing it to the lower rate of mixed genotype infections across SEA and the subsequent lack of competition within host. This would create the circumstances that would allow for multiple independent resistance mutations arising and fixing within the SEA population. Exploring general patterns of genome architecture within P. falciparum is vital to understanding the spread and prevention of resistance to anti-malarial drugs, and as a result this work is of interest to the malaria community. There are some clarifications required, and two significant areas of improvement for the paper: Author response: We thank you for your review and comments. We would like to clarify that higher N/S in P. falciparum strains from SEA compared to African strains has not been stated or suggested in the MalariaGen manuscript (Elife. 2016). In fact the authors of the MalariaGEN wrote that at the gene level “we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions”, however, no statistical test was performed by the authors. Furthermore, no comparison of N/S at the sample level was performed in the MalariaGEN study. We have added these statements to the Discussion section of the revised manuscript. Major clarifications: Author response: The aim of Figure 5 of the MalariaGen manuscript (Elife. 2016) was to test whether neutral evolution could account for the pattern of mutations in kelch13. Figure 5a plots genic N/S vs. protein conservation in Africa and SEA. The Figure shows that kelch13 gene is an outlier in the scatter-plot for SEA, but follows a general trend in Africa. Figure 5b plots the ratio of N/S ratios in SEA and Africa, for all genes (with ≥5 synonymous and ≥5 non-synonymous SNPs) and shows again that kelch13 is an outlier. Figure 5—Supplementary Figure 1 is same as Figure 5a, but also highlights other drug resistance genes. Figure 5—Supplementary Figure 2a plots number of non-synonymous SNPs/kbp vs. protein conservation in Africa and SEA and show that kelch13 follows the trend among genes in Africa, but has excess non-synonymous polymorphisms in SEA, while Figure 5—Supplementary Figure 2b plots number of synonymous SNPs/kbp vs. protein conservation in Africa and SEA and shows that k elch13 follows the normal trend among genes in Africa, but has far fewer synonymous SNPs than expected in SEA. More detailed results/discussion sections, and deeper discussion of previous literature. While the authors cite the MalariaGen publication (citation 18) they should more explicitly describe the relationship between that publication and this analysis. Specifically, highlighting Figure 5 of citation 18 and its supplemental material, which shows a similar genome-wide analysis of N/S ratio between Africa/SEA. It would be good to directly acknowledge this article/figure as the first description of this African/SEA pattern. Furthermore, it would be constructive for the reader if the authors added a deeper justification for their extension of this work. Thus the authors conclude that high prevalence of non-synonymous SNPs in kelch13 in SEA is not explainable by neutral evolution, but is consistent with neutral evolution in Africa. We would like to reemphasize that nowhere in the manuscript do the authors state or imply that strains from SEA have a higher N/S than strains from Africa. Author response: We would like to thank the reviewer for pointing out this mistake. We have modified the Figure legend in the revised manuscript to “Non-synonymous polymorphisms specifically observed in SEA are more likely to occur at conserved positions when compared to non-synonymous polymorphisms specifically observed in Africa” as suggested. We have also modified the statement “This may be important for the acquisition of antimalarial drug resistance since these mutations preferentially occur at the conserved sites” to “This may be important for the acquisition of antimalarial drug resistance since drug-resistance mutations preferentially occur at the conserved sites”. Language used to describe the accrual of Nonsynonymous sites overstates the general trend of NS SNPs at conserved sites. Figure 4 shows that NS SNPs accrue at a higher rate in SEA across the set of conserved sites within Plasmodium and at the set of conserved sites across Eukaryotes. However, the language used to describe this figure in the legend states that “NS polymorphisms… in SEA are more likely to occur at conserved positions.” This statement implies that this is a general genome-wide trend of higher rate of fixing NS SNPs at sites with high conservation. Perhaps extending this legend to mention that NS SNPs are “are more likely to occur at conserved positions when compared to Africa” would avoid this implied generalization. Additionally, on page 3 the authors state “these mutations preferentially occur at the conserved sites”. This statement repeats the highlighted issue above, and adds the word “preferentially” which may create an unintentional teleological statement about the occurrences of NS mutations. NS SNPs occur at a higher rate within the pre-selected subset of conserved sites when compared to Africa, but the figures do not show a general genome-wide preference for NS mutations at conserved sites. Minor clarifications are as follows: “Kelch” is misspelled multiple times throughout the article. “Cambodia” is misspelled in multiple figures. “PNG” Abbreviation should be written out fully. Author response: Thanks for pointing out these mistakes; we have corrected these mistakes in the revised manuscript. We would be happy to address any further concerns that you may have about the revised version of the manuscript. In this paper the authors analyze the data produced by the MalariaGen community that provides high quality data on over 900,000 SNP in more than 3000 samples from 22 countries. The authors report a significantly higher ratio of non-synonymous to synonymous polymorphisms in  P. falciparum from South-East Asia compared to Africa and propose that this could explain the emergence of resistance in Asia. However this observation (and also the higher multiplicity of infections in Africa) is not novel. Furthermore the large number of K13 alleles, their dynamics and relative contributions to the clinical phenotype are likely to also be important. I think they could improve by showing that their analysis helps to understand the spread of a specific allele of K13 such as the C580Y. It would be useful if the authors could provide more explanations as to why they believe their findings are innovative and how they help us to understand the evolution of artemisinin resistance, since the mechanism is clearly different to that of other antimalarials. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above. Thank you for your review and comments. In this paper the authors analyze the data produced by the MalariaGen community that provides high quality data on over 900,000 SNP in more than 3000 samples from 22 countries. The authors report a significantly higher ratio of non-synonymous to synonymous polymorphisms in P. falciparum from South-East Asia compared to Africa and propose that this could explain the emergence of resistance in Asia. However this observation (and also the higher multiplicity of infections in Africa) is not novel. Author response: We are not aware of any manuscript that reported higher N/S ratio of P. falciparum of SEA strains of P. falciparum compared to African strains. If the reviewer is referring to the MalariaGen study (Elife. 2016), the authors in that manuscript did not report or suggest these results. In fact the authors of the MalariaGEN wrote that at the gene level “we found virtually identical distributions of the ratio of non-synonymous to synonymous mutations (N/S ratio) in the two regions”, however, no statistical test was performed by the authors. Furthermore, no comparison of N/S at the sample level was performed in the MalariaGEN study. We have added these statements to the Discussion section of the revised manuscript. We agree with the reviewer that the general conclusion of a higher multiplicity of infection in Africa is not novel, indeed we wrote “The rate of mixed infection is generally lower in areas of low-transmission such as SEA ” before we reported our results. However we have confirmed the higher mixed infection rate in Africa compared to SEA in the MalariaGEN dataset, which is the largest whole-genome dataset on P. falciparum till date. We have added these statements to the Discussion section of the revised manuscript. Furthermore the large number of K13 alleles, their dynamics and relative contributions to the clinical phenotype are likely to also be important. I think they could improve by showing that their analysis helps to understand the spread of a specific allele of K13 such as the C580Y. Author response: Our manuscript is an attempt to understand why in general anti-malarial resistance often arises and spreads in SEA and it is beyond the scope of this manuscript to understand why a specific allele of K13 is dominating in certain parts of SEA. It would be useful if the authors could provide more explanations as to why they believe their findings are innovative and how they help us to understand the evolution of artemisinin resistance, since the mechanism is clearly different to that of other antimalarials. Author response: We have modified the Discussion section of the manuscript to clarify how our results differ from the MalariaGEN study. We have also discussed the implications of our results for the current wave of artemisinin resistance. These findings have not been previously reported. As stated previously our manuscript is an attempt to understand why in general anti-malarial resistance often arises and spreads in SEA, but whether and how the evolutionary mechanism of artemisinin resistance is different from other antimalarial is beyond the scope of this manuscript. We would again like to thank you for comments on our manuscript, and would be happy to address any further concerns that you may have about the revised version of the manuscript.
  38 in total

1.  Artemisinin resistance is a clear and present danger.

Authors:  Arjen M Dondorp; Pascal Ringwald
Journal:  Trends Parasitol       Date:  2013-06-12

2.  The role of immune-mediated apparent competition in genetically diverse malaria infections.

Authors:  Lars Raberg; Jacobus C de Roode; Andrew S Bell; Panagiota Stamou; David Gray; Andrew F Read
Journal:  Am Nat       Date:  2006-05-11       Impact factor: 3.926

Review 3.  Cross-species regulation of malaria parasitaemia in the human host.

Authors:  Marian C Bruce; Karen P Day
Journal:  Curr Opin Microbiol       Date:  2002-08       Impact factor: 7.934

4.  Common origin and fixation of Plasmodium falciparum dhfr and dhps mutations associated with sulfadoxine-pyrimethamine resistance in a low-transmission area in South America.

Authors:  Andrea M McCollum; Kristen Mueller; Leopoldo Villegas; Venkatachalam Udhayakumar; Ananias A Escalante
Journal:  Antimicrob Agents Chemother       Date:  2007-02-05       Impact factor: 5.191

Review 5.  Measurement of trends in childhood malaria mortality in Africa: an assessment of progress toward targets based on verbal autopsy.

Authors:  Eline L Korenromp; Brian G Williams; Eleanor Gouws; Christopher Dye; Robert W Snow
Journal:  Lancet Infect Dis       Date:  2003-06       Impact factor: 25.071

6.  Spread of artemisinin resistance in Plasmodium falciparum malaria.

Authors:  Elizabeth A Ashley; Mehul Dhorda; Rick M Fairhurst; Chanaki Amaratunga; Parath Lim; Seila Suon; Sokunthea Sreng; Jennifer M Anderson; Sivanna Mao; Baramey Sam; Chantha Sopha; Char Meng Chuor; Chea Nguon; Siv Sovannaroth; Sasithon Pukrittayakamee; Podjanee Jittamala; Kesinee Chotivanich; Kitipumi Chutasmit; Chaiyaporn Suchatsoonthorn; Ratchadaporn Runcharoen; Tran Tinh Hien; Nguyen Thanh Thuy-Nhien; Ngo Viet Thanh; Nguyen Hoan Phu; Ye Htut; Kay-Thwe Han; Kyin Hla Aye; Olugbenga A Mokuolu; Rasaq R Olaosebikan; Olaleke O Folaranmi; Mayfong Mayxay; Maniphone Khanthavong; Bouasy Hongvanthong; Paul N Newton; Marie A Onyamboko; Caterina I Fanello; Antoinette K Tshefu; Neelima Mishra; Neena Valecha; Aung Pyae Phyo; Francois Nosten; Poravuth Yi; Rupam Tripura; Steffen Borrmann; Mahfudh Bashraheil; Judy Peshu; M Abul Faiz; Aniruddha Ghose; M Amir Hossain; Rasheda Samad; M Ridwanur Rahman; M Mahtabuddin Hasan; Akhterul Islam; Olivo Miotto; Roberto Amato; Bronwyn MacInnis; Jim Stalker; Dominic P Kwiatkowski; Zbynek Bozdech; Atthanee Jeeyapant; Phaik Yeong Cheah; Tharisara Sakulthaew; Jeremy Chalk; Benjamas Intharabut; Kamolrat Silamut; Sue J Lee; Benchawan Vihokhern; Chanon Kunasol; Mallika Imwong; Joel Tarning; Walter J Taylor; Shunmay Yeung; Charles J Woodrow; Jennifer A Flegg; Debashish Das; Jeffery Smith; Meera Venkatesan; Christopher V Plowe; Kasia Stepniewska; Philippe J Guerin; Arjen M Dondorp; Nicholas P Day; Nicholas J White
Journal:  N Engl J Med       Date:  2014-07-31       Impact factor: 91.245

7.  A tight link between orthologs and bidirectional best hits in bacterial and archaeal genomes.

Authors:  Yuri I Wolf; Eugene V Koonin
Journal:  Genome Biol Evol       Date:  2012       Impact factor: 3.416

8.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

9.  Multiple populations of artemisinin-resistant Plasmodium falciparum in Cambodia.

Authors:  Olivo Miotto; Jacob Almagro-Garcia; Magnus Manske; Bronwyn Macinnis; Susana Campino; Kirk A Rockett; Chanaki Amaratunga; Pharath Lim; Seila Suon; Sokunthea Sreng; Jennifer M Anderson; Socheat Duong; Chea Nguon; Char Meng Chuor; David Saunders; Youry Se; Chantap Lon; Mark M Fukuda; Lucas Amenga-Etego; Abraham V O Hodgson; Victor Asoala; Mallika Imwong; Shannon Takala-Harrison; François Nosten; Xin-Zhuan Su; Pascal Ringwald; Frédéric Ariey; Christiane Dolecek; Tran Tinh Hien; Maciej F Boni; Cao Quang Thai; Alfred Amambua-Ngwa; David J Conway; Abdoulaye A Djimdé; Ogobara K Doumbo; Issaka Zongo; Jean-Bosco Ouedraogo; Daniel Alcock; Eleanor Drury; Sarah Auburn; Oliver Koch; Mandy Sanders; Christina Hubbart; Gareth Maslen; Valentin Ruano-Rubio; Dushyanth Jyothi; Alistair Miles; John O'Brien; Chris Gamble; Samuel O Oyola; Julian C Rayner; Chris I Newbold; Matthew Berriman; Chris C A Spencer; Gilean McVean; Nicholas P Day; Nicholas J White; Delia Bethell; Arjen M Dondorp; Christopher V Plowe; Rick M Fairhurst; Dominic P Kwiatkowski
Journal:  Nat Genet       Date:  2013-04-28       Impact factor: 38.330

10.  Population Structure Shapes Copy Number Variation in Malaria Parasites.

Authors:  Ian H Cheeseman; Becky Miller; John C Tan; Asako Tan; Shalini Nair; Standwell C Nkhoma; Marcos De Donato; Hectorina Rodulfo; Arjen Dondorp; Oralee H Branch; Lastenia Ruiz Mesia; Paul Newton; Mayfong Mayxay; Alfred Amambua-Ngwa; David J Conway; François Nosten; Michael T Ferdig; Tim J C Anderson
Journal:  Mol Biol Evol       Date:  2015-11-26       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.