| Literature DB >> 30514877 |
Lei Zhu1, Jaishree Tripathi1, Frances Maureen Rocamora1, Olivo Miotto2,3,4,5, Rob van der Pluijm2,3, Till S Voss6,7, Sachel Mok8, Dominic P Kwiatkowski2,3,4,5, François Nosten2,9, Nicholas P J Day2,3, Nicholas J White2,3, Arjen M Dondorp2,3, Zbynek Bozdech10.
Abstract
The predisposition of parasites acquiring artemisinin resistance still remains unclear beyond the mutations in Pfk13 gene and modulation of the unfolded protein response pathway. To explore the chain of casualty underlying artemisinin resistance, we reanalyze 773 P. falciparum isolates from TRACI-study integrating TWAS, GWAS, and eQTL analyses. We find the majority of P. falciparum parasites are transcriptomically converged within each geographic site with two broader physiological profiles across the Greater Mekong Subregion (GMS). We report 8720 SNP-expression linkages in the eastern GMS parasites and 4537 in the western. The minimal overlap between them suggests differential gene regulatory networks facilitating parasite adaptations to their unique host environments. Finally, we identify two genetic and physiological backgrounds associating with artemisinin resistance in the GMS, together with a farnesyltransferase protein and a thioredoxin-like protein which may act as vital intermediators linking the Pfk13 C580Y mutation to the prolonged parasite clearance time.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30514877 PMCID: PMC6279830 DOI: 10.1038/s41467-018-07588-x
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1P. falciparum transcriptome in the GMS. a Violin plot representing the distribution of estimated gametocyte proportion (GAM Prop.; left) and asexual parasite age (HPI; right) for all the GMS sites arranged in order of median parasites clearance time (in response to artemisinin treatment) (middle) from high to low. Black dot within violin indicates the statistical mean of each category. Magenta (blue) asterisks denote significantly high (low) values at p-value < 0.05 (Wilcoxon rank-sum test). b Distribution of Tcvs is displayed in density plot to show the local convergence of parasite transcriptome at each TRACI-study sites of the 773 isolates. The color from magenta to gray to blue represents the Tcvs scores from negative to zero to positive. The isolates number with Tcvs > 0 (blue number) and Tcvs ≤ 0 (magenta number) are shown together with the density plot at each site. The geographic map of Southeast Asia is originally downloaded from [https://freevectormaps.com/world-maps/southeast-asia] and modified using Photoshop
Fig. 2Genetic and transcriptomic population structure of P. falciparum isolates by geography. a Neighbor-joining tree showing parasite transcriptome population structure across GMS using HPI and GAM adjusted expression data. A magnified view of the branch structure is shown in the circular subpanel below the tree. Colored asterisks on branching point indicate an over-representation of cohort isolates from the respective sites (n > 10 and p-value < 0.05, hypergeometric test). The black dotted line is drawn at the main branch separating the majority of the w- and e-GMS samples. b Heat map of geF for 5061 genes across 11 field sites of the GMS. The length of scale bar is 0.1 as shown in bottom. Six groups (Grp1 to Grp6) were used to characterize the expression differences between w- and e-GMS which was obtained by K-mean clustering based on Euclidean distance. Density plot of geF was generated to compare expression prevalence of each group in w-GMS (blue curve) and e-GMS (black curve shaded in gray) isolates. Enriched MPM pathways for Grp1, Grp2, and Grp6 shown on the right were determined by hypergeometric test at p-value < 0.05. Full list of the enriched pathways are shown in Supplementary Data 4
Fig. 3eQTL analysis for P. falciparum isolates in the GMS. a Scatter plot of 5575 high-confidence SNP-expression linkages with 2350 gene expression (y axis) against 3972 eQTLs (x axis). The statistical significance of each linkage is represented by the size of dot and the intensity of blue. The bar graph represents target genes for each eQTL. The scatter plot below represents eQTL enrichment along the chromosomes. The enrichment score was calculated as a ratio of observed and expected eQTL counts within a 50 kb window. The magenta dots denote eQTL hotspots. b Intra-chromosomal linkages are plotted as the significance (log transformed p-values) against the distance (between eQTL and target gene) for each linkage. The black lines represent average curves for a sliding window of 20 eQTLs. c Differential eQTL effects in isolates of (sub)regions of the GMS. LOD values are plotted for the w-GMS eQTLs against the e-GMS eQTLs where yellow dots stand for the local cis linkages and blue for the distal. The top right inset represents LOD values resulted from sub-sampling analysis of the 5575 high-confidence linkages in e-GMS (Methods)
Fig. 4Integrative analysis of artemisinin-resistant parasites. a Manhattan plot of GWAS for artemisinin resistance in e-GMS (upper) and w-GMS (lower). SNPs passing p-value < 1e−5 are displayed in black circles above dot lines. b Manhattan plot of TWAS for artemisinin resistance. Upregulated and downregulated genes displayed in two directions (Up and Down) were defined by positive and negative Spearman’s rho. The horizontal dotted lines represent p-value = 0.01 (Spearman’s rho test). Genes observed in both w- and e-GMS at p-value < 0.01 (Spearman’s rho test) and FDR < 0.25 (Benjamini & Hochberg correction) are black circled and labeled with gene name/id in the e-GMS plot. The relationships between parasite clearance time (pink blocks), gene expression (blue blocks) and SNP markers (yellow blocks) in e-GMS are characterized into five putative causality modes. c SNP affects clearance time dependent on expression and vice versa. d Clearance time and expression independently affects same SNP(s). e SNP affects clearance time independent of expression. f Expression mediated genetic effects on clearance time. g Expression affects clearance time independently of SNPs. Blocks are linked at p-value < 1e−05 for SNP-expression (eQTL) and SNP-clearance time (GWAS), and at p-value < 0.01 and FDR < 0.25 for expression-clearance time (TWAS). Corresponding gene/SNP name(s)/ID(s) or numbers are listed above blocks accordingly for each mode. Examples of resistance-associated relationships are shown in scatterplots with genes of farnesyltransferase (PF3D7_1242600) and thioredoxin-like (PF3D7_0717699) for mode c, 5′–3′ exonuclease (PF3D7_0204600) for mode d, 6-cysteine (PF3D7_0404900) and MDR1 (PF3D7_0523000) for mode f. Light blue used in scatterplots stand for isolates with reference alleles (ref.) and magenta for the alternative (alt). Expression is compared between alleles in boxplot on the right for each gene with the eQTL p-values labeled above