| Literature DB >> 29669882 |
Maureen A Carey1, Vincent Covelli2, Audrey Brown3, Gregory L Medlock4, Mareike Haaren3, Jessica G Cooper3, Jason A Papin2,4, Jennifer L Guler5,3.
Abstract
Metabolomics is increasingly popular for the study of pathogens. For the malaria parasite Plasmodium falciparum, both targeted and untargeted metabolomics have improved our understanding of pathogenesis, host-parasite interactions, and antimalarial drug treatment and resistance. However, purification and analysis procedures for performing metabolomics on intracellular pathogens have not been explored. Here, we purified in vitro-grown ring-stage intraerythrocytic P. falciparum parasites for untargeted metabolomics studies; the small size of this developmental stage amplifies the challenges associated with metabolomics studies as the ratio between host and parasite biomass is maximized. Following metabolite identification and data preprocessing, we explored multiple confounding factors that influence data interpretation, including host contamination and normalization approaches (including double-stranded DNA, total protein, and parasite numbers). We conclude that normalization parameters have large effects on differential abundance analysis and recommend the thoughtful selection of these parameters. However, normalization does not remove the contribution from the parasite's extracellular environment (culture media and host erythrocyte). In fact, we found that extraparasite material is as influential on the metabolome as treatment with a potent antimalarial drug with known metabolic effects (artemisinin). Because of this influence, we could not detect significant changes associated with drug treatment. Instead, we identified metabolites predictive of host and medium contamination that could be used to assess sample purification. Our analysis provides the first quantitative exploration of the effects of these factors on metabolomics data analysis; these findings provide a basis for development of improved experimental and analytical methods for future metabolomics studies of intracellular organisms.IMPORTANCE Molecular characterization of pathogens such as the malaria parasite can lead to improved biological understanding and novel treatment strategies. However, the distinctive biology of the Plasmodium parasite, including its repetitive genome and the requirement for growth within a host cell, hinders progress toward these goals. Untargeted metabolomics is a promising approach to learn about pathogen biology. By measuring many small molecules in the parasite at once, we gain a better understanding of important pathways that contribute to the parasite's response to perturbations such as drug treatment. Although increasingly popular, approaches for intracellular parasite metabolomics and subsequent analysis are not well explored. The findings presented in this report emphasize the critical need for improvements in these areas to limit misinterpretation due to host metabolites and to standardize biological interpretation. Such improvements will aid both basic biological investigations and clinical efforts to understand important pathogens.Entities:
Keywords: Plasmodium falciparum; apicomplexan parasites; intracellular pathogen; metabolomics
Mesh:
Substances:
Year: 2018 PMID: 29669882 PMCID: PMC5907652 DOI: 10.1128/mSphere.00097-18
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1 Metabolomics pipeline and metabolite identification. (A) Metabolomics purification and analysis pipeline. (Step 1) Laboratory-adapted P. falciparum clones are cultured in host erythrocytes. Parasite count is collected at this step (total erythrocyte number multiplied by percent parasitemia yields total parasite value; see Materials and Methods). (Step 2) If enriching for late-stage parasites is desired, cultures are passed through a magnetic column to retain paramagnetic late-stage-infected erythrocytes. Note that this was not done for the present study. iHost, infected host erythrocytes; uHost, uninfected host erythrocytes. (Step 3) Host erythrocytes are lysed using saponin, but parasites remain intact. Samples are washed to remove hemoglobin and other intracellular host material and quenched on liquid nitrogen. Total protein is quantified at this step (prior to freezing). (Step 4) Soluble metabolites are extracted from precipitated protein using methanol and centrifugation. Double-stranded DNA is quantified at this step. (Step 5) Metabolites are separated via liquid chromatography and identified using mass spectroscopy. Metabolite spectra are compared to a library of authenticated standard metabolites for high-confidence identification. (Step 6) Abundance data for each metabolite are normalized to an appropriate parameter (i.e., DNA content or parasite number), log transformed, centered with respect to the median, and scaled with respect to variances, prior to employing statistical comparisons. (B) Experimental comparison. All samples were grown in RPMI media supplemented with AlbuMAX and hypoxanthine and with one of three blood batches (matched across treatment conditions). At the early ring stage (<3 h postinvasion), 10 samples were treated with dihydroartemisinin (DHA; 700 nM) for 6 h and 10 samples were matched with respect to protocol and condition (blood batch, medium batch, and stage) without drug treatment (see Table S3). Images shown were taken at the 6-h time point (×100 magnification); dormancy was observed at 24 h. (C) Summary of identified metabolites. Metabolites (each represented by one point) from various metabolic subgroups were not uniformly detected in all five replicates for any sample group. How frequently a metabolite was measured across replicates is indicated by the metabolite point placed in data corresponding to 1 to 5 replicates (y axis). The majority of metabolites detected were lipid species, as indicated by the large number of blue dots. A full list of identified metabolites is provided in the supplemental material.
FIG 2 Host persistence is detected using multiple approaches. (A) Visualization of parasites within erythrocyte ghosts. Fluorescent imaging (×40 magnification) reveals parasites (blue, DAPI) retained within erythrocyte ghosts (red, phycoerythrin-conjugated CD235a antibody) following saponin treatment. Approximately 70% of the parasites remain associated with host membranes (see Table S2). (B) Sample characteristics. Samples were evaluated for levels of double-stranded DNA (dsDNA; quantified in micrograms per milliliter on the x axis), protein amounts (black; quantified in micrograms on the y axis [ranging from 67.0641 to 130.0936 μg] in the left panel), and parasite counts (blue; quantified on the y axis [ranging from 1,306,500 to 6,946,875 parasites] in the center panel) prior to analysis. The total number of metabolites detected per sample (red; quantified on the y axis [ranging from 182 to 267 metabolites] in the right panel) was significantly correlated with sample dsDNA quantification (P = 9.8 × 10−5; r2 = 0.76). Protein amount and parasite count were not significantly correlated with dsDNA. The fit line uses a linear model, and the shaded region represents the standard error.
Parameters in metabolomics analysis of intracellular parasites, including Plasmodium
| Parameter | Option(s) | Factor(s) to consider |
|---|---|---|
| Growth conditions | ||
| Late stage | Larger in size (3–10 µm), polyploid genome; can use magnetic | |
| Mixed stages | Effects of stage variation on data | |
| Media batches | Relevant if using serum-based media formulations | |
| Additional controls | Uninfected erythrocytes | Used to identify or control for host metabolites; used in addition to |
| Enrichment methods | ||
| Magnetic purification | Increases parasite-to-host ratio ( | |
| Metabolite detection | NMR | Limited metabolite detection but higher confidence |
| Radio labeling | Targeted approach with high confidence | |
| Single metabolite assays | High-confidence, targeted approach with low throughput | |
| Preanalysis normalization | Cell number normalization | Can be combined with any postanalysis normalization but requires sample |
| Postanalysis normalization | ||
| Internal standards | Dependent on metabolomics facilities | |
| Centering | Mean | Standard centering |
| Other | See reference | |
| Scaling | ||
| Z-scoring | Requires control samples (i.e., untreated or uninfected erythrocytes) | |
| Statistical analysis | ||
Note that most parameters do not have strict recommendations, as they are dependent on experimental design. Bolded text indicates methods that were employed and/or evaluated during this study. NMR, nuclear magnetic resonance.
FIG 3 Normalization approaches impact the final metabolite abundance. Normalization controls for sample-to-sample variation were performed. Normalization requires sample metabolite abundance to be divided by the quantified normalization factor, the sample variable (the equation is in the blue box; normalization factors are shown left of the box). The examples of results shown in the table indicate abundances of X metabolites given several different sample metrics for normalization. For example, identical samples with different cell counts (sample 1 and sample 2) reveal the importance of normalization; without it, the data corresponding to the identical samples show a 2-fold difference in the values determined for metabolite X. The values determined for identical parasite samples 3 and 4 also show a nearly 2-fold difference in metabolite abundance after normalizing to protein levels, due to host bias for protein measures.
FIG 4 Metabolomes are dependent on the normalization approach and are influenced by extraparasite contamination. (A to D) Normalization affects metabolome similarity. (A to D) Principal-component (PC) analysis was performed prior to normalization (A) as well as after using three different normalization methods (DNA normalization [B], total protein normalization [C], and parasite count normalization [D]) on all identified metabolites. PERMANOVA significance is listed for each grouping. (E) Metabolites associated with components of media. The raw abundance of 82 metabolites was correlated with phenol red levels (unnormalized column), using a two-sided Pearson’s product moment correlation with Benjamini and Hochberg false-discovery rate correction. These associations were not removed with parasite number and protein normalization. DNA normalization best removes associations with components of media (increases in numbers of insignificant [gray] correlations); only 39% of correlations remain. (F) Removal of medium-associated metabolites. Principal-component analysis (PCA) of DNA-normalized samples with phenol red-correlated metabolites removed from the data set yielded no improvement in sample clustering.
FIG 5 Random forest analysis. (A) Building a random forest classifier. Samples are randomly classified into subsets (training and test data sets); from the training subsets, decision trees are built to separate samples into groups (see panel B). Trees are evaluated by testing classification performance on the remaining samples from the test data sets. See Materials and Methods for more details on the analyses. (B) Evaluating metabolite importance. Metabolite importance is calculated by determining the effect of removal of the metabolite from the data set on classifier performance. See Materials and Methods for further details.
FIG 6 Blood batch and antimalarial treatment influence metabolomes. (A) Classifier performance. Classifiers were built to predict blood batch or treatment conditions using the metabolomics data with or without 4 normalization approaches. The classifier error rate varies with the normalization approach. (B) The normalization method determines the important metabolites. A sample consisting of five metabolites associated with improved or worsened classifier accuracy is shown. These metabolites are shown in accordance with their importance in classifier performance and their interesting behavior across classifiers. Upward-pointing arrows indicate that the metabolite improves classifier accuracy in one classifier, and downward-pointing arrows indicate they worsen accuracy in one classifier (arrows represent the normalization approaches from panel A); if the metabolite does not improve or worsen accuracy, a dash is shown. Contradictory results (both upward-pointing and downward-pointing arrows for one metabolite) indicate that the normalization method changes the importance of the metabolite. Note that valyl leucine, hypoxanthine, and phenol red were removed upon phenol red filtering and, therefore, are present in only 4 classifiers, as indicated by the four arrows and dashes.