| Literature DB >> 29323249 |
Lisette Meerstein-Kessel1,2, Robin van der Lee1,3, Will Stone2,4, Kjerstin Lanke2, David A Baker5, Pietro Alano6, Francesco Silvestrini6, Chris J Janse7, Shahid M Khan7, Marga van de Vegte-Bolmer2, Wouter Graumans2, Rianne Siebelink-Stoter2, Taco W A Kooij2, Matthias Marti8, Chris Drakeley4, Joseph J Campo9, Teunis J P van Dam1,10, Robert Sauerwein2, Teun Bousema2, Martijn A Huynen11.
Abstract
Plasmodium gametocytes are the sexual forms of the malaria parasite essential for transmission to mosquitoes. To better understand how gametocytes differ from asexual blood-stage parasites, we performed a systematic analysis of available 'omics data for P. falciparum and other Plasmodium species. 18 transcriptomic and proteomic data sets were evaluated for the presence of curated "gold standards" of 41 gametocyte-specific versus 46 non-gametocyte genes and integrated using Bayesian probabilities, resulting in gametocyte-specificity scores for all P. falciparum genes. To illustrate the utility of the gametocyte score, we explored newly predicted gametocyte-specific genes as potential biomarkers of gametocyte carriage and exposure. We analyzed the humoral immune response in field samples against 30 novel gametocyte-specific antigens and found five antigens to be differentially recognized by gametocyte carriers as compared to malaria-infected individuals without detectable gametocytes. We also validated the gametocyte-specificity of 15 identified gametocyte transcripts on culture material and samples from naturally infected individuals, resulting in eight transcripts that were >1000-fold higher expressed in gametocytes compared to asexual parasites and whose transcript abundance allowed gametocyte detection in naturally infected individuals. Our integrated genome-wide gametocyte-specificity scores provide a comprehensive resource to identify targets and monitor P. falciparum gametocytemia.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29323249 PMCID: PMC5765010 DOI: 10.1038/s41598-017-18840-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Data sets used for integration.
| Study | Reference | Species | Life Stage | Integrated Data |
|---|---|---|---|---|
| Fl02 | Florens |
| Spz, Troph, Mer, Gct | MS asexual/Gct |
| La02 | Lasonder |
| Troph, Schiz, Gct | MS asexual/Gct |
| Le03 | LeRoch |
| Spz, Ri, Troph, Schiz, Mer, Gct | T asexual/Gct |
| Ha05 | Hall |
| Ri, Troph, Schiz, Gct | MS + T asexual/Gct |
| Kh05 | Khan |
| Mixed blood stage, Gct | MS asexual/Gct |
| Yo05 | Young |
| Gct | T Gct |
| Ll06 | Llinas |
| All blood stages, synchronized | T asexual |
| Ot10 | Otto |
| All blood stages, synchronized | T asexual |
| Bo11 | Treeck |
| Schiz | MS asexual |
| Lo11 | Lopez-Barragan |
| Troph, Schiz, Gct | T asexual/Gct |
| Oe12 | Oehring |
| Ri, Troph, Schiz | MS asexual |
| Mo14 | Moreno-Perez |
| Ri, Troph, Schiz | MS asexual |
| Ot14 | Otto |
| Ri, Troph, Schiz, Gct | T asexual/Gct |
| Si14 | Silvestrini |
| Troph, Schiz, Gct | MS asexual/Gct |
| Ta14 | Tao |
| Gct | MS Gct |
| Ll15 | Llinas |
| All blood stages, synchronized | T asexual |
| La16 | Lasonder |
| Gct | MS + T Gct |
| Su16 | Suarez-Cortes |
| Gct | MS Gct |
| Mi17 | Miao |
| Gct | MS Gct |
Life stages Spz sporozoites, Ri rings, Troph trohozoites, Schiz schizonts, Mer merozoites, Gct gametocytes. MS mass spectrometry (highest unique peptide count in any of the samples), T transcriptomics (highest percentile in any of the samples).
Data from Miao et al.[35] was integrated after analyses of high scoring proteins, ranks and scores are included in Supplementary Table 2.
*Asexual microarray data by Llinas and others retrieved from plasmoDB version 28 (Data set “Pfal3D7 real-time transcription and decay”), no accompanying publication.
Figure 1Clustered data sets used in this study with genes ranked according to their protein or transcript expression. Level of expression as detected in the respective samples with unique peptide counts for MS data and percentiles for transcriptomics. The studies are clustered using complete linkage according to their overall gene expression similarities (Euclidean distance). See Table 1 for study keys. Distribution of asexual(a)/gametocyte (g) samples (red/blue) is shown in top bar, proteomics (P) and transcriptomics (T) (dark/light grey) in lower bar.
Figure 2Gametocyte-specificity scores for P. falciparum genes derived from proteomics (P) and transcriptomics (T) data sets. (A) Boxplot for integrated scores for the two gold standard sets and all other Pf genes, derived from proteomics, transcriptomics or all data sets (combined). (B) Density of P and T gametocyte scores, individual gold standard genes and their scores are indicated at the bottom (red, asexual, blue gametocyte). (C) 100 highest ranking proteins and transcripts, gametocyte gold standard in blue. (D) Correlation of the gametocyte-specificity scores derived from all integrated MS studies and Pf MS studies only.
Figure 3Validation of Bayesian gametocyte scoring with area under the curve (AUC) values. Integrated data and individual data sets are compared by 10-fold cross-validation (subsampling of gametocyte and asexual gold standard sets). Integrated proteomics (P) and transcriptomics (T) scores in bold lines. P. berghei data sets in shades of red, individual proteomics and transcriptomics studies with short and long dashes, respectively. See Table 1 for study keys.
Figure 4Comparison of reported gametocyte-specific proteins in mass spectrometry studies. (A) Proteins reported as gametocyte-specific by six individual studies, agreements on gametocyte-specificity are summarized in the table. Bayesian: gametocyte-specific proteins (n = 602) that have a score >5 after data integration. The overlap with previously published data sets is shown, but not to scale. Overlap between the individual studies is not shown for better visibility. Note that the Lasonder 2002 study includes proteins that were found in gametocytes or gametocytes and gametes. (B) Proteins that were reported as non-gametocytic and are (partially) included after data integration.
Figure 5Seroprevalence in two cohorts of parasite carriers in The Gambia. (A+B) Antibodies against the highest scoring gametocyte-specific proteins were measured on protein microarrays. Comparison of positivity (mixture-model cutoff) in gametocyte carriers (n = 164) and non-carriers (n = 63). Gametocyte presence determined by microscopy. All individuals were positive for asexual parasites. (A) Prevalence of antigens from the gold standard (n = 40) and predicted gametocyte-specific proteins (n = 30), Mann-Whitney U test (B) Antigens of five predicted gametocyte-specific proteins are preferentially recognized by gametocyte carriers. Error bars indicate the upper limit of the 95% confidence interval around the proportion. p < 0.05 Fisher’s exact test, corrected for multiple testing of a total of 70 antigens (Benjamini-Hochberg).
Properties of putative gametocyte-specific targets.
| Rank | Gene ID | Description | Name | Intron-spanning | Least Ct difference |
|---|---|---|---|---|---|
| 3 | PF3D7_1143600 | conserved Plasmodium protein, unknown function | — | no | 9.37 |
| 5 | PF3D7_1147200 | tubulin–tyrosine ligase, putative | — | no | 8.52 |
| 6 | PF3D7_1026100 | conserved Plasmodium protein, unknown function | — | yes | 12.94 |
| 7 | PF3D7_1438800 | conserved Plasmodium protein, unknown function | — | yes | 8.57 |
| 8 | PF3D7_0625100 | sphingomyelin synthase 2, putative | SMS2 | no | 11.04 |
| 12 | PF3D7_0930000 | procollagen lysine 5-dioxygenase, putative | — | no | 11.96 |
| 14 | PF3D7_0518700 | mRNA-binding protein PUF1 | PUF1 | yes | 8.31 |
| 15 | PF3D7_0303900 | phosphatidylethanolamine-binding protein, putative | — | yes | 10.87 |
| 16 | PF3D7_1466600 | conserved Plasmodium protein, unknown function | — | no | 5.67 |
| 17 | PF3D7_1107900 | mechanosensitive ion channel protein, putative | MSCS | no | 6.33 |
| 18 | PF3D7_1214500 | conserved Plasmodium protein, unknown function | — | yes | 10.61 |
| 24 | PF3D7_1131500 | conserved Plasmodium protein, unknown function | — | no | 10.41 |
| 51 | PF3D7_0929600 | G2 protein, putative | — | yes | 8.37 |
| 61 | PF3D7_0816800 | meiotic recombination protein DMC1, putative | DMC1 | yes | 13.29 |
| 75 | PF3D7_0903800 | LCCL domain-containing protein | CCp4 | yes | 12.70 |
Rank in transcriptomics (all data sets) for specificity in gametocytes. Random sample of top 100, excluding the gold standard. If primers are not intron-spanning, samples were DNase I treated. Ct difference is the difference between the lowest Ct detected in asexual samples and the highest Ct in concentration-matched stage V gametocytes, averaged across strains Pf NF54, NF135, NF166 and NF175.
Figure 6Validation of gametocyte-specific targets in qRT-PCR Targets are sorted for decreasing gametocyte-specificity in all panels, see Table 2. (A) Minimum transcript abundance in blood stage versus gametocytes in different Pf strains. 1000-fold enrichment of transcript in gametocytes over asexuals was assumed when delta-Ct was 10 or higher (dashed line), considering the lowest Ct value detected in any asexual concentration-matched sample. This threshold was not met by the transcripts with gene IDs in grey. (B) Detection limit of eight validated targets alongside Pfs25 in serial dilutions of Pf NF54 asexual stage parasites (ring stage parasites 10–20 hours post invasion). (C) Detection limit of the most sensitive targets in serial dilutions of stage V gametocytes. (A–C) For Pf NF54, all n = 3, other strains n = 2 biological replicates (error bars: standard error of the mean), all measurements in triplicates. (D) Sensitivity of eight validated targets in Kenyan blood samples of varying gametocyte densities.