| Literature DB >> 33930021 |
Lisette Meerstein-Kessel1,2, Jeron Venhuizen1, Daniel Garza1, Nicholas I Proellochs2, Emma J Vos1, Joshua M Obiero3, Philip L Felgner3, Robert W Sauerwein1, Marynthe Peters1, Annie S P Yang2, Martijn A Huynen1.
Abstract
Plasmodium species, the causative agent of malaria, have a complex life cycle involving two hosts. The sporozoite life stage is characterized by an extended phase in the mosquito salivary glands followed by free movement and rapid invasion of hepatocytes in the human host. This transmission stage has been the subject of many transcriptomics and proteomics studies and is also targeted by the most advanced malaria vaccine. We applied Bayesian data integration to determine which proteins are not only present in sporozoites but are also specific to that stage. Transcriptomic and proteomic Plasmodium data sets from 26 studies were weighted for how representative they are for sporozoites, based on a carefully assembled gold standard for Plasmodium falciparum (Pf) proteins known to be present or absent during the sporozoite life stage. Of 5418 Pf genes for which expression data were available at the RNA level or at the protein level, 975 were identified as enriched in sporozoites and 90 specific to them. We show that Pf sporozoites are enriched for proteins involved in type II fatty acid synthesis in the apicoplast and GPI anchor synthesis, but otherwise appear metabolically relatively inactive in the salivary glands of mosquitos. Newly annotated hypothetical sporozoite-specific and sporozoite-enriched proteins highlight sporozoite-specific functions. They include PF3D7_0104100 that we identified to be homologous to the prominin family, which in human has been related to a quiescent state of cancer cells. We document high levels of genetic variability for sporozoite proteins, specifically for sporozoite-specific proteins that elicit antibodies in the human host. Nevertheless, we can identify nine relatively well-conserved sporozoite proteins that elicit antibodies and that together can serve as markers for previous exposure. Our understanding of sporozoite biology benefits from identifying key pathways that are enriched during this life stage. This work can guide studies of molecular mechanisms underlying sporozoite biology and potential well-conserved targets for marker and drug development.Entities:
Year: 2021 PMID: 33930021 PMCID: PMC8115857 DOI: 10.1371/journal.pcbi.1008067
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Positive gold standard members, based on the presence of proteins in western blot or immunofluorescent assay data.
| Gene ID | Name | Evidence |
|---|---|---|
| PF3D7_0104000 | Thrombospondin-related sporozoite protein | Labaied |
| PF3D7_0107600 | Eukaryotic translation initiation factor 2-α kinase 2, putative | Matuschewski |
| PF3D7_0207300 | Serine repeat antigen 8 | Arisue |
| PF3D7_0304000 | Inner membrane complex protein 1a, putative | Khater |
| PF3D7_0304600 | Circumsporozoite (CS) protein | Singh |
| PF3D7_0408600 | Sporozoite invasion-associated protein 1 | Arevalo-Pinzon |
| PF3D7_0408700 | Perforin-like- protein 1 | Yang |
| PF3D7_0417100 | mRNA-binding protein PUF2 | Gomes-Santos |
| PF3D7_0511400 | Unknown function protein | Schlarman |
| PF3D7_0517600 | F-actin-capping protein subunit β, putative | Ganter |
| PF3D7_0615100 | Enoyl-acyl carrier reductase | Vaughan |
| PF3D7_0626300 | 3-oxoacyl-acyl-carrier protein synthase I/II | Vaughan |
| PF3D7_0702300 | Sporozoite threonine and asparagine-rich protein | Fidock |
| PF3D7_0809100 | Erythrocyte membrane protein 1 | Zanghi |
| PF3D7_0812300 | Sporozoite surface protein 3, putative | Harupa |
| PF3D7_0816500 | Small heat shock protein HSP20, putative | Montagna |
| PF3D7_0822700 | Thrombospondin-related protein 1, putative | Klug |
| PF3D7_0830300 | Sporozoite invasion-related protein 2 | Siau |
| PF3D7_1137800 | Sporozoite surface protein essential for liver stage development | Al-Nihmi |
| PF3D7_1138200 | Unknown function protein | Schlarman |
| PF3D7_1147000 | Sporozoite and liver stage asparagine-rich protein | Silvie |
| PF3D7_1201300 | Liver stage associated protein 1 | Siau |
| PF3D7_1207300 | LIMP protein, putative | Santos |
| PF3D7_1208200 | Cysteine repeat modular protein 3 | Douradinha |
| PF3D7_1216600 | Cell traversal protein for ookinetes and sporozoites | Bergmann-Leitner |
| PF3D7_1221400 | Inner membrane complex protein 1h, putative | Tremp |
| PF3D7_1302200 | Protein UIS3 | Matuschewski |
| PF3D7_1335900 | Thrombospondin-related anonymous protein | Ejigiri |
| PF3D7_1342500 | Sporozoite protein essential for cell traversal | Yang |
| PF3D7_1442600 | TRAP-like protein | Steinbuechel |
| PF3D7_1475400 | Cysteine repeat modular protein 4 | Douradinha |
Fig 1Bayesian data integration identifies sporozoite-specific genes in P. falciparum.
Bayesian score distributions of the proteins from the negative gold standard, the positive gold standard and the remaining proteins when the integration is done A) using only RNA data, B) only proteomic data and C) RNAdata plus proteomic data. D) cross validation of our predictions (5-fold cross validation) with all data sets (Black solid line) and predictions with individual data sets (dashed colored lines).
Fig 2Predicted membrane topology of Pf3D7_0104100, a sporozoite-specific protein that is homologous to the prominin/CD133 protein family.
The level of polymorphisms among P. falciparum strains is indicated for the separate regions using the color scale below the cartoon of the protein as the average number of polymorphisms per nucleotide in the strains in PlasmoDB. PF3D7_0104100 has a high density of polymorphisms within P. falciparum strains that are concentrated in the second extracellular loop. Cysteines that are conserved among the homologs in Plasmodium species are indicated. The cysteines that are conserved also in human homologs are in bold. Note that the conserved cysteines occur in close proximity to each other, suggesting the formation of disulphide bonds.
Fig 3Apicoplast fatty acid synthesis proteins are enriched among sporozoites.
The width of the arrows is determined by the Bayesian score reflecting the level of over representation of that enzyme in sporozoites, e.g. 16 for ACS2 and 8 for FabB/F (S3 Table). For PDH that consists of three proteins, the width of the arrow was determined by the average of those three. Most of FASII proteins are enriched in sporozoites, except PKII, FABZ and LipA. The scheme is a simplification of the pathway as depicted by Shears et al.[50], to which ACS2 was added as it is highly enriched in sporozoites and relevant for fatty acid synthesis.
Putative markers of exposure to sporozoites.
Genes selected by the greedy method to cover all volunteers with their gene ID, function and number of volunteers with antibodies after CPS immunization as reported by Obiero et al. [13]. The non-synonymous SNPs are given as a proxy for genetic variability, with PlasmoDB and two sequenced laboratory strains as reference.
| Gene ID | function | No. (%) of volunteers with antibodies | Non-syn SNP/kb PlasmoDB | Non-syn SNPs | |
|---|---|---|---|---|---|
| NF135 | NF166 | ||||
| PF3D7_0630600 | Conserved hyp | 20 (52.6) | 2.8 | 0 | 1 |
| PF3D7_0906500 | Arginase | 19 (50.0) | 6.5 | 1 | 1 |
| PF3D7_1456700 | Conserved hyp. | 19 (50.0) | 2.9 | 0 | 0 |
| PF3D7_0719700 | 40S ribosomal S10 | 15 (39.5) | 0 | 0 | 0 |
| PF3D7_1219100 | Clathrin heavy chain | 12 (31.6) | 5.5 | 0 | 2 |
| PF3D7_0301700 | Hypothetical exp. | 10 (26.3) | 6.3 | 0 | 1 |
| PF3D7_1122700 | Conserved hyp. | 9 (23.7) | 7.8 | 0 | 0 |
| PF3D7_1455800 | LCCL prot. | 6 (15.8) | 4.1 | 0 | 0 |