| Literature DB >> 18270564 |
Yingyao Zhou1, Vandana Ramachandran, Kota Arun Kumar, Scott Westenberger, Phillippe Refour, Bin Zhou, Fengwu Li, Jason A Young, Kaisheng Chen, David Plouffe, Kerstin Henson, Victor Nussenzweig, Jane Carlton, Joseph M Vinetz, Manoj T Duraisingh, Elizabeth A Winzeler.
Abstract
A fundamental problem in systems biology and whole genome sequence analysis is how to infer functions for the many uncharacterized proteins that are identified, whether they are conserved across organisms of different phyla or are phylum-specific. This problem is especially acute in pathogens, such as malaria parasites, where genetic and biochemical investigations are likely to be more difficult. Here we perform comparative expression analysis on Plasmodium parasite life cycle data derived from P. falciparum blood, sporozoite, zygote and ookinete stages, and P. yoelii mosquito oocyst and salivary gland sporozoites, blood and liver stages and show that type II fatty acid biosynthesis genes are upregulated in liver and insect stages relative to asexual blood stages. We also show that some universally uncharacterized genes with orthologs in Plasmodium species, Saccharomyces cerevisiae and humans show coordinated transcription patterns in large collections of human and yeast expression data and that the function of the uncharacterized genes can sometimes be predicted based on the expression patterns across these diverse organisms. We also use a comprehensive and unbiased literature mining method to predict which uncharacterized parasite-specific genes are likely to have roles in processes such as gliding motility, host-cell interactions, sporozoite stage, or rhoptry function. These analyses, together with protein-protein interaction data, provide probabilistic models that predict the function of 926 uncharacterized malaria genes and also suggest that malaria parasites may provide a simple model system for the study of some human processes. These data also provide a foundation for further studies of transcriptional regulation in malaria parasites.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18270564 PMCID: PMC2215772 DOI: 10.1371/journal.pone.0001570
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genes upregulated 36 and 40 hpi in liver stages relative to blood stages.
|
|
| Function | Rank | Description |
| PF14_0280 | PY01269 | Signaling | 6, 110 | Phosphotyrosyl phosphatase activator |
| PFL0955c | PY01852 | Fatty-acid metabolism | 8, 9 | Phosphotidylethanolamine-binding protein |
| PFC0210c | PY03168 | Sporozoite | 14, 7 | Circumsporozoite (CS) protein |
| PFB0130w | PY00446 | AP-localized fatty acid biosynthesis | 40, 120 | Polyprenyl synthetase |
| PF08_0066 | PY00573 | AP-localized fatty acid biosynthesis | 47, 19 | Lipoamide dehydrogenase |
| MAL13P1.119 | PY01857 | Purine metabolism | 39,88 | CAMP-specific 3′,5′-cyclic phosphodiesterase 4B |
| PFL0475w | PY01829 | Purine metabolism | 38, 103 | 3′,5′-cyclic-nucleotide phosphodiesterase |
| PF14_0227 | PY07390 | Signaling | 85,31 | Calcium-dependent protein kinase |
| PFL1260w | PY03530 | Folate biosynthesis | 45, 69 | Hydrolase/phosphatase |
| PF11_0117 | PY01741 | DNA replication | 84, 222 | Replication factor C subunit 5 |
| PF10_0330 | PY00468 | Signaling | 122, 49 | Ubiquitin-conjugating enzyme |
| PFL2250c | PY00403 | Signaling | 78, 32 | Rac-beta serine/threonine protein kinase |
| MAL13P1.95 | PY03801 | Iron metabolism | 66, 374 | Ferredoxin |
| PFD0825c | PY04369 | Translation | 93, 565 | RNA-binding protein of pumilio/mpt5 family |
| PFD0215c | PY01340 | Sporozoite | 60, 119 |
|
| PFL2460w | PY01337 | Cytoskeleton | 62, 266 | Coronin |
| PFC0831w | PY00756 | AP-localized fatty acid biosynthesis | 130, 76 | Triosephophate isomerase |
| PFD0260c | PY04387 | Adhesion to CD36 | 123, 1 | Sequestrin |
| PFL2510w | PY00008 | Sporozoite | 120, 155 | Chitinase |
| PFE0175c | PY00345 | Sporozoite | 184, 278 | Unconventional myosin pfm-b |
| PFL2210w | PY05459 | Porphyrin metabolism | 208, 483 | delta-aminolevulinic acid synthetase |
| PF13_0128 | PY01586 | AP-localized fatty acid biosynthesis | 91,26 | Beta-hydroxyacyl-acp dehydratase |
| PFB0325c | PY02063 | Sporozoite | 110, 154 | Cysteine protease |
| MAL13P1.190 | PY02721 | Protein degradation | 107, 258 | Proteasome regulatory component |
| PFI0955w | PY05332 | Transport | 89, 28 | Sugar transporter |
The rank indicates the results from two independent cDNA subtraction experiments. Genes described as “hypothetical” are not listed. The rank is derived from comparing levels in P. yoelii asexual stages to liver stages and is calculated only for genes of at least six probes and changing more than 3-fold within the examined conditions (54 for 3,450 P. falciparum and P. yoelii gene pairs total). Genes listed “sporozoite” are highly expressed in salivary gland sporozoites, and their continued detection may be the result of some sporozoites failing to develop, or may indicate that there is a continued need for the gene product in the liver, potentially because the product is involved in immune response modulation.
Figure 1Temporal expression patterns were constructed from 54 P. falciparum and P. yoelii life cycle samples.
A total of 156 statistically enriched gene clusters identified by OPI analysis illustrates the transcription regulation characteristics of all key biological processes in Plasmodium species. Their yeast and human orthologs contents are represented by the white-blue heatmap, indicating parasite-specific processes generally found fewer orthologs in model organisms. The percentage of proteins that form statistically significant within-cluster networks are also white-blue color coded; most networks occur in blood stage processes. Altogether 33 manuscripts were identified with significant overlap to the clusters, nine of which [13], [27], [28], [36], [42], [60], [68], [69], [70] are referenced in the figure. Two clusters were enriched for proteins predicted to have a parasite export signal [68] and were labeled as “Exported proteins (sporozoite)” and “Exported proteins (trophozoite)”—one of which peaks in the trophozoite stage and a second which peaks in sporozoite stages (see GO:PM15591202_Trp and GO:PM15591202_Spo in Table S1-S2). S & T indicate that the P. falciparum parasites were synchronized within the asexual cycle by the thermocycling or sorbitol method [9]. The figure does not comprehensively describe all gene expression patterns contained within the data as there are ∼1,026 genes which are not found in any of the groups depicted here because they do not share expression patterns with a sufficient number of previously characterized genes.
Example of a gene expression cluster enriched for genes with roles in RNA polymerase nucleolus I activity (GO:0005730).
|
| InGO | Yeast name | Yeast function | Current |
| PF11_0305 | Y | YNL061W | Probable RNA m(5)C methyltransferase, essential for processing and maturation of 27S pre-rRNA and large ribosomal subunit biogenesis | hypothetical protein |
| PF11_0358 | Y | YPR010C | RNA polymerase | RNA polymerase, beta subunit |
| PF11_0471 | N | YCR072C | WD-repeat protein involved in ribosome biogenesis | hypothetical protein |
| PF13_0219 | N | YMR229C§ | Protein required for the synthesis of both 18S and 5.8S rRNA | hypothetical protein |
| PF07_0121 | N | YHR170w | Protein involved in nuclear export of the large ribosomal subunit | hypothetical protein |
| PF10_0200 | N | YNL132w | ribosome biogenesis and assembly | hypothetical protein |
| PF13_0165 | N | Required for 18S rRNA and 40S ribosomal subunit production in Schizosaccharomyces pombe | multidomain scavenger receptor protein PbSR precursor | |
| PF08_0055 | Y | YNL175w | u3 small nucleolar ribonucleoprotein protein, putative | hypothetical protein |
| PF11_0090 | N | YGR103w | NOP7 | hypothetical protein |
| PF14_0734 | N | hypothetical | ||
| PFD1175w | N | YMR173w | Interacts with ribosomal proteins by two hybrid | hypothetical protein |
| PF14_0677 | N | YOL010W | RNA terminal phosphate cyclase-like protein involved in rRNA processing at sites A0, A1, and A2 | RNA 3′-Terminal Phosphate Cyclase-like protein |
| PFE0465c | Y | YOR341W | RNA polymerase I; | RNA polymerase I |
| PFI1040c | N | YLR101C§ | Dubious ORF | hypothetical protein |
| PFL1345c | N | YPL086C | Subunit of Elongator complex, which is required for modification of wobble nucleosides in tRNA | hypothetical protein |
| PF11_0305 | Y | YNL061W | Probable RNA m(5)C methyltransferase, essential for processing and maturation of 27S pre-rRNA and large ribosomal subunit biogenesis | hypothetical protein |
| PF11_0358 | Y | YPR010C | RNA polymerase | RNA polymerase, beta subunit |
| PF11_0471 | N | YCR072C | WD-repeat protein involved in ribosome biogenesis | hypothetical protein |
For brevity, only P. falciparum genes (15 of 28 total) are shown. The heading “InGO” indicates whether or not the gene was annotated as being part of the GO category and used to determine cluster boundaries. Yeast genes indicated with a § were matched by BLAST searching the P. falciparum or P. yoelii protein against the Saccharomyces cerevisiae translated genome and selecting the best scoring hit, a necessity as many of these genes are small. YLR101C is antisense to an essential gene and is considered “dubious.”
Figure 2An automatic pipeline for malaria literature mining.
Approach A, full text search by literature search engines: A1) All P. falciparum and P. yoelii locus names were downloaded from PlasmoDB and searched against Google Scholar and SCIRUS one at a time; A2) URL hits were then mapped to PubMed entries. Approach B, NCBI database mining: B1) Mapping between GenBank sequence entries and PubMed entries were systematically retrieved from NCBI for four Plasmodium species; B2) Sequences were mapped to malaria locus names by BLAST alignment. The pipeline resulted in 6,428 functional associations between 3,262 malaria proteins and 1,278 PubMed papers.
Figure 3Probability density distributions of gene expression correlation coefficients.
Gene pairs that either share the same ontology term (red) or are co-cited in a paper (green) are more likely to be co-expressed than randomly paired genes (blue).
Figure 4Late schizont upregulation of P. knowlesi genes that are orthologs of uncharacterized genes in P. falciparum predicted to be involved in invasion.
Figure 5Two-hybrid interaction networks among genes that are co-expressed.
A. Invasion; B. Cellular carbohydrate catabolism; C. Cytosolic ribosome; D. RNA methyltransferase activity.