| Literature DB >> 21984903 |
Daniel Restrepo-Montoya1, David Becerra, Juan G Carvajal-Patiño, Alvaro Mongui, Luis F Niño, Manuel E Patarroyo, Manuel A Patarroyo.
Abstract
BACKGROUND: This study describes a bioinformatics approach designed to identify Plasmodium vivax proteins potentially involved in reticulocyte invasion. Specifically, different protein training sets were built and tuned based on different biological parameters, such as experimental evidence of secretion and/or involvement in invasion-related processes. A profile-based sequence method supported by hidden Markov models (HMMs) was then used to build classifiers to search for biologically-related proteins. The transcriptional profile of the P. vivax intra-erythrocyte developmental cycle was then screened using these classifiers.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21984903 PMCID: PMC3184965 DOI: 10.1371/journal.pone.0025189
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Methodology for identifying Plasmodium vivax proteins having a potential role in invasion using sequence redundancy reduction and profile hidden Markov models.
Plasmodium protein data sets used for profile HMMs construction.
| Protein existence | Organism | |||||||||
| # of proteins | Type | Query | Prot/Trans | Predicted | Putative | Homology |
|
|
|
|
| 129 | Neg | Database | 34 | 1 | 0 | 94 | 75 | 10 | 12 | 12 |
| 133 | Pos | Database | 21 | 81 | 22 | 9 | 79 | 28 | 15 | 11 |
| 118 | Pos | Lit/DB | 20 | 69 | 22 | 7 | 66 | 28 | 14 | 10 |
| 110 | Pos | Database | 21 | 81 | 0 | 1 | 71 | 21 | 8 | 10 |
| 88 | Pos | Lit/DB | 20 | 69 | 0 | 0 | 54 | 20 | 6 | 8 |
| 68 | Pos | Lit/DB | 0 | 68 | 0 | 0 | 48 | 11 | 4 | 5 |
| 67 | Pos | Literature | 6 | 48 | 8 | 5 | 67 | 0 | 0 | 0 |
| 42 | Pos | Lit/DB | 20 | 0 | 22 | 0 | 12 | 16 | 10 | 4 |
| 20 | Pos | Lit/DB | 20 | 0 | 0 | 0 | 6 | 9 | 2 | 3 |
*The remaining protein sequences correspond to P. chabaudi (10), P. knowlesi (8), P. gallinaceum (1) and P. malariae (1).
**Level of evidence that supports the existence of the protein concerned. “Prot/Trans” indicates that there is clear experimental evidence for the existence of the protein or that expression data indicate the existence of a transcript. “Homology” indicates that the existence of a protein is probable because orthologs exist in closely-related species. “Predicted” is used for entries without evidence at protein, transcript, or homology levels. ‘Putative’ indicates putative proteins having predicted existence.
Computational protein identification results.
| Protein profile datasets | ||||||||||||||||||||||||||||||||||||
| 20 | 42 | 67 | 68 | 88 | 110 | 118 | 133 | 129 | ||||||||||||||||||||||||||||
| cd-hit | pisces | cd-hit | pisces | cd-hit | pisces | cd-hit | pisces | cd-hit | pisces | cd-hit | pisces | cd-hit | pisces | cd-hit | pisces | cd-hit | pisces | |||||||||||||||||||
| ID proteins | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b | a | b |
| PVX_090110 | x | x | x | x | ||||||||||||||||||||||||||||||||
| PVX_117880 | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | |||||||||
| PVX_003905 | x | x | x | x | x | x | x | |||||||||||||||||||||||||||||
| PVX_092975 | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | ||||||||||||||||||
| PVX_121920 | x | x | x | x | x | x | x | x | x | x | x | x | x | x | ||||||||||||||||||||||
| PVX_003800 | x | x | x | x | x | x | x | x | x | x | x | x | x | |||||||||||||||||||||||
| PVX_003825 | x | x | x | x | x | x | x | x | x | x | x | |||||||||||||||||||||||||
| PVX_121885 | x | x | x | x | x | x | x | x | ||||||||||||||||||||||||||||
| PVX_003805 | x | x | x | x | x | x | x | x | x | x | x | x | ||||||||||||||||||||||||
| PVX_003850 | x | x | x | x | x | x | x | x | x | x | x | x | x | |||||||||||||||||||||||
| PVX_002510 | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | |||||||||||||||||||
| PVX_099980 | x | x | x | x | x | x | x | x | x | |||||||||||||||||||||||||||
| PVX_097670 | x | x | x | x | x | x | x | x | ||||||||||||||||||||||||||||
| PVX_097720 | x | x | x | x | x | x | x | x | x | x | x | x | ||||||||||||||||||||||||
| PVX_097680 | x | x | ||||||||||||||||||||||||||||||||||
| PVX_098585 | x | x | x | x | x | |||||||||||||||||||||||||||||||
| PVX_097715 | x | x | x | x | x | x | x | x | x | x | x | x | x | x | ||||||||||||||||||||||
| PVX_090210 | x | x | ||||||||||||||||||||||||||||||||||
| PVX_096990 | x | x | x | x | x | x | x | |||||||||||||||||||||||||||||
| PVX_097705 | x | x | x | x | x | x | x | x | x | x | ||||||||||||||||||||||||||
| PVX_084720 | x | x | x | |||||||||||||||||||||||||||||||||
| PVX_117230 | x | x | ||||||||||||||||||||||||||||||||||
| PVX_123105 | x | x | ||||||||||||||||||||||||||||||||||
| PVX_097700 | x | x | x | |||||||||||||||||||||||||||||||||
| PVX_097695 | x | x | x | x | x | x | x | |||||||||||||||||||||||||||||
| PVX_097710 | x | |||||||||||||||||||||||||||||||||||
| PVX_097675 | x | x | x | x | x | |||||||||||||||||||||||||||||||
| PVX_101555 | x | x | ||||||||||||||||||||||||||||||||||
| PVX_003830 | x | x | x | x | x | x | x | x | x | x | x | |||||||||||||||||||||||||
| PVX_085930 | x | x | x | x | x | x | x | x | x | |||||||||||||||||||||||||||
| PVX_112665 | x | |||||||||||||||||||||||||||||||||||
| PVX_101605 | x | x | x | x | ||||||||||||||||||||||||||||||||
| PVX_092995 | x | |||||||||||||||||||||||||||||||||||
| PVX_123550 | x | |||||||||||||||||||||||||||||||||||
| PVX_118525 | x | |||||||||||||||||||||||||||||||||||
| PVX_081810 | x | |||||||||||||||||||||||||||||||||||
| PVX_101505 | x | |||||||||||||||||||||||||||||||||||
| PVX_109280 | x | |||||||||||||||||||||||||||||||||||
| PVX_094425 | x | |||||||||||||||||||||||||||||||||||
| PVX_092425 | x | x | ||||||||||||||||||||||||||||||||||
| PVX_086850 | x | |||||||||||||||||||||||||||||||||||
| PVX_081845 | x | |||||||||||||||||||||||||||||||||||
| PVX_082695 | x | x | ||||||||||||||||||||||||||||||||||
| PVX_003815 | x | x | x | x | ||||||||||||||||||||||||||||||||
| PVX_097590 | x | x | x | |||||||||||||||||||||||||||||||||
| PVX_080305 | x | |||||||||||||||||||||||||||||||||||
The characters “a” and “b” represent 30% and 90% identity thresholds, respectively.
Computational protein selection results.
| Protein ID | E. Ref | Category | NAME | ESLPred2 | BaCelLo | SignalP | TM Pphob | SP Pphob | SecretomeP | TMHMM | GPI | DOM | MAAPs |
| PVX_123105 | - | Selected | hypothetical protein, conserved | Ext | Sec | ||||||||
| PVX_101605 | - | Selected | hypothetical protein | Ext | Sec | 2 | X | 2 | PFP | ||||
| PVX_123550 | - | Selected | hypothetical protein, conserved | Ext | Sec | X | |||||||
| PVX_118525 | - | Selected | hypothetical protein, conserved | Ext | Sec | 1–18 | 1 | 1–20 | 1 | ||||
| PVX_081810 | - | Selected | hypothetical protein, conserved | Ext | Cyt | ||||||||
| PVX_094425 | - | Selected | hypothetical protein, conserved | Ext | Mit | 1–27 | 11 | 1–27 | 9 | ||||
| PVX_092425 | - | Selected | hypothetical protein, conserved | Ext | Sec | 1 | 1–23 | HP | |||||
| PVX_081845 | - | Selected | hypothetical protein | Ext | Sec | 3 | X | 2 | |||||
| PVX_080305 | - | Selected | hypothetical protein, conserved | Ext | Sec | 1–22 | 1–22 | ||||||
| PVX_101555 | - | Selected | hypothetical protein | Ext | Sec | 1–27 | 3 | 1–27 | 3 | ||||
| PVX_002510 | - | Selected | Nucleosomal binding protein 1, putative | Cyt | Sec | 1 | 1 | X | |||||
| PVX_096990 | - | Selected | Pv-fam-d protein | Ext | Sec | 1–23 | 1 | 1–23 | 2 | ||||
| PVX_117230 | - | Selected | Ser/Thr protein phosphatase family protein | Ext | Sec | 2 | 1–21 | ||||||
| PVX_101505 | - | Selected | Pv-fam-d protein | Ext | Nuc | 2 | 2 | ||||||
| PVX_092975 | - | Selected | erythrocyte binding protein 1 | Nuc | Sec | 1–18 | 1 | 1–18 | 1 | X | X | ||
| PVX_003825 | - | Selected | serine-repeat antigen 4 (SERA) | Ext | Sec | 1–26 | 1–24 | X | |||||
| PVX_003805 | - | Selected | serine-repeat antigen (SERA), putative | Ext | Sec | 1–22 | 1–19 | X | |||||
| PVX_003850 | - | Selected | serine-repeat antigen 2 (SERA) | Ext | Sec | 1–21 | X | ||||||
| PVX_097715 | - | Selected | merozoite surface protein 3 (MSP3), putative | Ext | Sec | 1–18 | 1–18 | PFP | |||||
| PVX_097705 | - | Selected | merozoite surface protein 3 alpha (MSP3a), putative | Ext | Sec | 1–20 | 1–21 | ||||||
| PVX_097675 | - | Selected | merozoite surface protein 3 gamma (MSP3g), putative | Ext | Sec | 1–20 | 1–20 | X | |||||
| PVX_003830 | - | Selected | serine-repeat antigen 5 (SERA) | Ext | Sec | 1–24 | 1–22 | X | |||||
| PVX_086850 | - | Selected | variable surface protein Vir35, putative | Ext | Sec | 3 | 3 | ||||||
| PVX_003815 | - | Selected | serine-repeat antigen (SERA), truncated, putative | Ext | Cyt | X | |||||||
| PVX_003905 | - | Selected | transmission-blocking target antigen Pvs230, putative | Ext | Sec | 1–29 | 1–22 | X | |||||
| PVX_121885 | 5 | Selected-PE | cytoadherence linked asexual protein, CLAG, putative | Ext | Sec | 1–24 | 3 | 1–24 | 1 | X | |||
| PVX_084720 | 5 | Selected-PE | hypothetical protein, conserved | Ext | Sec | 1–22 | 1–22 | X | |||||
| PVX_092995 | 5 | Selected-PE | tryptophan-rich antigen (Pv-fam-a) | Ext | Sec | 1–23 | 1–23 | ||||||
| PVX_112665 | 5 | Selected-PE | tryptophan-rich antigen (Pv-fam-a) | Ext | Sec | 1–28 | 1–28 | ||||||
| PVX_085930 | 5,32 | Selected-PE | rhoptry-associated protein 1, putative | Ext | Sec | 1–22 | 1–22 | X | |||||
| PVX_082695 | 31 | Selected-PE | merozoite surface protein 7 (MSP7), putative | Ext | Sec | 1–20 | 1–22 | ||||||
| PVX_117880 | 30 | Selected-PE | hypothetical protein, conserved | Ext | Sec | 1–17 | 3 | 1–17 | 1 | ||||
| PVX_097670 | 22 | VC-AA | merozoite surface protein 3 gamma (MSP3g), putative | Ext | Sec | 1–20 | 1–20 | X | |||||
| PVX_121920 | 28 | VC-FA | reticulocyte-binding protein 2 (RBP2), like | Ext | Sec | 1–21 | 1 | 1–21 | |||||
| PVX_097590 | 26 | VC-PIA | rhoptry-associated protein 2, putative | Ext | Sec | 1–21 | 1–21 | ||||||
| PVX_109280 | 24 | VC-AA | tryptophan-rich antigen (Pv-fam-a) | Ext | Sec | 1–25 | 1–22 | ||||||
| PVX_090210 | 10 | VC-AA | hypothetical protein | Ext | Sec | 1–20 | 1–20 | PFP | |||||
| PVX_099980 | 5,10,27 | VC-PIA | major blood-stage surface antigen Pv200 | Ext | Sec | 1–19 | 1 | 1–19 | 1 | HP | X | X | |
| PVX_097680 | 10 | VC-AA | merozoite surface protein 3 beta (MSP3b) | Ext | Sec | 1–19 | 1–19 | X | |||||
| PVX_097700 | 10 | VC-AA | merozoite surface protein 3 (MSP3), putative | Ext | Cyt | X | |||||||
| PVX_097695 | 10 | VC-AA | merozoite surface protein 3 alpha (MSP3a), putative | Ext | Sec | 1–20 | 1–21 | ||||||
| PVX_097710 | 10 | VC-AA | merozoite surface protein 3 (MSP3), putative | Ext | Cyt | X | |||||||
| PVX_003800 | 25 | VC-AA | serine-repeat antigen (SERA) | Ext | Sec | 1–25 | 1–20 | X | |||||
| PVX_097720 | 23 | VC-AA | merozoite surface protein 3 alpha (MSP3a) | Ext | Mit | 1–23 | 1–23 | X | |||||
| PVX_098585 | 28,29 | VC-FA | reticulocyte-binding protein 1 (RBP1), putative | Ext | Cyt | 1–22 | 1 | 1–21 | |||||
| PVX_090110 | 5 | Negative-PE | hypothetical protein, conserved | Ext | Cyt | 3 |
Abbreviations: E. Ref. refers to the published reference where the relevant experimental data have been published for each protein. VC: Protein classified as Vaccine Candidate according to previous studies. PE: Experimental evidence of protein expression. PIA: Previous protection-induction assays. AA: Previous antigenicity assays. FA: Previous functional assays. ESLPred2 (Ext = extracellular, Cys = cytoplasm, Nuc = nuclear), BaCelLo (Sec = secretory pathway, Cyt = cytoplasm, Mit = mitochondrion, Nuc = nucleus), GPI (HP = highly probable, PFP = potential false positive), DOM = domains.