| Literature DB >> 32542363 |
Ankita Singh1,2, Rahul Kaushik1,3, Dheeraj Kumar Chaurasia1, Manpreet Singh1, B Jayaram1,4,5.
Abstract
Despite Plasmodium vivax being the main offender in the majority of malarial infections, very little information is available about its adaptation and development in humans. Its capability for activating relapsing infections through its dormant liver stage and resistance to antimalarial drugs makes it as one of the major challenges in eradicating malaria. Noting the immediate necessity for the availability of a comprehensive and reliable structural and functional repository for P. vivax proteome, here we developed a web resource for the new reference genome, PvP01, furnishing information on sequence, structure, functions, active sites and metabolic pathways compiled and predicted using some of the state-of-the-art methods in respective fields. The PvP01 web resource comprises organized data on the soluble proteome consisting of 3664 proteins in blood and liver stages of malarial cycle. The current public resources represent only 163 proteins of soluble proteome of PvP01, with complete information about their molecular function, biological process and cellular components. Also, only 46 proteins of P. vivax have experimentally determined structures. In this milieu of extreme scarcity of structural and functional information, PvP01 web resource offers meticulously validated structures of 3664 soluble proteins. The sequence and structure-based functional characterization led to a quantum leap from 163 proteins available presently to whole soluble proteome offered through PvP01 web resource. We believe PvP01 web resource will serve the researchers in identifying novel protein drug targets and in accelerating the development of structure-based new drug candidates to combat malaria. Database Availability: http://www.scfbio-iitd.res.in/PvP01.Entities:
Keywords: zzm321990 Plasmodium vivaxzzm321990 ; Computational Protein Databank; Functional Characterization; Protein Structure Prediction; Malarial Infection; Malarial Protein Targets
Mesh:
Substances:
Year: 2020 PMID: 32542363 PMCID: PMC7296392 DOI: 10.1093/database/baaa036
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Summary of various tools implemented in the development of the PvP01 web resource.
|
|
|
|
|
|---|---|---|---|
| SD Index | 1° Structure information |
| ~0.1 hours |
| PSIPRED | 2° Structure prediction |
| ~0.5 hours |
| PSSPred | 2° Structure predication |
| ~1.5 hours |
| SPIDER3 | 2° Structure prediction |
| ~2.0 hours |
| BhageerathH+ | 3° Structure prediction |
| ~24 hours |
| I-TASSER | 3° Structure prediction |
| ~48 hours |
| RaptorX | 3° Structure prediction |
| ~5 hours |
| GalaxyRefine | 3° Structure refinement |
| ~5 hours |
| ProTSAV | 3° Structure assessment |
| ~0.1 hours |
| ADDS | Ligand binding site |
| ~0.1 hours |
| FPocket | Ligand binding site |
| ~0.1 hours |
| LIGSITECSC | Ligand binding site |
| ~0.1 hours |
| InterPro | Function annotation |
| ~0.3 hours |
| SIFTER | Function annotation |
| ~0.3 hours |
| LocTree3 | Function annotation |
| ~0.1 hours |
| ProBis | Function annotation |
| ~1.5 hours |
| KEGG | Metabolic pathways |
| ~1.5 hours |
| Total compute time required for an individual protein (~250 AA) on octa-core | ~ 90 hours | ||
The standalone versions (wherever available) of the listed tools are used and the compute efficiency provided here approximates the time required by the tool on an octa-core CPU machine for an individual protein of sequence length about 250 amino acid residues
Figure 1An overall workflow of PvP01 web resource development at different stages.
Summary of potential search keywords for browsing the PvP01 web resource efficiently
| Search keyword | Example | Search keyword | Example |
|---|---|---|---|
| UniProt identifier | Q968V9 | Protein name | Fructose-bisphosphate aldolase (EC 4.1.2.13) |
| Pfam identifier | PF00274 | Gene name | PvP01_1262200 |
| RefSeq identifier | YP_009325966 | Protein sequence | MATGSE…. KKYVY |
| InterPro identifier | IPR029768 | Chromosome number | 12 |
| PvP01 identifier | PvP01_1262200 | Gene ontology term | GO:0003824 |
Figure 2A depiction of different pieces of information provided in PvP01-DB about sequence, structure, ligand binding site, function annotations and metabolic pathways.
Figure 3Sequence- and structure-based quantification of soluble proteome of PvP01 strain of P. vivax. (a) SDI-based characterization into Modelable, Difficult and Very Difficult regions of structural modelability. (b) Structure quality assessment of soluble proteome of PvP01 into highly reliable and hardly reliable regions of structural quality.
Figure 4Functional characterization of soluble proteome of PvP01 strain of P. vivax. (a) Protein function information of the soluble proteome of PvP01 currently available in public resources. (b) Protein function information of soluble proteome of PvP01 furnished through the PvP01 web resource.