| Literature DB >> 29688373 |
Ankita Singh1,2, Rahul Kaushik2,3, Himani Kuntal1, B Jayaram2,3,4.
Abstract
Database URL: http://www.scfbio-iitd.res.in/PvaxDB.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29688373 PMCID: PMC5852996 DOI: 10.1093/database/bay021
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
A brief summary of the various features compiled/predicted at protein sequence level for Plasmodium vivax proteome
| Tool/Method | Brief description | Availability | CPU compute time per protein |
|---|---|---|---|
| UniProtKB | UniprotKB is a composite repository protein sequences and functional information. | NA | |
| PSIPRED | PSIPRED is an artificial neural network machine learning-based secondary structure prediction method. | 15–20 min | |
| SD index | Structural difficulty index predicts the modelability of a protein sequence into its tertiary structure. | 4–5 min | |
| Other parameters | Other sequence features are computed via in-house programs based on parameters adopted from the literature. | NA |
A brief summary of the methods/tools used for features compilation/prediction at protein structure level along with their availability
| Method | Brief description | Availability | CPU compute time per protein |
|---|---|---|---|
| I-TASSER | Fragment assembly and simulated annealing-based multiple threading approach for protein structure prediction. | ∼48–72 h | |
| RaptorX | A remote homolog detection and threading-based proteins tertiary structure prediction method. | ∼5–6 h | |
| BhageerathH+ | An | 8–12 h | |
| ProTSAV | Implements metaserver approach for extensive protein structure quality assessment. | 2–3 min |
A brief summary of ligand binding site identification/prediction tools used in PvaxDB for ligand binding site prediction of all the 6600 modeled structures
| Method/Tool | Brief description | Availability | CPU compute time per protein |
|---|---|---|---|
| LigSitecsc | A Connolly surface and the degree of conservation-based pocket identification tool on protein surface. | 2–3 min | |
| FPocket | A Voronoi partition and alpha sphere theory-based protein pocket (cavity) detection method. | 2–3 min | |
| AADS | An automated active site detection and scoring methodology for identifying potential ligand binding site in proteins structures. | 2–3 min |
A brief description of function prediction software used for functional characterization
| Method/Tool | Brief description | Availability | ∼Compute time per-protein |
|---|---|---|---|
| SIFTER | A statistical approach using phylogenetic analysis for representing protein relationships and functional characterization. | 8–10 min | |
| InterPro | Protein sequence analysis and classification via prediction models or signatures assembled from different databases. | 5–6 min | |
| LocTree3 | A machine learning-based hierarchical system for experimental localizations and function prediction of proteins. | 3–4 min | |
| ProBiS | A protein structure surface conservation-based similar protein binding sites detection tool. | 45–60 min |
A summary of search keywords which can be used for a systematic and specific browsing of PvaxDB
| Search keyword | Brief description | Example |
|---|---|---|
| Protein sequence | User can provide amino acid sequence in single letter code of desired protein. The output will result in a Pvax identifier corresponding to input sequence. | MDGGEDEGATEESIPVVILD….ANVL |
| Chromosome number | The genome of | Any chromosome number from 1 to 14 |
| UniProt identifier | It is a unique identifier (six or ten letter alphanumeric string) assigned to each protein deposited in UniProt. | A5KAJ7 |
| Protein name | User can explore the database with the help of protein names. | 40S ribosome protein |
| Gene ontology (GO) | The GO terms are unique accession numbers of gene and gene product attributes across all species and can be used as search keywords in PvaxDB. | GO: 0000016 |
| Protein family or Pfam identifier | Name of protein families or their Pfam identifiers can be used for exploring the databank. A Pfam identifier is a seven letter alphanumeric string. | Actin family PF00022 |
| InterPro identifier | The InterPro database assigns each entry with a unique accession number in the form of an alphanumeric string starting with a prefix ‘IPR’ | IPR000001 |
| Pubmed identifier (PMID) | PMID is a unique identifier number assigned to each article record when it enters the PubMed system | 18843361 |
| Pvax identifier | Pvax identifier is a 10 letter unique alphanumeric string assigned to all proteins of | PVX_000750 |
Figure 1.A depiction of different information provided in PvaxDB about sequence, structure, ligand binding site and function annotations.
Figure 2.Pre-PvaxDB structural difficulty and post-PvaxDB quality assessment analysis. (a) Structural modelability based categorization of blood and liver stage soluble proteins of P. vivax proteome. (b) Categorization of predicted model structures of P. vivax based on an extensive quality assessment performed via ProTSAV.
Figure 3.A depiction of functional characterization of P. vivax proteome. The left panel shows functional annotation of P. vivax proteome presently available through various public resources. The right panel shows functional annotations carried out in PvaxDB based on sequence, structural and ligand binding site information-based functional annotation.