| Literature DB >> 15608169 |
Sébastien Rey1, Michael Acab, Jennifer L Gardy, Matthew R Laird, Katalin deFays, Christophe Lambert, Fiona S L Brinkman.
Abstract
Information about bacterial subcellular localization (SCL) is important for protein function prediction and identification of suitable drug/vaccine/diagnostic targets. PSORTdb (http://db.psort.org/) is a web-accessible database of SCL for bacteria that contains both information determined through laboratory experimentation and computational predictions. The dataset of experimentally verified information (approximately 2000 proteins) was manually curated by us and represents the largest dataset of its kind. Earlier versions have been used for training SCL predictors, and its incorporation now into this new PSORTdb resource, with its associated additional annotation information and dataset version control, should aid researchers in future development of improved SCL predictors. The second component of this database contains computational analyses of proteins deduced from the most recent NCBI dataset of completely sequenced genomes. Analyses are currently calculated using PSORTb, the most precise automated SCL predictor for bacterial proteins. Both datasets can be accessed through the web using a very flexible text search engine, a data browser, or using BLAST, and the entire database or search results may be downloaded in various formats. Features such as GO ontologies and multiple accession numbers are incorporated to facilitate integration with other bioinformatics resources. PSORTdb is freely available under GNU General Public License.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15608169 PMCID: PMC539981 DOI: 10.1093/nar/gki027
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Total number of proteins for each single and multiple SCL site in the ePSORTdb dataset
| Gram-negative | Gram-positive | |
|---|---|---|
| Cytoplasmic (C) | 278 | 194 |
| Cytoplasmic membrane (CM) | 309 | 103 |
| Periplasmic (P) | 276 | N/A |
| Outer membrane (OM) | 391 | N/A |
| Cell wall (CW) | N/A | 61 |
| Extracellular (EC) | 190 | 181 |
| C/CM | 16 | 15 |
| CM/P | 51 | N/A |
| P/OM | 2 | N/A |
| OM/EC | 78 | N/A |
| CM/CW | N/A | 20 |
| Total | 1591 | 574 |
Total number of proteins for each PSORTb (version 2.0) computationally predicted SCL site in the cPSORTdb dataset
| SCL site | Gram-negative | Gram-positive |
|---|---|---|
| Cytoplasmic | 94 592 | 68 948 |
| Cytoplasmic membrane | 47 284 | 26 586 |
| Periplasmic | 5 536 | N/A |
| Outer membrane | 6 097 | N/A |
| Cell wall | N/A | 1 132 |
| Extracellular | 1 203 | 4 228 |
| Potential multiple SCLs | 5 776 | 1 064 |
| Total | 160 488 | 101 958 |
Fields in ePSORTdb, the database of proteins of experimentally verified SCL
| Commona | Specific |
|---|---|
| GI numberb | Experimental subcellular localization (terse)b |
| Swiss-Prot accession number | Experimental subcellular localization (verbose)b |
| Protein nameb | GO accession IDc |
| Alternate protein name | GO accession definitionc |
| Gene nameb | PubMed ID reference |
| Alternate gene name | Reference title |
| Taxonomy ID (from NCBI) | ISBN number reference |
| Organismb | WWW reference |
| Phylum | Reference comments |
| Class | Reference summaryb,d |
| Gram stain | |
| Amino acid sequence | |
| Sequence length | |
| cPSORTdb GI link |
aCommon fields are those that are shared with cPSORTdb.
bField displayed by default in the result page.
cGO references of the experimental SCL.
dIncludes all fields relevant to references (i.e. PubMed ID, title, ISBN number, WWW and comments).
Fields in cPSORTdb, the database of proteins of computationally predicted SCL
| Commona | PSORTb module predictions details |
|---|---|
| Chromosome accession ID (from NCBI)b | SCL-BLAST predicted localization |
| GI numberb | SCL-BLAST details |
| RefSeq accession IDb | Motif predicted localization |
| Protein nameb | Motif details |
| Gene nameb | OMPMotif predicted localization |
| Alternate gene nameb | OMPMotif details |
| Taxonomy ID (from NCBI) | HMMTOP predicted localization |
| Organismb | HMMTOP details |
| Phylum | SignalP predicted localization |
| Class | SignalP details |
| Gram stain | Profile predicted localization |
| Amino acid sequence | Profile details |
| Sequence length | SCL-BLASTe predicted localization |
| SCL-BLASTe details | |
| CytoSVM predicted localization | |
| PSORTb prediction version | CMSVM predicted localization |
| Predicted localizationb | PPSVM predicted localization |
| GO accession IDc | CWSVM predicted localization |
| GO accession definitionc | OMSVM predicted localization |
| ECSVM predicted localization | |
| SublocC predicted localization | |
| Cytoplasmic score | |
| Cytoplasmic membrane score | |
| Periplasmic score | |
| Cell wall score | |
| Outer membrane score | |
| Extracellular score |
aCommon fields are those that are shared with ePSORTdb.
bField displayed by default in the result page.
cGO references of the predicted localization.