| Literature DB >> 16848907 |
Sandro Vivona1, Filippo Bernante, Francesco Filippini.
Abstract
BACKGROUND: Since a milestone work onEntities:
Mesh:
Substances:
Year: 2006 PMID: 16848907 PMCID: PMC1570458 DOI: 10.1186/1472-6750-6-35
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Figure 1NERVE software pipeline. The process can be divided into two parts: data production and storage (top) and data selection (bottom). Six different scripts screen the entire proteome to mine and infer information that flows into a MySQL table. A seventh script uses four filters (LOC, localization; TOP topology; PAD, probability of being adhesin; SHP, similarity to human proteins) and analyzes values created by steps 1 through 5 to select and rank VCs that are then presented in a html table with links to relevant data.
Figure 2Flow-chart of NERVE working process. Amino acid sequences from the whole bacterial proteome undergo six analytical steps: prediction of subcellular localization (1), calculation of probability of being adhesin (2), identification of TM domains (3), comparison to the proteome of Homo sapiens (4) and to that of a pathogen selected by the user (5), assignment of a putative function (6). Each of these steps stores data mined in an SQL database. After filtering and ranking, the best VCs are presented in a user-friendly html table (see figure 1 and Results and Discussion for details).
Figure 3Data concerning the ten proteomes used for tuning NERVE. The number of selected VCs is reported beside the overall number of sequences. The average size of the selected VC pools is 8.17% of the proteome (min 5.09%, max 10.73%).
Tuning NERVE settings on a known dataset including 10 proteomes.
| AMES ANCESTOR | [7,22] | |||
| ATCC 15692 | [23–25] | |||
| CO-92 | [26,27] | |||
| 2603 V/R | [8] | |||
| NEM316 | [8] | |||
| A909 | [8] | |||
| MC58 | [12] | |||
| W83 | [29–32] | |||
| B31 | [33–35] | |||
| UW-3/Cx | [36] |
Test of NERVE settings on another dataset including 6 proteomes.
| MW2 | [37–39] | |||
| ATCC 700825 | [40,41] | |||
| SF370 | [42] | |||
| NCTC 11168 | [43,44] | |||
| J99 | [45] | |||
| CWL029 | [46,46] |
Figure 4Data concerning the six proteomes used to test NERVE settings. The number of selected VCs is reported beside the overall number of sequences. Average size of selected VCs pools is 9.32% of proteome (min 8,17%, max 11,33%).