| Literature DB >> 19761570 |
Jonathan L Lustgarten1, Shyam Visweswaran, Robert P Bowser, William R Hogan, Vanathi Gopalakrishnan.
Abstract
BACKGROUND: The incorporation of biological knowledge can enhance the analysis of biomedical data. We present a novel method that uses a proteomic knowledge base to enhance the performance of a rule-learning algorithm in identifying putative biomarkers of disease from high-dimensional proteomic mass spectral data. In particular, we use the Empirical Proteomics Ontology Knowledge Base (EPO-KB) that contains previously identified and validated proteomic biomarkers to select m/zs in a proteomic dataset prior to analysis to increase performance.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19761570 PMCID: PMC2745687 DOI: 10.1186/1471-2105-10-S9-S16
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The methodology for the knowledge-based variable selection showing the three dataset generated. The three datasets selected by None, DS variable selection, BS variable selection were all subjected to the same rule learning process.
Average balanced accuracy (BACC) of the three variable selection methods
| Variable selection method | BACC (± std. dev) | P-Value |
| None | 66.40% (± 19.66) | - |
| DSVS | 71.84% (± 17.49) | 0.068 |
| BSVS | 78.24% (± 18.48) |
None indicates no variable selection, DSVS is disease-specific variable selection and BSVS is biofluid-specific variable selection.
Statistical comparison of the three variable selection methods using the Wilcoxon Signed Rank test and the t-test on BACC and RCI
| Methods | BACC | RCI | ||
| Wilcox | t-test | Wilcox | t-text | |
| None vs. DSVS | 0.074 + | 0.065 + | ||
| None vs. BSVS | ||||
| DSVS vs. BSVS | ||||
p-values below the 0.05 significance level are in bold font. A + favors the second method in a pair.
Average RCI performance of the three variable selection methods
| Variable selection method | RCI | P-Value |
| None | 12.72 | - |
| DSVS | 15.88 | 0.077 |
| BSVS | 20.15 |
Figure 2Venn diagram showing the overlap of the variables selected by RL for the three different variable selection methods.
m/zs that were present in the rules for None and DSVS respectively that were not present in the rules for BSVS
| Variable selection method | Percent Different | |
| None | 5.58, 9.56, 10.53, 18.69 | 50% |
| DSVS | None | 0% |