Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data.

Literature DB >> 12967959

Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data.

Baolin Wu¹, Tom Abbott, David Fishman, Walter McMurray, Gil Mor, Kathryn Stone, David Ward, Kenneth Williams, Hongyu Zhao.

Abstract

MOTIVATION: Novel methods, both molecular and statistical, are urgently needed to take advantage of recent advances in biotechnology and the human genome project for disease diagnosis and prognosis. Mass spectrometry (MS) holds great promise for biomarker identification and genome-wide protein profiling. It has been demonstrated in the literature that biomarkers can be identified to distinguish normal individuals from cancer patients using MS data. Such progress is especially exciting for the detection of early-stage ovarian cancer patients. Although various statistical methods have been utilized to identify biomarkers from MS data, there has been no systematic comparison among these approaches in their relative ability to analyze MS data.
RESULTS: We compare the performance of several classes of statistical methods for the classification of cancer based on MS spectra. These methods include: linear discriminant analysis, quadratic discriminant analysis, k-nearest neighbor classifier, bagging and boosting classification trees, support vector machine, and random forest (RF). The methods are applied to ovarian cancer and control serum samples from the National Ovarian Cancer Early Detection Program clinic at Northwestern University Hospital. We found that RF outperforms other methods in the analysis of MS data.

Entities: Disease Species

Mesh：

Substances：

Year: 2003 PMID： 12967959 DOI： 10.1093/bioinformatics/btg210

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

85 in total

1. An automatic method for arterial pulse waveform recognition using KNN and SVM classifiers.

Authors: Tânia Pereira; Joana S Paiva; Carlos Correia; João Cardoso
Journal: Med Biol Eng Comput Date: 2015-09-24 Impact factor: 2.602

2. A high productivity/low maintenance approach to high-performance computation for biomedicine: four case studies.

Authors: Nicholas Carriero; Michael V Osier; Kei-Hoi Cheung; Perry L Miller; Mark Gerstein; Hongyu Zhao; Baolin Wu; Scott Rifkin; Joseph Chang; Heping Zhang; Kevin White; Kenneth Williams; Martin Schultz
Journal: J Am Med Inform Assoc Date: 2004-10-18 Impact factor: 4.497

3. Predicting interpretability of metabolome models based on behavior, putative identity, and biological relevance of explanatory signals.

Authors: David P Enot; Manfred Beckmann; David Overy; John Draper
Journal: Proc Natl Acad Sci U S A Date: 2006-09-21 Impact factor: 11.205

9. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.

Authors: Carolin Strobl; James Malley; Gerhard Tutz
Journal: Psychol Methods Date: 2009-12

10. Molecular markers of carcinogenesis for risk stratification of individuals with colorectal polyps: a case-control study.

Authors: Samir Gupta; Han Sun; Sang Yi; Joy Storm; Guanghua Xiao; Bijal A Balasubramanian; Song Zhang; Raheela Ashfaq; Don C Rockey
Journal: Cancer Prev Res (Phila) Date: 2014-08-04

Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data.

1. An automatic method for arterial pulse waveform recognition using KNN and SVM classifiers.

2. A high productivity/low maintenance approach to high-performance computation for biomedicine: four case studies.

3. Predicting interpretability of metabolome models based on behavior, putative identity, and biological relevance of explanatory signals.

Review 4. Classification algorithms for phenotype prediction in genomics and proteomics.

5. Processing MALDI Mass Spectra to Improve Mass Spectral Direct Tissue Analysis.

6. Is bagging effective in the classification of small-sample genomic and proteomic data?

7. Classification of bioaccumulative and non-bioaccumulative chemicals using statistical learning approaches.

8. Adaptive prediction model in prospective molecular signature-based clinical studies.

9. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.

10. Molecular markers of carcinogenesis for risk stratification of individuals with colorectal polyps: a case-control study.