Literature DB >> 18312222

A classification model for the Leiden proteomics competition.

Huub C J Hoefsloot1, Suzanne Smit, Age K Smilde.   

Abstract

A strategy is presented to build a discrimination model in proteomics studies. The model is built using cross-validation. This cross-validation step can simply be combined with a variable selection method, called rank products. The strategy is especially suitable for the low-samples-to-variables-ratio (undersampling) case, as is often encountered in proteomics and metabolomics studies. As a classification method, Principal Component Discriminant Analysis is used; however, the methodology can be used with any classifier. A data set containing serum samples from breast cancer patients and healthy controls is analysed. Double cross-validation shows that the sensitivity of the model is 82% and the specificity 86%. Potential putative biomarkers are identified using the variable selection method. In each cross-validation loop a classification model is built. The final classification uses a majority voting scheme from the ensemble classifier.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18312222     DOI: 10.2202/1544-6115.1351

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  9 in total

1.  Comments on the rank product method for analyzing replicated experiments.

Authors:  James A Koziol
Journal:  FEBS Lett       Date:  2010-01-20       Impact factor: 4.124

2.  Combination approaches improve predictive performance of diagnostic rules for mass-spectrometry proteomic data.

Authors:  Alexia Kakourou; Werner Vach; Bart Mertens
Journal:  J Comput Biol       Date:  2014-12       Impact factor: 1.479

3.  A critical assessment of feature selection methods for biomarker discovery in clinical proteomics.

Authors:  Christin Christin; Huub C J Hoefsloot; Age K Smilde; B Hoekman; Frank Suits; Rainer Bischoff; Peter Horvatovich
Journal:  Mol Cell Proteomics       Date:  2012-10-31       Impact factor: 5.911

4.  A comparison of methods for classifying clinical samples based on proteomics data: a case study for statistical and machine learning approaches.

Authors:  Dayle L Sampson; Tony J Parker; Zee Upton; Cameron P Hurst
Journal:  PLoS One       Date:  2011-09-28       Impact factor: 3.240

5.  The rank product method with two samples.

Authors:  James A Koziol
Journal:  FEBS Lett       Date:  2010-10-14       Impact factor: 4.124

6.  To aggregate or not to aggregate high-dimensional classifiers.

Authors:  Cheng-Jian Xu; Huub C J Hoefsloot; Age K Smilde
Journal:  BMC Bioinformatics       Date:  2011-05-13       Impact factor: 3.169

Review 7.  A survey of computational tools for downstream analysis of proteomic and other omic datasets.

Authors:  Anis Karimpour-Fard; L Elaine Epperson; Lawrence E Hunter
Journal:  Hum Genomics       Date:  2015-10-28       Impact factor: 4.639

8.  A Simple Rank Product Approach for Analyzing Two Classes.

Authors:  Tae Young Yang
Journal:  Bioinform Biol Insights       Date:  2015-07-16

9.  Fusing metabolomics data sets with heterogeneous measurement errors.

Authors:  Sandra Waaijenborg; Oksana Korobko; Ko Willems van Dijk; Mirjam Lips; Thomas Hankemeier; Tom F Wilderjans; Age K Smilde; Johan A Westerhuis
Journal:  PLoS One       Date:  2018-04-26       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.