Literature DB >> 17512828

Assessing the statistical validity of proteomics based biomarkers.

Suzanne Smit1, Mariëlle J van Breemen, Huub C J Hoefsloot, Age K Smilde, Johannes M F G Aerts, Chris G de Koster.   

Abstract

A strategy is presented for the statistical validation of discrimination models in proteomics studies. Several existing tools are combined to form a solid statistical basis for biomarker discovery that should precede a biochemical validation of any biomarker. These tools consist of permutation tests, single and double cross-validation. The cross-validation steps can simply be combined with a new variable selection method, called rank products. The strategy is especially suited for the low-samples-to-variables-ratio (undersampling) case, as is often encountered in proteomics and metabolomics studies. As a classification method, principal component discriminant analysis is used; however, the methodology can be used with any classifier. A dataset containing serum samples from Gaucher patients and healthy controls serves as a test case. Double cross-validation shows that the sensitivity of the model is 89% and the specificity 90%. Potential putative biomarkers are identified using the novel variable selection method. Results from permutation tests support the choice of double cross-validation as the tool for determining error rates when the modelling procedure involves a tuneable parameter. This shows that even cross-validation does not guarantee unbiased results. The validation of discrimination models with a combination of permutation tests and double cross-validation helps to avoid erroneous results which may result from the undersampling.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17512828     DOI: 10.1016/j.aca.2007.04.043

Source DB:  PubMed          Journal:  Anal Chim Acta        ISSN: 0003-2670            Impact factor:   6.558


  38 in total

Review 1.  Finding biomarkers is getting easier.

Authors:  Brian Patrick Bradley
Journal:  Ecotoxicology       Date:  2012-03-13       Impact factor: 2.823

2.  Global urinary metabolic profiling procedures using gas chromatography-mass spectrometry.

Authors:  Eric Chun Yong Chan; Kishore Kumar Pasikanti; Jeremy K Nicholson
Journal:  Nat Protoc       Date:  2011-09-08       Impact factor: 13.491

3.  Double-check: validation of diagnostic statistics for PLS-DA models in metabolomics studies.

Authors:  Ewa Szymańska; Edoardo Saccenti; Age K Smilde; Johan A Westerhuis
Journal:  Metabolomics       Date:  2011-07-08       Impact factor: 4.290

4.  Multivariate paired data analysis: multilevel PLSDA versus OPLSDA.

Authors:  Johan A Westerhuis; Ewoud J J van Velzen; Huub C J Hoefsloot; Age K Smilde
Journal:  Metabolomics       Date:  2009-10-28       Impact factor: 4.290

5.  A critical assessment of feature selection methods for biomarker discovery in clinical proteomics.

Authors:  Christin Christin; Huub C J Hoefsloot; Age K Smilde; B Hoekman; Frank Suits; Rainer Bischoff; Peter Horvatovich
Journal:  Mol Cell Proteomics       Date:  2012-10-31       Impact factor: 5.911

6.  Evaluation of multiple variate selection methods from a biological perspective: a nutrigenomics case study.

Authors:  Henri S Tapp; Marijana Radonjic; E Kate Kemsley; Uwe Thissen
Journal:  Genes Nutr       Date:  2012-03-02       Impact factor: 5.523

7.  Mass Spectrometry Imaging and GC-MS Profiling of the Mammalian Peripheral Sensory-Motor Circuit.

Authors:  Stanislav S Rubakhin; Alexander Ulanov; Jonathan V Sweedler
Journal:  J Am Soc Mass Spectrom       Date:  2015-03-31       Impact factor: 3.109

8.  HDL in humans with cardiovascular disease exhibits a proteomic signature.

Authors:  Tomás Vaisar; Philip Mayer; Erik Nilsson; Xue-Qiao Zhao; Robert Knopp; Bryan J Prazen
Journal:  Clin Chim Acta       Date:  2010-03-20       Impact factor: 3.786

9.  Raman spectroscopy as a promising tool for noninvasive point-of-care glucose monitoring.

Authors:  Maarten J Scholtes-Timmerman; Sabina Bijlsma; Marion J Fokkert; Robbert Slingerland; Sjaak J F van Veen
Journal:  J Diabetes Sci Technol       Date:  2014-07-18

10.  Improving the analysis of designed studies by combining statistical modelling with study design information.

Authors:  Uwe Thissen; Suzan Wopereis; Sjoerd A A van den Berg; Ivana Bobeldijk; Robert Kleemann; Teake Kooistra; Ko Willems van Dijk; Ben van Ommen; Age K Smilde
Journal:  BMC Bioinformatics       Date:  2009-02-07       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.