Literature DB >> 12973727

Protocols for disease classification from mass spectrometry data.

Michael Wagner1, Dayanand Naik, Alex Pothen.   

Abstract

We report our results in classifying protein matrix-assisted laser desorption/ionization-time of flight mass spectra obtained from serum samples into diseased and healthy groups. We discuss in detail five of the steps in preprocessing the mass spectral data for biomarker discovery, as well as our criterion for choosing a small set of peaks for classifying the samples. Cross-validation studies with four selected proteins yielded misclassification rates in the 10-15% range for all the classification methods. Three of these proteins or protein fragments are down-regulated and one up-regulated in lung cancer, the disease under consideration in this data set. When cross-validation studies are performed, care must be taken to ensure that the test set does not influence the choice of the peaks used in the classification. Misclassification rates are lower when both the training and test sets are used to select the peaks used in classification versus when only the training set is used. This expectation was validated for various statistical discrimination methods when thirteen peaks were used in cross-validation studies. One particular classification method, a linear support vector machine, exhibited especially robust performance when the number of peaks was varied from four to thirteen, and when the peaks were selected from the training set alone. Experiments with the samples randomly assigned to the two classes confirmed that misclassification rates were significantly higher in such cases than those observed with the true data. This indicates that our findings are indeed significant. We found closely matching masses in a database for protein expression in lung cancer for three of the four proteins we used to classify lung cancer. Data from additional samples, increased experience with the performance of various preprocessing techniques, and affirmation of the biological roles of the proteins that help in classification, will strengthen our conclusions in the future.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12973727     DOI: 10.1002/pmic.200300519

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  22 in total

Review 1.  Classification algorithms for phenotype prediction in genomics and proteomics.

Authors:  Habtom W Ressom; Rency S Varghese; Zhen Zhang; Jianhua Xuan; Robert Clarke
Journal:  Front Biosci       Date:  2008-01-01

2.  Processing MALDI Mass Spectra to Improve Mass Spectral Direct Tissue Analysis.

Authors:  Jeremy L Norris; Dale S Cornett; James A Mobley; Malin Andersson; Erin H Seeley; Pierre Chaurand; Richard M Caprioli
Journal:  Int J Mass Spectrom       Date:  2007-02-01       Impact factor: 1.986

3.  A novel comprehensive wave-form MS data processing method.

Authors:  Shuo Chen; Ming Li; Don Hong; Dean Billheimer; Huiming Li; Baogang J Xu; Yu Shyr
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

4.  Correcting common errors in identifying cancer-specific serum peptide signatures.

Authors:  Josep Villanueva; John Philip; Carlos A Chaparro; Yongbiao Li; Ricardo Toledo-Crow; Lin DeNoyer; Martin Fleisher; Richard J Robbins; Paul Tempst
Journal:  J Proteome Res       Date:  2005 Jul-Aug       Impact factor: 4.466

5.  Normalization in MALDI-TOF imaging datasets of proteins: practical considerations.

Authors:  Sören-Oliver Deininger; Dale S Cornett; Rainer Paape; Michael Becker; Charles Pineau; Sandra Rauser; Axel Walch; Eryk Wolski
Journal:  Anal Bioanal Chem       Date:  2011-04-12       Impact factor: 4.142

6.  Classification of astrocytomas and oligodendrogliomas from mass spectrometry data using sparse kernel machines.

Authors:  Jacob Huang; Behnood Gholami; Nathalie Y R Agar; Isaiah Norton; Wassim M Haddad; Allen R Tannenbaum
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2011

7.  On Comprehensive Mass Spectrometry Data Analysis for Proteome Profiling of Human Blood Samples.

Authors:  Sameer Manchanda; Mikaela Meyer; Qianqian Li; Kai Liang; Yan Li; Nan Kong
Journal:  J Healthc Inform Res       Date:  2018-05-22

8.  Bootstrap classification and point-based feature selection from age-staged mouse cerebellum tissues of matrix assisted laser desorption/ionization mass spectra using a fuzzy rule-building expert system.

Authors:  Peter B Harrington; Claudine Laurent; Douglas F Levinson; Pat Levitt; Sanford P Markey
Journal:  Anal Chim Acta       Date:  2007-08-06       Impact factor: 6.558

9.  Label-free, normalized quantification of complex mass spectrometry data for proteomic analysis.

Authors:  Noelle M Griffin; Jingyi Yu; Fred Long; Phil Oh; Sabrina Shore; Yan Li; Jim A Koziol; Jan E Schnitzer
Journal:  Nat Biotechnol       Date:  2009-12-13       Impact factor: 54.908

10.  Biomarker Discovery by Imperialist Competitive Algorithm in Mass Spectrometry Data for Ovarian Cancer Prediction.

Authors:  Shiva Pirhadi; Keivan Maghooli; Niloofar Yousefi Moteghaed; Masoud Garshasbi; Seyed Jalaleddin Mousavirad
Journal:  J Med Signals Sens       Date:  2021-05-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.