Literature DB >> 12973731

Machine learning approaches to lung cancer prediction from mass spectra.

Melanie Hilario1, Alexandros Kalousis, Markus Müller, Christian Pellegrini.   

Abstract

We addressed the problem of discriminating between 24 diseased and 17 healthy specimens on the basis of protein mass spectra. To prepare the data, we performed mass to charge ratio (m/z) normalization, baseline elimination, and conversion of absolute peak height measures to height ratios. After preprocessing, the major difficulty encountered was the extremely large number of variables (1676 m/z values) versus the number of examples (41). Dimensionality reduction was treated as an integral part of the classification process; variable selection was coupled with model construction in a single ten-fold cross-validation loop. We explored different experimental setups involving two peak height representations, two variable selection methods, and six induction algorithms, all on both the original 1676-mass data set and on a prescreened 124-mass data set. Highest predictive accuracies (1-2 off-sample misclassifications) were achieved by a multilayer perceptron and Naïve Bayes, with the latter displaying more consistent performance (hence greater reliability) over varying experimental conditions. We attempted to identify the most discriminant peaks (proteins) on the basis of scores assigned by the two variable selection methods and by neural network based sensitivity analysis. These three scoring schemes consistently ranked four peaks as the most relevant discriminators: 11683, 1403, 17350 and 66107.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12973731     DOI: 10.1002/pmic.200300523

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  5 in total

1.  The parameter sensitivity of random forests.

Authors:  Barbara F F Huang; Paul C Boutros
Journal:  BMC Bioinformatics       Date:  2016-09-01       Impact factor: 3.169

Review 2.  Intelligence Algorithms for Protein Classification by Mass Spectrometry.

Authors:  Zichuan Fan; Fanchen Kong; Yang Zhou; Yiqing Chen; Yalan Dai
Journal:  Biomed Res Int       Date:  2018-11-11       Impact factor: 3.411

3.  A study of aortic dissection screening method based on multiple machine learning models.

Authors:  Lijue Liu; Caiwang Zhang; Guogang Zhang; Yan Gao; Jingmin Luo; Wei Zhang; Yi Li; Yang Mu
Journal:  J Thorac Dis       Date:  2020-03       Impact factor: 2.895

4.  Characterising phase variations in MALDI-TOF data and correcting them by peak alignment.

Authors:  Simon M Lin; Richard P Haney; Michael J Campa; Michael C Fitzgerald; Edward F Patz
Journal:  Cancer Inform       Date:  2005

Review 5.  Application of Artificial Intelligence in Lung Cancer.

Authors:  Hwa-Yen Chiu; Heng-Sheng Chao; Yuh-Min Chen
Journal:  Cancers (Basel)       Date:  2022-03-08       Impact factor: 6.639

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.