Literature DB >> 29049908

Feature selection method based on support vector machine and shape analysis for high-throughput medical data.

Qiong Liu1, Qiong Gu2, Zhao Wu3.   

Abstract

Proteomics data analysis based on the mass-spectrometry technique can provide a powerful tool for early diagnosis of tumors and other diseases. It can be used for exploring the features that reflect the difference between samples from high-throughput mass spectrometry data, which are important for the identification of tumor markers. Proteomics mass spectrometry data have the characteristics of too few samples, too many features and noise interference, which pose a great challenge to traditional machine learning methods. Traditional unsupervised dimensionality reduction methods do not utilize the label information effectively, so the subspaces they find may not be the most separable ones of the data. To overcome the shortcomings of traditional methods, in this paper, we present a novel feature selection method based on support vector machine (SVM) and shape analysis. In the process of feature selection, our method considers not only the interaction between features but also the relationship between features and class labels, which improves the classification performance. The experimental results obtained from four groups of proteomics data show that, compared with traditional unsupervised feature extraction methods (i.e., Principal Component Analysis - Procrustes Analysis, PCA-PA), our method not only ensures that fewer features are selected but also ensures a high recognition rate. In addition, compared with the two kinds of multivariate filter methods, i.e., Max-Relevance Min-Redundancy (MRMR) and Fast Correlation-Based Filter (FCBF), our method has a higher recognition rate.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Feature selection; High-throughput medical data; Shape analysis; Support vector machine

Mesh:

Year:  2017        PMID: 29049908     DOI: 10.1016/j.compbiomed.2017.10.008

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  4 in total

1.  Sparse support vector machines with L0 approximation for ultra-high dimensional omics data.

Authors:  Zhenqiu Liu; David Elashoff; Steven Piantadosi
Journal:  Artif Intell Med       Date:  2019-04-30       Impact factor: 5.326

2.  Clinical risk assessment in early pregnancy for preeclampsia in nulliparous women: A population based cohort study.

Authors:  Anna Sandström; Jonathan M Snowden; Jonas Höijer; Matteo Bottai; Anna-Karin Wikström
Journal:  PLoS One       Date:  2019-11-27       Impact factor: 3.240

3.  Artificial Intelligence-Based Diagnostic Model for Detecting Keratoconus Using Videos of Corneal Force Deformation.

Authors:  Zuoping Tan; Xuan Chen; Kangsheng Li; Yan Liu; Huazheng Cao; Jing Li; Vishal Jhanji; Haohan Zou; Fenglian Liu; Riwei Wang; Yan Wang
Journal:  Transl Vis Sci Technol       Date:  2022-09-01       Impact factor: 3.048

4.  Improving the Diagnosis of Phenylketonuria by Using a Machine Learning-Based Screening Model of Neonatal MRM Data.

Authors:  Zhixing Zhu; Jianlei Gu; Georgi Z Genchev; Xiaoshu Cai; Yangmin Wang; Jing Guo; Guoli Tian; Hui Lu
Journal:  Front Mol Biosci       Date:  2020-07-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.