Literature DB >> 31197323

A critical assessment of the feature selection methods used for biomarker discovery in current metaproteomics studies.

Jing Tang1,2, Yunxia Wang1, Jianbo Fu1, Ying Zhou1, Yongchao Luo1, Ying Zhang1, Bo Li3, Qingxia Yang1,3, Weiwei Xue3, Yan Lou4, Yunqing Qiu4, Feng Zhu1,3.   

Abstract

Microbial community (MC) has great impact on mediating complex disease indications, biogeochemical cycling and agricultural productivities, which makes metaproteomics powerful technique for quantifying diverse and dynamic composition of proteins or peptides. The key role of biostatistical strategies in MC study is reported to be underestimated, especially the appropriate application of feature selection method (FSM) is largely ignored. Although extensive efforts have been devoted to assessing the performance of FSMs, previous studies focused only on their classification accuracy without considering their ability to correctly and comprehensively identify the spiked proteins. In this study, the performances of 14 FSMs were comprehensively assessed based on two key criteria (both sample classification and spiked protein discovery) using a variety of metaproteomics benchmarks. First, the classification accuracies of those 14 FSMs were evaluated. Then, their abilities in identifying the proteins of different spiked concentrations were assessed. Finally, seven FSMs (FC, LMEB, OPLS-DA, PLS-DA, SAM, SVM-RFE and T-Test) were identified as performing consistently superior or good under both criteria with the PLS-DA performing consistently superior. In summary, this study served as comprehensive analysis on the performances of current FSMs and could provide a valuable guideline for researchers in metaproteomics.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  classification accuracy; feature selection method; metaproteomics; microbiome; spiked proteins

Year:  2020        PMID: 31197323     DOI: 10.1093/bib/bbz061

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  12 in total

1.  Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

Authors:  Fuyi Li; Jinxiang Chen; Zongyuan Ge; Ya Wen; Yanwei Yue; Morihiro Hayashida; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

2.  Understanding the mutational frequency in SARS-CoV-2 proteome using structural features.

Authors:  Puneet Rawat; Divya Sharma; Medha Pandey; R Prabakaran; M Michael Gromiha
Journal:  Comput Biol Med       Date:  2022-06-07       Impact factor: 6.698

3.  Identification of the gene signature reflecting schizophrenia's etiology by constructing artificial intelligence-based method of enhanced reproducibility.

Authors:  Qing-Xia Yang; Yun-Xia Wang; Feng-Cheng Li; Song Zhang; Yong-Chao Luo; Yi Li; Jing Tang; Bo Li; Yu-Zong Chen; Wei-Wei Xue; Feng Zhu
Journal:  CNS Neurosci Ther       Date:  2019-07-27       Impact factor: 5.243

4.  A Method for Identifying Vesicle Transport Proteins Based on LibSVM and MRMD.

Authors:  Zhiyu Tao; Yanjuan Li; Zhixia Teng; Yuming Zhao
Journal:  Comput Math Methods Med       Date:  2020-10-19       Impact factor: 2.238

Review 5.  The miRNA: a small but powerful RNA for COVID-19.

Authors:  Song Zhang; Kuerbannisha Amahong; Xiuna Sun; Xichen Lian; Jin Liu; Huaicheng Sun; Yan Lou; Feng Zhu; Yunqing Qiu
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

6.  Deep Learning Architecture Reduction for fMRI Data.

Authors:  Ruben Alvarez-Gonzalez; Andres Mendez-Vazquez
Journal:  Brain Sci       Date:  2022-02-08

7.  Metaproteomics characterizes human gut microbiome function in colorectal cancer.

Authors:  Shuping Long; Yi Yang; Chengpin Shen; Yiwen Wang; Anmei Deng; Qin Qin; Liang Qiao
Journal:  NPJ Biofilms Microbiomes       Date:  2020-03-24       Impact factor: 7.290

8.  Systematic Identification of Housekeeping Genes Possibly Used as References in Caenorhabditis elegans by Large-Scale Data Integration.

Authors:  Jingxin Tao; Youjin Hao; Xudong Li; Huachun Yin; Xiner Nie; Jie Zhang; Boying Xu; Qiao Chen; Bo Li
Journal:  Cells       Date:  2020-03-24       Impact factor: 6.600

9.  GIMICA: host genetic and immune factors shaping human microbiota.

Authors:  Jing Tang; Xianglu Wu; Minjie Mou; Chuan Wang; Lidan Wang; Fengcheng Li; Maiyuan Guo; Jiayi Yin; Wenqin Xie; Xiaona Wang; Yingxiong Wang; Yubin Ding; Weiwei Xue; Feng Zhu
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

10.  Residue-Residue Contact Can Be a Potential Feature for the Prediction of Lysine Crotonylation Sites.

Authors:  Rulan Wang; Zhuo Wang; Zhongyan Li; Tzong-Yi Lee
Journal:  Front Genet       Date:  2022-01-04       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.