Literature DB >> 26893083

Variable importance analysis based on rank aggregation with applications in metabolomics for biomarker discovery.

Yong-Huan Yun1, Bai-Chuan Deng2, Dong-Sheng Cao3, Wei-Ting Wang1, Yi-Zeng Liang4.   

Abstract

Biomarker discovery is one important goal in metabolomics, which is typically modeled as selecting the most discriminating metabolites for classification and often referred to as variable importance analysis or variable selection. Until now, a number of variable importance analysis methods to discover biomarkers in the metabolomics studies have been proposed. However, different methods are mostly likely to generate different variable ranking results due to their different principles. Each method generates a variable ranking list just as an expert presents an opinion. The problem of inconsistency between different variable ranking methods is often ignored. To address this problem, a simple and ideal solution is that every ranking should be taken into account. In this study, a strategy, called rank aggregation, was employed. It is an indispensable tool for merging individual ranking lists into a single "super"-list reflective of the overall preference or importance within the population. This "super"-list is regarded as the final ranking for biomarker discovery. Finally, it was used for biomarkers discovery and selecting the best variable subset with the highest predictive classification accuracy. Nine methods were used, including three univariate filtering and six multivariate methods. When applied to two metabolic datasets (Childhood overweight dataset and Tubulointerstitial lesions dataset), the results show that the performance of rank aggregation has improved greatly with higher prediction accuracy compared with using all variables. Moreover, it is also better than penalized method, least absolute shrinkage and selectionator operator (LASSO), with higher prediction accuracy or less number of selected variables which are more interpretable.
Copyright © 2016 Elsevier B.V. All rights reserved.

Keywords:  Biomarker discovery; Metabolomics; Rank aggregation; Variable importance; Variable ranking

Mesh:

Substances:

Year:  2016        PMID: 26893083     DOI: 10.1016/j.aca.2015.12.043

Source DB:  PubMed          Journal:  Anal Chim Acta        ISSN: 0003-2670            Impact factor:   6.558


  4 in total

1.  Multi-factor combined biomarker screening strategy to rapidly diagnose Alzheimer's disease and evaluate drug effect based on a rat model.

Authors:  Yanmeng Liu; Xinyue Zhang; Weiwei Lin; Nurmuhammat Kehriman; Wen Kuang; Xiaomei Ling
Journal:  J Pharm Anal       Date:  2022-04-19

2.  Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance.

Authors:  Guang-Hui Fu; Jia-Bao Wang; Min-Jie Zong; Lun-Zhao Yi
Journal:  Metabolites       Date:  2021-06-14

3.  Multi-Target Screening and Experimental Validation of Natural Products from Selaginella Plants against Alzheimer's Disease.

Authors:  Yin-Hua Deng; Ning-Ning Wang; Zhen-Xing Zou; Lin Zhang; Kang-Ping Xu; Alex F Chen; Dong-Sheng Cao; Gui-Shan Tan
Journal:  Front Pharmacol       Date:  2017-08-25       Impact factor: 5.810

4.  Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study.

Authors:  Liying Zhang; Yikang Wang; Miaomiao Niu; Chongjian Wang; Zhenfei Wang
Journal:  Sci Rep       Date:  2020-03-10       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.