Literature DB >> 32928475

Constructing bi-plots for random forest: Tutorial.

Lionel Blanchet1, Raffaele Vitale2, Robert van Vorstenbosch1, George Stavropoulos1, John Pender3, Daisy Jonkers4, Frederik-Jan van Schooten1, Agnieszka Smolinska5.   

Abstract

Current technological developments have allowed for a significant increase and availability of data. Consequently, this has opened enormous opportunities for the machine learning and data science field, translating into the development of new algorithms in a wide range of applications in medical, biomedical, daily-life, and national security areas. Ensemble techniques are among the pillars of the machine learning field, and they can be defined as approaches in which multiple, complex, independent/uncorrelated, predictive models are subsequently combined by either averaging or voting to yield a higher model performance. Random forest (RF), a popular ensemble method, has been successfully applied in various domains due to its ability to build predictive models with high certainty and little necessity of model optimization. RF provides both a predictive model and an estimation of the variable importance. However, the estimation of the variable importance is based on thousands of trees, and therefore, it does not specify which variable is important for which sample group. The present study demonstrates an approach based on the pseudo-sample principle that allows for construction of bi-plots (i.e. spin plots) associated with RF models. The pseudo-sample principle for RF. is explained and demonstrated by using two simulated datasets, and three different types of real data, which include political sciences, food chemistry and the human microbiome data. The pseudo-sample bi-plots, associated with RF and its unsupervised version, allow for a versatile visualization of multivariate models, and the variable importance and the relation among them.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Bi-plots; Principal coordinates analysis; Proximity matrix; Pseudo samples; Random forest interpretation

Year:  2020        PMID: 32928475     DOI: 10.1016/j.aca.2020.06.043

Source DB:  PubMed          Journal:  Anal Chim Acta        ISSN: 0003-2670            Impact factor:   6.558


  8 in total

1.  Construction of Machine Learning Models to Predict Changes in Immune Function Using Clinical Monitoring Indices in HIV/AIDS Patients After 9.9-Years of Antiretroviral Therapy in Yunnan, China.

Authors:  Bingxiang Li; Mingyu Li; Yu Song; Xiaoning Lu; Dajin Liu; Chenglu He; Ruixian Zhang; Xinrui Wan; Renning Zhang; Ming Sun; Yi-Qun Kuang; Ya Li
Journal:  Front Cell Infect Microbiol       Date:  2022-05-12       Impact factor: 6.073

2.  Machine learning-assisted ammonium detection using zinc oxide/multi-walled carbon nanotube composite based impedance sensors.

Authors:  Akshaya Kumar Aliyana; S K Naveen Kumar; Pradeep Marimuthu; Aiswarya Baburaj; Michael Adetunji; Terrance Frederick; Praveen Sekhar; Renny Edwin Fernandez
Journal:  Sci Rep       Date:  2021-12-21       Impact factor: 4.379

3.  Machine learning for infection risk prediction in postoperative patients with non-mechanical ventilation and intravenous neurotargeted drugs.

Authors:  Yi Du; Haipeng Shi; Xiaojing Yang; Weidong Wu
Journal:  Front Neurol       Date:  2022-08-01       Impact factor: 4.086

4.  Identification of potential biomarkers of inflammation-related genes for ischemic cardiomyopathy.

Authors:  Jianru Wang; Shiyang Xie; Yanling Cheng; Xiaohui Li; Jian Chen; Mingjun Zhu
Journal:  Front Cardiovasc Med       Date:  2022-08-23

Review 5.  Applications of machine learning in tumor-associated macrophages.

Authors:  Zhen Li; Qijun Yu; Qingyuan Zhu; Xiaojing Yang; Zhaobin Li; Jie Fu
Journal:  Front Immunol       Date:  2022-09-23       Impact factor: 8.786

6.  Combining comparative genomic analysis with machine learning reveals some promising diagnostic markers to identify five common pathogenic non-tuberculous mycobacteria.

Authors:  Xinmiao Jia; Linfang Yang; Cuidan Li; Yingchun Xu; Qiwen Yang; Fei Chen
Journal:  Microb Biotechnol       Date:  2021-05-21       Impact factor: 5.813

Review 7.  Strategies for Sudden Cardiac Death Prevention.

Authors:  Mattia Corianò; Francesco Tona
Journal:  Biomedicines       Date:  2022-03-10

8.  A Predictive Model for the Risk of Cognitive Impairment in Patients with Gallstones.

Authors:  Zhaofang Liu; Chuanyan Li
Journal:  Biomed Res Int       Date:  2021-07-17       Impact factor: 3.411

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.