Literature DB >> 35370452

Grafted and Vanishing Random Subspaces.

Matthew A Corsetti1, Tanzy M Love1.   

Abstract

The Random Subspace Method (RSM) is an ensemble procedure in which each constituent learner is constructed using a randomly chosen subset of the data features. Regression trees are ideal candidate learners in RSM ensembles. By constructing trees upon different feature subsets, RSM reduces correlation between trees resulting in a stronger ensemble. Furthermore, it lessens computational burden by only considering a subset of the features when building each tree. Despite its apparent advantages, RSM has a notable drawback. In some instances a randomly chosen subspace may lack informative features. This is especially true in situations in which the number of truly informative variables is small relative to the total number of variables. Trees that are constructed using feature subsets lacking informative features can be damaging to the ensemble. Here we present Grafted Random Subspaces (GRS) and Vanishing Random Subspaces (VRS), two novel ensemble procedures designed to remedy the aforementioned drawback by reusing information across trees. Both techniques borrow from RSM by growing individual trees on randomly selected feature subsets. For each tree in a GRS ensemble, the most important variable is identified and guaranteed inclusion into the next q feature subsets. This allows GRS to recycle a promising feature from one tree across several successive trees, effectively grafting the variable into the next q active subsets. In the VRS procedure the least important feature is guaranteed exclusion from the next q feature subsets. This creates a more enriched pool of candidate variables from which the successive feature subsets are drawn.

Entities:  

Keywords:  Boosting; Ensemble procedures; Feature Weighting; Random Forests; Random Subspaces; Trees

Year:  2021        PMID: 35370452      PMCID: PMC8975250          DOI: 10.1007/s10044-021-01029-0

Source DB:  PubMed          Journal:  Pattern Anal Appl        ISSN: 1433-7541            Impact factor:   2.580


  3 in total

1.  Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval.

Authors:  Dacheng Tao; Xiaoou Tang; Xuelong Li; Xindong Wu
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2006-07       Impact factor: 6.226

2.  Boosting random subspace method.

Authors:  Nicolás García-Pedrajas; Domingo Ortiz-Boyer
Journal:  Neural Netw       Date:  2008-01-06

3.  Conditional variable importance for random forests.

Authors:  Carolin Strobl; Anne-Laure Boulesteix; Thomas Kneib; Thomas Augustin; Achim Zeileis
Journal:  BMC Bioinformatics       Date:  2008-07-11       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.