Literature DB >> 25494503

Random forest construction with robust semisupervised node splitting.

Xiao Liu, Mingli Song, Dacheng Tao, Zicheng Liu, Luming Zhang, Chun Chen, Jiajun Bu.   

Abstract

Random forest (RF) is a very important classifier with applications in various machine learning tasks, but its promising performance heavily relies on the size of labeled training data. In this paper, we investigate constructing of RFs with a small size of labeled data and find that the performance bottleneck is located in the node splitting procedures; hence, existing solutions fail to properly partition the feature space if there are insufficient training data. To achieve robust node splitting with insufficient data, we present semisupervised splitting to overcome this limitation by splitting nodes with the guidance of both labeled and abundant unlabeled data. In particular, an accurate quality measure of node splitting is obtained by carrying out the kernel-based density estimation, whereby a multiclass version of asymptotic mean integrated squared error criterion is proposed to adaptively select the optimal bandwidth of the kernel. To avoid the curse of dimensionality, we project the data points from the original high-dimensional feature space onto a low-dimensional subspace before estimation. A unified optimization framework is proposed to select a coupled pair of subspace and separating hyperplane such that the smoothness of the subspace and the quality of the splitting are guaranteed simultaneously. Our algorithm efficiently avoids overfitting caused by bad initialization and local maxima when compared with conventional margin maximization-based semisupervised methods. We demonstrate the effectiveness of the proposed algorithm by comparing it with state-of-the-art supervised and semisupervised algorithms for typical computer vision applications, such as object categorization, face recognition, and image segmentation, on publicly available data sets.

Mesh:

Year:  2014        PMID: 25494503     DOI: 10.1109/TIP.2014.2378017

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  5 in total

1.  Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.

Authors:  Po-Hao Chen; Hanna Zafar; Maya Galperin-Aizenberg; Tessa Cook
Journal:  J Digit Imaging       Date:  2018-04       Impact factor: 4.056

2.  A risk score based on baseline risk factors for predicting mortality in COVID-19 patients.

Authors:  Ze Chen; Jing Chen; Jianghua Zhou; Fang Lei; Feng Zhou; Juan-Juan Qin; Xiao-Jing Zhang; Lihua Zhu; Ye-Mao Liu; Haitao Wang; Ming-Ming Chen; Yan-Ci Zhao; Jing Xie; Lijun Shen; Xiaohui Song; Xingyuan Zhang; Chengzhang Yang; Weifang Liu; Xiao Zhang; Deliang Guo; Youqin Yan; Mingyu Liu; Weiming Mao; Liming Liu; Ping Ye; Bing Xiao; Pengcheng Luo; Zixiong Zhang; Zhigang Lu; Junhai Wang; Haofeng Lu; Xigang Xia; Daihong Wang; Xiaofeng Liao; Gang Peng; Liang Liang; Jun Yang; Guohua Chen; Elena Azzolini; Alessio Aghemo; Michele Ciccarelli; Gianluigi Condorelli; Giulio G Stefanini; Xiang Wei; Bing-Hong Zhang; Xiaodong Huang; Jiahong Xia; Yufeng Yuan; Zhi-Gang She; Jiao Guo; Yibin Wang; Peng Zhang; Hongliang Li
Journal:  Curr Med Res Opin       Date:  2021-04-10       Impact factor: 2.580

3.  WiFi Indoor Localization with CSI Fingerprinting-Based Random Forest.

Authors:  Yanzhao Wang; Chundi Xiu; Xuanli Zhang; Dongkai Yang
Journal:  Sensors (Basel)       Date:  2018-08-31       Impact factor: 3.576

4.  A biomarker basing on radiomics for the prediction of overall survival in non-small cell lung cancer patients.

Authors:  Bo He; Wei Zhao; Jiang-Yuan Pi; Dan Han; Yuan-Ming Jiang; Zhen-Guang Zhang; Wei Zhao
Journal:  Respir Res       Date:  2018-10-10

5.  MRI-Based Radiomics Models for Predicting Risk Classification of Gastrointestinal Stromal Tumors.

Authors:  Haijia Mao; Bingqian Zhang; Mingyue Zou; Yanan Huang; Liming Yang; Cheng Wang; PeiPei Pang; Zhenhua Zhao
Journal:  Front Oncol       Date:  2021-05-10       Impact factor: 6.244

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.