Literature DB >> 21918713

Weighted random subspace method for high dimensional data classification.

Xiaoye Li1, Hongyu Zhao.   

Abstract

High dimensional data, especially those emerging from genomics and proteomics studies, pose significant challenges to traditional classification algorithms because the performance of these algorithms may substantially deteriorate due to high dimensionality and existence of many noisy features in these data. To address these problems, pre-classification feature selection and aggregating algorithms have been proposed. However, most feature selection procedures either fail to consider potential interactions among the features or tend to over fit the data. The aggregating algorithms, e.g. the bagging predictor, the boosting algorithm, the random subspace method, and the Random Forests algorithm, are promising in handling high dimensional data. However, there is a lack of attention to optimal weight assignments to individual classifiers and this has prevented these algorithms from achieving better classification accuracy. In this article, we formulate the weight assignment problem and propose a heuristic optimization solution.We have applied the proposed weight assignment procedures to the random subspace method to develop a weighted random subspace method. Several public gene expression and mass spectrometry data sets at the Kent Ridge biomedical data repository have been analyzed by this novel method. We have found that significant improvement over the common equal weight assignment scheme may be achieved by our method.

Entities:  

Year:  2009        PMID: 21918713      PMCID: PMC3170928          DOI: 10.4310/sii.2009.v2.n2.a5

Source DB:  PubMed          Journal:  Stat Interface        ISSN: 1938-7989            Impact factor:   0.582


  6 in total

1.  Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma.

Authors:  Gavin J Gordon; Roderick V Jensen; Li-Li Hsiao; Steven R Gullans; Joshua E Blumenstock; Sridhar Ramaswamy; William G Richards; David J Sugarbaker; Raphael Bueno
Journal:  Cancer Res       Date:  2002-09-01       Impact factor: 12.701

2.  Monitoring expression of genes involved in drug metabolism and toxicology using DNA microarrays.

Authors:  D Gerhold; M Lu; J Xu; C Austin; C T Caskey; T Rushmore
Journal:  Physiol Genomics       Date:  2001-04-27       Impact factor: 3.107

3.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

4.  Use of proteomic patterns in serum to identify ovarian cancer.

Authors:  Emanuel F Petricoin; Ali M Ardekani; Ben A Hitt; Peter J Levine; Vincent A Fusaro; Seth M Steinberg; Gordon B Mills; Charles Simone; David A Fishman; Elise C Kohn; Lance A Liotta
Journal:  Lancet       Date:  2002-02-16       Impact factor: 79.321

5.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Authors:  T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

6.  Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data.

Authors:  Baolin Wu; Tom Abbott; David Fishman; Walter McMurray; Gil Mor; Kathryn Stone; David Ward; Kenneth Williams; Hongyu Zhao
Journal:  Bioinformatics       Date:  2003-09-01       Impact factor: 6.937

  6 in total
  3 in total

1.  TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection.

Authors:  Haiyan Wang; Hongyan Zhang; Zhijun Dai; Ming-shun Chen; Zheming Yuan
Journal:  BMC Med Genomics       Date:  2013-01-23       Impact factor: 3.063

2.  Improving accuracy for cancer classification with a new algorithm for genes selection.

Authors:  Hongyan Zhang; Haiyan Wang; Zhijun Dai; Ming-shun Chen; Zheming Yuan
Journal:  BMC Bioinformatics       Date:  2012-11-13       Impact factor: 3.169

3.  Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles.

Authors:  Liying Yang; Zhimin Liu; Xiguo Yuan; Jianhua Wei; Junying Zhang
Journal:  Biomed Res Int       Date:  2016-11-24       Impact factor: 3.411

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.