Literature DB >> 15466910

BagBoosting for tumor classification with gene expression data.

Marcel Dettling1.   

Abstract

MOTIVATION: Microarray experiments are expected to contribute significantly to the progress in cancer treatment by enabling a precise and early diagnosis. They create a need for class prediction tools, which can deal with a large number of highly correlated input variables, perform feature selection and provide class probability estimates that serve as a quantification of the predictive uncertainty. A very promising solution is to combine the two ensemble schemes bagging and boosting to a novel algorithm called BagBoosting.
RESULTS: When bagging is used as a module in boosting, the resulting classifier consistently improves the predictive performance and the probability estimates of both bagging and boosting on real and simulated gene expression data. This quasi-guaranteed improvement can be obtained by simply making a bigger computing effort. The advantageous predictive potential is also confirmed by comparing BagBoosting to several established class prediction tools for microarray data. AVAILABILITY: Software for the modified boosting algorithms, for benchmark studies and for the simulation of microarray data are available as an R package under GNU public license at http://stat.ethz.ch/~dettling/bagboost.html.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15466910     DOI: 10.1093/bioinformatics/bth447

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  45 in total

1.  Improved mean estimation and its application to diagonal discriminant analysis.

Authors:  Tiejun Tong; Liang Chen; Hongyu Zhao
Journal:  Bioinformatics       Date:  2011-12-14       Impact factor: 6.937

2.  Identification of cancer genomic markers via integrative sparse boosting.

Authors:  Yuan Huang; Jian Huang; Ben-Chang Shia; Shuangge Ma
Journal:  Biostatistics       Date:  2011-10-31       Impact factor: 5.899

3.  Sparse partial least squares classification for high dimensional data.

Authors:  Dongjun Chung; Sunduz Keles
Journal:  Stat Appl Genet Mol Biol       Date:  2010-03-03

4.  Probabilistic classifiers with high-dimensional data.

Authors:  Kyung In Kim; Richard Simon
Journal:  Biostatistics       Date:  2010-11-17       Impact factor: 5.899

5.  Variable selection for discriminant analysis with Markov random field priors for the analysis of microarray data.

Authors:  Francesco C Stingo; Marina Vannucci
Journal:  Bioinformatics       Date:  2010-12-14       Impact factor: 6.937

6.  Mining gene expression profiles: an integrated implementation of kernel principal component analysis and singular value decomposition.

Authors:  Ferran Reverter; Esteban Vegas; Pedro Sánchez
Journal:  Genomics Proteomics Bioinformatics       Date:  2010-09       Impact factor: 7.691

7.  Patient-centered yes/no prognosis using learning machines.

Authors:  I R König; J D Malley; S Pajevic; C Weimar; H-C Diener; A Ziegler
Journal:  Int J Data Min Bioinform       Date:  2008       Impact factor: 0.667

8.  Higher criticism thresholding: Optimal feature selection when useful features are rare and weak.

Authors:  David Donoho; Jiashun Jin
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-24       Impact factor: 11.205

9.  Statistical challenges of high-dimensional data.

Authors:  Iain M Johnstone; D Michael Titterington
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2009-11-13       Impact factor: 4.226

Review 10.  Systems analysis of high-throughput data.

Authors:  Rosemary Braun
Journal:  Adv Exp Med Biol       Date:  2014       Impact factor: 2.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.