Literature DB >> 12801866

Boosting for tumor classification with gene expression data.

Marcel Dettling1, Peter Bühlmann.   

Abstract

MOTIVATION: Microarray experiments generate large datasets with expression values for thousands of genes but not more than a few dozens of samples. Accurate supervised classification of tissue samples in such high-dimensional problems is difficult but often crucial for successful diagnosis and treatment. A promising way to meet this challenge is by using boosting in conjunction with decision trees.
RESULTS: We demonstrate that the generic boosting algorithm needs some modification to become an accurate classifier in the context of gene expression data. In particular, we present a feature preselection method, a more robust boosting procedure and a new approach for multi-categorical problems. This allows for slight to drastic increase in performance and yields competitive results on several publicly available datasets. AVAILABILITY: Software for the modified boosting algorithms as well as for decision trees is available for free in R at http://stat.ethz.ch/~dettling/boosting.html.

Entities:  

Mesh:

Year:  2003        PMID: 12801866     DOI: 10.1093/bioinformatics/btf867

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  57 in total

1.  Identification of cancer genomic markers via integrative sparse boosting.

Authors:  Yuan Huang; Jian Huang; Ben-Chang Shia; Shuangge Ma
Journal:  Biostatistics       Date:  2011-10-31       Impact factor: 5.899

2.  Relationship Between Quantitative Adverse Plaque Features From Coronary Computed Tomography Angiography and Downstream Impaired Myocardial Flow Reserve by 13N-Ammonia Positron Emission Tomography: A Pilot Study.

Authors:  Damini Dey; Mariana Diaz Zamudio; Annika Schuhbaeck; Luis Eduardo Juarez Orozco; Yuka Otaki; Heidi Gransar; Debiao Li; Guido Germano; Stephan Achenbach; Daniel S Berman; Aloha Meave; Erick Alexanderson; Piotr J Slomka
Journal:  Circ Cardiovasc Imaging       Date:  2015-10       Impact factor: 7.792

3.  Patient-centered yes/no prognosis using learning machines.

Authors:  I R König; J D Malley; S Pajevic; C Weimar; H-C Diener; A Ziegler
Journal:  Int J Data Min Bioinform       Date:  2008       Impact factor: 0.667

4.  High Dimensional Classification Using Features Annealed Independence Rules.

Authors:  Jianqing Fan; Yingying Fan
Journal:  Ann Stat       Date:  2008       Impact factor: 4.028

5.  An efficient algorithm coupled with synthetic minority over-sampling technique to classify imbalanced PubChem BioAssay data.

Authors:  Ming Hao; Yanli Wang; Stephen H Bryant
Journal:  Anal Chim Acta       Date:  2013-11-06       Impact factor: 6.558

6.  Mining the literature for genes associated with placenta-mediated maternal diseases.

Authors:  Laritza M Rodriguez; Stephanie M Morrison; Kathleen Greenberg; Dina Demner Fushman
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

7.  Higher criticism thresholding: Optimal feature selection when useful features are rare and weak.

Authors:  David Donoho; Jiashun Jin
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-24       Impact factor: 11.205

8.  Simple decision rules for classifying human cancers from gene expression profiles.

Authors:  Aik Choon Tan; Daniel Q Naiman; Lei Xu; Raimond L Winslow; Donald Geman
Journal:  Bioinformatics       Date:  2005-08-16       Impact factor: 6.937

9.  A boosting method for maximizing the partial area under the ROC curve.

Authors:  Osamu Komori; Shinto Eguchi
Journal:  BMC Bioinformatics       Date:  2010-06-10       Impact factor: 3.169

10.  MALDI profiling of human lung cancer subtypes.

Authors:  Angelo Gámez-Pozo; Iker Sánchez-Navarro; Manuel Nistal; Enrique Calvo; Rosario Madero; Esther Díaz; Emilio Camafeita; Javier de Castro; Juan Antonio López; Manuel González-Barón; Enrique Espinosa; Juan Angel Fresno Vara
Journal:  PLoS One       Date:  2009-11-05       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.