Literature DB >> 15130820

Ensemble machine learning on gene expression data for cancer classification.

Aik Choon Tan1, David Gilbert.   

Abstract

Whole genome RNA expression studies permit systematic approaches to understanding the correlation between gene expression profiles to disease states or different developmental stages of a cell. Microarray analysis provides quantitative information about the complete transcription profile of cells that facilitate drug and therapeutics development, disease diagnosis, and understanding in the basic cell biology. One of the challenges in microarray analysis, especially in cancerous gene expression profiles, is to identify genes or groups of genes that are highly expressed in tumour cells but not in normal cells and vice versa. Previously, we have shown that ensemble machine learning consistently performs well in classifying biological data. In this paper, we focus on three different supervised machine learning techniques in cancer classification, namely C4.5 decision tree, and bagged and boosted decision trees. We have performed classification tasks on seven publicly available cancerous microarray data and compared the classification/prediction performance of these methods. We have observed that ensemble learning (bagged and boosted decision trees) often performs better than single decision trees in this classification task.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 15130820

Source DB:  PubMed          Journal:  Appl Bioinformatics        ISSN: 1175-5636


  58 in total

1.  Identification of Marker Genes for Cancer Based on Microarrays Using a Computational Biology Approach.

Authors:  Xiaosheng Wang
Journal:  Curr Bioinform       Date:  2014-04-01       Impact factor: 3.543

2.  Design of a multi-signature ensemble classifier predicting neuroblastoma patients' outcome.

Authors:  Andrea Cornero; Massimo Acquaviva; Paolo Fardin; Rogier Versteeg; Alexander Schramm; Alessandra Eva; Maria Carla Bosco; Fabiola Blengio; Sara Barzaghi; Luigi Varesio
Journal:  BMC Bioinformatics       Date:  2012-03-28       Impact factor: 3.169

3.  Investigating the efficacy of nonlinear dimensionality reduction schemes in classifying gene and protein expression studies.

Authors:  George Lee; Carlos Rodriguez; Anant Madabhushi
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2008 Jul-Sep       Impact factor: 3.710

4.  A classification framework applied to cancer gene expression profiles.

Authors:  Hussein Hijazi; Christina Chan
Journal:  J Healthc Eng       Date:  2013       Impact factor: 2.682

5.  Global gene expression profiling of dimethylnitrosamine-induced liver fibrosis: from pathological and biochemical data to microarray analysis.

Authors:  Li-Jen Su; Shih-Lan Hsu; Jyh-Shyue Yang; Huei-Hun Tseng; Shiu-Feng Huang; Chi-Ying F Huang
Journal:  Gene Expr       Date:  2006

6.  Simple decision rules for classifying human cancers from gene expression profiles.

Authors:  Aik Choon Tan; Daniel Q Naiman; Lei Xu; Raimond L Winslow; Donald Geman
Journal:  Bioinformatics       Date:  2005-08-16       Impact factor: 6.937

7.  Accurate molecular classification of cancer using simple rules.

Authors:  Xiaosheng Wang; Osamu Gotoh
Journal:  BMC Med Genomics       Date:  2009-10-30       Impact factor: 3.063

8.  Novel methods to identify biologically relevant genes for leukemia and prostate cancer from gene expression profiles.

Authors:  Austin H Chen; Yin-Wu Tsau; Ching-Heng Lin
Journal:  BMC Genomics       Date:  2010-04-30       Impact factor: 3.969

9.  Microarray-based cancer prediction using soft computing approach.

Authors:  Xiaosheng Wang; Osamu Gotoh
Journal:  Cancer Inform       Date:  2009-05-26

10.  ArrayMining: a modular web-application for microarray analysis combining ensemble and consensus methods with cross-study normalization.

Authors:  Enrico Glaab; Jonathan M Garibaldi; Natalio Krasnogor
Journal:  BMC Bioinformatics       Date:  2009-10-28       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.