Literature DB >> 22125385

Random forest for gene selection and microarray data classification.

Kohbalan Moorthy1, Mohd Saberi Mohamad.   

Abstract

A random forest method has been selected to perform both gene selection and classification of the microarray data. In this embedded method, the selection of smallest possible sets of genes with lowest error rates is the key factor in achieving highest classification accuracy. Hence, improved gene selection method using random forest has been proposed to obtain the smallest subset of genes as well as biggest subset of genes prior to classification. The option for biggest subset selection is done to assist researchers who intend to use the informative genes for further research. Enhanced random forest gene selection has performed better in terms of selecting the smallest subset as well as biggest subset of informative genes with lowest out of bag error rates through gene selection. Furthermore, the classification performed on the selected subset of genes using random forest has lead to lower prediction error rates compared to existing method and other similar available methods.

Entities:  

Keywords:  Random forest; cancer classification; classification; gene expression data; gene selection; microarray data

Year:  2011        PMID: 22125385      PMCID: PMC3218317          DOI: 10.6026/97320630007142

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


  11 in total

1.  A molecular signature of metastasis in primary solid tumors.

Authors:  Sridhar Ramaswamy; Ken N Ross; Eric S Lander; Todd R Golub
Journal:  Nat Genet       Date:  2002-12-09       Impact factor: 38.330

2.  A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression.

Authors:  Tao Li; Chengliang Zhang; Mitsunori Ogihara
Journal:  Bioinformatics       Date:  2004-04-15       Impact factor: 6.937

3.  Systematic variation in gene expression patterns in human cancer cell lines.

Authors:  D T Ross; U Scherf; M B Eisen; C M Perou; C Rees; P Spellman; V Iyer; S S Jeffrey; M Van de Rijn; M Waltham; A Pergamenschikov; J C Lee; D Lashkari; D Shalon; T G Myers; J N Weinstein; D Botstein; P O Brown
Journal:  Nat Genet       Date:  2000-03       Impact factor: 38.330

4.  Prediction of central nervous system embryonal tumour outcome based on gene expression.

Authors:  Scott L Pomeroy; Pablo Tamayo; Michelle Gaasenbeek; Lisa M Sturla; Michael Angelo; Margaret E McLaughlin; John Y H Kim; Liliana C Goumnerova; Peter M Black; Ching Lau; Jeffrey C Allen; David Zagzag; James M Olson; Tom Curran; Cynthia Wetmore; Jaclyn A Biegel; Tomaso Poggio; Shayan Mukherjee; Ryan Rifkin; Andrea Califano; Gustavo Stolovitzky; David N Louis; Jill P Mesirov; Eric S Lander; Todd R Golub
Journal:  Nature       Date:  2002-01-24       Impact factor: 49.962

5.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

6.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

7.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.

Authors:  A A Alizadeh; M B Eisen; R E Davis; C Ma; I S Lossos; A Rosenwald; J C Boldrick; H Sabet; T Tran; X Yu; J I Powell; L Yang; G E Marti; T Moore; J Hudson; L Lu; D B Lewis; R Tibshirani; G Sherlock; W C Chan; T C Greiner; D D Weisenburger; J O Armitage; R Warnke; R Levy; W Wilson; M R Grever; J C Byrd; D Botstein; P O Brown; L M Staudt
Journal:  Nature       Date:  2000-02-03       Impact factor: 49.962

8.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

Authors:  J Khan; J S Wei; M Ringnér; L H Saal; M Ladanyi; F Westermann; F Berthold; M Schwab; C R Antonescu; C Peterson; P S Meltzer
Journal:  Nat Med       Date:  2001-06       Impact factor: 53.440

9.  Gene expression correlates of clinical prostate cancer behavior.

Authors:  Dinesh Singh; Phillip G Febbo; Kenneth Ross; Donald G Jackson; Judith Manola; Christine Ladd; Pablo Tamayo; Andrew A Renshaw; Anthony V D'Amico; Jerome P Richie; Eric S Lander; Massimo Loda; Philip W Kantoff; Todd R Golub; William R Sellers
Journal:  Cancer Cell       Date:  2002-03       Impact factor: 31.743

10.  Gene selection and classification of microarray data using random forest.

Authors:  Ramón Díaz-Uriarte; Sara Alvarez de Andrés
Journal:  BMC Bioinformatics       Date:  2006-01-06       Impact factor: 3.169

View more
  8 in total

1.  Side effect prediction based on drug-induced gene expression profiles and random forest with iterative feature selection.

Authors:  Arzu Cakir; Melisa Tuncer; Hilal Taymaz-Nikerel; Ozlem Ulucan
Journal:  Pharmacogenomics J       Date:  2021-06-21       Impact factor: 3.550

2.  Application of unsupervised analysis techniques to lung cancer patient data.

Authors:  Chip M Lynch; Victor H van Berkel; Hermann B Frieboes
Journal:  PLoS One       Date:  2017-09-14       Impact factor: 3.240

3.  Combination of CRP and NLR: a better predictor of postoperative survival in patients with gastric cancer.

Authors:  Jing Guo; Shangxiang Chen; Yongming Chen; Shun Li; Dazhi Xu
Journal:  Cancer Manag Res       Date:  2018-02-14       Impact factor: 3.989

4.  Feature Extraction and Classification on Esophageal X-Ray Images of Xinjiang Kazak Nationality.

Authors:  Fang Yang; Murat Hamit; Chuan B Yan; Juan Yao; Abdugheni Kutluk; Xi M Kong; Sui X Zhang
Journal:  J Healthc Eng       Date:  2017-04-04       Impact factor: 2.682

5.  DETECT I & DETECT II: a study protocol for a prospective multicentre observational study to validate the UroMark assay for the detection of bladder cancer from urinary cells.

Authors:  Wei Shen Tan; Andrew Feber; Liqin Dong; Rachael Sarpong; Sheida Rezaee; Simon Rodney; Pramit Khetrapal; Patricia de Winter; Frelyn Ocampo; Rumana Jalil; Norman R Williams; Chris Brew-Graves; John D Kelly
Journal:  BMC Cancer       Date:  2017-11-15       Impact factor: 4.430

6.  Predicting prognosis of endometrioid endometrial adenocarcinoma on the basis of gene expression and clinical features using Random Forest.

Authors:  Fufen Yin; Xingyang Shao; Lijun Zhao; Xiaoping Li; Jingyi Zhou; Yuan Cheng; Xiangjun He; Shu Lei; Jiangeng Li; Jianliu Wang
Journal:  Oncol Lett       Date:  2019-06-20       Impact factor: 2.967

7.  Development and Validation of a New Multiparametric Random Survival Forest Predictive Model for Breast Cancer Recurrence with a Potential Benefit to Individual Outcomes.

Authors:  Huan Li; Ren-Bin Liu; Chen-Meng Long; Yuan Teng; Lin Cheng; Yu Liu
Journal:  Cancer Manag Res       Date:  2022-03-01       Impact factor: 3.989

8.  Early Transcriptome Signatures from Immunized Mouse Dendritic Cells Predict Late Vaccine-Induced T-Cell Responses.

Authors:  Nicolas Dérian; Bertrand Bellier; Hang Phuong Pham; Eliza Tsitoura; Dorothea Kazazi; Christophe Huret; Penelope Mavromara; David Klatzmann; Adrien Six
Journal:  PLoS Comput Biol       Date:  2016-03-21       Impact factor: 4.475

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.