Literature DB >> 17460773

Feature selection and molecular classification of cancer using genetic programming.

Jianjun Yu1, Jindan Yu, Arpit A Almal, Saravana M Dhanasekaran, Debashis Ghosh, William P Worzel, Arul M Chinnaiyan.   

Abstract

Despite important advances in microarray-based molecular classification of tumors, its application in clinical settings remains formidable. This is in part due to the limitation of current analysis programs in discovering robust biomarkers and developing classifiers with a practical set of genes. Genetic programming (GP) is a type of machine learning technique that uses evolutionary algorithm to simulate natural selection as well as population dynamics, hence leading to simple and comprehensible classifiers. Here we applied GP to cancer expression profiling data to select feature genes and build molecular classifiers by mathematical integration of these genes. Analysis of thousands of GP classifiers generated for a prostate cancer data set revealed repetitive use of a set of highly discriminative feature genes, many of which are known to be disease associated. GP classifiers often comprise five or less genes and successfully predict cancer types and subtypes. More importantly, GP classifiers generated in one study are able to predict samples from an independent study, which may have used different microarray platforms. In addition, GP yielded classification accuracy better than or similar to conventional classification methods. Furthermore, the mathematical expression of GP classifiers provides insights into relationships between classifier genes. Taken together, our results demonstrate that GP may be valuable for generating effective classifiers containing a practical set of genes for diagnostic/prognostic cancer classification.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17460773      PMCID: PMC1854845          DOI: 10.1593/neo.07121

Source DB:  PubMed          Journal:  Neoplasia        ISSN: 1476-5586            Impact factor:   5.715


  28 in total

1.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

Authors:  Margaret A Shipp; Ken N Ross; Pablo Tamayo; Andrew P Weng; Jeffery L Kutok; Ricardo C T Aguiar; Michelle Gaasenbeek; Michael Angelo; Michael Reich; Geraldine S Pinkus; Tane S Ray; Margaret A Koval; Kim W Last; Andrew Norton; T Andrew Lister; Jill Mesirov; Donna S Neuberg; Eric S Lander; Jon C Aster; Todd R Golub
Journal:  Nat Med       Date:  2002-01       Impact factor: 53.440

2.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method.

Authors:  L Li; C R Weinberg; T A Darden; L G Pedersen
Journal:  Bioinformatics       Date:  2001-12       Impact factor: 6.937

3.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

4.  Gene-expression profiles in hereditary breast cancer.

Authors:  I Hedenfalk; D Duggan; Y Chen; M Radmacher; M Bittner; R Simon; P Meltzer; B Gusterson; M Esteller; O P Kallioniemi; B Wilfond; A Borg; J Trent; M Raffeld; Z Yakhini; A Ben-Dor; E Dougherty; J Kononen; L Bubendorf; W Fehrle; S Pittaluga; S Gruvberger; N Loman; O Johannsson; H Olsson; G Sauter
Journal:  N Engl J Med       Date:  2001-02-22       Impact factor: 91.245

5.  MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia.

Authors:  Scott A Armstrong; Jane E Staunton; Lewis B Silverman; Rob Pieters; Monique L den Boer; Mark D Minden; Stephen E Sallan; Eric S Lander; Todd R Golub; Stanley J Korsmeyer
Journal:  Nat Genet       Date:  2001-12-03       Impact factor: 38.330

6.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.

Authors:  A A Alizadeh; M B Eisen; R E Davis; C Ma; I S Lossos; A Rosenwald; J C Boldrick; H Sabet; T Tran; X Yu; J I Powell; L Yang; G E Marti; T Moore; J Hudson; L Lu; D B Lewis; R Tibshirani; G Sherlock; W C Chan; T C Greiner; D D Weisenburger; J O Armitage; R Warnke; R Levy; W Wilson; M R Grever; J C Byrd; D Botstein; P O Brown; L M Staudt
Journal:  Nature       Date:  2000-02-03       Impact factor: 49.962

7.  Delineation of prognostic biomarkers in prostate cancer.

Authors:  S M Dhanasekaran; T R Barrette; D Ghosh; R Shah; S Varambally; K Kurachi; K J Pienta; M A Rubin; A M Chinnaiyan
Journal:  Nature       Date:  2001-08-23       Impact factor: 49.962

8.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Authors:  T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

9.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

Authors:  J Khan; J S Wei; M Ringnér; L H Saal; M Ladanyi; F Westermann; F Berthold; M Schwab; C R Antonescu; C Peterson; P S Meltzer
Journal:  Nat Med       Date:  2001-06       Impact factor: 53.440

10.  Symbolic discriminant analysis of microarray data in autoimmune disease.

Authors:  Jason H Moore; Joel S Parker; Nancy J Olsen; Thomas M Aune
Journal:  Genet Epidemiol       Date:  2002-06       Impact factor: 2.135

View more
  16 in total

Review 1.  Gene expression profiling in patients with carcinoma of unknown primary site: from translational research to standard of care.

Authors:  John D Hainsworth; F Anthony Greco
Journal:  Virchows Arch       Date:  2014-02-01       Impact factor: 4.064

2.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

Authors:  Patrick Breheny; Jian Huang
Journal:  Ann Appl Stat       Date:  2011-01-01       Impact factor: 2.083

3.  The War on Cancer rages on.

Authors:  Alnawaz Rehemtulla
Journal:  Neoplasia       Date:  2009-12       Impact factor: 5.715

4.  Germline polymorphisms in the one-carbon metabolism pathway and DNA methylation in colorectal cancer.

Authors:  Aditi Hazra; Charles S Fuchs; Takako Kawasaki; Gregory J Kirkner; David J Hunter; Shuji Ogino
Journal:  Cancer Causes Control       Date:  2010-03       Impact factor: 2.506

5.  Neoplasia: the second decade.

Authors:  Alnawaz Rehemtulla
Journal:  Neoplasia       Date:  2008-12       Impact factor: 5.715

Review 6.  Applications of genetic programming in cancer research.

Authors:  William P Worzel; Jianjun Yu; Arpit A Almal; Arul M Chinnaiyan
Journal:  Int J Biochem Cell Biol       Date:  2008-10-02       Impact factor: 5.085

7.  Generation and external validation of a tumor-derived 5-gene prognostic signature for recurrence of lymph node-negative, invasive colorectal carcinoma.

Authors:  Peter F Lenehan; Lisa A Boardman; Douglas Riegert-Johnson; Giovanni De Petris; David W Fry; Jeanne Ohrnberger; Eugene R Heyman; Brigitte Gerard; Arpit A Almal; William P Worzel
Journal:  Cancer       Date:  2012-05-17       Impact factor: 6.860

8.  Comparative genome analysis of a large Dutch Legionella pneumophila strain collection identifies five markers highly correlated with clinical strains.

Authors:  Ed Yzerman; Jeroen W den Boer; Martien Caspers; Arpit Almal; Bill Worzel; Walter van der Meer; Roy Montijn; Frank Schuren
Journal:  BMC Genomics       Date:  2010-07-15       Impact factor: 3.969

9.  A comparison of machine learning techniques for survival prediction in breast cancer.

Authors:  Leonardo Vanneschi; Antonella Farinaccio; Giancarlo Mauri; Mauro Antoniotti; Paolo Provero; Mario Giacobini
Journal:  BioData Min       Date:  2011-05-11       Impact factor: 2.522

10.  Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data.

Authors:  Amalia Annest; Roger E Bumgarner; Adrian E Raftery; Ka Yee Yeung
Journal:  BMC Bioinformatics       Date:  2009-02-26       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.