Literature DB >> 15867433

Classification of a large microarray data set: algorithm comparison and analysis of drug signatures.

Georges Natsoulis1, Laurent El Ghaoui, Gert R G Lanckriet, Alexander M Tolley, Fabrice Leroy, Shane Dunlea, Barrett P Eynon, Cecelia I Pearson, Stuart Tugendreich, Kurt Jarnagin.   

Abstract

A large gene expression database has been produced that characterizes the gene expression and physiological effects of hundreds of approved and withdrawn drugs, toxicants, and biochemical standards in various organs of live rats. In order to derive useful biological knowledge from this large database, a variety of supervised classification algorithms were compared using a 597-microarray subset of the data. Our studies show that several types of linear classifiers based on Support Vector Machines (SVMs) and Logistic Regression can be used to derive readily interpretable drug signatures with high classification performance. Both methods can be tuned to produce classifiers of drug treatments in the form of short, weighted gene lists which upon analysis reveal that some of the signature genes have a positive contribution (act as "rewards" for the class-of-interest) while others have a negative contribution (act as "penalties") to the classification decision. The combination of reward and penalty genes enhances performance by keeping the number of false positive treatments low. The results of these algorithms are combined with feature selection techniques that further reduce the length of the drug signatures, an important step towards the development of useful diagnostic biomarkers and low-cost assays. Multiple signatures with no genes in common can be generated for the same classification end-point. Comparison of these gene lists identifies biological processes characteristic of a given class.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15867433      PMCID: PMC1088301          DOI: 10.1101/gr.2807605

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  19 in total

1.  Microarray analysis of hepatotoxins in vitro reveals a correlation between gene expression profiles and mechanisms of toxicity.

Authors:  J F Waring; R Ciurlionis; R A Jolly; M Heindel; R G Ulrich
Journal:  Toxicol Lett       Date:  2001-03-31       Impact factor: 4.372

Review 2.  Roles of PPARs in health and disease.

Authors:  S Kersten; B Desvergne; W Wahli
Journal:  Nature       Date:  2000-05-25       Impact factor: 49.962

3.  Identification of toxicologically predictive gene sets using cDNA microarrays.

Authors:  R S Thomas; D R Rank; S G Penn; G M Zastrow; K R Hayes; K Pande; E Glover; T Silander; M W Craven; J K Reddy; S B Jovanovich; C A Bradfield
Journal:  Mol Pharmacol       Date:  2001-12       Impact factor: 4.436

4.  Centralization: a new method for the normalization of gene expression data.

Authors:  A Zien; T Aigner; R Zimmer; T Lengauer
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

5.  Biomarker identification by feature wrappers.

Authors:  M Xiong; X Fang; J Zhao
Journal:  Genome Res       Date:  2001-11       Impact factor: 9.043

6.  A highly reproducible, linear, and automated sample preparation method for DNA microarrays.

Authors:  David R Dorris; Ramesh Ramakrishnan; Dionisios Trakas; Frank Dudzik; Richard Belval; Connie Zhao; Allen Nguyen; Marc Domanus; Abhijit Mazumder
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

7.  Prediction of clinical drug efficacy by classification of drug-induced genomic expression profiles in vitro.

Authors:  Erik C Gunther; David J Stone; Robert W Gerwien; Patricia Bento; Melvyn P Heyes
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-17       Impact factor: 11.205

8.  Development of a large-scale chemogenomics database to improve drug candidate selection and to understand mechanisms of chemical toxicity and action.

Authors:  Brigitte Ganter; Stuart Tugendreich; Cecelia I Pearson; Eser Ayanoglu; Susanne Baumhueter; Keith A Bostian; Lindsay Brady; Leslie J Browne; John T Calvin; Gwo-Jen Day; Naiomi Breckenridge; Shane Dunlea; Barrett P Eynon; L Mike Furness; Joe Ferng; Mark R Fielden; Susan Y Fujimoto; Li Gong; Christopher Hu; Radha Idury; Michael S B Judo; Kyle L Kolaja; May D Lee; Christopher McSorley; James M Minor; Ramesh V Nair; Georges Natsoulis; Peter Nguyen; Simone M Nicholson; Hang Pham; Alan H Roter; Dongxu Sun; Siqi Tan; Silke Thode; Alexander M Tolley; Antoaneta Vladimirova; Jian Yang; Zhiming Zhou; Kurt Jarnagin
Journal:  J Biotechnol       Date:  2005-09-29       Impact factor: 3.307

9.  Multiclass cancer diagnosis using tumor gene expression signatures.

Authors:  S Ramaswamy; P Tamayo; R Rifkin; S Mukherjee; C H Yeang; M Angelo; C Ladd; M Reich; E Latulippe; J P Mesirov; T Poggio; W Gerald; M Loda; E S Lander; T R Golub
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-11       Impact factor: 11.205

10.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Authors:  T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

View more
  24 in total

1.  A General Framework for Fast Co-clustering on Large Datasets Using Matrix Decomposition.

Authors:  Feng Pan; Xiang Zhang; Wei Wang
Journal:  Proc ACM SIGMOD Int Conf Manag Data       Date:  2008-04-25

2.  Simultaneous class discovery and classification of microarray data using spectral analysis.

Authors:  Peng Qiu; Sylvia K Plevritis
Journal:  J Comput Biol       Date:  2009-07       Impact factor: 1.479

3.  Predicting gene targets of perturbations via network-based filtering of mRNA expression compendia.

Authors:  Elissa J Cosgrove; Yingchun Zhou; Timothy S Gardner; Eric D Kolaczyk
Journal:  Bioinformatics       Date:  2008-09-08       Impact factor: 6.937

4.  Multiple-rule bias in the comparison of classification rules.

Authors:  Mohammadmahdi R Yousefi; Jianping Hua; Edward R Dougherty
Journal:  Bioinformatics       Date:  2011-05-05       Impact factor: 6.937

5.  Performance reproducibility index for classification.

Authors:  Mohammadmahdi R Yousefi; Edward R Dougherty
Journal:  Bioinformatics       Date:  2012-09-06       Impact factor: 6.937

6.  CRD: Fast Co-clustering on Large Datasets Utilizing Sampling-Based Matrix Decomposition.

Authors:  Feng Pan; Xiang Zhang; Wei Wang
Journal:  Proc Int Conf Data Eng       Date:  2008-04-07

7.  Analysis and computational dissection of molecular signature multiplicity.

Authors:  Alexander Statnikov; Constantin F Aliferis
Journal:  PLoS Comput Biol       Date:  2010-05-20       Impact factor: 4.475

8.  Indirect two-sided relative ranking: a robust similarity measure for gene expression data.

Authors:  Louis Licamele; Lise Getoor
Journal:  BMC Bioinformatics       Date:  2010-03-17       Impact factor: 3.169

9.  Functional analysis of multiple genomic signatures demonstrates that classification algorithms choose phenotype-related genes.

Authors:  W Shi; M Bessarabova; D Dosymbekov; Z Dezso; T Nikolskaya; M Dudoladova; T Serebryiskaya; A Bugrim; A Guryanov; R J Brennan; R Shah; J Dopazo; M Chen; Y Deng; T Shi; G Jurman; C Furlanello; R S Thomas; J C Corton; W Tong; L Shi; Y Nikolsky
Journal:  Pharmacogenomics J       Date:  2010-08       Impact factor: 3.550

10.  Algorithms for Discovery of Multiple Markov Boundaries.

Authors:  Alexander Statnikov; Nikita I Lytkin; Jan Lemeire; Constantin F Aliferis
Journal:  J Mach Learn Res       Date:  2013-02       Impact factor: 3.654

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.