Amrita Basu1, Ritwik Mitra2, Han Liu2, Stuart L Schreiber1, Paul A Clemons1. 1. Chemical Biology & Therapeutics Science Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA. 2. Operational Research and Financial Engineering, Princeton University, Princeton, NJ, USA.
Abstract
Motivation: In recent years there have been several efforts to generate sensitivity profiles of collections of genomically characterized cell lines to panels of candidate therapeutic compounds. These data provide the basis for the development of in silico models of sensitivity based on cellular, genetic, or expression biomarkers of cancer cells. However, a remaining challenge is an efficient way to identify accurate sets of biomarkers to validate. To address this challenge, we developed methodology using gene-expression profiles of human cancer cell lines to predict the responses of these cell lines to a panel of compounds. Results: We developed an iterative weighting scheme which, when applied to elastic net, a regularized regression method, significantly improves the overall accuracy of predictions, particularly in the highly sensitive response region. In addition to application of these methods to actual chemical sensitivity data, we investigated the effects of sample size, number of features, model sparsity, signal-to-noise ratio, and feature correlation on predictive performance using a simulation framework, particularly for situations where the number of covariates is much larger than sample size. While our method aims to be useful in therapeutic discovery and understanding of the basic mechanisms of action of drugs and their targets, it is generally applicable in any domain where predictions of extreme responses are of highest importance. Availability and implementation: The iterative and other weighting algorithms were implemented in R. The code is available at https://github.com/kiwtir/RWEN. The CTRP data are available at ftp://caftpd.nci.nih.gov/pub/OCG-DCC/CTD2/Broad/CTRPv2.1_2016_pub_NatChemBiol_12_109/ and the Sanger data at ftp://ftp.sanger.ac.uk/pub/project/cancerrxgene/releases/release-6.0/. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: In recent years there have been several efforts to generate sensitivity profiles of collections of genomically characterized cell lines to panels of candidate therapeutic compounds. These data provide the basis for the development of in silico models of sensitivity based on cellular, genetic, or expression biomarkers of cancer cells. However, a remaining challenge is an efficient way to identify accurate sets of biomarkers to validate. To address this challenge, we developed methodology using gene-expression profiles of humancancer cell lines to predict the responses of these cell lines to a panel of compounds. Results: We developed an iterative weighting scheme which, when applied to elastic net, a regularized regression method, significantly improves the overall accuracy of predictions, particularly in the highly sensitive response region. In addition to application of these methods to actual chemical sensitivity data, we investigated the effects of sample size, number of features, model sparsity, signal-to-noise ratio, and feature correlation on predictive performance using a simulation framework, particularly for situations where the number of covariates is much larger than sample size. While our method aims to be useful in therapeutic discovery and understanding of the basic mechanisms of action of drugs and their targets, it is generally applicable in any domain where predictions of extreme responses are of highest importance. Availability and implementation: The iterative and other weighting algorithms were implemented in R. The code is available at https://github.com/kiwtir/RWEN. The CTRP data are available at ftp://caftpd.nci.nih.gov/pub/OCG-DCC/CTD2/Broad/CTRPv2.1_2016_pub_NatChemBiol_12_109/ and the Sanger data at ftp://ftp.sanger.ac.uk/pub/project/cancerrxgene/releases/release-6.0/. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Gregory Riddick; Hua Song; Susie Ahn; Jennifer Walling; Diego Borges-Rivera; Wei Zhang; Howard A Fine Journal: Bioinformatics Date: 2010-12-05 Impact factor: 6.937
Authors: Brinton Seashore-Ludlow; Matthew G Rees; Jaime H Cheah; Murat Cokol; Edmund V Price; Matthew E Coletti; Victor Jones; Nicole E Bodycombe; Christian K Soule; Joshua Gould; Benjamin Alexander; Ava Li; Philip Montgomery; Mathias J Wawer; Nurdan Kuru; Joanne D Kotz; C Suk-Yee Hon; Benito Munoz; Ted Liefeld; Vlado Dančík; Joshua A Bittker; Michelle Palmer; James E Bradner; Alykhan F Shamji; Paul A Clemons; Stuart L Schreiber Journal: Cancer Discov Date: 2015-10-19 Impact factor: 39.397
Authors: Francesco Iorio; Theo A Knijnenburg; Daniel J Vis; Graham R Bignell; Michael P Menden; Michael Schubert; Nanne Aben; Emanuel Gonçalves; Syd Barthorpe; Howard Lightfoot; Thomas Cokelaer; Patricia Greninger; Ewald van Dyk; Han Chang; Heshani de Silva; Holger Heyn; Xianming Deng; Regina K Egan; Qingsong Liu; Tatiana Mironenko; Xeni Mitropoulos; Laura Richardson; Jinhua Wang; Tinghu Zhang; Sebastian Moran; Sergi Sayols; Maryam Soleimani; David Tamborero; Nuria Lopez-Bigas; Petra Ross-Macdonald; Manel Esteller; Nathanael S Gray; Daniel A Haber; Michael R Stratton; Cyril H Benes; Lodewyk F A Wessels; Julio Saez-Rodriguez; Ultan McDermott; Mathew J Garnett Journal: Cell Date: 2016-07-07 Impact factor: 41.582
Authors: Artem Sokolov; Daniel E Carlin; Evan O Paull; Robert Baertsch; Joshua M Stuart Journal: PLoS Comput Biol Date: 2016-03-09 Impact factor: 4.475
Authors: Wanjuan Yang; Jorge Soares; Patricia Greninger; Elena J Edelman; Howard Lightfoot; Simon Forbes; Nidhi Bindal; Dave Beare; James A Smith; I Richard Thompson; Sridhar Ramaswamy; P Andrew Futreal; Daniel A Haber; Michael R Stratton; Cyril Benes; Ultan McDermott; Mathew J Garnett Journal: Nucleic Acids Res Date: 2012-11-23 Impact factor: 16.971
Authors: Matthew G Rees; Brinton Seashore-Ludlow; Jaime H Cheah; Drew J Adams; Edmund V Price; Shubhroz Gill; Sarah Javaid; Matthew E Coletti; Victor L Jones; Nicole E Bodycombe; Christian K Soule; Benjamin Alexander; Ava Li; Philip Montgomery; Joanne D Kotz; C Suk-Yee Hon; Benito Munoz; Ted Liefeld; Vlado Dančík; Daniel A Haber; Clary B Clish; Joshua A Bittker; Michelle Palmer; Bridget K Wagner; Paul A Clemons; Alykhan F Shamji; Stuart L Schreiber Journal: Nat Chem Biol Date: 2015-12-14 Impact factor: 15.040
Authors: Nicholas O'Grady; David L Gibbs; Kawther Abdilleh; Adam Asare; Smita Asare; Sara Venters; Lamorna Brown-Swigart; Gillian L Hirst; Denise Wolf; Christina Yau; Laura J van 't Veer; Laura Esserman; Amrita Basu Journal: JAMIA Open Date: 2021-06-03