Literature DB >> 34524425

A cross-study analysis of drug response prediction in cancer cell lines.

Fangfang Xia1, Jonathan Allen2, Prasanna Balaprakash1, Thomas Brettin1, Cristina Garcia-Cardona3, Austin Clyde1,4, Judith Cohn3, James Doroshow5, Xiaotian Duan4, Veronika Dubinkina6, Yvonne Evrard7, Ya Ju Fan2, Jason Gans3, Stewart He2, Pinyi Lu7, Sergei Maslov6, Alexander Partin1, Maulik Shukla1, Eric Stahlberg7, Justin M Wozniak1, Hyunseung Yoo1, George Zaki7, Yitan Zhu1, Rick Stevens1,4.   

Abstract

To enable personalized cancer treatment, machine learning models have been developed to predict drug response as a function of tumor and drug features. However, most algorithm development efforts have relied on cross-validation within a single study to assess model accuracy. While an essential first step, cross-validation within a biological data set typically provides an overly optimistic estimate of the prediction performance on independent test sets. To provide a more rigorous assessment of model generalizability between different studies, we use machine learning to analyze five publicly available cell line-based data sets: National Cancer Institute 60, ancer Therapeutics Response Portal (CTRP), Genomics of Drug Sensitivity in Cancer, Cancer Cell Line Encyclopedia and Genentech Cell Line Screening Initiative (gCSI). Based on observed experimental variability across studies, we explore estimates of prediction upper bounds. We report performance results of a variety of machine learning models, with a multitasking deep neural network achieving the best cross-study generalizability. By multiple measures, models trained on CTRP yield the most accurate predictions on the remaining testing data, and gCSI is the most predictable among the cell line data sets included in this study. With these experiments and further simulations on partial data, two lessons emerge: (1) differences in viability assays can limit model generalizability across studies and (2) drug diversity, more than tumor diversity, is crucial for raising model generalizability in preclinical screening.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Keywords:  deep learning; drug response prediction; drug sensitivity; precision oncology

Mesh:

Year:  2022        PMID: 34524425      PMCID: PMC8769697          DOI: 10.1093/bib/bbab356

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  58 in total

Review 1.  A review on machine learning principles for multi-view biological data integration.

Authors:  Yifeng Li; Fang-Xiang Wu; Alioune Ngom
Journal:  Brief Bioinform       Date:  2018-03-01       Impact factor: 11.622

2.  Consistency in drug response profiling.

Authors:  John Patrick Mpindi; Bhagwan Yadav; Päivi Östling; Prson Gautam; Disha Malani; Astrid Murumägi; Akira Hirasawa; Sara Kangaspeska; Krister Wennerberg; Olli Kallioniemi; Tero Aittokallio
Journal:  Nature       Date:  2016-11-30       Impact factor: 49.962

3.  Inconsistency in large pharmacogenomic studies.

Authors:  Benjamin Haibe-Kains; Nehme El-Hachem; Nicolai Juul Birkbak; Andrew C Jin; Andrew H Beck; Hugo J W L Aerts; John Quackenbush
Journal:  Nature       Date:  2013-11-27       Impact factor: 49.962

4.  Effect of normalization methods on the performance of supervised learning algorithms applied to HTSeq-FPKM-UQ data sets: 7SK RNA expression as a predictor of survival in patients with colon adenocarcinoma.

Authors:  Leili Shahriyari
Journal:  Brief Bioinform       Date:  2019-05-21       Impact factor: 11.622

5.  Deep learning for drug response prediction in cancer.

Authors:  Delora Baptista; Pedro G Ferreira; Miguel Rocha
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

6.  BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology.

Authors:  Michael K Gilson; Tiqing Liu; Michael Baitaluk; George Nicola; Linda Hwang; Jenny Chong
Journal:  Nucleic Acids Res       Date:  2015-10-19       Impact factor: 16.971

7.  A novel heterogeneous network-based method for drug response prediction in cancer cell lines.

Authors:  Fei Zhang; Minghui Wang; Jianing Xi; Jianghong Yang; Ao Li
Journal:  Sci Rep       Date:  2018-02-20       Impact factor: 4.379

8.  The cornucopia of meaningful leads: Applying deep adversarial autoencoders for new molecule development in oncology.

Authors:  Artur Kadurin; Alexander Aliper; Andrey Kazennov; Polina Mamoshina; Quentin Vanhaelen; Kuzma Khrabrov; Alex Zhavoronkov
Journal:  Oncotarget       Date:  2017-02-14

9.  Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen.

Authors:  Michael P Menden; Dennis Wang; Mike J Mason; Bence Szalai; Krishna C Bulusu; Yuanfang Guan; Thomas Yu; Jaewoo Kang; Minji Jeon; Russ Wolfinger; Tin Nguyen; Mikhail Zaslavskiy; In Sock Jang; Zara Ghazoui; Mehmet Eren Ahsen; Robert Vogel; Elias Chaibub Neto; Thea Norman; Eric K Y Tang; Mathew J Garnett; Giovanni Y Di Veroli; Stephen Fawell; Gustavo Stolovitzky; Justin Guinney; Jonathan R Dry; Julio Saez-Rodriguez
Journal:  Nat Commun       Date:  2019-06-17       Impact factor: 14.919

10.  Drug ranking using machine learning systematically predicts the efficacy of anti-cancer drugs.

Authors:  Henry Gerdes; Pedro Casado; Arran Dokal; Maruan Hijazi; Nosheen Akhtar; Ruth Osuntola; Vinothini Rajeeve; Jude Fitzgibbon; Jon Travers; David Britton; Shirin Khorsandi; Pedro R Cutillas
Journal:  Nat Commun       Date:  2021-03-25       Impact factor: 14.919

View more
  3 in total

Review 1.  Drug sensitivity prediction from cell line-based pharmacogenomics data: guidelines for developing machine learning models.

Authors:  Hossein Sharifi-Noghabi; Soheil Jahangiri-Tazehkand; Petr Smirnov; Casey Hon; Anthony Mammoliti; Sisira Kadambat Nair; Arvind Singh Mer; Martin Ester; Benjamin Haibe-Kains
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

2.  PharmacoDB 2.0: improving scalability and transparency of in vitro pharmacogenomics analysis.

Authors:  Nikta Feizi; Sisira Kadambat Nair; Petr Smirnov; Gangesh Beri; Christopher Eeles; Parinaz Nasr Esfahani; Minoru Nakano; Denis Tkachuk; Anthony Mammoliti; Evgeniya Gorobets; Arvind Singh Mer; Eva Lin; Yihong Yu; Scott Martin; Marc Hafner; Benjamin Haibe-Kains
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

3.  HPTMT Parallel Operators for High Performance Data Science and Data Engineering.

Authors:  Vibhatha Abeykoon; Supun Kamburugamuve; Chathura Widanage; Niranda Perera; Ahmet Uyar; Thejaka Amila Kanewala; Gregor von Laszewski; Geoffrey Fox
Journal:  Front Big Data       Date:  2022-02-07
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.