Khandakar Tanvir Ahmed1, Sunho Park2, Qibing Jiang1, Yunku Yeu2, TaeHyun Hwang2, Wei Zhang3. 1. Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL, 32816, USA. 2. Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, 9211 Euclid Ave, Cleveland, OH, 44106, USA. 3. Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL, 32816, USA. wzhang.cs@ucf.edu.
Abstract
BACKGROUND: Drug sensitivity prediction and drug responsive biomarker selection on high-throughput genomic data is a critical step in drug discovery. Many computational methods have been developed to serve this purpose including several deep neural network models. However, the modular relations among genomic features have been largely ignored in these methods. To overcome this limitation, the role of the gene co-expression network on drug sensitivity prediction is investigated in this study. METHODS: In this paper, we first introduce a network-based method to identify representative features for drug response prediction by using the gene co-expression network. Then, two graph-based neural network models are proposed and both models integrate gene network information directly into neural network for outcome prediction. Next, we present a large-scale comparative study among the proposed network-based methods, canonical prediction algorithms (i.e., Elastic Net, Random Forest, Partial Least Squares Regression, and Support Vector Regression), and deep neural network models for drug sensitivity prediction. All the source code and processed datasets in this study are available at https://github.com/compbiolabucf/drug-sensitivity-prediction . RESULTS: In the comparison of different feature selection methods and prediction methods on a non-small cell lung cancer (NSCLC) cell line RNA-seq gene expression dataset with 50 different drug treatments, we found that (1) the network-based feature selection method improves the prediction performance compared to Pearson correlation coefficients; (2) Random Forest outperforms all the other canonical prediction algorithms and deep neural network models; (3) the proposed graph-based neural network models show better prediction performance compared to deep neural network model; (4) the prediction performance is drug dependent and it may relate to the drug's mechanism of action. CONCLUSIONS: Network-based feature selection method and prediction models improve the performance of the drug response prediction. The relations between the genomic features are more robust and stable compared to the correlation between each individual genomic feature and the drug response in high dimension and low sample size genomic datasets.
BACKGROUND: Drug sensitivity prediction and drug responsive biomarker selection on high-throughput genomic data is a critical step in drug discovery. Many computational methods have been developed to serve this purpose including several deep neural network models. However, the modular relations among genomic features have been largely ignored in these methods. To overcome this limitation, the role of the gene co-expression network on drug sensitivity prediction is investigated in this study. METHODS: In this paper, we first introduce a network-based method to identify representative features for drug response prediction by using the gene co-expression network. Then, two graph-based neural network models are proposed and both models integrate gene network information directly into neural network for outcome prediction. Next, we present a large-scale comparative study among the proposed network-based methods, canonical prediction algorithms (i.e., Elastic Net, Random Forest, Partial Least Squares Regression, and Support Vector Regression), and deep neural network models for drug sensitivity prediction. All the source code and processed datasets in this study are available at https://github.com/compbiolabucf/drug-sensitivity-prediction . RESULTS: In the comparison of different feature selection methods and prediction methods on a non-small cell lung cancer (NSCLC) cell line RNA-seq gene expression dataset with 50 different drug treatments, we found that (1) the network-based feature selection method improves the prediction performance compared to Pearson correlation coefficients; (2) Random Forest outperforms all the other canonical prediction algorithms and deep neural network models; (3) the proposed graph-based neural network models show better prediction performance compared to deep neural network model; (4) the prediction performance is drug dependent and it may relate to the drug's mechanism of action. CONCLUSIONS: Network-based feature selection method and prediction models improve the performance of the drug response prediction. The relations between the genomic features are more robust and stable compared to the correlation between each individual genomic feature and the drug response in high dimension and low sample size genomic datasets.
Authors: Elizabeth A McMillan; Myung-Jeom Ryu; Caroline H Diep; Saurabh Mendiratta; Jean R Clemenceau; Rachel M Vaden; Ju-Hwa Kim; Takashi Motoyaji; Kyle R Covington; Michael Peyton; Kenneth Huffman; Xiaofeng Wu; Luc Girard; Yeojin Sung; Pei-Hsaun Chen; Prema L Mallipeddi; Joo Young Lee; Jordan Hanson; Sukesh Voruganti; Yunku Yu; Sunho Park; Jessica Sudderth; Christopher DeSevo; Donna M Muzny; HarshaVardhan Doddapaneni; Adi Gazdar; Richard A Gibbs; Tae-Hyun Hwang; John V Heymach; Ignacio Wistuba; Kevin R Coombes; Noelle S Williams; David A Wheeler; John B MacMillan; Ralph J Deberardinis; Michael G Roth; Bruce A Posner; John D Minna; Hyun Seok Kim; Michael A White Journal: Cell Date: 2018-04-19 Impact factor: 41.582
Authors: Kristina Preuer; Richard P I Lewis; Sepp Hochreiter; Andreas Bender; Krishna C Bulusu; Günter Klambauer Journal: Bioinformatics Date: 2018-05-01 Impact factor: 6.937
Authors: Fangfang Xia; Maulik Shukla; Thomas Brettin; Cristina Garcia-Cardona; Judith Cohn; Jonathan E Allen; Sergei Maslov; Susan L Holbeck; James H Doroshow; Yvonne A Evrard; Eric A Stahlberg; Rick L Stevens Journal: BMC Bioinformatics Date: 2018-12-21 Impact factor: 3.169
Authors: Wanjuan Yang; Jorge Soares; Patricia Greninger; Elena J Edelman; Howard Lightfoot; Simon Forbes; Nidhi Bindal; Dave Beare; James A Smith; I Richard Thompson; Sridhar Ramaswamy; P Andrew Futreal; Daniel A Haber; Michael R Stratton; Cyril Benes; Ultan McDermott; Mathew J Garnett Journal: Nucleic Acids Res Date: 2012-11-23 Impact factor: 16.971
Authors: Maryam Pouryahya; Jung Hun Oh; James C Mathews; Zehor Belkhatir; Caroline Moosmüller; Joseph O Deasy; Allen R Tannenbaum Journal: Int J Mol Sci Date: 2022-01-19 Impact factor: 6.208