| Literature DB >> 34890156 |
Shuangxia Ren1, Yifeng Tao, Ke Yu, Yifan Xue, Russell Schwartz, Xinghua Lu.
Abstract
Application of artificial intelligence (AI) in precision oncology typically involves predicting whether the cancer cells of a patient (previously unseen by AI models) will respond to any of a set of existing anticancer drugs, based on responses of previous training cell samples to those drugs. To expand the repertoire of anticancer drugs, AI has also been used to repurpose drugs that have not been tested in an anticancer setting, i.e., predicting the anticancer effects of a new drug on previously unseen cancer cells de novo. Here, we report a computational model that addresses both of the above tasks in a unified AI framework. Our model, referred to as deep learning-based graph regularized matrix factorization (DeepGRMF), integrates neural networks, graph models, and matrix-factorization techniques to utilize diverse information from drug chemical structures, their impact on cellular signaling systems, and cancer cell cellular states to predict cell response to drugs. DeepGRMF learns embeddings of drugs so that drugs sharing similar structures and mechanisms of action (MOAs) are closely related in the embedding space. Similarly, DeepGRMF also learns representation embeddings of cells such that cells sharing similar cellular states and drug responses are closely related. Evaluation of DeepGRMF and competing models on Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE) datasets show its superiority in prediction performance. Finally, we show that the model is capable of predicting effectiveness of a chemotherapy regimen on patient outcomes for the lung cancer patients in The Cancer Genome Atlas (TCGA) dataset*.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34890156 PMCID: PMC8691529
Source DB: PubMed Journal: Pac Symp Biocomput ISSN: 2335-6928
Fig. 1:Diagram of DeepGRMF model.
1) The orange dotted box shows the procedure of the first module: using a graph regularized matrix factorization to decompose the drug response matrix into the product of cell line factor matrix A and drug factor matrix B. 2) The blue dotted box shows the procedure of the second module: using two separate neural networks to learn the mapping functions. The neural network I is used to learn the mapping function for cell lines, which maps gene expression matrix C to cell line factor matrix A. The neural network II is used to learn the mapping function for drugs, which maps the drug embedding obtained by concatenating integrative drug embedding matrix D and pathway embedding matrix E to drug factor matrix B.
Fig. 2:A) The relationship between similarities of drug sensitivity and Euclidean distance of drug representations using chemical structure with/without the drug MOAs information. B) The relationship between similarities of drug sensitivity and Euclidean distance of drug representations using chemical structure, drug effect with/without pathway information.
Performance of different models to predict drug response of new cell lines to existing drugs.
| Per Cell Line | Per Drug | Micro | ||||||
|---|---|---|---|---|---|---|---|---|
| Train/Val Data | Test Data | Model | AUROC | AUPR | AUROC | AUPR | AUROC | AUPR |
| GDSC | GDSC | Lasso | 79.1 | 53.8 | 67.1 | 38.2 | 79.3 | 55.4 |
| DeepDSC | 80.0 | 54.8 | 67.7 | 38.8 | 79.9 | 56.4 | ||
| DeepGRMF |
|
|
|
|
|
| ||
|
| ||||||||
| GDSC | CCLE | Lasso | 79.2 | 67.5 | 66.2 | 38.2 | 74.1 | 50.5 |
| DeepDSC | 80.0 | 68.3 | 67.0 | 40.5 | 75.1 | 51.5 | ||
| DeepGRMF |
|
|
|
|
|
| ||
Performance of different models to predict drug response of existing cell lines to new drugs.
| Per Cell Line | Per Drug | Micro | ||||||
|---|---|---|---|---|---|---|---|---|
| Train/Val Data | Test Data | Model | AUROC | AUPR | AUROC | AUPR | AUROC | AUPR |
| GDSC | GDSC | DeepDSC | 58.6 | 33.0 | 64.5 | 35.3 | 65.4 | 37.7 |
| DeepGRMF |
|
|
|
|
|
| ||
Performance of different models to predict drug response of new cell lines to new drugs.
| Per Cell Line | Per Drug | Micro | ||||||
|---|---|---|---|---|---|---|---|---|
| Train/Val Data | Test Data | Model | AUROC | AUPR | AUROC | AUPR | AUROC | AUPR |
| GDSC | GDSC | DeepDSC | 58.2 | 31.6 | 55.6 | 28.5 | 59.8 | 31.9 |
| DeepGRMF |
|
|
|
|
|
| ||
|
| ||||||||
| GDSC | CCLE | DeepDSC | 49.1 | 49.4 | 58.5 | 38.1 | 55.1 | 32.2 |
| DeepGRMF |
|
|
|
|
|
| ||
Fig. 3:Kaplan-Meier curves of responder and non-responder group of lung cancer patients that took Cisplatin, Pemetrexed, Paclitaxel, and/or Vinorelbine for adjuvant therapy which these four drugs treated as existing drugs (A) or as new drugs (B).