| Literature DB >> 31510700 |
Hossein Sharifi-Noghabi1,2, Olga Zolotareva3, Colin C Collins2,4, Martin Ester1,2.
Abstract
MOTIVATION: Historically, gene expression has been shown to be the most informative data for drug response prediction. Recent evidence suggests that integrating additional omics can improve the prediction accuracy which raises the question of how to integrate the additional omics. Regardless of the integration strategy, clinical utility and translatability are crucial. Thus, we reasoned a multi-omics approach combined with clinical datasets would improve drug response prediction and clinical relevance.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31510700 PMCID: PMC6612815 DOI: 10.1093/bioinformatics/btz318
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Schematic overview of MOLI (A) pre-processing mutation, CNA and gene expression data. (B) Each encoding sub-network learns features for its omics data type and the learned features are concatenated into one representation. (C) MOLI cost function consists of a triplet loss and a classification loss, obtained from the classifier sub-network that uses the multi-omics representation to predict drug response
Fig. 2.(A) Using MOLI to make predictions for PDX/patient inputs during external validation. (B) Combining targeted drugs that target the same pathway or molecule to make a pan-drug training dataset for MOLI
List of the studied drugs from the used resources with multi-omics profiles available
| Drug | Type | Resource | Number of samples | Number of genes | Usage |
|---|---|---|---|---|---|
| Afatinib | Targeted | GDSC | 828 (NR: 678, RS: 150) | 13 | Training |
| Cetuximab | Targeted | GDSC | 856 (NR: 735, RS: 121) | 12 | Training |
| Cetuximab | Targeted | PDX | 60 (NR: 55, RS: 5) | 12 | External validation |
| Cisplatin | Chemotherapy | GDSC | 829 (NR: 752, RS: 77) | 15 | Training |
| Cisplatin | Chemotherapy | TCGA | 66 (NR: 6, RS: 60) | 15 | External validation |
| Docetaxel | Chemotherapy | GDSC | 829 (NR: 764, RS: 65) | 15 | Training |
| Docetaxel | Chemotherapy | TCGA | 16 (NR: 8, RS: 8) | 15 | External validation |
| Erlotinib | Targeted | GDSC | 362 (NR: 298, RS: 64) | 12 | Training |
| Erlotinib | Targeted | PDX | 21 (NR: 18, RS: 3) | 12 | External validation |
| Gefitinib | Targeted | GDSC | 825 (NR: 710, RS: 115) | 13 | Training |
| Gemcitabine | Chemotherapy | GDSC | 844 (NR: 790, RS: 54) | 12 | Training |
| Gemcitabine | Chemotherapy | PDX | 25 (NR: 18, RS: 7) | 12 | External validation |
| Gemcitabine | Chemotherapy | TCGA | 57 (NR: 36, RS: 21) | 15 | External validation |
| Lapatinib | Targeted | GDSC | 387 (NR: 326, RS: 61) | 13 | Training |
| Paclitaxel | Chemotherapy | GDSC | 389 (NR: 363, RS: 26) | 12 | Training |
| Paclitaxel | Chemotherapy | PDX | 43 (NR: 38, RS: 5) | 12 | External validation |
| Pan-drug | Targeted | GDSC | 3258 (NR: 2747, RS: 511) | 13 | Training |
Note: NR, non-responder; RS, responder.
Number of screened samples with all three omics data types available.
Number of genes in common between the train data and the external validation data for each drug.
Number of genes for the drug-specific experiments.
Number of genes for the pan-drug experiments.
Performance of different versions of MOLI compared to the baselines in terms of prediction AUC across two targeted therapeutics and five chemotherapy agents
| Method | PDX | PDX | PDX | PDX | TCGA | TCGA | TCGA | Input omics |
|---|---|---|---|---|---|---|---|---|
| Drug | Paclitaxel | Gemcitabine | Cetuximab | Erlotinib | Docetaxel | Cisplatin | Gemcitabine | |
|
| 0.52 | 0.59 |
|
| 0.59 | 0.62 | 0.53 | Expression |
| Early integration via NMF | 0.24 | 0.56 | 0.53 | 0.28 | 0.39 | 0.40 | 0.58 | Multi |
| Early integration via DNNs | NSC |
| NSC | NSC | 0.52 | NSC | 0.59 | Multi |
| Feed forward net | 0.68 | 0.48 | 0.43 | 0.37 |
| 0.44 |
| Expression |
| MOLI complete |
| 0.52 | 0.51 | 0.39 |
|
| 0.64 | Expression |
| MOLI with classifier | NSC | 0.55 | 0.46 | NSC | 0.58 | 0.6 |
| Multi |
| MOLI complete |
|
| 0.53 | 0.63 | 0.58 |
|
| Multi |
| MOLI complete Pan-drug | NA | NA |
|
| NA | NA | NA | Multi |
Note: NSC: no stable classifier during cross validation or final training. Boldface indicates the best method for the corresponding drug and italics indicates the second best method. NA: the corresponding drug is not for targeted therapy. Complete: when MOLI has both classification and the triplet losses; NMF, non-negative matrix factorization; DNNs, deep neural networks.