| Literature DB >> 35106508 |
Eugene F Douglass1,2, Robert J Allaway3, Bence Szalai4, Wenyu Wang5, Tingzhong Tian6, Adrià Fernández-Torras7, Ron Realubit1, Charles Karan1, Shuyu Zheng5, Alberto Pessia5, Ziaurrehman Tanoli5, Mohieddin Jafari5, Fangping Wan6, Shuya Li6, Yuanpeng Xiong8, Miquel Duran-Frigola7, Martino Bertoni7, Pau Badia-I-Mompel7, Lídia Mateo7, Oriol Guitart-Pla7, Verena Chung3, Jing Tang5, Jianyang Zeng6,9, Patrick Aloy7,10, Julio Saez-Rodriguez11, Justin Guinney3, Daniela S Gerhard12, Andrea Califano1,13,14,15,16.
Abstract
The Columbia Cancer Target Discovery and Development (CTD2) Center is developing PANACEA, a resource comprising dose-responses and RNA sequencing (RNA-seq) profiles of 25 cell lines perturbed with ∼400 clinical oncology drugs, to study a tumor-specific drug mechanism of action. Here, this resource serves as the basis for a DREAM Challenge assessing the accuracy and sensitivity of computational algorithms for de novo drug polypharmacology predictions. Dose-response and perturbational profiles for 32 kinase inhibitors are provided to 21 teams who are blind to the identity of the compounds. The teams are asked to predict high-affinity binding targets of each compound among ∼1,300 targets cataloged in DrugBank. The best performing methods leverage gene expression profile similarity analysis as well as deep-learning methodologies trained on individual datasets. This study lays the foundation for future integrative analyses of pharmacogenomic data, reconciliation of polypharmacology effects in different tumor contexts, and insights into network-based assessments of drug mechanisms of action.Entities:
Keywords: DREAM challenge; community challenge; pharmacogenomics; polypharmacology
Mesh:
Substances:
Year: 2022 PMID: 35106508 PMCID: PMC8784774 DOI: 10.1016/j.xcrm.2021.100492
Source DB: PubMed Journal: Cell Rep Med ISSN: 2666-3791
Figure 1Underlying data and structure of polypharmacology community challenge
(A) Drug mechanism can be divided into direct binding targets and downstream effectors.
(B) The PANACEA-database-given transcriptional profiles of cell lines perturbed by clinical oncology drugs.
(C) Kinome-binding profiles of 32 kinase inhibitors.
(D) Transcriptional Hallmark programs induced by 32 kinase inhibitors (this data represents the average of two technical replicates where the same cell line was perturbed and sequenced on 2 different days).
(E) Challenge structure: participants are given perturbed RNA-seq and dose response data and asked to predict protein targets.
(F) Challenge evaluation: participant predictions are evaluated based on the enrichment of <μM binders within each drug target prediction vector.
Number of additional datasets used by participants for training and algorithm class
| Team | SC1 | SC2 | No. of drug-AUC datasets | No. of drug-mRNA datasets | No. of drug-target datasets | Total training datasets | Algorithm class |
|---|---|---|---|---|---|---|---|
| Netphar | 12.6 | 70.9 | 6 | 1 | 4 | 11 | similarity |
| SBNB | 11.7 | 59.2 | 6 | 3 | 2 | 11 | similarity |
| Xielab | 13.8 | 50.3 | 6 | 2 | 1 | 9 | similarity |
| Atom | 17.4 | 49.3 | – | 2 | 4 | 6 | NN |
| DMIS_PDA | 13.8 | 35.2 | – | 2 | 1 | 3 | NN |
| Theragen | 15.1 | 17.3 | – | 2 | 1 | 3 | similarity |
| Signal | 6.3 | 6.1 | – | 1 | 2 | 4 | regression |
| TeamAxolotl | 6.2 | 1.1 | – | – | 2 | 3 | NN |
| AMbeRland | 3.3 | 1.1 | – | – | – | 0 | unsup. |
| SenthamizhaV | 7.4 | 0.9 | – | – | – | 0 | unsup. |
Drug sensitivity (AUC) databases include: NCI60, GDSC, CTRP, gCSI, CCLE, and other manually curated data.
Drug mRNA perturbation databases include: L1000-drugs, L1000-shRNA, and CREEDS.
Drug target datasets include: DrugBank, ChEMBL, KEGG, and MATADOR.
Figure 2The universe of training data used in this challenge
Drug-perturbation datasets can be divided into two major categories: technology-based and literature-based, each with distinct limitations.
Figure 3Comparison of DrugBank and kinome drug target definitions
(A) An affinity threshold of 1 μM within the kinome database successfully recovered almost 80% of the kinase targets within DrugBank.
(B) The kinome-defined drug targets appear to reveal a large number of new drug-targets (in red) in addition to the canonical drug targets (in black).
(C) Drug target pairs overlap across four drug target universes.
(D) Drug target pairs not detected in the kinome database used for PANACEA evaluation.
(E) Number of successful top 10 predictions for each drug and team across the different drug target universes.
Figure 4Different kinase pathways show distinct mRNA signatures when inhibited
(A and B) Across all models, tyrosine kinase (TK)-targeting drugs performed the best.
(C) Distribution of kinases profiled across the Human Kinome annotated by kinase group.
(D) Correlation of kinase-binding data with transcriptional program.
(E and F) KEGG pathway transformation of kinase space from (C) revealed pathway-specific transcriptional signatures
Figure 5Comparison of the two winning strategies: weighted similarity and neural networks
(A) Team Netphar (who won SC2) used a simple matrix manipulation procedure to predict drug targets.
(B) Team Atom (who won SC1) used a protein-sequence-trained neural network.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| AEE788 | SelleckChem | S1486 |
| Afatinib | SelleckChem | S1011 |
| AZD5363 | SelleckChem | S8019 |
| Bafetinib | SelleckChem | S1369 |
| Bosutinib | SelleckChem | S1014 |
| Cabozantinib | SelleckChem | S1119 |
| Cediranib | SelleckChem | S1017 |
| Crenolanib | SelleckChem | S2730 |
| Crizotinib | SelleckChem | S1068 |
| Dacomitinib | SelleckChem | S2727 |
| Dasatinib | SelleckChem | S1021 |
| Dovitinib | SelleckChem | S1018 |
| Foretinib | SelleckChem | S1111 |
| Gefitinib | SelleckChem | S1025 |
| Icotinib | SelleckChem | S2922 |
| Imatinib | SelleckChem | S2475 |
| KW2449 | SelleckChem | S2158 |
| Lapatinib | SelleckChem | S2111 |
| Linifanib | SelleckChem | S1003 |
| MGCD365 | SelleckChem | S1361 |
| MK2206 | SelleckChem | S1078 |
| Neratinib | SelleckChem | S2150 |
| Nilotinib | SelleckChem | S1033 |
| Osimertinib | SelleckChem | S7297 |
| Ponatinib | SelleckChem | S1490 |
| Quizartinib | SelleckChem | S1526 |
| Regorafenib | SelleckChem | S1178 |
| Sorafenib | SelleckChem | S1040 |
| Sunitinib | SelleckChem | S1042 |
| Tivantinib | SelleckChem | S2753 |
| Vandetanib | SelleckChem | S1046 |
| Varlitinib | SelleckChem | S2755 |
| CellTiter-Glo Luminescent Viability Assay | Promega | G7570 |
| PANACEA gene expression profiles. | This paper | GEO: GSE186341 |
| AsPC-1 | ATCC | ATCC Cat# CRL-1682; RRID:CVCL_0152 |
| DU 145 | ATCC | ATCC Cat# HTB-81; RRID:CVCL_0105 |
| EFO-21 | DSMZ | DSMZ Cat# ACC-235; RRID:CVCL_0029 |
| HCC1143 | ATCC | ATCC Cat# CRL-2321; RRID:CVCL_1245 |
| HF2597 | Henry Ford | N/A |
| HSTS | Broad | RRID:CVCL_L296 |
| KRJ1 | Califano Lab | RRID:CVCL_8886 |
| LNCaP | ATCC | ATCC Cat# CRL-1740; RRID:CVCL_1379 |
| NCI-H1793 | ATCC | ATCC Cat# CRL-5896; RRID:CVCL_1496 |
| PANC-1 | ATCC | ATCC Cat# CRL-1469; RRID:CVCL_0480 |
| U-87 MG | ATCC | ATCC Cat# HTB-14; RRID:CVCL_0022 |
| STAR aligner, 2.5.2b | Dobin et al. | |
| Limma 3.48.1 | Ritchie et al. | |
| DESeq2 | Love et al. | |
| ComBat | Johnson et al. | |
| Analysis code | This paper | |