| Literature DB >> 33281870 |
Minsik Oh1, Sungjoon Park1, Sangseon Lee2, Dohoon Lee3, Sangsoo Lim2, Dabin Jeong3, Kyuri Jo4, Inuk Jung5, Sun Kim2,3,6.
Abstract
Pharmacogenomics is the study of how genes affect a person's response to drugs. Thus, understanding the effect of drug at the molecular level can be helpful in both drug discovery and personalized medicine. Over the years, transcriptome data upon drug treatment has been collected and several databases compiled before drug treatment cancer cell multi-omics data with drug sensitivity (IC 50, AUC) or time-series transcriptomic data after drug treatment. However, analyzing transcriptome data upon drug treatment is challenging since more than 20,000 genes interact in complex ways. In addition, due to the difficulty of both time-series analysis and multi-omics integration, current methods can hardly perform analysis of databases with different data characteristics. One effective way is to interpret transcriptome data in terms of well-characterized biological pathways. Another way is to leverage state-of-the-art methods for multi-omics data integration. In this paper, we developed Drug Response analysis Integrating Multi-omics and time-series data (DRIM), an integrative multi-omics and time-series data analysis framework that identifies perturbed sub-pathways and regulation mechanisms upon drug treatment. The system takes drug name and cell line identification numbers or user's drug control/treat time-series gene expression data as input. Then, analysis of multi-omics data upon drug treatment is performed in two perspectives. For the multi-omics perspective analysis, IC 50-related multi-omics potential mediator genes are determined by embedding multi-omics data to gene-centric vector space using a tensor decomposition method and an autoencoder deep learning model. Then, perturbed pathway analysis of potential mediator genes is performed. For the time-series perspective analysis, time-varying perturbed sub-pathways upon drug treatment are constructed. Additionally, a network involving transcription factors (TFs), multi-omics potential mediator genes, and perturbed sub-pathways is constructed, and paths to perturbed pathways from TFs are determined by an influence maximization method. To demonstrate the utility of our system, we provide analysis results of sub-pathway regulatory mechanisms in breast cancer cell lines of different drug sensitivity. DRIM is available at: http://biohealth.snu.ac.kr/software/DRIM/.Entities:
Keywords: drug-response; multi-omics; perturbed pathway; pharmacogenomics; time-series; web-system
Year: 2020 PMID: 33281870 PMCID: PMC7689278 DOI: 10.3389/fgene.2020.564792
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Pharmacogenomics data analysis methods, their input, output, and algorithms.
| MOLI | Multi-omics data | Drug response (IC50) | Deep learning |
| DSPLMF | Multi-omics data, chemical structures | Drug response (IC50) | Logistic Matrix Factorization |
| CancerDAP | Multi-omics data | Sub-pathway signatures for drug response | Random forest, logistic regression |
| DryNetMC | Drug treatment time-series gene expression data | Clinically relevant genes | Differential network analysis |
Figure 1Phenotypic change of cell over time by drug. DRIM makes it possible to interpret drug response at molecular level by investigating perturbed sub-pathways.
Figure 2The systematic workflow of the system. Step 1 is for drug and cell line selection. Step 2 is for perturbed sub-pathway identification using expression propagation. Step 3 is for selecting multi-omics potential mediator genes by multi-omics embedding methods. Step 4 is for constructing time-bound network and determining regulatory path by influence maximization. Step 5 is to visualize the analysis result.
Figure 3Multi-omics potential mediator gene selection. (A) Multi-omics integration by tensor decomposition. (B) Multi-omics integration by autoencoder. (C) IC50-related feature selection using Lasso regression with embedded feature matrix. (D) Gene selection of tensor decomposition from selected features. (E) Gene selection of autoencoder from selected features.
Figure 4Multi-omics data analysis result before drug treatment. (A) Three tables are shown: cell line with IC50 table, multi-omics potential mediator genes with score table, and perturbed pathway with P-value table. (B) Perturbed pathway mapping to KEGG pathway. (C) An enriched pathway dot plot.
Figure 5Time-series gene expression data analysis result after drug treatment. (A) Selector to visualize network of cell line. (B) A perturbed sub-pathway table of cell line. (C) Visualized time varying network TF to perturbed sub-pathway. (D) Gene information window that contains time-series gene expression plot and multi-omics data before drug treatment.
Five breast cancer cell lines that are available multi-omics data before drug treatment with lapatinib sensitivity and time-series gene expression data after drug treatment.
| BT-549 | Basal B | 2.02 |
| T-47D | Luminal | 2.90 |
| MCF7 | Luminal | 3.04 |
| MDA-MB-468 | Basal A | 3.77 |
| MDA-MB-231 | Basal B | 6.50 |
Top 15 multi-omics potential mediator genes that are related to lapatinib sensitivity.
| ERBB3 | 8.01 |
| VEGFA | 6.11 |
| PGR | 5.96 |
| CDAN1 | 5.96 |
| ABCG2 | 5.83 |
| ESR1 | 5.71 |
| CASP8 | 5.64 |
| TP53 | 5.63 |
| MAP2K7 | 5.62 |
| CNTN4 | 5.53 |
| DCTN6 | 5.44 |
| CD274 | 5.39 |
| NF2 | 5.31 |
| CBL | 5.19 |
| E2F1 | 5.14 |
The P-value of PI3K-Akt signaling pathway.
| BT-549 | 1.6e-05 |
| T-47D | 6.07e-03 |
| MCF7 | 7.11e-04 |
| MDA-MB-468 | 4.94e-05 |
| MDA-MB-231 | 1.08e-03 |
Figure 6Differentially perturbed sub-pathway networks. (A) Regulatory path sharing VEGFA in BT-549, T-47D, and MDA-MB-468. (B) Regulatory path sharing CCND3, BCL2L1 in MDA-MB-231, T-47D. (C) Regulatory path sharing SHC1 in MDA-MB-231, MCF7.