| Literature DB >> 36081949 |
Rui Xuan Huang1, Damrongrat Siriwanna2, William C Cho3, Tsz Kin Wan1, Yan Rong Du4, Adam N Bennett2, Qian Echo He2, Jun Dong Liu2, Xiao Tai Huang5, Kei Hang Katie Chan1,2,6.
Abstract
Lung cancer is the leading cause of cancer deaths globally, and lung adenocarcinoma (LUAD) is the most common type of lung cancer. Gene dysregulation plays an essential role in the development of LUAD. Drug repositioning based on associations between drug target genes and LUAD target genes are useful to discover potential new drugs for the treatment of LUAD, while also reducing the monetary and time costs of new drug discovery and development. Here, we developed a pipeline based on machine learning to predict potential LUAD-related target genes through established graph attention networks (GATs). We then predicted potential drugs for the treatment of LUAD through gene coincidence-based and gene network distance-based methods. Using data from 535 LUAD tissue samples and 59 precancerous tissue samples from The Cancer Genome Atlas, 48,597 genes were identified and used for the prediction model building of the GAT. The GAT model achieved good predictive performance, with an area under the receiver operating characteristic curve of 0.90. 1,597 potential LUAD-related genes were identified from the GAT model. These LUAD-related genes were then used for drug repositioning. The gene overlap and network distance with the target genes were calculated for 3,070 drugs and 672 preclinical compounds approved by the US Food and Drug Administration. At which, bromoethylamine was predicted as a novel potential preclinical compound for the treatment of LUAD, and cimetidine and benzbromarone were predicted as potential therapeutic drugs for LUAD. The pipeline established in this study presents new approach for developing targeted therapies for LUAD.Entities:
Keywords: deep learning; drug repositioning; gene prediction; graph attention networks; lung adenocarcinoma; machine learning
Year: 2022 PMID: 36081949 PMCID: PMC9445420 DOI: 10.3389/fphar.2022.936758
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.988
FIGURE 1Lung adenocarcinoma target gene prediction–drug repositioning pipeline. (A) data collection and pre-processing; (B) graph attention network (GAT) model building and lung adenocarcinoma (LUAD)-related target gene prediction; (C) potential LUAD drug prediction. FDR, false discovery rate; TCGA, The Cancer Gene Atlas.
FIGURE 2Analysis of differentially expressed LUAD-related genes. FDR, false discovery rate.
FIGURE 3Areas under the receiver operating characteristic curves of the models. AUROC, area under the receiver operating characteristic curve; GAT, graph attention networks; GCN, graph convolutional network.
Model evaluation metrics: AUROC, precision, recall, and F1 score.
| Model name | AUROC | Precision | Recall | F1-score |
|---|---|---|---|---|
| GAT | 0.90 | 0.85 | 0.85 | 0.85 |
| GCN | 0.87 | 0.82 | 0.82 | 0.82 |
| Adaboost | 0.85 | 0.79 | 0.79 | 0.79 |
| XGBoost | 0.83 | 0.77 | 0.76 | 0.76 |
| Random forest | 0.81 | 0.77 | 0.74 | 0.74 |
| TabNet | 0.78 | 0.75 | 0.75 | 0.75 |
| CTD validation | 0.82 | 0.78 | 0.78 | 0.78 |
FIGURE 4Gene-gene interaction network trained by GAT model. Blue nodes: LUAD-related genes determined by gene enrichment analysis; red nodes: LUAD-related genes predicted by GAT model; grey nodes: genes with no obvious association with LUAD.
Potential drugs or preclinical compounds for lung adenocarcinoma.
| Drug/preclinical compound | Jaccard score | Z-score | Validation | Reference |
|---|---|---|---|---|
| Chlorpromazine | 0.059 | −2.371 |
| ( |
| Bromoethylamine | 0.057 | −2.476 | NA | - |
| Azathioprine | 0.054 | −3.848 | Reported | ( |
| Ethanol | 0.050 | −2.310 |
|
|
| Papaverine | 0.044 | −3.345 |
|
|
| Fluoxetine | 0.043 | −3.516 | Phase 2 |
|
| Cimetidine | 0.040 | −2.125 | NA | - |
| Benzbromarone | 0.039 | −2.325 | NA | - |
| Rotenone | 0.038 | −3.437 |
|
|
| Sulfasalazine | 0.038 | −2.663 |
|
|
Physical drug-target interaction pairs and IC50 values.
| Drugs | Target name | Gene symbol | IC50 (nM) | Cell line |
|---|---|---|---|---|
| Benzbromarone | Cytochrome P450 2C9 | CYP2C9 | 41.00 |
|
| Aldo-keto reductase family 1 member C1 | AKR1C1 | 48.00 | BAOEC cells | |
| Solute carrier family 22 member 6 | SLC22A6 | 4,600.00 |
| |
| Cimetidine | Histamine H2 receptor | HRH2 | 500.00 | U2OS cells |
| Bromoethylamine | Neuropeptide Y receptor type 1 | NPY1R | 0.06 | SK-N-MC cells |
| G protein-coupled receptor kinase 5 | GRK5 | 11.80 | In-silico study (3D-QSAR) | |
| Matrix metalloproteinase-9 | MMP9 | 26.00 | Kinetic study | |
| Anoctamin-1 | ANO1 | 156.50 | FRT, and U251 cells | |
| Protein farnesyltransferase chain B | FNTB | 180.00 | NIH3T3 cells | |
| Delta-type opioid receptor | OPRD1 | 362.00 | CHO cells | |
| DNA repair protein RAD51 homolog 1 | RAD51 | 370.00 | HEK293 cells | |
| ATP-binding cassette sub-family G member 2 | ABCG2 | 527.50 | H460/MX20 cells | |
| Thymidylate synthase | TYMS | 1,288.00 | In-silico study (3D-QSAR) | |
| Dihydrofolate reductase | DHFR | 5,010.00 | HL-60, Bel-7402, BGC823, KB, Hela, and SK-OV-3 cells |