| Literature DB >> 32425783 |
Wenying Yan1, Xingyi Liu1, Yibo Wang1, Shuqing Han1, Fan Wang1, Xin Liu1, Fei Xiao1, Guang Hu1,2.
Abstract
Pancreatic ductal adenocarcinoma (PDAC) is one of the leading causes of cancer-related death and has an extremely poor prognosis. Thus, identifying new disease-associated genes and targets for PDAC diagnosis and therapy is urgently needed. This requires investigations into the underlying molecular mechanisms of PDAC at both the systems and molecular levels. Herein, we developed a computational method of predicting cancer genes and anticancer drug targets that combined three independent expression microarray datasets of PDAC patients and protein-protein interaction data. First, Support Vector Machine-Recursive Feature Elimination was applied to the gene expression data to rank the differentially expressed genes (DEGs) between PDAC patients and controls. Then, protein-protein interaction networks were constructed based on the DEGs, and a new score comprising gene expression and network topological information was proposed to identify cancer genes. Finally, these genes were validated by "druggability" prediction, survival and common network analysis, and functional enrichment analysis. Furthermore, two integrins were screened to investigate their structures and dynamics as potential drug targets for PDAC. Collectively, 17 disease genes and some stroma-related pathways including extracellular matrix-receptor interactions were predicted to be potential drug targets and important pathways for treating PDAC. The protein-drug interactions and hinge sites predication of ITGAV and ITGA2 suggest potential drug binding residues in the Thigh domain. These findings provide new possibilities for targeted therapeutic interventions in PDAC, which may have further applications in other cancer types.Entities:
Keywords: drug targets; integrins; pancreatic ductal adenocarcinoma; protein-protein interactions; structural dynamics; support vector machine–recursive feature elimination
Year: 2020 PMID: 32425783 PMCID: PMC7204992 DOI: 10.3389/fphar.2020.00534
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Figure 1The computational pipeline proposed in this work included three steps. Overall, a machine learning method was used to identify DEGs in PDAC, which were then combined with two parameters of the PPI network to define a new score that predicted disease genes and drug targets in PDAC. All potential targets were then further verified by other bioinformatics analyses and investigated by a “druggability” analysis of structural and dynamic properties.
Information on the included GEO datasets.
| Accessions | Platforms | Samples | References |
|---|---|---|---|
| Affymetrix Human Genome U133 Plus 2.0 Array | 36 vs. 36 | ( | |
| Affymetrix Human Gene 1.0 ST Array | 45 vs. 45 | ( | |
| Affymetrix Human Genome U133 Plus 2.0 Array | 14 vs. 8 | ( |
Figure 2Differentially expressed genes (DEGs) between PDACs and normal tissues. (A–C) Volcano plot of −log10 (FDR) vs. log2 (fold change) of DEGs in the three datasets. (D) Venn diagram with the number of overlapping DEGs from the different datasets.
Figure 3Potential drug targets in the PPI network. The genes that were predicted by our pipeline are marked with red labels. The node size denotes the average RNs of the gene in two or three datasets.
Identified potential drug targets for PDAC.
List of prioritized protein targets with their drug target information and “druggability” features.
| Gene | RNs | Drug targets* | Drug(s)# | Disease(s)# | PDB | DS |
|---|---|---|---|---|---|---|
| 5.34 | Yes | XL784 | Solid tumor/cancer, Breast cancer | 6BE6 | 0.694 | |
| 4.79 | No | NA | NA | 1LQN | 0.839 | |
| 4.77 | Yes | Sucralfate, Tesevatinib, Alpha-Aminobutyric Acid, Cholecystokinin | Oral mucositis, Vulnerary | template: 5GJE | 0.968 | |
| 3.31 | No | NA | NA | template: 6BXJ | 0.545 | |
| 3.17 | No | NA | NA | template: 2Q4U | 0.677 | |
| 3.12 | Yes | Zinc, Zinc acetate, Zinc chloride | NA | 5TPT | 0.912 | |
| 3.08 | Yes | Abituzumab, | Colorectal cancer, Solid tumour/cancer | 3IJE | 0.663 | |
| 3.05 | Yes | Hyaluronic acid | NA | template: 4CSY | NA | |
| 3.03 | Yes | Thiodigalactoside, 1,4-Dithiothreitol, Mercaptoethanol, Artenimol | NA | 3W59 | NA | |
| 3.027 | No | NA | NA | Templates: 3K71, 4NEH, 3K6S | 0.672 | |
| 2.85 | Yes | Gadobenate Dimeglumine, Glycyrrhizic acid, Patent Blue, (365 drugs) | Hemophilia, Schizophrenia | 4BKE | 1.000 | |
| 2.82 | Yes | Pyruvic acid, | Pain, Renal cell carcinoma; | 6GG5 | 0.996 | |
| 2.54815 | No | NA | NA | template: 4DEQ | NA | |
| 2.52 | Yes | Zinc, Zinc acetate, Zinc chloride | NA | template: 5J2L | 0.503 | |
| 2.45 | Yes | Copper, Human calcitonin | NA | template: 4D1E | 0.673 | |
| 2.31 | No | NA | NA | 3MQB | 0.821 | |
| 2.0 | Yes | Amatuximab | Ovarian/Pancreatic cancer | 4F3F | 0.727 |
*”YES” means drug target, and “NO” means non-drug target; #”NA” means no drug and disease information, or no druggable pockets.
Figure 4Kaplan-Meier survival curves of overall survival from the human protein atlas datasets for potential drug targets divided by high (red) or low (green) expression level.
Figure 5Four modules were discovered within PPI networks. Genes that were predicted in at least two datasets are marked red, while genes that were predicted in only one dataset are marked blue.
Figure 6Top 10 enriched GO terms in biological processes, cellular components, and molecular functions.
Top 10 enriched KEGG pathways (integrins and collagens are marked in bold).
| KEGG term | Gene(s) | Count | Adjust p-value |
|---|---|---|---|
| 15 | 2.62E-11 | ||
| 20 | 2.67E-11 | ||
| 19 | 2.97E-10 | ||
| 23 | 4.12E-10 | ||
| 17 | 8.45E-10 | ||
| 9 | 1.25E-05 | ||
| 17 | 3.87E-05 | ||
| 9 | 1.03E-04 | ||
| 8 | 3.01E-04 | ||
| 8 | 3.01E-04 |
Figure 7Structures and dynamics of ITGAV and ITGA2. (A) The structure of ITGAV including the β-propeller, Thigh, Calf-1, and Calf-2 domains, and the most druggable pocket (purple), which is located along the outer side of the β-barrel. (B) The binding poses by docking Levothyroxine into the most druggable pocket of ITGAV. Levothyroxine and interacting residues are represented as colored sticks. (C) The structure of ITGA2 including the I-, β-propeller, Thigh, Calf-1, and Calf-2 domains, and the most druggable pocket (purple), which is located at the hole of the β-barrel; the binding pose with Levothyroxine and this pocket is shown in (D). (E) The shapes of first and second GNM modes of ITGAV. The minimum of the shapes indicate the hinge region, which corresponds to the structure in dark blue. Mode 1 predicts Asn455, Ser471, Arg553, and Gly594 within the Thigh domain are hinge sites (red arrows). (F) The shape of the first GNM mode of ITGA2, where the region of Phe681 to Ser737 within the Thigh domain was predicted to contain hinge sites (red circle).