| Literature DB >> 35053185 |
Suma L Sivan1, Vinod Chandra S Sukumara Pillai2.
Abstract
Network biology has become a key tool in unravelling the mechanisms of complex diseases. Detecting dys-regulated subnetworks from molecular networks is a task that needs efficient computational methods. In this work, we constructed an integrated network using gene interaction data as well as protein-protein interaction data of differentially expressed genes derived from the microarray gene expression data. We considered the level of differential expression as well as the topological weight of proteins in interaction network to quantify dys-regulation. Then, a nature-inspired Smell Detection Agent (SDA) optimisation algorithm is designed with multiple agents traversing through various paths in the network. Finally, the algorithm provides a maximum weighted module as the optimum dys-regulated subnetwork. The analysis is performed for samples of triple-negative breast cancer as well as colorectal cancer. Biological significance analysis of module genes is also done to validate the results. The breast cancer subnetwork is found to contain (i) valid biomarkers including PIK3CA, PTEN, BRCA1, AR and EGFR; (ii) validated drug targets TOP2A, CDK4, HDAC1, IL6, BRCA1, HSP90AA1 and AR; (iii) synergistic drug targets EGFR and BIRC5. Moreover, based on the weight values assigned to nodes in the subnetwork, PLK1, CTNNB1, IGF1, AURKA, PCNA, HSPA4 and GAPDH are proposed as drug targets for further studies. For colorectal cancer module, the analysis revealed the occurrence of approved drug targets TYMS, TOP1, BRAF and EGFR. Considering the higher weight values, HSP90AA1, CCNB1, AKT1 and CXCL8 are proposed as drug targets for experimentation. The derived subnetworks possess cancer-related pathways as well. The SDA-derived breast cancer subnetwork is compared with that of tools such as MCODE and Minimum Spanning Tree, and observed a higher enrichment (75%) of significant elements. Thus, the proposed nature-inspired algorithm is a novel approach to derive the optimum dys-regulated subnetwork from huge molecular network.Entities:
Keywords: breast cancer; colorectal cancer; differential expression; disease genes; drug target; smell detection agent optimisation; subnetwork; topological weight
Mesh:
Year: 2021 PMID: 35053185 PMCID: PMC8774275 DOI: 10.3390/biom12010037
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Data Flow of the proposed approach for subnetwork detection. The Smell Detection Agent (SDA) optimisation algorithm is applied on the network created using gene interaction data and protein–protein interaction data.
Figure 2Correlation matrix generated from the Differentially Expressed (DE) gene set with rows and columns corresponding to the differentially expressed genes, and each cell holds the measure of the difference in correlation values across the samples.
Figure 3Graphical representation of the portion of the integrated network with final weights. Node weight comprises the differential weight of the gene g and the topological weight of the corresponding protein p. Edge weight comprises the correlation value of genes and the connectivity score of proteins in the PPI graph.
Figure 4Objective function based on varying values of parameter δ. The smell update coefficient takes the value 0.5 corresponding to the maximum objective function. Agent_count represents the number of agents used by the algorithm for finding separate paths. After performing multiple runs, final value is taken as 8.
Comparing performance of the proposed algorithm and Artificial Bee Colony (ABC) algorithm.
| SDA Algorithm | ||
|---|---|---|
| No. of Agents | Objective Value | Time (s) |
| 4 | 111.1 | 0.03 |
| 8 | 110.98 | 0.05 |
| 12 | 111.0 | 0.07 |
| 14 | 111.5 | 0.08 |
|
| ||
|
|
|
|
| 20 | 104.37 | 0.238 |
| 30 | 105.18 | 0.467 |
| 40 | 108.94 | 0.574 |
Figure 5Visual representation of SDA derived dys-regulated subnetwork for Triple Negative Breast Cancer (TNBC) with 60 nodes and 940 edges. The nodes correspond to the genes in the optimum path with optimum weight values. The varying weights in increasing order is represented as colour gradient between yellow and purple.
Figure 6Degree-based view of the TNBC module generated by Cytoscape showing higher connectivity among the nodes.
Module gene associations with diseases for TNBC gene set, verified with other methods.
| Molecules Approved/Undergoing Studies | Significance Observed | Genes in SDA Module Overlapped with Other Methods | Number of Overlapped Molecules |
|---|---|---|---|
| Biomarker | 7 | ||
| Drug targets under clinical validation/pre-clinical evaluation | 12 | ||
| Proposed targets | Chosen based on weights | ||
| Genes found in DisGeNET database | Disease associated genes | 32 | |
| Genes found in TNBCdb database | Disease associated genes | 45 |
Top five pathways identified by KEGG tool from the TNBC subnetwork.
| Pathway Description | Genes Present | |
|---|---|---|
| hsa04110: Cell cycle | 3.84 × 10−17 | |
| hsa05200: Pathways in cancer | 5.49 × 10−16 | |
| hsa04115: p53signalling pathway | 5.18 × 10−8 | |
| hsa04915: Estrogen signalling pathway | 1.35 × 10−5 | |
| hsa05202: Transcriptional mis regulation in cancer | 0.0022 |
Pathway enrichment analysis of genes found in the TNBC subnetwork was conducted. For a cut-off p-value < 0.05, 55 functionally relevant pathways were obtained, and five are shown here. The list of all pathways is given as Supplementary File S3.
Comparing enrichment of significant elements in the subnetwork.
| Method | Path Size | Disease Genes (%) | Drug Targets | Significant Pathways | Biomarkers |
|---|---|---|---|---|---|
| MCODE | 88 | 32 (36%) | 7 | 4 | 9 |
| MST | 58 | 37 (64%) | 10 | 2 | 7 |
| SDA | 60 | 45 (75%) | 10 | 7 | 12 |
Figure 7The optimum dys-regulated subnetwork generated by the SDA algorithm for CRC. The yellow nodes represent genes with low weights in the module and top-weighted nodes are purple.
Module gene associations with diseases for the CRC gene set.
| Gene Symbol Identified by Other Methods | Significance Observed/Method Used | Genes in SDA Module | No. of Overlapped Genes |
|---|---|---|---|
| Dense module/Cytoscape [ | 11 | ||
| Hub genes [ | 6 | ||
| Common onco genes and tumor suppressor genes [ | |||
| Drug targets [ | 5 | ||
| - | Proposed targets |
Relevance of genes found in the SDA-derived module for CRC data was assessed. By comparing with results by other tools, hub genes, dense module genes and drug targets were identified. Overall, 80% of genes in subnetwork was found to be validated with the compared techniques.
Pathways observed during analysis of CRC subnetwork genes. This table shows a few top pathways associated to cellular functions, signalling and cancer-related processing.
| Pathway Description | Genes Present | |
|---|---|---|
| hsa04110: Cell cycle | 1.89 × 10−14 | |
| hsa04115: p53 signaling pathway | 8.55 × 10−9 | |
| hsa03010: ribosome | 2.45 × 10−5 | |
| hsa05200: Pathways in cancer | 3.20 × 10−5 | |
| hsa05210: Colorectal cancer | 6.76 × 10−4 | |
| hsa04151: PI3K-Akt signaling pathway | 0.0065 | |
| hsa0401: MAPK signaling pathway | 0.0238 |
The pathway enrichment analysis by KEGG has returned 35 pathways for a cut-off p-value < 0.05. This table shows seven functionally relevant pathways comprising the top genes of the derived subnetwork. The list of all pathways is given as Supplementary File S3.