| Literature DB >> 35992221 |
Golnaz Taheri1,2, Mahnaz Habibi3.
Abstract
The World Health Organization (WHO) introduced "Coronavirus disease 19" or "COVID-19" as a novel coronavirus in March 2020. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) requires the fast discovery of effective treatments to fight this worldwide crisis. Artificial intelligence and bioinformatics analysis pipelines can assist with finding biomarkers, explanations, and cures. Artificial intelligence and machine learning methods provide powerful infrastructures for interpreting and understanding the available data. On the other hand, pathway enrichment analysis, as a dominant tool, could help researchers discover potential key targets present in biological pathways of host cells that are targeted by SARS-CoV-2. In this work, we propose a two-stage machine learning approach for pathway analysis. During the first stage, four informative gene sets that can represent important COVID-19 related pathways are selected. These "representative genes" are associated with the COVID-19 pathology. Then, two distinctive networks were constructed for COVID-19 related signaling and disease pathways. In the second stage, the pathways of each network are ranked with respect to some unsupervised scorning method based on our defined informative features. Finally, we present a comprehensive analysis of the top important pathways in both networks. Materials and implementations are available at: https://github.com/MahnazHabibi/Pathway.Entities:
Keywords: Coronavirus disease 2019; Machine learning; SARS-CoV-2; Unsupervised learning
Year: 2022 PMID: 35992221 PMCID: PMC9384336 DOI: 10.1016/j.asoc.2022.109510
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 8.263
Fig. 1A schematic view of a biological pathway.
Fig. 2The workflow of the proposed methods.
Fig. 3Venn diagram of COVID-19 related genes in sets E, Cl, Co, respectively.
Fig. 4Percentage of shared genes in more than 2, 3, 4, 5 and 6 out of 6 underlying diseases.
The number of related pathways in four COVID-19 related selected gene sets.
| Co | Cl | T | E | S | |
|---|---|---|---|---|---|
| No. Signaling pathway | 4 | 15 | 4 | 22 | 39 |
| No. Disease pathway | 12 | 23 | 2 | 31 | 56 |
List of top 10 signaling pathways with respect to the highest Laplacian Score.
| Signaling pathway ID | No. genes | References |
|---|---|---|
| hsa04151:PI3K-Akt signaling pathway | 80 | |
| hsa04010:MAPK signaling pathway | 67 | |
| hsa04620:Toll-like receptor signaling pathway | 31 | |
| hsa04621:NOD-like receptor signaling pathway | 16 | |
| hsa04062:Chemokine signaling pathway | 10 | |
| hsa04310:Wnt signaling pathway | 18 | |
| hsa04152:AMPKs signaling pathway | 28 | |
| hsao4630:Jak-STAT signaling pathway | 30 | |
| hsa04668:TNF signaling pathway | 32 | |
| hsa04064:NF- | 26 |
List of top 10 disease pathways with respect to the highest Laplacian Score.
| Disease pathway ID | No. genes | Important genes |
|---|---|---|
| hsa05200:Pathways in cancer | 98 | P53, STAT3 |
| hsa05164:Influenza A | 56 | TNFRSF1A, IKBKB, JAK1 |
| hsa05166:HTLV-I infection | 49 | P53, MYC, JAK1 |
| hsa05152:Tuberculosis | 44 | TNFRSF1A, CASP3 |
| hsa04931:Insulin resistance | 34 | MTOR, IKBKB |
| hsa05169:Epstein-Barr virus infection | 30 | MYC, P53 |
| hsa05321:Inflammatory bowel disease (IBD) | 20 | IL2, STAT3 |
| hsa05323:Rheumatoid arthritis | 28 | CCL2, TNF, CCL5 |
| hsa05144:Malaria | 21 | TGFB1, CXCL8 |
| hsa05130:Pathogenic Escherichia coli infection | 10 | CD14, ACTB |
Fig. 5TLR signaling pathway cross-talks and immune deficiencies.
Fig. 6The PPI network associated with 98 genes are shared between pathways in cancer and COVID-19.
Fig. 7The PPI network associated with 56 genes are shared between Influenza A and COVID-19.
Fig. 8The PPI network associated with 49 genes are shared between the HTLV-I infection pathway and COVID-19.
Fig. 9Venn diagram of COVID-19 related genes in three highly scored disease pathways.
The list of shared genes between three disease pathways and their functions.
| Gene name | Function |
|---|---|
| NFKBIA | The NFKBIA helps keep NF- |
| PIK3CD | The PIK3CD provides instructions for an enzyme called phosphatidylinositol 3-kinase (PI3K). PI3K is found in white blood cells, including immune system B cells and T cells. These cells identify and attack foreign invaders like viruses and bacteria and prevent infection |
| JUN | The JUN encodes a protein that is highly similar to the viral protein. This gene interacts directly with precise target DNA sequences to control gene expression. JUN is mapped to a chromosomal region involved in both translocations and deletions in human malignancies |
| JAK1 | The JAK1 has a critical role in influencing the expression of genes that mediate inflammation. This gene is an important part of the (IL-6)/JAK1/STAT3 inflammation and immune response, and it could be an effective drug target for reducing cytokine storms |
| AKT1 | The AKT1 encodes one of the three members of the human AKT serine-threonine protein kinase family. AKT proteins control a large number of cellular functions like cell proliferation, metabolism, survival, and angiogenesis in both malignant and normal cells |
| GSK3B | The GSK3B encodes a protein that belongs to the glycogen synthase kinase subfamily. It is involved in energy metabolism, inflammation, ER-stress, mitochondrial dysfunction, and apoptotic pathways |
| IL6 | The IL6 encodes a cytokine that operates in inflammation and in the maturation of B cells. The function of IL6 is implicated in a wide range of inflammation-associated diseases. High levels of the encoded protein have been found in virus infections like COVID-19 |
| PIK3CG | The PIK3CG encodes a protein that is a class I catalytic subunit of PI3K. Since PI3K is found in the immune system, which identifies and attacks viruses and bacteria, it prevents infection. |
| NFKB1 | The NFKB is a transcription regulator that is activated by various cellular stimuli like cytokines, bacterial and viral products. It has a major role in the regulation of the early response to viral infection. Inappropriate activation of NFKB has been associated with several inflammatory diseases |
| IKBKB | The IKBKB encodes a protein that causes dissociation of the inhibitor and activation of NF- |
| PIK3CA | The PIK3CA provides instructions for PI3K. Since PI3K is found in the immune system, which identifies and attacks viruses and bacteria, it prevents infection |