| Literature DB >> 32995795 |
Mary E Dolan1, David P Hill1, Gaurab Mukherjee1, Monica S McAndrews1, Elissa J Chesler1, Judith A Blake1.
Abstract
The emergence of the SARS-CoV-2 virus and subsequent COVID-19 pandemic initiated intense research into the mechanisms of action for this virus. It was quickly noted that COVID-19 presents more seriously in conjunction with other human disease conditions such as hypertension, diabetes, and lung diseases. We conducted a bioinformatics analysis of COVID-19 comorbidity-associated gene sets, identifying genes and pathways shared among the comorbidities, and evaluated current knowledge about these genes and pathways as related to current information about SARS-CoV-2 infection. We performed our analysis using GeneWeaver (GW), Reactome, and several biomedical ontologies to represent and compare common COVID-19 comorbidities. Phenotypic analysis of shared genes revealed significant enrichment for immune system phenotypes and for cardiovascular-related phenotypes, which might point to alleles and phenotypes in mouse models that could be evaluated for clues to COVID-19 severity. Through pathway analysis, we identified enriched pathways shared by comorbidity datasets and datasets associated with SARS-CoV-2 infection.Entities:
Year: 2020 PMID: 32995795 PMCID: PMC7523125 DOI: 10.1101/2020.09.21.306720
Source DB: PubMed Journal: bioRxiv
SARS-CoV-2 (COVID-19) comorbidity-related gene sets
| Gene Set Identifier | Gene count | Gene Set Title | Gene Set Description/metadata |
|---|---|---|---|
| 219 | Kidney Disease | The Boolean Algebra tool was used to find the Union of 5 sets: GS380448, GS380449, GS380450, GS380451, GS380452. Kidney disease has been reported as a comorbidity factor in the disease (COVID-19) caused by SARS-CoV-2 infection: PMID:32247631, PMID:32233161, PMID:32232218. | |
| 267 | Hepatitis- | This gene set represents the genes in GS237945 queried on April 16, 2020 which was derived from the gene2mesh procedure. GS237945 was updated on 2019-01-07. Liver disease has been reported as a comorbidity factor in the disease (COVID-19) caused by SARS-CoV-2 infection: PMID: 32179124, PMID:32233161. | |
| 2550 | Cardiovascular Diseases- | This gene set represents the genes in GS235830 queried on April 17, 2020 which was derived from the gene2mesh procedure. GS235830 was updated on 2019-01-07. Cardiovascular disease has been reported as a comorbidity factor in the disease (COVID-19) caused by SARS-CoV-2 infection: PMID: 32179124, PMID:32267833, PMID:32232218, PMID:32217556. | |
| 1003 | Diabetes Mellitus, type 2- | This gene set represents the genes in GS242930 queried on April 16, 2020 which was derived from the gene2mesh procedure. GS242930 was updated on 2019-01-07. Diabetes has been reported as a comorbidity factor in the disease (COVID-19) caused by SARS-CoV-2 infection: PMID: 32179124, PMID:32233161, PMID:32267833, PMID:32232218, PMID:32217650, PMID:32345579. | |
| 1927 | Lung Diseases- | This gene set represents the genes in GS245016 queried on April 16, 2020 which was derived from the gene2mesh procedure. GS245016 was updated on 2019-01-07. Lung (pulmonological) disease has been reported as a comorbidity factor in the disease (COVID-19) caused by SARS-CoV-2 infection: PMID:32233161. |
SARS-CoV-2 (COVID-19) Gene Sets
| Gene Set Identifier | Gene Count | Gene Set Title | Gene Set Description/metadata |
|---|---|---|---|
| 119 | Up regulated angiogenesis and inflammation genes in lungs from patients who died from COVID-19 | This gene set describes genes that are up-regulated in lungs from patients who died from COVID-19. COVID-19 is the disease caused by SARS-CoV-2 virus. Note that this expression analysis includes only the angiogenesis-associated and inflammation-associated genes available on NanoString panels. The authors define up-regulated as those genes that show a (FDR) of <=0.05. These data are from the publication (angiogenesis) and supplementary (inflammation) materials associated with the publication. PMID:32437596 | |
| 102 | Up regulated genes in host transcriptional response to SARS-CoV-2 in Human adenocarcinomic alveolar basal epithelial (A549) cells | This gene set describes genes that are up-regulated by the host transcriptional response to SARS-CoV-2 infection in human adenocarcinomic alveolar basal epithelial (A549) cells. COVID-19 is the disease caused by SARS-CoV-2 virus. We define up-regulated as those genes that show a (log 2 fold change) of >=2. These data are from the | |
| 333 | Up regulated genes in host transcriptional response to SARS-CoV-2 in Human lung adenocarcinoma epithelial (Calu3) cells | This gene set describes genes that are up-regulated by the host transcriptional response to SARS-CoV-2 infection in human lung adenocarcinoma epithelial cells derived from pleural effusion (Calu3). COVID-19 is the disease caused by SARS-CoV-2 virus. We define up-regulated as those genes that show a (log 2 fold change) of >=2. These data are from the | |
| 14 | Up regulated genes in host transcriptional response to SARS-CoV-2 in normal human bronchial epithelium (NHBE) cells | This gene set describes genes that are up-regulated by the host transcriptional response to SARS-CoV-2 infection in normal human bronchial epithelium cells (NHBE). COVID-19 is the disease caused by SARS-CoV-2 virus. We define up-regulated as those genes that show a (log 2 fold change) of >=2. These data are from the | |
| 587 | Up regulated genes in post-mortem lung samples from COVID-19-positive patients | This gene set describes genes that are up-regulated in post-mortem lung samples from COVID-19-positive patients relative to biopsied healthy lung tissue from uninfected individuals. COVID-19 is the disease caused by SARS-CoV-2 virus. We define up-regulated as those genes that show a (log 2 fold change) of >=2. These data are from the | |
| 4 | Genes that are overexpressed in severe compared to mild cases of Covid-19 | People with severe cases of Covid-19 express these proteins at significantly higher levels than people with mild cases of Covid-19. Data from |
Genes shared by COVID-19 comorbidities.
This table shows the genes that were annotated to four out of five comorbidities that are associated with COVID-19 severity. Genes in bold are annotated to all five comorbidities.
Significantly enriched phenotype categories.
Top significantly enriched phenotype categories identified by VLAD analysis, showing how many of the eight genes shared among all five comorbidities are annotated to each phenotype category. T Cell related phenotypes included ‘increased CD-4 positive, alpha beta T cell number’ (p=3.9×10e-8) and ‘increased T-helper Cell number’ (p=1.6×10e-06). (Complete list of enriched phenotypes available supplemental table 1)
| Phenotype Category | Gene Count | P-value range |
|---|---|---|
| T Cell Related | 5 | 1.6×10e-06 to 3.9×10e-08 |
| Autoimmune Response | 5 | 1.7×10e-6 |
| Interferon gamma secretion | 4 | 3.7×10e-6 |
| Lung Inflammation | 4 | 1.7×10e-6 |
| Susceptibility to infection | 5 | 8.9×10e-7 |
| Blood Coagulation | 5 | 5.8×e10-8 |
| Cardiovascular physiology | 7 | 1.6×e10-6 |
Significantly enriched phenotype categories.
Top significantly enriched phenotype categories identified by VLAD analysis, showing the number of genes from the set of 107 mouse orthologs shared among four out of five comorbidities annotated to each phenotype category. (Complete list of enriched phenotypes available supplemental table 2)
| Phenotype Category | Gene Count | P-value range |
|---|---|---|
| Innate immunity | 49 | 3.1×10e-43 |
| Inflammatory response | 66 | 7.0×10e-43 |
| Leukocyte Physiology | 65 | 5.7×10e-42 |
| Cytokine levels | 43 | 1.9×10e-41 |
| Susceptibility to infection | 48 | 9.3×10e-40 |
| Mononuclear cell morphology | 68 | 1.1×10-e39 |
| Phagocyte morphology | 52 | 3.8×10-e39 |
| Leukocyte number | 69 | 2.0×10e-40 |
| Leukocyte physiology | 66 | 8.2×10e-39 |
| Injury response | 54 | 2.3×10e-49 |
| Blood vessel morphology | 60 | 1.4×10e-39 |
| Professional antigen presenting cell morphology | 50 | 4.5×10e-40 |
Figure 1.The top 25 most significantly enriched pathways involving the eight genes shared among all five comorbidities.
Pathways that are similar or directly related in the Reactome knowledgebase are color coded. Yellow: lipoprotein-related processes; peach: scavenger receptor pathways; blue: blood clotting; green; retinoid-related pathways; grey; signaling through STAT3.
Figure 2.The top 25 most significantly enriched pathways involving the 123 genes shared among four out of five comorbidities.
Pathways that are similar or directly related in the Reactome knowledgebase are color coded. Yellow: adaptive immune system pathways; blue: interleukin signaling pathways; green; interferon signaling pathways.
Comparison of pathway enrichment false discovery rates (FDR) for pathways with FDR <=0.05 (shown in light red) for at least one of the comorbidity sets and for at least one of the COVID-19 related gene sets:
comorbid8 column displays FDR for comorbidity set for all five comorbidities; comorbid123 for comorbidity set for four of the five comorbidities; A549 for up regulated genes in host transcriptional response to SARS-CoV-2 in human A549 lung alveolar cells; Calu3 for up regulated genes in Calu3 human lung adenocarcinoma epithelial cells; NHBE for up regulated genes in normal human bronchial epithelium (NHBE) cells; postmortem for up regulated genes in post-mortem lung samples from COVID-19-positive patients; PMID:32437596 for up regulated angiogenesis and inflammation genes in lungs from patients who died from COVID-19; PMID:32217835 for genes that are overexpressed in severe compared to mild cases of Covid-19. The score column gives the number of sets with FDR meeting our criteria. Shaded in yellow are the pathways significantly enriched for both the comorbidity sets and several COVID-19 sets based on highest scores. The full table displaying 28 pathways meeting our criteria is given in supplemental material. (Complete list of enriched pathways available supplemental table 4; enriched pathway details for each gene set available supplemental tables 5–12)
| pw_id | pw_name | comorbid8 | comorbid123 | A549 | Calu3 | NHBE | postmortem | PMID:32437596 | PMID:32217835 | score | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R-HSA-168256 | Immune System | 3.133E-02 | 5.773E-15 | 8.450E-02 | 1.832E-14 | 1.485E-06 | 2.376E-14 | 1.343E-14 | 1.259E-02 | 7 | ||||
| R-HSA-1280215 | Cytokine Signaling in Immune system | 2.870E-02 | 5.773E-15 | 9.169E-03 | 1.832E-14 | 1.422E-11 | 2.376E-14 | 1.343E-14 | 5.990E-04 | 8 | ||||
| R-HSA-449147 | Signaling by Interleukins | 3.473E-02 | 5.773E-15 | 3.453E-04 | 6.084E-14 | 1.093E-10 | 4.014E-09 | 1.343E-14 | 4.840E-05 | 8 | ||||
| R-HSA-6783783 | Interleukin-10 signaling | 7.900E-02 | 5.773E-15 | 3.953E-11 | 1.832E-14 | 1.144E-14 | 7.138E-11 | 1.343E-14 | 1.340E-05 | 7 | ||||
| R-HSA-6785807 | lnterleukin-4 and Interleukin-13 signaling | 1.153E-02 | 5.773E-15 | 5.224E-05 | 6.880E-09 | 1.334E-05 | 2.228E-03 | 1.343E-14 | 4.840E-05 | 8 | ||||
| R-HSA-6783589 | lnterleukin-6 family signaling | 3.473E-02 | 8.082E-02 | 2.852E-01 | 2.485E-01 | 6.745E-02 | 6.313E-01 | 2.274E-02 | 1.601E-02 | 3 | ||||
| R-HSA-1059683 | lnterleukin-6 signaling | 3.220E-02 | 3.448E-02 | 2.852E-01 | 2.526E-01 | 6.745E-02 | 6.313E-01 | 5.788E-03 | 3 | |||||
| R-HSA-913531 | Interferon Signaling | 5.346E-02 | 5.773E-15 | 4.843E-01 | 1.832E-14 | 1.829E-02 | 2.376E-14 | 9.975E-02 | 4 | |||||
| R-HSA-909733 | Interferon alpha/beta signaling | 5.773E-15 | 1.832E-14 | 1.982E-03 | 2.376E-14 | 7.299E-02 | 4 | |||||||
| R-HSA-877300 | Interferon gamma signaling | 3.473E-02 | 5.773E-15 | 2.852E-01 | 4.546E-13 | 1.970E-08 | 5.761E-01 | 4 | ||||||
| R-HSA-168249 | Innate Immune System | 3.662E-01 | 4.366E-02 | 2.852E-01 | 4.483E-01 | 8.976E-01 | 3.656E-12 | 4.336E-02 | 3 | |||||
| R-HSA-109582 | Hemostasis | 4.045E-02 | 3.400E-03 | 2.852E-01 | 9.355E-01 | 6.313E-01 | 2.762E-09 | 3 | ||||||
| R-HSA-76002 | Platelet activation, signaling and aggregation | 1.903E-02 | 3.484E-05 | 2.852E-01 | 8.186E-01 | 6.313E-01 | 2.125E-11 | 3 | ||||||
| R-HSA-76005 | Response to elevated platelet cytosolic Ca2+ | 3.133E-02 | 9.286E-07 | 2.852E-01 | 4.692E-01 | 6.313E-01 | 1.848E-09 | 3 | ||||||
| R-HSA-114608 | Platelet degranulation | 3.133E-02 | 4.837E-07 | 2.852E-01 | 6.213E-01 | 6.313E-01 | 1.059E-09 | 3 | ||||||
| R-HSA-76009 | Platelet Aggregation (Plug Formation) | 4.940E-02 | 2.912E-02 | 2.852E-01 | 8.100E-01 | 6.313E-01 | 1.544E-02 | 3 | ||||||
Figure 3.Phenotype Enrichment for the Eight Genes Shared by All Five Comorbidities.
Cells shaded in yellow indicate that there is a mouse model in the MGI resource that has been studied for the specific phenotype.