| Literature DB >> 35003208 |
Fatma Alqutami1, Abiola Senok1, Mahmood Hachim1,2.
Abstract
Background: To develop anti-viral drugs and vaccines, it is crucial to understand the molecular basis and pathology of COVID-19. An increase in research output is required to generate data and results at a faster rate, therefore bioinformatics plays a crucial role in COVID-19 research. There is an abundance of transcriptomic data from studies carried out on COVID-19, however, their use is limited by the confounding factors pertaining to each study. The reanalysis of all these datasets in a unified approach should help in understanding the molecular basis of COVID-19. This should allow for the identification of COVID-19 biomarkers expressed in patients and the presence of markers specific to disease severity and condition. Aim: In this study, we aim to use the multiple publicly available transcriptomic datasets retrieved from the Gene Expression Omnibus (GEO) database to identify consistently differential expressed genes in different tissues and clinical settings. Materials andEntities:
Keywords: COVID-19; SARS – CoV – 2; atlas; differentially expressed gene analysis; omics analyses
Year: 2021 PMID: 35003208 PMCID: PMC8727884 DOI: 10.3389/fgene.2021.755222
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Work flow chart for identifying datasets, R analysis and identifying common differential genes and pathway enrichment analysis (BioRender (2021). Created, 2021).
List of Data sets.
| GEO accession number | GEO study title | PMID | Source of samples | Number of samples |
|---|---|---|---|---|
| GSE151764 | Two distinct immunopathological profiles in lungs of lethal COVID-19 | 33033248 | Lung tissue | 56 |
| GSE163426 | COVID-19 ARDS is characterized by a dysregulated host response that differs from cytokine storm and is moderated by dexamethasone | 34446707 | Tracheal aspirates | 52 |
| GSE156063 | Upper airway gene expression differentiates COVID-19 from other acute respiratory illnesses and reveals suppression of innate immune responses by SARS-CoV-2 | 33203890 | Clinical naso/pharyngeal swabs | 234 |
| GSE149273 | RV infections in asthmatics increase ACE2 expression and stimulate cytokine pathways implicated in COVID19 | 32649217 | Nasal Tissue | 90 |
| GSE163151 | A diagnostic host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood | 33536218 | Nasal Swabs Whole blood | 404 |
| GSE152641 | Transcriptomic Similarities and Differences in Host Response between SARS-CoV-2 and Other Viral Infection | 33437935 | Whole Blood | 86 |
| GSE151161 | Blocking of the CD80/86 axis as a therapeutic approach to prevent progression to more severe forms of COVID-19 | 34075090 | Whole Blood | 79 |
| GSE152418 | Systems biological assessment of immunity to severe and mild COVID-19 infections | 32788292 | PBMC | 34 |
| GSE157103 | Large-scale Multi-omic Analysis of COVID-19 Severity | 33096026 | Leukocytes from whole blood | 126 |
FIGURE 2Summary of the different types of analysis carried out (BioRender (2021). Created, 2021).
List of genes that are differently expressed in GSE151764.
| Condition | List of genes |
|---|---|
| COVID-19 | IFI6, OAS3, OAS1, CRTAM, TNFRSF9 |
| COVID-19 without any pre-existing lung conditions | CCR7 |
| COVID-19 with pre-existing lung conditions | IFIT3, GBP1, CRTAM, OAS1, IFI44L, IFI6, OAS3, IFIT2, CXCL11, MX1, ISG15, |
| IFIT1, CXCL10, OAS2, TNFSF18, TNFRSF9, LAG3, HLA-G, MS4A1, CD274, GZMK, NCR3, FCGR2B, IL10 | |
| COVID-19 in non-smokers | MS4A1, HLA-DOB, CCR7, CRTAM |
| COVID-19 in smokers | CRTAM, IFI44L, IFIT3, IFI6, OAS1, OAS3, NCR3, MX1, GBP1, IL10, IFIT2, TNFSF18, ISG15, IFIT1, CXCL11, CXCL10, TNFRSF9, HLA-DPA1 |
| COVID-19 without diabetes | MS4A1, CRTAM |
| COVID-19 with diabetes | IFI27, CXCL11, HLA-DOB, IFI6, OAS3, ISG15, IFIT2, MX1, OAS1, IFIT3, IFIT1, TNFSF14, CRTAM, IL10, IDO1, GBP1, IFI44L, CXCL10, IL1A, IL23A |
List of differentially expressed genes in GSE163426.
| Condition | List of genes |
|---|---|
| COVID-19 induced ARDS | HSPA14, GBP5, PATL2, FHOD3, PHF11 |
| Non-COVID-19 induced ARDS | ALPK3 |
| Control (Mechanical ventilation) | MEGF6, H2AW |
FIGURE 6(A) Differentially expressed genes in ARDS caused by COVID-19. And (B) enrichment of COVID-19 induced ARDS genes reveal that most genes are linked to protein regulation.
List of differentially expressed genes in GSE156063.
| Condition | List of genes |
|---|---|
| Other non-viral acute respiratory illness | IFI6, IFI44L, IFI44, BST2, EPSTI1, IFIT1, ISG15, USP18, IFIT3, CXCL11, OAS3, SIGLEC1, SERPING1, CXCL10, CMPK2, CXCL9, KLHDC7B, RSAD2, IFITM1, CCDC194, OASL, LAG3, IFIT2, UBD |
| Viral infections | CCL3, IL1RN, SERPINB9, PLEK, TAGAP, PSTPIP2, CCL4, SOCS1, SLAMF7, GBP2, CD274, SH2B2, PIK3AP1, IL1B, CCRL2, GSTA2, FGR, CCL3L3, IFITM2, ICAM1, HCAR3, P2RY14, GPR84, GBP5, CSF2RB, ANKRD66, APOBEC3A, TNFRSF10C, ACOD1, LILRB2, CD69, CD53, FPR2, CD300E, CCR1, HK3, CCL4L2, CREB5, SLC25A47, LILRA5, PDE4B, H2AC18, CSF3R, LCP2, TNFAIP6, G0S2, C5orf58, CASP5, CCL2, MAP6, FPR1, FCGR3B, IL18RAP, SAMSN1, DYSF, TNFRSF1B, GLT1D1, SEC14L3, IL27, BCL2A1, HRH2, AQP9, OSM, TREM1, CXCR1, CLEC4D, SIGLEC14, HSPA6, SCGB1A1, SIGLEC5, PROK2 |
FIGURE 7(A) Number of genes differentially expressed genes between the different types of upper airway infections. And the enriched terms for genes that are differentially expressed in (B) all types of respiratory diseases.
List of differentially expressed genes in GSE149273.
| Condition | List of genes |
|---|---|
| RVC-15 | GIMAP2 |
| RVA-16 | DHX58, STAT2, NMI, LAP3, SCO2, TOR1B, STARD5, ZNFX1, PSMB9, TRIM69, NUPR1, APOL2, C17orf67, NCOA7, HESX1, TNFRSF6B, PPM1J, BIRC3, STX11, CEACAM1, IL15RA, TLR2, IL19, OR52N4, PDGFRL, GBP2, MUC13, CXCL8, IL7R, PLAUR, RIMS2, BATF3, CXCL3, IL36G, CASP5, OR51B5, PLCL1, FXYD6, PDCD1LG2, CXCL2, CCL3, SLC26A4, LINC01208, PTGS2, TNFAIP6, DEFB4A, DEFB4B, CHRNA1, CCL2 |
| Control | MX2, CMPK2, IRF7, RSAD2, OAS3, DDX60L, MX1, IFIT1, USP18, TRIM25, IFIT5, STAT1, OAS2, OAS1, XAF1, IFIT2, DDX58, EPSTI1, PNPT1, IFIT3, OASL, SAMD9L, HERC5, LAMP3, HERC6, IFI35, RTP4, PARP12, PARP9, ISG15, UBE2L6, HSH2D, C19orf66, IFI44, TRANK1, IFIH1, SP110, PLSCR1, IFITM1, DDX60, PARP10, PARP14, TRIM21, ISG20, ETV7, NLRC5, SLC25A28, IFI44L, TRIM22, RNF213, SAMD9, NT5C3A, PML, GBP4, TAP1, TNFSF10, PLEKHA4, LMO2, GBP1, APOL6, EIF2AK2, HELZ2, CXCL10, BST2, TYMP, GMPR, IFITM3, BATF2, SECTM1, IFI6, APOBEC3G, WARS, AIM2, EXOC3L1, BCL2L14, LIFR, GBP5, SLC15A3, ZBP1, FBXO39, HSPB9, PIK3AP1, RASGRP3, MYH7, MLKL, APOBEC3A, GBP1P1, DUOX2, RUFY4, DUOXA2, FGD2, USP30-AS1, TNFSF13B, CCL5, HRASLS2, GRIP2, IFI27, RBM11, MAB21L2, CD34, HTR2B, SUSD3, SOCS1, CARD17, IL4I1, CD38, TAGAP, LRRN2, CD83, LAG3, SMTNL1, CXCL11, IDO1, CSAG3, CD7, TMEM171, 04-Sep, PLVAP, CSF3, KLHDC7B, NCF1, PTPRR, CD274, CCL20, IFNB1, CXorf49, CXorf49B, NCF1B, XXYLT1-AS2, TNIP3, LINC00890, OR52K2, ART3, RORB, SP140, THEMIS2, PAX5, CCL4, PCDH17, FRMD3, IL17C, LGALS17A, NCF1C, CD69, ZFPM2, TNF, ANGPTL1, IFNL3, EXOC3L4, IFNL1, CLCA3P, FGF2, CYP21A2, SLCO5A1, CACNA1I, SERPINB9P1, CXCL9, IFNL2, ATP10A, IL36A, BCL2A1, NOS2, C3AR1, IL6, TRIM31, WISP1 |
FIGURE 10Common DEGs between controls, COVID-19 patients, and patients from viral or nonviral acute respiratory illness from (A) nasopharyngeal swabs and (B) whole blood samples.
List of genes that are differentially expressed in GSE152418.
| Condition | List of genes |
|---|---|
| Moderate | OTOF, TNFRSF10C, HK3, AXL |
| Severe | SAMD14, CMTM5, PLXNB3, MYL9 |
| ICU | TGFB1I1, CMTM5, MYL9, SAMD14, CETP, TTC7B, NT5M, MSRB3, EGFL7, ARHGAP6, MMRN1, SPX, LNCAROD, CDC42BPA, XK, TREML1, SMOX, PLOD2, GSTM5, PLXNB3, MFSD2B, CAV2, CLU, ITGA2B, SCN1B, TTLL7, PROS1, SELP, PF4V1, PCSK6, ITGB5, NRGN, ABLIM3, SPTB, FAM20A, CALD1, MPIG6B, PCOLCE2, EGF, WFDC1, PTGER3, GP9, AC015912.3, MGLL, LCN2, PF4, AQP10, LY6G6F, TFPI, FSTL1, ELOVL7, ANKRD9, PEAR1, CTTN, BEX3, GP6, VWF, ANXA3, SPARC, LY6G6E, MAOB, LINC01151, ACRBP, CLEC1B, GP1BA, PPBP, ALOX12, LTBP1, PRKAR2B, ITGB3, PTPRF, CTDSPL, FAXDC2, MYLK, WASF3, VEGFC, MAP1A, EHD3, TNNC2, GAS2L1, PTGS1, MMP8, GNAZ, KREMEN1, AL162424.1, ARHGEF12, ITGA1, PCYT1B, MED12L, VSIG2, CLDN5, PGAM1P8, RETN, AL450468.2, ESAM, METTL7B, CREB3L1, HLX, TRIM7, JAM3, PRTFDC1, WASF1, MAFB, SH3BGRL2, ITGA7, F2RL3, VEPH1, WDFY3-AS2, TAL1, F13A1, AC090409.1, S100A8, SAPCD2, KAZN, GUCY1B1, LINC01503, SLC25A37, TLCD4, FAM20C, S100A9, S100A12, GJA4, HTRA1, KRT80, LHFPL6, LYVE1, TREML3P, AC215522.2, GRB10, PLPP3, SLC18A2, ASGR2, VSIG4, MGST1, TRAJ32, AC092490.1, ADAM9, STAB1, MCEMP1, THBS1, DYSF, ZNF185, NIBAN2, FLVCR2, PLBD1, AQP9, FPR2, PTAFR, DLC1, VNN3, CYP1B1-AS1, VNN1, ACSL1, SIRPA, SERINC2, CYP1B1 |
Number of differentially expressed genes in each analysis.
| First level: Control vs COVID-19 | First level: Control vs COVID-19 | First level: Control vs COVID-19 |
|---|---|---|
|
|
|
|
|
|
|
|
| Control ICU: 2 genes | Control Ventilation: 12 genes | Control non-ICU and non-ventilation: 491 genes |
| Control Non ICU: 308 genes | Control non-Ventilation: 179 genes | COVID-19 non-ICU and ventilation: 2 genes |
| COVID-19 ICU: 431 genes | COVID-19 Ventilation: 394 genes | COVID-19 non-ICU and non-ventilation: 11 genes |
| COVID-19 non-ICU: 34 genes | COVID-19 non-Ventilation: 13 genes | COVID-19 ICU and ventilation: 377 genes |
| — | — | COVID-19 ICU and non-ventilation: 1 gene |
FIGURE 3Intersecting the differential expressed genes of (A) COVID-19 with pre-existing conditions, no smoking history, and no diabetes. (B) COVID-19 with pre-existing conditions, diabetes, and a smoking history.
FIGURE 4Gene term enrichment analysis of the DEGs for COVID-19 in GSE151764 reveals that most genes are involved in the cytokine signaling of the immune system.
FIGURE 5Gene term enrichment against COVID-19 reference list for the DEGs in COVID-19 reveal that the reactome gene sets for cytokine signaling are involved in COVID-19 prognosis.
FIGURE 8Heatmap of enriched genes for COVID-19 related pathways in upper airway infections.
FIGURE 9(A) Gene term enrichment for the differential expression genes in RVA-16 and RVC-15 (B) Gene term enrichment for COVID-19 pathways.
List of genes that are differentially expressed in COVID-19 analyzed from GSE163151.
| COVID-19 (nasopharyngeal swabs) | IFI6, IFI44, IFIT3, IFIT2, AMH, EIF2AK2, GBP1, IFI27, KRT5, GBP5, CXCL10, HS1BP3, ADCK2, ZBP1, SNORA61, HSF2, RNVU1_8, COL5A1, GATA5, DCLK2 |
| COVID-19 (common between nasopharangeal swabs and whole blood) | AMH, KRT5, COL5A1, GATA5, DCLK2 |
FIGURE 11(A) From the COVID-19 DEGs, 58 genes are common between nasopharyngeal swabs and whole blood samples (B) five genes are common from the COVID-19 specific DEGs.
FIGURE 12Pathway enrichment for the 58 genes that are common in COVID-19 patients between nasopharyngeal swabs and whole blood samples.
List of differentially expressed genes in GSE152641.
| Upregulated in COVID-19 | PPP1R14A, H2BC6, RRM2, IFI27, LY6E, H2AC4, SERPING1, IL22RA2, H3C8, FI44, ACOT2, RFFL, PARP6, H3C12, SPDYC, CNTNAP5, CLEC2L, DNTTIP1, IFNL4, RFPL4AL1, MED19, CHEK2, LRIT2, C12orf65, C1QC, MLNR, DBX2, RAB9B, PDZD8, GABRQ, NCKIPSD, ZFC3H1, CAV1, PPARG, ITGA7, LINC00460, HP, MINK1, MMP8, ZNRF1, LINC01442, IGFBP2, UCHL1, PAX3, KDM2B, SIGLEC1, RUNDC1, GPBAR1, LYRM4, OTOF, SDC1, PPP1R3G, C16orf87, SWI5, SPTLC3, ADAMTS2, OLFM4, CCL8, PRTN3, DEFA3, ADARB2 |
| Down Regulated in COVID-19 | FCER1A, OR10R2, GTPBP2, ALOX15, PACSIN2, SIGIRR, CHST9, TMEM97, GPR15, NOG, FDCSP, GSTM1, TUBB2A, GATD3A, TNS1, DISC1, SMIM13, HBG2 |
FIGURE 13List of gene terms enrichment for the upregulated COVID-19 genes in GSE152641.
FIGURE 14Gene enrichment analysis of (A) regular pathways is showing that the most common pathway is that of hemostasis and exocytosis and are all linked to cell differentiation. While in (B) COVID-19 gene lists, they are followed with angiogenesis, phagocytosis, and several immune responses.
FIGURE 15(A) Intersecting differentially expressed genes in ICU patients from two different studies (GSE152418 & GSE157103) showed that there are nine genes commonly differentiated in ICU patients (B) and these genes are involved in ECM organization and immune response regulation when upon enrichment.
FIGURE 16(A) Number of genes that are common between COVID-19 patients regardless of use of ventilation or not (B) Common genes are mostly involved in GO biological processes for cell division.
FIGURE 17(A) enrichment analysis for the 387 ventilation only genes reveal most genes are involved in nuclear cell division. And (B) COVID-19 lists reveal that most genes are involved in the immune system.
List of pathways common between multiple datasets.
| Pathway | Number of datasets pathway was enriched in |
|---|---|
| R-HSA-909733 Interferon alpha/beta signaling | 2 |
| R-HSA-1280215 Cytokine Signaling in Immune system | 3 |
| GO:0009617 response to bacterium | 3 |
| GO:0002274 myeloid leukocyte activation | 2 |
| GO:0035455 response to interferon-alpha | 2 |
| GO:0045055 regulated exocytosis | 2 |
| GO:0050900 leukocyte migration | 2 |
| M5885 NABA MATRISOME ASSOCIATED | 2 |
List of COVID-19 pathways common between multiple datasets.
| Pathway | Number of datasets pathway was enriched in |
|---|---|
| GO:0002697 regulation of immune effector | 2 |
| GO:0001817 regulation of cytokine production | 3 |
| GO:0034341 response to interferon-gamma | 3 |
| R-HSA-1169410: Antiviral mechanism by INF-stimulated genes | 2 |
| GO:0009617 response to bacterium | 4 |
| R-HSA-1280215 Cytokine Signaling in Immune system | 3 |
| GO:0045088 regulation of innate immune response | 2 |
| GO:0035455 response to interferon-alpha | 2 |
| R-HSA-913531 Interferon Signaling | 2 |
| GO:0002683 negative regulation of immune system process | 2 |
| GO:0032103 positive regulation of response to external stimulus | 2 |
| GO:0050900 leukocyte migration | 2 |
| GO:0006898 receptor-mediated endocytosis | 2 |
| R-HSA-1474244: Extracellular matrix organization | 2 |