| Literature DB >> 34975833 |
Chiranjib Chakraborty1, Ashish Ranjan Sharma2, Manojit Bhattacharya3, Hatem Zayed4, Sang-Soo Lee2.
Abstract
The COVID-19 pandemic has created an urgent situation throughout the globe. Therefore, it is necessary to identify the differentially expressed genes (DEGs) in COVID-19 patients to understand disease pathogenesis and the genetic factor(s) responsible for inter-individual variability. The DEGs will help understand the disease's potential underlying molecular mechanisms and genetic characteristics, including the regulatory genes associated with immune response elements and protective immunity. This study aimed to determine the DEGs in mild and severe COVID-19 patients versus healthy controls. The Agilent-085982 Arraystar human lncRNA V5 microarray GEO dataset (GSE164805 dataset) was used for this study. We used statistical tools to identify the DEGs. Our 15 human samples dataset was divided into three groups: mild, severe COVID-19 patients and healthy control volunteers. We compared our result with three other published gene expression studies of COVID-19 patients. Along with significant DEGs, we developed an interactome map, a protein-protein interaction (PPI) pattern, a cluster analysis of the PPI network, and pathway enrichment analysis. We also performed the same analyses with the top-ranked genes from the three other COVID-19 gene expression studies. We also identified differentially expressed lncRNA genes and constructed protein-coding DEG-lncRNA co-expression networks. We attempted to identify the regulatory genes related to immune response elements and protective immunity. We prioritized the most significant 29 protein-coding DEGs. Our analyses showed that several DEGs were involved in forming interactome maps, PPI networks, and cluster formation, similar to the results obtained using data from the protein-coding genes from other investigations. Interestingly we found six lncRNAs (TALAM1, DLEU2, and UICLM CASC18, SNHG20, and GNAS) involved in the protein-coding DEG-lncRNA network; which might be served as potential biomarkers for COVID-19 patients. We also identified three regulatory genes from our study and 44 regulatory genes from the other investigations related to immune response elements and protective immunity. We were able to map the regulatory genes associated with immune elements and identify the virogenomic responses involved in protective immunity against SARS-CoV-2 infection during COVID-19 development.Entities:
Keywords: COVID-19; differentially expressed genes; interactome mapping; protective immunity; transcriptome profiling
Mesh:
Year: 2021 PMID: 34975833 PMCID: PMC8714830 DOI: 10.3389/fimmu.2021.724936
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Outline of the workflow and diverse sample types of our entire study. (A) A brief workflow of our bioinformatics study. (B) Different diverse sample types of our complete study.
The summary of 15 human subjects (control and COVID-19 patients) datasets and study characteristics.
| Group | Accession | Source name | Cell type | Disease | Gender | Age |
|---|---|---|---|---|---|---|
|
| GSM5019817 | PBMC, HC | peripheral blood mononuclear cells (PBMCs) | healthy | male | 62 |
|
| GSM5019818 | PBMC, HC | peripheral blood mononuclear cells (PBMCs) | healthy | male | 56 |
|
| GSM5019819 | PBMC, HC | peripheral blood mononuclear cells (PBMCs) | healthy | male | 54 |
|
| GSM5019820 | PBMC, HC | peripheral blood mononuclear cells (PBMCs) | healthy | male | 71 |
|
| GSM5019821 | PBMC, HC | peripheral blood mononuclear cells (PBMCs) | healthy | female | 56 |
|
| GSM5019822 | PBMC, mild patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 55 |
|
| GSM5019823 | PBMC, mild patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 44 |
|
| GSM5019824 | PBMC, mild patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 51 |
|
| GSM5019825 | PBMC, mild patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 54 |
|
| GSM5019826 | PBMC, mild patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | female | 53 |
|
| GSM5019827 | PBMC, severe patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 54 |
|
| GSM5019828 | PBMC, severe patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 52 |
|
| GSM5019829 | PBMC, severe patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 73 |
|
| GSM5019830 | PBMC, severe patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 51 |
|
| GSM5019831 | PBMC, severe patient | peripheral blood mononuclear cells (PBMCs) | COVID-19 | male | 60 |
List of top ranking expressed genes of COVID-19 patients from the different experiments.
| Sl. No | Group name | Sample type | Assay | Gene expression | Reference |
|---|---|---|---|---|---|
| 1. | Xiong et al., 2020 | Bronchoalveolar lavage fluid | RNA library construction, high-throughput RNA sequencing | CXCL1, CXCL2, CXCL6, CXCL8, IL 33, CXCL10/IP-10, CCL2/MCP-1,CCL3/MIP-1A, CCL4/MIP1B | ( |
| 2. | Xiong et al., 2020 | Peripheral blood mononuclear cells | RNA library construction, high- throughput RNA sequencing | CXCL10, TNFSF10, TIMP1, C5, IL18, AREG, NRG1, IL10, ADA2, HK1, GAT1, PGD, PLA2G15, CTSD, GAA, LAIR1 | ( |
| 3. | Ziegler et al., 2020 | Lung lobe, Nasal polyps, ethmoid sinus surgical tissue, Ileum | Single-cell RNA-sequencing | IFNGR2, TRIM27, NT5DC1, ARL6IP1, IFNAR1, TMPRSS2, ACE2, TRIM28, APOA1, FABP6, ENPEP, STAT1, IFI6, IFITM1, GBP2, FI35, XAF1 | ( |
| 4. | Jain et al., 2020 | Nasopharyngeal swabs | Shotgun transcriptome sequencing of RNA | CXCL5, CXCL12, CCL2, CCL4, CXCL10, IFIH1, IFI44, IFIT1, IL6, IL10, CSF2, TNFSF11, TNFRSF11B, IL18R1, BMP2, BMP7, PDGFA, IFIT1B, C4BPA, CCR6, CCR22, CCR25, IL3RA, IL11, IL19, IL21RA, SERPINE1, SERPINF2 | ( |
Figure 2Visualization of identified DEGs using volcano plots. (A) DEG volcano plot using the data of control vs. mild COVID-19 patients. (B) DEG volcano plot using the data of severe COVID-19 patients vs. control healthy volunteers. (C) DEG volcano plot using the data of mild COVID-19 patients vs. severe COVID-19 patients. In this figure, red dots denote upregulated DEGs and blue dots denote downregulated DEGs.
Figure 3Visualization of identified DEGs using MD. (A) DEGMD plot using the data of control vs. mild COVID-19 patients. (B) DEGMD plot using the data of severe COVID-19 patients vs. control healthy volunteers. (C) DEGMD plot using the data of mild COVID-19 patients vs. severe COVID-19 patients. In this figure, red dots denote upregulated DEGs, and blue dots denote downregulated DEGs.
Figure 4Different types of statistical plots were developed from our study. (A) UMAP plot visualizing samples related to each other. (B) Venn diagram shows the groups’ common genes. (C) The box plot shows the distribution of the selected samples’ values for this study. (D) The expression density plot shows the distribution of values of the DEGs of the three groups. (E) The adjusted p-value histogram represents the p-value in the experiment (the top DEGs). (F) The moderated t-statistic q-q plot shows our DEGs data sample against the theoretical quantiles a Student’s t distribution. (G) The mean-variance trend plot shows the mean-variance relationship of the gene expression data.
Significantly upregulated protein-coding genes DEGs from three experimental human groups of our dataset.
| Sl. No. | Gene name | P-value | F-value |
|---|---|---|---|
| 1. | CERKL | 1.95e-10 | 135 |
| 2. | EIF4G1/EIF3G/EIF3E | 1.94e-09 | 97.9 |
| 3. | RPL18A | 2.14e-10 | 133.3 |
| 4. | EXOSC2/EXOCS5 | 2.86e-09 | 92.7 |
| 5. | STRN4 | 5.94e-10 | 115.7 |
| 6. | RPL3L/RPL35/RPL1BA/RPL19 | 1.37e-10 | 141.8 |
| 7. | RPS3/RPS16 | 2.14e-10 | 133.3 |
| 8. | SMTN | 2.03e-09 | 97.3 |
| 9. | FGF1 | 3.73e-10 | 123.4 |
| 10. | PPP1R12A | 2.63e-09 | 93.8 |
| 11. | CNNM2 | 9.07e-10 | 109 |
| 12. | AP2M1 | 1.68e-09 | 100 |
| 13. | EDN1 | 4.12e-10 | 121.7 |
| 14. | ARHGEF1 | 2.21e-12 | 248.8 |
| 15. | DUS1L | 1.49e-09 | 101.7 |
| 16. | RBM5 | 3.73e-10 | 123.4 |
| 17. | MPHOSPH6 | 2.99e-09 | 92.2 |
| 18. | SKIV2L2 | 2.86e-09 | 92.7 |
Figure 5Significantly expressed genes from our experiments and other experiments. (A) The percentage of significantly expressed protein-coding genes and lncRNAs. (B) Total no. of top-ranking protein-coding genes from our study and other studies.
Figure 6The protein interactome map constructed using protein-coding DEGs from our study and other studies. (A) The protein interactome map with protein-coding genes from the 250 DEGs in our research. (B) The protein interactome map of the central cluster from our study was identified from (A). (C) The protein interactome map from Xiong et al. study where the samples were the bronchoalveolar lavage fluid. (D) The protein interactome map with the top-ranked genes of the Xiong et al. study from the PBMC samples. (E) The protein interactome map with data from the Ziegler et al. study where the samples were collected from the lung lobe, nasal polyps, ethmoid sinus surgical tissue, and ileum. (F) The protein interactome map with the top-ranked genes of the Jain et al. study that used nasopharyngeal swab sample.
Figure 7The PPI network constructed using protein-coding DEGs from our study and other studies. (A) The PPI network with significant protein-coding DEGs in our research. (B) The PPI network from Xiong et al. study where the samples were the bronchoalveolar lavage fluid. (C) The PPI network with the top-ranked genes of the Xiong et al. study from the PBMC samples. (D) The PPI network with data from the Ziegler et al. study and the gene expression data was collected from the lung lobe, nasal polyps, ethmoid sinus surgical tissue, and ileum. (E) The PPI network with the top-ranked protein-coding genes used a nasopharyngeal swab sample (the Jain et al. study).
Figure 8The enrichment cluster analysis of the PPI network using protein-coding DEGs from our study and other studies. (A) The enrichment cluster analysis of the PPI network with significant protein-coding DEGs in our research. (B) The enrichment cluster analysis of the PPI network from Xiong et al. study where the samples were the bronchoalveolar lavage fluid. (C) The enrichment cluster analysis of the PPI network with the Xiong et al. study’s top-ranked genes from the PBMC samples. (D) The enrichment cluster analysis of the PPI network with data from the Ziegler et al. study where the samples were collected from the lung lobe, nasal polyps, ethmoid sinus surgical tissue, and ileum. (E) The enrichment cluster analysis of the PPI network with the top-ranked protein-coding genes of the Jain et al. study that used nasopharyngeal swab sample.
Figure 9The functional pathway enrichment analysis in COVID categories using protein-coding DEGs from our study and other studies. (A) Functional pathway enrichment analysis in COVID-19 categories from the 250 DEGs of our study. (B) Functional pathway enrichment analysis in COVID-19 categories with the top-ranked genes of the Xiong et al. study that used bronchoalveolar lavage fluid sample. (C) Functional pathway enrichment analysis in COVID-19 categories with the top-ranked genes of the Xiong et al. study that used PBMC samples. (D) Functional pathway enrichment analysis in COVID-19 types with the Ziegler et al. study. (E) Functional pathway enrichment analysis in COVID-19 categories with the Jain et al. study.
Figure 10The functional pathway enrichment analysis in transcription factor targets using protein-coding DEGs from our study and other studies. (A) Functional pathway enrichment analysis in transcription factor targets from the 250 DEGs of our study. (B) Functional pathway enrichment analysis in transcription factor targets with the top-ranked genes of the Xiong et al. study that used bronchoalveolar lavage fluid sample. (C) Functional pathway enrichment analysis in transcription factor targets with the top-ranked genes of the Xiong et al. study that used PBMC samples. (D) Functional pathway enrichment analysis in transcription factor targets with the top-ranked genes of the Ziegler et al. study. (E) Functional pathway enrichment analysis in transcription factor targets with the top-ranked genes of the Jain et al. study.
Significantly upregulated lncRNA genes from DEGs between three experimental human groups of our dataset.
| Sl. No. | Gene name | P- value | F -value |
|---|---|---|---|
| 1. | LOC101929613 | 2.60e-15 | 612.8 |
| 2. | LOC105370401 | 6.88e-15 | 538.5 |
| 3. | TMEM252-DT | 2.91e-11 | 175.4 |
| 4. | HOXC13-AS | 3.12e-11 | 173.7 |
| 5. | SEMA3B-AS | 4.59e-11 | 164.8 |
| 6. | MIR210HG | 5.25e-11 | 161.8 |
| 7. | DLEU2 | 5.71e-11 | 159.9 |
| 8. | MEF2C-AS2 | 6.53e-11 | 157 |
| 9. | LINC01639 | 3.58e-10 | 124.1 |
| 10. | SNHG20 | 6.75e-10 | 113.6 |
| 11. | SLC25A48-AS1 | 3.86e-10 | 122.8 |
| 12. | TALAM1 | 3.88e-10 | 122.8 |
| 13. | LOC105370619 | 4.66e-10 | 119.6 |
| 14. | UICLM | 4.96e-10 | 118.6 |
| 15. | NRSN2-AS1 | 5.49e-10 | 116.9 |
| 16. | GNAS | 7.29e-10 | 112.4 |
| 17. | LINC02612 | 1.32e-09 | 103.5 |
| 18. | LINC02582 | 1.46e-09 | 102 |
| 19. | LINC02000 | 1.47e-09 | 101.9 |
| 20. | LINC01393 | 1.48e-09 | 101.7 |
| 21. | LINC01191 | 2.19e-09 | 96.3 |
| 22. | N4BP2L2 | 2.71e-09 | 93.4 |
| 23. | LINC01920 | 2.78e-09 | 93.1 |
| 24. | CASC18 | 3.14e-09 | 91.5 |
Figure 11The co-expression networks of the protein expression genes of DEG and lncRNA.
Figure 12The genes associated with the immune response elements and protective immunity from DEGs.
Annotated genes related to the activation of immune cells and components as well as for protective immunity function from our analysis and other experiments.
| Sl. No. | Gene name | Remark | Ref. |
|---|---|---|---|
|
| |||
| 19. | RPL18A | Activation role of T cell proliferation | ( |
| 20. | EDN1 | Involve in TLR4 responses | ( |
| 21. | ARHGEF1 | Antigen-specific antibody production/humoral immune response | ( |
|
| |||
| 1. | CXCL1 | Regulation of IL-1β level within tissue | ( |
| 2. | CXCL2 | Self-regulated neutrophil recruitment and function | ( |
| 3. | CXCL6 | Act as potent pro-inflammatory neutrophil chemoattractant and activator component | ( |
| 4. | CXCL8 | Synthesis of IL-8 and important role in systemic inflammatory response syndrome | ( |
| 5. | IL33 | Synthesis of intracellular IL-33 may play role in pro-inflammatory signaling | ( |
| 6. | CXCL10 | Regulator of the interferon response, specially attracts activated T lymphocytes | ( |
| 7. | MCP-1 | Activation and migration of leukocytes | ( |
| 8. | IP-10 | Secretion of cytokines | ( |
| 9. | CCL3 | Induction of antigen-specific T cell responses | ( |
| 10. | CCL4 | Activation of antigen-presenting cells and B cells | ( |
| 11. | TNFSF10 | Role in adaptive immune system | ( |
| 12. | TIMP1 | Stimulates the immune response in lung cells | ( |
| 13. | C5 | Protease function as membrane attack complex (MAC) | ( |
| 14. | IL18 | Regulating the T helper responses and stimulating interferon gamma production | ( |
| 15. | NRG1 | Regulatory role in neuroinflammation | ( |
| 16. | IL10 | Enhance the B cell survival, proliferation, and antibody production | ( |
| 17. | ADA2 | Regulation of immune cells (neutrophils, monocytes, NK cells and B cells) activation and survival | ( |
| 18. | GAT1 | Decreases T cell proliferation | ( |
| 19. | LAIR1 | Regulates the inhibition of NK cell–mediated cytotoxicity | ( |
|
| |||
| 1. | IFNGR2 | Regulation of NK cell activity and B cell function | ( |
| 2. | TRIM27 | Lowering the function of IFN and pathogen-recognition receptors | ( |
| 3. | TMPRSS2 | Neutralizing antibodies by protease activity | ( |
| 4. | TRIM28 | Regulation of IFN-β, IFN-γ and cytokine expression in infected lung cells | ( |
| 5. | APOA1 | Involved in inflammatory and immune response regulation | ( |
| 6. | STAT1 | Maturation, stability of cytotoxic and helper T cells | ( |
| 7. | IFI6 | It delays type I interferon-induced apoptosis in cells | ( |
| 8. | IFITM1 | Regulate the CD4+ T helper cell differentiation | ( |
| 9. | GBP2 | Innate immune functions against intracellular pathogens | ( |
| 10. | XAF1 | Helps in IFN-β-induced apoptosis | ( |
|
| |||
| 25. | CXCL5 | Encodes receptor protein to recruit neutrophils | ( |
| 26. | CXCL12 | Coded protein paly role in immune surveillance, inflammation response. | ( |
| 27. | CCL2 | Activation and migration of leukocytes | ( |
| 28. | CCL4 | Activation of antigen-presenting cells and B cells | ( |
| 29. | IFIH1 | Involved in immune response and antiviral activity | ( |
| 30. | IFIT1 | Encoded protein may inhibit viral replication and translational initiation | ( |
| 31. | IL6 | Encoded cytokines functions in inflammation and the maturation of B cells | ( |
| 32. | IL10 | It lowering the expression of Th1 cytokines, MHC class II Ags, and costimulatory effects on macrophages | ( |
| 33. | CSF2 | It controls the production, differentiation, and function of granulocytes and macrophages | ( |
| 34. | TNFSF11 | Regulation of T cell dependent immune response | ( |
| 35. | BMP2 | It regulate thymic T cell development, maintain TR cell | ( |
| 36. | C4BPA | It controls the activation of the complement cascade | ( |
| 37. | CCR6 | Regulate the migration and recruitment of dendritic and T cells | ( |
| 38. | IL11 | Stimulate the T-cell-dependent development of immunoglobulin producing B cells | ( |
| 39. | IL19 | Encoded cytokine induces the expression of IL6 and TNF-alpha and helps in inflammatory responses | ( |