| Literature DB >> 35601483 |
Dongyuan Wu1, Runzhi Zhang1, Susmita Datta1.
Abstract
Due to the COVID-19 pandemic, the global need for vaccines to prevent the disease is imperative. To date, several manufacturers have made efforts to develop vaccines against SARS-CoV-2. In spite of the success of developing many useful vaccines so far, it will be helpful for future vaccine designs, targetting long-term disease protection. For this, we need to know more details of the mechanism of T cell responses to SARS-CoV-2. In this study, we first detected pairwise differentially expressed genes among the healthy, mild, and severe COVID-19 groups of patients based on the expression of CD4+ T cells and CD8+ T cells, respectively. The CD4+ T cells dataset contains 6 mild COVID-19 patients, 8 severe COVID-19 patients, and 6 healthy donors, while the CD8+ T cells dataset has 15 mild COVID-19 patients, 22 severe COVID-19 patients, and 4 healthy donors. Furthermore, we utilized the deep learning algorithm to investigate the potential of differentially expressed genes in distinguishing different disease states. Finally, we built co-expression networks among those genes separately. For CD4+ T cells, we identified 6 modules for the healthy network, 4 modules for the mild network, and 1 module for the severe network; for CD8+ T cells, we detected 6 modules for the healthy network, 4 modules for the mild network, and 3 modules for the severe network. We also obtained hub genes for each module and evaluated the differential connectivity of each gene between pairs of networks constructed on different disease states. Summarizing the results, we find that the following genes TNF, CCL4, XCL1, and IFITM1 can be highly identified with SARS-CoV-2. It is interesting to see that IFITM1 has already been known to inhibit multiple infections with other enveloped viruses, including coronavirus. In addition, our networks show some specific patterns of connectivity among genes and some meaningful clusters related to COVID-19. The results might improve the insight of gene expression mechanisms associated with both CD4+ and CD8+ T cells, expand our understanding of COVID-19 and help develop vaccines with long-term protection.Entities:
Keywords: COVID-19; RNA-seq; T cell; deep learning; differential expression; functional annotation; gene co-expression network
Year: 2022 PMID: 35601483 PMCID: PMC9114762 DOI: 10.3389/fgene.2022.871164
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Workflow of the study. The analysis primarily focused on two main datasets, and two supplemental datasets were included to enrich the findings (A) Differential Expression analysis to obtain the differentially expressed (DE) genes among different disease states (B) Deep learning for validating the DE genes (C) Network analysis comparing the connectivity of genes in different disease states (D) Functional annotation.
FIGURE 2Venn diagrams for DE genes (A) Pairwise comparisons of the healthy, mild, and severe groups for CD4+ and CD8+ T cells (B) Comparison of the mild vs healthy group for CD4+ T cells, CD8+ T cells, and B cells.
FIGURE 3Volcano plots for DE genes. The thresholds of the absolute is 0.25 (A) Comparisons based on CD4+ T cells. DE genes DDX3Y and EIF1AY have been removed from the mild vs healthy plot and the severe vs healthy plot because of their infinite (B) Comparisons based on CD8+ T cells.
Results of the prediction of the selected MLP models. H, M, and S indicate the healthy, the mild and the severe COVID-19 groups, respectively.
| Cell Type | Comparison | Stacked accuracy | Stacked sensitivity | Stacked specificity | Stacked AUC |
|---|---|---|---|---|---|
| CD4+ T cells | M vs. H | 0.9066 | 0.9087 | 0.8944 | 0.9015 |
| S vs. H | 0.9783 | 0.9870 | 0.8100 | 0.8985 | |
| S vs. M | 0.8070 | 0.8397 | 0.7003 | 0.7700 | |
| CD8+ T cells | M vs. H | 0.9827 | 0.9904 | 0.9611 | 0.9757 |
| S vs. H | 0.9757 | 0.9866 | 0.8468 | 0.9167 | |
| S vs. M | 0.7595 | 0.8521 | 0.3701 | 0.6111 | |
| B cells | M vs. H | 0.9775 | 0.9857 | 0.9694 | 0.9775 |
FIGURE 4PIDC networks for CD4+ T cells. Node size and edge width represent the average expression level of the gene and the confidence of connectivity between a pair of genes in the network, separately. Red nodes are hub genes in the corresponding modules.
FIGURE 5PIDC networks for CD8+ T cells. Node size and edge width represent the average expression level of the gene and the confidence of connectivity between a pair of genes in the network, separately. Red nodes are hub genes in the corresponding modules.
Hub genes in each module of each network.
| Cell Type | Network | Module | Total number of genes | Hub gene |
|---|---|---|---|---|
| CD4+ T Cells | Healthy | Module 1 | 31 | RPS21 |
| Module 2 | 25 | CD40LG, IL2, NFKBID, TNF | ||
| Module 3 | 7 | IL4I1 | ||
| Module 4 | 15 | GBP5 | ||
| Module 5 | 13 | MT-CO1 | ||
| Module 6 | 5 | INSIG1 | ||
| Mild COVID-19 | Module 1 | 53 | CCL4ac, CCL5ac, GZMB | |
| Module 2 | 11 | GBP1 | ||
| Module 3 | 44 | RPS28ac, MTRNR2L12ac, RPL39ac | ||
| Module 4 | 7 | HLA-Eac, MIR4435-2HGac | ||
| Severe COVID-19 | Module 1 | 63 | RPL34 | |
| CD8+ T Cells | Healthy | Module 1 | 38 | XCL1ab, XCL2ab |
| Module 2 | 15 | MT-ND3ab | ||
| Module 3 | 18 | CD27ab | ||
| Module 4 | 17 | IFIT3ab, IFITM1 | ||
| Module 5 | 6 | ACTG1 | ||
| Module 6 | 23 | RPS25, RPS15A | ||
| Mild COVID-19 | Module 1 | 74 | CRTAM | |
| Module 2 | 15 | MT-CYBac, MT-ND4L | ||
| Module 3 | 15 | IFIT3 | ||
| Module 4 | 19 | RPS25, RPS28 | ||
| Severe COVID-19 | Module 1 | 79 | RPS15A | |
| Module 2 | 21 | IFITM1 | ||
| Module 3 | 5 | - |
Significant differential connection between the healthy network and the mild COVID-19 network.
Significant differential connection between the healthy network and the severe COVID-19 network.
Significant differential connection between the mild COVID-19 network and the severe COVID-19 network.
Significant differential connection in any pairwise comparison of three networks.
FIGURE 6The changes of GBP4 and GBP5 in gene expression over time from the bulk RNA-seq data. The FDR adjusted -values of the overall difference across three stages (using R package DESeq2) for GBP4 and GBP5 are 0.00156 and 0.00111, separately.
The main related GO biological process of each module of each network.
| Cell Type | Network | Module | Number of genes | Main related biological processa |
|---|---|---|---|---|
| CD4+ T Cells | Healthy | Module 1 | 31 | Viral transcription; Translational initiation |
| Module 2 | 25 | Cell-cell adhesion; Regulation of T cell and leukocyte activation; T cell proliferation | ||
| Module 3 | 7 | Regulation of fat cell, B cell, and myeloid cell differentiation | ||
| Module 4 | 15 | Response to IFN- | ||
| Module 6 | 5 | Mediation of IFN- | ||
| Mild COVID-19 | Module 1 | 53 | Leukocyte cell-cell adhesion; Cellular response to tumor necrosis factor; T cell proliferation | |
| Module 2 | 11 | Response to IFN- | ||
| Module 3 | 44 | Viral transcription; Translation initiation | ||
| Module 4 | 7 | IL-4 production; Immune response | ||
| Severe COVID-19 | Module 1 | 63 | Viral transcription; Translation initiation | |
| CD8+ T Cells | Healthy | Module 1 | 38 | Response to tumor necrosis factor and IFN- |
| Module 3 | 18 | Regulation of leukocyte cell-cell adhesion and T cell activation; Cell killing; Lymphocyte apoptotic process | ||
| Module 4 | 17 | Response to type I IFN and virus; Negative regulation of viral genome replication | ||
| Module 5 | 6 | Actin filament and synapse organization | ||
| Module 6 | 23 | Viral transcription; Translational initiation | ||
| Mild COVID-19 | Module 1 | 74 | Response to IFN- | |
| Module 3 | 15 | Response to type I IFN and virus; Negative regulation of viral genome replication | ||
| Module 4 | 19 | Viral transcription; Translational initiation | ||
| Severe COVID-19 | Module 1 | 79 | Translational initiation; Viral transcription; Cell killing | |
| Module 2 | 21 | Response to type I IFN and virus | ||
| Module 3 | 5 | DNA replication |
IFN: interferon; IL: interleukin.
The main KEGG pathway of each module of each network.
| Cell Type | Network | Module | Number of genes | Main KEGG pathway |
|---|---|---|---|---|
| CD4+ T Cells | Healthy | Module 1 | 31 | COVID-19 |
| Module 2 | 25 | Cytokine-cytokine receptor interaction | ||
| Module 3 | 7 | Rheumatoid arthritis | ||
| Module 4 | 15 | NOD-like receptor signaling pathway | ||
| Module 6 | 5 | PD-L1 expression and PD-1 checkpoint pathway in cancer | ||
| Mild COVID-19 | Module 1 | 53 | Cytokine-cytokine receptor interaction | |
| Module 2 | 11 | NOD-like receptor signaling pathway | ||
| Module 3 | 44 | COVID-19 | ||
| Module 4 | 7 | Central carbon metabolism in cancer | ||
| Severe COVID-19 | Module 1 | 63 | COVID-19; Cytokine-cytokine receptor interaction | |
| CD8+ T Cells | Healthy | Module 1 | 38 | Cytokine-cytokine receptor interaction |
| Module 3 | 18 | Natural killer cell mediated cytotoxicity | ||
| Module 4 | 17 | COVID-19; EBV infection | ||
| Module 5 | 6 | Regulation of actin cytoskeleton | ||
| Module 6 | 23 | COVID-19 | ||
| Mild COVID-19 | Module 1 | 74 | Cytokine-cytokine receptor interaction | |
| Module 3 | 15 | EBV infection | ||
| Module 4 | 19 | COVID-19 | ||
| Severe COVID-19 | Module 1 | 79 | COVID-19; Cytokine-cytokine receptor interaction | |
| Module 2 | 21 | EBV infection | ||
| Module 3 | 5 | DNA replication; Cell cycle |
NOD: Nucleotide-binding oligomerization domain; PD-L1: Programmed death ligand 1; PD-1: Programmed death 1; EBV: Epstein–Barr virus.