| Literature DB >> 35504488 |
Thomas Smith1, Mohammed A Rohaim1, Muhammad Munir2.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is an emerging RNA virus causing COVID-19 disease, across the globe. SARS-CoV-2 infected patients may exhibit acute respiratory distress syndrome which can be compounded by endemic respiratory viruses and thus highlighting the need to understand the genetic bases of clinical outcome under multiple respiratory infections. In this study, 42 individual datasets and a multi-parametric based selected list of over 12,000 genes against five medically important respiratory viruses (SARS-CoV-2, SARS-CoV-1, influenza A, respiratory syncytial virus (RSV) and rhinovirus were collected and analysed in an attempt to understand differentially regulated gene patterns and to cast genetic markers of individual and multiple co-infections. While a certain cohort of virus-specific genes were regulated (negatively and positively), notably results revealed a greatest correlation among genes regulation by SARS-CoV-2 and RSV. Furthermore, out of analysed genes, the MAP2K5 and NFKBIL1 were specifically and highly upregulated in SARS-CoV-2 infection both in vivo or in vitro. The most conserved genetic signature was JAK2 gene as well as the constitutively downregulated ZNF219 gene. In contrast, several genes including GPBAR1 and SC5DL were specifically downregulated in SARS-CoV-2 datasets. Finally, we catalogued a set of genes that were conserved or differentially regulated across studied respiratory viruses. These finding provide foundational and genome-wide data to gauge the markers of respiratory viral infections individually and under co-infection. This work compares the virogenomic signatures among human respiratory viruses and provides valid targets for potential antiviral therapy.Entities:
Keywords: COVID-19; Influenza; RSV; Rhinovirus; SARS-CoV-1; SARS-CoV-2; Transcriptomics
Mesh:
Substances:
Year: 2022 PMID: 35504488 PMCID: PMC9054707 DOI: 10.1016/j.mcp.2022.101820
Source DB: PubMed Journal: Mol Cell Probes ISSN: 0890-8508 Impact factor: 3.285
Inclusion and Exclusion criteria used to select datasets to be included within this study.
| Inclusion Criteria | Exclusion Criteria |
|---|---|
| Transcriptomic Study | Sample is taken from another species other than human |
| Sample has been collected from infected humans with either SARS-CoV-2, Influenza A, SARS-CoV-1, RSV or Rhinovirus | Studies that include less than 10 up or down regulated genes |
| Sample is taken from humans | Studies that have modified genes |
| In vivo or in vitro study | Studies where subjects had been given a preventative drug or treatment |
Complete list of all datasets selected to be used within this study. The virus, type of sample and analysis method are also shown.
| Virus | Sample Type | Dataset ID | Analysis Method |
|---|---|---|---|
| SARS-CoV-2 | Nasopharyngeal swabs | GSE152075 | RNA sequencing |
| SARS-CoV-2 | PBMCs | GSE150728 | RNA sequencing |
| SARS-CoV-2 | Intestinal Organoids | GSE149312 | RNA sequencing |
| SARS-CoV-2 | NHBE | GSE147507 | RNA sequencing |
| SARS-CoV-2 | Autopsy samples | GSE150316 | RNA sequencing |
| SARS-CoV-2 | PBMCs and BALF | CRA002390 | RNA sequencing |
| SARS-CoV-2 | BALF | HRA000143 | RNA sequencing |
| SARS-CoV-2 | Upper Airway Samples | GSE156063 | RNA sequencing |
| SARS-CoV-2 | PBMCs | GSE152418 | RNA sequencing |
| SARS-CoV-2 | pHAE | GSE153970 | RNA sequencing |
| SARS-CoV-2 | Nasopharyngeal swabs | GSE154770 | RNA sequencing |
| SARS-CoV-2 | FFPE samples | GSE159787 | RNA sequencing |
| SARS-CoV-2 | Human Lung-only mice | GSE155286 | RNA sequencing |
| SARS-CoV-2 | nHTBE | GSE157526 | RNA sequencing |
| Influenza | PBMCs | GSE34205 | Microarray |
| Influenza | PBMCs | GSE6269 | Microarray |
| Influenza | Nasal Wash Samples | GSE68310 | Microarray |
| Influenza | Whole Blood | GSE38900 | Microarray |
| Influenza | Whole Blood | GSE90732 | Microarray |
| Influenza | Whole Blood | GSE61754 | Microarray |
| Influenza | NHBE | GSE147507 | RNA Sequencing |
| Influenza | Whole Blood | GSE21802 | Microarray |
| SARS-CoV-1 | PBMCs | GSE1739 | Microarray |
| SARS-CoV-1 | Intestinal Organoids | GSE149312 | RNA Sequencing |
| SARS-CoV-1 | NHBE | GSE147507 | RNA Sequencing |
| RSV | Whole Blood | GSE38900 | Microarray |
| RSV | PBMCs | GSE34205 | Microarray |
| RSV | NHBE | GSE147507 | RNA Sequencing |
| Rhinovirus | PBMCs | GSE53543 | Microarray |
| Rhinovirus | Whole Blood | GSE38900 | Microarray |
Fig. 1Scatter bar graphs of the Log-2-Fold Change of each gene for each dataset for A) SARS-CoV-2, B) Influenza, C) SARS-CoV-1, D) RSV and E) Rhinovirus. A horizontal line is also shown on each bar, which marks the average Log-2 fold change (Log-2FC) of the selected genes.
Top five up regulated genes shared across SARS-CoV-2, Influenza, SARS-CoV-1, RSV and Rhinovirus.
| Function | Location | ||||||
|---|---|---|---|---|---|---|---|
| SARS-CoV-2 | Influenza | SARS-CoV-1 | RSV | Rhinovirus | |||
| DDX60L | This gene encodes a member of the DExD/H-box helicase family of proteins, a subset of the super family 2 helicases. In addition to functions in RNA metabolism, members of this family are involved in anti-viral immunity and act as cytosolic sensors of viral nucleic acids | chr4 | 76.89471 | 81.07511 | 88.881 | 98.23783 | 98.44945 |
| JAK2 | This gene encodes a non-receptor tyrosine kinase that plays a central role in cytokine and growth factor signalling. This gene and the IL6/JAK2/STAT3 signalling pathway is a therapeutic target for the treatment of excessive inflammatory responses to viral infections. | chr9 | 65.71339 | 81.63137 | 94.10571 | 87.08943 | 97.70603 |
| IFI44 | Predicted to be involved in immune response. | chr1 | 75.19402 | 88.06365 | 68.81404 | 96.18025 | 97.14861 |
| FOXN2 | This gene encodes a forkhead domain binding protein and may function in the transcriptional regulation of the human T-cell leukemia virus long terminal repeat. | chr2 | 66.57874 | 89.72695 | 99.19854 | 82.8818 | 85.6627 |
| DDX60 | DX60 (DExD/H-Box Helicase 60) is a Protein Coding gene that functions as an antiviral factor and promotes RIG-I-like receptor-mediated signaling. | chr4 | 60.44368 | 69.17354 | 92.75642 | 99.07696 | 99.12914 |
Means of the total ranking score of every gene were used to calculate the top five up regulated genes.
Top five down regulated genes shared across all SARS-CoV-2, Influenza, SARS-CoV-1, RSV and Rhinovirus.
| FCGRT | This gene encodes a receptor that binds the Fc region of monomeric immunoglobulin G. This protein also binds immunoglobulin G to protect the antibody from degradation. Alternative splicing results in multiple transcript variants. | chr19 | −39.6042 | −82.1815 | −90.1054 | −93.0122 | −99.3856 |
| MFSD3 | MFSD3 (Major Facilitator Superfamily Domain Containing 3) is a Protein Coding gene. redicted to be involved in proton transmembrane transport. Predicted to be integral component of membrane. | chr8 | −42.4523 | −76.7588 | −93.4394 | −97.8397 | −94.4382 |
| IMPA2 | IMPA2 (Inositol Monophosphatase 2) is a Protein Coding gene. The encoded protein catalyzes the dephosphoylration of inositol monophosphate and plays an important role in phosphatidylinositol signaling. | chr18 | −42.7931 | −77.4724 | −97.8705 | −92.2407 | −97.9466 |
| CCDC106 | CCDC106 (Coiled-Coil Domain Containing 106) is a Protein Coding gene. romotes the degradation of p53/TP53 protein and inhibits its transactivity. | chr19 | −39.2854 | −89.8161 | −88.2233 | −98.4019 | −93.7914 |
| ZNF219 | ZNF219 (Zinc Finger Protein 219) is a Protein Coding gene. The encoded protein functions as a transcriptional repressor of the high mobility group nucleosome binding domain 1 protein, which is associated with transcriptionally active chromatin. | chr14 | −75.3585 | −85.8418 | −83.4911 | −72.589 | −98.0437 |
Means of the total ranking score of every gene were used to calculate the top five down regulated genes.
Ranking scores of up regulated genes related to SARS-CoV-2 compared to down regulated genes correlated with other respiratory viruses (Influenza, SARS-CoV-1, RSV and Rhinovirus).
| Gene | Fucntion | Location | Virus | ||||
|---|---|---|---|---|---|---|---|
| RGS11 | RGS11 (Regulator Of G Protein Signaling 11) is a Protein Coding gene. This protein inhibits signal transduction by increasing the GTPase activity of G protein alpha subunits, thereby driving them into their inactive GDP-bound form. | chr16 | 65.1003246 | −96.6809 | −93.256614 | −94.2356 | 0 |
| MAP2K5 | MAP2K5 (Mitogen-Activated Protein Kinase Kinase 5) is a Protein Coding gene. The protein encoded by this gene is a dual specificity protein kinase that belongs to the MAP kinase kinase family. This kinase specifically interacts with and activates MAPK7/ERK5. This kinase itself can be phosphorylated and activated by MAP3K3/MEKK3, as well as by atypical protein kinase C isoforms (aPKCs). The signal cascade mediated by this kinase is involved in growth factor stimulated cell proliferation and muscle cell differentiation. | chr15 | 47.014237 | −58.371 | −65.336632 | −55.5825 | −78.04365 |
| GPR162 | GPR162 (G Protein-Coupled Receptor 162) is a Protein Coding gene. This gene was identified upon genomic analysis of a gene-dense region at human chromosome 12p13. It appears to be mainly expressed in the brain; however, its function is not known. | chr12 | 17.8299687 | −96.1071 | −82.211228 | −99.2836 | −97.15441 |
| MSC | MSC (Musculin) is a Protein Coding gene. This protein is capable of inhibiting the transactivation capability of E47, an E2A protein, in mammalian cells. This gene is a downstream target of the B-cell receptor signal transduction pathway. | chr8 | 47.471529 | −72.2297 | −91.707894 | −20.3681 | −93.69442 |
| ZNF581 | ZNF581 (Zinc Finger Protein 581) is a Protein Coding gene. Predicted to enable DNA-binding transcription factor activity and RNA polymerase II | chr19 | 58.2545732 | −44.4351 | −86.534739 | −68.4118 | −73.32255 |
| RINL | RINL (Ras And Rab Interactor Like) is a Protein Coding gene. Predicted to enable guanyl-nucleotide exchange factor activity. Predicted to be involved in endocytosis. | chr19 | 42.9596735 | −57.4864 | −21.907937 | −96.4069 | −75.6831 |
| CENPBD1 | Predicted to enable DNA binding activity. | chr16 | 55.3289963 | −67.2357 | −42.449989 | −55.814 | −93.1447 |
| THBS3 | THBS3 (Thrombospondin 3) is a Protein Coding gene. The protein encoded by this gene belongs to the thrombospondin family. Thrombospondin family members are adhesive glycoproteins that mediate cell-to-cell and cell-to-matrix interactions. This protein forms a pentameric molecule linked by a single disulfide bond. This gene shares a common promoter with metaxin 1. | chr1 | 62.2955312 | −84.3552 | −29.791353 | −78.6179 | −56.05497 |
| CDA | CDA (Cytidine Deaminase) is a Protein Coding gene. This gene encodes an enzyme involved in pyrimidine salvaging. The encoded protein forms a homotetramer that catalyzes the irreversible hydrolytic deamination of cytidine and deoxycytidine to uridine and deoxyuridine, respectively. It is one of several deaminases responsible for maintaining the cellular pyrimidine pool. | chr1 | 45.317302 | −79.2657 | −51.989675 | −8.71817 | −99.35327 |
| LIMS2 | LIMS2 (LIM Zinc Finger Domain Containing 2) is a Protein Coding gene. This gene encodes a member of a small family of focal adhesion proteins which interacts with ILK (integrin-linked kinase), a protein which effects protein-protein interactions with the extraceullar matrix. The encoded protein has five LIM domains, each domain forming two zinc fingers, which permit interactions which regulate cell shape and migration. | chr2 | 46.6092978 | −76.5029 | −81.910088 | −75.113 | 0 |
| NFKBIL1 | NFKBIL1 (NFKB Inhibitor Like 1) is a Protein Coding gene. Involved in the regulation of innate immune response. Acts as negative regulator of Toll-like receptor and interferon-regulatory factor (IRF) signaling pathways. Contributes to the negative regulation of transcriptional activation of NF-kappa-B target genes in response to endogenous proinflammatory stimuli. | chr6 | 53.4792011 | −80.9981 | −78.780383 | −80.205 | 0 |
| DTX3 | DTX3 (Deltex E3 Ubiquitin Ligase 3) is a Protein Coding gene. Regulator of Notch signaling, a signaling pathway involved in cell-cell communications that regulates a broad spectrum of cell-fate determinations. | chr12 | 58.4436487 | −79.989 | −64.132071 | −46.0818 | −43.75101 |
Ranking scores of down regulated genes related to SARS-CoV-2 compared to up regulated genes correlated with other respiratory viruses (Influenza, SARS-CoV-1, RSV and Rhinovirus).
| Gene | Fucntion | Location | |||||
|---|---|---|---|---|---|---|---|
| GPBAR1 | GPBAR1 (G Protein-Coupled Bile Acid Receptor 1) is a Protein Coding gene. This gene encodes a member of the G protein-coupled receptor (GPCR) superfamily. This enzyme functions as a cell surface receptor for bile acids. Treatment of cells expressing this GPCR with bile acids induces the production of intracellular cAMP, activation of a MAP kinase signaling pathway, and internalization of the receptor. The receptor is implicated in the suppression of macrophage functions and regulation of energy homeostasis by bile acids. | chr2 | −44.515674 | 82.44926 | 41.1585675 | 83.27739 | 98.47069 |
| IDI1 | IDI1 encodes a peroxisomally-localized enzyme that catalyzes the interconversion of isopentenyl diphosphate (IPP) to its highly electrophilic isomer, dimethylallyl diphosphate (DMAPP), which are the substrates for the successive reaction that results in the synthesis of farnesyl diphosphate and, ultimately, cholesterol. | chr10 | −40.808037 | 93.10592 | 60.5356599 | 76.79214 | 63.57264 |
| SC5DL | This gene encodes an enzyme of cholesterol biosynthesis. The encoded protein catalyzes the conversion of lathosterol into 7-dehydrocholesterol. | chr11 | −71.53255 | 60.62026 | 74.15035 | 88.31216 | 77.25149 |
| FAM49B | Enables small GTPase binding activity. Involved in several processes, including cellular response to molecule of bacterial origin; negative regulation of small GTPase mediated signal transduction; and regulation of organelle organization. | chr8 | −44.603995 | 74.26559 | 59.9066653 | 58.04363 | 63.74257 |
| MYH15 | MYH15 (Myosin Heavy Chain 15) is a Protein Coding gene. Predicted to enable several functions, including ATP binding activity; actin filament binding activity; and calmodulin binding activity. Predicted to be involved in extraocular skeletal muscle development. | chr3 | −60.985801 | 96.50129 | 79.5576747 | 86.08247 | 0 |
| INA | INA (Internexin Neuronal Intermediate Filament Protein Alpha) is a Protein Coding gene. They may also play a role in intracellular transport to axons and dendrites. This gene is a member of the intermediate filament family and is involved in the morphogenesis of neurons. | chr10 | −62.020737 | 95.6463 | 74.2010754 | 77.4994 | 0 |
| KIAA1009 | CEP162 (Centrosomal Protein 162) is a Protein Coding gene. Involved in cilium assembly. | chr6 | −50.576886 | 63.53921 | 96.7637212 | 13.33014 | 58.41121 |
| SGOL2 | SGO2 (Shugoshin 2) is a Protein Coding gene. Predicted to be involved in homologous chromosome segregation; meiotic sister chromatid cohesion; and mitotic sister chromatid segregation. Predicted to act upstream of or within meiotic nuclear division; positive regulation of maintenance of meiotic sister chromatid cohesion, centromeric; and protein localization. Located in chromosome, centromeric region and nuclear body. | chr2 | −59.393092 | 88.30258 | 99.9289845 | −4.27642 | 52.46389 |
| BCAS2 | BCAS2 (BCAS2 Pre-MRNA Processing Factor) is a Protein Coding gene. Involved in mRNA splicing, via spliceosome. Located in centrosome and nuclear speck. Part of U2-type catalytic step 2 spliceosome. Colocalizes with DNA replication factor A complex. Implicated in breast cancer. | chr1 | −42.301185 | 44.86425 | 69.5343411 | 42.43587 | 35.70518 |
| PKD2 | PKD2 (Polycystin 2, Transient Receptor Potential Cation Channel) is a Protein Coding gene. The encoded protein is a multi-pass membrane protein that functions as a calcium permeable cation channel, and is involved in calcium transport and calcium signaling in renal epithelial cells. This protein interacts with polycystin 1, and they may be partners in a common signaling cascade involved in tubular morphogenesis. | chr4 | −45.37472 | 46.51688 | 62.7675763 | 29.76504 | 53.48343 |
Fig. 2Uncut (A) and Cut (B) ranking scores for each gene combining all datasets for each respiratory virus. Also, in this figure are scatter plots of ranking scores of all genes collected for each respiratory virus, using SARS-CoV-2 as the comparison. (C) Shows a comparison of Influenza and SARS-CoV-2, (D) between SARS-CoV-1 and SARS-CoV-2, (E) between RSV and SARS-CoV-2 and (F) between Rhinovirus and SARS-CoV-2.
Fig. 3Heatmap of DEGs for all respiratory viruses studied in this analysis.
Fig. 4(A) Standard deviation of all genes across all viruses. (B) Correlation matrix using data taken from the top 75% of genes. (C) KEGG pathway analysis by cluster. (D) T-SNE plot of all 12,000 genes.
Fig. 5Nature of differentially regulated genes. (A) Total number of upregulated and down regulated genes for each virus. (B) Venn diagrams representing the differentially regulated genes that are in common between each of the respiratory viruses.
Fig. 6Heatmaps specific to different pathways compiled by GAGE pathway analysis. (A) For Defence response to virus, (B) for cytokine response, (C) for regulation of cytokine production and (D) for positive regulation of innate immune response.
Fig. 7Regulation of different pathways by studied respiratory viruses. (A) Regulation of genes associated with the JAK-STAT signalling pathway. (B) Regulation of genes associated with cytokine-cytokine receptor interaction. (C) Ranking scores of the C8orf4 gene for each respiratory virus. (D) Genome map showing SARS-CoV-2 upregulated genes in red and downregulated genes in blue.