| Literature DB >> 35664076 |
Francisco J Enguita1,2, Ana Lúcia Leitão3, J Tyson McDonald4, Viktorija Zaksas1,5,6, Saswati Das1,7, Diego Galeano1,8, Deanne Taylor1,9,10, Eve Syrkin Wurtele1,11, Amanda Saravia-Butler1,12,13, Stephen B Baylin1,14, Robert Meller1,15, D Marshall Porterfield1,16, Douglas C Wallace1,9,10,17, Jonathan C Schisler1,18, Christopher E Mason1,19,20,21,22, Afshin Beheshti1,23,24.
Abstract
Rationale: Viral infections are complex processes based on an intricate network of molecular interactions. The infectious agent hijacks components of the cellular machinery for its profit, circumventing the natural defense mechanisms triggered by the infected cell. The successful completion of the replicative viral cycle within a cell depends on the function of viral components versus the cellular defenses. Non-coding RNAs (ncRNAs) are important cellular modulators, either promoting or preventing the progression of viral infections. Among these ncRNAs, the long non-coding RNA (lncRNA) family is especially relevant due to their intrinsic functional properties and ubiquitous biological roles. Specific lncRNAs have been recently characterized as modulators of the cellular response during infection of human host cells by single stranded RNA viruses. However, the role of host lncRNAs in the infection by human RNA coronaviruses such as SARS-CoV-2 remains uncharacterized.Entities:
Keywords: RNA-binding protein; SARS-CoV-2; long non-coding RNA; regulatory network
Mesh:
Substances:
Year: 2022 PMID: 35664076 PMCID: PMC9131284 DOI: 10.7150/thno.73268
Source DB: PubMed Journal: Theranostics ISSN: 1838-7640 Impact factor: 11.600
Figure 1SARS-CoV-2 infection is characterized by a gene expression pattern enriched in up-regulated mRNA and lncRNA transcripts that can be correlated with the viral load observed in patients. A, number of differentially expressed transcripts observed in patients with different SARS-CoV-2 viral loads (Low, Medium and High) and those infected with different respiratory viruses (Other) in comparison with the uninfected patients; B, number of the different families of up-regulated transcripts in SARS-CoV-2 patients and infected with other respiratory viruses in comparison with the control group; C, number of the different families of down-regulated transcripts in SARS-CoV-2 patients and infected with other respiratory viruses in comparison with the control group; D, Venn diagram representing the number of up-regulated lncRNA transcripts observed in each group of study referred to the uninfected control group; E, CIRCOS plot 45 showing the genomic location and fold changes of the differentially expressed coding transcripts in the group of SARS-CoV-2 patients infected with higher viral loads in comparison with the uninfected controls (red squares, up-regulated mRNAs; green squares, down-regulated mRNAs); F, CIRCOS plot 45 depicting the genomic locations and fold changes of the differentially expressed lncRNA transcripts in the group of SARS-CoV-2 patients infected with higher viral loads in comparison with the uninfected controls (red circles, up-regulated lncRNAs; green circles, down-regulated lncRNAs).
upregulated lncRNAs detected in nasopharyngeal samples from patients with high SARS-CoV-2 viral loads that have been functionally characterized in different cellular processes or pathologies.
| Symbol | ENSEMBL gene | Location | Comments | References |
|---|---|---|---|---|
| NRIR | ENSG00000225964 | chr2:6968685 | Negative regulator of interferon response. Experimental evidence linked this lncRNA to the control of the cellular immunity against viral infections. | |
| BISPR | ENSG00000282851 | chr19:17516495 | Interferon-stimulated positive regulator. This lncRNA belongs to a specific transcriptomic fingerprint developed in response to viral infections. | |
| LINC02068 | ENSG00000223387 | chr3:172278691 | This lncRNA has been described as a part of a molecular signature that predicts the outcome of endometrial cancer. |
|
| LINC01208 | ENSG00000223715 | chr3:176321936 | Member of a molecular biomarker signature determined in breast cancer that can independently predict the patient survival rate. |
|
| USP30-AS1 | ENSG00000256262 | chr12:109489846 | Antisense transcript to the USP30 gene. It has been characterized as an enhancer of cell proliferation in myeloid leukemia, colon, and cervical cancers. Its regulatory mechanisms involve the direct control of the expression of USP30 gene and the sponging of several miRNAs. | |
| U62317.2 | ENSG00000272666 | chr22: 50604217 | In bladder cancer, this lncRNA has been described as an important player in the regulation of the epithelial-to-mesenchymal transition, and directly related with the overall disease prognosis. |
|
| DLGAP1-AS5 | ENSG00000261520 | chr18:4264602 | Antisense transcript to the DLGAP1 gene. In gastric cancer, its overexpression has been related with an increased endogenous immune response against the tumor and a better prognosis. |
|
| FMR1-IT1 | ENSG00000236337 | chrX:147028461-147029103 | Internal transcript to FMR1 gene. In head and neck carcinomas, this lncRNAs has been described as an independent prognosis biomarker. |
|
| MIR155HG | ENSG00000234883 | chr21:26934457-26947480 | Host gene for miR-155. This lncRNAs has been characterized as an important regulatory factor of innate immune response against viral infections and inflammation. Its regulatory actions are exerted via direct transcriptional control, epigenetic activation, miRNA sponging and the production of micropeptides. | |
| C5orf56 | ENSG00000197536 | chr5:131746621-131811736 | Chromosome 5, open reading frame 56. Described in genetic associations with autoimmune diseases, but also as a prognosis factor of bladder cancer by its involvement in the regulation of the epithelial to mesenchymal transition. |
Figure 2Functional prediction analysis by ncFANs 2.0 algorithm 39 of the upregulated lncRNAs observed in SARS-CoV-2 patients with high viral loads. A, GO-term enrichment analysis performed with ncFANs and filtered by removal of the redundant terms with REVIGO 41. The filtered GO-terms are classified according to their two-dimensional arbitrary semantic space and represented by symbols with dimensions proportional to the LogSize, showing the most relevant GO-terms for the context of viral infections. B, pathway enrichment analysis by ncFANs using the KEGG database. C, molecular signature analysis by ncFANs using the MSigDB database.
Figure 3Expression levels of selected lncRNAs quantified by next-generation sequencing in nasal swabs from the working group of patients, and whole blood, obtained from the COVIDOME project database 74. LncRNA expression in nasal swabs distributed by groups of patients: A, NRIR; B, LINC02068; C, BISPR; D, USP30-AS1; E, MIR155HG and F, AL512306.2. LncRNA levels in whole blood in SARS-CoV-2 patients (Covid19) and non-infected controls (None): G, NRIR; H, BISPR; I, USP30-AS1 and J, AL512306.2. Statistical comparisons between sample groups were made by one-way ANOVA in the case of samples from nasal swabs and by the Student's t-test in the data from the COVIDOME project (****, p-value < 0.0001; ***, p-value < 0.001; **, p-value < 0.01; *, p-value < 0.05 and n.s, non-significant).
Figure 4Spearman's correlation analysis of the top 90 upregulated lncRNAs and the viral gene transcripts in the cohort of analyzed samples. A, Hierarchical clustered Spearman's correlation matrix for the overexpressed lncRNAs and the detected SARS-CoV-2 transcripts across all the samples analyzed by the BioCPR software 75. The SARS-CoV-2 mRNA transcripts are highlighted within boxes; B, correlation analysis between NRIR lncRNA and S-protein transcript; C, correlation analysis between BISPR lncRNA and S-protein transcript; and D, correlation analysis between MIR155HG lncRNA and S-protein transcript. The correlation coefficients showed in panels b, c and d correspond to the Spearman analysis and are significant in all cases with p-values < 0.0001.
Figure 5Functional analysis of the 50 top up-regulated lncRNAs by SARS-CoV-2 infection considering their validated interactions with RNA-binding proteins retrieved from ENCORI database 42. A, GO-term enrichment analysis for biological processes of the RNA-binding proteins that interact with the selected overexpressed lncRNAs in SARS-CoV-2 patients with high viral loads; B, pathway enrichment analysis using the KEGG database and considering the RNA-binding proteins that interact with the selected overexpressed lncRNAs in SARS-CoV-2 patients with high viral loads; C, interaction map between RNA-binding proteins and the 50 top overexpressed lncRNAs in SARS-CoV-2 patients with high viral loads. The number of interactions is depicted as circles with a diameter proportional to the number of RNA-binding sites in each lncRNA. The right-hand side panel represents the density of RBP binding sites per 1000 nucleotides in each lncRNA as extracted from the ENCORI database.
Figure 6lncRNA-centered regulatory network established in SARS-CoV-2 infection involving upregulated lncRNAs, RNA-binding proteins and the viral genome. A, regulatory network built by heterogeneous network data analysis with IHNLncSim algorithm 44, the RNA-binding proteins extracted from ENCORI database 42 and the recently described interactions between host proteins and the viral genome 32. Functional similarity among upregulated lncRNAs determined by IHNLncSim are represented by connecting continuous lines with thickness proportional to the value of the similarity coefficient value. Validated RNA-protein interactions from ENCORI database are represented by dashed grey lines. Characterized interactions between RNA-binding proteins and the SARS-CoV-2 genome are represented by dashed blue arrows. The size of the symbols representing lncRNAs (squares) and RNA-binding proteins (circles) are proportional to the number of established functional interactions. B, time course of protein expression from the selected RNA-binding proteins during SARS-CoV-2 in a cellular model, as described previously 25. Expression data from quantitative proteomics were retrieved from the PRIDE partner repository database.