| Literature DB >> 32326627 |
Ruth Barral-Arca1,2,3, Alberto Gómez-Carballa1,2,3, Miriam Cebey-López1,2,3, María José Currás-Tuala1,2,3, Sara Pischedda1,2,3, Sandra Viz-Lasheras1,2,3, Xabier Bello1,2,3, Federico Martinón-Torres3,4, Antonio Salas1,2,3.
Abstract
There is a growing interest in unraveling gene expression mechanisms leading to viral host invasion and infection progression. Current findings reveal that long non-coding RNAs (lncRNAs) are implicated in the regulation of the immune system by influencing gene expression through a wide range of mechanisms. By mining whole-transcriptome shotgun sequencing (RNA-seq) data using machine learning approaches, we detected two lncRNAs (ENSG00000254680 and ENSG00000273149) that are downregulated in a wide range of viral infections and different cell types, including blood monocluclear cells, umbilical vein endothelial cells, and dermal fibroblasts. The efficiency of these two lncRNAs was positively validated in different viral phenotypic scenarios. These two lncRNAs showed a strong downregulation in virus-infected patients when compared to healthy control transcriptomes, indicating that these biomarkers are promising targets for infection diagnosis. To the best of our knowledge, this is the very first study using host lncRNAs biomarkers for the diagnosis of human viral infections.Entities:
Keywords: RNA-seq; biomarkers; lncRNA; machine learning; virus
Year: 2020 PMID: 32326627 PMCID: PMC7215422 DOI: 10.3390/ijms21082748
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Classification performance based on the two lncRNA VS, considering different viral pathogens and studies. Box and whisker plots of VS values: (A) Discovery group (Spanish group); (B) Test group GSE69529; (C) External validation group GSE98859; (D) External validation group GSE121385; (E) External validation group GSE94551; and (F) External validation group PRJNA230906. For all box plots, the horizontal lines in boxes indicate median of the groups; the lower and upper sides of boxes interquartile ranges and the whiskers < 1 times the interquartile range. On the x-axis is the sample status and on the y-axis the VS calculated as log2 of the sum of counts of our 2-transcript diagnosis model. The acronyms stand for: RV (Rotavirus), VZV (Varicella Zoster Virus), diffuse-adhering Escherichia coli (DAEC), enteropathogenic Escherichia coli (EPEC), and enteroaggregative Escherichia coli (EAEC).
Figure 2Evaluation of VS test performance. AUC (Area Under the Curve) values are provided in Table 1. ROC curves for VS: (A) Discovery group (Spanish group); (B) Test group GSE69529; (C) External validation group GSE98859; (D) External validation group GSE121385; (E) External validation group GSE94551; and (F) External validation group PRJNA230906. For all figures, the horizontal lines in boxes indicate median of the groups; the lower and upper sides of boxes interquartile ranges and the whiskers < 1 times the interquartile range. On the x-axis is the sample status and on the y-axis the VS calculated as log2 of the sum of counts of our 2-transcript diagnosis model.
Area under the curve (AUC) values and viral score (VS) for different pathogens and studies. Figures in round brackets show the CI95% (calculated using 2000 bootstrap replicates). No expression (NE) means that at least one of the transcripts of the signature was not expressed. CC: Chinese group; SC: Spanish group; VZF: Varicella Zoster infected fibroblasts; MNC: mononuclear cells with dengue, EC: human umbilical vein endothelial cells enterovirus group, MC: Mexican group.
| Database | Comparison |
| AUC | Sensitivity | Specificity | VS | |
|---|---|---|---|---|---|---|---|
|
| |||||||
| CC | PRJNA230906 | virus vs. control | 4 | 1 (1–1) | 1 | 1 | 2.569 |
| MNC | GSE98859 | virus vs. control | 5 | 1 (1–1) | 1 | 1 | 4.939 |
| EC | GSE94551 | virus vs. control | 6 | 1 (1–1) | 1 | 1 | 4.043 |
| SC | PRJNA325575 | virus vs. control | 18 | 1 (1–1) | 1 | 1 | 7.535 |
| MC | GSE69529 | virus vs. control | 115 | 0.891 (0.833–0.950) | 0.857 | 0.838 | 8.350 |
| MC | GSE69529 | virus vs. bacteria | 220 | 0.731 (0.662–0.800) | 0.814 | 0.600 | 8.113 |
| MC | GSE69529 | bacteria vs. control | 175 | 0.707 (0.621–0.793) | 0.650 | 0.743 | 8.392 |
| VZF | GSE121385 | virus vs. control | 6 | 1 (1–1) | 1 | 1 | 15.015 |
|
| |||||||
| CC | PRJNA230906 | virus vs. control | 4 | 1 (1–1) | 1 | 1 | 1.980 |
| MNC | GSE98859 | virus vs. control | 5 | 1 (1–1) | 1 | 1 | 2.596 |
| EC | GSE94551 | virus vs. control | NE | NE | NE | NE | NE |
| SC | PRJNA325575 | virus vs. control | 18 | 1 (1–1) | 1 | 1 | 3.227 |
| MC | GSE69529 | virus vs. control | 115 | 0.857 (0.784–0.929) | 0.800 | 0.850 | 6.133 |
| MC | GSE69529 | virus vs. bacteria | 220 | 0.724 (0.651–0.797) | 0.855 | 0.525 | 4.101 |
| MC | GSE69529 | bacteria vs. control | 175 | 0.723 (0.623–0.824) | 0.790 | 0.714 | 6.883 |
| VZF | GSE121385 | virus vs. control | 6 | 1 (1–1) | 1 | 1 | 9.670 |
|
| |||||||
| CC | PRJNA230906 | virus vs. control | 4 | 1 (1–1) | 1 | 1 | 2.328 |
| MNC | GSE98859 | virus vs. control | 5 | 1 (1–1) | 1 | 1 | 4.899 |
| EC | GSE94551 | virus vs. control | NE | NE | NE | NE | NE |
| SC | PRJNA325575 | virus vs. control | 18 | 1 (1–1) | 1 | 1 | 11.640 |
| MC | GSE69529 | virus vs. control | 115 | 0.850 (0.774–0.927) | 0.800 | 0.863 | 8.247 |
| MC | GSE69529 | virus vs. bacteria | 220 | 0.753 (0.684–0.822) | 0.630 | 0.788 | 7.469 |
| MC | GSE69529 | bacteria vs. control | 175 | 0.681 (0.579–0.783) | 0.710 | 0.714 | 8.773 |
| VZF | GSE121385 | virus vs. control | 6 | 1 (1–1) | 1 | 1 | 9.679 |
|
| |||||||
| CC | PRJNA230906 | virus vs. control | 4 | 1 (1–1) | 1 | 1 | −2.650 |
| MNC | GSE98859 | virus vs. control | 5 | 0.625 (0.500–0.875) | 1 | 0.250 | −0.091 |
| EC | GSE94551 | virus vs. control | NE | NE | NE | NE | NE |
| SC | PRJNA325575 | virus vs. control | 18 | 1 (1–1) | 1 | 1 | 4.715 |
| MC | GSE69529 | virus vs. control | 115 | 0.829 (0.749–0.910) | 0.771 | 0.825 | −3.764 |
| MC | GSE69529 | virus vs. bacteria | 220 | 0.745 (0.674–0.816) | 0.813 | 0.638 | −4.834 |
| MC | GSE69529 | bacteria vs. control | 175 | 0.647 (0.544–0.749) | 0.748 | 0.629 | −2.924 |
| VZF | GSE121385 | virus vs. control | 6 | 1 (1–1) | 1 | 1 | −2.270 |
Figure 3Diagram of the study design. The acronyms stand for: Rotateq® (RV5); rotavirus (RV); diffuse-adhering Escherichia coli (DAEC), enteropathogenic Escherichia coli (EPEC) and enteroaggregative Escherichia coli (EAEC), and Varicella Zoster Virus (VZV).