| Literature DB >> 33017003 |
Yoshiaki Yasumizu1, Atsushi Hara2, Shimon Sakaguchi1, Naganari Ohkura1.
Abstract
SUMMARY: The possibility that RNA transcripts from clinical samples contain plenty of virus RNAs has not been pursued actively so far. We here developed a new tool for analyzing virus-transcribed mRNAs, not virus copy numbers, in the data of bulk and single-cell RNA-sequencing of human cells. Our pipeline, named VIRTUS (VIRal Transcript Usage Sensor), was able to detect 762 viruses including herpesviruses, retroviruses and even SARS-CoV-2 (COVID-19), and quantify their transcripts in the sequence data. This tool thus enabled simultaneously detecting infected cells, the composition of multiple viruses within the cell, and the endogenous host-gene expression profile of the cell. This bioinformatics method would be instrumental in addressing the possible effects of covertly infecting viruses on certain diseases and developing new treatments to target such viruses.Entities:
Mesh:
Year: 2021 PMID: 33017003 PMCID: PMC7745649 DOI: 10.1093/bioinformatics/btaa859
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.VIRTUS, a pipeline for analyzing multiple viruses, and its outputs from conventional RNA-seq data. (a) Viruses detected from peripheral blood leukocytes from Systemic Lupus Erythematosus (SLE) patients and healthy donors. The colors indicate the number of reads mapped on the viruses with log2 transformation. Only viruses mapped more than five reads in at least one sample were shown. In this dataset, the statistical power was insufficient to detect the differences (two-sided Mann-Whitney U test). (b) Top20 differentially expressed genes within EBV infected cells. The color shows the normalized expression. (c) Virus-mapped reads with splicing on the EBV genome visualized by The Integrative Genomics Viewer. (d) Mean expression of HSV-1 transcripts and the expression of a correlated host gene, RASD1, on UMAP plots. (e) Differentially expressed genes between HSV-1 infected and non-infected cells. The x-axis shows log2 fold change in the gene expression and the y-axis represents -log10 (P-value) calculated by DESeq2