Literature DB >> 34705532

Sequences of SARS-CoV-2 "Hybrids" with the Human Genome: Signs of Non-coding RNA?

Andrey Grigoriev1, James J Kelley1, Lingyu Guan1.   

Abstract

Entities:  

Keywords:  SARS-CoV-2; genome; non-coding RNA; small RNA

Mesh:

Substances:

Year:  2021        PMID: 34705532      PMCID: PMC8791249          DOI: 10.1128/JVI.01462-21

Source DB:  PubMed          Journal:  J Virol        ISSN: 0022-538X            Impact factor:   5.103


× No keyword cloud information.

LETTER

Controversial reports of human-virus chimeric reads (HVCRs) suggested a possible integration of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequences into human DNA (1–3). Its recent refutation (4) in this journal posited that HVCRs are likely artifacts of library construction (often involving rRNAs and tRNAs), but the subject continues to attract attention (5, 6). We previously analyzed human non-coding regulatory small RNA (sRNA), including microRNA, plus tRNA and rRNA fragments, paired with their putative targets in millions of chimeric reads (7–9) obtained in CLASH experiments (10). Applying our computational pipelines to the HVCR data sets, we observed many read ends matching human sRNA. Strikingly, they were not chimeras (we will still use “HVCRs” here), but occurrences of common short sequences in both genomes, human and SARS-CoV-2. This leads to multiple human-human and two virus-virus joined sequences, matching sRNA and misinterpreted as HVCRs. Most human–human cases contained sRNA paired with human transcripts. 1–100 nt away from the sRNA in the read, a short match to SARS-CoV-2 appeared within that transcript (Fig. 1A). Such short matches were always subsequences of longer human transcripts, making HVCR unlikely, as such events occurred in both SARS-CoV-2 and corresponding mock samples. Details of these non-viral chimeras are outside the scope of this letter. Our observation that many short virus sequences co-occur in human genome and may lead to false HVCRs is not surprising, because random 16-mers should appear once every ∼4.5 gigabases of a sequence with 50% GC.
FIG 1

Human–human and virus–virus “hybrid” cases, in genomic context of reads containing sRNA. “Virus match” (A) and “18S match” (B) are 16-mers common between human and SARS-CoV-2 genomes. (C) An IGV display of the N gene region, containing a match to hsa-mir-8066 (nucleotides hybridizing with the “seed” region are in red). In addition to the reads matching 18S at their ends, we detected many middle-match reads, traversing this part of the S gene (B, not checked in A). Four sequencing samples (from our reanalysis of BioProject PRJEB37886, SRA Study ERP121228), reveal 1–2 nt deletions within the seed match, supported with many non-duplicate Illumina reads. Reads without these deletions are not shown. Some reads contain additional substitutions (sample 3) or deletions (samples 2 and 4, green and blue notches, misaligned in read ends at the bottom of the display also, in fact, indicate a deletion). Other deletions or insertions in nearby reads (not shown) may restore the reading frame as in sample 2.

Human–human and virus–virus “hybrid” cases, in genomic context of reads containing sRNA. “Virus match” (A) and “18S match” (B) are 16-mers common between human and SARS-CoV-2 genomes. (C) An IGV display of the N gene region, containing a match to hsa-mir-8066 (nucleotides hybridizing with the “seed” region are in red). In addition to the reads matching 18S at their ends, we detected many middle-match reads, traversing this part of the S gene (B, not checked in A). Four sequencing samples (from our reanalysis of BioProject PRJEB37886, SRA Study ERP121228), reveal 1–2 nt deletions within the seed match, supported with many non-duplicate Illumina reads. Reads without these deletions are not shown. Some reads contain additional substitutions (sample 3) or deletions (samples 2 and 4, green and blue notches, misaligned in read ends at the bottom of the display also, in fact, indicate a deletion). Other deletions or insertions in nearby reads (not shown) may restore the reading frame as in sample 2. For virus–virus, the Spike gene harbored a 16-nt sequence (genome positions 22783–22798), a reverse complement of 18S rRNA, and the N gene (positions 28697–28712) contained a reverse complement of 16 nts fragment from the mature human microRNA, hsa-mir-8066. In the “hybrid” reads these 16-mers were always adjacent to the surrounding virus sequence. As such HVCRs only occurred in samples with SARS-CoV-2, not mock, they seem to be genuine SARS-CoV-2 reads (Fig. 1B), not chimeras. These cases may have interesting (although speculative) implications. Why Spike and N? Might these abundant SARS-CoV-2 subgenomic mRNAs somehow interact with the host sRNA-based regulation? We observed in multiple samples 1–2 nt deletions in numerous reads (3%–8% coverage in 28706–28711), corresponding to the hsa-mir-8066 “seed” region (Fig. 1C). Does this indicate “attempts” of potential evasion of hsa-mir-8066 by the virus? Perhaps it does because these deletions are frame-disrupting, but note how frequent are adjacent (linked by the same read) additional deletions/insertions, which in combination may restore the reading frame. And 18S rRNA? Earlier, we described numerous rRNA fragments (rRFs), potentially interacting within Argonaute complexes (and forming abundant chimeras) with human transcripts (7). One indicator of interactions was the presence of common motifs (reverse complements with rRFs) in the targets of the same rRF, suggesting a mechanism for post-transcriptional regulation. Coincidentally, one such interaction motif we reported would perfectly hybridize with the 16-nt stretch of Spike mRNA. While it is tempting to speculate about Spike regulation by the host rRF, we observed few changes in this gene region (unlike in Fig. 1C). Alternatively, this stretch of 18S might hybridize with Spike mRNA affecting its translation (“ribosome filter” hypothesis) (11).
  10 in total

Review 1.  The ribosome filter hypothesis.

Authors:  Vincent P Mauro; Gerald M Edelman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-09-09       Impact factor: 11.205

2.  Inferring targeting modes of Argonaute-loaded tRNA fragments.

Authors:  Lingyu Guan; Spyros Karaiskos; Andrey Grigoriev
Journal:  RNA Biol       Date:  2019-10-15       Impact factor: 4.652

3.  Host-Virus Chimeric Events in SARS-CoV-2-Infected Cells Are Infrequent and Artifactual.

Authors:  Bingyu Yan; Srishti Chakravorty; Carmen Mirabelli; Christiane E Wobus; Behdad Afzali; Majid Kazemian; Luopin Wang; Jorge L Trujillo-Ochoa; Daniel Chauss; Dhaneshwar Kumar; Michail S Lionakis; Matthew R Olson
Journal:  J Virol       Date:  2021-07-12       Impact factor: 5.103

4.  Reverse-transcribed SARS-CoV-2 RNA can integrate into the genome of cultured human cells and can be expressed in patient-derived tissues.

Authors:  Liguo Zhang; Alexsia Richards; M Inmaculada Barrasa; Stephen H Hughes; Richard A Young; Rudolf Jaenisch
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-25       Impact factor: 11.205

5.  Mapping the human miRNA interactome by CLASH reveals frequent noncanonical binding.

Authors:  Aleksandra Helwak; Grzegorz Kudla; Tatiana Dudnakova; David Tollervey
Journal:  Cell       Date:  2013-04-25       Impact factor: 41.582

6.  Computational meta-analysis of ribosomal RNA fragments: potential targets and interaction mechanisms.

Authors:  Lingyu Guan; Andrey Grigoriev
Journal:  Nucleic Acids Res       Date:  2021-04-19       Impact factor: 16.971

7.  Exogenous Coronavirus Interacts With Endogenous Retrotransposon in Human Cells.

Authors:  Ying Yin; Xiao-Zhao Liu; Ximiao He; Li-Quan Zhou
Journal:  Front Cell Infect Microbiol       Date:  2021-02-25       Impact factor: 5.293

8.  SARS-CoV-2-Host Chimeric RNA-Sequencing Reads Do Not Necessarily Arise From Virus Integration Into the Host DNA.

Authors:  Anastasiya Kazachenka; George Kassiotis
Journal:  Front Microbiol       Date:  2021-06-02       Impact factor: 5.640

9.  Large-Scale Computational Discovery of Binding Motifs in tRNA Fragments.

Authors:  Lingyu Guan; Vincent Lam; Andrey Grigoriev
Journal:  Front Mol Biosci       Date:  2021-06-22
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.