| Literature DB >> 30838030 |
Xiaofeng Xu1, Haishuo Ji2,3, Xiufeng Jin2, Zhi Cheng2, Xue Yao4, Yanqiang Liu2, Qiang Zhao2, Tao Zhang2, Jishou Ruan5, Wenjun Bu2, Ze Chen1, Shan Gao2,3.
Abstract
In this study, we used pan RNA-seq analysis to reveal the ubiquitous existence of both 5' and 3' end small RNAs (5' and 3' sRNAs). 5' and 3' sRNAs alone can be used to annotate nuclear non-coding and mitochondrial genes at 1-bp resolution and identify new steady RNAs, which are usually transcribed from functional genes. Then, we provided a simple and cost effective way for the annotation of nuclear non-coding and mitochondrial genes and the identification of new steady RNAs, particularly long non-coding RNAs (lncRNAs). Using 5' and 3' sRNAs, the annotation of human mitochondrial was corrected and a novel ncRNA named non-coding mitochondrial RNA 1 (ncMT1) was reported for the first time in this study. We also found that most of human tRNA genes have downstream lncRNA genes as lncTRS-TGA1-1 and corrected the misunderstanding of them in previous studies. Using 5', 3', and intronic sRNAs, we reported for the first time that enzymatic double-stranded RNA (dsRNA) cleavage and RNA interference (RNAi) might be involved in the RNA degradation and gene expression regulation of U1 snRNA in human. We provided a different perspective on the regulation of gene expression in U1 snRNA. We also provided a novel view on cancer and virus-induced diseases, leading to find diagnostics or therapy targets from the ribonuclease III (RNase III) family and its related pathways. Our findings pave the way toward a rediscovery of dsRNA cleavage and RNAi, challenging classical theories.Entities:
Keywords: 3′ end; 5′ end; Pan RNA-seq; genome annotation; small RNA
Year: 2019 PMID: 30838030 PMCID: PMC6382676 DOI: 10.3389/fgene.2019.00105
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Definition of 5′ and 3′ sRNAs. (A) 5′ and 3′ sRNAs (in red color) are defined as sRNA-seq reads with lengths of 15∼50 bp, which are precisely aligned to the 5′ and 3′ ends of mature RNAs, respectively. The lengths of them vary progressively with 1-bp differences. This figure shows 5′ and 3′ sRNAs from a typical tRNA (in blue color). As for longer RNAs (e.g., snRNAs or rRNAs), there could be abundant sRNAs in the body. (B) 5-end format is defined to easily identify 5′ ends of mature RNAs using sRNA-seq data. (C) Human rRNA genes (RefSeq: NR_046235.1) were annotated using alignment results in the 5-end format. Among positions 7923, 7924, and 7925 with ratio1s (the 5th column) above 70%, the position 7925 with the highest ratio1 was determined as the 5′ end of 28S rRNA.
FIGURE 2Genome-alignment maps using sRNA-seq, Cage-seq, and GRO-seq. This figure shows the count distribution of all aligned reads on the reference rRNA sequence (RefSeq: NR_046235.1). These reads are from one sRNA-seq dataset (A) one CAGE-seq dataset (B) and one GRO-seq dataset (C) the description of these datasets can be seen in the Section “Materials and Methods.” The identified 5′ and 3′ ends of mature RNAs are marked by boxes.
Annotation of human rRNA genes with corrections.
| Gene | Start | End | Start∗ | End∗ | Length∗ |
|---|---|---|---|---|---|
| 18S rRNA | 3,655 | 5,523 | 3,655 | 5,523 | 1,869 |
| ITS1 | 5,524 | 6,600 | 5,524 | 6,595∗ | 1,072 |
| 5.8S rRNA | 6,601 | 6,757 | 6,596∗ | 6,756∗ | 161 |
| ITS2 | 6,758 | 7,924 | 6,757∗ | 7,924 | 1,168 |
| 28S rRNA | 7,925 | 12,994 | 7,925 | 12,993∗ | 5,069 |
FIGURE 3Corrected annotations of mitochondrial tRNAs. tRNATyr/tRNACys/tRNAAsn/tRNAAla is encoded by L-strand of human mitochondrial genome. tRNAAla is not showed, as it does not need correction. ncMT1 was first discovered in this study. (A) Mitochondrial tRNAs are annotated between two cleavage sites using the sRNA-seq based method, while they are annotated between two trimming sites and several nucleic acids in red color are missed using other existing methods. (B) The acceptor stem of a tRNA precursor is trimmed to contain a 1-bp overhang at the 3′ end. CCAs are post-transcriptionally added to the 3′ ends of tRNAs, one nucleotide at a time. A polyA tail is post-transcriptionally added to the 3′ end of ncMT1.
FIGURE 4SiRNA duplexes discovered from U1 snRNAs. (A) The count distribution of all aligned reads on the reference U1 snRNA (RefSeq: NR_004430.2). (B) The above is a mountain plot representation of the MFE structure, the thermodynamic ensemble of RNA structures and the centroid structure. The positional entropy for each position is showed below. (C) The secondary structure of U1 snRNA. (D) U1 over-expression in the HEK293 (human), SY5Y (human) and PC-12 (rat) cell lines were conducted by virus transfection. The qPCR results showed the relative expression levels of U1 in 12 groups. For each experiment, 12 groups of samples named control, ×1, ×2, ×3, ×4, ×5, ×6, ×7, ×8, ×9, ×10, and ×11 were transfected by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, and 11 μL U1-packaged lentiviruses The control group used unprocessed samples.