| Literature DB >> 28621718 |
Xiaoran Niu1,2, Yu Sun3, Ze Chen4, Rugang Li5, Chellappan Padmanabhan6, Jishou Ruan7, Jan F Kreuze8, KaiShu Ling9, ZhangJun Fei10, Shan Gao11.
Abstract
Small interfering RNA (siRNA) duplexes are short (usually 21 to 24 bp) double-stranded RNAs (dsRNAs) with several overhanging nucleotides at both 5'- and 3'-ends. It has been found that siRNA duplexes bind the RNA-induced silencing complex (RISC) and cleave the sense strands with endonucleases. In this study, for the first time, we detected siRNA duplexes induced by plant viruses on a large scale using next-generation sequencing (NGS) data. In addition, we used the detected 21 nucleotide (nt) siRNA duplexes with 2 nt overhangs to construct a dataset for future data mining. The analytical results of the features in the detected siRNA duplexes were consistent with those from previous studies. The investigation of siRNA duplexes is useful for a better understanding of the RNA interference (RNAi) mechanism. It can also help to improve the virus detection based on the small RNA sequencing (sRNA-seq) technologies and to rationally design siRNAs for RNAi experiments.Entities:
Keywords: RNAi; plant virus; siRNA duplex; small RNA-seq; virus detection
Year: 2017 PMID: 28621718 PMCID: PMC5485527 DOI: 10.3390/genes8060163
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Viral sequences used in this study.
| ID | Description | Viral Read | Depth (bp) | Coverage | Proportion |
|---|---|---|---|---|---|
| JQ314457 | Pepino mosaic virus strain EU_CAHN8, complete genome | 8349 | 28.39 | 98.27% | 0.57% |
| JQ314458 | Pepino mosaic virus strain US1_CAHN8, complete genome | 11,651 | 39.28 | 99.50% | 0.89% |
| JQ314459 | Pepino mosaic virus strain EU_EF09_58, complete genome | 90,569 | 298.61 | 99.95% | 6.97% |
| JQ314460 | Pepino mosaic virus strain US1_EF09_58, complete genome | 21,374 | 71.95 | 99.84% | 0.51% |
| JQ314461 | Pepino mosaic virus strain EU_EF09_60, complete genome | 36,002 | 120.81 | 99.94% | 2.01% |
| JQ314462 | Pepino mosaic virus strain US1_EF09_60, complete genome | 47,776 | 160.46 | 99.92% | 4.60% |
| JQ314463 | Tomato necrotic stunt virus strain MX9354, complete genome | 207,553 | 439.85 | 100.00% | 12.72% |
| KT438549 | Southern tomato virus isolate CN-12, complete genome | 4508 | 27.93 | 99.10% | 0.64% |
| KT634055 | Southern tomato virus BD-13, complete genome | 2161 | 13.61 | 100.00% | 0.00% |
| KT810183 | Tomato mottle mosaic virus isolate NY-13, complete genome | 89,061 | 292.25 | 100.00% | 26.75% |
| KM504246 * | Tobacco streak virus isolate FL13-07 segment RNA1, complete sequence | 1,141,881 | 4515.53 | 100.00% | 21.77% |
| KM504247 * | Tobacco streak virus isolate FL13-07 segment RNA2, complete sequence | 635,836 | 3864.01 | 100.00% | 21.23% |
| KM504248 * | Tobacco streak virus isolate FL13-07 segment RNA3, complete sequence | 499,308 | 4252.48 | 100.00% | 21.67% |
| KR094068 * | Melon necrotic spot virus isolate ABCA13-01, complete genome | 969,376 | 4380.00 | 100.00% | 39.32% |
| KP772568 * | Cucumber green mottle mosaic virus isolate ABCA13-01, complete genome | 635,907 | 2113.00 | 100.00% | 33.50% |
| KP223323 * | Squash mosaic virus segment RNA-1, complete sequence | 1,340,402 | 4454.95 | 100.00% | 47.06% |
| KP223324 * | Squash mosaic virus segment RNA-2, complete sequence | 638,694 | 3808.92 | 100.00% | 48.22% |
ID are the GenBank accession numbers. Viral read represents the number of reads that can be aligned to this viral sequence using Bowtie software, allowing one mismatch. Depth (average depth) is calculated as the total number of nucleotides of the aligned reads divided by the read-covered positions on the reference genome. Coverage (genome coverage) represents the proportion of read-covered positions against the genome length. Proportion (21 nt duplex proportion) is calculated as the read count of 21 nt siRNA duplexes with 2 nt overhangs divided by the count of viral reads. * Seven sequences with higher average depths from four viruses were used to detect and analyze siRNA duplexes.
Figure 1Read-count distribution of small interfering RNA (siRNA) duplexes. (A) The read count of siRNA duplexes varies with the duplex length and the overhang length, using KP772568 as an example. (B) The median of read counts varies with the internal GC content using 21 nt siRNA duplexes with 2 nt overhangs from seven viral sequences.
Figure 2Distribution of viral reads along the reference genomes. The genome coverages of seven viral sequences calculated using aligned 21 nt siRNA duplexes with 2 nt overhangs are close to the genome coverages calculated using all aligned reads, using KP772568 as an example. The results of all 14 sequences can be seen in Supplementary File 1.
Figure 3Positive and negative-strands of siRNA duplexes. (A) This pattern shows symmetric read-count distribution of positive and negative strands in 21 nt siRNA duplexes with 2 nt overhangs, using KP223323 as an example. (B) This pattern shows read-count distribution biased to positive strands in 21 nt siRNA duplexes with 2 nt overhangs, using KP772568 as an example. The results of all seven sequences can be seen in Supplementary File 3.