| Literature DB >> 29733404 |
Pavel P Kuksa1,2, Alexandre Amlie-Wolf1,2,3, Živadin Katanic1,2, Otto Valladares1,2, Li-San Wang1,2,3,4, Yuk Yee Leung1,2.
Abstract
The introduction of new high-throughput small RNA sequencing protocols that generate large-scale genomics datasets along with increasing evidence of the significant regulatory roles of small non-coding RNAs (sncRNAs) have highlighted the urgent need for tools to analyze and interpret large amounts of small RNA sequencing data. However, it remains challenging to systematically and comprehensively discover and characterize sncRNA genes and specifically-processed sncRNA products from these datasets. To fill this gap, we present Small RNA-seq Portal for Analysis of sequencing expeRiments (SPAR), a user-friendly web server for interactive processing, analysis, annotation and visualization of small RNA sequencing data. SPAR supports sequencing data generated from various experimental protocols, including smRNA-seq, short total RNA sequencing, microRNA-seq, and single-cell small RNA-seq. Additionally, SPAR includes publicly available reference sncRNA datasets from our DASHR database and from ENCODE across 185 human tissues and cell types to produce highly informative small RNA annotations across all major small RNA types and other features such as co-localization with various genomic features, precursor transcript cleavage patterns, and conservation. SPAR allows the user to compare the input experiment against reference ENCODE/DASHR datasets. SPAR currently supports analyses of human (hg19, hg38) and mouse (mm10) sequencing data. SPAR is freely available at https://www.lisanwanglab.org/SPAR.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29733404 PMCID: PMC6030839 DOI: 10.1093/nar/gky330
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The SPAR workflow. Given a reference genome and input small RNA-seq dataset (custom or reference data), SPAR processes the small RNA-seq dataset and identifies sncRNA loci using unsupervised segmentation. sncRNA loci are grouped into the major small RNA classes or the novel unannotated category (total of 10 classes) and annotated with various genomic features. The SPAR results are summarized in interactive tables at the per locus and genome-wide level and are compared with reference ENCODE and DASHR tissues and cell lines. All results are available for download in a variety of formats.
Figure 2.SPAR inputs. SPAR can be used to interactively process and analyze small RNA sequencing datasets. The user can select the input dataset from one of the publicly available DASHR/ENCDODE datasets (‘Analyze public datasets’, left box) or provide their own custom data via web-accessible URL or direct upload (‘Analyze your own data’, right box). SPAR parameters can be specified under ‘Additional analysis options’. Genomic regions of interest can be provided by the user under ‘Specify regions of interest’ section.
Figure 3.SPAR outputs: (A) results can be downloaded easily as a single file (ZIP archive) or report (PDF). Individual files are available for download under ‘Download results’ section on the report page; (B) summary of the overall composition of the small RNA-seq data; (C) SPAR allows user to view, filter and download selected loci using the interactive sncRNA table; (D) SPAR provides overview of expression patterns across all loci in the small RNA sequencing data; (E) SPAR displays the variety of specific processing patterns across all loci in the small RNA sequencing data; (F) SPAR provides direct comparison of user data against ENCODE/DASHR reference tissues (RPM expression values).