| Literature DB >> 29444641 |
Raza-Ur Rahman1,2, Abhivyakti Gautam1, Jörn Bethune1,2, Abdul Sattar1,2, Maksims Fiosins1,2, Daniel Sumner Magruder1,2, Vincenzo Capece1, Orr Shomroni1, Stefan Bonn3,4,5.
Abstract
BACKGROUND: Small RNA molecules play important roles in many biological processes and their dysregulation or dysfunction can cause disease. The current method of choice for genome-wide sRNA expression profiling is deep sequencing.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29444641 PMCID: PMC5813365 DOI: 10.1186/s12859-018-2047-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
sRNA-seq web application comparison
| Feature | Oasis 2 | Oasis | omiRas | mirTools 2.0 | MAGI | Chimira | sRNAtoolbox |
|---|---|---|---|---|---|---|---|
| FASTQ compression | ✓ | ✓ | ✓ | ✓ | |||
| miRNA prediction | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
| miRNA modifications and edits | ✓ | ✓ | |||||
| Novel miRNA database | ✓ | ||||||
| Infection and cross-species analysis | ✓ | ✓ | |||||
| Non-model organism | ✓ | ✓ | |||||
| Differential expression | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Multivariate differential expression | ✓ | ✓ | ✓ | ||||
| Classification | ✓ | ✓ | |||||
| Novel miRNA target prediction | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| Pathway/GO analysis | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
| Batch job submission (API) | ✓ | ✓ | |||||
| Genome browser | ✓ |
Of note, this comparison does not include all available sRNA analysis web applications. It only considers the most recent web applications that we deemed most competitive and we do not compare to standalone software solutions that have to be locally installed
Fig. 1Detection of sRNAs in Oasis 2: The web application allows for the upload of raw or compressed FASTQ files to Oasis 2’s sRNA detection module. After pre-processing (adapter/barcode trimming and length filtering), reads are first aligned to target organism (TO) transcripts that are stored in Oasis-DB (Step 1), including known miRNAs, piRNAs, snoRNAs, snRNAs, rRNAs, and high-stringency predicted miRNAs and their families. Unmapped reads of Step1 are subsequently aligned to the TO’s genome (Step 2) to predict and subsequently store novel miRNAs in Oasis-DB. Unmapped reads from step 2 are mapped to bacterial, archaeal, and viral genomes using Kraken (Step 3) to detect potential pathogenic infections or contaminations. Finally, reads that could not be aligned in steps 1–3 are aligned to all non-target organism (NTO) miRNAs in miRBase (Step 4) to detect potentially orthologous or cross-species miRNAs. In case the user’s data does not correspond to one of the 14 supplied organisms, Oasis 2 aligns the reads only to NTO miRNAs (Step 4), supporting the detection of miRNA expression in any organism
Fig. 2Pathogen detection performance: To assess the performance of ‘pathogen detection module’, sRNA datasets with defined viral or bacterial infections were analyzed and the F-score (a), recall (b), and precision (c) of the pathogen predictions were measured for the top 10 reported organisms. Overall, the prediction of bacterial (M. abscessus) and viral (HIV, HHV4, HHV5, Gallid_herpesvirus_2) infections resulted in high F-scores, recall, and precision, especially when the top 5 predicted pathogen species are reported. In consequence, Oasis 2 currently reports the top five predicted pathogen species based on their read counts
Oasis 2 browser compatibility
| Browser | Version |
|---|---|
| Chrome | 61.0.3163.100, 62.0.3202.62 |
| Mozilla Firefox | 55.0.3, 56.0 (64-bit), 57.0 (64-bit) |
| Chromium | 62.0.3202.75 |
| Safari | 11.0.1 |
| Internet explorer | 11 |
Browsers that are used to test Oasis 2 functionalities
Runtime comparison of different sRNA-seq web applications
| Demo Dataset | Oasis 2 (total) 1 | Oasis (total)1 | MAGI | Chimira | omiRas | mirTools7 2.0 | sRNAtoolbox |
|---|---|---|---|---|---|---|---|
| AD | 8 h31m50s | 12h29m12s | NA2 | NA4 | NA5 | NA | NA |
| Psoriasis | 1h35m17s | 5h49m4s | 48h3 | 3h3m12s | NA6 | NA | NA |
| Renal Cancer | 31m43s | 1h8m41s | 8h3 | 47m11s | 9h31m | NA | NA |
1Run time estimate includes the data compression and decompression, the sRNA Detection, DE Analysis, and Classification. 2 We could not get MAGI to upload all AD files. Most probably it has a problem with the quality or format of one of the files. 3 These values were obtained from the MAGI website. 4 Chimira does not support the analysis of more than 25 files at a time, which prohibited us from getting runtime estimates for the AD dataset. 5 omiRas did not finish uploading files, which prohibited us from getting runtime estimates for the AD dataset. 6 omiRas http uploading error. 7 We cannot compare the runtime of mirTools 2.0 as maximum file size to upload is limited to 30 Mb. The sRNAtoolbox web application has been non-functional since 30/05/2017, which prohibited any runtime comparison (http://bioinfo2.ugr.es:8080/srnatoolbox/quick-start/)
Overlap of differentially expressed sRNAs using three datasets
| Statistic1 | Overlap2 | Validated overlap3 | FP overlap4 | |
|---|---|---|---|---|
| AD | Wilcoxon-Mann-Whitney | 60% | 75%(6/8)5 | 0% (0/2) |
| Psoriasis | Pearson’s chi-squared | 73% | 64% (7/11) | 0% (0/1) |
| Renal Cancer | edgeR [ | 76% | 80% (4/5) | NA |
| Schizophrenia | DESeq2 (Dejian et al., 2015) | 41% | 67%(2/3) | 0% (0/1) |
1Oasis 2 uses a negative binomial distribution as basis for its statistical evaluation of the differential expression. A very similar approach is taken by the edgeR package that has been used in the Renal Cancer study. The Psoriasis data was analyzed using a Pearson’s chi-squared test and the AD dataset was analyzed using the non-parametric Wilcoxon-Mann-Whitney test. Schizophrenia dataset used the same approach like Oasis 2. 2Overlap of differentially expressed miRNAs comparing Oasis 2’s results to published data. The percentage is calculated in reference to the shorter DE list. 3Overlap of differentially expressed miRNAs that have been validated independently in addition to the sRNA-seq experiment. 4False positive (FP) differentially expressed miRNAs detected by Oasis 2. 5Only known validated DE miRNAs are considered
Fig. 3Oasis 2′ (QC) outlier detection: To assess the QC of Oasis 2 and its biological relevance, sRNA Psoriasis data (demo dataset) was analyzed. PCA sample distances of psoriasis (green) and control (blue) is shown. (a) PCA of psoriasis and control samples showing a potentially mis-annotated (SRR330866_PP) and an outlier sample (SRR330860_PP). (b) PCA of psoriasis and control samples without misclassified/outlier samples. Removal of these two samples increased the number of significantly (adjusted p-value < 0.1) DE miRNAs from 195 to 256 cases and increased the AUC from 0.9 to 1 in the classification module, providing strong evidence for the utility of Oasis 2’ QC plots