| Literature DB >> 27167008 |
Eduardo Andrés-León1, Rocío Núñez-Torres1, Ana M Rojas1.
Abstract
Large-scale RNAseq has substantially changed the transcriptomics field, as it enables an unprecedented amount of high resolution data to be acquired. However, the analysis of these data still poses a challenge to the research community. Many tools have been developed to overcome this problem, and to facilitate the study of miRNA expression profiles and those of their target genes. While a few of these enable both kinds of analysis to be performed, they also present certain limitations in terms of their requirements and/or the restrictions on data uploading. To avoid these restraints, we have developed a suite that offers the identification of miRNA, mRNA and circRNAs that can be applied to any sequenced organism. Additionally, it enables differential expression, miRNA-mRNA target prediction and/or functional analysis. The miARma-Seq pipeline is presented as a stand-alone tool that is both easy to install and flexible in terms of its use, and that brings together well-established software in a single bundle. Our suite can analyze a large number of samples due to its multithread design. By testing miARma-Seq in validated datasets, we demonstrate here the benefits that can be gained from this tool by making it readily accessible to the research community.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27167008 PMCID: PMC4863143 DOI: 10.1038/srep25749
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1miARma-Seq pipeline workflow.
An overview of the modular design of the pipeline. Main modules are indicated by gray background. Output files are indicated by purple background.
Performance of miARma-Seq tool.
| Type | EXP | Sample | Scope | Reads | T | Q | PRE | ALIGN | SUMM |
|---|---|---|---|---|---|---|---|---|---|
| miRNA | GSE47602 | SRR873382 | D | 34686701 | 19 | 2 | 6 (Reaper) | 8 (Bowtie 1) | 3 (FeatureCounts) |
| miRNA | GSE47603 | SRR873383 | I, dN | 34686701 | 48 | 2 | <1 (mirDeep2) | 45 (mirDeep2 uses Bowtie 1 for the alignment) | |
| mRNA | GSE52778 | SRR1039508 | mD | 45871042 | 172 | 6 | 161 (TopHat/Bowtie 2) | 5 (FeatureCounts) | |
| circRNAs | GSE49321 | SRR1051292 | cD | 6767745 | 48 | 3 | 35 (BWA) | 10 (CIRI) | |
The analyses have been performed in an average computer with 8 GB of RAM memory, 1.7 GHz CPU, and 4 threads. EXP: Experiment identifier. SCOPE: D stands for “detection” of known miRNAS, ID indicates “identification”, dN indicates “de novo” prediction of miRNAS, mD indicates “detection” of mRNAS, cD indicates “detection” of circRNAS. Reads: Number of reads per sample. T: Total analyses time, Q: Quality analyses time (FastQC). PRE: Preprocessing time (Software used). ALIGN: Alignment time (Software used). SUMM: Summarization of read counts time (Software used).
*Time in minutes.
**The use of pre-processing stage will depend on the sequencing process.