| Literature DB >> 26434581 |
Jieming Shi1, Min Dong1, Lei Li1, Lin Liu1, Agustin Luz-Madrigal1,2, Panagiotis A Tsonis2, Katia Del Rio-Tsonis1, Chun Liang1,3.
Abstract
Being involved in many important biological processes, miRNAs can regulate gene expression by targeting mRNAs to facilitate their degradation or translational inhibition. Many miRNA sequencing studies reveal that miRNA variations such as isomiRs and "arm switching" are biologically relevant. However, existing standalone tools usually do not provide comprehensive, detailed information on miRNA variations. To deepen our understanding of miRNA variability, we developed a new standalone tool called "mirPRo" to quantify known miRNAs and predict novel miRNAs. Compared with the most widely used standalone program, miRDeep2, mirPRo offers several new functions including read cataloging based on genome annotation, optional seed region check, miRNA family expression quantification, isomiR identification and categorization, and "arm switching" detection. Our comparative data analyses using three datasets from mouse, human and chicken demonstrate that mirPRo is more accurate than miRDeep2 by avoiding over-counting of sequence reads and by implementing different approaches in adapter trimming, mapping and quantification. mirPRo is an open-source standalone program (https://sourceforge.net/projects/mirpro/).Entities:
Mesh:
Substances:
Year: 2015 PMID: 26434581 PMCID: PMC4592965 DOI: 10.1038/srep14617
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The core algorithms of mirPRo for exploration and quantification of miRNA variants.
(A) IsomiR identification. Human precursor miRNA “hsa-let-7a-1” is used for illustration, and the collapsed reads are not real data. mirPRo allows base errors (mismatch and indel) and soft clips in read mapping, and permits position check and optional seed region check in mature miRNA quantification. The isomiRs annotated by mirPRo include mature miRNA reads with mismatches, insertions, deletions, or a mixture, with 3′-end non-templated nucleotide addition, and with nucleotide shift (super or sub) at their 5′, 3′ or both ends. Mature miRNA variants: “5 (3) super (sub)” means the reads have 5 (3) end upstream (downstream) nucleotide shift in collapsed-read-to-hairpin mappings. The upper case “D” in the aligned sequence means deletion. The column “collapsed read” has the identifier (“XXX-YYY”) for collapsed reads, where “XXX” is a unique number and “YYY” is the read count. The column “M/I/D/N” represents the number of "mismatches/insertions/deletions/nucleotide N" in the alignment. For the two hairpin arms, most of the collapsed reads are mapped to the 5′ arm while few reads are mapped to the 3′ arm. (B) Arm switching detection. More reads are mapped to the 5′ arm of the precursor in treatment 1, while more reads are mapped to the 3′ arm in treatment 2. This indicates that two different mature miRNAs are generated from two different arms of the same precursor in two different treatments (e.g., different tissues).
Figure 2The comparative analyses of mature miRNAs differential expression profiles between miRDeep2 and mirPRo using three datasets.
The y-axis represents the number of mature miRNAs. In the x-axis, we have 8 different cases for comparative differential expression analyses of mature miRNAs detected by miRDeep2 and mirPRo. Each case is labeled as (A/B), where A stands for miRDeep2 result and B for mirPRo result, and both A and B can be one of these values: + (up-regulated significantly), - (down-regulated significantly) and Δ (no significant differential expression). For example, the case (+/+) means that a mature miRNAs is significantly up-regulated, reported by both miRDeep2 and mirPRo.