| Literature DB >> 31681943 |
Xiangfu Zhong1, Albert Pla1, Simon Rayner1,2.
Abstract
MOTIVATION: The existence of complex subpopulations of miRNA isoforms, or isomiRs, is well established. While many tools exist for investigating isomiR populations, they differ in how they characterize an isomiR, making it difficult to compare results across different tools. Thus, there is a need for a more comprehensive and systematic standard for defining isomiRs. Such a standard would allow investigation of isomiR population structure in progressively more refined sub-populations, permitting the identification of more subtle changes between conditions and leading to an improved understanding of the processes that generate these differences.Entities:
Year: 2019 PMID: 31681943 PMCID: PMC7703784 DOI: 10.1093/bioinformatics/btz806
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Schematic of isomiR nomenclature used in Jasmine. (a) Left hand rectangle. isomiRs are classified into progressively more refined levels. From left to right. Level 1: miRNAs are classified into mature miRNAs (identical start and end position, and read sequence as found in miRBase) and isoforms; Level 2: Isoforms are sub-classified as ‘polymorphic’ (poly), ‘same length’ (sl), ‘longer’ (lr) and ‘shorter’ (sr) isomiRs, where Same length isomiRs have the same length as parent mature miRNAs, but are shifted in the 5′ or 3′ direction. Level 3: IsomiRs are further categorized in terms of position and modification type; i.e. deletion, extension and substitution. 5 and 3: 5 and 3 prime-end respectively. e: extension; d: deletion; p: polymorphism in region overlapping mature miRNA; m: mixture. Right hand rectangle shows specific example for each case. (b) Summary of the different isomiR classification levels used in the analysis of the (Nogales-Cadenas, 2016) dataset. Jasmine generates count table in five different tab-separated files in tsv format, ‘Level 0’, ‘Level 1’, ‘Level 2’, ‘Level 3’ and ‘Level 4’. See Supplementary Table S4 for detailed explanations of the abbreviations used for each isomiR sub-classes. In Level 4, templated and non-templated information is provided, as well as matched SNVs. ‘N > M’ is used to describe nucleotide modifications in Level 4, where N = M means templated, N! = M means non-templated and N > D indicates deletion. Nucleotide modifications occurring in the seed region (2–8nt), are be tagged with ‘seed’ flag
Comparison of isomiR nomenclature between Jasmine and other selected tools
| Demo sequence | Methods | isomiR nomenclature |
|---|---|---|
| UAGCUUAUCAGAC UGAUGUUGA | Jasmine | hsa-miR-21-5p: mature |
| mirAligner | hsa-miR-21-5p.ref | |
| Binarized isomiR | hsa-miR-21-5p|0|0 | |
| isomiR-SEA | hsa-miR-21-5p mirna_exact | |
| miRNA-MATE | hsa-miR-21-5p canonical miRNA | |
| UGGCUUAUCAGAC UGAUGUUGA | Jasmine | hsa-miR-21-5p: poly |
| mirAligner | & hsa-miR-21-5p.mis | |
| Binarized isomiR | hsa-miR-21-5p|0|0 | |
| isomiR-SEA | not as isomiR | |
| miRNA-MATE | hsa-miR-21-5p substitution only isomiR | |
| GUAGCUUAUCAGA CUGAUGUUGA | Jasmine | hsa-miR-21-5p: lr-5e |
| mirAligner | not as isomiR | |
| Binarized isomiR | hsa-miR-21-5p|+1|0 | |
| isomiR-SEA | hsa-miR-21-5p iso_5p-only | |
| miRNA-MATE | hsa-miR-21-5p start-site only isomiR | |
| UAGCUUAUCAGAC UGAUGUUGA | Jasmine | hsa-miR-21-5p: lr-3e |
| mirAligner | hsa-miR-21-5p.ad.C | |
| Binarized isomiR | hsa-miR-21-5p|0|+1 | |
| isomiR-SEA | hsa-miR-21-5p iso_3p-only | |
| miRNA-MATE | hsa-miR-21-5p end-site only isomiR | |
| UAGCUUAUCAGAC UGAUGUUG | Jasmine | hsa-miR-21-5p: sr-3d |
| mirAligner | hsa-miR-21-5p.t3.a | |
| Binarized isomiR | hsa-miR-21-5p|0|-1 | |
| isomiR-SEA | hsa-miR-21-5p iso_3p-only | |
| miRNA-MATE | hsa-miR-21-5p end-site only isomiR | |
| UAGCUUAUCAGAC UGAUGUU | Jasmine | hsa-miR-21-5p: sr-3d |
| mirAligner | hsa-miR-21-5p.t3.ga | |
| Binarized isomiR | hsa-miR-21-5p|0|-2 | |
| isomiR-SEA | hsa-miR-21-5p iso_3p-only | |
| miRNA-MATE | hsa-miR-21-5p end-site only isomiR |