| Literature DB >> 26635395 |
Christina Backes1, Benjamin Meder2, Martin Hart3, Nicole Ludwig3, Petra Leidinger3, Britta Vogel2, Valentina Galata1, Patrick Roth4, Jennifer Menegatti5, Friedrich Grässer5, Klemens Ruprecht6, Mustafa Kahraman1, Thomas Grossmann1, Jan Haas2, Eckart Meese3, Andreas Keller7.
Abstract
Small non-coding RNAs play a key role in many physiological and pathological processes. Since 2004, miRNA sequences have been catalogued in miRBase, which is currently in its 21st version. We investigated sequence and structural features of miRNAs annotated in the miRBase and compared them between different versions of this reference database. We have identified that the two most recent releases (v20 and v21) are influenced by next-generation sequencing based miRNA predictions and show significant deviation from miRNAs discovered prior to the high-throughput profiling period. From the analysis of miRBase, we derived a set of key characteristics to predict new miRNAs and applied the implemented algorithm to evaluate novel blood-borne miRNA candidates. We carried out 705 individual whole miRNA sequencings of blood cells and collected a total of 9.7 billion reads. Using miRDeep2 we initially predicted 1452 potentially novel miRNAs. After excluding false positives, 518 candidates remained. These novel candidates were ranked according to their distance to the features in the early miRBase versions allowing for an easier selection of a subset of putative miRNAs for validation. Selected candidates were successfully validated by qRT-PCR and northern blotting. In addition, we implemented a web-server for ranking potential miRNA candidates, which is available at:www.ccb.uni-saarland.de/novomirank.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26635395 PMCID: PMC4824081 DOI: 10.1093/nar/gkv1335
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Overview of the numbers of human precursors with two annotated miRNAs and their first occurrence in miRBase
| miRBase version | Number new precursors | Set | Numbers in set |
|---|---|---|---|
| 1 | 88 | I | 185 |
| 2 | 50 | ||
| 3 | 34 | ||
| 4 | 13 | ||
| 5 | 12 | II | 96 |
| 6 | 3 | ||
| 7 | 81 | ||
| 8 | 52 | III | 125 |
| 9 | 8 | ||
| 10 | 35 | ||
| 11 | 30 | ||
| 12 | 8 | IV | 106 |
| 13 | 0 | ||
| 14 | 7 | ||
| 15 | 47 | ||
| 16 | 44 | ||
| 17 | 141 | V | 218 |
| 18 | 49 | ||
| 19 | 28 | ||
| 20 | 200 | VI | 202 |
| 21 | 2 |
Figure 1.For different features, the distribution across different miRBase versions is presented as Box-Whisker plots. The novel miRNAs discovered in our study are included in grey. The miRBase version sets are numbered according to Table 1.
Figure 2.Principal Component Analysis. The early miRBase versions in red (sets I + II, see also Table 1) fit well to each other and show a central cluster. The middle versions of the miRBase in blue (III + IV) still fit nicely to these initial miRNA precursors, while the newer versions (V + VI) and the miRDeep predicted precursors in green and grey scatter at the edge of the distribution. POV: proportion of variance.
Figure 3.Histogram blot of the absolute value of average z-scores from early versions of miRBase. With increasing version the distance from the initial miRNA precursors increases significantly.
Figure 4.Selected examples of secondary structures for miRNA precursors having a good score in our ranked list. Each panel presents one miRNA precursor along with the 5p- and 3p-miRNA in orange and blue. Additionally, the mature sequences and the overall distance (score) from the reference distribution (miRBase v1–7) is provided, as well as the summarized normalized base counts (per 10 million reads) over all samples for these precursors are illustrated.
Figure 5.Selected examples of secondary structures for miRNA precursors having a bad score in our ranked list. Each panel presents one miRNA precursor along with the 5p- and 3p-miRNA in orange and blue. Additionally, the mature sequences and the overall distance (score) from the reference distribution (miRBase v1–7) is provided, as well as the summarized normalized base counts (per 10 million reads) over all samples for these precursors are illustrated. Panel A shows the most divergent miRNA precursor according to our score. Panel B shows the miRNA precursor with overall highest length.
Figure 6.Validation of novel miRNAs by qRT-PCR and northern blots. Panel A shows amplification products of qRT-PCR in three RNA Pools (P1-P3) on Bioanalyzer DNA 1000 Chip, Panel B on conventional 3% agarose gels. Negative controls included a no template control for reverse transcription (NTRT), a RT reaction without enzyme (RT-) and a no template PCR control for each specific primer (NTC). As the used qRT-PCR system depends on poly-adenylation at the 3′ end of mature miRNAs followed by reverse transcription using an oligo-dT primer that includes a universal tag sequence for the qPCR, amplification products of mature miRNAs are ≈80–95 bps depending on the number of A′s added to the miRNA sequence. The ladder bands shown represent 50 and 100 bps. For 11 miRNAs specific bands at 80–90 bps could be detected. All PCR products were subcloned into pGEM and Sanger sequenced (see Supplementary Table S4) to verify specific amplification of novel miRNAs. Panels C and D show northern blots detecting mature miRs-1005–5p (C) and -3p (D) with sequence specific radio-labelled probes (left side) in HEK293T cells transfected with pSG5 vector with inserted mir-1005 precursor sequence. The right size of the novel mature miRNAs was confirmed by the stripping and rehybridization of both nylon membranes with specific radio-labelled probes of the high confident miR-20a-5p (right side). Loading control demonstrates equal RNA amounts in all lanes.
Experimentally validated novel miRNAs
| Novel miRNA name | Sequence | Performed experiments |
|---|---|---|
| miR-1005–5p | AGAGGCUGAAUUCCCAGUGAGU | qRT-PCR, Northern blot, Sanger |
| miR-1005–3p | AUUCGCUGGGAAUUCAGCCUCU | qRT-PCR, Northern blot, Sanger |
| miR-1037–5p | CAGGGAUCAGAUCUUAUGUUC | qRT-PCR, Sanger |
| miR-1037–3p | CUAGCAUAAGACUGAUCCUCC | qRT-PCR, Sanger |
| miR-1002–5p | AUCCCCAGGAAAAUGCCUUUU | qRT-PCR, Sanger |
| miR-1002–3p | AAAGGCUUUUUCCUUGGGGGUU | qRT-PCR, Sanger |
| miR-2317–3p | UAUCCUCCAGUAGACUAGGGAG | qRT-PCR, Sanger |
| miR-375–5p | UUAGCCAGAGUUCCUAAAGAAA | qRT-PCR, Sanger |
| miR-1570–3p | UAAAUUAUCUGAGCCCCAGGA | qRT-PCR, Sanger |
| miR-1331–5p | CCCAAACCUUGUCUGGACAUGG | qRT-PCR, Sanger |
| miR-608–5p | AAAGCAAAUGUUGGGUGAACGG | qRT-PCR, Sanger |