| Literature DB >> 25822230 |
Suvi Asikainen1, Liisa Heikkinen2, Juuso Juhila3, Frida Holm4, Jere Weltner3, Ras Trokovic3, Milla Mikkola3, Sanna Toivonen3, Diego Balboa3, Riina Lampela4, Katherine Icay3, Timo Tuuri3, Timo Otonkoski5, Garry Wong6, Outi Hovatta4.
Abstract
Small RNA molecules, including microRNAs (miRNAs), play critical roles in regulating pluripotency, proliferation and differentiation of embryonic stem cells. miRNA-offset RNAs (moRNAs) are similar in length to miRNAs, align to miRNA precursor (pre-miRNA) loci and are therefore believed to derive from processing of the pre-miRNA hairpin sequence. Recent next generation sequencing (NGS) studies have reported the presence of moRNAs in human neurons and cancer cells and in several tissues in mouse, including pluripotent stem cells. In order to gain additional knowledge about human moRNAs and their putative development-related expression, we applied NGS of small RNAs in human embryonic stem cells (hESCs) and fibroblasts. We found that certain moRNA isoforms are notably expressed in hESCs from loci coding for stem cell-selective or cancer-related miRNA clusters. In contrast, we observed only sparse moRNAs in fibroblasts. Consistent with earlier findings, most of the observed moRNAs derived from conserved loci and their expression did not appear to correlate with the expression of the adjacent miRNAs. We provide here the first report of moRNAs in hESCs, and their expression profile in comparison to fibroblasts. Moreover, we expand the repertoire of hESC miRNAs. These findings provide an expansion on the known repertoire of small non-coding RNA contents in hESCs.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25822230 PMCID: PMC4378994 DOI: 10.1371/journal.pone.0116668
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Read counts and percentages compared to raw data in total of seven small RNA-seq libraries during bioinformatics workflow from raw reads to miRNA and moRNA sequences.
| Reads | HS401 | HS181 | HFF-1 | H9 | EB | H1 | Spont. diff. |
|---|---|---|---|---|---|---|---|
| Original lib | 10.5M | 11.1M | 11.4M | 6.1M | 6.0M | 0.28M | 0.14M |
| Cleaned data | 6.1M (59%) | 8.4M (75%) | 9.3M (82%) | 4.8M (79%) | 4.5M (75%) | 0.22M (79%) | 0.11M (79%) |
| Genome mapping | 5.6M (53%) | 7.5M (68%) | 8.6M (75%) | 4.5M (74%) | 3.9M (65%) | 0.20M (72%) | 0.11M (76%) |
| miRNA hp mapping | 2.2M (21%) | 3.6M (32%) | 6.6M (58%) | 1.2M (19%) | 1.2M (20%) | 0.19M (68%) | 0.10M (71%) |
| moR-5p mapping | 3145 (0.030%) | 3469 (0.031%) | 983 (0.009%) | 1412 (0.023%) | 944 (0.016%) | 171 (0.061%) | 45 (0.032%) |
| moR-3p mapping | 1961 (0.019%) | 2806 (0.025%) | 41 (<0.001%) | 982 (0.016%) | 855 (0.014%) | 34 (0.015%) | 15 (0.011%) |
Reads were aligned to reference sequences by allowing two mismatches.
Lists of 15 most overexpressed miRNAs in hESC lines when compared to HFF-1 foreskin fibroblast by statistical computing using DEseq algorithm.
| miRBase name v18 | HS401 reads (rpm) | HS181 reads (rpm) | HFF-1 reads (rpm) | P value | Location |
|---|---|---|---|---|---|
| hsa-miR-302a-5p | 16380,51 | 23920,20 | 18,15 | 8,39E-31 | miR-302 cluster |
| hsa-miR-363-3p | 8482,25 | 6651,71 | 4,71 | 2,89E-30 | miR-106a-363 |
| hsa-miR-302d-3p | 5134,52 | 5504,53 | 3,30 | 1,98E-29 | miR-302 cluster |
| hsa-miR-302a-3p | 7597,99 | 6680,41 | 5,18 | 1,63E-29 | miR-302 cluster |
| hsa-miR-302b-3p | 6556,53 | 6090,26 | 4,83 | 5,23E-29 | miR-302 cluster |
| hsa-miR-302c-3p | 5399,63 | 4615,92 | 4,12 | 5,29E-28 | miR-302 cluster |
| hsa-miR-372 | 5522,03 | 1845,79 | 2,36 | 1,40E-27 | miR-371/372/373 cluster |
| hsa-miR-371a-5p | 6083,84 | 3410,09 | 4,95 | 9,90E-27 | miR-371/372/373 cluster |
| hsa-miR-200c-3p | 2240,66 | 2437,46 | 2,47 | 9,51E-25 | miR-200c/miR141 |
| hsa-miR-367-3p | 1111,78 | 997,75 | 0,35 | 3,35E-24 | miR-302 cluster |
| hsa-miR-20b-5p | 598,48 | 504,74 | 0,24 | 1,63E-20 | miR-106a-363 |
| hsa-miR-302b-5p | 539,25 | 809,27 | 0,71 | 4,26E-20 | miR-302 cluster |
| hsa-miR-20b-3p | 508,98 | 506,95 | 0,47 | 5,32E-19 | miR-106a-363 |
| hsa-miR-106a-5p | 434,71 | 389,66 | 0,24 | 8,18E-19 | miR-106a-363 |
| hsa-miR-302d-5p | 275,26 | 331,99 | 0,12 | 5,27E-18 | miR-302 cluster |
Normalized read count (RPM), P-value and the name of surrounding miRNA cluster (location) are shown for each miRNA. Relative expression analysis was made using reads mapping to the genome allowing 2 mismatches.
New mature miRNA sequences found from known miRNA hairpin precursors.
| Novel miRNA | Sequence | HS401 | HS181 | HFF-1 |
|---|---|---|---|---|
| hsa-miR-135a-2-3p | TGTAGGGATGGAAGCCATGAA | 152.30 | 121.56 | 0 |
| hsa-miR-137-5p | ACGGGTATTCTTGGGTGGATAA | 0.4 | 0.4 | 3.4 |
| hsa-miR-1912-5p | CTCATTGCATGGGCTGTGTATA | 1.9 | 1.2 | 0 |
| hsa-miR-498-3p | AAAGCACCTCCAGAGCTTGAAGC | 2.6 | 1.8 | 0 |
| hsa-miR-519a-2-5p/has-miR-520b-5p | CCTCTACAGGGAAGCGCTTTC | 1.5 | 1.8 | 0.1 |
| hsa-miR-520e-5p | CTCAAGATGGAAGCAGTTTCTG | 3.0 | 1.1 | 0 |
| hsa-miR-526a-1-3p | GAAAGCGCTTCCTTTTAGAGGA | 5.8 | 3.2 | 0 |
| hsa-miR-549a-5p | AGCTCATCCATAGTTGTCACTG | 0.2 | 0.6 | 6.1 |
The sequence and the normalized read count (RPM) of most abundant isomiR are shown in the table for each library.
Novel miRNA hairpins found, their dot-bracket—notation, genomic loci and conservation.
| hairpin | chr | strand | start | end | hairpin sequence | HS401 | HS181 | HFF1 | Cons. | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| name | 5p | 3p | 5p | 3p | 5p | 3p | ||||||
| hsa-mir-10522 | 5 | - | 17156011 | 17156121 |
| 14 | 0 | 9 | 0 | 0 | 0 | no |
| ....(((((((((((.((((.....................)))).)))))))))))...... | ||||||||||||
| hsa-mir-10523 | 5 | + | 172342066 | 172342178 |
| 25 | 0 | 50 | 0 | 8 | 0 | no |
| ..(((.((.....(((.(((.(((((((.(((((........))))))))))))))).)))......))))).... | ||||||||||||
| hsa-mir-9983 | 6 | + | 39042066 | 39042177 |
| 0 | 12 | 0 | 17 | 0 | 0 | yes |
| ((((((((((((((.(((((((...(((((......))).))...))))))))))))))))))))).. | ||||||||||||
| hsa-mir-10524 | 6 | - | 79248939 | 79249046 |
| 48 | 0 | 167 | 0 | 0 | 0 | no |
| ((((((.(((..(((((......)).)))..))).....)))))).. | ||||||||||||
| hsa-mir-10525 | 7 | + | 74010263 | 74010373 |
| 0 | 11 | 0 | 4 | 0 | 3 | no |
| .((((((((.((((.(((((((((......)))))))))..))))...))))))))... | ||||||||||||
| hsa-mir-10526 | 11 | - | 122022791 | 122022900 |
| 0 | 2 | 0 | 2 | 0 | 23 | no |
| (((((((((((((((((....((.......))...)))))))))))))).))). | ||||||||||||
| hsa-mir-10527 | 12 | + | 64217423 | 64217534 |
| 8 | 0 | 24 | 0 | 83 | 0 | yes |
| ..(((((..((.(((((...((((...............))))...))))).))..))))).... |
Mature miRNAs are indicated by caps and underlined, and the raw number of 5p and 3p reads found in each library are shown.
moRNA frequencies (moR count / millions of exactly mapped reads) in four different human hESC libraries, two day 15 libraries (EB and adherent, spontaneously differentiated) and HFF-1.
| moR name | most abundant isomoR sequence | HS401 | HS141 | H9 | H1 | HFF-1 | EB | spont.diff |
|---|---|---|---|---|---|---|---|---|
| hsa-moR-367-3p | TGGATTGTTAAGCCAATGACAGAA | 196.6 | 209.3 | 19.4 | - | 0.1 | 2.4 | - |
| hsa-moR-103a-2-3p | AAGAACCAAGAATGGGCTGC | 24.5 | 23.5 | 14.6 | - | 0.7 | 17.5 | - |
| hsa-moR-92a-1-3p | TGAGTTTGGTGGGGATTGTGACCAGAAGA | 15.8 | 19.9 | - | - | 0.1 | - | - |
| hsa-moR-16-2-3p | GTGTGACAGGGATACAGCAAC | 3.6 | 3.4 | 1.3 | - | - | - | - |
| hsa-moR-21-3p | TGACATTTTGGTATCTTTCA | 3.1 | 2 | 0.4 | - | - | - | - |
| hsa-moR-221-3p | AGGCTACCTGGAAACATGTTCTCC | 2.4 | 2.4 | - | - | - | - | - |
| hsa-moR-103a-1-3p | AGGCATTGAGACCTGTTCT | 2.2 | 1.6 | 1.3 | - | 0.1 | - | - |
| hsa-moR-20b-3p | ACTCTTGGATAACAAA | 1.7 | 2.1 | 0.9 | - | - | - | - |
| hsa-moR-421-5p | ATCCGGTGCACATTGTAGGC | 26.9 | 23 | 2.2 | - | 0.3 | - | - |
| hsa-moR-21-5p | TGTACCACCTTGTCGGG | 24.5 | 12.2 | 3.9 | - | 9.7 | 2.4 | 137.9 |
| hsa-moR-302a-5p | CAAGACTGGGCTCCCCACC | 14.9 | 15 | 31.4 | 70.9 | - | - | - |
| hsa-moR-363-5p | ATGATCTGTTTTGCTGTTG | 14.2 | 11.7 | 4.7 | 4.7 | 0.1 | 6.3 | - |
| hsa-moR-367-5p | CTTGGCTACAGGCCAT | 13.9 | 31.8 | 2.6 | - | - | 3.9 | - |
| hsa-moR-517a-5p | GAAGATCTCAGGCAGTGAC | 12.5 | 9.8 | 2.2 | - | - | 1.9 | - |
| hsa-moR-517c-5p | GAAGATCTCAGGCAGTGAC | 12.5 | 9.8 | - | - | - | 1.9 | - |
| hsa-moR-296-5p | AGGAGAAAGGACCCTTCCA | 10.3 | 10.8 | - | - | 0.3 | - | - |
| hsa-moR-17-5p | TGACCAGTCAGAATAATG | 9.6 | 3.8 | - | - | - | 1 | - |
| hsa-moR-20a-5p | ATGTGACAGCTTCTGTAGCAC | 9.4 | 8 | 0.9 | - | - | 1.9 | - |
| hsa-moR-33a-5p | CCTGGCGGGCAGCTGTG | 9.4 | 5.4 | 14.2 | - | 2.5 | 5.4 | 9.2 |
| hsa-moR-515-1-5p | AGAAGATCTCATGCAGTCA | 9.4 | 2.6 | 0.4 | - | - | - | - |
| hsa-moR-515-2-5p | AGAAGATCTCATGCAGTCA | 9.4 | 2.6 | 0.4 | - | - | - | - |
| hsa-moR-519e-5p | AGAAGATCTCATGCAGTCA | 9.4 | 2.6 | 0.4 | - | - | - | - |
| hsa-moR-1283-2-5p | AGAAGATCTCAAGCTGTGA | 6.7 | 6.7 | 32.3 | - | - | 9.7 | - |
| hsa-moR-525-5p | AGAAGATCTCAAGCTGTGA | 6.7 | 6.7 | 32.3 | - | - | 9.7 | - |
| hsa-moR-518a-1-5p | AGAAGATCTCAAGCTGTGA | 6.7 | 6.7 | 32.3 | - | - | 9.7 | - |
| hsa-moR-520d-5p | AGAAGATCTCAAGCTGTGA | 6.7 | 6.7 | 32.3 | - | - | 9.7 | - |
| hsa-moR-527-5p | AGAAGATCTCAAGCTGTGA | 6.7 | 6.7 | 32.3 | - | - | 9.7 | - |
| hsa-let-7f-2-5p | ACACTGGTGCTCTGTGGGA | 6.7 | 5.5 | 11.2 | 9.5 | 0.4 | 5.4 | 9.2 |
| hsa-moR-625-5p | TGGTAAGGGTAGAGGGATG | 6.7 | 4.7 | - | - | 0.1 | - | - |
| hsa-moR-103a-2-5p | CTGCGTCTTTGTGCTT | 6.5 | 3.1 | 4.3 | 9.5 | 1.6 | 1.5 | 46 |
| hsa-moR-125a-5p | ATGTTGCCAGTCTCTAGG | 5.8 | 3.8 | 0.4 | - | 1.3 | - | - |
| hsa-moR-302b-5p | TGTTGGGTGGGCTCCCTTCA | 5.3 | 4.2 | 8.6 | 113.5 | - | 2.4 | - |
| hsa-moR-4521-5p | GTGGGTTCGAATCCCATCCTCGTCGG | 4.3 | 3.1 | 23.2 | 4.7 | 1.3 | 17.5 | - |
| hsa-moR-221-5p | TGAACATCCAGGTCTGGGGCATGA | 4.3 | 1.3 | - | - | 0.3 | - | - |
| hsa-moR-516a-2-5p | AGAAGATCTCAGGTTGTGACC | 3.8 | 2.1 | 0.4 | - | - | 1 | - |
| hsa-moR-519a-1-5p | AGAAGACCTCAGGCTGTGAC | 3.4 | 1 | 1.3 | - | - | - | - |
| hsa-moR-324-5p | CTGAGCTGACTATGCCTCCC | 3.4 | 0.8 | - | - | 0.1 | 0.5 | - |
| hsa-moR-197-5p | GGAATCTGTGCTCTGGGGGCTGTGC | 2.6 | 2.3 | 2.6 | - | 0.4 | 2.4 | 9.2 |
| hsa-moR-524-5p | AGAAGATCTCATGCTGTGACC | 2.6 | 1.5 | 1.3 | - | - | - | - |
| hsa-moR-518f-5p | AGAAGATCTCATGCTGTGACC | 2.6 | 1.5 | 1.3 | - | - | - | - |
| hsa-moR-302d-5p | ATCTGTTAAGGGGCCCCCTC | 2.4 | 1.8 | 3 | 4.7 | - | - | - |
| hsa-moR-128-1-5p | GTTCCTGAGCTGTTGGAT | 2.2 | 3.4 | - | - | 0.1 | 0.5 | - |
| hsa-moR-876-5p | ACAAACTGTGAAGTGCTGTG | 2.2 | 3.1 | 1.7 | - | - | - | - |
| hsa-moR-519a-2-5p | AGAAGATCTCAGGCTGTG | 1.9 | 3.4 | 4.3 | - | - | 1 | - |
| hsa-moR-522-5p | AGAAGATCTCAGGCTGTG | 1.9 | 3.4 | 4.3 | - | - | 1 | - |
| hsa-mor-520a-5p | AGAAGATCTCAGGCTGTG | 1.9 | 3.4 | 4.3 | - | - | 1 | - |
| hsa-moR-521-1-5p | AGAAGATCTCAGGCTGTG | 1.9 | 3.4 | 4.3 | - | - | 1 | - |
| hsa-moR-18a-5p | ATGTTGAGTGCTTTTTGT | 1.9 | 2.8 | 0.9 | - | - | 0.5 | - |
| hsa-moR-106b-5p | CGCTCCAGCCCTGCCGGGGC | 1.9 | 1.6 | - | - | - | - | - |
| hsa-moR-370-5p | GGGGCACAAGACAGAGAAGC | 1.9 | 1.5 | 0.4 | - | 4.5 | 0.5 | - |
| hsa-moR-191-5p | GCCAACGGCTGGACAGCGGG | 0.7 | 1.6 | - | - | 0.3 | - | - |
| hsa-moR-92a-2-5p | ACTCATGCCCATTCATCCC | 0.7 | 1.6 | - | - | - | 0.5 | - |
| hsa-moR-214-5p | AGAGTTGTCATGTGTC | 0.2 | 0.7 | - | - | 1.5 | - | - |
| hsa-moR-27a-5p | CTGTGCCTGGCCTGAGGAGC | 0.2 | 0.3 | 0.4 | - | 1.5 | - | - |
Shown are moRNAs supported with min 10 reads in at least one of the libraries, no mismatches were allowed.
List of 15 most overexpressed moRNAs in hESC lines when compared to HFF-1 foreskin fibroblast by statistical computing using DEseq algorithm.
| name | HS401 reads (rpm) | HS181 reads (rpm) | HFF-1 reads (rpm) | P value | Location |
|---|---|---|---|---|---|
| hsa-moR-367-3p | 226,38 | 250,99 | 0,12 | 1,71E-95 | miR-302 cluster |
| hsa-moR-302a-5p | 32,34 | 34,08 | 0,12 | 9,46E-20 | miR-302 cluster |
| hsa-moR-103a-2-3p | 57,35 | 51,61 | 1,65 | 1,45E-18 | miR-103-2a/b cluster |
| hsa-moR-367-5p | 15,04 | 31,18 | 0 | 6,63E-18 | miR-302 cluster |
| hsa-moR-92a-1-3p | 24,44 | 26,77 | 0,12 | 1,50E-17 | miR-17-92 cluster |
| hsa-moR-421-5p | 29,52 | 27,60 | 0,35 | 1,67E-16 | miR-374b/421 cluster |
| hsa-moR-517a-5p | 21,25 | 15,59 | 0 | 2,76E-15 | C19MC |
| hsa-moR-517c-5p | 21,25 | 15,59 | 0 | 2,76E-15 | C19MC |
| hsa-moR-363-5p | 21,43 | 20,70 | 0,12 | 2,92E-15 | miR-106a-363 cluster |
| hsa-moR-296-5p | 19,37 | 18,77 | 0,24 | 9,68E-14 | miR-296/298 cluster |
| hsa-moR-302b-5p | 13,54 | 10,35 | 0 | 2,18E-12 | miR-302 cluster |
| hsa-moR-1283-2-5p | 9,78 | 10,35 | 0 | 1,50E-11 | C19MC |
| hsa-moR-525-5p | 9,78 | 10,35 | 0 | 1,50E-11 | C19MC |
| hsa-moR-20a-5p | 10,34 | 8,83 | 0 | 2,43E-11 | miR-17-92 cluster |
| hsa-moR-518a-1-5p | 6,96 | 9,52 | 0 | 5,87E-11 | C19MC |
Normalized read count (RPM), P-value and the name of surrounding miRNA cluster (location) are shown for each moRNA. Relative expression analysis was made using reads mapping to the genome allowing 2 mismatches.
Number of predicted miR-103a-3p and moR-103a-2-3p targets in the sets of genes down-regulated by both mimics, down-regulated by only miR-103a-3p and down-regulated by moR-103a-2-3p.
| Gene set | Down by both mimics | Down by miR only | Down by moR only |
|---|---|---|---|
| genes with annotated 3’UTR | 226 | 165 | 158 |
| miR seed match |
|
| 30 |
| moR ‘seed’ match | 29 | 9 |
|
| seed matches for both | 9 | 7 | 7 |
The analysis was made only for genes with an annotated 3’ UTR in GRCh37/hg19.
The ten most significantly down-regulated genes in miR-103a-3p transfection study.
| Rank miR | Gene name | FC miR | FC moR | miR seeds | moR seeds |
|---|---|---|---|---|---|
| 1 | TRIAP1 | -4.8 | ns | 1 | 0 |
| 2 | LY6E | -4.6 | -2.2 | 2 | 0 |
| 3 | ARL2 | -4.1 | ns | 3 | 0 |
| 4 | HCFC1R1 | -4 | -2.1 | 0 | 0 |
| 5 | BCAT2 | -3.7 | ns | 1 | 0 |
| 6 | SMPD1 | -3.6 | ns | 3 | 0 |
| 7 | C7orf55 | -3.6 | -3.5 | 2 | 1 |
| 8 | HIST1H2BD | -3.5 | ns | 0 | 0 |
| 9 | TMEM256 | -3.5 | -4.3 | 1 | 0 |
| 10 | ZNF428 | -3.4 | -2.1 | 1 | 0 |
Fold changes of the genes in both transfection studies, and the number of perfect 7mer seed matches found in their 3’ UTRs for both miR and moR are shown. (ns = expression of the gene is not significantly changed).
The ten most significantly down-regulated genes in moR-103a-2-3p transfection study.
| Rank moR | Gene name | FC miR | FC moR | miR seeds | moR seeds |
|---|---|---|---|---|---|
| 1 | ATPIF1 | -2.1 | -7.8 | 0 | 1 |
| 2 | RPL14 | -2.7 | -6.4 | 2 | 0 |
| 3 | LAGE3 | -3.2 | -6.2 | 0 | 1 |
| 4 | PLAC9 | -3.2 | -5.7 | 1 | 0 |
| 5 | CCL2 | ns | -5.4 | 0 | 0 |
| 6 | ZMAT2 | -2.1 | -5.3 | 1 | 2 |
| 7 | ADIRF | -2.8 | -5.1 | 0 | 0 |
| 8 | POLR2G | -2.2 | -5.1 | 2 | 2 |
| 9 | HSPE1 | -2.1 | -5.0 | 1 | 1 |
| 10 | MOXD1 | ns | -4.8 | 1 | 0 |
Fold changes of the genes in both transfection studies, and the number of perfect 7mer seed matches found in their 3’ UTRs for both miR and moR are shown. (ns = expression of the gene is not significantly changed).
p-values for the top 10 enriched GO terms for genes that were down-regulated with both mimics.
| GO term | Down by both mimics p-value | Down by moR only p-value | Down by miR only p-value |
|---|---|---|---|
| structural constituent of ribosome | 3.9E-34 | 5.5E-10 | 6.0E-5 |
| ribosome | 4.9E-33 | 3.1E-8 | 7.1E-4 |
| ribosomal subunit | 2.9E-32 | 5.9E-8 | 1.5E-4 |
| ribonucleoprotein complex | 1.7E-30 | 4.0E-11 | 2.3E-3 |
| translation | 1.8E-25 | 1.2E-7 | 1.7E-3 |
| mitochondrion | 2.9E-20 | 7.8E-7 | 6.3E-4 |
| mitochondrial part | 2.9E-20 | 7.0E-6 | 1.8E-2 |
| translational elongation | 7.5E-20 | 4.7E-8 | 1.9E-4 |
| cytosolic ribosome | 4.4E-18 | 1.1E-7 | 8.0E-4 |
| mitochondrial membrane part | 1.1E-16 | 4.4E-6 | - |
The p-values for these GO terms are shown also for genes down-regulated only with moR-103a-2-3p mimic, and genes down-regulated only with miR-103a-3p mimic.