| Literature DB >> 20224791 |
Chad J Creighton1, Ashley L Benham, Huifeng Zhu, Mahjabeen F Khan, Jeffrey G Reid, Ankur K Nagaraja, Michael D Fountain, Olivia Dziadek, Derek Han, Lang Ma, Jong Kim, Shannon M Hawkins, Matthew L Anderson, Martin M Matzuk, Preethi H Gunaratne.
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs that mediate post-transcriptional gene silencing. Over 700 human miRNAs have currently been identified, many of which are mutated or de-regulated in diseases. Here we report the identification of novel miRNAs through deep sequencing the small RNAome (<30 nt) of over 100 tissues or cell lines derived from human female reproductive organs in both normal and disease states. These specimens include ovarian epithelium and ovarian cancer, endometrium and endometriomas, and uterine myometrium and uterine smooth muscle tumors. Sequence reads not aligning with known miRNAs were each mapped to the genome to extract flanking sequences. These extended sequence regions were folded in silico to identify RNA hairpins. Sequences demonstrating the ability to form a stem loop structure with low minimum free energy (<-25 kcal) and predicted Drosha and Dicer cut sites yielding a mature miRNA sequence matching the actual sequence were considered putative novel miRNAs. Additional confidence was achieved when putative novel hairpins assembled a collection of sequences highly similar to the putative mature miRNA but with heterogeneous 3'-ends. A confirmed novel miRNA fulfilled these criteria and had its "star" sequence in our collection. We found 7 distinct confirmed novel miRNAs, and 51 additional novel miRNAs that represented highly confident predictions but without detectable star sequences. Our novel miRNAs were detectable in multiple samples, but expressed at low levels and not specific to any one tissue or cell type. To date, this study represents the largest set of samples analyzed together to identify novel miRNAs.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20224791 PMCID: PMC2835764 DOI: 10.1371/journal.pone.0009637
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Reproductive tissues profiled.
| Description | Number of samples (103 total) |
| Normal ovarian surface epithelium (NOSE), cell culture | 4 |
| Primary ovarian cancer, serous, malignant | 8 |
| Primary cancer, ovarian, serous, borderline malignant | 1 |
| Primary ovarian cancer, endometrioid, malignant | 4 |
| Ovarian cancer cell culture, clear cell | 12 |
| Ovarian cancer cell culture, serous | 4 |
| Endometrioma | 10 |
| Endometrium, non-endometriosis | 10 |
| Endometrium, endometriosis | 3 |
| Uterine leiomyoma | 18 |
| Uterine myometrium | 18 |
| Uterine leiomyosarcoma | 9 |
| Uterine leiomyosarcoma, cell culture | 2 |
Figure 1Flow chart of the mapping of deep sequence reads to novel miRNAs.
Novel microRNAs identified.
| bcm-miR | UCSC Blat | Sequence | Position (st = strand) | Samples detected |
|
| ||||
| 8 | Intergenic |
| chr2:207682912–207683088 st− | 12 |
| 21-1 | Intron BC058547 |
| chr9:136881792–136881939 st− | 2 |
| 39 | LINE |
| chr6:120378000–120378122 st+ | 5 |
| 93 | Intergenic |
| chr3:48332858–48332955 st− | 1 |
| 132 | Intron OSBP2 |
| chr22:29457541–29457630 st+ | 7 |
| 190 | Intergenic |
| chrX:149146896–149146977 st+ | 4 |
| 244 | Intron EDA-A2 |
| chrX:69159432–69159498 st+ | 16 |
|
| ||||
| 3 | Intergenic |
| chr17:20107032–20107214 st− | 2 |
| 15-1 | Intergenic |
| chr16:15156208–15156360 st− | 6 |
| 15-2 | Intergenic |
| chr16:2125979–2126131 st− | 6 |
| 15-3 | Intergenic |
| chr16:16311259–16311411 st+ | 6 |
| 19 | Intron CRYGN |
| chr7:150761508–150761658 st− | 1 |
| 21-2 | Intron BC058547 |
| chr9:136881157–136881228 st− | 1 |
| 22 | Intron PHC2 |
| chr1:33570591–33570735 st− | 16 |
| 35 | Intergenic |
| chr12:122586909–122587034 st+ | 1 |
| 43 | Intergenic |
| chr22:34061633–34061751 st+ | 5 |
| 46 | CDH2 |
| chr15:91248611–91248727 st+ | 10 |
| 48 | Intergenic |
| chr5:92982149–92982263 st− | 1 |
| 53-1 | ROR2/LINE |
| chr9:93438354–93438464 st+ | 14 |
| 53-2 | LINE |
| chr9:93438367–93438448 st− | 14 |
| 60 | Intergenic |
| chr9:129492787–129492895 st− | 2 |
| 70 | Intergenic |
| chr5:170746265–170746369 st− | 2 |
| 72 | Intergenic |
| chrX:146139351–146139453 st− | 4 |
| 75-1 | Intergenic |
| chr12:68264769–68264870 st− | 4 |
| 92-1 | Intron WBSCR17 − |
| chr7:70410594–70410692 st− | 1 |
| 92-2 | Intron WBSCR17 + |
| chr7:70410596–70410690 st+ | 1 |
| 94 | Intergenic |
| chr2:69184307–69184404 st+ | 3 |
| 99 | Intron DMD |
| chrX:32511694–32511790 st− | 1 |
| 108 | Intergenic |
| chr11:61339205–61339300 st− | 1 |
| 111 | Intergenic |
| chr4:153629927–153630020 st− | 4 |
| 113 | Intergenic |
| chr1:245431892–245431985 st− | 1 |
| 118 | Intergenic |
| chr11:126363560–126363652 st− | 3 |
| 120 | Intergenic |
| chr1:26105440–26105532 st− | 7 |
| 122 | Intergenic |
| chr6:159105681–159105773 st− | 1 |
| 127 | Intergenic |
| chr8:78041555–78041645 st− | 1 |
| 135 | Intron SCHIP1 |
| chr3:160483129–160483217 st+ | 2 |
| 142-1 | Intron RP11-529I10.4 + |
| chr10:103351160–103351248 st+ | 23 |
| 142-2 | Intron RP11-529I10.4 − |
| chr10:103351160–103351248 st− | 23 |
| 150 | SINE |
| chr6:155216181–155216267 st+ | 7 |
| 158 | Intergenic |
| chr8:96154315–96154400 st− | 1 |
| 159 | Intron TRPC6 |
| chr11:100895761–100895846 st− | 1 |
| 160 | LINE |
| chr3:101165848–101165932 st− | 1 |
| 164 | Intergenic |
| chr16:14902866–14902949 st+ | 4 |
| 167 | Intergenic |
| chr12:103509541–103509624 st+ | 1 |
| 171 | LTR |
| chr3:79639727–79639809 st+ | 1 |
| 172 | Intergenic |
| chr1:62317042–62317124 st− | 8 |
| 189 | Intergenic |
| chr14:27172246–27172327 st− | 2 |
| 191 | Intergenic |
| chr10:58734245–58734325 st− | 1 |
| 192 | Simple repeat |
| chr17:72597094–72597174 st+ | 3 |
| 203 | Intergenic |
| chr4:9689335–9689412 st− | 5 |
| 204 | Intron AK094607 |
| chr1:98283407–98283484 st− | 17 |
| 210-1 | Intron FASTKD2 + |
| chr2:207356202–207356278 st+ | 14 |
| 210-2 | Intron FASTKD2 − |
| chr2:207356203–207356277 st− | 14 |
| 212 | Intergenic |
| chr6:36698191–36698267 st− | 1 |
| 215 | Intron COL5a2 |
| chr2:189706007–189706082 st− | 9 |
| 219 | Intergenic |
| chr9:18563304–18563377 st+ | 1 |
| 223 | Intergenic |
| chr2:134601163–134601236 st+ | 1 |
| 226-1 | Intron LONRF1 − |
| chr8:12629112–12629184 st− | 1 |
| 226-2 | Intron LONRF1 + |
| chr8:12629117–12629179 st+ | 1 |
| 230 | Intergenic |
| chr17:77032732–77032802 st− | 1 |
| 232 | LTR |
| chr9:111313576–111313646 st− | 2 |
| 245 | Intergenic |
| chr1:247087199–247087265 st+ | 5 |
| 261 | Intron ACBD6 |
| chr1:178674081–178674139 st− | 4 |
| 262 | ncRNA |
| chrX:153650065–153650123 st+ | 26 |
| 265 | Intergenic |
| chr22:29886048–29886105 st− | 49 |
| 268 | SINE |
| chr18:31768049–31768103 st− | 3 |
Figure 2Predicted secondary structures of the precursor hairpins for the seven confirmed novel miRNAs.
Mfe, predicted mean free energy level of the hairpin. Mature miRNA and miRNA star sequences—both detected by deep sequencing—are denoted (red and blue, respectively), as well as predicted Drosha cut points (red arrows). Asterisks indicate the small 3′ overhangs which are known to be typical for microRNAs.
Figure 3Relative abundance levels of novel miRNAs across samples.
(A) Total number of sequence reads (summed across all samples) for both known miRNAs (404 in all) and novel miRNAs (note the log base 10 scale). High confidence, novel miRNA was found to form part of a miRNA-like hairpin, yet the hairpin star sequence was not detected by the deep sequencing (51 in all); Confirmed, novel miRNA formed part of a miRNA-like hairpin, with the star sequence also being detected (seven in all). (B) Heat map representing the number of sequence reads mapping to each novel miRNA hairpin across the samples. (C) Detection of hsa-bcm-miR-15 by qPCR. RQ, Relative Quantification (HEK293 sample as reference). hsa-bcm-miR-15 was detected at ∼300 sequence reads in OVTOKO cells and ∼100 reads in HEK293.