| Literature DB >> 21346806 |
Seongho Ryu1, Natasha Joshi, Kevin McDonnell, Jongchan Woo, Hyejin Choi, Dingcheng Gao, William R McCombie, Vivek Mittal.
Abstract
MicroRNAs (miRNAs) are key regulators of gene expression and contribute to a variety of biological processes. Abnormal miRNA expression has been reported in various diseases including pathophysiology of breast cancer, where they regulate protumorigenic processes including vascular invasiveness, estrogen receptor status, chemotherapy resistance, invasion and metastasis. The miRBase sequence database, a public repository for newly discovered miRNAs, has grown rapidly with approximately >10,000 entries to date. Despite this rapid growth, many miRNAs have not yet been validated, and several others are yet to be identified. A lack of a full complement of miRNAs has imposed limitations on recognizing their important roles in cancer, including breast cancer. Using deep sequencing technology, we have identified 189 candidate novel microRNAs in human breast cancer cell lines with diverse tumorigenic potential. We further show that analysis of 500-nucleotide pri-microRNA secondary structure constitutes a reliable method to predict bona fide miRNAs as judged by experimental validation. Candidate novel breast cancer miRNAs with stem lengths of greater than 30 bp resulted in the generation of precursor and mature sequences in vivo. On the other hand, candidates with stem length less than 30 bp were less efficient in producing mature miRNA. This approach may be used to predict which candidate novel miRNA would qualify as bona fide miRNAs from deep sequencing data with approximately 90% accuracy.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21346806 PMCID: PMC3035615 DOI: 10.1371/journal.pone.0016403
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Analysis of deep sequencing data to identify novel miRNA.
(A) Flowchart depicting analysis pipeline. Numbers of sequencing reads at each step of the analysis are shown in parentheses. (B) Size distribution (bp) shown for sequence reads that matched known miRNAs (matched reads) in miRBase and reads that did not match (non-matched reads) for human breast cell lines MCF10A, MCF7 and MDA-231 cells.
Characterization of 189 candidate miRNAs.
| Category | Frequency |
| High confidence candidate miRNAs which has high frequency reads (>50) | 30 |
| Highly similar to known human miRNA (1∼2 mismatched) but located on different chromosome | 7 |
| No match with human miRNAs but highly similar to other species (0∼2 mismatched) | 4 |
| Presence of star sequence | 27 |
| Remaining | 137 |
A list of 30 high frequency novel candidate miRNAs in breast cancer cells.
| NAME | SEQUENCE | length | MCF10A | MCF7 | MDA | MIRBASE | STAR | LOCATION |
|
|
| 27 | 585 | 1347 | 1251 | Novel | No | chr11:13304324–13304433[−] |
|
|
| 18 | 1 | 0 | 1651 | Novel | No | chr9:73809165–73809274[−] |
|
|
| 31 | 257 | 433 | 1007 | Novel | No | chr13:98986564–98986673[−] |
|
|
| 22 | 0 | 0 | 57 | 1 mismatches with hsa-548d-5p | No | chr1: 81947441–81947550[+] |
| hsa-miR-B5 |
| 17 | 639 | 8 | 396 | Novel | No | chr1:1172913–1173022[−] |
|
|
| 22 | 364 | 528 | 215 | perfect match with mouse, dog, cow | Yes(22) | chr15:81221792–81221898[+] |
|
|
| 29 | 468 | 214 | 340 | close to hsa-miR-1303 (chr5) in diff loc | No | chr16:3297692–3297801[−] |
| hsa-miR-B8 |
| 23 | 20 | 28 | 15 | Novel | Yes(14) | chr6:133180097–133180198[+] |
|
|
| 20 | 169 | 111 | 157 | close to cow (bta-mir-2476) | No | chrY:586300–586319[+] |
|
|
| 25 | 209 | 46 | 92 | 1 mismatch with mmu-miR-1959 | No | chr6:28834104–28834213[−] |
| hsa-miR-B11 |
| 22 | 1 | 15 | 14 | Novel | Yes(5) | chr20:49502834–49502934[−] |
| hsa-miR-B12 |
| 17 | 0 | 0 | 78 | Novel | No | chr8:107080026–107080135[+] |
| hsa-miR-B13 |
| 20 | 49 | 29 | 206 | Novel | No | chr17:8031210–8031319[+] |
| hsa-miR-B14 |
| 25 | 153 | 10 | 29 | Novel | No | chr1:153915582–153915691[−] |
|
|
| 22 | 0 | 0 | 80 | Novel | No | chr20:43767137–43767246[−] |
| hsa-miR-B16 |
| 20 | 32 | 2 | 28 | Novel | No | chr22:40334774–40334883[−] |
| hsa-miR-B17 |
| 30 | 180 | 16 | 36 | Novel | No | chr17:38359009–38359118[−] |
| hsa-miR-B18 |
| 19 | 0 | 0 | 26 | Novel | chr7: 91108780–91108889[+] | |
|
|
| 20 | 8 | 0 | 133 | Novel | No | chr11:121527997–121528099[−] |
| hsa-miR-B20 |
| 17 | 19 | 22 | 125 | Novel | No | chr1:202811178–202811287[−] |
|
|
| 21 | 1 | 0 | 88 | close to mouse (mmu-miR-669) | No | chr8:129122276–129122385[+] |
| hsa-miR-B22 |
| 18 | 114 | 1 | 11 | Novel | No | chr7:73108306–73108415[−] |
|
|
| 17 | 0 | 0 | 48 | Novel | No | chr1:33878065–33878174[+] |
| hsa-miR-B24 |
| 26 | 29 | 12 | 51 | Novel | No | chr5:105917028–105917137[+] |
| hsa-miR-B25 |
| 20 | 33 | 42 | 36 | Novel | No | chr19:764562–764671[+] |
| hsa-miR-B26 |
| 18 | 59 | 21 | 3 | one mismatch with hsa-miR-1260 | No | chr11:95714237–95714346[+] |
|
|
| 22 | 8 | 0 | 50 | Novel | Yes(3) | chr11:121532093–121532199[−] |
|
|
| 22 | 68 | 21 | 8 | close to cow (bta-mir-2355) | Yes(14) | chr2:207682944–207683054[−] |
| hsa-miR-B29 |
| 23 | 7 | 11 | 42 | Novel | No | chr1:28778841–28778945[−] |
| hsa-miR-B30 |
| 23 | 13 | 47 | 15 | Novel | No | chr22:29457597–29457619[+] |
Candidate miRNAs in bold were used to evaluate miRNA expression using Northern blotting.
Figure 2Validation of candidate miRNAs by northern blot analysis and secondary structure prediction of precursor sequences.
(A) 20µg of RNA (enriched for small RNA fraction) was isolated from MDA-MB-231 cells and fractionated on a 7% polyacrylamide gel and hybridized to anti-sense oligo probes corresponding to miRNAs indicated above to detect processed 70-nt precursor and a 22-nt mature miRNA. In some cases, 500nt- precursors were cloned in lentiviral vectors to generate 293T cells stably expressing precursor miRNAs. RNA harvested from the transfected cells was used for northern blot analysis. A highly abundant known miRNA, hsa-miR-21 was used as a control. Decade marker was used as a size marker for small RNAs. (B) Secondary structures were predicted by RNA fold program using either a ∼75-nt precursor sequence or the 500-nt precursor sequence. The stem and loop structure of pre-miRNA are boxed. The location of mature miRNA sequence is denoted in red.
Figure 3Distribution of the length of stem in known and unknown miRNA sequence reads.
The pri-miRNA secondary structures in both known miRNAs and candidate miRNAs were predicted by RNAfold program. The secondary structures of pre-miRNA were located and extracted to measure the length of stem in both known miRNAs (A) and candidate miRNAs (B). Blue and red bars represent the distribution of longer stem length (cutoff >30) and short stem length, respectively. Read frequencies with stem length of >30 or <30 are indicated.
Figure 4Northern blot and Taqman QPCR analysis showing that novel candidate miRNAs with stem lengths of ≥30bp may comprise bona fide miRNAs.
(A) The expression of miRNA was examined by northern blotting using end-labeled oligonucleotide probes. A total of 10 candidate miRNAs were selected from Table 2 and their 500-nt precursors cloned in plasmids and overexpressed in 293T cells to detect processed 70-nt pre-miRNA and a mature miRNA. The numbers in parentheses represent the read frequency in MDA-MB-231 cells obtained from deep sequencing. The numbers at the bottom indicate the length of stem in individual candidate miRNAs. A synthetic 32P-labeled RNA marker was used as a size marker. Asterisk indicate miRNAs selected for QPCR validation. Taqman miRNA assay showed expression of candidate miRNAs from each group by Ct values (B) and fold change (C). RNU48 was used as positive control and no-template control (NTC) used as negative control. (D) Box plot graph showing correlation between validated miRNA and non-validated miRNA with their stem length. Mann-Whitney test showing that the two groups are statistically different (p<0.001).