| Literature DB >> 22306111 |
Matthias Hackl1, Vaibhav Jadhav, Tobias Jakobi, Oliver Rupp, Karina Brinkrolf, Alexander Goesmann, Alfred Pühler, Thomas Noll, Nicole Borth, Johannes Grillari.
Abstract
MicroRNAs (miRNAs) have recently entered Chinese hamster ovary (CHO) cell culture technology, due to their severe impact on the regulation of cellular phenotypes. Applications of miRNAs that are envisioned range from biomarkers of favorable phenotypes to cell engineering targets. These applications, however, require a profound knowledge of miRNA sequences and their genomic organization, which exceeds the currently available information of ~400 conserved mature CHO miRNA sequences. Based on these recently published sequences and two independent CHO-K1 genome assemblies, this publication describes the computational identification of CHO miRNA genomic loci. Using BLAST alignment, 415 previously reported CHO miRNAs were mapped to the reference genomes, and subsequently assigned to a distinct genomic miRNA locus. Sequences of the respective precursor-miRNAs were extracted from both reference genomes, folded in silico to verify correct structures and cross-compared. In the end, 212 genomic loci and pre-miRNA sequences representing 319 expressed mature miRNAs (approximately 50% of miRNAs represented matching pairs of 5' and 3' miRNAs) were submitted to the miRBase miRNA repository. As a proof-of-principle for the usability of the published genomic loci, four likely polycistronic miRNA cluster were chosen for PCR amplification using CHO-K1 and DHFR (-) genomic DNA. Overall, these data on the genomic context of miRNA expression in CHO will simplify the development of tools employing stable overexpression or deletion of miRNAs, allow the identification of miRNA promoters and improve detection methods such as microarrays. Copyright ÂEntities:
Mesh:
Substances:
Year: 2012 PMID: 22306111 PMCID: PMC3314935 DOI: 10.1016/j.jbiotec.2012.01.019
Source DB: PubMed Journal: J Biotechnol ISSN: 0168-1656 Impact factor: 3.307
Fig. 1Strategy for identification of CHO pre-miR sequences from genomic references. (a) Schematic outline of identification strategy. (b) Flow chart illustrating the sequence identification strategy in detail: all currently published CHO mature miRNA sequences were BLAST-aligned to two independent CHO-K1 genomic reference sequence assemblies (K1-P and K1-BB). BLAST results were filtered for alignments with zero mismatches (100% identity) and alignment lengths equal to the mature miRNA length (100% length). Additionally, miRNAs mapping to genomic repeat regions were removed. From the remaining genomic loci, the respective pre-miRNA sequences were extracted independently from both genomic references and cross-checked.
Genome references for identification of CHO pre-miR sequences.
| K1-P | K1-BB | |
|---|---|---|
| Genome size (Gbp) | 2.40 | 2.98 |
| Contigs | 109,151 | 11,400,490 |
| Average contig length | 21,986 | 261 |
| Median contig length | 503 | 124.5 |
| 95 | 17.1 |
Fig. 2BLAST alignment of mature miRNAs to two different genomic reference sequences. (a) Size-adjusted Venn diagram indicating that 318 mature miRNAs were aligned to both reference genomes, while 35 and 12 mature miRNAs could only be aligned to K1-P or K1-BB, respectively. (b) The cumulative fraction of BLAST-aligned miRNAs is plotted against the number of perfect genomic matches identified; for each miRNA; 16% and 28% of miRNAs could be perfectly aligned to two or more genomic locations in the K1-P (black) or K1-BB (gray) genomic reference sequence.
Fig. 3Characterization of CHO pre-miRNA sequences. (a) Scheme representing the strategy for pre-miRNA sequence extraction from a genomic locus mapped by either one or two mature miRNAs: (i) a buffer of 10 bases up- and downstream the mature miRNAs was taken in case both hairpin-arms were mapped. (ii) and (iii) For genomic positions aligned by a single miRNA a total pre-miRNA of 100 bases was extracted, starting 10 bases upstream a 5′ miRNA match or 10 bases downstream a 3′ match. (b) Distribution of CHO miRNA loci identified by alignment of either 5′ or 3′ mature miRNAs or both is shown. Venn overlap of miRNA genomic loci as identified independently in each CHO-K1 genomic reference sequence. (c) For pre-miRNA genomic loci mapped at both the 5′ and 3′ miRNA hairpin-arm, length of the pre-miRNA was calculated as the distance between the start of the 5′ miRNA alignment and the end of the 3′ miRNA alignment. Cumulative fraction of pre-miRNAs is plotted against the pre-miRNA length, showing that for most pre-miRs length ranged between 50 and 70 bases.
miR-17-92 pre-miRNA sequences.
| >cgr-mir-17_scaffold_gi|344163086|gb|JH001979.1|_REV |
| AGGATAATGTcaaagtgcttacagtgcaggtagTGATATGCACATCTactgcagtgcaggcacttgtggCATTATGGT |
| >cgr-mir-18a_scaffold_gi|344163086|gb|JH001979.1|_REV |
| CTTTTTGTTCtaaggtgcatctagtgcagatagTGAAGTAGACTAGCATCTactgccctaagtgctccttctggCATAAGAAG |
| >cgr-mir-19a_scaffold_gi|344163086|gb|JH001979.1|_REV |
| GCAGCCCTCTGTTAGTTTTGCATACTTGCACTACAAGAAGAATGCAGTtgtgcaaatctatgcaaaactgaTGGTGGCCT |
| >cgr-mir-19b_scaffold_gi|344163086|gb|JH001979.1|_REV |
| GTCTATGGTTagttttgcaggtttgcatccagcTGTATAATACTCTGCtgtgcaaatccatgcaaaactgaCTGTGGTGG |
| >cgr-mir-20a_scaffold_gi|344163086|gb|JH001979.1|_REV |
| TCTGTGGCACtaaagtgcttatagtgcaggtagTGTCCACTCATCTACTGCATTACGAGCACTTCCAGTGCTGCCAGCTGGAGAGCCCCAGCCTCGCTCG |
| >cgr-mir-92a_scaffold_gi|344163086|gb|JH001979.1|_REV |
| CTTTCTACACaggctgggatttgttgcaatgctGTGTTTCTCGATGGtattgcacttgtcccggcctgtTGAGTTTGG |
Lower case letters indicate mature miRNAs; upper case letters indicate 5′ and 3′ flanking regions as well as loop regions.
Fig. 4Sequence characterization of CHO pre-miRNAs. (a) Conservation CHO (cgr) mir-17-92 pre-miRNAs in respect to Mus musculus (mmu); *, sequence matches; - sequence deletions. (b) PCR amplification of miRNA clusters: PCR amplification of miRNA clusters using genomic DNA from CHO-K1 and CHO dhfr (-) cells. Lanes 1–4 and 6–9 showing specific amplification for miR-24-23a (1/6), miR-17-92a, miR-221-222 and miR-24-23b. Lanes 5 and 10 no template control PCR.
Number of aligned miRNAs, unique genomic loci and precursor-miRNA sequences.
| K1-P | K1-BB | |
|---|---|---|
| miRNAs mapped to genome (100% ID, 100% length) | 353 | 330 |
| miRNAs mapped to genomic repeat regions | 14 | 15 |
| miRNAs used for identification of genomic loci and pre-miRNAs | 339 | 315 |
| High confidence genomic miRNA loci | 206 | 196 |
| pre-miRNA sequences submitted to miRBase | 206 | 6 |
After removal of loci that give rise to incorrectly folded pre-miRs.
In total 212 pre-miRNA sequences submitted to miRBase.