| Literature DB >> 23171001 |
Jungeun Kim1, June Hyun Park, Chan Ju Lim, Jae Yun Lim, Jee-Youn Ryu, Bong-Woo Lee, Jae-Pil Choi, Woong Bom Kim, Ha Yeon Lee, Yourim Choi, Donghyun Kim, Cheol-Goo Hur, Sukweon Kim, Yoo-Sun Noh, Chanseok Shin, Suk-Yoon Kwon.
Abstract
BACKGROUND: Roses (Rosa sp.), which belong to the family Rosaceae, are the most economically important ornamental plants--making up 30% of the floriculture market. However, given high demand for roses, rose breeding programs are limited in molecular resources which can greatly enhance and speed breeding efforts. A better understanding of important genes that contribute to important floral development and desired phenotypes will lead to improved rose cultivars. For this study, we analyzed rose miRNAs and the rose flower transcriptome in order to generate a database to expound upon current knowledge regarding regulation of important floral characteristics. A rose genetic database will enable comprehensive analysis of gene expression and regulation via miRNA among different Rosa cultivars.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23171001 PMCID: PMC3527192 DOI: 10.1186/1471-2164-13-657
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of sequencing and assembly results for rose flowers
| Read No. | 151,906 | 132,974 | 107,816 | 115,707 |
| Read length (bp) | 34,317,306 | 29,145,021 | 23,643,473 | 26,107,242 |
| Avg. read length (bp) | 225.91 | 219.18 | 219.30 | 225.63 |
| Read No. | 142,421 | 116,397 | 95,875 | 109,067 |
| | (93.76 %) | (87.53 %) | (88.92 %) | (94.26 %) |
| Read length (bp) | 33,112,590 | 27,277,537 | 22,306,881 | 25,260,565 |
| Avg. read length (bp) | 232.50 | 234.35 | 232.67 | 231.61 |
| Clustered reads | 120,931 | 97,250 | 73,768 | 84,281 |
| Clustered range | 2-4,792 | 2-3,726 | 2-1,353 | 2-980 |
| Contigs (assembled by TGICL a) | 14,167 | 12,287 | 12,618 | 15,366 |
| Singlets b | 3,501 | 3,261 | 3,406 | 3,814 |
| Singlets c | 17,989 | 15,886 | 18,701 | 20,542 |
| Avg. contigs length (bp) | 397.60 | 441.52 | 416.45 | 369.33 |
a TGICL: TIGR Gene Index CLustering tools, b reads that are clustered but not assembled into contigs, c un-clustered reads by TGICL.
Abbreviations: V: ‘Vital’; M: ‘Maroussia’; S: ‘Sympathy’; H: ‘Haedang’.
Figure 1Scheme for construction of the Rose database. The schematic presentation shows the strategy of transcriptome and miRNA analysis. Transcript reads were assembled by TGICL [32] after pre-processing with SeqClean. Assembled sequences were annotated by searching homologs against non-redundant (NR) database and Arabidopsis (ver. 10). Domains and Gene Ontology (GO) were analyzed by Pfam. KEGG metabolic pathways and plant ontology (PO) were assigned by referring the description of top matched Arabidopsis genes. miRNAs were identified by applying modified miRDeep [24,29]. To search for known miRNAs, sRNA sequences were BLAST searched against currently known miRNAs in miRBase [86] databases.
Gene and unigene annotations for rose and related species
| 35,657 | 31,434 | 34,725 | 39,722 | 63,541 | 34,809 | 28,702 | 26,346 | |
| 18,414 | 19,214 | 20,264 | 20,349 | 51,525 | 24,764 | 26,163 | 26,346 | |
| 9,062 | 9,425 | 9,395 | 9,816 | | | | | |
| 9,352 | 9,699 | 10,869 | 10,533 | | | | | |
| 10,646 | 11,353 | 11,642 | 11,265 | 72,220 | 36,042 | 37,433 | 32,566 | |
| 2,262 | 2,315 | 2,383 | 2,369 | 3,311 | 3,333 | 3,297 | 3,305 | |
a The number of gene or unigene annotated by BLAST search against NR database with expect value (e-value) lower than 1e-10.
b Number of domain identified by implementing Pfam. Abbreviations: V: ‘Vital’; M: ‘Maroussia’; S: ‘Sympathy’; H: ‘Haedang’.
Figure 2Comparative analysis of gene families with genome sequenced Rosaceae (apple, strawberry, and peach) and grape. Distribution of orthologous and paralogous gene families, including Rosa and apple, strawberry, peach and grape, were analyzed by OrthoMCL [34]. A. A total of 294,936 sequences from the eight different organisms were clustered into 242,234 (82.13%) families and singletons using OrthoMCL. B. Numbers of families and genes (in parenthesis) presented in each organism combination are given in the individual sections. 4,881 families (with 75,314 genes) were common to all species analyzed in this study, 592 families (with 7,451 genes) were shared in Rosaceae group. 6,808 were clustered in rose specific group with a total of 1,484 families. The species-specific gene families denote genes clustered within singe organisms.
miRNA distribution
| Total sRNA No. | 7,440,941 | 11,814,817 | 9,210,732 | 11,368,773 | |
| Trimmed sRNAs | 7,339,459 | 11,646,177 | 9,049,517 | 11,201,170 | |
| Unique sRNAs | 2,231,008 | 2,520,957 | 2,590,637 | 2,955,175 | |
| | |||||
| tRNAs | 12,638 | 14,647 | 11,942 | 12,925 | |
| rRNAs | 50,020 | 70,718 | 49,317 | 56,084 | |
| other RNAs | 1,552 | 2,123 | 1,774 | 1,518 | |
| Masking | 2,166,798 | 2,433,469 | 2,527,604 | 2,884,648 | |
| 18nt to 26nt unique sRNAs after pre-processing | 2,097,062 | 2,338,032 | 2,439,306 | 2,824,442 | |
| 18nt to 26nt total sRNAs after pre-processing | 5,400,748 | 6,498,879 | 6,456,832 | 6,930,547 | |
| | | | | ||
| sRNA align to strawberry genome | 754,825 | 919,179 | 815,779 | 600,561 | |
| Total predicted miRNAs (Unique) | 183 (122) | 186 (125) | 186 (125) | 179 (120) | 192 (130) |
| Conserved miRNAs in miRBase (Unique) | 155 (101) | 155 (102) | 158 (104) | 154 (100) | 159 (105) |
| Novel miRNAs (Unique) | 28 (21) | 31 (23) | 28 (21) | 25 (20) | 33(25) |
| Found by homology search with miRBase a | 136 | 137 | 137 | 128 | 137 |
| Total miRNA groups (Major) | 78 (40) | 82 (42) | 80 (40) | 75 (41) | 84 (53) |
| Conserved miRNA groups (Major) | 63 (27) | 65 (28) | 65 (28) | 61 (28) | 65 (37) |
| Novel miRNA groups (Major) | 15 (13) | 17 (14) | 15 (12) | 14 (13) | 19 (16) |
a The number of miRNAs searched by BLAST against known miRNAs in the miRBase [86]. Abbreviations: V: ‘Vital’; M: ‘Maroussia’; S: ‘Sympathy’; H: ‘Haedang’.
Figure 3miRNA length and first nucleotide distribution Highly abundant sRNA tags were identified/filtered according to their read length and first nucleotide. Canonical miRNAs were 21 nt in length and began with a Uridine (U) [40].
Figure 4Venn diagram showing unique and shared miRNA families. A. Distribution of miRNA tags in the four rose cultivars (Vital, Maroussia, Sympathy, and Haedang). B. Distribution of miRNA tags in Rosaceae plants. The numbers of miRNA tags present in each organism combination were given in the individual sections. Venn diagram represents conservation and/or variation of miRNA tags in rose flowers by the presence or absence of miRNA tags among the four rose cultivars (A) or among Rosaceae species (B).
Representative targets of conserved miRNA in four roses
| miR156a c | UGACAGAAGAGAGUGAGCUC | 1,393 | 1,861 | 2,755 | 3,616 | H_CL12502Contig1 d | 2 | squamosa promoter-binding-like protein |
| M_CL3280Contig1 d | 2 | squamosa promoter-binding-like protein | ||||||
| S_FJZMSUQ02I0U8Q d | 2 | squamosa promoter-binding-like protein | ||||||
| | | | | | | V_CL1436Contig1 | 2 | squamosa promoter-binding-like protein |
| miR159 c | UUUGGAUUGAAGGGAGCUCUA | 327,436 | 271,208 | 264,412 | 187,579 | H_F09TDQC01DKJVB | 4 | U-box domain-containing protein |
| M_FJZMSUQ02JDPLW d | 3.5 | r2r3-myb transcription factor | ||||||
| S_FJZMSUQ02FP8OE | 4 | U-box domain-containing protein | ||||||
| | | | | | | V_F1XUE2F01EH5K8 d | 3.5 | r2r3-myb transcription factor |
| miR160 c | UGCCUGGCUCCCUGUAUGCCA | 682 | 739 | 1,739 | 783 | H_F1XUE2F01ECCC8 | 0.5 | putative auxin response factor ARF16 |
| M_FJZMSUQ02HS2DI | 0.5 | putative auxin response factor ARF16 | ||||||
| | | | | | | S_FJZMSUQ02JJQL0 d | 0.5 | putative auxin response factor ARF16 |
| miR164b c | UGGAGAAGCAGGGCACGUGCA | 1,618 | 1,166 | 753 | 735 | H_F1XUE2F01CXKY2 d | 3 | NAC domain protein |
| M_CL5581Contig1 | 2.5 | NAC domain protein | ||||||
| | | | | | | V_CL3217Contig1 | 2.5 | NAC domain protein |
| miR167c c | UGAAGCUGCCAGCAUGAUCUC | 19,155 | 21,149 | 28,997 | 17,491 | H_CL5998Contig1 | 4 | ARF domain class transcription factor |
| M_CL367Contig1 | 4 | ARF domain class transcription factor | ||||||
| S_CL788Contig1 | 4 | ARF domain class transcription factor | ||||||
| | | | | | | V_F1XUE2F01AFQWV | 4 | ARF domain class transcription factor |
| miR168 c | UCGCUUGGUGCAGGUCGGGAA | 40 | 145 | 33 | 98 | H_F09TDQC01CTD5O | 3.5 | argonaute protein group |
| | | | | | | M_CL4618Contig1 | 0.5 | argonaute protein group |
| miR172 c | GGAAUCUUGAUGAUGCUGCAG | 2,590 | 1,369 | 1,728 | 720 | H_CL8041Contig1 | 1 | AP2 domain class transcription factor |
| S_CL1813Contig1 d | 1 | AP2 domain class transcription factor | ||||||
| | | | | | | V_CL10861Contig1 | 1 | AP2 domain class transcription factor |
| miR319a c | UUGGACUGAAGGGAGCUCCC | 392 | 1,117 | 630 | 518 | H_F1XUE2F01EUODN | 4 | TCP domain class transcription factor |
| M_CL839Contig1 | 3.5 | TCP domain class transcription factor | ||||||
| S_CL7921Contig1 | 4 | TCP domain class transcription factor | ||||||
| | | | | | | V_F1XUE2F01EH5K8 | 3 | r2r3-myb transcription factor |
| miR394 c | UUGGCAUUCUGUCCACCUCC | 227 | 2,251 | 732 | 310 | H_F1XUE2F01DV50Z d | 1 | F-box family protein |
| M_CL9289Contig1 | 1 | F-box family protein | ||||||
| S_CL298Contig1 | 3.5 | conserved hypothetical protein | ||||||
| | | | | | | V_CL3647Contig1 | 3.5 | conserved hypothetical protein |
| miR396 c | UUCCACAGCUUUCUUGAACUG | 10,509 | 20,749 | 9,975 | 6,184 | M_CL458Contig1 | 4 | 3-dehydroquinate dehydrogenase |
| S_FJZMSUQ02FG68V | 3.5 | ATP binding protein | ||||||
| | | | | | | V_CL7932Contig1 | 3 | uncharacterized protein |
| miR482b c | UCUUUCCUAUUCCUCCCAUCCC | 2,150 | 5,734 | 4,173 | 6,112 | H_F09TDQC01C8RL9 | 4 | TIR-NBS-LRR resistance protein |
| S_FJZMSUQ02IMY8Q | 4 | TIR-NBS-LRR resistance protein | ||||||
| | | | | | | V_F1XUE2F01BNZWC | 3 | TIR-NBS-LRR resistance protein |
| miR530a | UGCAUUUGCACCUGCACCUCU | 231 | 205 | 88 | 237 | H_CL335Contig4 | 2.5 | BZIP domain class transcription factor |
| M_CL83Contig1 | 2.5 | BZIP domain class transcription factor | ||||||
| S_CL3168Contig1 | 2.5 | BZIP domain class transcription factor | ||||||
| | | | | | | V_CL1619Contig1 | 2.5 | BZIP domain class transcription factor |
| miR894 | CGUUUCACGUCAGGUUCACCA | 494 | 325 | 736 | 239 | H_CL672Contig1 | 3 | E3 ubiquitin-protein ligase |
| M_CL2017Contig1 | 3 | E3 ubiquitin-protein ligase | ||||||
| | | | | | | V_CL3434Contig1 | 3 | E3 ubiquitin-protein ligase |
| Ng3 e | UCUAAGAAACAUUCCUUGAUG | 186 | 391 | 155 | 77 | S_FJZMSUQ02GRX36 | 2 | Probable disease resistance protein |
| S_CL7967Contig1 | 2 | Probable disease resistance protein | ||||||
| | | | | | | V_CL9334Contig1 | 4 | Probable disease resistance protein |
| Ng11 e | GUGGAGUUCUGGGAAAGAAG | 51 | 8 | 2 | 7 | H_CL10098Contig1 | 3.5 | Matrix metalloprotease 1 |
| M_CL1633Contig2 | 4 | Acyltransferase-like protein | ||||||
| S_CL7901Contig1 | 3.5 | Metalloendoproteinase 1-like |
a numbers denote normalized expression value in each Rosa. b A single capital letter in front of the UniGene ID represent the abbreviations of Rosa. For example, “M_CL3280Contig1” is unigene in ‘Maroussia’. All abbreviations of tag (V, M, S, H) are ‘Vital’, ‘Maroussia’, ‘Sympathy’, and ‘Haedang’, respectively. c miRNAs identified in strawberry libraries which downloaded from GEO (Accession No. GSE34813). d target genes successfully validated using 5’RACE (Figure 5). e Novel miRNAs identified in this study.
Figure 5Validation of miRNA-directed cleavage using 5’ RLM-RACE. A-F. 5’ RLM-RACE PCR products terminating at a given position indicated above the each miRNA-target duplex with the frequency of clones.
miRNAs putatively targeting flavonoid biosynthetic pathway
| miR171 | UUUUUCUGAUUGAGCCGUGCC | 17 | 19 | 6 | S_CL1522Contig1 | 5.5 | Anthocyanin synthase (ANS) | |
| miR858b | UUCGUUGUCUGUUCGACCUGA | 39 | 39 | 19 | M_CL545Contig1 | 4 | HTH_MYB | |
| miR535 | UGACGAUGAGAGAGAGCACGC | 30,364 | 24,750 | S_CL160Contig1 | 6 | Flavone synthase (FNS) | ||
| miR166i | UGAAUGUCGUCUGGUUCGAAA | 68 | 51 | S_CL5764Contig1 | 6 | Flavonol synthase (FLS) | ||
| miR477b | ACUCUCCCUCAAGAGCUUCUAG | 12 | 50 | H_CL379Contig1 | 6 | Dihydroflavonol-4-reductase (DFR) | ||
| miR159e | UUUGGCUUGAAGGGAGCUCUA | 78 | 53 | 51 | V_F1XUE2F01CHQVJ | 6 | Cytochrome P450 | |
| | | | | | | S_CL2652Contig1 | 6 | Cytochrome P450 |
| miR172c | GUAGCAUCAUCAAGAUUCAC | 7 | 4 | 2 | H_CL3307Contig1 | 6 | Cytochrome P450 | |
| S_CL373Contig2 | 6 | Cinnamate 4-hydroxylase (C4H) | ||||||
| | | | | | | V_CL1167Contig1 | 6 | Cinnamate 4-hydroxylase (C4H) |
| miR399 | GGGCGUCUUUCCUUUGGCAGG | 16 | 0 | 1 | S_CL3909Contig1 | 6 | Cytochrome P450 | |
| miR396a | CACAGCUUUCUUGAACUU | 11 | 22 | 7 | H_CL735Contig1 | 5.5 | Chalcone and stilbene synthase | |
| H_F09TDQC01AFYKY | 5.5 | Chalcone synthase (CHS) | ||||||
| M_FJZMSUQ02GLZSQ | 6 | Short chain alcohol dehydrogenase | ||||||
| S_CL831Contig1 | 5.5 | Chalcone and stilbene synthase | ||||||
| V_CL2603Contig1 | 5.5 | Constitutive photomorphogenic DWARF (CPD) | ||||||
| V_CL3087Contig1 | 5 | Cytochrome P450 | ||||||
| | | | | | | V_F1XUE2F01B3HEY | 5 | Cinnamate 4-hydroxylase (C4H) |
| miR845e | ACCUGGCUCUGAUACCAAUUG | 84 | 63 | H_CL735Contig1 | 5.5 | Chalcone and stilbene synthase family protein | ||
| | | | | | | S_CL831Contig1 | 5.5 | Chalcone and stilbene synthase family protein |
| miR396e | UUCCACAGCUUUCUUGAACUU | 6,309 | 2,520 | H_CL11974Contig1 | 6 | Cytochrome P450 | ||
| miR319 | UGGACUGAAGGGAGCUCCC | 7 | 9 | 2 | M_FJZMSUQ02HT34C | 4.5 | Cytochrome P450 | |
| miR396b | CACAGCUUUCUUGAACUG | 20 | 39 | 22 | S_FJZMSUQ02HBX1V | 6 | Cytochrome P450 | |
| V_CL10716Contig1 | 6 | CPD (CONSTITUTIVE PHOTOMORPHOGENIC DWARF) | ||||||
| V_CL1787Contig1 | 6 | Flavone synthase (FNS) | ||||||
| V_CL2603Contig1 | 6 | CPD (CONSTITUTIVE PHOTOMORPHOGENIC DWARF) | ||||||
| V_F1XUE2F01B3HEY | 6 | C4H (CINNAMATE-4-HYDROXYLASE) |
The bold characters denote enriched miRNA tags with less than p<0.001 by analyzing Audic’s test [88]. Abbreviations: V: ‘Vital’; M: ‘Marcia’; S: ‘Sympathy’; H: ‘Haedang’.
Transcript profiles for colour-related genes
| | | | | | | | | |
| 28 | 15 | 31 | 28 | 66 | 35 | 25 | 30 | |
| 68 | 15 | 76 | 100 | | | | | |
| 2.4 | 1 | 2.4 | 3.6 | | | | | |
| | | | | | | | | |
| 31 | 12 | 40 | 38 | 159 | 55 | 34 | 63 | |
| 96 | 12 | 232 | 201 | | | | | |
| 3.1 | 1 | 5.8 | 5.2 |
Abbreviations: V: ‘Vital’; M: ‘Maroussia’; S: ‘Sympathy’; H: ‘Haedang’.