| Literature DB >> 32699274 |
Changhong Li1, Yongqi Zheng2, Ping Huang3.
Abstract
The rose is one of the most important ornamental woody plants because of its extensive use and high economic value. Herein, we sequenced a complete chloroplast genome of the miniature rose variety Rosa 'Margo Koster' and performed comparative analyses with sequences previously published for other species in the Rosaceae family. The chloroplast genome of Rosa 'Margo Koster', with a size of 157,395 bp, has a circular quadripartite structure typical of angiosperm chloroplast genomes and contains a total of 81 protein-coding genes, 30 tRNA genes and 4 rRNA genes. Conjunction regions in the chloroplast genome of Rosa 'Margo Koster' were verified and manually corrected by Sanger sequencing. Comparative genome analysis showed that the IR contraction and expansion events resulted in rps19 and ycf1 pseudogenes. The phylogenetic analysis within the Rosa genus showed that Rosa 'Margo Koster' is closer to Rosa odorata than to other Rosa species. Additionally, we identified and screened highly divergent sequences and cpSSRs and compared their power to discriminate rose varieties by Sanger sequencing and capillary electrophoresis. The results showed that 15 cpSSRs are polymorphic, but their discriminating power is only moderate among a set of rose varieties. However, more than 150 single nucleotide variations (SNVs) were discovered in the flanking region of cpSSRs, and the results indicated that these SNVs have a higher divergence and stronger power for profiling rose varieties. These findings suggest that nucleotide mutations in the chloroplast genome may be an effective and powerful tool for rose variety discrimination and DNA profiling. These molecular markers in the chloroplast genome sequence of Rosa spp. will facilitate population and phylogenetic studies and other related studies of this species.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32699274 PMCID: PMC7376030 DOI: 10.1038/s41598-020-68092-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Circular map of the cp genome for Rosa ‘Margo Koster’. The gray arrows show that genes inside the circle are transcribed clockwise, and genes outside the circle are transcribed counterclockwise. The innermost shaded areas inside the inner circle correspond to the GC content in the cp genome. Genes in different functional groups are color coded. The boundaries of four regions (IRa, IRb, LSC, SSC) are noted in the inner circle.
The gene content and functional classification in the cp genome of Rosa ‘Margo Koster’.
| Function | Genes |
|---|---|
| RNA transfer | trnH-GUG, trnK-UUU, trnQ-UUG, tRNA-Gly, trnG-UCC, trnR-UCU,trnD-GUC, trnY-GUA, trnE-UUC, trnT-GGU, trnM-CAU, trnS-UGA, trnfM-CAU, trnS-GCU, trnS-GGA, trnT-UGU, trnL-UAA,trnF-GAA,trnV-UAC, trnW-CCA, trnP-UGG, trnP-GGG, trnI-CAU, trnL-CAA, trnV-GAC, trnI-GAU*, trnA-UGC*, trnN-GUU, trnR-ACG, trnL-UAG |
| RNA ribosomal | rrn23, rrn16, rrn5, rrn4.5 |
| RNA polymerase | rpoC1*, rpoC2, rpoA, rpoB |
| Clpp, Matk, ORFs | Clpp**, matk, orf42, orf188 |
| Ribosomal proteins (SSU) | rps2, rps3, rps4, rps7, rps8, rps11, rps12*, rps14, rps15, rps16*, rps18, rps19 |
| Ribosomal proteins (LSU) | rpl2*, rpl2Ψ, rpl14, rpl16, rpl20, rpl22, rpl23, rpl32, rpl33, rpl36 |
| Hypothetical chloroplast reading frames (ycf) | ycf1, ycf1Ψ, ycf2, ycf3**, ycf4 |
| ATP synthase | atpE, atpB, atpA, atpF, atpH, atpI |
| Photosystem I | psaI, psaB, psaA, psaC, psaJ |
| Photosystem II | psbD, psbC, psbZ, psbT, psbH, psbK, psbI, psbJ, psbF, psbE, psbM, psbN, psbL, psbA, psbB |
| RubisCO large subunit | rbcL |
| Cytochrome complex | petN, petA, petL, petG, petB, petD |
| NADH dehydrogenase | ndhB*, ndhI, ndhK, ndhC, ndhF, ndhD, ndhG, ndhE, ndhA*, ndhH, ndhJ |
| Others | infA, accD, cemA, ccsA |
*Genes containing one intron; **genes containing two introns; Ψpseudogene.
Figure 2Neighbor-joining (NJ) and maximum likelihood (ML) trees for the Rosaceae family inferred from the complete cp genomes of 19 species from 5 subfamilies. Bootstrap values are indicated at the nodes. The length scale behind the tree indicates substitutions per site.
Figure 3Comparisons of sequence identity of cp genomes for five Rosa species. The vertical axis represents identity ranging from 50 to 100%. Each arrow indicates the annotated gene and its transcriptional direction. Genome regions are color coded as an exon, mRNA or tRNA, untranslated region (UTR) and conserved noncoding sequence (CNS).
Figure 4Comparison of the boundaries between LSC, SSC, and two IR regions among 19 cp genomes within the Rosaceae family and two species of basal angiosperms. The boundaries of five Rosa species are shown together at the top of the figure, and the differences in their boundaries are obvious. The number of base pairs (bp) represents the distance from the boundary to the end of the gene.
Twenty most divergent regions of the cp genome based on a comparison of Rosa species.
| No. | Region | Nucleotide diversity (π) | Total number of mutations (η) | Region length (bp) |
|---|---|---|---|---|
| 1 | 0.0094 | 41 | 2,381 | |
| 2 | 0.0100 | 10 | 420 | |
| 3 | 0.0101 | 27 | 1,220 | |
| 4 | 0.0101 | 4 | 201 | |
| 5 | 0.0103 | 12 | 510 | |
| 6 | 0.0109 | 5 | 247 | |
| 7 | 0.0110 | 7 | 292 | |
| 8 | 0.0113 | 13 | 469 | |
| 9 | 0.0120 | 21 | 883 | |
| 10 | 0.0121 | 20 | 827 | |
| 11 | 0.0124 | 35 | 1,314 | |
| 12 | 0.0126 | 32 | 1,240 | |
| 13 | 0.0192 | 18 | 450 | |
| 14 | 0.0244 | 11 | 312 | |
| 15 | 0.0244 | 21 | 631 | |
| 16 | 0.0320 | 23 | 400 | |
| 17 | 0.0323 | 12 | 223 | |
| 18 | 0.0368 | 9 | 214 | |
| 19 | 0.0492 | 62 | 529 | |
| 20 | 0.0524 | 18 | 183 |
The Primer information of 15 cpSSRs and nucleotide mutations of their flanking sequences.
| Locus Name | Forward primer | Reverse primer | Product size (bps) | Tm (°C) | Nucleotide diversity (π) | Total number of mutations (η) | Region length (bp) | Dja | Djb | Site |
|---|---|---|---|---|---|---|---|---|---|---|
| Rhcp2 | AACTCATCAACGGACTCTCCA | ATTAGTGCTTGATGCGGGAAA | 261 | 59.5 | 0.00193 | 3 | 205 | 0.4130 | 0.3446 | |
| Rhcp5 | GACTTGTGTTGGATTGGCACT | ACGGAACTTCGCCTTAACCAA | 289 | 60 | 0.02362 | 53 | 221 | 0.1231 | 0.9016 | |
| Rhcp6 | CATTCCTTCAGTTTGGAACCCA | TCTTGGTACTTGAAGAAGTGTGA | 337 | 58 | 0.06351 | 150 | 271 | 0.4544 | 0.9885 | |
| Rhcp11 | TGGAGTGAAAAGCGTCCATTG | AGCGCCTCTTATTCAAGTTATTCA | 167 | 59 | 0.04529 | 78 | 113 | 0.3111 | 0.5339 | |
| Rhcp13 | CGGATGGCCAATAACCCAAG | GAGGTATTTCGCAACTGGCCT | 311 | 59 | 0.00906 | 58 | 266 | 0.3700 | 0.5196 | |
| Rhcp14 | CGCACGTCGTAAACAAATCCA | AGCTTAGCCTGACGCAATGT | 331 | 60 | 0.05264 | 76 | 282 | 0.4642 | 0.8277 | |
| Rhcp16 | CCGGCTCCAGTAGTTACACC | TAACCGTCGAGGCGAAGTAG | 350 | 60 | 0.00891 | 34 | 296 | 0.2170 | 0.9495 | |
| Rhcp19 | CACATATTGCGCACTTCCCG | AGGGCCTCTTCGATGGGTAT | 246 | 60 | 0.04801 | 48 | 200 | 0.3534 | 0.9996 | |
| Rhcp22 | CGCTATCCGCCCAGGATAAT | CCTTGAGGTCACGGGTTCAA | 243 | 60 | 0.01792 | 24 | 197 | 0.3020 | 0.9727 | |
| Rhcp24 | CGGGGATACACGACAGAAGG | CACCTATTACAGAGATGGTGCGA | 190 | 60 | 0.02778 | 23 | 114 | 0.1043 | 0.9084 | |
| Rhcp29 | CGTGTAGAAACGTGTAGAAGGG | GAAACCATTGCAATTGCCGGA | 360 | 59 | 0.01671 | 48 | 322 | 0.4523 | 0.7457 | |
| Rhcp31 | AGCGAGTCAACCGCTAGAAC | GGAGAATGAACTCTGGGAAGGT | 296 | 60 | 0.03396 | 53 | 252 | 0.5046 | 0.9991 | |
| Rhcp34 | CTCCGAGTAAAGATCCGCCC | TGAAGTATCCAGGCTCCGTT | 308 | 59 | 0.03085 | 92 | 272 | 0.3925 | 0.7929 | |
| Rhcp38 | TGTGTATCTAGGGAATCGTCGC | CTGCCCCCGAGGGTCTATAA | 319 | 59 | 0.07443 | 81 | 269 | 0.9350 | 0.9930 | |
| Rhcp41 | AGCTCCTCGCGAATTAAACGA | TGGGAACGACAGAACCTGTG | 390 | 60 | 0.01791 | 39 | 352 | 0.6307 | 0.9790 |
‘a’ indicates the discriminating power based on the genotype of cpSSR; ‘b’ indicates the discriminating power based on the genotype of nucleotide mutations.
Figure 5The cpSSR genotype of rose varieties based on capillary electrophoresis and Sanger sequencing. (a)–(d) Variation in fragment size of cpSSRs and their repeat motif in different rose varieties.