| Literature DB >> 29907083 |
Weicong Qi1, Xi Chen2, Peihong Fang3, Shaochuan Shi3, Jingjing Li4, Xintong Liu3, Xiaoqian Cao3, Na Zhao3, Huiyuan Hao3, Yajie Li3, Yujie Han3, Zhao Zhang5.
Abstract
BACKGROUND: Rosa hybrida is a valuable ornamental, food and medicinal crop worldwide, but with relatively limited molecular marker resources, especially for flower-specific markers. In this study, we performed genomic and floral transcriptomic sequencing of modern rose. We obtained comprehensive nucleotide information, from which numerous potential simple sequence repeat (SSR) markers were identified but were found to have high rates of amplification failure and PCR product redundancy.Entities:
Keywords: Genome; Rose; Simple sequence repeat; Transcription factor; Transcriptome
Mesh:
Substances:
Year: 2018 PMID: 29907083 PMCID: PMC6003205 DOI: 10.1186/s12870-018-1322-5
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Summary of de novo assembly of the genome and transcriptome data
| Genome | Transcriptome | ||
|---|---|---|---|
| Contig | Scaffold | ||
| Raw data | 32.0G | 7.2G | |
| Number | 2,445,344 | 554,534 | 71,532 |
| N50 | 171 bp | 1681 bp | 1526 bp |
| Maximum Length | 9114 bp | 67,855 bp | 12,858 bp |
| Average Length | 180.2 bp | 1311.3 bp | 992 bp |
| Total Length | 0.44 Gb | 0.73 Gb | 0.71 Gb |
Fig. 1Kmer-17 depth distribution curve of the Rosa × hybrida genome sequence. A K-mer frequency distribution curve (k-mer = 17) using paired-ends with a 400 bp insert size is shown in the chart. The highest peak is located at a multiplicity of 9 (marked by the asterisk), and the estimated genome size based on the highest peak is 2.3 Gb
Fig. 2Motif distribution of the genomic and EST SSRs. Distribution of the top 10 motifs in genomic SSRs (a) and EST-SSRs (b). The most common motif in genomic and EST- SSR is the dinucleotide motif AG/CT. The X-axis represents the motif sequence, and the Y-axis represents the number of detected SSRs
Validation of the genomic and EST-SSRs developed de novo from rose in other Rosaceae species (Fragaria vesca, Prunus persica, Malus × domestica and Rubus occidentalis)
| Genomic SSR | EST-SSR | |
|---|---|---|
| Total | 124,591 | 2292 |
| Strawberry | 462 | 156 |
| Raspberry | 360 | 112 |
| Apple | 28 | 7 |
| Peach | 27 | 6 |
Fig. 3Distribution of rose genomic SSRs in the strawberry genome. Distribution of rose genomic SSRs in the genome of Fragaria vesca. The F. vesca genome contains seven chromosomes, in which a total of 475 rose genomic SSR markers were mapped onto 495 sites
Materials used for genetic diversity testing
| # | Name | Ploidy | # | Name | Ploidy |
|---|---|---|---|---|---|
| 1 | Variegata Di Bologna | 4× | 25 | Angela | 4× |
| 2 | First Blush | 4× | 26 | 2× | |
| 3 | Showtime | 4× | 27 | Baby Romantica | 4× |
| 4 | Xuelian | 4× | 28 | Samantha | 4× |
| 5 | Star Profusion | 4× | 29 |
| 2× |
| 6 | Cynthia | 4× | 30 | 2× | |
| 7 | Rose Gaujard | 4× | 31 | 2× | |
| 8 | Rose de Rescht | 4× | 32 |
| 2× |
| 9 | Green Ice | 4× | 33 | 2× | |
| 10 | Hudiequan | 4× | 34 |
| 2× |
| 11 | Cherry | 4× | 35 |
| 2× |
| 12 | 2× | 36 |
| 2× | |
| 13 | Sweet Pretty | 4× | 37 | 2× | |
| 14 | Geraldine | 4× | 38 |
| 4× |
| 15 | Golden Edge | 4× | 39 | 2× | |
| 16 | Qingge | 4× | 40 |
| 2× |
| 17 | Dainty Bess | 4× | 41 | 2× | |
| 18 | Beijinghong | 4× | 42 | 2× | |
| 19 | Xiang Yi | 4× | 43 | 2× | |
| 20 | Regensberg | 4× | 44 | 2× | |
| 21 | Xiang Fei | 4× | 45 |
| 2× |
| 22 | No. 26 | 4× | 46 | Camara | 4× |
| 23 | Elle | 4× | 47 | Red Naomi | 4× |
| 24 | Tian Mi De Meng | 4× | 48 | Black Magic | 4× |
Fig. 4Alleles detected by markers in diploid and tetraploid accessions. More markers detected multiple alleles in tetraploid rose than in diploid rose (a). The average number of alleles detected by a marker was higher in tetraploid rose than in diploid rose (b). The asterisk indicates significant differences (one-way ANOVA: p < 0.05) between the diploid and tetraploid accession
Fig. 5Tree plot of genetic diversity based on PCR results using 37 SSR markers. Tree plot derived from UPGMA cluster analysis using the NEI72 coefficients of SSR markers. The UPGMA (unweighted pair group method analysis) tree plot was generated using Mega 7.0 cluster analysis based on the genetic distances (calculated using NEI72 coefficients with NTSYS software) of 48 independent accessions. Blue circles = tetraploid rose accessions; red circles = diploid accession; purple circles = strawberry accession. The accession codes are marked, and the names can be found in Table 3. * = cultivar ‘Samantha’
Fig. 6Workflow used in the present study. High-throughput sequencing was performed to generate 32 Gb and 7.2 Gb of raw reads for the rose genome and flower transcriptome, respectively. After de novo assembly, a total of 1.17 GB of genomic scaffolds, as well as 71 Mb of EST contigs were obtained. Altogether, 163,680 SSR markers were developed for these genome scaffolds and 16,345 for transcriptomic contigs. Further validation showed that 124,591 genomic and 2762 transcriptomic SSR markers (total of 127,353) had unique annealing sites on the scaffolds; these markers were of better quality than the primary markers. Based on BLAST analysis, 5225 genomic SSR markers were associated with the transcriptome. Together with the EST SSRs, 7987 SSRs were develop that could potentially be used as functional markers for rose floral development