| Literature DB >> 29045613 |
Noriko Nakamura1, Hideki Hirakawa2, Shusei Sato3, Shungo Otagaki4, Shogo Matsumoto4, Satoshi Tabata2, Yoshikazu Tanaka1.
Abstract
The draft genome sequence of a wild rose (Rosa multiflora Thunb.) was determined using Illumina MiSeq and HiSeq platforms. The total length of the scaffolds was 739,637,845 bp, consisting of 83,189 scaffolds, which was close to the 711 Mbp length estimated by k-mer analysis. N50 length of the scaffolds was 90,830 bp, and extent of the longest was 1,133,259 bp. The average GC content of the scaffolds was 38.9%. After gene prediction, 67,380 candidates exhibiting sequence homology to known genes and domains were extracted, which included complete and partial gene structures. This large number of genes for a diploid plant may reflect heterogeneity of the genome originating from self-incompatibility in R. multiflora. According to CEGMA analysis, 91.9% and 98.0% of the core eukaryotic genes were completely and partially conserved in the scaffolds, respectively. Genes presumably involved in flower color, scent and flowering are assigned. The results of this study will serve as a valuable resource for fundamental and applied research in the rose, including breeding and phylogenetic study of cultivated roses.Entities:
Mesh:
Year: 2018 PMID: 29045613 PMCID: PMC5909451 DOI: 10.1093/dnares/dsx042
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1R. multiflora used in this study (A). Total RNA was prepared from the petals of buds (B), leaves (C) and roots (D) for RNA-Seq analysis.
Genomic feature of RMU_r2.0 and RMU_r2.0_cds
| RMU_r2.0 (genome) | RMU_r2.0_cds (CDS) | |
|---|---|---|
| Number of sequences | 83,189 | 67,380 |
| Total length (bases) | 739,637,845 | 66,058,172 |
| Average length (bases) | 8,891 | 980 |
| Max length (bases) | 1,133,259 | 20,538 |
| Min length (bases) | 501 | 149 |
| N50 length (bases) | 90,830 | 1,272 |
| G+C% | 38.9 | 45.9 |
| Repeat% | 56.4 | – |
| Complete genes | – | 54,893 |
| Partial genes | – | 12,487 |