| Literature DB >> 32426809 |
Kenta Shirasawa1, Hideki Hirakawa1, Nobuko Fukino2, Hiroyasu Kitashiba3, Sachiko Isobe1.
Abstract
AIM: The complex genome of a Japanese radish (Raphanus sativus) cultivar named 'Okute-Sakurajima' with an extremely large edible round root was analysed to explore its genomic characteristics. METHODS ANDEntities:
Keywords: chromosome-scale pseudomolecule sequences; genome sequence; long-read sequence technology; radish
Mesh:
Year: 2020 PMID: 32426809 PMCID: PMC7334891 DOI: 10.1093/dnares/dsaa010
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Genome size estimation for ‘Okute-Sakurajima’ with the distribution of the number of distinct k-mers (k = 17) with the given multiplicity values.
Statistics of the primary contig sequences of ‘Okute-Sakurajima’
| RSAskr_r1.0 | |
|---|---|
| Total contig size (bases) | 504,534,164 |
| Number of contigs | 1,437 |
| Contig N50 length (bases) | 1,247,688 |
| Longest contig size (bases) | 8,317,732 |
| Gap (%) | 0.0 |
| Complete BUSCOs | 94.1 |
| Single-copy BUSCOs | 71.0 |
| Duplicated BUSCOs | 23.1 |
| Fragmented BUSCOs | 2.0 |
| Missing BUSCOs | 3.9 |
| #Genes ( | 89,915 |
| #Genes (mapping) | 78,645 |
Genetic map length and number of SNPs for F2 radish populations
| LG | SNF2 | ASF2 | ||
|---|---|---|---|---|
| Number of SNPs | Map length (cM) | Number of SNPs | Map length (cM) | |
| Rs1 | 608 | 64.0 | 253 | 87.4 |
| Rs2 | 757 | 104.9 | 299 | 103.4 |
| Rs3 | 342 | 75.1 | 237 | 91.9 |
| Rs4 | 743 | 119.3 | 311 | 108.3 |
| Rs5 | 795 | 123.6 | 406 | 122.1 |
| Rs6 | 779 | 118.7 | 456 | 115.1 |
| Rs7 | 549 | 76.0 | 227 | 93.3 |
| Rs8 | 496 | 90.4 | 243 | 80.2 |
| Rs9 | 501 | 95.3 | 248 | 93.6 |
| Total | 5,570 | 867.2 | 2,680 | 895.3 |
The nine LGs were named Rs1 through Rs9 according to the nomenclature proposed by Shirasawa and Kitashiba.
An F2 mapping populations (n = 115) derived from a cross between an inbred line via self-pollination of radish cultivar ‘Shogoin Daikon’ and a line of R. sativus var. raphanistroides ‘Nohara 1’.
Another F2 population (n = 95) derived from a cross between ‘Aokubi S-h’ and ‘Sayatori 26704’ as reported in Kitashiba et al.
Statistics of the ‘Okute-Sakurajima’ pseudomolecule sequences, RSAskr_r1.0.pmol
| Chr | #Contigs | (%) | Contig size (bp) | (%) | #Genes | (%) |
|---|---|---|---|---|---|---|
| Rs1 | 25 | 1.7 | 27,719,058 | 5.5 | 5,426 | 6.0 |
| Rs2 | 40 | 2.8 | 42,944,316 | 8.5 | 8,295 | 9.2 |
| Rs3 | 30 | 2.1 | 31,410,669 | 6.2 | 5,923 | 6.6 |
| Rs4 | 35 | 2.4 | 56,498,296 | 11.2 | 10,843 | 12.1 |
| Rs5 | 34 | 2.4 | 42,357,306 | 8.4 | 8,477 | 9.4 |
| Rs6 | 41 | 2.9 | 53,940,652 | 10.7 | 10,403 | 11.6 |
| Rs7 | 17 | 1.2 | 28,108,325 | 5.6 | 5,545 | 6.2 |
| Rs8 | 30 | 2.1 | 28,319,830 | 5.6 | 5,474 | 6.1 |
| Rs9 | 41 | 2.9 | 38,520,653 | 7.6 | 7,143 | 7.9 |
|
|
|
|
|
|
|
|
| Rs0 | 1,144 | 79.6 | 154,715,059 | 30.7 | 22,386 | 24.9 |
| Total | 1,437 | 100.0 | 504,534,164 | 100.0 | 89,915 | 100.0 |
The nine pseudomolecule sequences were named Rs1 through Rs9 according to the nomenclature proposed by Shirasawa and Kitashiba. The remaining unassigned sequences were grouped into Rs0.
Bold indicates the subtotal values of Rs1 to Rs9.
Figure 2Sequence similarity of the ‘Okute-Sakurajima’ genome with three other species. Dots indicate sequence similarity of the ‘Okute-Sakurajima’ genome (RSAskr_r1.0) on the vertical axis versus those of R. sativus (Rs_1.0), A. thaliana (TAIR10), and B. rapa (IVFCAASv1) on the horizontal axis. Arrows indicate major regions on Rs1 to Rs7 presented in only RSAskr_r1.0.pmol.
Figure 3Numbers of genes predicted with the same method from the two radish genome sequences, RSAskr_r1.0 and RSA_r1.0.
Repetitive sequences in the ‘Okute-Sakurajima’ sequences, RSAskr_r1.0
| Repeat type | Number of elements | Length occupied (bp) | % |
|---|---|---|---|
| SINEs | 6,031 | 1,113,643 | 0.2 |
| LINEs | 18,132 | 12,792,260 | 2.5 |
| LTR elements | 67,265 | 144,417,931 | 28.6 |
| DNA elements | 90,424 | 37,170,618 | 7.4 |
| Unclassified | 127,034 | 46,043,450 | 9.1 |
| Small RNA | 7,177 | 8,492,545 | 1.7 |
| Satellites | 2,506 | 561,848 | 0.1 |
| Simple repeats | 112,754 | 5,427,620 | 1.1 |
| Low complexity | 25,293 | 1,305,261 | 0.3 |
Percentage of sequence length of RSAskr_r1.0 (504,676,864 bp).
Figure 4Distribution of repeat element lengths. Lengths of repeat elements were indicated by box plots. A horizontal dash line indicates the read N50 of sequence reads of ‘Okute-Sakurajima’.
Figure 5Distributions of genes and repeat elements over the ‘Okute-Sakurajima’ genome. Densities of genes and repeat elements are shown by bar plots in green and blue, respectively. Genome regions presented in only RSAskr_r1.0.pmol are represented by red boxes, which positions are corresponding to those indicated with arrows in Fig. 2.