| Literature DB >> 24848699 |
Hiroyasu Kitashiba1, Feng Li1, Hideki Hirakawa2, Takahiro Kawanabe1, Zhongwei Zou1, Yoichi Hasegawa1, Kaoru Tonosaki1, Sachiko Shirasawa1, Aki Fukushima1, Shuji Yokoi3, Yoshihito Takahata3, Tomohiro Kakizaki4, Masahiko Ishida4, Shunsuke Okamoto5, Koji Sakamoto5, Kenta Shirasawa2, Satoshi Tabata2, Takeshi Nishio6.
Abstract
Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥ 300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified.Entities:
Keywords: draft sequence; high-density genetic map; radish
Mesh:
Substances:
Year: 2014 PMID: 24848699 PMCID: PMC4195494 DOI: 10.1093/dnares/dsu014
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Statistics of RSA_r1.0
| Number of scaffolds (≥300 bp) | 76,592 |
| Total length (Mb) | 402 |
| Average size (bp) | 5,253 |
| Maximum size (bp) | 831,256 |
| N50 (bp) | 46,262 |
| GC content (%) | 34.9 |
Statistics of genes predicted by Augustus 2.7
| Total | Intrinsic and partial | |
|---|---|---|
| Number of sequences | 80,521 | 61,572 |
| Total length (bp) | 78,403,816 | 53,787,862 |
| Average length (bp) | 974 | 874 |
| Max length (bp) | 16,173 | 16,173 |
| N50 length (bp) | 1,470 | 1,266 |
Figure 1.Venn diagram showing unique and shared gene families in R. sativus, A. thaliana, Brassica rapa, B. oleracea, and R. raphanistrum. Numbers in the individual sections represent the number of clusters. The number below the species name marks the total number of genes used as an input for the software. Genome data sets were used in R. sativus (RSA_r1.0), A. thaliana (TAIR10), and B. rapa (ver. 1.2), and EST-unigene data sets were used in B. oleracea and R. raphanistrum.
Figure 2.Comparison of KOG (Clusters of Orthologous Groups of proteins) classifications between R. sativus, B. rapa, A. thaliana, R. raphanistrum, and B. oleracea.
Figure 3.Dot-plot view of SRs of R. sativus (horizontal) and B. rapa (vertical) genomes. All genomic sequences of predicted genes in the pseudomolecules of scaffolds assigned to the R. sativus linkage map and those in the B. rapa genome were compared with each other, using nucleotide BLAST. The genes of B. rapa with the lowest E-value, which meets <1E−100, were regarded as the syntenic homologues, and the dots were plotted on the chart (see Materials and Methods). The genetic distances between the scaffolds were converted to physical distances based on the ratio of total length of linkage map and genome size of R. sativus. Axes represent the concatenation of all chromosomes for the corresponding genomes. Gridlines indicate the boundaries between chromosomes.