| Literature DB >> 24079801 |
Jingyin Yu1, Meixia Zhao, Xiaowu Wang, Chaobo Tong, Shunmou Huang, Sadia Tehrim, Yumei Liu, Wei Hua, Shengyi Liu.
Abstract
BACKGROUND: Brassica oleracea is a morphologically diverse species in the family Brassicaceae and contains a group of nutrition-rich vegetable crops, including common heading cabbage, cauliflower, broccoli, kohlrabi, kale, Brussels sprouts. This diversity along with its phylogenetic membership in a group of three diploid and three tetraploid species, and the recent availability of genome sequences within Brassica provide an unprecedented opportunity to study intra- and inter-species divergence and evolution in this species and its close relatives. DESCRIPTION: We have developed a comprehensive database, Bolbase, which provides access to the B. oleracea genome data and comparative genomics information. The whole genome of B. oleracea is available, including nine fully assembled chromosomes and 1,848 scaffolds, with 45,758 predicted genes, 13,382 transposable elements, and 3,581 non-coding RNAs. Comparative genomics information is available, including syntenic regions among B. oleracea, Brassica rapa and Arabidopsis thaliana, synonymous (Ks) and non-synonymous (Ka) substitution rates between orthologous gene pairs, gene families or clusters, and differences in quantity, category, and distribution of transposable elements on chromosomes. Bolbase provides useful search and data mining tools, including a keyword search, a local BLAST server, and a customized GBrowse tool, which can be used to extract annotations of genome components, identify similar sequences and visualize syntenic regions among species. Users can download all genomic data and explore comparative genomics in a highly visual setting.Entities:
Mesh:
Year: 2013 PMID: 24079801 PMCID: PMC3849793 DOI: 10.1186/1471-2164-14-664
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic illustration of the Bolbase sitemap.
Comparison of predicted protein-coding genes in Brassica oleracea, Brassica rapa, and Arabidopsis thaliana
| 45,758 | 1,761 | 1,037 | 4.55 | 228 | 204 | |
| 41,174 | 2,014 | 1,171 | 5.03 | 232 | 210 | |
| 27,379 | 2,176 | 1,215 | 5.38 | 237 | 235 |
B. rapa genome V1.0 gene sets downloaded from BRAD (http://brassicadb.org/brad/).
A. thaliana genome TAIR9 representative gene sets downloaded from TAIR (http://www.arabidopsis.org/).
Figure 2Annotation of predicted protein-coding genes in the genome. A. basic information; B. protein sequence features; C. gene clusters, including orthologous groups and tandem duplicated arrays; D. syntenic analysis, including orthologous genes, syntenic regions and triplicated blocks in B. rapa and A. thaliana; E. the orthologous genes of Bol007288 in A. thaliana (AT5G06860 and AT3G12090); F. the orthologous genes of Bol007288 in B. rapa (Bra038699 and Bra000594).
Figure 3Syntenic regions of chromosome C01 and the genome. As an example, B. oleracea chromosome C01, which contains 55 syntenic regions, was compared to the genome of A. thaliana. The hyperlinks under 'Region’ or 'Mapped Region’ will visually present the syntenic relationship between the two genomes. The hyperlinks under 'Detail’ will retrieve orthologous gene pairs in the syntenic regions and calculate their Ka/Ks values and divergence times.
Syntenic regions on pseudomolecular chromosomes in Brassica oleracea, Brassica rapa, and
| 6 | 10 | 12 | 15 | 12 | 20 | 3 | 18 | 7 | 11 | 14 | 4 | 16 | 7 | 7 | |
| 14 | 4 | 8 | 9 | 19 | 5 | 21 | 8 | 1 | 7 | 12 | 15 | 6 | 13 | 8 | |
| 12 | 13 | 26 | 25 | 16 | 25 | 12 | 28 | 10 | 23 | 17 | 6 | 12 | 17 | 15 | |
| 9 | 22 | 20 | 3 | 6 | 3 | 1 | 14 | 30 | 18 | 3 | 13 | 3 | 15 | 3 | |
| 28 | 5 | 9 | 2 | 4 | 9 | 4 | 10 | 4 | 20 | 14 | 7 | 10 | 13 | 5 | |
| 14 | 14 | 9 | 15 | 17 | 15 | 9 | 15 | 3 | 3 | 14 | 12 | 15 | 31 | 7 | |
| 35 | 4 | 16 | 4 | 1 | 3 | 9 | 6 | 9 | 14 | 21 | 30 | 13 | 11 | 2 | |
| 31 | 11 | 8 | 8 | 3 | 10 | 1 | 9 | 8 | 7 | 25 | 17 | 22 | 21 | 4 | |
| 4 | 4 | 9 | 13 | 29 | 10 | 24 | 21 | – | 4 | 9 | 3 | 1 | 16 | 13 | |
'–’: no syntenic region between the corresponding chromosomes.