| Literature DB >> 35448733 |
Zhidong Li1, Ying Li1, Tongkun Liu1, Changwei Zhang1, Dong Xiao1, Xilin Hou1.
Abstract
The availability of a high-quality genome sequence of Brassica campestris ssp. chinensis NHCC001 has paved the way for deep mining of genome data. We used the B. campestris NHCC001 draft genome to develop a comprehensive database, known as the non-heading Chinese cabbage database, which provides access to the B. campestris NHCC001 genome data. The database provides 127,347 SSR, from which 382,041 pairs of primers were designed. NHCCDB contains information on 105,360 genes, which were further classified into 63 transcription factor families. Furthermore, NHCCDB provides eight kinds of tools for biological or sequencing data analyses, including sequence alignment tools, functional genomics tools, comparative genomics tools, motif analysis tools, genome browser, primer design, and SSR analysis tools. In addition, eight kinds of graphs, including a box plot, Venn diagram, corrplot, Q-Q plot, Manhattan plot, seqLogo, volcano plot, and a heatmap, can be generated rapidly using NHCCDB. We have incorporated a search system for efficient mining of transcription factors and genes, along with an embedded data submit function in NHCCDB. We believe that the NHCCDB database will be a useful platform for non-heading Chinese cabbage research and breeding.Entities:
Keywords: Brassica campestris; genome database; non-heading Chinese cabbage; whole-genome sequence
Year: 2022 PMID: 35448733 PMCID: PMC9029197 DOI: 10.3390/plants11081005
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Flow diagram showing the designing and construction of NHCCDB.
Figure 2Screenshot of a species page for Brassica campestris (syn. Brassica rapa) ssp. chinensis.
Figure 3Browse module. (a) The syntenic gene in B. campestris NHCC001 and Arabidopsis thaliana. (b). Brassica campestris ssp. chinensis NHCC001 ARF (Auxin response factor) family. (c) Browsing of the SSR dataset of B. campestris NHCC001. (d) Gene function annotation datasets.
Figure 4GO and KEGG enrichment and JBrowse tools in NHCCDB. (a) Visualization of GO and KEGG enrichment results with bubble charts. (b) Visualization of GO and KEGG enrichment results with bar charts. (c) Visualization of GO and KEGG enrichment results with emapplots. (d) Visualization of GO and KEGG enrichment results with network planning. (e) Visualization of GO and KEGG enrichment results with heatmaps. (f) Genome browser view of processed genome data.
Figure 5A total of eight kinds of graphs can be generated using NHCCDB. (a) Box plot. (b) Corrplot. (c) Venn diagram. (d) Q-Q plot. (e) Heatmap. (f) Manhattan plot. (g) Volcano plot. (h) SeqLogo.
Figure 6Search module. (a) Batch Query tool search results. (b) B. campestris NHCC001 gene function annotation search result. (c) Using SSR (CCG)6 as an example to show SSR search results. (d) Syntenic genes search results.