| Literature DB >> 31592084 |
Min Tang1,2, Ying Lu3, Zhongmin Xiong1,2, Ming Chen1,2, Yufang Qin1,2.
Abstract
With the release of the draft genome of the grass carp, researches on the grass carp from the genetic level and the further molecular mechanisms of economically valuable physiological behaviors have gained great attention. In this paper, we integrated a large number of genomic, genetic and some other data resources and established a web-based grass carp genomic visualization database (GCGVD). To view these data more effectively, we visualized grass carp and zebrafish gene collinearity and genetic linkage map using Scalable Vector Graphics (SVG) format in the browser, and genomic annotations by JBrowse. Furthermore, we carried out some preliminary study on a whole-genome alternative splicing (AS)of the grass carp. The RNA-seq reads of 15 samples were aligned to the reference genome of the grass carp by Bowtie2 software. RNA-seq reads of each sample and density map of reads were also exhibited in JBrowse. Additionally, we designed a universal grass carp genome annotation data model to improve the retrieval speed and scalability. Compared with the published database GCGD previously, we newly added the visualization of some more genomic annotations, conserved domain and RNA-seq reads aligned to the reference genome. GCGVD can be accessed at http://122.112.216.104. © The author(s).Entities:
Keywords: Conserved domain; Database; Grass carp; RNA-seq
Mesh:
Year: 2019 PMID: 31592084 PMCID: PMC6775296 DOI: 10.7150/ijbs.32860
Source DB: PubMed Journal: Int J Biol Sci ISSN: 1449-2288 Impact factor: 6.580
Figure 1The pipeline of visualizing the genetic and genomic data of the grass carp.
Figure 2The grass carp genome annotation data model.
Figure 4Gene collinearity between the grass carp and the zebrafish. A, For each synteny diagram, the grass carp scaffold (left) is represented by blue blocks (for example, CI00000003). Each block represents a grass carp gene. The zebrafish chromosome (right) is represented by red blocks (for example, DR02). Each block represents one or several zebrafish genes. Homologous genes are connected by blue lines. B, Details of some homologous genes between zebrafish and grass carp are presented in a tabular format.
Figure 5All available data of genome annotations rendered in JBrowse. A, Selected data appears graphically on the right main window by selecting the options in the left frame. B, Detailed annotation of protein-coding gene will be displayed after clicking on the gene structure. Detailed annotation of other data can also be rendered selectively as described operation above.
Figure 6The conserved domain under a gene. Each line with arrow represents a conserved domain. For example, CI00000000:08634004. 08675794 is shown.
Figure 7The gene expression in 6 tissues. For example, RPKM of gene CI01000000_00034855_00045035 is shown.
Figure 8RNA-seq reads of grass carp. A, RNA-seq reads (Sample id 'ERR472713') were mapped to the reference genome. B, Density map of reads per 50bp. Here we captured the density map of 4 samples.
The comparison of the two grass carp genome databases.
| Comparative aspects | GCGVD | GCGD | The advantage of GCGVD |
|---|---|---|---|
| The main data in the database | Functional annotations, conserved domain, SNP, non-coding RNA genes, repeat annotation, RPKM of six tissues, gene collinearity, genetic map, RNA-seq reads, etc. | Functional annotations, gene collinearity, genetic map, SSRs, three transcriptomic datasets, etc. | GCGVD added some annotation data not available in GCGD. |
| Data management and storage | Designed a general annotation data model to store and manage data. | Ordinary storage. | GCGVD has higher retrieval speed and scalability in overlapping regions. |