| Literature DB >> 31836898 |
Zhenglin Zhu1, Zhufen Guan1, Gexin Liu1, Yawang Wang1,2, Ze Zhang1.
Abstract
Although the domestic silkworm (Bombyx mori) is an important model and economic animal, there is a lack of comprehensive database for this organism. Here, we developed the silkworm genome informatics database (SGID). It aims to bring together all silkworm-related biological data and provide an interactive platform for gene inquiry and analysis. The function annotation in SGID is thorough and covers 98% of the silkworm genes. The annotation details include function description, Gene Ontology, Kyoto Encyclopedia of Genes and Genomes pathway, subcellular location, transmembrane topology, protein secondary/tertiary structure, homologous group and transcription factor. SGID provides genome-scale visualization of population genetics test results based on high-depth resequencing data of 158 silkworm samples. It also provides interactive analysis tools of transcriptomic and epigenomic data from 79 NCBI BioProjects. SGID will be extremely useful to silkworm research in the future.Entities:
Mesh:
Year: 2019 PMID: 31836898 PMCID: PMC6911161 DOI: 10.1093/database/baz134
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
A summary of the data in SGID
| Item | Description | Cov. | Previous |
|---|---|---|---|
| Genome | High-quality assembly of the silkworm genome in chromosome level ( | The same as Kawamoto et al. and Mita et al. ( | |
| Gene models | 16 880 in total, with 12 752 correlated with old gene models ( | The same as Kawamoto et al. and Mita et al. ( | |
| Gene function annotation | 15 594 genes are of function annotations | 92.4% | 80.3% in Kawamoto et al. and Mita et al. ( |
| 4937 gene feature annotations | 22.0% | NA | |
| 201 309 annotations from InterproScan | 93.9% | NA | |
| 8730 distinctive GO lists | 86.9% | 54.2% in Kawamoto et al. and Mita et al. ( | |
| 16 028 correlated KEGG Gene IDs | 96.4% | NA | |
| 138 KEGG pathways | 16.5% | NA | |
| 16 320 correlated Entrez IDs | 96.7% | NA | |
| Biophysics and chemistry | 2487 EC numbers | 13.8% | NA |
| 329 biophysicochemical properties | 0.6% | NA | |
| 2445 catalytic activity annotations | 12.0% | NA | |
| 1743 cofactor information annotations | 7.5% | NA | |
| Topology | 20 378 subcellular localization annotations | 99.9% | NA |
| 2878 genes with transmembrane regions | 17.0% | NA | |
| 1960 genes with signal peptides | 11.6% | NA | |
| Proteomics and protein structure | 12 394 real peptides from experiments validated 2999 protein coding genes | 17.8% | NA |
| 9844 genes significantly correlated PDB protein structures | 58.3% | NA | |
| 1 730 892 correlated EMBL IDs | 92.3% | NA | |
| 17 762 correlated Gene3D IDs | 57.2% | NA | |
| 112 275 correlated Interpro IDs | 86.9% | NA | |
| 6257 CDD annotations | 29.0% | NA | |
| TFs | 704 items | 4.2% | NA |
| Repeat elements | 571 401 segments, with 28 519 DNA transposons, 190 316 LINE, 13763 LTR and 179 435 SINE | In accordance with Osanai-Futahashi et al. ( | |
| Transcriptomics | 306 samples from 41 projects | NA | |
| Epigenomics | 187 samples from 38 projects | NA | |
| Populations genetics | Sliding widow analysis results based on 158 silkworm genomes | NA |
‘Cov.’ denotes the coverage of genes with corresponding annotations in total genes. ‘Previous’ denotes the comparison of SGID and previous work. ‘NA’ denotes that related data is not available in previously built silkworm databases, such as SilkBase, Ensembl Silkworm and SilkDB.
Figure 1The distribution of silkworm genes in different cellular subunits.
Figure 2A snapshot of the genome browser of SGID with KWMTBOMO08141 in the center. Below are tracks of population genetics test results.
Figure 3Search engines and analysis tools in SGID. Lines with arrows pointed out a general analysis flow in SGID.