| Literature DB >> 35670730 |
Munetaka Kawamoto1,2, Takashi Kiuchi1, Susumu Katsuma1.
Abstract
We introduce SilkBase as an integrated database for transcriptomic and genomic resources of the domesticated silkworm Bombyx mori and related species. SilkBase is the oldest B. mori database that was originally established as the expressed sequence tag database since 1999. Here, we upgraded the database by including the datasets of the newly assembled B. mori complete genome sequence, predicted gene models, bacterial artificial chromosome (BAC)-end and fosmid-end sequences, complementary DNA (cDNA) reads from 69 libraries, RNA-seq data from 10 libraries, PIWI-interacting RNAs (piRNAs) from 13 libraries, ChIP-seq data of 9 histone modifications and HP1 proteins and transcriptome and/or genome data of four B. mori-related species, i.e. Bombyx mandarina, Trilocha varians, Ernolatia moorei and Samia ricini. Our new integrated genome browser easily provides a snapshot of tissue- and stage-specific gene expression, alternative splicing, production of piRNAs and histone modifications at the gene locus of interest. Moreover, SilkBase is useful for performing comparative studies among five closely related lepidopteran insects. Database URL: https://silkbase.ab.a.u-tokyo.ac.jp.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35670730 PMCID: PMC9216573 DOI: 10.1093/database/baac040
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 4.462
A summary of the datasets installed in SilkBase
| Category | Library | Description | References |
|---|---|---|---|
|
| |||
| Genome | Chromosome-level genome assembly | 28 chromosomes and 668 scaffolds | ( |
| Fosmid library | 274 342 fosmid clones | ||
| BAC libraries | 137 753 BAC clones from three libraries | ( | |
| Hypothetically reconstructed genome | 782 316 in total from 18 strains | ||
| Re-sequenced genome DNA libraries | 1 672 128 940 reads from 18 strains | ||
| Old scaffold library (2008) | 43 462 scaffolds | ( | |
| Gene | Gene model | 16 880 genes | ( |
| Transcript level, protein family membership, domains and repeats, detail signature matches, residue annotation, GO term prediction and description against nr | |||
| Gene model (2008) | 14 623 genes | ( | |
| Position on genome, GO term prediction, Uniref and Orthologs | |||
| Geneset A | 16 823 genes | ( | |
| Position on genome | |||
| Transcriptome | Assembled RNA-seq libraries | 1 062 486 in total from 10 tissues/stage | ( |
| Position on genome, GO term prediction, description against nr and transcript level | |||
| ORF from assembled RNA-seq libraries | 529 531 in total from 10 tissues/stage | ( | |
| Position on genome, GO term prediction and description against nr | |||
| RNA-seq libraries | Reads from 10 tissues/stage | ( | |
| The complete sequences of the FL-cDNA clone libraries | 11 833 clones | ( | |
| Position on genome | |||
| cDNA libraries | 461 119 in total from 69 libraries | ( | |
| Position on genome, GO term prediction, Uniref and Orthologs | |||
| SAGE libraries | 82 227 clones | ( | |
| MPSS libraries | 44 872 clones | ||
| piRNA libraries | 83 984 212 reads in total from 13 libraries | ( | |
| Transposon libraries | 121 of well-annotated transposons and 1690 of transposons | ( | |
| Epigenome | ChIP-seq libraries | 526 234 147 reads from 16 libraries | ( |
| Peak calling | |||
|
| |||
| Genome | Scaffold-level genome assembly | 66 797 scaffolds | |
| Fosmid libraries | 153 216 clones | ||
| Hypothetically reconstructed genome | 86 924 in total from two strains | ||
| Re-sequenced genome DNA libraries | 185 322 718 reads from two strains | ||
| Transcriptome | Assembled RNA-seq libraries | 141 139 in total from three tissues | |
| GO term prediction, description against nr and transcript level | |||
| ORF from assembled RNA-seq libraries | 73 790 in total from three tissues | ||
| GO term prediction, Description against nr | |||
|
| |||
| Transcriptome | Assembled RNA-seq libraries | 106 248 in total from three tissues | |
| GO term prediction, description against nr and transcript level | |||
| ORF from assembled RNA-seq libraries | 41 707 in total from three tissues | ||
| GO term prediction and description against nr | |||
|
| |||
| Transcriptome | Assembled RNA-seq libraries | 38 954 assembly | |
| GO term prediction, description against nr and transcript level | |||
| ORF from assembled RNA-seq libraries | 15 068 ORFs | ||
| GO term prediction and description against nr | |||
|
| |||
| Genome | Scaffold-level genome assembly | 155 scaffolds | ( |
| Transcriptome | Assembled RNA-seq libraries | 171 159 in total from three tissues | |
| GO term prediction, description against nr and transcript level | |||
| ORF from assembled RNA-seq libraries | 78 839 in total from three tissues | ||
| GO term prediction and description against nr | |||
| cDNA | 20 320 clones from two libraries | ( | |
| GO term prediction, Uniref and Orthologs | |||
This database.
This database (Sequenced by National Bio Resource Project).
Figure 1.Overview of the Web user interfaces. The path lines show the user pathways between the Web user interfaces.
Figure 2.Integrated genome browser. (A) Overview of the genome browser. Genomic, transcriptomic and epigenomic information is displayed on a single screen. (B) Transcriptional status of BmSuc1 in different tissues. (C) A piRNA-producing locus in the Masc gene. (D) Histone modifications around the KWMTBOMO06377 gene.
Figure 3.piRNA- and ChIP-seq mapping tool. The vertical axis indicates the nucleotide position of the query sequence, and the horizontal axis indicates the depth of coverage. The upper graph shows the mapping results on the sense strand, whereas the lower graph shows those on the antisense strand (piRNA mapping tool only). Information on nucleotide sequence, start point, end point, strand and the depth of coverage is displayed below the graph. (A) piRNA production in ovary from the transposon Kabuki. The detailed information of a certain peak (indicated by oval) is as follows: the piRNA sequence is ‘cggattcactgccatcggaaaatcaga’, and 108 reads are mapped from 2231 to 2257. (B) An example of the piRNA production status in the Fem locus. The depth of coverage is 935 on the sense strand and 1 on the antisense strand. (C) Mapping of the HP1a-binding sites around the KWMTBOMO02692 gene. The depth of ChIP-seq coverage on both strands is merged and displayed on a single graph (ChIP-seq mapping tool).
Figure 4.Comparative genomic analysis of B. mori-related species. (A) The phylogenetic relationship of B. mori and related species. The host plants are different among the three groups (indicated by rectangles). (B) An example of the identification of B. mori Masc homologs from four species.