| Literature DB >> 24434032 |
Weiqi Tang1, Liying Yu, Weiyi He, Guang Yang, Fushi Ke, Simon W Baxter, Shijun You, Carl J Douglas, Minsheng You.
Abstract
The diamondback moth Genome Database (DBM-DB) is a central online repository for storing and integrating genomic data of diamondback moth (DBM), Plutella xylostella (L.). It provides comprehensive search tools and downloadable datasets for scientists to study comparative genomics, biological interpretation and gene annotation of this insect pest. DBM-DB contains assembled transcriptome datasets from multiple DBM strains and developmental stages, and the annotated genome of P. xylostella (version 2). We have also integrated publically available ESTs from NCBI and a putative gene set from a second DBM genome (KONAGbase) to enable users to compare different gene models. DBM-DB was developed with the capacity to incorporate future data resources, and will serve as a long-term and open-access database that can be conveniently used for research on the biology, distribution and evolution of DBM. This resource aims to help reduce the impact DBM has on agriculture using genomic and molecular tools. Database URL: http://iae.fafu.edu.cn/DBM/Entities:
Mesh:
Year: 2014 PMID: 24434032 PMCID: PMC3893660 DOI: 10.1093/database/bat087
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Summary of the DBM genome and transcriptome datasets in DBM-DB (version 1.2)
| Data set | Number | Percentage |
|---|---|---|
| Genome dataset | ||
| Assembly scaffold (version 2) | 1819 | |
| Official Gene Set (version 1) | 18 071 | |
| Gene annotation | ||
| SwissProt | 12 631 | 69.90 |
| TrEMBL | 14 844 | 82.14 |
| InterPro | 12 877 | 71.26 |
| KEGG | 10 390 | 57.50 |
| GO | 10 745 | 59.46 |
| Total annotated genes | 15 195 | 84.08 |
| Transcriptome dataset | ||
| Total unigenes | 171 262 | |
| Functionally annotated unigenes | 38 255 | 22.34 |
| OGSv1 with RPKM value >1 | 16 150 | 89.37 |
aPercentage of the official gene set
Statistics of the DBM genome assembly (version 2) in DBM-DB and KONAGAbase
| Data set | Total number | GC % | Size (Mb) | N50 (bp) | Max (bp) |
|---|---|---|---|---|---|
| DBM-DB: DBM susceptible (Fuzhou-S) strain ( | |||||
| Scaffolds | 1819 | 38.34% | 394 | 737 182 | 3 493 687 |
| Contigs (>1 kbp) | 31 979 | 38.34% | 334 | 18 785 | 202 508 |
| KONAGAbase: Bt-toxin susuceptible strain ( | |||||
| Contigs (>1 kbp) | 88 530 | 38.3% | 186 | 2273 | 24 960 |
| Degenerate contigs | 246 44 | 38.8% | 148 | 643 | 12 183 |
| Singleton | 106 455 | 42.0% | 31 | 399 | 727 |
Figure 1.Flowchart of the DBM gene prediction. Software or approaches used for the prediction are shown in dashed boxes.
Statistics of unigenes/ESTs in DBM-DB and KONAGAbase
| Data set | Total number | GC % | Size (Mb) | Mean (bp) | Max (bp) | |
|---|---|---|---|---|---|---|
| DBM-DB | Egg | 70 234 | 45.3% | 31.4 | 447 | 14 617 |
| Larva | 69 008 | 47.9% | 24.8 | 436 | 20 840 | |
| Pupa | 73 194 | 45.3% | 32.5 | 444 | 20 840 | |
| Adult | 55 943 | 45.2% | 30.1 | 443 | 14 617 | |
| CRS | 54 869 | 51.8% | 24.9 | 454 | 20 804 | |
| RFS | 58 565 | 48.8% | 27.4 | 468 | 20 840 | |
| Total | 171 262 | 43.0% | 93.6 | 547 | 23 074 | |
| KONAGAbase | NCBI | 1722 | 50.1 | 1.1 | 645 | 16 113 |
| Midgut | 12 406 | 49.2 | 6.0 | 480 | 879 | |
| Egg | 6904 | 42.6 | 3.1 | 446 | 855 | |
| Testis | 16 308 | 44.2 | 9.7 | 446 | 880 | |
| Larvae (4th) | 147 370 | 42.8 | 66.5 | 451 | 11 311 | |
| Total | 84 570 | 43.0 | 47.8 | 564 | 16 249 | |
aSamples at four basic developmental stages of transcriptomes in DBM-DB are from DBM susceptible (Fuzhou-S) strain. CRF and RFS represent the chlorpyrifos- and fipronil-resistant strains, respectively. The total unigenes is the result of clustering and redundancy filtering of six transcriptomes.
bKONAGAbase unigenes were assembled the EST/mRNA sequences from NCBI, the ESTs from midgut, egg and testes, and the RNA-seq contigs of the fourth instar DBM larvae.
Statistics of mapping unigenes from Fuzhou-S, PXS and ESTs (NCBI) to DBM reference genomes Fuzhou-S and PXS
| Data set | Fuzhou-S unigenes | PXS genes | ESTs |
|---|---|---|---|
| Total number of sequences in dataset | 38 255 | 32 800 | 36 907 |
| Fuzhou-S genome | 31 825 (83.19%) | 26 074 (79.49%) | 29 187 (79.08%) |
| Fuzhou-S fosmids (>5 Kb) | 3482 (54.15%) | 4022 (59.80%) | 3732 (48.34%) |
| Total Fuzhou-S | 35 307 (92.29%) | 30 096 (91.75%) | 32 919 (89.19%) |
| PXS genome | 28 064 (73.36%) | 24 882 (75.86%) | 28 970 (78.49%) |
aFosmid sequences were generated by sequencing 100 800 single colonies to achieve 10× coverage of the genome.
Figure 2.Schematic structure of the DBM-DB. Components including gene information, Overview, Search, BLAST and Gbrowse, and their orientations are presented. Arrows represent the relationships between different components.
Figure 3.Architecture of the DBM-DB.