| Literature DB >> 24602877 |
Hansheng Zhao1, Zhenhua Peng, Benhua Fei, Lubin Li, Tao Hu, Zhimin Gao, Zehui Jiang.
Abstract
Bamboo, as one of the most important non-timber forest products and fastest-growing plants in the world, represents the only major lineage of grasses that is native to forests. Recent success on the first high-quality draft genome sequence of moso bamboo (Phyllostachys edulis) provides new insights on bamboo genetics and evolution. To further extend our understanding on bamboo genome and facilitate future studies on the basis of previous achievements, here we have developed BambooGDB, a bamboo genome database with functional annotation and analysis platform. The de novo sequencing data, together with the full-length complementary DNA and RNA-seq data of moso bamboo composed the main contents of this database. Based on these sequence data, a comprehensively functional annotation for bamboo genome was made. Besides, an analytical platform composed of comparative genomic analysis, protein-protein interactions network, pathway analysis and visualization of genomic data was also constructed. As discovery tools to understand and identify biological mechanisms of bamboo, the platform can be used as a systematic framework for helping and designing experiments for further validation. Moreover, diverse and powerful search tools and a convenient browser were incorporated to facilitate the navigation of these data. As far as we know, this is the first genome database for bamboo. Through integrating high-throughput sequencing data, a full functional annotation and several analysis modules, BambooGDB aims to provide worldwide researchers with a central genomic resource and an extensible analysis platform for bamboo genome. BambooGDB is freely available at http://www.bamboogdb.org/. Database URL: http://www.bamboogdb.org.Entities:
Mesh:
Year: 2014 PMID: 24602877 PMCID: PMC3944406 DOI: 10.1093/database/bau006
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
BambooGDB data content and statistics as of 12 October 2013
| Data set | Data type | Data statistics |
|---|---|---|
| Basic data content | DNA/Protein | |
| Genes | 31 987 | |
| Expressed genes (FPKM | 28 576 | |
| MicroRNA target genes | 161 | |
| Proteins | 31 987 | |
| RNA | ||
| tRNAs | 1167 | |
| MicroRNAs | 86 | |
| Variant | ||
| Heterozygous SNPs | 2 009 487 | |
| Annotation | DNA/Protein | |
| Pfam-A accessions | 21 645 | |
| COGs | 14 049 | |
| InterPro accessions | 66 567 | |
| PANTHER | 38 868 | |
| EC | 752 | |
| Ortholog groups | 9856 | |
| Structure feature | ||
| Conserved domain models | 852 248 | |
| Conserved sites | 36 731 | |
| Pathway/Network | ||
| GO | 37 188 | |
| KO | 3714 | |
| Metabolic pathway | ||
| Proteins | 3946 | |
| Pathway maps | 191 | |
| PPI | ||
| Proteins | 2202 | |
| Interactions | 34 169 | |
| Comparative genomics | ||
| Best | 16 383 | |
| Best rice hits | 21 849 |
aFPKM: Fragments per kilobase of transcript per million mapped reads.
bHeterozygous SNPs: heterozygous single nucleotide polymorphisms.
cCOGs: Clusters of orthologous group.
dPANTHER: Protein ANalysis THrough Evolutionary Relationships (one classification system of protein).
eEC: Enzyme commission.
fGO: Gene ontology.
gKO: KEGG orthology.
Figure 1.Screenshot showing interrelation of data and tools housed in BambooGDB. Users access the data through search and browse function. In addition, all data and tools incorporated in BambooGDB are cross-linked.
Figure 2.One example of the application of BambooGDB for browsing and searching information for bamboo research.
Figure 3.Another example of the application of BambooGDB in deriving data-based new hypothesis for bamboo research. (a) An application example for studying on bamboo special characteristics; (b) An application example for studying on bamboo flowering mechanism and protein-protein interactions.