| Literature DB >> 27307138 |
Deepika Singh1, Hasnahana Chetia1, Debajyoti Kabiraj1, Swagata Sharma1, Anil Kumar2, Pragya Sharma3, Manab Deka3, Utpal Bora4.
Abstract
Recent progress in the field of sequencing and analysis has led to a tremendous spike in data and the development of data science tools. One of the outcomes of this scientific progress is development of numerous databases which are gaining popularity in all disciplines of biology including sericulture. As economically important organism, silkworms are studied extensively for their numerous applications in the field of textiles, biomaterials, biomimetics, etc. Similarly, host plants, pests, pathogens, etc. are also being probed to understand the seri-resources more efficiently. These studies have led to the generation of numerous seri-related databases which are extremely helpful for the scientific community. In this article, we have reviewed all the available online resources on silkworm and its related organisms, including databases as well as informative websites. We have studied their basic features and impact on research through citation count analysis, finally discussing the role of emerging sequencing and analysis technologies in the field of seri-data science. As an outcome of this review, a web portal named SeriPort, has been created which will act as an index for the various sericulture-related databases and web resources available in cyberspace.Database URL: http://www.seriport.in/.Entities:
Mesh:
Year: 2016 PMID: 27307138 PMCID: PMC4909305 DOI: 10.1093/database/baw086
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Number of publications on seri-related databases from the year 2003 to 2015* where (*) represents 2015-continued year.
Figure 2.Timeline of the existing seri-databases from the year 2003 to 2015# generated using respective publication in the literature and database creation year from websites, where (#) represents 2015-continued year; (*) indicates database first published in 1999 and its updated versions considered from period 2003–2015; (**) indicates the same database with updated information.
Figure 3.Schematic representation of Seri-databases classified into four categories- Silkworm Databases-20 No.s, Host Plant Databases-23 No.s, Pest and Pathogen Databases- 01 No., Combined Databases-17 No.s.(Abbreviations- SilkDB: Silkworm Knowledgebase, EST DB: Expressed Sequence Tag Database, BmMDB: Bombyx mori Microarray Database, SilkTransDB: Silkworm Transcriptome Database, SilkSatDb: Silkworm Microsatellite Database, DBMP: Database of Bombyx mutant photographs, BmTEdb: Transposable elements database for B. mori, SilkProt: Annotated protein database of silkworm, SilkPPI: Silkworm Protein–Protein Interaction database, SilkTF: Silkworm Transcription Factor Database, SGRDB: Silkworm Gene Resources database, iPathDB: Insect Pathway Database, MorusDB: Morus Genome Database, MulSatDB: Mulberry Microsatellite Database, CastorDB: A comprehensive knowledgebase database for R. communis, Papaya-DB: Papaya Genomic Resources Online, CPR-DB: Papaya Repeat Database, CGDB: Cassava Genome Database, CCDB: Chinese Cassava Genome Database, HOSTS: a Database of the World's Lepidopteran Host plants, PlantGDB: Resources for Comparative Plant Genomics, ChromDB: The Chromatin Database, PlantTFDB: Plant Transcription Factor Database, SilkPathDB: Silkworm Pathogen Database, BOLD: Barcode of Life Data System, DBIF: Database of Insects and their Food Plants, DBMW: Database of Butterflies and Moth of the World, CNIDB: Common Names of Insects Databases, BAMONA: Butterflies and moths of North America, EOL: Encyclopedia of Life, ITIS: Integrated taxonomic information system, SRDB: Spatio-temporal database of Silk Road, SFSDB: Silk Fabric Specification Database, miRNEST: An integrative microRNA resource, miRBase: The microRNA database, MEROPS: the peptidase database).
Figure 4.Percentage distribution of various features across seri-databases.
Comparative features of Seri-databases (Data Search on 11 March 2016)
| Name | Website design & implementation | Database download enabled | Public data submission | Browse | Public query submission/search | Data input | Cross-referenced | Associated with online analytical tools | Help | User registration | Search visibility (Google) | References | Comments |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Last Updated | |||||||||||||
| Silkworm databases | |||||||||||||
| Silkworm genome databases | |||||||||||||
| SilkDB (v2.0) | -MySQL-Navigated by GBrowse | ✓ (via ftp) | ✗ | ✓ (GBrowse, SCB tool) | ✓ (Quick Search, etc.) | ✗ | ✓ | ✓ (SilkMap, BLAST, Wego, Clustal W, Cap3, BL2SEQ, EMBOSS, WISE2) | ✓ | ✗ | Fair | ( | |
| (2009) | |||||||||||||
| Kaikobase | -Written in Javascript- | ✓ (Through | ✓ (can upload and share custom tracks in GBrowse) | ✓ (GBrowse, UTGB and GeneViewer) | ✓ (Keyword and Position Search, Scaffold Sequence Search) | ✗ | ✗ | ✓ (KAIKOBLAST, KAIKOGAAS, GBrowse) | ✓ | ✗ | Fair | ( | |
| 07/06/2013 | |||||||||||||
| Silkworm gene expression databases | |||||||||||||
| SilkBase | NA | ✓ (cDNA and EST for | ✗ | ✓ (Libraries tab) | ✓ (Different Search Bar- Keyword, gene model, genome position, ESTs, etc.) | ✓ | ✓ | ✓ (All variations of BLAST) | ✗ | ✗ | Fair | ( | |
| 16/03/2015 | |||||||||||||
| WildSilkbase | MySQL and PHP, Apache web server, Fedora Linux system | ✓ (EST Sequence data, EST annotation data) | ✗ | ✓ | ✓ (3 searches-Keyword Search, Homolog Finder and SSR Finder) | ✗ | ✗ | ✓ (BLAST, Homolog Finder, SSR Finder) | ✓ | ✗ | Fair | ( | |
| Butterfly Base | ✓ (FASTA) | ✓ | ✓ (Genome Browse Under Development) | ✓ (Text search, BLAST search, etc.) | ✓ | ✗ | ✓ (NCBI-BLASTALL, PSI-BLAST and WU-BLAST-driven MS-BLAST, prot4EST) | ✓ | ✗ | Poor | ( | ||
| BmMDB | MySQL, | ✗ | ✗ | ✓ (Browse Raw data, Browse Tissue specific genes) | ✓ (Search by Probe ID, Search by BLAST) | ✗ | ✓ | ✓(BLAST) | ✗ | ✗ | Fair | ( | |
| SilkTransDB | Gbrowse | ✓ (PNG, SVG, FASTA, GFF) | ✓ (can upload file in Gbrowse option) | ✓ Gbrowse | ✓ (BLAST Search) | ✗ | ✓ | ✓ (BLAST-blastn, tblastn, tblastx) | ✓ | ✗ | Fair | ( | |
Microsatellite database | |||||||||||||
| SilkSatDB | MySQL, PHP, Apache web server | ✗ | ✗ | ✓ | ✓ | ✗ | ✗ | ✓ (SSRF, AutoPrimer) | ✗ | ✗ | Fair | ( | |
| 13/10/2004 | |||||||||||||
| InSatDB | MySQL, PHP, Apache web server | ✓ (.csv files) | ✗ | ✓ | ✓ (multi-option query sheet) | ✗ | ✗ | ✓ (Primer3) | ✓(Tutorial) | ✗ | Fair | ( | |
Silkworm mutant databases | |||||||||||||
| DBMP | NA | ✗ (Photographs of mutants can be accessed) | ✗ | ✓ | ✗ | ✗ | ✗ | ✗(BLAST associated with full length cDNA DB but currently not functional) | ✗ | ✗ | Fair | No publication | |
| ABURA-KO | NA | ✗ (Photographs of mutants can be accessed) | ✗ | ✓ | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | Fair | No Publication | |
| 07/03/2012 | |||||||||||||
| Bombyx Trap Database | Integrated with Kaikobase | ✗ (Photographs of strains can be accessed) | ✗ | ✓ | ✓ (Word search and Pictorial search options) | ✗ | ✓ | ✗ | ✓ | ✗ | Fair | No Publication for database but there is a reference publication ( | |
| 28/02/2011 | |||||||||||||
Transposable elements databases | |||||||||||||
| BmTEdb | NA | ✓ (FASTA) | ✗ | ✓ (Browse BmTEdb) | ✓ (Keyword search) | ✗ | ✗ | ✓ (BLAST, HMMER, GetORF) | ✓ | ✗ | Fair | ( | |
| 07/27/2013 | |||||||||||||
Silkworm protein databases | |||||||||||||
| Kaiko2DDB | - | ✗ (2D-PAGE Images can be retrieved) | ✗ | ✓ | ✓ (simple search queries, combined fileds) | ✗ | ✗ | ✗ | ✗ | ✗ | Fair | ( | |
| SilkProt | NA | ✗ | ✗ | ✗ | ✓ (Submit Query) | ✗ | ✗ | ✗ | ✗ | ✗ | Poor | No Publication | |
| SilkPPI | NA | ✗ | ✗ | ✓ | ✓ | ✗ | ✓ | ✗ | ✓ | ✗ | Poor | No publication for DB But the data available is referred in Ref ( | |
| SilkTF | NA | ✗ | ✗ | ✓ (Browse by id) | ✓ (Search by Sequence ID, Domain) | ✗ | ✓ | ✓(BLAST) | ✓ | ✗ | Fair | No Publication | |
| Silkworm genetic resource database | |||||||||||||
| SGRDB | MYSQL, JAVA, Oracle relational database management system (RDBMS) | – | – | – | – | – | – | – | – | – | Poor | ( | |
| Silkworm Base | NA | ✗ | ✓ (Publication) | ✓ | ✓ (3 Search options-strain, gene & references) | ✓ | ✓ | ✗ | ✗ | ✓ | Fair | No publication for DB | |
| 27/04/2015 | |||||||||||||
Insect pathway databases | |||||||||||||
| iPathDB | HTML, PHP, CSS and JavaScript, Apache HTTP server, Linux operating system (Redhat 5.6, Raleigh, NC, USA) | ✓ (via FTP, Data Sorted by species, Phylogenetic tree, Software: iPathCons, raw data) | ✓ | ✓ | ✓ (Search by species name, pathway ID or name, Pathway Search) | ✓ | ✓ | ✓ (iPathCons) | ✓ | ✗ | Fair | ( | |
Silkworm host plant databases | |||||||||||||
Mulberry databases | |||||||||||||
| MorusDB | ✓ (via FTP-FTP Downloads, File Browser Downloads) | ✗ | ✓ GBrowse | ✓ (Search, Fetch data) | ✓ | ✓ | ✓ (BLAST, WEGO, Browse GO, Search GO, Genome browser) | ✓ | ✗ | Fair | ( | ||
| MorusDB v1.0 released (29/08/2014) | |||||||||||||
| MulSatDB | MySQL relational database, Apache 2.2 web server, Linux system, PHP, HTML and JavaScript | ✗ | ✓ | ✓ | ✓ (Search markers-whole genome, EST) | ✓ | ✓ | ✓ (CMap, Primer3Plus, BLAST) | ✗ | ✗ | Fair | ( | |
| 05/02/2014 | |||||||||||||
| Castor databases | |||||||||||||
| Castor Database | PHP | ✗ | ✗ | ✓ | ✓ | ✗ | ✗ | ✗ | ✗ | ✗ | Poor | No publication | |
| 04/2014 | |||||||||||||
| Other web resources on Castor | |||||||||||||
| JCVI Castor Bean Genome DB | NA | ✓ (via FTP) | ✗ | ✗Gbrowse (Error) | ✓ (sequence BLAST search) | ✗ | ✗ | ✓ (BLAST-blastn, blastp, blastx, tblastn, tblastx) | ✗ | ✗ | Fair | No Publication | |
| CastorDB | MySQL, Perl API, Perl, Java, CGI, HTML, Javascript | ✓ (text, jnlp file) | ✗ | ✓ | ✓ (simple, advanced and similarity search) | ✗ | ✗ | ✓ (BLAST) | ✓ | ✗ | Poor | ( | |
| Papaya databases | |||||||||||||
| Papaya-DB | NA | ✗ | ✗ | ✓ (Gbrow-se is not function-al) | ✓ (Search not-functional) | ✗ | ✓ | ✗ | ✗ | ✗ | Fair | Draft genome published in 2008 ( | |
| CPR-DB | Hosted via ftp | ✓ (via FTP-TE, TR, protein sequences can be downloaded) | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | ✗ | Fair | ( | |
| Jatropha databases | |||||||||||||
| Jatropha genome DB | NA | ✓ (via FTP) | ✗ | ✓ | ✓ (Keyword Search, Similarity search) | ✓ | ✗ | ✓ (BLASTN, BLASTP, TBLASTN) | ✗ | ✗ | Fair | ( | |
| Cassava databases | |||||||||||||
| CGDB | NA | ✓ (BAC End Sequences, Cassava ESTs, Assembled Cassava ESTs, SNPs from physical map and genes)- Right click on dataset name, & select “ | ✗ | ✓ (Gbrow se is not-function-al) | ✗ | ✗ | ✗ | ✓ (BLAST-blast, blastn, blastp, tblastn, tblastx) | ✗ | ✗ | Fair | No Publication | |
| CCDB | NA | ✓ (FASTA, GFF) | ✗ | ✓ (Genome Browser) | ✗ | ✗ | ✗ | ✓ (BLAST) | ✓ (Under construction) | ✓ (Not functional) | Fair | No Publication | |
| 01/08/2011 | |||||||||||||
| Cassavabase | ✓ (via FTP) | ✓ | ✓ GenomeBrowser (Jbrowse) | ✓ (Multi- optional search) | ✓ | ✓ | ✓ ( | ✓ | ✓ | Fair | ( | ||
| 21/02/2016 | |||||||||||||
| Quercus (Oak) databases | |||||||||||||
| Quercus portal | NA | ✓ | ✓ | ✓ | ✓ (Integrated search-Global search) | ✓ | ✓ | ✓ (BLAST) | ✓ (Overall navigation guide need for easy access) | ✓ | Fair | – | |
| 19/05/2015 | |||||||||||||
| Other generalized plant databases (Not specific for only silkworm host plants but contains some of their information) | |||||||||||||
| HOSTS | NA | ✗ | ✗ | ✓ | ✓(Search & Drill down search-Lepidoptera, Hostplant) | ✗ | ✗ | ✗ | ✓ | ✗ | Fair | ( | |
| PlantGDB | MySQL-PHP-Perl-Apache | ✓ (via ftp, FASTA format, GenBank, GFF3 or EMBL format, bzip2 files and MySQL tables) | ✓ (Users can submit annotations) | ✓ (Genome browser) | ✓ (Sequence ID, Sequence Search) | ✗ | ✓ | ✓ ( | ✓ | ✓ | Fair | ( | |
| 23/07/2012 | |||||||||||||
| Phytozome | LAMPJ stack (Linux, Apache, MySQL, PHP/Perl and Java) | ✓ (via the JGI Genome Portal after log in- OBO format, HTML table and tab delimited text) | ✓ | ✓ (Gbrowse) | ✓ (Keyword Search) | ✓ | ✗ | ✓ (JBrowse, BLAST, BLAT, PhytoMine, BioMart) | ✓ | ✓ | Fair | ( | |
| 23/02/2016 | |||||||||||||
| PLAZA | NA | ✓ (via ftp-directory-different formats-.csv, .tfa), | ✗ | ✓ (AnnoJ and Genome View) | ✓ (Multi-optional search) | ✓ | ✗ | ✓ (BLAST and various tools to explore/identify gene families, etc.) | ✓ | ✓ | Fair | ( | |
| 26/06/2015 | |||||||||||||
| ChromDB | MySQL, HTML, Mason, (Perl, HTML), UNIX | ✗ | ✗ | ✓ GBrowse | ✓ (Quick Search & Advanced Search) | ✓ | ✗ | ✓ (BLAST, GMOD, GBrowse) | ✓ | ✗ | Fair | ( | |
| PlantTFDB | MySQL | ✓ (FASTA, etc.) | ✗ | ✓ (Browse by Species, families) | ✓ (Quick Search & Advanced Search) | ✗ | ✓ | ✓ (BLAST) | ✓ | ✗ | Fair | ( | |
| 23/08/2013 | |||||||||||||
| PLANTS Database | NA | ✓ (uncompressed ASCII text) | ✗ | ✓ | ✓ (Name Search, State Search, Advanced Search) | ✓ | ✗ | ✓ (Crop Nutrient Tool, etc.) | ✓ | ✗ | Fair | ( | |
| 19/10/2015 | |||||||||||||
| Pest and pathogen databases | |||||||||||||
| SilkPathDB | NA | ✓ (FileServer- PNG, SVG, FASTA, GFF, | ✓ | ✓ (Browse GO) | ✓ (Keywords, sequence IDs, Locations) | ✓ | ✓ | ✓ (WEGO, BLAST, EuSecPred, ProSecPred) | ✓ | ✗ | Fair | – | |
| 08/07/2015 | |||||||||||||
| Combined databases | |||||||||||||
| Barcode database | |||||||||||||
| BOLD | PostgreSql, Java, C ++, PHP, Linux system | ✓ ( | ✓(trace files from ABI sequencers) | ✓ (Taxonomy Browser) | ✓ (Keyword search) | ✓ | ✓ | ✓ (Distribution Map Analysis, Taxon ID Tree, Distance Summary, sequence composition tools etc.) | ✓ | ✓ | Fair | ( | |
| Taxonomy/distribution related databases | |||||||||||||
| DBIF | NA | ✗ | ✗ | ✓ | ✓ 3 options -(‘Search for …’-invertebrates, host plant, source) | ✗ | ✗ | ✗ | ✓ | ✗ | Fair | No publication for the DB but related publications are available | |
| DBMW | NA | ✓ (from Aim & Scope Page) | ✓ | ✓ (catalogue) | ✓ (catalogue, biblio-graphy, images) | ✗ | ✓ | ✗ | ✓ | ✗ | Fair | No Publication | |
| 06/2013 | |||||||||||||
| CNIDB | NA | ✓ (pdf) | ✓ | ✓ | ✓ (Common Names of Insects and Related Organisms) | ✓ | ✗ | ✗ | ✗ | ✓ | Fair | No publication | |
| 02/03/2016 | |||||||||||||
| BAMONA | NA | ✓ (Images are available) | ✓ | ✓ (Browse All Species) | ✓ (Search for Species Profiles-Under Image gallery section) | ✓ | ✗ | ✓ (Identification Tools) | ✓ | ✓ | Fair | No publication | |
| 08/03/2016 | |||||||||||||
| EPPO Global DB | NA | ✓ (Documents-pdf, Image-jpg) | ✓ | ✓ | ✓ (Advanced Search) | ✓ | ✗ | ✓ (Fast text and Batch processing tools) | ✓ | ✓ | Fair | EPPO Global Database (2015) | |
| 01/2016 | |||||||||||||
| EOL | NA | ✓ | ✓ | ✓ | ✓ (EOL Search) | ✓ | ✗ | ✗ | ✓ | ✓ | Fair | ( | |
| 23/02/2016 | |||||||||||||
| ITIS | MySQL, PostgreSql | ✓ (ITIS Downloads-Informix 7, MS SQL Server, MySQL bulk load, MySQL by table, PostgreSql, SQLite).csv format | ✓ | ✓ | ✓ (Quick Search, advanced search) | ✓ | ✗ | ✓ (Compare Taxonomy/Nomenclature, The Taxonomic Workbench tool) | ✓ (Guidelines provided) | ✗ | Fair | No publication | |
| 01/03/2016 | |||||||||||||
| INPN | NA | ✓ | ✓ | ✓ | ✓ (Search data on program, species & habitat) | ✓ | ✗ | ✓ (Data & Tools option) | ✗ | ✓ | Fair | No publication | |
| 09/03/2016 | |||||||||||||
| Pheromone database | |||||||||||||
| Pherobase | NA | ✗ | ✓ | ✓ (Browse data in various categories) | ✓ (Search Bar) | ✓ | ✓ | ✓ (Kovats Calculator, Formula Generator) | ✓ | ✓ | Fair | ( | |
| Silk databases | |||||||||||||
| Biomat_dBase | PHP, HTML/CSS, Javascript, Red Hat Enterprise Linux 4 | – | – | – | – | – | – | – | – | – | Poor | ( | |
| SRDB | Oracle-MySQL-PHP associated with technologies like GIS, RS, Flex and C# | – | – | – | – | – | – | – | – | – | Poor | ( | No link available |
| SFSDB | SQL server 2000, Visual Basic.NET | – | – | – | – | – | – | – | – | – | Poor | ( | No link available |
| Other web resources | |||||||||||||
| miRNEST v2.0 | HTML, CSS, PHP 5.2.11 and MySQL 4.0.31 | ✓(.gz) | ✓ | ✓ | ✓ | ✓(version 2.0 is currently available) | ✓ | ✓ (HuntMi) | ✗ | ✗ | Fair | ( | Integrated database available at |
| miRBase | MySQL | ✓ (via ftp)File formats-FASTA sequences, GFF genome coordinates and MySQL database dumps | ✓ | ✓ (Browse miRBase by species) | ✓ (Search miRBase) | ✓ | ✓ | ✓ (BLASTN, SSEARCH) | ✓ | ✗ | Fair | ( | |
| 03/07/2014 | |||||||||||||
| MEROPS | MySQL, Distributed annotation system (DAS) server | ✓(After log in) | ✓ | ✓ | ✓ (Searches) | ✓ | ✓ | ✓ (BLAST) | ✓ (in the form of PPT) | ✓ | Fair | ( | |
| 06/07/2015 | |||||||||||||
Technologies for data generation in sericulture field
| Sl. No. | Sericultural research area | Technologies | References |
|---|---|---|---|
| 1 | Genomics | BAC-end sequencing filter, hybridization, fingerprinting, WGS, Transgenesis technology, comparative genomics, Linkage mapping, Sanger sequencing, Roche 454 Genome Sequencing, Pyrosequencing technology (454 GS-FLX), Combination of Illumina and 454, Illumina short-read sequencing, de novo seq, exome seq, targeted seq, Microarray based and genome wide association studies (GWAS) | ( |
| 2 | Proteomics | SDS-PAGE, Tandem MS (Tandem Mass Spectroscopy), MALDI MS, two-dimensional gel electrophoresis (2-DE), protein microarray | ( |
| 3 | Transcriptomics | High-throughput RNA sequencing technology (RNA-Seq), Comparative transcriptome analysesTotal RNA and mRNA sequencing, Targeted RNA Sequencing, Small RNA and Non-coding RNA Sequencing, Serial analysis of gene expression (SAGE) | ( |
| 4 | Metabolomics | MS –based system with GC (Gas chromatography) and LC (Liquid Chromatography) for initial separation, NMR analysis of crude extracts and its direct examination by MS | ( |
| 5 | Epigenomics | Methylation sequenicng, Illumina high-throughput bisulfite sequencing (MethylC-Seq), ChiP Sequencing, Ribosome Profiling, Pyrosequencing technology | ( |
| 6 | Metagenomics | Metagenomics: Amplicon seq (16S rRNA), Shotgun Sequencing | ( |
| 7 | Metatrancriptomics | Metatrancriptomics: Functional study of microbial populations Illumina RNA-Seq | ( |
| 8 | Genetic Manipulation | Genetic technologies- Transposon based or genome-editing technologies | ( |