| Literature DB >> 32071553 |
Jie Li1, Zhuo Chen1, Yadong Wang1.
Abstract
Many bacterial-related databases are developed to meet the researchers' needs of analysis and search for a number of bacterial information. However, these databases have different data resources, construction methods, data formats, and analysis tools. It's difficult for researchers to select appropriate databases and analysis tools to promote their researches. In the paper, we compared the contents, construction methods, data sources, update frequency, scope and scale of data, analysis tools, and features of nine famous bacterial databases: CARD, EffectiveDB, MBGD, MPD, PATRCI, PHI-base, VFDB, gcMeta and SILVA, and help researchers to better make better use of these databases. In addition, we also hope this review can help researchers develop a more comprehensive database and better tools to meet the needs of researchers. © The author(s).Entities:
Keywords: analysis tool; bacterial database; data resource; database construction method
Mesh:
Year: 2020 PMID: 32071553 PMCID: PMC7019132 DOI: 10.7150/ijbs.39289
Source DB: PubMed Journal: Int J Biol Sci ISSN: 1449-2288 Impact factor: 6.580
Figure 1CARD function diagram
Figure 2EffectiveDB function diagram
Figure 3MBGD function diagram
Figure 4MPD function diagram.
Figure 5PATRIC function diagram.
Figure 6PHI-base function diagram.
Figure 7VFDB function diagram.
Figure 8gcMeta function diagram.
Figure 9SILVA function diagram.
Figure 10Data source matrix of nine bacteria databases.
Figure 11Data construction method matrix of nine bacteria databases.
Database basic function comparison
| Online analyze tool | Help | Browse | Search | Download | ||
|---|---|---|---|---|---|---|
| PATRIC | √(19) | √ | √ | √ | √ | {5} |
| EffectiveDB | √(1) | √ | √ | √ | √ | {5} |
| MPD | √(1) | √ | √ | √ | √ | {5} |
| VFDB | √(2) | √ | √ | √ | √ | {5} |
| PHI-base | √(1) | √ | √ | √ | √ | {5} |
| CARD | √(2) | √ | √ | √ | {4} | |
| MBGD | √ | √ | √ | √ | {4} | |
| gcMeta | √ | √ | √ | √ | {4} | |
| SILVA | √(4) | √ | √ | √ | √ | {5} |
Summary of bacteria databases
| Database | Database description | Scope and scale | Microbiome data | Application/Tools available | Main data sources | Update frequency | Data construction methods |
|---|---|---|---|---|---|---|---|
| CARD | Bacterial antibiotic resistance-related gene, protein and mutation database | 2678 reference sequences, involving 82 pathogens | No | Web interface, prevalence, resistomes & variants, BLAST, RGI, and RGI software | NCBI | Monthly | Text mining and database integration |
| EffectiveDB | Bacterial secreted proteins and secreted systems database | 1699 bacterial genomes and their secreted proteins | No | Web interface, Prediction of secreted proteins and protein secretion systems, and EffectiveT3 software | NCBI, Simap, EggNOG | Quarterly | Generate by tools and database integration |
| MBGD | Microbial orthologous comparative genomic database | 6318 genomes including 5861 bacteria, 254 archaea and 203 eukaryotes | No | Web interface, DomCluster, DomRefine, CGAT, and CoreAligner | NCBI, KEGG, PROSITE, DDBJ | Semiannual | Generate by tools and database integration |
| MPD | Bacterial genome and metagenomic database | 41935 genomes of bacterial strains and 28950 metagenomic data from human and environmental samples | Yes | Web interface, and ANItools | NCBI, EMBL, Imicrobe, Meta-hit,MG-RAST, HMP, Ensembl | Yearly | Users upload and database integration |
| PATRIC | Comprehensive database of bacteria | 236968 bacteria genomes, 3512 archaea genomes, 4719 virus genomes, and 10 eukaryotic hosts genomes | Yes | Web interface, genome assembly, genome annotation, comprehensive genome analysis, BLAST, similar genome finder, variation analysis, Tn-Seq analysis, phylogenetic tree building, genome alignment, metagenomic read mapping, taxonomic classification, metagenomic binning, expression data import, RNA-Seq analysis, protein family sorter, proteome comparison, comparative pathway, model reconstruction, and ID mapper | NCBI, KEGG, CARD, SEED, ResFinder, ArrarExpress, IntAct, BIND, DIP, MINT, BioGRID, PDB | Monthly | User upload and database integration |
| PHI-base | Pathogen and host interaction database | 12467 pathogen-host interaction data with 266 pathogens and 199 hosts | No | Web interface, and PHI-BLAST | NCBI, Uploaded by users, EMBL, and web of science | Semiannual | Text mining, user upload, and databases integration |
| VFDB | Virulence factor database | 32 genus of bacteria and 575 virulence factors in the core data set | No | Web interface, BLAST, and VFanalyzer | NCBI | Irregular updates | Text mining, generate by tools, and database integration |
| gcMeta | a Global Catalogue of Metagenomics platform | 126602 samples, 153271 sequences, 146696 experiments and 77682 reports | Yes | Web interface, SRAtoolkit, ART, plRS, Bbtools, fastQC, cutadapt, Trimmomatic, fastp, dustmasker, DRISEE, Musket, SOAPec, LoRDEC, proovread, Quiver, FLASH, SOAPdenovo2, SPAdes, MetaVelvet, ALLPATH-LG, Meta-IDBA, MEGAHIT, RayMeta, CANU, CAP3, SSPACE, OPERA, QUAST, REAPR, CheckM, BUSCO, cufflinks, StringTie, Cuffdiff, Sailfish, Kallisto, DESeq2, Ballgown, Trinity, Oases, SOAPdenovo-Trans, PILER-CR, minced, tRNAscan SE, RNAmmer, Prodigal, Glimmer, GeneMark, FragGeneScan, XSTREAM, RepeatMasker, PRISM, LUMPY, Prokka, DFAST, InterProScan, PfamScan, QIIME(1,2), LEfSe, PICRUSt, MetaCV, k-SLAM, Kaiju, Centrifuge, DUDes, mOTU, StrainEst, Mash, sourmash, MetaPhlAn2, HUMAnN2, CONCOCT, MaxBin2, MetaBAT2, AbundanceBin, VirFinder, VirHostMatcher, orthoANI, CD-hit, MUMmer, BWA, Bowtie2, samtools, BLAST, BLAT, diamond, STAR, Tophat2, hisat2, blasr, R_plot_heatmap, R_plot_point, R_plot_PcoA, R_plot_barplot_fromtable | MG-RAST, EBI metagenomics and HMP and ongoing research projects such as CAS-CMI | Monthly | User upload, and databases integration |
| SILVA | High quality ribosomal RNA database | 9470435 SSU Parc, 1312673 LSU Parc, 4945070 SSU Ref, 357845 LSU Ref, 659046 SSU Ref NR 99 and 123524 LSU Ref NR 99. | No | SILVAngs, SILVA Alignment, Classification and Tree (ACT) Service, SILVA Tree Viewer and ARB. | EMBL-EBI/ENA | Yearly | Generate by tools, and database integration |