| Literature DB >> 34718743 |
Lorenz Christian Reimer1, Joaquim Sardà Carbasse1, Julia Koblitz1, Christian Ebeling1, Adam Podstawka1, Jörg Overmann1.
Abstract
The bacterial metadatabase BacDive (https://bacdive.dsmz.de) has developed into a leading database for standardized prokaryotic data on strain level. With its current release (07/2021) the database offers information for 82 892 bacterial and archaeal strains covering taxonomy, morphology, cultivation, metabolism, origin, and sequence information within 1048 data fields. By integrating high-quality data from additional culture collections as well as detailed information from species descriptions, the amount of data provided has increased by 30% over the past three years. A newly developed query builder tool in the advanced search now allows complex database queries. Thereby bacterial strains can be systematically searched based on combinations of their attributes, e.g. growth and metabolic features for biotechnological applications or to identify gaps in the present knowledge about bacteria. A new interactive dashboard provides a statistic overview over the most important data fields. Additional new features are improved genomic sequence data, integrated NCBI TaxIDs and links to BacMedia, the new sister database on cultivation media. To improve the findability and interpretation of data through search engines, data in BacDive are annotated with bioschemas.org terms.Entities:
Mesh:
Year: 2022 PMID: 34718743 PMCID: PMC8728306 DOI: 10.1093/nar/gkab961
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Updated features in the strain detail view. (A) Infobox comprising main strain identifier, including BacDive ID, culture collection numbers and NCBI taxonomy ID. (B) Short summary of the most important information on the strain. (C) Species navigation bar, to switch between strains that belong to the same species. (D, E) Gallery and detail view for multimedia content, e.g. electron microscopy images. (F) Metabolic pathway information integrated from EnzymeDetector, displayed in a list sorted descending by the coverage of detected enzymes. (G) Table with genomic sequence data retrieved from NCBI GenBank, PATRIC and JGI.
Figure 2.Selected statistics from the dashboard (https://bacdive.dsmz.de/dashboard) providing an overview over BacDive data within 49 graphs. (A) Eight key values covering total strains and species, cell size, optimum cultivation temperature, pH, incubation time, optimum salt condition, and count of 16S sequence accession numbers. (B) Cultivation data showing distribution of growth temperatures for 39 701 strains. (C) Morphology data showing distribution of cell length for 5217 strains. (D) Metabolism data showing a ranking of 706 utilized metabolites ordered descending by strain count. (E) Location data showing the number of strains that were isolated in each country.