| Literature DB >> 28776041 |
Albert Barberán1, Hildamarie Caceres Velazquez2, Stuart Jones2, Noah Fierer3,4.
Abstract
Cultivation in the laboratory is essential for understanding the phenotypic characteristics and environmental preferences of bacteria. However, basic phenotypic information is not readily accessible. Here, we compiled phenotypic and environmental tolerance information for >5,000 bacterial strains described in the International Journal of Systematic and Evolutionary Microbiology (IJSEM) with all information made publicly available in an updatable database. Although the data span 23 different bacterial phyla, most entries described aerobic, mesophilic, neutrophilic strains from Proteobacteria (mainly Alpha- and Gammaproteobacteria), Actinobacteria, Firmicutes, and Bacteroidetes isolated from soils, marine habitats, and plants. Most of the routinely measured traits tended to show a significant phylogenetic signal, although this signal was weak for environmental preferences. We demonstrated how this database could be used to link genomic attributes to differences in pH and salinity optima. We found that adaptations to high salinity or high-pH conditions are related to cell surface transporter genes, along with previously uncharacterized genes that might play a role in regulating environmental tolerances. Together, this work highlights the utility of this database for associating bacterial taxonomy, phylogeny, or specific genes to measured phenotypic traits and emphasizes the need for more comprehensive and consistent measurements of traits across a broader diversity of bacteria. IMPORTANCE Cultivation in the laboratory is key for understanding the phenotypic characteristics, growth requirements, metabolism, and environmental preferences of bacteria. However, oftentimes, phenotypic information is not easily accessible. Here, we compiled phenotypic and environmental tolerance information for >5,000 bacterial strains described in the International Journal of Systematic and Evolutionary Microbiology (IJSEM). We demonstrate how this database can be used to link bacterial taxonomy, phylogeny, or specific genes to measured phenotypic traits and environmental preferences. The phenotypic database can be freely accessed (https://doi.org/10.6084/m9.figshare.4272392), and we have included instructions for researchers interested in adding new entries or curating existing ones.Entities:
Keywords: pH; phenotypes; phylogeny; salinity; traits
Year: 2017 PMID: 28776041 PMCID: PMC5541158 DOI: 10.1128/mSphere.00237-17
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Information compiled from the International Journal of Systematic and Evolutionary Microbiology (IJSEM) publications
| Category | Components |
|---|---|
| Ancillary data | Yr of publication, article digital object identifier (doi), taxonomic nomenclature, culture collection code |
| Morphology/phenotype | Gram stain status, cell length, cell width, cell shape, cell aggregation, motility, spore and pigment formation |
| Metabolism | General metabolism, sole carbon substrate use, BIOLOG information available |
| Environmental preferences | Habitat of isolation; oxygen requirement; range and optimum for pH, temp, and salt |
| Sequence data | GC content, 16S rRNA accession no., genome accession no. |
FIG 1 Taxonomic distribution (A) and habitat distribution (B) of the >4,000 bacterial strains present in the phenotype database. The inset in panel A shows the strain representation of the major proteobacterial subgroups in the database. Note that in panel B the habitat is the environment from which each strain was originally isolated (if reported) and may not accurately reflect where those strains may be most abundant.
FIG 2 Distribution of selected traits across the >4,000 strains in the most recent version of the database, including cell shape (A), spore formation (B), motility (C), oxygen requirements (D), temperature optimum (E), and pH optimum (F). The number of strains with information for a particular trait is indicated in parentheses.
Phylogenetic signal of bacterial traits
| Trait | Type | Phylogenetic signal |
|---|---|---|
| Spore | Categorical | |
| Pigment | Categorical | |
| Shape (rod) | Categorical | |
| Shape (coccus) | Categorical | |
| Aggregation (chain) | Categorical | |
| Gram stain | Categorical | |
| Flagella | Categorical | |
| Aerobe | Categorical | |
| Anaerobe | Categorical | |
| Temp preference | Continuous | |
| pH preference | Continuous | 0.006 |
| Salinity preference | Continuous | 0.023 |
−D + 1 for categorical, Blomberg’s K for continuous.
Values in bold are significant (P < 0.05).
FIG 3 Phylogenetic signal of selected traits: presence of pigment (A), spore formation (B), pH optima (C), and temperature optima (D). For categorical variables (A and B), the red columns indicate presence. For continuous variables (C and D), the red columns indicate the reported value.
Putative genomic markers associated with pH and salinity optima
| KO ID | Optimum | Description | Sign of coefficient | TCDB |
|---|---|---|---|---|
| Both | K+-transporting ATPase ATPase A chain | − | Yes | |
| Both | K+-transporting ATPase ATPase B chain | − | Yes | |
| Both | K+-transporting ATPase ATPase C chain | − | Yes | |
| Both | Alanine or glycine:cation symporter, AGCS family | + | Yes | |
| Both | Trk system potassium uptake protein | + | Yes | |
| Both | Cation:H+ antiporter | + | Yes | |
| Both | Putative membrane protein | + | No | |
| pH | Membrane fusion protein, multidrug efflux system | − | Yes | |
| pH | MFS transporter, DHA2 family, multidrug resistance protein | − | Yes | |
| pH | Kumamolisin | − | No | |
| pH | Membrane fusion protein, multidrug efflux system | − | Yes | |
| pH | Squalene-hopene/tetraprenyl-beta-curcumene cyclase | − | Yes | |
| pH | Molybdate/tungstate transport system substrate-binding protein | − | Yes | |
| pH | Molybdate/tungstate transport system permease protein | − | Yes | |
| pH | Cation/acetate symporter | + | Yes | |
| pH | Choline/glycine/proline betaine transport protein | + | Yes | |
| pH | Putative glutathione | + | No | |
| pH | + | No | ||
| pH | + | No | ||
| pH | Uncharacterized protein | + | No | |
| pH | Magnesium transporter | + | Yes | |
| pH | Multicomponent Na+:H+ antiporter subunit A | + | Yes | |
| pH | Multicomponent Na+:H+ antiporter subunit C | + | Yes | |
| pH | Multicomponent Na+:H+ antiporter subunit D | + | Yes | |
| pH | Multicomponent Na+:H+ antiporter subunit E | + | Yes | |
| pH | Multicomponent Na+:H+ antiporter subunit F | + | Yes | |
| pH | Multicomponent Na+:H+ antiporter subunit G | + | Yes | |
| pH | Solute carrier family 34 (sodium-dependent phosphate cotransporter) | + | Yes | |
| pH | Solute carrier family 13 (sodium-dependent dicarboxylate transporter), member 2/3/5 | + | Yes | |
| pH | Betaine/carnitine transporter, BCCT family | + | Yes | |
| pH | Neurotransmitter:Na+ symporter, NSS family | + | Yes | |
| pH | Voltage-gated sodium channel | + | Yes | |
| pH | Putative acetyltransferase | + | No | |
| Salinity | Membrane-associated protein | − | Yes | |
| Salinity | MFS transporter, fosmidomycin resistance protein | − | Yes | |
| Salinity | Two-component system, OmpR family, sensor histidine kinase KdpD | − | No | |
| Salinity | KUP system potassium uptake protein | − | Yes | |
| Salinity | Putative hemolysin | − | No | |
| Salinity | Cytochrome | + | No | |
| Salinity | UPF0271 protein | + | No |
KO ID, entry in KEGG ortholog (KO) database.
TCDB indicates whether the enriched KO was included in the Transporter Classification Database.
Abbreviations: AGCS, alanine or glycine cation symporter; MFS, major facilitator superfamily; BCCT, betaine carnitine choline transporter; NSS, neurotransmitter sodium symporter; KUP, K uptake permease.