| Literature DB >> 30273414 |
Karen G Lloyd1, Andrew D Steen2, Joshua Ladau3, Junqi Yin4, Lonnie Crosby4.
Abstract
To describe a microbe's physiology, including its metabolism, environmental roles, and growth characteristics, it must be grown in a laboratory culture. Unfortunately, many phylogenetically novel groups have never been cultured, so their physiologies have only been inferred from genomics and environmental characteristics. Although the diversity, or number of different taxonomic groups, of uncultured clades has been studied well, their global abundances, or numbers of cells in any given environment, have not been assessed. We quantified the degree of similarity of 16S rRNA gene sequences from diverse environments in publicly available metagenome and metatranscriptome databases, which we show have far less of the culture bias present in primer-amplified 16S rRNA gene surveys, to those of their nearest cultured relatives. Whether normalized to scaffold read depths or not, the highest abundances of metagenomic 16S rRNA gene sequences belong to phylogenetically novel uncultured groups in seawater, freshwater, terrestrial subsurface, soil, hypersaline environments, marine sediment, hot springs, hydrothermal vents, nonhuman hosts, snow, and bioreactors (22% to 87% uncultured genera to classes and 0% to 64% uncultured phyla). The exceptions were human and human-associated environments, which were dominated by cultured genera (45% to 97%). We estimate that uncultured genera and phyla could comprise 7.3 × 1029 (81%) and 2.2 × 1029 (25%) of microbial cells, respectively. Uncultured phyla were overrepresented in metatranscriptomes relative to metagenomes (46% to 84% of sequences in a given environment), suggesting that they are viable. Therefore, uncultured microbes, often from deeply phylogenetically divergent groups, dominate nonhuman environments on Earth, and their undiscovered physiologies may matter for Earth systems. IMPORTANCE In the past few decades, it has become apparent that most of the microbial diversity on Earth has never been characterized in laboratory cultures. We show that these unknown microbes, sometimes called "microbial dark matter," are numerically dominant in all major environments on Earth, with the exception of the human body, where most of the microbes have been cultured. We also estimate that about one-quarter of the population of microbial cells on Earth belong to phyla with no cultured relatives, suggesting that these never-before-studied organisms may be important for ecosystem functions. Author Video: An author video summary of this article is available.Entities:
Keywords: environmental microbiology; phylogeny; uncultured microbes
Year: 2018 PMID: 30273414 PMCID: PMC6156271 DOI: 10.1128/mSystems.00055-18
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1Fractions of 16S rRNA genes from bacteria (top panel) and archaea (bottom panel) in public databases from primer-amplified metagenomes (with and without read depths) and metatranscriptomes at different percent identities with their closest cultured relative. Vertical dashed lines represent estimated cutoff levels for different taxonomic levels of novelty relative to all cultures (indicated at the top of the panel) (9). Primer-amplified bacterial sequences showed 30% to up to 100% similarity to their closest cultured relative but were removed for clarity.
FIG 2Proportion of 16S rRNA gene sequences in each category of phylogenetic novelty relative to cultures for each environment, by amplicons, metagenomes (without scaffold read depth), and metatranscriptomes. Closed circles represent primer-amplified amplicons, open circles represent metagenomes, and crosses represent transcriptomes. Total numbers of sequences and studies are listed in Table S2.
FIG 3Proportion of 16S rRNA gene sequences by scaffold read depth averaged across all metagenomes. Each single data point represents the abundance of reads in that similarity bin from a single metagenome. Rows represent different similarity bins.
Metagenome-based estimates of global microbial cell abundances from uncultured archaea and bacteria, based on 16S rRNA gene sequence read depths
| Environment | No. (%) of microbial cells × 1026 | |||
|---|---|---|---|---|
| Total | Cultured | Uncultured | Uncultured | |
| Marine sediment ( | 2,900 | 390 (13) | 1,921 (66) | 590 (20) |
| Soil ( | 2,560 | 454 (18) | 1,268 (50) | 839 (33) |
| Terrestrial subsurface ( | 2,500 | 702 (28) | 1,211 (48) | 587 (23) |
| Seawater ( | 1,010 | 143 (14) | 640 (63) | 229 (23) |
| Freshwater ( | 1.3 | 0.1 (11) | 0.8 (64) | 0.3 (25) |
| Plant hosts ( | 1 | 0.5 (49) | 0.4 (37) | 0.1 (14) |
| Animal hosts ( | 0.2 | 0.1 (49) | 0.1 (37) | 0.0 (14) |
| Total | 8,974 | 1,689 (19) | 5,050 (56) | 2,245 (25) |
Environments with fewer microbial cells were excluded.
Cutoff values represent the upper 95% confidence interval of the median 16S rRNA gene identity for each taxonomic level (12).
Metatranscriptome-based estimates of global microbial cell abundances from uncultured archaea and bacteria, based on 16S rRNA gene sequence numbers
| Environment (reference) | No. (%) of microbial cells × 1026 | |||
|---|---|---|---|---|
| Total | Cultured | Uncultured | Uncultured | |
| Marine sediment ( | NA | NA | NA | NA |
| Soil ( | 2,560 | 49 (2) | 758 (30) | 1,753 (69) |
| Terrestrial subsurface ( | 2,500 | 45 (2) | 597 (24) | 1,858 (74) |
| Seawater ( | 1,010 | 36 (4) | 389 (38) | 587 (58) |
| Freshwater ( | 1.3 | 0.0 (3) | 0.5 (40) | 0.7 (56) |
| Plant hosts ( | 1 | 0.2 (18) | 0.3 (33) | 0.5 (49) |
| Animal hosts ( | 0.2 | 0.0 (18) | 0.1 (33) | 0.1 (49) |
| Total | 6,074 | 129 (2) | 1,744 (29) | 4,200 (69) |
Environments with fewer microbial cells were excluded.
Cutoff values represent the upper 95% confidence interval of the median 16S rRNA gene identity for each taxonomic level (12). NA, not applicable (too few metatranscriptome data are available from the indicated environment to be included).
FIG 4Rank abundance plots by taxonomic genus assignments for metagenomic data (top three rows) and metatranscriptomic data (bottom two rows). Listed in each box are the top 10 most abundant genera for that environment in the format of phylum_lowest identified taxonomic group, with asterisks (*) denoting archaea. Data are colored for uncultured phyla (teal), uncultured class to genus (pink) or cultured phyla (tan). Taxonomic-based genera that had sequences from multiple phylogeny-based percent identity bins were labeled with the color of the bin with the most sequences.