| Literature DB >> 35880883 |
Xiao Chu1,2, Xiaojun Wang1,3, Lok Shan Cheung4, Xiaoyuan Feng1,3, Put Ang5, Shing Yip Lee1, Sean A Crowe4,6,7, Haiwei Luo1,2,3.
Abstract
Globally dominant marine bacterioplankton lineages are often limited in metabolic versatility, owing to their extensive genome reductions, and thus cannot take advantage of transient nutrient patches. It is therefore perplexing how the nutrient-poor bulk seawater sustains the pelagic streamlined lineages, each containing numerous populations. Here, we sequenced the genomes of 33 isolates of the recently discovered CHUG lineage (~2.6 Mbp), which have some of the smallest genomes in the globally abundant Roseobacter group (commonly over 4 Mbp). These genome-reduced bacteria were isolated from a transient habitat: seawater surrounding the brown alga, Sargassum hemiphyllum. Population genomic analyses showed that: (i) these isolates, despite sharing identical 16S rRNA genes, were differentiated into several genetically isolated populations through successive speciation events; (ii) only the first speciation event led to the genetic separation of both core and accessory genomes; and (iii) populations resulting from this event are differentiated at many loci involved in carbon utilization and oxygen respiration, corroborated by BiOLOG phenotype microarray assays and oxygen uptake kinetics experiments, respectively. These differentiated traits match well with the dynamic nature of the macroalgal seawater, in which the quantity and quality of carbon sources and the concentration of oxygen likely vary spatially and temporally, though other habitats, like fresh organic aggregates, cannot be ruled out. Our study implies that transient habitats in the overall nutrient-poor ocean can shape the microdiversity and population structure of genome-reduced bacterioplankton lineages. IMPORTANCE Prokaryotic species, defined with operational thresholds, such as 95% of the whole-genome average nucleotide identity (ANI) or 98.7% similarity of the 16S rRNA gene sequences, commonly contain extensive fine-grained diversity in both the core genome and the accessory genome. However, the ways in which this genomic microdiversity and its associated phenotypic microdiversity are organized and structured is poorly understood, which disconnects microbial diversity and ecosystem functioning. Population genomic approaches that allow this question to be addressed are commonly applied to cultured species because linkages between different loci are necessary but are missing from metagenome-assembled genomes. In the past, these approaches were only applied to easily cultivable bacteria and archaea, which, nevertheless, are often not representative of natural communities. Here, we focus on the recently discovered cluster, CHUG, which are representative in marine bacterioplankton communities and possess some of the smallest genomes in the globally dominant marine Roseobacter group. Despite being over 95% ANI and identical in the 16S rRNA gene, the 33 CHUG genomes we analyzed have undergone multiple speciation events, with the first split event predominantly structuring the genomic diversity. The observed pattern of genomic microdiversity correlates with CHUG members' differential utilization of carbon sources and differential ability to explore low-oxygen niches. The available data are consistent with the idea that brown algae may be home to CHUG, though other habitats, such as fresh organic aggregates, are also possible.Entities:
Keywords: CHUG; Roseobacter; Sargassum; microdiversity; population genomics; streamlined genomes
Mesh:
Substances:
Year: 2022 PMID: 35880883 PMCID: PMC9426536 DOI: 10.1128/mbio.00571-22
Source DB: PubMed Journal: mBio Impact factor: 7.786
FIG 1Population differentiation of the 33 CHUG isolates from a single 50 mL seawater sample surrounding a brown alga. (A) Genetically isolated populations from M1 to M5 as well as the two subpopulations, M5S1 and M5S2, within M5 that were defined by PopCOGenT are mapped to the maximum likelihood phylogenomic tree constructed by IQ-TREE, with the root position determined by the MAD algorithm. Solid circles in the phylogeny indicate nodes with bootstrap values of 100% in the 1,000 bootstrapped replicates. Three split events discussed in the main paper are marked with arrows. (B to D) The distribution of the FST values across 1,788 core genes between the lineages, resulting from each of the three split events. Gene families showing FST > 0.25 with statistical significance (P < 0.05) are marked in red, and others are marked in gray. For the dot plots (top), genes are ordered along the closed genome of HKCCA1288. For the histogram (bottom), the count represents the number of gene families. (E) The dendrogram is constructed based on gene presence and absence using the complete linkage method implemented in the R package, “pheatmap”.
FIG 2The pangenome of the 33 CHUG isolates. All of the orthologous gene families identified by Roary are positioned according to the closed genome, HKCCA1288. From the inner to the outer circle: (1 to 33) the genomes of the 33 CHUG isolates, arranged in line with their phylogenomic tree, which is shown in the top-left corner. Genomic islands, prophages, insertion sequences, and pseudogenes are marked with different colors and are mapped to each genome; (34) M1M2-specific genes, M3M4M5-specific genes, and core genes showing unusually large d values are marked with different colors and are placed according to the coordinates of the HKCCA1288 genome; (35) M3M4-specific genes and M5-specific genes; (36) M5S1-specific genes and M5S2-specific genes. Gene families falling in the functional categories, “energy production and conversion”, “amino acid transport and metabolism”, and “carbohydrate transport and metabolism”, are each attached with a gene name and framed in a box with a background color corresponding to a functional category. Each box is connected with a line that is colored according to whether the gene family is part of the population-specific accessory genes or the d outlier core genes.
Organic compounds that support genetic or phenotypic differentiation between M1M2 and M3M4M5
| Compounds | M1M2-specific | M3M4M5-specific | Different Roary families with same COG annotation | BiOLOG assay | Macroalgae-derived | |
|---|---|---|---|---|---|---|
| Hydroxypyruvate | Yes | - | ||||
| N-acetylglucosamine | Yes | No growth | ||||
| D-fructose | Yes | Differential growth | Brown, red, and green ( | |||
| Myo-inositol | Yes | Yes | No growth | Brown, red, and green ( | ||
| Glycerol-3-phosphate | Yes | Yes | No growth | Red ( | ||
| D-arabinose | Yes | Equal growth | ||||
| Maltose | Yes | No growth | Green ( | |||
| D-glycerate | Yes | - | ||||
| Tartronate semialdehyde | Yes | - | ||||
| β-hydroxybutyric acid | Yes | Differential growth | Brown ( | |||
| D-xylose | Yes | Equal growth | Brown, red, and green ( | |||
| D-galacturonate | Yes | Equal growth | ||||
| D-glucuronate | Yes | No growth except one replicate of the M2 representative | Brown ( | |||
| Methyl D-Lactate | Yes | No growth except one replicate of the M2 representative | ||||
| Pyruvate | Yes | Equal growth | ||||
| D-gluconate | Yes | No growth | ||||
| Ketogluconate | Yes | Equal growth | ||||
| 2-keto-3-deoxy-L-fuconate | Yes | - | ||||
| L-fucose | Yes | Differential growth | Brown ( | |||
| Acetic acid | Yes | Differential growth | Red ( | |||
| Glycolic acid | Differential growth | Brown ( | ||||
| Sodium formate | Differential growth | |||||
| Mono-methyl succinate | Differential growth | |||||
| D-arabitol | Differential growth | |||||
| D-fucose | Differential growth | |||||
| γ-hydroxybutyric Acid | Differential growth |
Compounds potentially made by macroalgae are marked in the last column.
The “M1M2-specific” and “M3M4M5-specific” gene families are exclusively present in M1M2 and M3M4M5, respectively.
The “d outlier” gene families are shared by all members of M1M2 and M3M4M5 but were subjected to novel allelic replacement.
Gene families linked to “different Roary gene families with the same COG annotation” each consist of two families identified by Roary but are annotated with the same function. The gene locus, COG id, gene names, and functions of the involved gene families can be found in Data Set S1j.
The differential growth is between M1 and the remaining populations. In other cases, differential growth was observed between M1M2 and M3M4M5.
FIG 3(A) The heat map of the respiration values of a few representative strains of the populations defined by PopCOGenT at 96 h, measured by BiOLOG. The clustering is constructed using all 190 carbon sources provided by the two microplates, PM01 and PM02, but only those 85 substrates that showed a respiration value over 10 OmniLog units in at least one strain are displayed. Substrates differentially utilized by M1M2 and M3M4M5 are framed with green boxes. (B) The respiration curves of the representative strains fed with the framed substrates in (A).
FIG 4Oxygen uptake measurements. (A) Oxygen consumption of two CHUG isolates (HKCCA1288 and HKCCA1065) from M1M2, two CHUG isolates (HKCCA1006 and HKCCA1086) from M3M4M5, and Ruegeria pomeroyi DSS-3 under microaerobiosis. (B) The relationship between oxygen concentration and oxygen uptake rates at an oxygen level of <2 μmol/L. The solid line is a model fitted based on the Michaelis-Menten equation, and the nearby dotted line shows the standard deviation estimated by this equation. The error bars of the filled circles represent the standard deviations of three replicates. (C) The Hanes-Woolf plot of oxygen uptake as a function of oxygen concentration, ranging from 100 nmol/L to 1 μmol/L. The error bars of the filled circles represent the standard deviations of three replicates. (D) The estimated specific affinity of oxygen for the four CHUG isolates and R. pomeroyi DSS-3.