| Literature DB >> 35453798 |
Lukas Malfertheiner1, Clara Martínez-Pérez1,2, Zihao Zhao1, Gerhard J Herndl1,3, Federico Baltar1.
Abstract
The bacterial SAR324 cluster is ubiquitous and abundant in the ocean, especially around hydrothermal vents and in the deep sea, where it can account for up to 30% of the whole bacterial community. According to a new taxonomy generated using multiple universal protein-coding genes (instead of the previously used 16S rRNA single gene marker), the former Deltaproteobacteria cluster SAR324 has been classified since 2018 as its own phylum. Yet, very little is known about its phylogeny and metabolic potential. We downloaded all publicly available SAR324 genomes (65) from all natural environments and reconstructed 18 new genomes using publicly available oceanic metagenomic data and unpublished data from the waters underneath the Ross Ice Shelf. We calculated a global SAR324 phylogenetic tree and identified six clusters (namely 1A, 1B, 2A, 2B, 2C and 2D) within this clade. Genome annotation and metatranscriptome read mapping showed that SAR324 clades possess a flexible array of genes suited for survival in various environments. Clades 2A and 2C are mostly present in the surface mesopelagic layers of global oceans, while clade 2D dominates in deeper regions. Our results show that SAR324 has a very versatile and broad metabolic potential, including many heterotrophic, but also autotrophic pathways. While one surface water associated clade (2A) seems to use proteorhodopsin to gain energy from solar radiation, some deep-sea genomes from clade 2D contain the complete Calvin-Benson-Bassham cycle gene repertoire to fix carbon. This, in addition to a variety of other genes and pathways for both oxic (e.g., dimethylsulfoniopropionate degradation) and anoxic (e.g., dissimilatory sulfate reduction, anaerobic benzoate degradation) conditions, can help explain the ubiquitous presence of SAR324 in aquatic habitats.Entities:
Keywords: comparative genomics; extreme environments; metabolism; metagenomics; microbial ecology; nutrient cycling; phylogeny
Year: 2022 PMID: 35453798 PMCID: PMC9031357 DOI: 10.3390/biology11040599
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1Relative sequence abundance of individual taxa of the microbial community in different oceanic basins based on annotated miTags. Each single bar on the x-axis represents a unique sample; they are ordered according to their depth layers from surface (left side) to bathypelagic (right side) and according to their geographic location within each depth layer.
Figure 2Phylogenetic maximum likelihood tree with de-replicated SAR324 genomes as well as genomes belonging to the phylum Myxococcota as an outgroup. Additionally, genomes are annotated with their origin based on the information available in public databases and their phylogeny based on the GTDB. More information regarding the sample location can be found in Supplementary Table S1. The tree is based on 120 bacterial marker genes and calculated with IQtree. We assigned names and colors to the different SAR24 clades based on their phylogeny and environment. The x-axis shows the phylogenetic distance scale.
Genome properties of all the genomes which passed our quality filtering and de-replication (see Material and Methods for details). All genomes are ordered according to their completeness values. * An asterisk next to a genome name indicates a bin containing more than one copy of the genome. # In the columns is an abbreviation for “Numbers of”. The origin of genomes which did not pass our quality filters is shown in Supplementary Table S1.
| ID | Environment | Completeness | Contamination | Genome Size (Mbp) | GC % | # Contigs | N50 Contigs (KBP) | Coding Density % |
|---|---|---|---|---|---|---|---|---|
| GCA_001469005 | Red Sea water column Station 34–depth 10 m | 89.09 | 0.0 | 3.49 | 47.1 | 312 | 16.27 | 90 |
| GCA_001627675 | Red Sea water column Station 34–depth 25 m | 72.17 | 2.94 | 2.54 | 47.46 | 440 | 6.54 | 91 |
| GCA_001627845 | Red Sea water column Station 192–depth 500 m | 89.54 | 0.0 | 3.17 | 42.89 | 90 | 58.32 | 89 |
| GCA_001781945 | Rifle well CD01 at 16ft depth; 0.1 μm filter at time point D, USA: Rifle, CO | 91.96 | 1.68 | 3.12 | 56.51 | 84 | 65.77 | 91 |
| GCA_001783695 | Rifle well CD01 at 16ft depth; 0.1 μm filter at time point D, USA: Rifle, CO | 91.83 | 1.68 | 3.14 | 49.59 | 149 | 37.4 | 91 |
| GCA_002082305 | hydrothermal plumes on the South Mid-Atlantic Ridge | 90.35 | 0.0 | 2.8 | 42.3 | 148 | 26.08 | 89 |
| GCA_002313905 | marine | 81.27 | 0.84 | 3.15 | 55.43 | 595 | 8.63 | 88 |
| GCA_002327995 | marine | 87.04 | 0.84 | 4.04 | 45.73 | 470 | 14.94 | 86 |
| GCA_002401295 | Atlantic Ocean: North Pond, oxic subseafloor aquifer | 69.22 | 2.95 | 3.08 | 41.71 | 182 | 23.04 | 87 |
| GCA_002683655 | Chile-Peru Current Coastal Province, 5–1000 m (TARA) | 80.16 | 2.33 | 3.15 | 46.19 | 109 | 42.34 | 89 |
| GCA_002685535 * | Chile-Peru Current Coastal Province, 5–1000 m (TARA) | 87.85 | 2.75 | 7.29 | 46.09 | 245 | 51.04 | 77 |
| GCA_002689755 | Mediterranean Sea, 5–1000 m (TARA) | 65.12 | 0.85 | 2.05 | 46.5 | 161 | 12.88 | 92 |
| GCA_002690525 | Mediterranean Sea, 5–1000 m (TARA) | 80.24 | 1.68 | 3.12 | 44.81 | 187 | 18.62 | 88 |
| GCA_002704555 | Mediterranean Sea, 5–1000 m (TARA) | 69.42 | 1.05 | 1.97 | 38.89 | 114 | 19.51 | 89 |
| GCA_002726945 | North Pacific Ocean, 5–1000 m (TARA) | 89.88 | 2.2 | 2.6 | 57.34 | 156 | 18.45 | 89 |
| GCA_002753255 * | Australia: Punkally Creek Sediment | 78.03 | 0.96 | 5.06 | 41.83 | 530 | 14.91 | 90 |
| GCA_003506525 | marine | 72.8 | 0.95 | 3.48 | 44.67 | 720 | 7.44 | 83 |
| GCA_003519185 | Neamphius huxleyi metagenome | 68.15 | 0.0 | 2.13 | 40.87 | 1465 | 2.25 | 73 |
| GCA_003541985 | marine | 67.36 | 0.84 | 2.48 | 44.62 | 592 | 5.66 | 83 |
| 14_54 | Red Sea brine pool | 87.34 | 0.17 | 3.45 | 43.43 | 374 | 12.52 | 87 |
| RIS_MetaBAT_11 | Ross Ice Shelf Antarctica | 79.08 | 0.89 | 2.47 | 57.09 | 297 | 10.42 | 92 |
| RIS_MetaBAT_3 | Ross Ice Shelf Antarctica | 81.27 | 1.01 | 1.93 | 40.12 | 285 | 8.09 | 90 |
| malaspina001 | Malaspina Deep Sea Samples | 91.15 | 2.69 | 2.61 | 42.46 | 284 | 20.73 | 89 |
| malaspina003 | Malaspina Deep Sea Samples | 91.26 | 0.63 | 2.77 | 42.98 | 222 | 14.14 | 90 |
| sample_7_b1 | 45° N, 178° E, Deep Sea | 65.19 | 1.58 | 1.8 | 40.91 | 1138 | 1.57 | 88 |
Figure 3Presence/absence of selected genes or traits. The number in parenthesis indicates how many genes were used to define the respective trait (for more information regarding which genes were used, see Supplementary Table S2). Red color indicated the gene/trait is not present (contains <33% of the genes encoding the respective enzyme/pathway), yellow indicated partially present (33–66%) and blue present (>66%). On the bottom, genomes are additionally annotated with their respective environment.
Figure 4Bar plots showing the percentage of metatranscriptomic reads that mapped to the respective SAR324 clades relative to the total number of reads present in each sample on the x-axis for Surface (A) and Mesopelagic (B). The largest number of SAR324 mapped reads in the surface layer belong to clades 2A and 2C, while 2D appears to be the dominant clade in the mesopelagic region.