Literature DB >> 25670777

Genomes of planktonic Acidimicrobiales: widening horizons for marine Actinobacteria by metagenomics.

Carolina Megumi Mizuno1, Francisco Rodriguez-Valera2, Rohit Ghai1.   

Abstract

UNLABELLED: The genomes of four novel marine Actinobacteria have been assembled from large metagenomic data sets derived from the Mediterranean deep chlorophyll maximum (DCM). These are the first marine representatives belonging to the order Acidimicrobiales and only the second group of planktonic marine Actinobacteria to be described. Their streamlined genomes and photoheterotrophic lifestyle suggest that they are planktonic, free-living microbes. A novel rhodopsin clade, acidirhodopsins, related to freshwater actinorhodopsins, was found in these organisms. Their genomes suggest a capacity to assimilate C2 compounds, some using the glyoxylate bypass and others with the ethylmalonyl-coenzyme A (CoA) pathway. They are also able to derive energy from dimethylsulfopropionate (DMSP), sulfonate, and carbon monoxide oxidation, all commonly available in the marine habitat. These organisms appear to be prevalent in the deep photic zone at or around the DCM. The presence of sister clades to the marine Acidimicrobiales in freshwater aquatic habitats provides a new example of marine-freshwater transitions with potential evolutionary insights. IMPORTANCE: Despite several studies showing the importance and abundance of planktonic Actinobacteria in the marine habitat, a representative genome was only recently described. In order to expand the genomic repertoire of marine Actinobacteria, we describe here the first Acidimicrobidae genomes of marine origin and provide insights about their ecology. They display metabolic versatility in the acquisition of carbon and appear capable of utilizing diverse sources of energy. One of the genomes harbors a new kind of rhodopsin related to the actinorhodopsin clade of freshwater origin that is widespread in the oceans. Our data also support their preference to inhabit the deep chlorophyll maximum and the deep photic zone. This work contributes to the perception of marine actinobacterial groups as important players in the marine environment with distinct and important contributions to nutrient cycling in the oceans.
Copyright © 2015 Mizuno et al.

Entities:  

Mesh:

Year:  2015        PMID: 25670777      PMCID: PMC4337565          DOI: 10.1128/mBio.02083-14

Source DB:  PubMed          Journal:  mBio            Impact factor:   7.867


INTRODUCTION

The bottleneck of pure culture has revealed only a partial glimpse of the actinobacterial world (1, 2). In recent years, a vast uncultured world of genuinely planktonic Actinobacteria has been discovered. These newly discovered microbes are in complete contrast to their better-studied soil counterparts in their size and genomic features. With the advent of 16S rRNA cloning, it became possible to interrogate microbial community composition without the bias of culture, and one of the first such surveys in the marine habitat (3) identified nearly identical sequences from high-GC gram-positive organisms from the Pacific and Atlantic oceans. Phylogenetic analyses of these sequences also suggested that they represented a deep-branching clade, quite different from known organisms. It was not until 1997 that full-length 16S rRNA sequences became available for these microbes (OM1 from Cape Hatteras, North Atlantic coast, and SAR432 from the Sargasso Sea) and enabled a robust taxonomic placement, reinforcing their novelty (4). A subsequent follow-up study, with additional sequences (including Actinobacteria of terrestrial origin), established that microbes harboring these sequences were sufficiently different to warrant a new subclass within the phylum Actinobacteria (5). This group was referred to as the marine actinobacterial clade (MAC). Interestingly, in that work, the authors stated that the GC content of the 16S rRNA gene for these marine sequences was slightly lower (51.9 to 53.1%) than the typical values of other Actinobacteria (55 to 64%), providing the first indication of the lower GC content that characterizes planktonic Actinobacteria (2, 6). More information about the MAC was later revealed by studies on samples from the Bermuda Atlantic Time-series (BATS) location in the Sargasso Sea. In 2005, Morris and collaborators used terminal restriction fragment length polymorphism (T-RFLP), among other techniques, to examine spatial and temporal patterns across a decade of sampling (7). This study showed for the first time that, like SAR11, marine Actinobacteria (represented at this time by the MAC) increased in abundance following convective overturn at a depth of 200 m, suggesting postmixing blooming. Another recent study has also provided substantial evidence of the association of the MAC with the deep chlorophyll maximum (DCM) (8), the zone of maximum phytoplankton concentration in marine stratified water columns. Recently, by sequencing and assembly of metagenomic fosmids, the nearly complete genome of members of the MAC group has been obtained (6). The reconstructed genome was remarkably small (estimated to be <1 Mb) and had very low genomic GC content, only 32%. Both values are the lowest for any Actinobacteria described until then. The estimated cell volume was only 0.006 to 0.024 µm3, making them smaller even than “Candidatus Pelagibacter ubique” and the smallest free-living cells described until then. A new taxon (subclass “Candidatus Actinomarinidae,” order “Candidatus Actinomarinales”) was proposed for the organisms belonging to this clade, and the organism itself was referred to as “Candidatus Actinomarina minuta.” All previously recovered sequences, including BDA1-5, OM1, SAR432, and D92-32, described as belonging to the MAC are comprised within “Ca. Actinomarinidae.” Two incomplete single-cell amplified genomes (SAGs) that are closely related to “Ca. Actinomarina minuta” (SAG D07 and SAG M09) have also been published recently (9), adding further representatives of this new subclass. The existence of another different marine actinobacterial group was also detected by 16S rRNA analysis of clone libraries and metagenomic data sets (10) that indicated the presence of members of the Acidimicrobiales, an order with very few representative isolates (11). Subsequently, in an extensive study analyzing seasonal and vertical changes in the BATS (12), sequences related to the acidimicrobial microbe “Ca. Microthrix parvicella” were found. These “marine Microthrix” were restricted to the DCM in particular, blooming occasionally during summer stratification. Their population dynamics appeared similar to that of the SAR11 1b ecotype. Therefore, at this point, two taxonomically distinct planktonic actinobacterial groups are known to be present in the marine habitat. Both appear to be associated with the zone of maximal photosynthetic production, but genomic information is only available for one of them, the “Ca. Actinomarinales.” The deep chlorophyll maximum is the site of maximal phytoplanktonic density in the stratified marine water column. It is located permanently at the depth of ~100 m in tropical oceans and at a variable depth (~50 to 100 m) during summer in temperate latitudes. As such, it is one of the largest habitats on earth. In the present work, using assembly from deep metagenomic sequencing from the DCM of the Mediterranean, we describe four nearly complete genomes of marine Actinobacteria belonging to the order Acidimicrobiales. These reconstructed genomes are essentially composites, each likely originating from several coexisting clonal lineages, and provide insights into the lifestyle of this diverse and important group of microbes, complementing the genomic information obtained before about the “Ca. Actinomarinales.”

RESULTS AND DISCUSSION

Identification of Acidimicrobiales contigs and genome reconstruction.

A broad overview of the phylogenetic composition of the unassembled data set was obtained by retrieving 16S rRNA fragments (ca. 41,000) from the raw reads. A comparison of the community structure determined this way with those of two other DCM data sets (one of them a 454 run [13]) from the same site is shown in Fig. S1 in the supplemental material. Specifically, the contributions of Actinobacteria in these samples appear to be rather similar (1.6 to 3%). Since our target for this work was the marine Acidimicrobiales from the DCM, we specifically selected long contigs (>10 kb) in which the majority of genes gave best hits to organisms of the subclass Acidimicrobidae. This allows a simple but stringent selection method toward identifying the target group. A total of 160 contigs with a combined length of 6.6 Mb (the longest contig was 431 kb) were thus identified as putative Acidimicrobiales genomic fragments. Using GC content, coverage, and principal component analysis of tetranucleotide frequencies (see Materials and Methods), 151 contigs could be further classified into four groups. Each of these groups likely comes from cells that belong to a single species, although, as is always the case with metagenomic assemblies, the clonal purity of the fragments cannot be guaranteed. For the sake of simplicity, we will use the term “genome” to refer to these composites and we will refer to them as the MedAcidi group. Some summary statistics about these genomes are provided in Table 1. Three genomes had GC contents of between 40 and 45% and were predicted to have maximum sizes in the range of 1.7 to 2 Mb, while the fourth had slightly higher values (50% GC content and a 2.33-Mb genome). With one exception (MedAcidi-G2A), the genomes were predicted to be nearly 90% complete.
TABLE 1

Summary statistics of the reconstructed genomes

GenomeNo. of contigsGC%Total length (Mb)Completeness (%)Estimated size (Mb)
MedAcidi-G14337–441.6884–911.85–2
MedAcidi-G2A4841–461.3771–791.73–1.93
MedAcidi-G2B2844–461.4488–951.52–1.64
MedAcidi-G33250–522.190–962.19–2.33
Summary statistics of the reconstructed genomes Three of the genomes had contigs that contained 16S rRNA. Figure 1a shows the reconstructed phylogeny of the 16S rRNA of our genomes together with several reference genomes. They were more related to each other and to other uncultured microbes than to any of the cultured and sequenced representatives of this subclass (e.g., members of “Ca. Microthrix” and Ilumatobacter). Related 16S rRNA sequences came from diverse marine environments around the world, including the Pacific and the Atlantic oceans, suggesting a widespread distribution. We also used a concatenate of 106 proteins that are conserved between these and 150 actinobacterial genomes for a more robust phylogenetic affiliation and, also, to be able to place MedAcidi-G2A, which did not have a 16S rRNA in the contigs. The resulting tree confirms the Acidimicrobiales affiliation of these genomes (Fig. 1b). In both trees, the nearest neighbors were Ilumatobacter coccineum (a beach isolate) and the recently described genome acAcidi, from a freshwater reservoir (14). Their similarity with the MedAcidi genomes was rather low (average nucleotide identity, ~60%), suggesting that these belong to different genera. Therefore, our reconstructed genomes provide the first genomic information about genuine marine and planktonic Acidimicrobiales.
FIG 1 

Phylogeny of the MedAcidi group of marine Actinobacteria. (a) Maximum-likelihood phylogeny of 16S rRNA of the assembled MedAcidi genomes and related sequences from clone libraries. Sequences from the order Actinomycetales are used to root the tree. The 16S rRNAs from the MedAcidi groups are highlighted by colored boxes. (b) Maximum-likelihood phylogeny using a concatenation of 106 conserved proteins: the assembled genomes of the MedAcidi group are highlighted by colored boxes. Bootstrap values (%) are indicated at each node. Reference organisms shown in both trees are connected by dashed lines.

Phylogeny of the MedAcidi group of marine Actinobacteria. (a) Maximum-likelihood phylogeny of 16S rRNA of the assembled MedAcidi genomes and related sequences from clone libraries. Sequences from the order Actinomycetales are used to root the tree. The 16S rRNAs from the MedAcidi groups are highlighted by colored boxes. (b) Maximum-likelihood phylogeny using a concatenation of 106 conserved proteins: the assembled genomes of the MedAcidi group are highlighted by colored boxes. Bootstrap values (%) are indicated at each node. Reference organisms shown in both trees are connected by dashed lines.

Lifestyle insights.

The sample from which these microbes were assembled (75-m depth in offshore Mediterranean waters) already indicates that they come from an oligotrophic habitat (15). However, even in oligotrophic open ocean waters, some microbes have opportunistic copiotrophic lifestyles, taking advantage of microheterogeneity (particulate organic matter, for example) or sporadic nutrient inputs (such as upwelling or deep mixing). In the case of the MedAcidi genomic fragments described here, several lines of evidence support a real planktonic and oligotrophic lifestyle. Perhaps the most prominent feature of oligotrophic marine planktonic free-living microbes is that they possess very streamlined genomes (9, 16). In addition to the reduction in the number of genes, they also display a characteristic reduction in the length of intergenic spacers. Along these lines, planktonic Actinobacteria, including the MedAcidi genomes, display strikingly short intergenic spacers (see Fig. S2 in the supplemental material), in the range of 3 bp (“Ca. Actinomarina minuta”) to 32 bp for the freshwater actinobacterium acMicro-1 (14). The smallest median intergenic spacer sizes in these new genomes were found to be in the range of 14 to 23 bp (MedAcidi-G2B and MedAcidi-G3, respectively). Another indication of streamlined genomes is a reduction in regulatory sigma factors. For example, the streamlined genome of “Ca. Pelagibacter ubique” encodes only two sigma factors (17). Only two sigma factors were found in the genomes of MedAcidi-G1 and MedAcidi-G2B, four in MedAcidi-G2A, and a maximum of five in MedAcidi-G3. Only one sigma factor (sigma-70) was found in the “Ca. Actinomarina minuta” genome. However, this is not yet definite, as the genome is estimated to be ~70% complete. No sigma factors were found in the SAG genomes of “Ca. Actinomarinales” either. In contrast, the model pathogenic actinobacterium Mycobacterium tuberculosis H37Rv encodes 13, implying a much more complex response to the environment. Another feature of truly planktonic microbes is a lack of any apparatus for motility, and indeed, no flagellar genes were found in any of these new Acidimicrobiales genomes, suggesting that they are nonmotile.

Acidirhodopsins: a new rhodopsin clade shared between marine and freshwater environments.

Photoheterotrophy is a common feature of both planktonic marine and freshwater Actinobacteria (6, 14, 18, 19). Of the four genomes described here, one (MedAcidi-G1) was found to harbor a rhodopsin-coding gene, which showed the highest sequence similarity (82%) to a recently described rhodopsin from an Acidimicrobiales bacterium (uncultured actinobacterium acAcidi) from freshwater (14). Both sequences appear to be related to actinorhodopsins (Fig. 2). Searches against the global ocean sampling (GOS) database using the rhodopsin of MedAcidi-G1 revealed several hits to sequences from diverse samples originating from the Sargasso Sea, Indian Ocean, and Galapagos Islands. Moreover, these sequences reciprocally gave the best BLAST hits to the freshwater sequence, as did the rhodopsin of MedAcidi-G1. In cases where adjacent genes were found on the GOS reads, they also gave the best hits to the same freshwater Acidimicrobiales bacterium. Although actinorhodopsins are known to be widespread in freshwater Actinobacteria, they have not been identified in the marine environment (19, 20). Moreover, the recently described “Ca. Actinomarina minuta” harbors rhodopsin sequences that form a distinct clade (MACrhodopsins) in the rhodopsin phylogeny (6). Though sequence similarity suggested a close relationship to freshwater actinorhodopsins, phylogenetic analysis indicates that these new sequences form a sister clade to known actinorhodopsins, xanthorhodopsins, and eukaryal rhodopsins. The new cluster is sufficiently different from other known branches that we propose a new name, “acidirhodopsins,” for this branch of the rhodopsin tree (Fig. 2). Analysis of the critical amino acids of these sequences suggests they absorb in the green region of the spectrum (20). This is only the second example of a rhodopsin that has close representatives in marine and freshwater habitats, the other being the similar rhodopsins of marine alphaproteobacterial SAR11 and the freshwater LD12 clades. The other three MedAcidi genomes did not contain rhodopsins or any gene that suggested the presence of one, although this might be due to the incompleteness of the genomes. All four genomes encoded photolyases and UvrABC systems for correcting UV light-induced DNA damage, which are typical for microbes inhabiting the photic zone. This, together with the rhodopsin found in MedAcidi-G1 and the sample from which they were obtained, suggests that they inhabit the photic zone (see below).
FIG 2 

Rhodopsin phylogeny. A maximum-likelihood tree of all known types of rhodopsins is shown. Sequences belonging to the new clade of acidirhodopsins are highlighted. Sequences originating from marine and freshwater habitats are indicated by colored squares. Bootstrap values (%) are shown on the nodes.

Rhodopsin phylogeny. A maximum-likelihood tree of all known types of rhodopsins is shown. Sequences belonging to the new clade of acidirhodopsins are highlighted. Sequences originating from marine and freshwater habitats are indicated by colored squares. Bootstrap values (%) are shown on the nodes.

Carbon and energy metabolism.

The genomes provided solid evidence for the MedAcidi being heterotrophs (or in the case of MedAcidi-G1, a photoheterotroph). All of the genomes encoded transporters for organic matter, e.g., glycerol, sugars, amino acids, and peptides (Fig. 3 and Fig. 4; see Fig. S3 and S4 in the supplemental material). Most of the steps for both glycolysis and the TCA cycle were found in all of the genomes, suggesting that these pathways are essentially complete (see Data Set S1 in the supplemental material). Apart from using these typical carbon and energy sources, all genomes appeared to be able to oxidize even- and odd-carbon number fatty acids as well (to acetyl-CoA and propionyl-CoA, respectively). In the absence of complex carbon sources like glucose, the glyoxylate bypass allows the utilization of C2 carbon units from acetate (21) and the acetyl-CoA generated by the oxidation of fatty acids. Both of the critical enzymes of the glyoxylate pathway (isocitrate lyase and malate synthase) were found in MedAcidi-G1, MedAcidi-G2A, and MedAcidi-G2B, showing that they can use C2 compounds. However, the MedAcidi-G3 genome did not encode the genes for the glyoxylate cycle. The recently described ethylmalonyl pathway has an equivalent role in C2 assimilation (22). Nearly all of the enzymes of this pathway were found in MedAcidi-G3 (including the marker enzyme crotonyl-CoA carboxylase/reductase) (Fig. S5). A common C2 compound in the marine environment is glycolate, a highly labile compound that is a byproduct of photorespiration (23). All of the MedAcidi genomes contained glycolate oxidase genes, suggesting the ability to use this compound.
FIG 3

Metabolic overview of the marine actinobacterium MedAcidi-G1. Major pathways are indicated in boxes. DMSP, dimethylsulfopropionate; MMPA, methylmercaptopropionate; THF, tetrahydrofolate; DHAP, dihydroxyacetone phosphate.

FIG 4

Metabolic overview of the marine actinobacterium MedAcidi-G3. Major pathways are indicated in boxes. Some reactions that have not been found in this genome but are found in the other MedAcidi genomes are shown in grey. DMSP, dimethylsulfopropionate; MMPA, methylmercaptopropionate; THF, tetrahydrofolate; ggt, gamma-glutamyl transpeptidase; DHAP, dihydroxyacetone phosphate.

Metabolic overview of the marine actinobacterium MedAcidi-G1. Major pathways are indicated in boxes. DMSP, dimethylsulfopropionate; MMPA, methylmercaptopropionate; THF, tetrahydrofolate; DHAP, dihydroxyacetone phosphate. Metabolic overview of the marine actinobacterium MedAcidi-G3. Major pathways are indicated in boxes. Some reactions that have not been found in this genome but are found in the other MedAcidi genomes are shown in grey. DMSP, dimethylsulfopropionate; MMPA, methylmercaptopropionate; THF, tetrahydrofolate; ggt, gamma-glutamyl transpeptidase; DHAP, dihydroxyacetone phosphate. The complete tetrahydrofolate (THF)-linked pathway was detected in the MedAcidi-G1 genome, and parts of it were found in the others, suggesting that these microbes might be capable of producing energy using dimethylsulfopropionate (DMSP). The first key enzyme, dimethylsulfoniopropionate-dependent demethylase (DmdA), which removes a methyl group from DMSP and yields methylmercaptopropionate (MMPA), was found in all MedAcidi genomes (24, 25). DMSP is a commonly occurring osmolyte in the marine habitat, particularly in marine algae (26, 27). Microbial degradation of DMSP can lead to the release of dimethylsulfide (DMS), a highly volatile compound that is released to the atmosphere in large amounts (28). It has been suggested that nearly a third of the surface ocean microbes are capable of performing this transformation (24), and several can incorporate the DMSP-derived sulfur into amino acids (methionine) (29). It has been shown that “Ca. Pelagibacter ubique” is able to oxidize MMPA-derived methyl groups via this pathway to produce energy (30). Another common energy source for marine microbes is carbon monoxide. Carbon monoxide dehydrogenase (CODH) genes were found in all of the MedAcidi genomes. Carbon monoxide produced by photolysis of dissolved organic matter (31, 32) is readily available in the marine habitat. Moreover, evidence from genomic and metagenomic analyses is also increasingly revealing the widespread distribution of aerobic carbon monoxide dehydrogenases in the seas (33–36). Two types of these heterotrimeric enzymes are recognized, with the largest subunit (CoxL) possessing characteristic catalytic sites. Form I CODH enzymes, originally described in Oligotropha carboxydovorans, are fast CO oxidizers, while form II enzymes (e.g., those found in Bradyrhizobium, Mesorhizobium, and Sinorhizobium species) are several times slower (10 to 1,000 times). A total of 19 candidate CoxL genes were found in the MedAcidi genomes, although only 8 of them had the catalytic-site pattern of the form II CODH. However, all the genomes encoded at least one potentially active form II enzyme. No form I CODH catalytic site pattern was identified, making them slow CO oxidizers. A CODH has also been found in acAcidi. However, whether or not all form II CODH genes actually are responsible for CO oxidation has not been conclusively demonstrated. For example, in the case of Roseobacter species, even though form II CODH genes were readily identifiable, CO oxidation was not observed (37). The MedAcidi-G1 genome encoded a transporter for alkanesulfonates and, also, an alkanesulfonate monooxygenase (also found in MedAcidi-G2B). The alkanesulfonate monooxygenase is used to split the C-S bond in the sulfonates and release formaldehyde and sulfite. Sulfonates are organosulfur compounds that are widely distributed in nature (38–41), and several bacteria are known to use sulfonates as a source of sulfur (42). One widespread form of organosulfonates is methanesulfonate, produced from the chemical oxidation of biogenic DMS in the atmosphere (39). While the sulfite may undergo spontaneous or enzymatic oxidation to sulfate, some organisms, e.g., methylotrophs, can utilize the formaldehyde as a carbon and energy source via the serine pathway, but this pathway was not found in the MedAcidi genomes. In any case, intracellular formaldehyde must be detoxified. In Actinobacteria, formaldehyde can spontaneously combine with the Actinobacteria-specific thiol mycothiol to form S-hydroxymethylmycothiol, which is the substrate for the mycothiol-dependent formaldehyde dehydrogenase. The gene coding for this enzyme was found in MedAcidi-G1 and MedAcidi-G2B. The use of mycothiol (a pseudodisaccharide) as a cofactor in place of the universal tripeptide thiol glutathione is characteristic of this particular enzyme (43). This enzyme converts S-hydroxymethylmycothiol to S-formylmycothiol, and in a second step, a thiol esterase yields formate and the original mycothiol (44). Additional oxidation of formate by formate dehydrogenases yields CO2 and reducing equivalents in the form of NADH. Formate dehydrogenase can also be linked to the THF-linked oxidation pathway (see above), and genes coding for this enzyme were found in all four genomes.

Nitrogen metabolism.

We found ammonium transporters in all of the genomes except MedAcidi-G2B (its absence in this genome could again reflect incompleteness). Additionally, transporters for glutamine and glutamate/aspartate were also found in MedAcidi-G1 and MedAcidi-G3. All genomes encoded more general amino acid, dipeptide, and oligopeptide transporters (Fig. 3 and 4; see Fig. S3 and S4 in the supplemental material). We also found genes coding for the hydrolysis of nitriles, which can be a potential source of both carbon and nitrogen, in three of these genomes. Nitriles are naturally occurring compounds produced by plants and are also released into the environment by human industrial activity, e.g., petrochemical refineries, mining, etc. (45). Nitrile-hydrolyzing bacteria have been sought after not only because of their ability to degrade these environmentally hazardous cyanide-containing compounds but also because of the successful commercial application of nitrile-hydrolyzing enzymes as biocatalysts for the commercial production of a variety of organic compounds, e.g., acrylamides, carboxylic acids, etc. (46). Actually, Actinobacteria with such metabolic capacities have been reported from terrestrial and deep sea sediments (47). Though there are several described pathways for nitrile degradation (45), we found two in these actinobacterial genomes, both of which yield carboxylate and ammonium as end products, which can be used as carbon and nitrogen sources, respectively. In the first pathway, found in MedAcidi-G1 and MedAcidi-G2B, a single-step hydrolysis using nitrilases yields these final compounds directly. In the second, found in MedAcidi-G3, a two-step hydrolysis is predicted, where a nitrile hydratase first transforms nitriles to amides, which are then processed by amidases to yield the same end products.

Distribution of marine actinobacterial groups.

Fragment recruitment analysis of genomes provides the most accurate estimate of the presence and abundance of the corresponding microbes in different habitats. However, to perform these analyses, large sequence data sets are required and, as yet, very few are available, particularly of depth profiles in the ocean. We have three metagenomes (samples taken in 2007 at a depth of 50 m, in 2012 at 75 m, and in 2013 at 55 m), all from the depth of the DCM at the same site from which the genomes were assembled. The recruitment results are shown in Fig. 5 and indicate that all four microbes are more abundant in the sample of origin (2012), as should be expected since long contigs are easier to assemble in a sample in which they are very prevalent. However, their presence in the other high-coverage data set (2013, 55 m) attests to their permanence in this habitat. They were barely detectable in the smaller 454 data set from 2007. Moreover, these genomes also recruited very little from the BATS (48) and the Hawaii Ocean Time-series (HOTS) (49) data sets (Fig. 5). In comparison, “Ca. Actinomarina minuta,” represented by 43 assembled fosmids (6) and two SAGs (9), recruited consistently across these data sets. When similar analyses was performed using the Global Ocean Sampling (GOS) data set (50), similar results were found, where the genomes of “Ca. Actinomarinales” recruited much more than the marine Acidimicrobiales (see Fig. S6 in the supplemental material). However, recruitments across the depth profile of HOTS (Fig. S7) indicate that the marine Acidimicrobiales are represented much more at the deep chlorophyll maximum (DCM) than at the surface.
FIG 5

Metagenomic fragment recruitment. (a) Relative abundances of genomes of marine Acidimicrobiales and “Ca. Actinomarinales” across two data sets from the Mediterranean deep chlorophyll maximum, Hawaii Ocean Time-series (HOTS), Bermuda Atlantic Time-series (BATS), and the Red Sea. Data are expressed as RPKG (reads recruited per Kb of genome per Gb of metagenome). (b) Recruitment plots of two representative Acidimicrobiales genomes (MedAcidi-G1 and MedAcidi-G3) and one “Ca. Actinomarinales” genome (“Ca. Actinomarina minuta”) compared to two metagenomes (MedDCM-2012 and MedDCM-2013) are shown. The dashed horizontal line indicates 95% nucleotide sequence identity. (c) Relative abundances of 16S rRNA sequences of marine Acidimicrobiales and “Ca. Actinomarinales” across the same data sets. All data sets that are from a deep chlorophyll maximum are marked with a green circle. Data are expressed as hits/Gb of metagenome.

Metagenomic fragment recruitment. (a) Relative abundances of genomes of marine Acidimicrobiales and “Ca. Actinomarinales” across two data sets from the Mediterranean deep chlorophyll maximum, Hawaii Ocean Time-series (HOTS), Bermuda Atlantic Time-series (BATS), and the Red Sea. Data are expressed as RPKG (reads recruited per Kb of genome per Gb of metagenome). (b) Recruitment plots of two representative Acidimicrobiales genomes (MedAcidi-G1 and MedAcidi-G3) and one “Ca. Actinomarinales” genome (“Ca. Actinomarina minuta”) compared to two metagenomes (MedDCM-2012 and MedDCM-2013) are shown. The dashed horizontal line indicates 95% nucleotide sequence identity. (c) Relative abundances of 16S rRNA sequences of marine Acidimicrobiales and “Ca. Actinomarinales” across the same data sets. All data sets that are from a deep chlorophyll maximum are marked with a green circle. Data are expressed as hits/Gb of metagenome. Given the scarcity of fragments recruited by the genomes, we have also used several 16S rRNA sequences from each group to assess their relative levels of abundance across the marine habitat (see Materials and Methods) (Fig. 5c; see Fig. S8 in the supplemental material). This method allows a much better assessment of their presence in low-coverage data sets, although it is much less reliable. The results again show a wide variation among different data sets but indicate that, not only are the marine Acidimicrobiales detectable at DCM depths but they may also be found at greater depths (e.g., HOTS 500 m and 4,000 m). Low but nearly equal abundances were found for both groups in additional metagenomic data sets from the deep Sea of Marmara at 1,000 m (51) and the Puerto Rico Trench at 4,000 m (52). Results from the GOS data set (Fig. S8) and from the Red Sea (53) also indicate, similar to the results from genome recruitments, that “Ca. Actinomarinales” representatives are relatively more abundant than Acidimicrobiales. On the other hand, marine Acidimicrobiales 16S rRNA sequences were detectable in the polar latitudes, both in the Antarctic (54) and the Arctic (55), while “Ca. Actinomarinales” appeared to be restricted to temperate and tropical latitudes (Fig. S8), suggesting that at least some members of this new group might be psychrotolerant. The partial MedAcidi genomes described here provide the first glimpses into the physiology and lifestyle of the marine acidimicrobia, hitherto known only from cloned 16S rRNA sequences and denominated the “marine Microthrix” group (12). They belong to representatives of the second major phylogenetic group of planktonic marine Actinobacteria to be described at this level, after a similar description became available for the “Ca. Actinomarinales” (6), formerly known as the “marine actinobacterial clade” (MAC). Both groups have streamlined genomes and are heterotrophs, sometimes supplemented by phototrophic energy derived from rhodopsins. They seem to have free-living bona fide planktonic cells (as opposed to particle associated) and are well adapted to the oligotrophic marine waters, using organic matter that is predicted to be common in this habitat. Genome recruitment indicated that these microbes appear to be more abundant in the deeper photic zone than at the surface and are probably more prevalent around DCM depths (50 to 120 m, also favored by the “Ca. Actinomarinales”). These depths are very productive in the stratified ocean and are free from the damaging UV light of shallower depths. Recent developments in the field of freshwater Actinobacteria have revealed the presence of related acidimicrobial groups (14). This situation is reminiscent of those of the marine “Ca. Pelagibacterales” and the freshwater LD12 groups. In both cases, the microbes are distantly related and also share phylogenetically related rhodopsins, suggesting that the acquisition of photoheterotrophy predates the marine-to-fresh water transition. The “Ca. Actinomarinales” do not seem to have sister clades in freshwater.

MATERIALS AND METHODS

Sampling, sequencing, assembly, and annotation.

Sampling was done on 20 July 2012 from the DCM depth (75 m) at a distance of ~20 nautical miles off the coast of Alicante, Spain (38°4′6.64″N, 0°13′55.18″E). The deep chlorophyll maximum was determined from a water column fluorescence profile by using a Seabird SBE 19 multiprobe profiler. Two hundred liters of seawater was filtered through a series of 20-µm, 5-µm, and 0.22-µm polycarbonate filters (Millipore). DNA from the 0.22-µm fraction was extracted as described previously (56). Sequencing was performed using the Illumina HiSeq 2000 PE (Macrogen, South Korea). Two libraries of different insert sizes, 300 bp and 3 kb, were created. In total, 11.9 Gb (135 million reads) and 5.3 Gb (62 million reads) of data were obtained for the two libraries, respectively. This data set is referred to as the MedDCM-2012 data set in this work. Both data sets (197 million reads) were assembled together using the IDBA assembler (57). The second metagenome used here, referred to as MedDCM-2013, was sampled from the same site on 6 September 2013 and processed in identical fashion. Only the unassembled reads from this data set are used in this work. Protein-coding genes in the assembled contigs were predicted using Prodigal in metagenomic mode (58), and tRNAs were predicted using tRNAscan-SE (59). Ribosomal rRNA genes were identified using ssu-align (60) and meta_rna (61). Additional annotation of genes was done by comparing against the NCBI NR, COG (62), and TIGRfam (63) databases. In addition, genomes were also annotated using the RAST server (64). Additional local BLAST searches against the latest NCBI-NR database were performed whenever necessary.

16S rRNA classification.

A nonredundant version of the RDP database (65) was prepared by clustering the ca. 2.3 million 16S rRNA sequences into ~800,000 sequences at the 90% identity level using UCLUST (66). Comparisons against this representative 16S rRNA database were used to identify candidate 16S rRNA sequences in the complete Illumina data sets. A sequence matching this database at an e value of <1e−5 was considered a potential 16S sequence. Candidate 16S rRNA sequences were further examined using ssu-align, which uses hidden Markov models (HMMs) to separate these into archaeal, bacterial, and eukaryal 16S/18S rRNA or non-16S rRNA sequences (60). These bona fide sequences were finally compared to the complete RDP database and classified into a high-level taxon if the sequence identity was ≥80% and the alignment length was ≥90 bp. For 454 sequences, the alignment length was relaxed to 100 bp. Sequences failing these thresholds were discarded. We only used contigs of >10 kb for all analyses. A contig was considered to originate from the order Acidimicrobiales if the majority of genes gave best blast hits to this order. Further clusters were made using taxonomy, principal component analysis of tetranucleotide frequencies, GC content, and coverage in both the metagenomes as described previously (67–69). Tetranucleotide frequencies were computed using the wordfreq program in the EMBOSS package (70). Principal component analysis was performed using the FactoMineR package in R (71).

Genomic phylogenetic trees and genome size estimation.

A set of 97 conserved proteins found in a set of 155 complete actinobacterial genomes were used to estimate genome completeness for these genomes. For whole-genome phylogeny, a smaller set of representative actinobacterial genomes was chosen in order to maximize the number of common proteins, though all available genomes from the order Acidimicrobiales were included. Using the COG database (62), 106 conserved proteins were found in these five genomes and the other reference genomes. COG assignments were made using an e value of <1e−5, >80% query coverage, and >30% identity. These proteins were concatenated and used to create the whole-genome phylogeny of these new microbes. The alignment was performed using Kalign (72) and trimmed using trimAL (73). A maximum-likelihood tree was constructed with FastTree2 (74), using a JTT+CAT model, a gamma approximation, and 100 bootstrap replicates.

Single gene trees.

All 16S sequences were trimmed using ssu-align (60), and multiple sequence alignments were created using MUSCLE (75). 16S rRNA phylogenetic trees were constructed using FastTree2 (74) with a GTR+CAT model, a gamma approximation, and 100 bootstrap replicates. The sequences of rhodopsins were also aligned using MUSCLE, and a maximum-likelihood tree was constructed with FastTree2 (74), using a JTT model, a gamma approximation, and 100 bootstrap replicates.

Metagenomic recruitment.

Recruitments were performed using BLASTN (76), and a hit was considered only when it was at least 50 nucleotides long, with an identity of >95% and with an e value of ≤1e−5. These hits were used to compute the RPKG (reads recruited per kilobase of genome per gigabase of metagenome) values, which provide a normalized value that is comparable across various metagenomes. For 16S rRNA abundance comparison of the two groups, 13 sequences from the marine Acidimicrobiales group and 7 from “Ca. Actinomarinales” were compared to the 16S rRNA sequences in the metagenomic data sets mentioned here. Hits were assigned to each group if they had more than 95% nucleotide sequence identity (species-level cutoffs) and alignment lengths of >100 bp. The number of unique hits for each group was normalized by dividing by the data set size.

Data accession numbers.

The metagenomic data have been submitted to NCBI SRA and are accessible under the BioProject identifier PRJNA257723. The assembled genome sequences have been deposited to DDBJ/EMBL/GenBank and can be accessed using the accession numbers JUEM00000000, JUEN00000000, JUEO00000000, and JUEP00000000. Excel file containing important pathways and transporters found in the MedAcidi genomes. Download Data Set S1, XLSX file, 0.02 MB 16S rRNA classification of the Mediterranean deep chlorophyll maximum data sets. y axis indicates percentage of reads attributed to each taxon. Color key is at top right. Download Figure S1, PDF file, 1.6 MB (a) Genome size versus GC content. (b) Median intergenic spacer size versus GC content. 1386 prokaryotic genomes are shown here. The marine actinobacterial genomes assembled in this work and another marine actinobacterial genome (“Ca. Actinomarina minuta”) are labeled in white boxes. Genomes of freshwater Actinobacteria are labeled. All other genomes are shown in grey. Download Figure S2, PDF file, 1.8 MB Metabolic overview of the marine Actinobacteria member MedAcidi-G2A. Reactions that have not been found but are likely to be present are shown in grey. DMSP, dimethylsulfopropionate; MMPA, methylmercaptopropionate; THF, tetrahydrofolate. Download Figure S3, PDF file, 1.5 MB Metabolic overview of the marine Actinobacteria member MedAcidi-G2B. Reactions/enzymes that have not been found but are likely to be present are shown in grey. DMSP, dimethylsulfopropionate; MMPA, methylmercaptopropionate; THF, tetrahydrofolate. Download Figure S4, PDF file, 1.6 MB The ethylmalonyl pathway in the genome of MedAcidi-G3. Enzymes that were found in the genome are shown in blue boxes, and those not found are colored in grey. Download Figure S5, PDF file, 1.4 MB Metagenomic fragment recruitment across the Global Ocean Sampling (GOS) data set and a transect across the Southern Ocean. The two groups of marine Actinobacteria, the “Ca. Actinomarinales” (3 representatives), and the marine Acidimicrobiales (4 representatives) are shown. y axis indicates RPKG (reads recruited per kb per Gb of metagenome). Color key is at top right. Download Figure S6, PDF file, 1.4 MB Metagenomic fragment recruitment along the depth profile of the Hawaii Ocean Time-series data set. The two groups of marine Actinobacteria, the “Ca. Actinomarinales” (3 representatives), and the marine Acidimicrobiales (4 representatives) are shown. y axis indicates depth (in meters), and x axis indicates RPKG (reads recruited per kb per Gb of metagenome). M. parvicella and Ilumatobacter sp. strain Y16-304 are non-marine representatives of Acidimicrobidae and are shown for reference. Prochlorococcus marinus NATL1A is a genuine deep chlorophyll maximum inhabitant and is shown as a reference for “Ca. Actinomarinales” genomes. Color key is at top right. Download Figure S7, PDF file, 1.4 MB Relative abundances of 16S rRNA sequences belonging to marine Acidimicrobiales and “Ca. Actinomarinales” in metagenomic data sets. (a) Global Ocean Sampling data set. Open ocean data sets and coastal data sets are shown separately. (b) Southern Ocean (Antarctic) transect, an Arctic metagenome, and two metagenomes from the deep ocean: Marmara (1,000 m) and Puerto Rico Trench (4,000 m). Color keys are at top right. Download Figure S8, PDF file, 2.7 MB
  62 in total

Review 1.  A guide to the natural history of freshwater lake bacteria.

Authors:  Ryan J Newton; Stuart E Jones; Alexander Eiler; Katherine D McMahon; Stefan Bertilsson
Journal:  Microbiol Mol Biol Rev       Date:  2011-03       Impact factor: 11.056

2.  Role for urea in nitrification by polar marine Archaea.

Authors:  Laura Alonso-Sáez; Alison S Waller; Daniel R Mende; Kevin Bakker; Hanna Farnelid; Patricia L Yager; Connie Lovejoy; Jean-Éric Tremblay; Marianne Potvin; Friederike Heinrich; Marta Estrada; Lasse Riemann; Peer Bork; Carlos Pedrós-Alió; Stefan Bertilsson
Journal:  Proc Natl Acad Sci U S A       Date:  2012-10-01       Impact factor: 11.205

Review 3.  Implications of streamlining theory for microbial ecology.

Authors:  Stephen J Giovannoni; J Cameron Thrash; Ben Temperton
Journal:  ISME J       Date:  2014-04-17       Impact factor: 10.302

4.  Nitrile hydrolysing activities of deep-sea and terrestrial mycolate actinomycetes.

Authors:  Pedro F B Brandão; Alan T Bull
Journal:  Antonie Van Leeuwenhoek       Date:  2003       Impact factor: 2.271

5.  Metabolic potential of a single cell belonging to one of the most abundant lineages in freshwater bacterioplankton.

Authors:  Sarahi L Garcia; Katherine D McMahon; Manuel Martinez-Garcia; Abhishek Srivastava; Alexander Sczyrba; Ramunas Stepanauskas; Hans-Peter Grossart; Tanja Woyke; Falk Warnecke
Journal:  ISME J       Date:  2012-07-19       Impact factor: 10.302

6.  One carbon metabolism in SAR11 pelagic marine bacteria.

Authors:  Jing Sun; Laura Steindler; J Cameron Thrash; Kimberly H Halsey; Daniel P Smith; Amy E Carter; Zachary C Landry; Stephen J Giovannoni
Journal:  PLoS One       Date:  2011-08-23       Impact factor: 3.240

7.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.

Authors:  Salvador Capella-Gutiérrez; José M Silla-Martínez; Toni Gabaldón
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

8.  Identification of ribosomal RNA genes in metagenomic fragments.

Authors:  Ying Huang; Paul Gilna; Weizhong Li
Journal:  Bioinformatics       Date:  2009-04-03       Impact factor: 6.937

9.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

10.  Metagenomics of the deep Mediterranean, a warm bathypelagic habitat.

Authors:  Ana-Belen Martín-Cuadrado; Purificación López-García; Juan-Carlos Alba; David Moreira; Luis Monticelli; Axel Strittmatter; Gerhard Gottschalk; Francisco Rodríguez-Valera
Journal:  PLoS One       Date:  2007-09-19       Impact factor: 3.240

View more
  26 in total

1.  Genome Reconstruction from Metagenomic Data Sets Reveals Novel Microbes in the Brackish Waters of the Caspian Sea.

Authors:  Maliheh Mehrshad; Mohammad Ali Amoozegar; Rohit Ghai; Seyed Abolhassan Shahzadeh Fazeli; Francisco Rodriguez-Valera
Journal:  Appl Environ Microbiol       Date:  2016-01-04       Impact factor: 4.792

Review 2.  Survey of (Meta)genomic Approaches for Understanding Microbial Community Dynamics.

Authors:  Anukriti Sharma; Rup Lal
Journal:  Indian J Microbiol       Date:  2016-11-11       Impact factor: 2.461

3.  In-depth Spatiotemporal Characterization of Planktonic Archaeal and Bacterial Communities in North and South San Francisco Bay.

Authors:  Anna N Rasmussen; Julian Damashek; Emiley A Eloe-Fadrosh; Christopher A Francis
Journal:  Microb Ecol       Date:  2020-11-05       Impact factor: 4.552

4.  Genomes of Novel Microbial Lineages Assembled from the Sub-Ice Waters of Lake Baikal.

Authors:  Pedro J Cabello-Yeves; Tamara I Zemskaya; Riccardo Rosselli; Felipe H Coutinho; Alexandra S Zakharenko; Vadim V Blinov; Francisco Rodriguez-Valera
Journal:  Appl Environ Microbiol       Date:  2017-12-15       Impact factor: 4.792

5.  Genome-centric resolution of novel microbial lineages in an excavated Centrosaurus dinosaur fossil bone from the Late Cretaceous of North America.

Authors:  Renxing Liang; Maggie C Y Lau; Evan T Saitta; Zachary K Garvin; Tullis C Onstott
Journal:  Environ Microbiome       Date:  2020-03-19

6.  Genomic and Transcriptomic Evidence for Carbohydrate Consumption among Microorganisms in a Cold Seep Brine Pool.

Authors:  Weipeng Zhang; Wei Ding; Bo Yang; Renmao Tian; Shuo Gu; Haiwei Luo; Pei-Yuan Qian
Journal:  Front Microbiol       Date:  2016-11-15       Impact factor: 5.640

7.  Genome diversity of marine phages recovered from Mediterranean metagenomes: Size matters.

Authors:  Mario López-Pérez; Jose M Haro-Moreno; Rafael Gonzalez-Serrano; Marcos Parras-Moltó; Francisco Rodriguez-Valera
Journal:  PLoS Genet       Date:  2017-09-25       Impact factor: 5.917

Review 8.  From cultured to uncultured genome sequences: metagenomics and modeling microbial ecosystems.

Authors:  Daniel R Garza; Bas E Dutilh
Journal:  Cell Mol Life Sci       Date:  2015-08-09       Impact factor: 9.261

9.  Marine microbial communities of the Great Barrier Reef lagoon are influenced by riverine floodwaters and seasonal weather events.

Authors:  Florent E Angly; Candice Heath; Thomas C Morgan; Hemerson Tonin; Virginia Rich; Britta Schaffelke; David G Bourne; Gene W Tyson
Journal:  PeerJ       Date:  2016-01-05       Impact factor: 2.984

10.  Not All Particles Are Equal: The Selective Enrichment of Particle-Associated Bacteria from the Mediterranean Sea.

Authors:  Mario López-Pérez; Nikole E Kimes; Jose M Haro-Moreno; Francisco Rodriguez-Valera
Journal:  Front Microbiol       Date:  2016-06-22       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.