Cyanobacterial harmful algal blooms (cyanoHABs) appear to be increasing in frequency on a global scale. The Cyanobacteria in blooms can produce toxic secondary metabolites that make freshwater dangerous for drinking and recreation. To characterize microbial activities in a cyanoHAB, transcripts from a eutrophic freshwater reservoir in Singapore were sequenced for six samples collected over one day-night period. Transcripts from the Cyanobacterium Microcystis dominated all samples and were accompanied by at least 533 genera primarily from the Cyanobacteria, Proteobacteria, Bacteroidetes and Actinobacteria. Within the Microcystis population, abundant transcripts were from genes for buoyancy, photosynthesis and synthesis of the toxin microviridin, suggesting that these are necessary for competitive dominance in the Reservoir. During the day, Microcystis transcripts were enriched in photosynthesis and energy metabolism while at night enriched pathways included DNA replication and repair and toxin biosynthesis. Microcystis was the dominant source of transcripts from polyketide and non-ribosomal peptide synthase (PKS and NRPS, respectively) gene clusters. Unexpectedly, expression of all PKS/NRPS gene clusters, including for the toxins microcystin and aeruginosin, occurred throughout the day-night cycle. The most highly expressed PKS/NRPS gene cluster from Microcystis is not associated with any known product. The four most abundant phyla in the reservoir were enriched in different functions, including photosynthesis (Cyanobacteria), breakdown of complex organic molecules (Proteobacteria), glycan metabolism (Bacteroidetes) and breakdown of plant carbohydrates, such as cellobiose (Actinobacteria). These results provide the first estimate of secondary metabolite gene expression, functional partitioning and functional interplay in a freshwater cyanoHAB.
Cyanobacterial harmful algal blooms (cyanoHABs) appear to be increasing in frequency on a global scale. The Cyanobacteria in blooms can produce toxic secondary metabolites that make freshwater dangerous for drinking and recreation. To characterize microbial activities in a cyanoHAB, transcripts from a eutrophic freshwater reservoir in Singapore were sequenced for six samples collected over one day-night period. Transcripts from the Cyanobacterium Microcystis dominated all samples and were accompanied by at least 533 genera primarily from the Cyanobacteria, Proteobacteria, Bacteroidetes and Actinobacteria. Within the Microcystis population, abundant transcripts were from genes for buoyancy, photosynthesis and synthesis of the toxin microviridin, suggesting that these are necessary for competitive dominance in the Reservoir. During the day, Microcystis transcripts were enriched in photosynthesis and energy metabolism while at night enriched pathways included DNA replication and repair and toxin biosynthesis. Microcystis was the dominant source of transcripts from polyketide and non-ribosomal peptide synthase (PKS and NRPS, respectively) gene clusters. Unexpectedly, expression of all PKS/NRPS gene clusters, including for the toxins microcystin and aeruginosin, occurred throughout the day-night cycle. The most highly expressed PKS/NRPS gene cluster from Microcystis is not associated with any known product. The four most abundant phyla in the reservoir were enriched in different functions, including photosynthesis (Cyanobacteria), breakdown of complex organic molecules (Proteobacteria), glycan metabolism (Bacteroidetes) and breakdown of plant carbohydrates, such as cellobiose (Actinobacteria). These results provide the first estimate of secondary metabolite gene expression, functional partitioning and functional interplay in a freshwater cyanoHAB.
The frequency of cyanobacterial harmful algal blooms (cyanoHABs) appears to be increasing on a global scale and is linked to nutrient pollution and changing regional climates (Sellner ; Paerl ). Anoxic dead zones, toxin production and foul odors that can require years to abate make CyanoHABs unsafe and undesirable (Paerl ; Pan ). Cyanobacterial blooms usually contain one prominent bloom-forming primary producer and a diverse mix of other photosynthetic and non-photosynthetic microorganisms that contribute to nutrient and energy cycling (Litchman ). The cyanoHAB agent in the Kranji Reservoir, Singapore is Microcystis aeruginosa. This reservoir is a tropical, well-mixed, shallow eutrophic impoundment that experiences persistent cyanoHABs with detectable levels of microcystin toxin (Te and Gin, 2011). In the present work, the goal is to characterize the dynamics and variability of gene expression over a day/night cycle in the Kranji Reservoir for M. aeruginosa and other major taxa to gain insight into toxin synthesis and partitioning of functions for nutrients and energy cycling that affect water quality.Functional and taxonomic annotation of expressed genes (mRNA) recovered from complex microbial communities (that is, metatranscriptomics) has enabled breakthrough insights into resource utilization and niche partitioning in microbial ecosystems (for example, Gifford ; Ottesen ; Vila-Costa ). To date, comparatively few metatranscriptomic studies have been carried out in freshwater environments, and community gene expression in a freshwater cyanoHAB remains undescribed. Expression of toxin genes within cyanoHABs is of particular interest because of their effect on health. Cyanotoxins such as microcystin, anatoxin-a, aeruginosin, cylindrospermopsin and microviridin have been linked to symptoms from skin irritation to liver failure and death (Carmichael, 1992; Pouria ; Carmichael, 2001; Ehrenreich ; Stewart ; Rounge ; Dittmann ). Although toxin production in HAB-forming cyanobacteria has been studied in detail using cultured isolates, little is known about the ecological role of these toxins, and the expression of toxin genes within blooms has not been measured. Through analysis of gene expression in the bloom community, it can be determined which toxin synthesis pathways are highly expressed and whether expression is variable as may be expected in response to transient environmental cues or consistent as may be expected for functions central to the physiology of the organism.In this study, bacterioplankton from Kranji Reservoir were sampled over a 22-h period, and transcripts were sequenced. The distributions of expressed genes from different taxonomic groups in the bloom community were annotated based on function and analyzed to identify resource partitioning and functional interplay. In addition, with regards to the Microcystis population four question were addressed: (1) Which transcripts are most abundant? (2) Which transcripts change from day to night? (3) Are there differences in expression of genes from the core and flexible genome in Microcystis? and (4) what toxin-related biosynthetic pathways are detected, and how do their expression change over time? This is the first study to examine gene expression in a complex bacterioplankton community dominated by Microcystis over a diurnal cycle.
Materials and methods
Six grab samples were collected from Kranji Reservoir at 10-20 cm beneath a visible algal surface scum on 14 and 15 January 2010 at 1700, 2000, 0600, 0800, 1300 and 1500 hours followed by filtration onto a 0.22-μm Sterivex membrane and preservation in RNAlater (Invitrogen, Carlsbad, CA, USA; Wang, 2011). Environmental metadata for temperature, nutrient concentrations and dissolved microcystin toxin concentrations were measured for each sample. Detailed methods for sample processing and analysis are provided as an online supplement.Briefly, RNA was extracted from filters and prepared for sequencing using a series of kits according to the manufacturers' protocols. The Trizol reagent (Invitrogen) was used for RNA extraction, followed by depletion of tRNA, rRNA and eukaryotic mRNA using Ambion kits MegaClear, MicrobeEnrich and MicrobeExpress, respectively, followed by treatment with mRNA-only (Epicentre, Madison, WI, USA). RNA was amplified using MessageAmpII (Ambion, Austin, TX, USA), followed by double-stranded cDNA synthesis (Superscript, Invitrogen) and multiplexing for sequencing in a single lane of an Illumina GAII using Illumina (San Diego, CA, USA) adaptors ligated to custom barcodes. Sequence data have been submitted to the GenBank databases under accession PRJNA238448. Quantitative PCR (qPCR) was used to quantify cDNA transcripts using published primers and conditions (Nonneman and Zimba, 2002; Furukawa ; Shao ). Ribosomal RNA transcripts were identified by blastn (<1e-20, at >60% sequence identity and >50 bp alignment) to the SILVA database and classified in MG-RAST. Blastx against NR (bit-score>40) was used to taxonomically and functionally classify mRNA in MEGAN using the KEGG hierarchy. Expression of Microcystis genes was measured by recruiting reads to a M. aeruginosa pan-genome (>90 bp alignment; 90% identity) constructed from two publicly available genomes followed by calculation of RPKM (reads per kilobase per million mapped reds) in CLC Genomics Workbench to generate transcript profiles. Sample transcript profiles were log2 transformed, normalized and compared by Principal Component Analysis and Hierarchical Clustering in Multiexperiment Viewer (Saeed et al., 2006). Differentially expressed genes were identified through the Student's t-test and with a multiple comparison test Significance Analysis of Microarrays (SAM) implemented in Multiexperiment Viewer. The differential distribution of KEGG pathways in Microcystis among different samples and among the four most abundant bacterial phyla was determined using a non-parametric Gene Set Enrichment Analysis (GSEA). Evidence for community polyketide and non-ribosomal peptide synthase (PKS and NRPS, respectively) expression was obtained by identifying ketosynthase (KS) and condensation (C) domains with the online tool NaPDoS (Ziemert ).
Results
Sampling and sequencing
Six metatranscriptomes generated from the Kranji Reservoir contained 0.49 Gbp resulting in between 711 948 and 944 045 reads per sample. In all, 7–31% of these sequences represented rRNA as identified by blastn to the SILVA SSU database. Out of the remaining reads, 48–75% were assumed to be mRNA due to their match with proteins in the non-redundant protein database (Supplementary Table S1 and Supplementary Figure S1) and are hereafter referred to as ‘annotated mRNA'. The main environmental variables among collected metadata were light and temperature while nutrients and microcystin toxin did not vary significantly throughout the study period (Supplementary Figure S2).
Community structure of active bacterioplankton
The phylum Cyanobacteria accounted for the majority of all annotated mRNA and SSU rRNA reads (69% and 76%, respectively) followed by Proteobacteria (4% and 5%), Bacteroidetes (2% and 1%) and Actinobacteria (2% and 0.4%, Supplementary Figure S1). Taxonomic assignment of recovered SSU rRNA sequences revealed 533 co-existing bacterial genera in addition to microbial eukaryotes and grazers (for example, Daphnia, Rotifers) where Microcystis transcripts accounted for 63% of SSU rRNA and 39% of annotated mRNA (Supplementary Database S1 and S2). Additional observed genera each represented <0.5% of annotated mRNA transcripts, with the exception of the cyanobacterial genus Nostoc (1.2%).
Environmental expression of a M. aeruginosa pan-genome
M. aeruginosa transcripts from the six metatranscriptomes were identified by comparison to a Microcystis pan-genome constructed from strains PCC7806 and NIES-843 containing 8132 genes corresponding to core (in both strains) or flexible (in one strain) genes (Figure 1a). RPKM values for the Microcystis genes recA, psbA, prx, mcyA and mcyB were significantly correlated (Pearson's R>0.87, P<0.01) to qPCR values that were normalized to the stably expressed housekeeping gene grpE (Shao ) (Supplementary Table S2). In contrast, RPKM values were not well correlated with qPCR quantification of the same targets per ng of total cDNA consistent with the nature of RPKMs as a measure of relative expression within a genome rather than the absolute magnitude. Core genes are more highly represented and abundant among transcripts than flexible genes (Figure 1). All but a single core hypothetical gene is represented in the transcriptomes, while 93% of the 2902 flexible genes from M. aeruginosa NIES-843 and 76% of the 1868 flexible genes from M. aeruginosa PCC7806 were detected in at least one of the samples (Figure 1b).
Figure 1
Recruitment of transcripts from the Kranji Reservoir to a Microcystis aeruginosa pan-genome based on strains NIES-843 and PCC7806. (a) Venn diagram of the core (shared) and flexible (strain specific) genes that define the pan-genome. (b) Pie charts indicate the prevalence of transcripts from core or flexible genomes among the six metatranscriptomes. (c) Box plots present the median and percentile distribution in increments of 10% for M. aeruginosa transcript abundance (RPKM) for the six samples. Sample names Krb1-6 correspond with the collection time shown below the name.
The most highly represented transcripts in the Kranji Reservoir Microcystis population stem from genes encoding photosynthesis, genetic information processing, gas vesicle synthesis, stress response, DNA repair, microviridin production and hypotheticals (Figure 2). GvpA (gas vesicle protein A) is the most highly represented transcript at all times except at 0800 hours when PsbA (photosystem II P680 reaction center D protein) transcripts are most abundant (Figure 2). Most highly expressed hypothetical genes are located next to gas vesicle (gvp) genes, photosynthetic pigment genes or mobile genetic elements (Figure 2) and may have related functions.
Figure 2
Highly represented transcripts from the M. aeruginosa pan-genome listed in descending order of average RPKM value. Heat map scale is the log2 of RPKM values. The 25 most abundant transcripts from each of the six samples correspond to 53 different genes. Gene numbers are given followed by a letter to indicate membership in the core or flexible genome (C=core, N=flexible NIES-843P=flexible PCC7806) and functions are given for PS=Photosynthesis, GV=Gas Vesicles, MGE=Mobile Genetic Element, GIP=Genetic Information Processing, aaM=amino acid metabolism and Chap=Chaperones. Hypothetical genes annotated based on gene neighborhoods or published studies (that is, microviridin, Ziemert ) are marked with asterisk (*).
Diel changes in Microcystis transcript abundance
Principal Component Analysis of transcript abundance (RPKM) from the Microcystis pan-genome revealed a partitioning of samples into two major groups that coincided with collection during the daytime ‘day' (0800, 1300 and 1500 hours) and early evening/night ‘night' (1700, 2000, 0600 hours) (Figure 3). Hierarchical Clustering supported this partitioning (data not shown). Screening by SAM analysis and the Student's t-test revealed 77 genes with significantly higher transcript abundance in day samples, including genes involved in energy metabolism, photosynthesis, oxidative phosphorylation, genetic information and processing, environmental information processing and metabolism of carbohydrates, amino acids (aa), nucleotides, cofactors and vitamins, amino sugars, nucleotide sugars and purines (SAM median false detection rate (FDR)=0.012 and t-test P<0.01; Supplementary Figure S3). Genes enriched at night participated in genetic information processing, metabolism of cofactors and vitamins, DNA replication and repair, mismatch repair and microcystin production (mcyA and mcyD), although differential expression of these single genes did not reach statistical significance after correction for multiple hypotheses or by SAM analysis (Supplementary Figure S4).
Figure 3
Principal component analysis of M. aeruginosa transcriptome profiles. The first two principal components shown account for 48% of the total variance of the data.
To further explore changes in Microcystis activity across the samples, genes were sorted into metabolic pathways based on the KEGG hierarchy and secondary metabolite gene clusters. Non-parametric GSEA revealed that KEGG pathways for photosynthesis and protein biosynthesis were upregulated during the day while multiple pathways for DNA synthesis and repair along with nitrogen metabolism, butanoate metabolism, aminoacyl-tRNA biosynthesis, porphyrin and chlorophyll metabolism and synthesis of the toxins microcystin and aeruginosin were enriched at night (P<0.001; FDR<0.25; Table 1).
Table 1
Gene set pathways in M. aeruginosa showing significant changes in transcript representation between day and night
Minimum number of genes per gene set=5, 75 gene sets used in analysis.
Data shown for pathways enriched with NOM P-value<0.05 and FDR<0.25, based on 1000 perturbations.
Functional diversity and partitioning in the bacterioplankton community
Although Microcystis genes are dominant (45-63%) among annotated bacterial transcripts, the remaining proportion of bacterial transcripts originated from a diverse assemblage where an additional 29-34% had taxonomic assignments to four phyla: Cyanobacteria (excluding Microcystis), Proteobacteria, Actinobacteria, and Bacteroidetes (Supplementary Figure S1C). Functional gene expression based on the abundance of reads that recruit to KEGG categories was compared among the four majority phyla. Overall, 34% of the reads from the top four phyla could be assigned to a KEGG pathway. At the highest level of the KEGG function hierarchy, metabolism is most highly represented (54–71%), followed by genetic information processing (14–34%), environmental information processing (6–13%) and cellular processes (2–5%). GSEA identified 26 KEGG pathways significantly enriched in the Proteobacteria relative to the other top phyla (P<0.05, FDR<0.10), including motility/chemotaxis, secretion and metabolism of carbohydrates and amino acids and breakdown of several types of xenobiotics (Supplementary Table S3). Cyanobacteria are enriched in four KEGG pathways related to photosynthesis and vitamin metabolism. Bacteroidetes are enriched in pathways for glycosidases and glycan biosynthesis as well as degradation pathways shared with lysosomes. No enriched pathways were observed for Actinobacteria likely due to low statistical power from low sequence coverage.
Diversity and function of transporters and two-component systems
Expression of transporters and two-component systems within the major cyanoHAB taxa provides insight into their sensing and utilization of environmental resources and respective ecological niches. In the Kranji Reservoir, there is evidence for expression of 82 different ABC transporters and 94 different two-component systems the most abundant of which include the sensing and uptake of nutrients (all major Phyla), chemotaxis and cell motility through flagellar assembly (Proteobacteria), mechanisms for cell-cell interaction, including pilus formation (Cyanobacteria and Proteobacteria) and secretion systems (Proteobacteria) and efflux of unknown chemicals (all major Phyla) (Figures 4 and 5, Supplementary Figures S5-S7, Supplementary Tables S4, S6 and S7).
Figure 4
ABC transporters. Bar graph showing the number of reads matching ABC transporters in the top four phyla on the x axis. The y axis contains a description of the typical transported substrates followed by the names of the genes that make up the transporter based on KEGG annotation. See Supplementary Table S6 for KEGG identification numbers. Only transporters with at least 20 reads are shown. Black lines delineate the KEGG transporter categories: (1) saccharides and polyols, (2) phosphate and amino acids, (3) peptide and nickel, (4) mineral and organic ions, (5) metallic cations, iron-siderophore and vitamin B12 transport, (6) lipoprotein releasing, (7) lipooligosaccharide, (8) antibiotics, (9) heme, (10) cell division and (11) eukaryotic types.
Figure 5
Two-component systems. The number of reads matching two-component systems in the top four phyla are shown on the x axis. The y axis contains a description of the function followed by the gene names for the system based on KEGG annotation. The top five most common systems for each Phylum are shown.
Among the major phyla, high and variable proportions of transcripts for sensing and uptake of nitrogen and phosphorous from both inorganic and organic sources suggests resource partitioning and active recycling of nutrients within the bloom. In Cyanobacteria, the phosphate uptake system PstABCS is the most abundantly expressed ABC transporter. In contrast, uptake of nitrogen-rich peptides and amino acids are the top functions of ABC transport transcripts from Proteobacteria, Bacteroidetes and Actinobacteria (Figure 4), and sensing and responding to nitrogen availability is the top function among two-component system transcripts in Proteobacteria (GlnL/GlnG) (Figure 5). ABC transport of urea appears to be uniquely associated with Cyanobacteria, while utilization of organicphosphorous is evidenced by expression of the phosphonate transporter (Phn) in Proteobacteria and, to a much lesser extent, in Cyanobacteria, suggesting a likely role for heterotrophic Proteobacteria in recycling reduced organicphosphorous.The intensity of nutrient limitation in the bloom may vary as a function of time of day. Samples collected at 1300 and 1500 hours revealed elevated expression of nutrient transporters in the Microcystis population relative to samples collected during the evening/night and morning (Supplementary Figure S8), pointing to the depletion of intracellular concentrations from sustained photosynthesis. At 1300 hours, maximal expression is evident for transporters of urea and ammonium as well as GlnB that regulates ammonia assimilation (Supplementary Figure S8). At both 1300 and 1500 hours, expression is elevated for the phosphate ABC transporter and alkaline phosphatase, which is responsible for cleaving phosphate groups off organic molecules.Uptake of inorganic carbon is dominated by Cyanobacteria, corresponding to all annotated ABC transporter transcripts for bicarbonate. Uptake of organiccarbon reveals potential for niche overlap and specialization. All phyla express transporters for uptake of peptides and branched chain amino acids. Proteobacteria are responsible for the majority of sorbitol and mannitol ABC transporters, and transcripts for the multi-component PhosphoTransferase System for neutral sugars were enriched in the Proteobacteria by GSEA (Supplementary Table S3 and S4). ABC transporters for cellobiose, xylose and alpha glucosides are almost exclusively annotated as actinobacterial, while in Bacteroidetes the peptide/nickel ABC transporter is represented by the most transcripts, far exceeding transcripts for uptake of other carbon substrates.
Analysis of secondary metabolites
To gain insight into the balance between synthesis and biodegradation of metabolites that negatively impact water quality, the metatranscriptomes were screened for secondary metabolite biosynthesis pathways and for biodegradation pathways of the toxin microcystin. The presence of KS and C domains, which were used as a proxy for identifying expressed NRPS and PKS pathways, were predominantly cyanobacterial (Figures 6a and b). A total of 73 different KS domains (1629 reads) and 82 different C domains (924 reads) from PKS and NRPS gene clusters, respectively, were identified that appear to originate from Cyanobacteria based on blastx. Of those cyanobacterial KS domains, 34 share >90% aa sequence identity with recruited reads including 25 KS domains from Microcystis strains, which strongly suggest that the corresponding PKS/NRPS clusters are expressed in the Reservoir. In contrast, only 8 of the 129 non-cyanobacterial KS domains have >90% sequence identity with recruited reads, and none match PKS or NRPS pathways with known products.
Figure 6
Taxonomic affiliation of Ketosynthase (KS) and condensation (C) domain transcripts matching the NaPDoS database of representative pathways linked to a known PKS or NRPS product.
The presence of C and KS domains and RPKM data indicates that all of the seven PKS and NRPS gene clusters in the Microcystis pan-genome were expressed in the reservoir (Table 2). Three of the seven clusters are known to produce the compounds microcystin (cluster 1), cyanopeptolin (cluster 8) and aeruginosin (cluster 9) and are in both reference genomes. Cluster 4 from NIES-843 has no known product, yet appears to be the most highly transcribed PKS/NRPS cluster in Kranji Reservoir in all samples (Table 2). Of the PKS/NRPS clusters in the two reference genomes, mean transcript RPKM from aeruginosin and microcystin clusters appear to vary significantly between the day and night (P=0.028 and P=0.017, respectively) with higher levels at night (Figures 7a and b) in agreement with GSEA analysis (Table 1) and qPCR (Supplementary Table S2). Eight additional PKS/NRPS gene clusters from other M. aeruginosa strains, as well as Planktothrix, Acaryochloris and Cyanothece, were observed in the Kranji Reservoir transcripts (Table 2) and shared >90% (aa) sequence identity to cyanobacterial domains in non-redundant protein database (Table 2).
Table 2
Cyanobacterial natural product gene clusters detected in Kranji reservoir
Cluster no.
Pathway type
Pathway name
Reference strain
GI nos.
No. of genes in cluster
No. of genes shared
KS/C domains in pathway
Total KS/C reads
Reference
1
PKS/NRPS
Microcystin
M. aeruginosa PCC7806
6007547–6007556
12
10
4/6
238/249
Tillett et al., 2000
2
PKS
Lectin like
M. aeruginosa M228
7288161–7288163
3
NA
1/0
12/0
Jimbo et al., 2000
3
PKS/NRPS
psm3
M. aeruginosa K-139
150445937–150445949
13
NA
1/4
12/11
Nishizawa et al., 2007
4
PKS/NRPS/FAS
None
M. aeruginosa NIES-843
166365520–166365529
10
1
3/2
857/163
Shulse and Allen, 2011
5
PKS
NIES-843 enediyne
M. aeruginosa NIES-843
166364025
1
1
1/0
168/0
Frangeul et al., 2008
6
PKS
PKS iterative
M. aeruginosa PCC7806
159028726–159028735
10
1
1/0
100/0
Frangeul et al., 2008
7
PKS
PKS1/PKS 3
M. aeruginosa PCC7806
159029973–159029981
9
3
3/0
105/0
Frangeul et al., 2008
8
NRPS
Cyanopeptolin
M. aeruginosa NIES-843
166368741–166368744
4
4
0/6
0/106
Frangeul et al, 2008
9
NRPS
Aeruginosin
M. aeruginosa PCC7806
159030043–159030054
13
10
0/4
0/161
Frangeul et al., 2008
10
NRPS
Micropeptin
M. aeruginosa K-139
223717788–223717791
4
NA
0/7
0/133
Nishizawa et al., 2011
11
NRPS
Oscillapeptin
Planktothrix rubescens NIVA-CYA 98
258380691–258380695
5
NA
0/4
0/24
Blom et al., 2003
12
NRPS
Anabaenopeptin
Planktothrix agardhii NIVA-CYA 126/8
157804708–157804711
4
NA
0/4
0/38
Christiansen et al., 2011
13
PKS/NRPS
Microginin
Planktothrix rubescens NIVA-CYA 98
258380635–258380636
2
NA
1/3
0/6
Rounge et al., 2009
14
NRPS
None
Acaryochloris marina MBIC11017
158339498–158339499
2
NA
0/2
0/2
Swingley et al., 2008
15
NRPS
None
Cyanothece sp. PCC8802
257059705
1
NA
0/3
0/8
Welsh et al., 2008
Note: The newly identified PKS and NRPS gene clusters (MIC2 and MIC3) from recently sequenced Microcystis reference strains (Humbert ) were not detected in Kranji Reservoir samples. MIC1 from Humbert is defined here as Cluster 4.
Figure 7
Expression of secondary metabolite gene clusters, (a) clusters with a known product, (b) clusters that are not linked to a specific product. The error bars indicate 2 s.ds. from the mean. The P-values are for the test to reject null hypothesis that there is no difference between the mean night and day transcript abundance.
Ribosomally produced secondary metabolites were also observed in Kranji Reservoir, including the toxin microviridin and the odor-causing compound geosmin. The precursor peptide for microviridin was among the most highly expressed genes of Microcystis (Figure 2), whereas the precursor peptide of a second ribosomally synthesized toxin microcyclamide (Ziemert ) was undetected. Five transcripts were recovered that matched the Nostoc geoA gene for geosmin synthase while none were recovered for genes for the synthesis of the odor compound MIB-2.Biodegradation pathways for most secondary metabolites are currently unknown. Betaproteobacterial genera recently implicated in microcystin biodegradation (Mou ) were observed among sequenced transcripts (Supplementary Database 1). No significant sequence similarity was observed for the mrlAB-mediated microcystin biodegradation pathway (Bourne ; Mou ).
Discussion
During the collection period, a green scum was apparent in the Kranji Reservoir, which has previously been identified as dominated by Microcystis cells. This study confirms the dominance and shows that approximately half of the bacterial transcripts are from Microcystis, exceeding transcripts from other cyanobacterial populations by at least 20-fold and non-cyanobacterial genera by at least 100-fold and enabling inference of trends at the population level for Microcystis and at the Phylum level for other taxa.
Genomic diversity of M. aeruginosa in Kranji Reservoir
Transcripts from a large majority of genes in the M. aeruginosa core and flexible genome were identified in the Kranji Reservoir. M. aeruginosa is a highly characterized cyanoHAB agent with high phenotypic and genetic diversity (Briand ; Yoshida ; van Gremberghe ; Steffen ; Humbert ), including a variable distribution of toxin genes among different strains (Frangeul ). Observation of flexible genes in Kranji Reservoir, Singapore from two M. aeruginosa strains isolated over broad geographic and temporal scales (that is, NIES-843 isolated in Japan in 1997 (Tanabe ) and PCC7806 from the Netherlands in 1972 (Kappers, 1984)) is consistent with other studies documenting the global dispersal and recombination of genomes within this species (Tanabe ; van Gremberghe ; Humbert ). Recently, 10 additional M. aeruginosa genomes were sequenced, refining the M. aeruginosa core genome from the 3362 genes considered in this study to a set of 2462 genes and extending the flexible genome by more than a factor of two (Humbert ). Compared with this extended pan-genome, the majority of Kranji Reservoir Microcystis transcripts shared highest homology with strain NIES-843 included in the two-strain pan-genome (Supplementary Table S5). Interestingly, Lake Erie also showed higher recruitment of metagenome reads to strain NIES-843 over PCC7806 (Steffen ). Flexible genes had lower transcript abundance than core genes, likely reflecting the existence of additional Microcystis genomes in the Kranji Reservoir with a common core and novel flexible genes. It is likely that further sequencing will reveal additional diversity, and work is currently in progress to characterize the genomic diversity of Microcystis in the Kranji Reservoir.
Highly expressed genes in M. aeruginosa and partitioning of functions between day and night
M. aeruginosa gene expression has been previously characterized under controlled experimental conditions (Straub ; Harke and Gobler, 2013), and comparison with environmental populations (this study) highlights both similarities and differences potentially associated with environmental complexity. Strain PCC7806 examined by microarray exhibited common trends with the present study where the most highly expressed genes were for energy metabolism (photosynthesis) and were enriched during the day together with transcripts for carbon fixation, stress response and ribosome production (Straub ). At night, the microarray study of PCC7806 revealed enrichment of the tricarboxylic acid-branched pathway, glycogen degradation, the oxidative pentose phosphate pathway and ammonium uptake (Straub ). With the exception of ammonium uptake (Supplementary Figure S8), these observations are consistent with qualitative trends from individual genes in the present data set that do not reach statistical significance. In addition, in the reservoir population, there was significant nighttime enrichment of multiple additional pathways (Table 1) including DNA repair. Elevated transcription of DNA repair genes at night is also evident in marine Prochlorococcus (ProPortal Search, May 2013, (Kelly )). Low-light periods provide a reprieve from stresses associated with ultraviolet exposure and oxygenic photosynthesis. Nighttime repair of DNA damage incurred during the day may improve the overall fitness of environmental populations. In the reservoir, Microcystis transcripts for gas vesicle proteins (GvpA and GvpC) were expressed at a similar level as energy metabolism (photosynthesis) genes, indicating the dual importance of competition for surface colonization and light-harvesting for the success of bloom-forming Microcystis populations. Other genes highly expressed within the reservoir Microcystis population include the pilin subunit pilA and the two-component system regulator for Type IV pili that mediate cell-cell contact, twitching motility and have been implicated in lateral gene transfer in environmental populations (Figure 5) (Nakasugi and Neilan, 2005). Transcripts for stress-responsive genes for chaperon HspA and DnaK and oxidative stress mitigating peroxiredoxin were also highly represented in the reservoir population and may help the cells acclimate to dynamic environmental conditions.Although previous experimental work has indicated that Kranji reservoir is phosphorous limited (Gin ), the extent and nature of nutrient limitation may vary on a daily timescale. A recent study of M. aeruginosa clone LE-3 demonstrates increased expression of nitrogen or phosphorous uptake systems during experimentally induced limitation of the respective nutrients in lake water (Harke and Gobler, 2013). In the reservoir's Microcystis population, the highest transcript abundance for nitrogen uptake (urea and ammonium) and phosphate uptake were observed at 1300 and 15 hours and may reflect nutrient limitation induced by elevated growth rates and competition for resources during high light and temperatures midday (Supplementary Figure S2).
Secondary metabolites and toxins
Secondary metabolite compounds are small molecules that can act as toxins, antibiotics, anti-cancer agents, surfactants, metal chelators and contribute to off-flavors and odors in freshwater (Di Lorenzo ; Oliynyk ; Fenical ; Haft ). Although the ecological function of many of these compounds remains unclear, recent studies have shed light on some potential roles within the context of a bloom community. The most highly expressed secondary metabolite gene from M. aeruginosa in Kranji Reservoir corresponds to the ribosomally produced serine-protease inhibitor microviridin (Ziemert ), with maximal expression at 1700 and 2000 hours (Figure 2). A microviridin variant (type J) has been shown to be toxic to freshwater Daphnia; a grazer of Microcystis (Rohrlack , 2004), and production of toxins such as microviridin may represent an antipredatory strategy in bloom-forming Cyanobacteria. Other cyanotoxins such as microcystin can also be toxic to Daphnia (Rohrlack ) although grazers adapted to high concentrations demonstrate resistance (Chislock ). Microcystin may also function to enhance the aggregation of Microcystis cells as often occurs during bloom formation (Gan ) and to provide a fitness advantage to Microcystis under oxidative stress (Zilliges ). Despite transcript evidence that the microcystin biosynthetic operon was expressed throughout the day-night cycle, concentrations of dissolved microcystin toxin were measured near the limit of detection in this study (Supplementary Figure S2), potentially reflecting low synthesis activity, biodegradation (Bourne ; Mou ) and/or conversion of microcystin to insoluble conjugates that are not detected by the enzyme-linked immunosorbent assay-based method used in this study (Meissner ).The most highly expressed PKS/NRPS gene cluster was cluster 4 (Figure 6), part of the flexible genome of strain NIES-843. Although the product of cluster 4 has not been identified, bioinformatics analysis indicates that it is likely a hybrid polyketide/fatty acid product (Shulse and Allen, 2011). Interestingly, cluster 4 is related to a cluster previously identified in Anabaena sp. PCC7120 (Johnson ) that has a role in heterocyst formation (Mella-Herrera ), which is not a documented trait in Microcystis species (Watanabe, 1996).The abundance of toxin-producing Microcystis in the reservoir (Te and Gin, 2011) represents a rare opportunity to examine temporal dynamics of expression of PKS/NRPS genes, which typically remains undetected without specific primers (Foerstner ; Kaluzhnaya ). Transcripts for all PKS/NRPS genes from Microcystis strainsNIES-843 and PCC7806 were observed in the Kranji Reservoir throughout the day/night cycle, suggesting that they are continually expressed and thus are likely central to the physiology of the cells rather than produced in response to transient environmental cues. Enrichment of transcripts from PKS/NRPS gene clusters for microcystin and aeruginosin was apparent at night after normalization of transcript abundance to a housekeeping gene (qPCR) or to total gene expression (RPKM) (Table 1, Figure 6). Previous studies report higher transcript abundance of toxin genes during the day (Straub ), which agrees with our findings for all qPCR targets quantified per ng cDNA in the samples (including microcystin biosynthesis genes mcyA and mcyB). However, normalization removes trends due to changes in baseline cellular activity such as growth and suggests that cells allocate a greater portion of their energy budget to toxin synthesis at night. This pattern was also observed for the toxin aeruginosin. We consider several hypotheses to explain this variation. First, the null hypothesis that these differences are due to chance alone was rejected (P<0.05) for GSEA; however, the low statistical power afforded by six samples precludes robust rejection as evidenced by a FDR of 18% (Table 1). Although the present study represents a level of replication on par with, or exceeding, recent metatranscriptome sequencing efforts (Ottesen ; Vila-Costa ), we identify the need to expand replication in future works. Alternately, enrichment of toxin gene transcripts at night may be due to heterogeneity in the distribution of Microcystis cells containing toxin genes, resulting in an increased proportion of toxigenic cells in samples collected at night. Finally, if enrichment is due to increases in toxin gene expression per cell at night, this may be an adaptation to ecological factors, such as enhanced nighttime grazing pressure. Indeed Daphnia grazers have been documented to carry out a diel vertical migration, rising to surface waters at dusk (Haupt ). Elevated expression of genes for the antipredatory toxin microviridin in the 1700- and 2000- hour samples lends additional support to this hypothesis.
Partitioning of functions between phyla
The functions identified among metatranscripts show a phylum level partitioning lending to an apparent interplay within the HAB community. Based on ABC transporter profiles, Cyanobacteria have a major role in the uptake of phosphate, urea, neutral and branched chain amino acids and bicarbonate, likely to satisfy growth requirements during photosynthesis. At the same time, cells within the dominant cyanobacterial genus Microcystis produce gas vesicles that allow colonization of the upper water column and surface and secondary metabolites, including toxins with antipredatory activity such as microviridin and microcystin. Efficient competition for dissolved resources coupled with competition for space and defense against grazing likely contribute to the environmental fitness and competitive dominance of M. aeruginosa within the HAB community.The profile of transporter substrates in the co-existing Proteobacteria largely overlap with both Cyanobacteria and Actinobacteria, with highest transcription levels observed for uptake of branched chain amino acids and including transcripts for uptake of organicphosphorous (phosphonate) compounds. Enrichment of proteobacterial genes for motility and chemotaxis suggest enhanced access to ephemeral nutrient patches, which may evolve during cell lysis and bloom decay, and expression of pili and contact-based secretion systems (Type III, Type VI) indicate cell-cell contact and molecular exchange with other members of the bloom community. Actinobacterial ABC transporter profiles reveal a diverse array of substrates and suggest specialization for the photosynthetically derived carbohydrates (cellobiose, xylose, maltose and alpha glucosides) that are not substrates for highly expressed transporters in other phyla. Bacteroidetes ABC transporter profiles indicate peptides are a major source of organiccarbon while transcripts were enriched in pathways for hydrolysis and formation of glycosidic bonds, suggesting that they have a role in metabolism of complex sugars and dissolved proteins within the bloom community. Patterns in resource acquisition, utilization and secondary metabolite production are implicated as factors that support a bloom community where primary producers (Cyanobacteria) take up nutrients, inorganic carbon and monomers for primary production while producing a suite of small molecules with antipredatory activity while consumers (heterotrophic members of the Proteobacteria, Actinobacteria and Bacteroidetes) utilize photosynthate released from primary producers by exudation or cell lysis, thereby recycling the fixed energy and nutrients.
Conclusions and future directions
Water quality is an emergent property of a complex ecosystem composed of multiple interacting microbial populations. To understand the persistence of blooms and production of secondary metabolites that mediate water quality, the dynamics within the community must be considered. Several important limitations come with studying gene expression in a complex microbial community. We have used sequenced transcripts as a proxy for microbial community gene expression based on the assumption that biases introduced by collection, handling and sequencing are the same across samples and thus do not significantly impact conclusions based on comparison between samples. However, biases in database annotation are a serious concern for interpretation of sequence diversity. At the present time, genomes from freshwater microbes are under-represented in databases, limiting our ability to assign function and taxonomic affiliation to transcripts. Efforts are currently underway to sequence genomes from cultured isolates from the major phylogenetic groups of Kranji Reservoir and help address questions regarding the population genomics of Microcystis and the genomic diversity of co-occurring taxa. By integrating knowledge from isolates and model organisms with expression data from a complex ecosystem, we can gain a more realistic picture of the intricate microbial world.
Authors: Rodrigo A Mella-Herrera; M Ramona Neunuebel; Krithika Kumar; Sushanta K Saha; James W Golden Journal: J Bacteriol Date: 2011-02-11 Impact factor: 3.490
Authors: Ineke van Gremberghe; Frederik Leliaert; Joachim Mergeay; Pieter Vanormelingen; Katleen Van der Gucht; Ann-Eline Debeer; Gissell Lacerot; Luc De Meester; Wim Vyverman Journal: PLoS One Date: 2011-05-05 Impact factor: 3.240
Authors: Yvonne Zilliges; Jan-Christoph Kehr; Sven Meissner; Keishi Ishida; Stefan Mikkat; Martin Hagemann; Aaron Kaplan; Thomas Börner; Elke Dittmann Journal: PLoS One Date: 2011-03-18 Impact factor: 3.240
Authors: Jason Nicholas Woodhouse; Andrew Stephen Kinsela; Richard Nicholas Collins; Lee Chester Bowling; Gordon L Honeyman; Jon K Holliday; Brett Anthony Neilan Journal: ISME J Date: 2015-12-04 Impact factor: 10.302
Authors: A Katharina Makower; J Merijn Schuurmans; Detlef Groth; Yvonne Zilliges; Hans C P Matthijs; Elke Dittmann Journal: Appl Environ Microbiol Date: 2014-11-07 Impact factor: 4.792
Authors: Jia Y Har; Tim Helbig; Ju H Lim; Samodha C Fernando; Adam M Reitzel; Kevin Penn; Janelle R Thompson Journal: Front Microbiol Date: 2015-09-02 Impact factor: 5.640
Authors: Luisa W Hugerth; John Larsson; Johannes Alneberg; Markus V Lindh; Catherine Legrand; Jarone Pinhassi; Anders F Andersson Journal: Genome Biol Date: 2015-12-14 Impact factor: 13.583