| Literature DB >> 23555932 |
Björn Voss1, Henk Bolhuis, David P Fewer, Matthias Kopf, Fred Möke, Fabian Haas, Rehab El-Shehawy, Paul Hayes, Birgitta Bergman, Kaarina Sivonen, Elke Dittmann, Dave J Scanlan, Martin Hagemann, Lucas J Stal, Wolfgang R Hess.
Abstract
Nodularia spumigena is a filamentous diazotrophic cyanobacterium that dominates the annual late summer cyanobacterial blooms in the Baltic Sea. But N. spumigena also is common in brackish water bodies worldwide, suggesting special adaptation allowing it to thrive at moderate salinities. A draft genome analysis of N. spumigena sp. CCY9414 yielded a single scaffold of 5,462,271 nucleotides in length on which genes for 5,294 proteins were annotated. A subsequent strand-specific transcriptome analysis identified more than 6,000 putative transcriptional start sites (TSS). Orphan TSSs located in intergenic regions led us to predict 764 non-coding RNAs, among them 70 copies of a possible retrotransposon and several potential RNA regulators, some of which are also present in other N2-fixing cyanobacteria. Approximately 4% of the total coding capacity is devoted to the production of secondary metabolites, among them the potent hepatotoxin nodularin, the linear spumigin and the cyclic nodulapeptin. The transcriptional complexity associated with genes involved in nitrogen fixation and heterocyst differentiation is considerably smaller compared to other Nostocales. In contrast, sophisticated systems exist for the uptake and assimilation of iron and phosphorus compounds, for the synthesis of compatible solutes, and for the formation of gas vesicles, required for the active control of buoyancy. Hence, the annotation and interpretation of this sequence provides a vast array of clues into the genomic underpinnings of the physiology of this cyanobacterium and indicates in particular a competitive edge of N. spumigena in nutrient-limited brackish water ecosystems.Entities:
Mesh:
Year: 2013 PMID: 23555932 PMCID: PMC3610870 DOI: 10.1371/journal.pone.0060224
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1General features of N. spumigena CCY9414.
A. Photomicrograph of N. spumigena CCY9414 trichomes. The arrows point to heterocysts. The vertical bar corresponds to 40 µm. B. Phylogenetic position of N. spumigena CCY9414 (boxed) within the cyanobacterial phyum, based on its two 16S rRNA sequences (labeled a and b). The two sub-clades within the Nostocales, clade I and clade II, are indicated. Species for which a total genome sequence is publicly available, are in blue. The sequence of Chlorobium tepidum TLS served as outgroup. The numbers at nodes refer to bootstrap support values (1000 repetitions) if >60%. The phylogenetic tree was generated using the Minimum Evolution method within MEGA5 [158]. The optimal tree with the sum of branch length = 0.85445647 is shown. The tree is drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree and are given in the number of base substitutions per site. The multiple sequence alignment was shortened to a total of 1407 positions in the final dataset to include also 16S rRNA sequences from species without a genome sequence.
General genome and annotation information.
|
| |
| Genome Length | 5,462,271 |
| Scaffolds | 1 |
| Avg. Contig Size | 26,308 |
| Genomic GC% | 42.00% |
| Genes | 5,294 |
| Coding% | 80.00% |
| tRNA Count | 48 |
| rRNA Count | 8 |
| Introns | 1 |
| sRNAs | 4 |
| Inteins | 2 |
| Sigma factors | 8 |
| Total number of TSS | 6,519 |
| Number of gTSS | 1,628 |
| Number of aTSS | 2,084 |
| Number of iTSS | 2,043 |
| Number of nTSS | 764 |
For information on non-ubiquitious sRNAs, see Table 3.
Selected non-coding RNA elements mentioned in the text or ubiquitious among bacteria.
| Gene | Product | From | To | S | Comment | Reference |
|
| tmRNA | 3524428 | 3524830 | + |
| |
|
| 6S RNA | 3750385 | 3750114 | - | single TSS located in |
|
|
| scRNA, 10S RNA | 2912155 | 2912257 | + | ||
|
| RNAse P RNA subunit | 5025615 | 5025994 | + | ||
|
| Yfr2 | 460195 | 460090 | - |
| |
|
| NsiR3 | 2888944 | 2888814 | - | NtcA binding site conserved |
|
The respective gene name is given together with the sRNA product, the location, orientation (S; +, forward strand; -, reverse strand), comments and references.
Figure 2Classification of the predicted N. spumigena CCY9414 proteome.
A. Comparison of all predicted proteins of N. spumigena (N_spumi) against the proteomes of other well-studied Nostocales, Nostoc punctiforme sp. PCC 73102 (N_punct), Anabaena variabilis sp. ATCC 29413 (A_var) and Anabaena PCC 7120 (N_7120) based on MCL clustering of BLASTp results (minimum e-value: 10-8). The numbers refer to the number of protein clusters in each category, the numbers in brackets to the total number of individual proteins. B. Taxonomic top hits for the 1,098 N. spumigena CCY9414 singletons from part A (Table S3) visualized by MEGAN.
The 50 gTSS of protein-coding genes associated with the highest number of reads.
| Position | S | Reads | ID | Gene | Annotation |
| 512315 | + | 4058097 |
|
| photosystem II protein D1 |
| 1911567 | + | 420026 |
|
| RNA-binding protein |
| 5407423 | - | 404779 |
|
| RNA-binding protein |
| 818076 | + | 254679 |
|
| SSU ribosomal protein S16p |
| 963628 | + | 194603 |
|
| hypothetical protein |
| 4901855 | + | 186690 |
|
| LSU ribosomal protein L32p |
| 3168087 | + | 168603 |
|
| SSU ribosomal protein S14p (S29e) |
| 429606 | + | 161747 |
|
| unknown protein |
| 2033905 | + | 140335 |
|
| Translation initiation factor 3, TSS2 |
| 1997768 | + | 135635 |
|
| putative sodium-dependent bicarbonate transporter |
| 25246 | + | 108746 |
|
| hypothetical protein |
| 125437 | + | 74766 |
|
| unknown protein |
| 818072 | + | 69294 |
|
| SSU ribosomal protein S16p |
| 3595360 | + | 67550 |
|
| photosystem II protein D1 (PsbA) |
| 2100481 | − | 66076 |
|
| hypothetical protein |
| 490455 | + | 61106 |
|
| Photosystem II protein |
| 1301909 | − | 58894 |
|
| Serine hydroxymethyltransferase (EC 2.1.2.1) |
| 3327951 | + | 54540 |
|
| DnaJ-class molecular chaperone |
| 4211981 | − | 54170 |
|
| CAB/ELIP/HLIP superfamily |
| 4066960 | + | 52955 |
|
| Possible carbon dioxide concentrating mechanism protein CcmK |
| 3743395 | − | 51014 |
|
| Branched-chain amino acid permeases, DUF4079 |
| 2477247 | + | 48670 |
|
| Phycobilisome rod-core linker polypeptide, phycocyanin-associated |
| 2126456 | + | 46218 |
|
| Acyl carrier protein |
| 3427650 | − | 45498 |
|
| hypothetical protein |
| 4477663 | − | 45304 |
|
| Phosphoribosylformylglycinamidine synthase, PurS subunit (EC 6.3.5.3) |
| 4875128 | − | 44715 |
|
| Peroxiredoxin |
| 3279390 | + | 43459 |
|
| Light-independent protochlorophyllide reductase iron-sulfur ATP-binding protein ChlL |
| 308174 | − | 39899 |
|
| hypothetical protein |
| 4424601 | + | 36350 |
|
| Putative subunit N of NAD(P)H:quinone oxidoreductase |
| 3660323 | −- | 35393 |
|
| Peptidase M23B precursor |
| 2945315 | − | 34180 |
|
| FIG00871618: hypothetical protein |
| 2963323 | + | 33480 |
|
| Thioredoxin |
| 2609225 | − | 33225 |
|
| Ribulose bisphosphate carboxylase large chain (EC 4.1.1.39) |
| 948959 | − | 32821 |
|
| LSU ribosomal protein L19p |
| 2961668 | − | 32694 |
|
| Transposase, OrfB family |
| 1827328 | + | 32133 |
|
| Fructose-1,6-bisphosphatase, GlpX type (EC 3.1.3.11)/Sedoheptulose-1,7-bisphosphatase (EC 3.1.3.37) |
| 1563165 | − | 32001 |
|
| putative regulator of phycobilisome association C |
| 4656026 | − | 31933 |
|
| Geranylgeranyl reductase (EC 1.3.1.83) |
| 1811000 | + | 30834 |
|
| DNA-directed RNA polymerase omega subunit |
| 4154749 | − | 30790 |
|
| hypothetical protein |
| 2033866 | + | 29710 |
|
| Translation initiation factor 3, TSS1 |
| 2898416 | + | 29643 |
|
| hypothetical protein |
| 3831169 | − | 29380 |
|
| hypothetical protein |
| 4611031 | + | 29275 |
|
| LSU ribosomal protein L35p |
| 175701 | + | 29188 |
|
| hypothetical protein |
| 4907404 | + | 29084 |
|
| Protein CP12, regulation of Calvin cycle |
| 4319669 | - | 29012 |
|
| hypothetical protein |
| 3648306 | − | 28409 |
|
| S-adenosylmethionine synthetase (EC 2.5.1.6) |
| 472034 | + | 27553 |
|
| Ubiquinone biosynthesis monooxygenase UbiB |
| 3264182 | − | 25978 |
|
| hypothetical protein |
For each gTSS, the position with respect to the forward strand, the orientation (S), the number of reads, the gene ID, gene name (if known) and gene annotation is given. The gTSS are ordered according to the number of reads.
Selected proteins of heterocyst differentiation, pattern formation and nitrogen assimilation in N. spumigena CCY9414.
| Category/Protein |
|
| ||
| NsORF | TSS | Identity | e-value | |
|
| ||||
| NtcA§ |
| 254230R | 99% | e-121 |
| HanA, HupB |
| - | - | - |
| HetR§ |
| 1713443R | 91% | e-162 |
| HetF |
| 2206991F | 61% | 0 |
| HetC |
| - | - | - |
| HetL | - | - | - | - |
| HetP (alr2818)§ |
| - | 69% | 2e-046 |
| NrrA§* |
| 1819958R | 92% | 1e-134 |
| HetZ* |
| 4037960F | 90% | 1e-173 |
|
| ||||
| PatS |
| - | 82% | 0.011 |
| PatA* |
| - | 61% | 1e-130 |
| PatB |
| - | 81% | 0 |
| PatN*§ |
| 1277587F, 1277705F | 71% | 5e-084 |
| HetN |
| - | - | - |
|
| ||||
| GlnA§* |
| 1656268F, 1656364F, 1656438F | 88% | 0 |
| GifA |
| 1658460R | 79% | 3e-025 |
| NblA§ |
| 4524908F | 92% | 1e-029 |
The protein names are given, followed by the ORF ID in N. spumigena CCY9414 (NsORF), the position of the TSS (F, forward or R, reverse strand), the% ID and e-value in a pairwise alignment with the orthologs from Anabaena PCC 7120. Only amino acid identities ≥60% were considered; (-) not detected; *gene is associated with an antisense RNA in in N. spumigena CCY9414; §gene is associated with multiple TSS in Anabaena PCC 7120.
Proteins related to the uptake of iron in N. spumigena CCY9414 identified on the basis of gene clusters present in the genome of Anabaena PCC 7120.
| ORF | NsORF | Annotation | % ID | Reference |
|
| ||||
|
|
| Iron(III) dicitrate transport system, periplasmatic Iron(III) dicitrate transport system, periplasmatic | 36, 32 | |
|
|
| TonB-dependent receptor; Outer membrane Ferrichrome receptor | 50, 25 | |
|
|
| TonB-dependent receptor; Outer membrane Ferrichrome-iron receptor, Ferrichrome-iron receptor | 47, 38, 29 | |
|
| ||||
|
|
| TonB-dependent receptor; Outer membrane Ferrichrome-iron receptor, Ferrichrome-iron receptor | 46, 27, 25 | |
|
|
| iron(III) dicitrate-binding periplasmic protein | 38, 38 | |
|
|
| Ferrichrome-iron receptor | 45, 38 | |
|
|
| Iron(III) dicitrate transport system, periplasmic proteins, Ferric hydroxamate ABC transporter | 37, 32, 37 | |
|
|
| Iron(III) dicitrate transport system, periplasmic proteins | ||
|
| ||||
|
|
| TonB-dependent receptor; Outer membrane, Ferrichrome-iron receptor, Ferrichrome-iron receptor | 47, 37, 24 |
|
|
|
| Acetylornithine aminotransferase | 29 |
|
|
|
| Cysteine desulfurase | 26 |
|
|
| ||||
|
|
|
| ||
|
|
| Ferric hydroxamate ABC transporter, ABC-type Fe3+-siderophore transport system, ATP-binding | 89, 50 |
|
|
|
| Ferric hydroxamate ABC transporter Iron(III) dicitrate transport system, periplasmic binding | 83,30 |
|
|
|
| Ferric hydroxamate ABC transporter, ABC-type Fe3+-siderophore transport system, permeases, Nsp11950 and 11940 probably 1 ORF | 85, 37, 48 |
|
|
| ||||
|
|
| TonB-dependent receptor; Outer membrane Ferrichrome-iron receptor, Ferrichrome-iron receptor | 78, 27, 24 | |
|
|
| hypothetical protein | 47 | |
|
|
| Iron(III) dicitrate transport system, periplasmic binding protein | 80, 36 |
|
|
|
| ABC-type Fe3+-siderophore transport system, ATP-binding Ferric hydroxamate ABC transporter | 71, 52 |
|
|
|
| ABC-type Fe3+-siderophore transport system, permeases, Ferric hydroxamate ABC transporter | 67, 36, 43 |
|
|
|
| ABC-type Fe3+-siderophore transport system, permeases, Ferric hydroxamate ABC transporter | 61, 37, 34 |
|
|
| ||||
|
|
| Transcriptional regulator, AraC family | 38 | |
|
|
| Ferrichrome-iron receptor, Ferrichrome-iron receptor, TonB-dependent receptor | 50, 40, 41 | |
|
|
| hypothetical protein | 47 | |
|
|
| Iron(III) dicitrate transport system, periplasmic binding proteins, Ferric hydroxamate ABC transporter | 28, 27, 37 | |
|
|
| Transcriptional regulator, AraC family | 38 | |
|
|
| Ferrichrome-iron receptor, Ferrichrome-iron receptor, TonB-dependent receptor | 46, 38,26 | |
|
|
| Iron(III) dicitrate transport system, binding periplasmic proteins | 38, 35 | |
|
| ||||
|
|
| Transcriptional regulator, AraC family | 40 | |
|
|
| Ferrichrome-iron receptor, Ferrichrome-iron receptor, TonB-dependent receptor | 49, 38, 27 | |
|
|
| Iron(III) dicitrate transport system, periplasmic binding proteins | 29, 26 | |
|
| ||||
|
|
| ABC-type Fe3+-siderophore transport system, permease proteins | 42, 37 |
|
|
|
| ABC-type Fe3+-siderophore transport system, ATP-binding, Ferric hydroxamate ABC transporter | 46, 44 |
|
|
|
| TonB-dependent receptor, Ferrichrome-iron receptor | 25, 22 |
|
|
|
| Iron(III) dicitrate transport system, periplasmic binding protein | 24 |
|
|
| ||||
|
|
| hypothetical protein | 38 |
|
|
|
| Iron(III) dicitrate transport system, periplasmic binding protein | 23 |
|
|
|
| ABC-type Fe3+-siderophore transport system, permease proteins, Ferric hydroxamate ABC transport | 32, 28, 31 |
|
|
|
| ABC-type Fe3 +-siderophore transport system, ATP-binding | 33 |
|
|
| ||||
|
|
| Calcium-dependent protease precursor | 32 |
|
|
|
| Ferric iron-binding periplasmic proteins of ABC transporter | 57, 49 |
|
|
|
| Ferric iron ABC transporter, permease protein | 64 |
|
|
|
| Iron(III)-transport ATP-binding protein | 60 |
|
The first row contains the ORF ID, annotation and gene name (if available) of the respective Anabaena protein (according to the published sequence [163] in Genbank file NC_003272.1), followed by the ORF ID in N. spumigena CCY9414 (NsORF), the detailed annotation, the% ID in a pairwise alignment and the reference, if available.
Figure 3Analysis of loci encoding proteins of the CP43/IsiA/Pcb family.
A. Organization of the chromosomal region harboring the isiA and psbC-like genes (psbC-lk1-3) of N. spumigena and the separate psbDC operon. The PsaL–coding domain in psbC-lk2 (nsp37500) is highlighted in orange. B. Phylogenetic analysis of CP43, IsiA and related chlorophyll-binding proteins from N. spumigena and of selected other cyanobacteria was inferred using the Minimum Evolution method. The optimal tree with the sum of branch length = 3,97009738 is shown. The percentage of replicate trees in which the associated taxa clustered in the bootstrap test (1000 replicates) are shown next to the branches. The tree is drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the Poisson correction method and are in the units of the number of amino acid substitutions per site. All positions containing gaps and missing data were eliminated from the dataset (complete deletion option). There were a total of 279 positions in the final dataset. C. Transcriptional organization around the isiA, isiB and psbC-like gene cluster. There are three mapped TSS in the region displayed in Fig. 3A, all associated with or close to the 5′ end of nsp37510. TSS are indicated by blue arrows and the number of cDNA reads associated with them are given as approximation for their activity. One gTSS gives rise to the 83 nt long 5′ UTR upstream of nsp37510 (blue) and the gene or operon mRNA. An antisense RNA originates from a single aTSS in the opposite direction (purple). The third TSS is a putative nTSS driving the transcription of an ncRNA in the nsp37510- nsp37520 intergenic spacer. Except for the nsp37510 5′ UTR, all TSS displayed are drawn with a 100 nt-long box that corresponded to the maximum read length in the dRNAseq approach.
Complement of P- and arsenate-related gene orthologs in N. spumigena CCY9914.
| NsORF, gene name | Annotation | comment | Reference |
|
| |||
|
| low affinity P permease | ||
|
| low affinity P permease | ||
|
| freshwater | similar to | |
|
| periplasmic P binding protein PstS | similar to |
|
|
| PstC component of high affinity ABC P transporter |
| |
|
| PstA component of high affinity ABC P transporter |
| |
|
| PstB component of high affinity ABC P transporter ATP-binding protein component |
| |
|
| PstB component of high affinity ABC P transporter ATP-binding protein component |
| |
|
| periplasmic P binding protein PstS | similar to |
|
|
| PstC component of high affinity ABC P transporter |
| |
|
| PstA component of high affinity ABC P transporter |
| |
|
| PstB component of high affinity ABC P transporter ATP-binding protein component |
| |
|
| |||
|
| PhnF component of a C-P lyase | ||
|
| PhnG component of a C-P lyase | ||
|
| PhnH component of a C-P lyase | ||
|
| PhnI component of a C-P lyase | ||
|
| PhnJ component of a C-P lyase | ||
|
| PhnK component of a C-P lyase | ||
|
| PhnL component of a C-P lyase | ||
|
| PhnM component of a C-P lyase | ||
|
| hypothetical protein in | ||
|
| hypothetical protein in | ||
|
| PhnD component of phosphonate ABC transporter phosphate-binding periplasmic component | ||
|
|
| ||
|
| PhnE Phosphonate ABC transporter permease protein | ||
|
|
| ||
|
|
| ||
|
| PhnD1 Phosphonate ABC transporter phosphate-binding periplasmic component | ||
|
| PhnE1 Phosphonate ABC transporter permease protein | ||
|
| PhnH | truncated version, translationally coupled to | |
|
| PhnD2 Phosphonate ABC transporter phosphate-binding periplasmic component | ||
|
|
| ||
|
| PhnE4 Phosphonate ABC transporter permease protein | ||
|
| |||
|
|
| ||
|
|
| ||
|
|
| ||
|
| phosphite dehydrogenase | ||
|
|
| consistent with the operon structure of the characterised |
|
|
| |||
|
| glycerol-3-phosphate ATP-binding protein component | no other components of the | |
|
| |||
|
| PhoH family protein | ||
|
| |||
|
| polyphosphate kinase | ||
|
| inorganic pyrophosphatase | ||
|
| exopolyphosphatase | ||
|
| |||
|
| glycerophophoryl diester phosphodiesterase | contains also a potential phytase domain | |
|
| atypical alkaline phosphatase | akin to those present in several cyanobacteria | |
|
| DedA-like phosphatase | ||
|
| alkaline phosphatase | ||
|
| PhoX-like phosphatase | ||
|
| putative PhoX phosphatase | ||
|
| COG4246 superfamily | sometimes annotated as a phytase | |
|
| Metallophophoesterase | ||
|
| Metallophosphoesterase | ||
|
| predicted phosphatase | ||
|
| metal dependent PHP family phosphoesterase | ||
|
| acid phosphatase | ||
|
| metallophosphoesterase (GlpQ-like) | ||
|
| PhoD-like phosphatase | ||
|
| |||
|
| ArsA | ||
|
| ArsA | ||
|
| ArsA | ||
|
| regulator of arsenate resistance | ||
|
| SphX periplasmic P binding component of P ABC transporter | ||
|
| glyceraldehyde-3-phosphate dehydrogenase | ||
|
| major facilitator superfamily permease | ||
|
| Acr3 (ArsB) Arsenical-resistance protein ACR3 | ||
|
| ArsH Arsenic-resistance protein | ||
|
| ArsC-family protein | ArsC similarity not obvious | |
|
| |||
|
| HAD-superfamily hydrolase | ||
|
| HAD-superfamily hydrolase | ||
|
| HAD-superfamily hydrolase | ||
|
| glycoside hydrolase/HAD-superfamily hydrolase | ||
|
| |||
|
| PhoB (SphR) Response regulator | ||
|
| PhoR (SphS) sensor kinase | ||
|
| PhoU putative negative regulator of the Pi regulon | ||
Figure 4Gene clusters for secondary metabolite biosyntheses in N. spumigena CCY9414.
The assignment of the gene products to non-ribosomal peptide synthetases (NRPS) or polyketide synthases (PKS) is indicated by red and green colour, respectively. Genes encoding putative tailoring proteins are indicated in black. The classification of PKS into iterative and modular PKS is shown in the subtitle of each gene cluster. For characterized gene clusters product names were included in the subtitles. Related gene clusters if present in the database are shown with their gene annotations and strain names above each gene cluster. Substrate specificities as predicted by http://www.nii.res.in/nrps-pks.html are shown underneath each NRPS or PKS gene containing substrate activating domains. Question marks indicate domains with unclear substrate specificities. The numbers in the second line below the gene clusters relate to the gene numbers in N. spumigena CCY9414.