Literature DB >> 26380634

High quality draft genome sequence of Flavobacterium rivuli type strain WB 3.3-2(T) (DSM 21788(T)), a valuable source of polysaccharide decomposing enzymes.

Richard L Hahnke1, Erko Stackebrandt1, Jan P Meier-Kolthoff1, Brian J Tindall1, Sixing Huang1, Manfred Rohde2, Alla Lapidus3, James Han4, Stephan Trong4, Matthew Haynes4, T B K Reddy4, Marcel Huntemann4, Amrita Pati4, Natalia N Ivanova4, Konstantinos Mavromatis4, Victor Markowitz5, Tanja Woyke4, Markus Göker1, Nikos C Kyrpides6, Hans-Peter Klenk7.   

Abstract

Flavobacterium rivuli Ali et al. 2009 emend. Dong et al. 2013 is one of about 100 species in the genus Flavobacterium (family Flavobacteriacae, phylum Bacteroidetes) with a validly published name, and has been isolated from the spring of a hard water rivulet in Northern Germany. Including all type strains of the genus Myroides and Flavobacterium into the 16S rRNA gene sequence phylogeny revealed a clustering of members of the genus Myroides as a monophyletic group within the genus Flavobacterium. Furthermore, F. rivuli WB 3.3-2(T) and its next relatives seem more closely related to the genus Myroides than to the type species of the genus Flavobacterium, F. aquatile. The 4,489,248 bp long genome with its 3,391 protein-coding and 65 RNA genes is part of the G enomic E ncyclopedia of B acteria and A rchaea project. The genome of F. rivuli has almost as many genes encoding carbohydrate active enzymes (151 CAZymes) as genes encoding peptidases (177). Peptidases comprised mostly metallo (M) and serine (S) peptidases. Among CAZymes, 30 glycoside hydrolase families, 10 glycosyl transferase families, 7 carbohydrate binding module families and 7 carbohydrate esterase families were identified. Furthermore, we found four polysaccharide utilization loci (PUL) and one large CAZy rich gene cluster that might enable strain WB 3.3-2(T) to decompose plant and algae derived polysaccharides. Based on these results we propose F. rivuli as an interesting candidate for further physiological studies and the role of Bacteroidetes in the decomposition of complex polymers in the environment.

Entities:  

Keywords:  Aerobic; Bacteroidetes; Carbohydrate active enzyme; Flavobacteriaceae; GEBA-KMG I; Gram-negative; Hard water rivulet; Myroides; Non-motile; Polysaccharide utilization loci

Year:  2015        PMID: 26380634      PMCID: PMC4572689          DOI: 10.1186/s40793-015-0032-y

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Strain WB 3.3-2T (=DSM 21788T = CIP 109865T) is the type strain of [1, 22]. The genus , the type genus [12, 36] of the family [13], was proposed in the first edition of Bergey’s Manual of Determinative Bacteriology in 1923 [10]. have been isolated from soil, freshwater, marine and saline environments [13]. However, members of the Cytophaga/Flavobacteria group have been found with greater abundances in rivers and oceans [39], which was attributed to their important role in the decomposition of algal-derived organic matter [24, 39, 70]. WB 3.3-2T has been isolated from a hardwater rivulet in the Harz Mountains, Germany [17]. Therefore, we selected the freshwater strain WB 3.3-2T as a candidate for comparing its polysaccharide decomposition potential with the one of marine . Here we present the set of carbohydrate active enzymes, polysaccharide utilization loci and peptidases of strain WB 3.3-2T, together with a summary of its present classification, the set of known phenotypic features and a description of the permanent draft genome sequencing and annotation derived from a culture of strain DSM 21788T.

Organism information

Classification and features

The draft genome of DSM 21788T (ARKJ00000000) has one full-length 16S rRNA gene sequence (Q765_20790, 1415 bp) and one partial 16S rRNA gene sequence (Q765_20790, 594 bp) which were both 100 % identical with the sequence from the original species description (AM934661, NR_115084) [1]. BLAST search revealed the presence of a closely related strain CH1-10 (JX971542, 98.4 %) from a mushroom, two closely related (98.5 %) clone sequences from floor dust (FM872607, FM872591) [69], and two clone sequences from human skin (HM274288, HM269957, 98.2 %). The next related species was subsaxonensis WB 4.1-42T [1], whereas other affiliations are poorly supported (Fig. 1). In contrast to the original affiliation with the genus , WB 3.3-2T belongs to a group of species which seem more closely related to the genus [71] than to the type species of , [10, 15, 29] (Fig. 2). However, the backbone of the 16S rRNA gene phylogenetic tree is essentially unresolved. A summary of the classification and general features of F. rivuli WB 3.3-2T is shown in Table 1. Cells of strain WB 3.3-2T are Gram-negative, aerobic to microaerobic, non-motile (flagella are absent) and non-gliding, catalase- and oxidase-positive 0.4–0.6 × 1.5–2.0 μm rods which produce extracellular polymeric substances (EPS) (Fig. 3). Colonies are pearl-white on R2A and CY agars and yellow on TSA and NA agars. Flexirubin pigments are absent. Sparse growth occurs between 4 and 8 °C and no growth was observed above 29 °C; the growth optimum is between 16 and 24 °C. Growth occurs between pH 6.4 and 7.8 (optimum 7.0) and at NaCl concentrations between 0 and 2 % (w/v) with an optimum at 1 % (w/v). Nitrate reduction is negative. The strain hydrolyses aesculin, cellobiose, glycogen, starch, Tween 40 and Tween 80, but not alginate, caseine, cellulose, chitin, DNA and pectin. The tests for β-galactosidase and acid phosphatase are strongly positive. Other physiological properties are available for the API ZYM and API 20NE systems (bioMérieux) and the GN MicroPlate system (Biolog) substrate panels [1]. Maltose and other carbohydrates are assimilated. Properties that can be used for the differentiation from the closely related type strain of are, according to the substrates provided by the GN MicroPlate, positive utilization of acetic acid, α-d-lactose, trehalose and Tween 40, and lack of utilization of l-alanine, l-fucose, α-ketobutyric acid, dl-lactic acid, methyl ß-d-glucoside, l-ornithine, l-rhamnose and l-serine.
Fig. 1

Phylogenetic tree of the genus Flavobacterium and its most closely related genus Capnocytophaga. The tree was inferred from 1,254 aligned characters of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion as previously described [34]. The sequences were aligned using poa [45] and the resulting alignment restricted to its conserved part using Gblocks [20]. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates (left) and from 1,000 maximum-parsimony bootstrap replicates (right) if larger than 60 % [34]. Acccession numbers of 16S rRNA gene sequences are listed in Acccession numbers of 16S rRNA gene sequences are listed in Additional file 1: suppl. Table 6

Fig. 2

Histogram showing the distribution of pairwise SSU similarities of the type species Flavobacterium aquatile with respect to all other 119 strains in the dataset. Except the genus Myroides, all genera are clearly segregated from each other. Pairwise SSU similarities were calculated using the recommended approach described in [55]. Bars are colored according to genus affiliation. The figure was visualized using the ggplot package [72] for the R statistical framework [63]. Acccession numbers of 16S rRNA gene sequences are listed in suppl. Table 6

Fig. 3

Scanning electron micrograph of F. rivuli WB 3.3-2T (DSM 21788T) showing expression of extracellular polymeric substances, EPS (arrows)

Phylogenetic tree of the genus Flavobacterium and its most closely related genus Capnocytophaga. The tree was inferred from 1,254 aligned characters of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion as previously described [34]. The sequences were aligned using poa [45] and the resulting alignment restricted to its conserved part using Gblocks [20]. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates (left) and from 1,000 maximum-parsimony bootstrap replicates (right) if larger than 60 % [34]. Acccession numbers of 16S rRNA gene sequences are listed in Acccession numbers of 16S rRNA gene sequences are listed in Additional file 1: suppl. Table 6 Histogram showing the distribution of pairwise SSU similarities of the type species Flavobacterium aquatile with respect to all other 119 strains in the dataset. Except the genus Myroides, all genera are clearly segregated from each other. Pairwise SSU similarities were calculated using the recommended approach described in [55]. Bars are colored according to genus affiliation. The figure was visualized using the ggplot package [72] for the R statistical framework [63]. Acccession numbers of 16S rRNA gene sequences are listed in suppl. Table 6 Scanning electron micrograph of F. rivuli WB 3.3-2T (DSM 21788T) showing expression of extracellular polymeric substances, EPS (arrows)

Chemotaxonomic data

Major fatty acids (>5 % of total) are i-C15:0, ai-C15:0, C16:0, C16:0 3-OH, iso-C17 : 0 3-OH and, as main component, summed feature C16 : 1 ω7c and/or iso-C15 : 0 2-OH [1]. Although the original publication indicates that “summed feature 3” is present (C16 : 1 ω7c and/or iso-C15 : 0 2-OH) and is generally explained as “summed features are groups of two or three fatty acids that cannot be separated by GLC using the MIDI System” this is a misrepresentation of information provided by MIDI Inc as well as a failure to further inspect the final results. Re-examination of the original data held in the DSMZ indicates that a single peak is present with an ECL of 15.819, coinciding with the ECL of C16 :1 ω7c in the MIDI Sherlock TSBA40 peak naming table, indicating that C16:1 ω7c is present and iso-C15:0 2-OH is absent. While these differences may appear trivial this information can be linked back to the enzymes (their encoding genes) and biosynthetic pathways leading to the synthesis of these two very different fatty acids as has been pointed out previously by [57, 58]. No data are available on respiratory quinone, peptidoglycan, polar lipid, polyamine and whole-cell sugar composition. The DNA G + C content of the type strain was previously determined as 40.4 mol% [1].

The genera Flavobacterium and Myroides

Figures 1 and 2 give an overview of the phylogenetic relationships of members of the genus based on the comparison of 16S rRNA gene sequences (see list in Additional file 1: Table S1). In addition members of the genus are included and members of the genus and are used as outgroups. Members of the genera and form a monophyletic group, but the division of that monophyletic group to produce a monophyletic group including all members of the genus does not result in members of the genus forming a monophyletic group. In such cases the genus may be divided into several monophyletic groups or the group representing members of the genus and may be described as being paraphyletic. If a genus is to be composed of species that constitute a monophyletic group then the present data suggest at least two alternatives. If one retains the genus as a monophyletic group then the division of the genus into several monophyletic groups may need closer investigation, potentially resulting in the creation of several new genera. Alternatively, the fact that a monophyletic group is recovered that includes members of both the genera and may be indicative of the inclusion of members of both taxa in a single genus, where the genus name Bergey et al. [10] has priority over the genus name Myroides Vancanneyt et al. [44, 71]. The type species of the genus , (Stutzer [68]) Vancanneyt et al. [71] was originally named Stutzer [68], i.e. the two names are homotypic synonyms. The lowest 16S rRNA gene sequence pairwise similarity values between the type strain of the type species of the genus , and other type strains of species considered to be members of the genus is 92-93 %, close to the 16S rRNA gene sequence pairwise similarity value of 92 % to the type strain of the type species of , M. odoratum.

Genome sequencing information

Genome project history

DSM 21788T was selected for sequencing on the basis of its phylogenetic position [35, 40], and is part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project [43], a follow-up of the Genomic Encyclopedia of Bacteria and Archaea pilot project [74], which aims at increasing the sequencing coverage of key reference microbial genomes and to generate a large genomic basis for the discovery of genes encoding novel enzymes [61]. KMG-I is the first of the production phases of the “Genomic Encyclopedia of Bacteria and Archaea: sequencing a myriad of type strains initiative a [42] and a Genomic Standards Consortium project [27]. The genome project is deposited in the Genomes OnLine Database [59] and the permanent draft genome sequence is deposited in GenBank. Sequencing, finishing and annotation were performed by the DOE Joint Genome Institute (JGI) using state-of-the-art sequencing technology [49]. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information

MIGS IDPropertyTerm
MIGS-31.1Sequencing qualityLevel 2: High-Quality Draft
MIGS-28.1Libraries methodIllumina Std shotgun library
MIGS-28.2Reads count14,972,538 sequencing reads
MIGS-29Sequencing methodIllumina HiSeq 2000,
MIGS-31.2Fold coverage124.1x
MIGS-30Assembly methodVelvet v. 1.1.04; ALLPATHS v. r41043
MIGS-32Gene calling methodProdigal, GenePRIMP, IMG-ER
NCBI project ID182404
Genbank IDARKJ00000000
Genbank Date of Release16-SEP-2013
IMG object ID2519103183
GOLD IDGi11501
MIGS-13Source Material IdentifierDSM 21788
Project relevanceTree of Life, GEBA-KMG
Classification and general features of F. rivuli WB 3.3-2T in accordance with the MIGS recommendations [26], as developed by [25], List of Prokaryotic names with Standing in Nomenclature [23] and the Names for Life database [31] Evidence codes - IDA, Inferred from Direct Assay (first time in publication), TAS traceable author statement (i.e., a direct report exists in the literature), NAS, non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [5] Genome sequencing project information

Growth Conditions and genomic DNA preparation

A culture of DSM 21788T was grown aerobically in DSMZ medium 830 [4] at 20 °C. Genomic DNA was isolated using Jetflex Genomic DNA Purification Kit (GENOMED 600100) following the standard protocol provided by the manufacturer but modified by an incubation time of 60 min. the incubation on ice overnight on a shaker, the use of additional 50 μl proteinase K, and the addition of 200 μl protein precipitation buffer. DNA is available from DSMZ through the DNA Bank Network [32].

Genome sequencing and assembly

The draft genome of DSM 21788T was generated using the Illumina technology [9]. An Illumina Std. shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 14,972,538 reads totaling 2,245.9 Mbp (Table 3). All general aspects of library construction and sequencing performed at the JGI can be found at [21]. All raw sequence data were passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J, DUK. Unpublished). Following steps were performed for assembly: (1) filtered reads were assembled using Velvet [77], (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [46], (3) Sequence reads were assembled with simulated read pairs using Allpaths–LG [33]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: –very clean yes –export- Filtered yes –min contig lgth 500 –scaffolding no –cov cutoff 10) 2) wgsim (–e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True). The final draft assembly contained 26 contigs in 23 scaffolds, with three contigs shorter than the threshold used to generate Table 3. The total size of the genome is 4.5 Mbp and the final assembly is based on 560.1 Mbp of data, which provides a 124.1x average coverage of the genome.
Table 3

Genome statistics

AttributeNumber% of Total
DNA, total number of bases4489248100.0
DNA coding number of bases398139988.7
DNA G + C number of bases177775839.6
DNA scaffolds23100.0
Genes total number4056100.0
Protein coding genes399198.4
RNA genes651.6
rRNA genes80.2
5S rRNA50.1
16S rRNA1<0.1
23S rRNA20.1
tRNA genes481.2
Other RNA genes90.2
Protein coding genes with function prediction284270.1
without function prediction114928.3
Protein coding genes with COGs257063.4
Protein coding genes with Pfam292472.1
Protein coding genes coding signal peptides65416.1
Protein coding genes coding transmembrane proteins90622.3
CRISPR repeats0
Genome statistics

Genome annotation

Genes were identified using Prodigal [37] as part of the DOE-JGI genome annotation pipeline [49], followed by manual curation using the JGI GenePRIMP pipeline [60]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) non-redundant database, UniProt, TIGR-Fam, Pfam, PRIAM, KEGG, COG, and InterPro database. These data sources were combined to assert a product description for each predicted protein. Additional gene prediction analysis and functional annotation was performed within the Integrated Microbial Genomes-Expert Review (IMG-ER) platform [48].

Genome properties

The assembly of the draft genome sequence consists of 23 scaffolds amounting to 4,489,248 bp. The G + C content is 39.6 % (Table 3) which is similar to the G + C content determined by Ali et al. [1] and is within the acceptable range for a microbial species [56]. Of the 4,056 genes predicted, 3,991 were protein-coding genes, and 65 RNAs. The majority of the protein-coding genes (70.1 %) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4.
Table 4

Number of genes associated with the general COG functional categories

CodeValue% ageDescription
J1525.4Translation, ribosomal structure and biogenesis
A10.1RNA processing and modification
K2097.4Transcription
L1384.9Replication, recombination and repair
B10.1Chromatin structure and dynamics
D210.8Cell cycle control, cell division, chromosome partitioning
Y00.0Nuclear structure
V491.7Defense mechanisms
T1776.3Signal transduction mechanisms
M2378.4Cell wall/membrane/envelope biogenesis
N100.4Cell motility
Z00.0Cytoskeleton
W00.0Extracellular structures
U501.8Intracellular trafficking, secretion, and vesicular transport
O1113.9Posttranslational modification, protein turnover, chaperones
C1525.4Energy production and conversion
G2017.2Carbohydrate transport and metabolism
E1997.1Amino acid transport and metabolism
F672.4Nucleotide transport and metabolism
H1244.4Coenzyme transport and metabolism
I1093.9Lipid transport and metabolism
P1334.7Inorganic ion transport and metabolism
Q481.7Secondary metabolites biosynthesis, transport and catabolism
R36012.8General function prediction only
S2619.3Function unknown
-148636.6Not in COGs
Number of genes associated with the general COG functional categories

Insights from the genome sequence

Comparative genomics

Here we present a brief comparative genomics analysis of DSM 21788T with a selection of its closest phylogenetic neighbour (according to Fig. 1), [1] (NZ_AUGP00000000), other potentially closely related species such as [66] (NZ_AUDM00000000) and [30] (NZ_JRLV00000000), as well as the genome of the type species of the genus , [10, 15, 29] (NZ_JRHH00000000). The genomes of these five sequenced type strains differ significantly in their size: 4.49 Mbp (see above), 3.8 Mbp, 3.19 Mbp, 4.63 Mbp and 3.49 Mbp. Since these genome sequences have not been sequenced completely yet, the final values might change slightly in future analyses based on complete genome sequences. An estimate of the overall similarity between and the other strains in this data set was generated with the Genome-to-Genome Distance Calculator (2.0) [6, 7, 53]. It calculates intergenomic distances by comparing two respective genomes to obtain HSPs (high-scoring segment pairs) and, afterwards, infers distances via a set of formulas (1, HSP length/total length; 2, identities/HSP length; 3, identities/total length). The GGDC also reports model-based DDH estimates (digital DDH or dDDH) along with their confidence intervals [53]. Since formula 2 is robust against the use of incomplete genome sequences (see above), it is especially suited for this type of analysis. The result of this comparison is shown in Table 5 and yields dDDH of below 22 % throughout, which underlines the expected status of distinct species, as inferred from the 16S rRNA sequence similarities. Consequently, with 21.3 % dDDH has the highest similarity to , whereas has the lowest similarity of 18.4 % dDDH. The comparison of with and reached the lowest value (2 %) regarding the average genome length covered with HSPs. This value was slightly increased (7 %) between and and clearly higher (31 %) with respect to , the closest related species according to Fig. 1. The identity within the HSPs was 77 % on average, whereas the identity over the whole genome was 24 % regarding the comparison of with , and, was even below 10 % regarding the remaining comparisons (Table 5).
Table 5

Pairwise comparison of F. rivuli with F. filum, F. subsaxonicum, F. beibuense and F. aquatile using the GGDC (Genome-to-Genome Distance Calculator). Digital DDH (dDDH) and the respective confidence intervals (C.I.) are specified for GGDC’s recommended formula 2

F. rivuli versus% dDDH% C.I. dDDHHSP length/total length [%]Identities/HSP length [%]Identities/total length [%]
F. aquatile 18.42.52761
F. beibuense 18.72.67766
F. filum 19.02.52771
F. subsaxonicum 21.32.9317924
Pairwise comparison of F. rivuli with F. filum, F. subsaxonicum, F. beibuense and F. aquatile using the GGDC (Genome-to-Genome Distance Calculator). Digital DDH (dDDH) and the respective confidence intervals (C.I.) are specified for GGDC’s recommended formula 2

Gliding motility

The gliding motility machinery among is composed of adhesion-like proteins, an ATP-binding cassette transporter, the PorS secretion system, and additional proteins, as described by McBride and Zhu [51]. In the genome of all genes necessary for gliding motility were identified (Table 6). However, adhesin-like proteins comparable to the ones of UW101 were not found, and gliding motility of was not observed previously [1].
Table 6

Gliding motility-related genes in strain DSM 21788T compared to genes in Flavobacterium strains studied by McBride and Zhu [51]

F. rivuli DSM 21788T F. psychrophilum JIP02/86T F. johnsoniae ATCC 17061T
Locus tag prefixF565_ RS01FPFjoh_
Gliding motility++
Adhesin-like
remA19590808
remB21171657
sprB00160979
ATP-binding cassette transporter
gldA0527002521516
gldF0076010892722
gldG0076510902721
Additional protein required for gliding
gldBa 1339020691793
gldC1338520681794
gldDa 1886516631540
gldE1886013581539
gldHa 1051500240890
gldJa 1184513891557
Peptidoprolyl isomerase (Flavobacteriia, protein folding)
gldI0818018922369
PorS secretion system (secretion of RemA/RemB and SprA/SprB)
gldKa 1860519731853
gldLa 1860019721854
gldMa 1859519711855
gldNa 1859019701856
sprAa 0606521211653
sprEa 1915024671051
sprTa 0547503261466

aessential gliding motility genes after McBride and Zhu [51]

Gliding motility-related genes in strain DSM 21788T compared to genes in Flavobacterium strains studied by McBride and Zhu [51] aessential gliding motility genes after McBride and Zhu [51]

Peptidases

The MEROPS [64] annotation was carried out by searching the sequences against MEROPS 9.10 (access date: 2014.10.16, version: pepunit.lib). processes 177 peptidases the majority of which were 59 metallo (M) and 89 serine (S) peptidases (Table 7 and Additional file 1: Table S2). Furthermore, the genome contained 22 I39, two I87 and one I71 simple peptidase inhibitors (Table 7 and Additional file 1: Table S3).
Table 7

Peptidases and simple peptidase inhibitors in the genome of strain DSM 21788T

Peptidase familyM01M03M12M13M14M16M19M20M23
Counts622283158
Peptidase familyM24M28M38M41M42M43M48M50M61
Counts226111111
Peptidase familyM75M79M90M96
Counts2112
Peptidase familyS01S08S09S11S12S14S16S24S26
Counts2531163351
Peptidase familyS33S41S46S49S51S54S66
Counts16631232
Peptidase familyC01C25C26C40C44C56C82
Counts1183441
Peptidase familyN11T02U32U73A08A28
Counts112111
Inhibitor familyI39I71I87
Counts2212
Peptidases and simple peptidase inhibitors in the genome of strain DSM 21788T

Carbohydrate active enzymes

The CAZyme annotation was a combination of RAPSearch2 search [75, 78] and HMMER scanning [28]. The RAPSearch2 database was created from the protein sequences listed at the CAZy website [18, 47] (access date: 2014.09.18) while the profile HMMs were downloaded from dbCAN [76] (version: dbCAN-fam-HMMs.txt.v3). The outputs of these two program runs were compared and only their intersections were kept (i.e., loci confirmed by both methods). In case of conflicting family assignments, the RAPSearch2 results were preferred. Overall, in its genome DSM 21788T possess a variety of carbohydrate active enzymes including 94 glycoside hydrolases (GH) belonging to 31 families, 11 carbohydrate binding modules (CBM) belonging to 7 families, 13 carbohydrate esterases (CE) belonging to 8 families, one polysaccharide lyase of family 11 (PL11) and 37 glycosyl transferases belonging to 11 families (Table 8 and Additional file 1: Table S4). The carbohydrate esterases CE2, CE6, CE7, CE12 might act as carboxylic-ester hydrolases (EC 3.1.1.-) and the carbohydrate esterases CE11, CE14 as linear amides (EC 3.5.1.-). The genome of strain DSM 21788T comprised a set of four GH5 and three GH51, for the potential hydrolysis of various cellulose or xylan polysaccharides. The absence of GH50, GH86 (agarose hydrolysis), GH18, GH19, GH20 (chitin hydrolysis) and a gene for alginate lyase (EC 4.2.2.3) corroborate the results of Ali et al. [1] that can not hydrolyze agarose, chitin and alginate, respectively. is equipped with one GH1, five GH5 and three GH30 as potential β-glucosidases and was shown to utilize cellobiose (d-Glc-β(1 → 4)-d-Glc) but not cellulose [1]. Gentobiose (d-Glc-β(1 → 6)-d-Glc) utilization and β-galactosidase activity was shown for [1] which has one GH1, fifteen GH2, eleven GH3 and one GH42 encoded in its genome. Starch was hydrolyzed by [1] presumably by enzyme activity of the four GH13 (α-amylase) and trehalose [1] by four GH13, one GH37, one GH65 (trehalase). The products of starch hydrolysis, maltose and d-glucose, can be utilized by [1]. Melibiose (d-Gal-α(1 → 6)-d-Glc) was metabolized by and α-galactosidase activity was confirmed [1], which might be mediated by the two GH27, two GH36 and five GH97.
Table 8

Carbohydrate active enzymes (CAZy) in the genome of strain DSM 21788T

CAZy familyGH1GH2GH3GH5GH13GH16GH23
Counts115114422
CAZy familyGH25GH27GH28GH29GH30GH31GH36
Counts1252352
CAZy familyGH37GH39GH42GH43GH51GH65GH73
Counts13111311
CAZy familyGH78GH88GH92GH95GH97GH105GH106
Counts1123542
CAZy familyGH127GH130GHa
Counts123
CAZy familyGT2GT4GT5GT9GT19GT20GT28
Counts131013111
CAZy familyGT30GT41GT51GTa
Counts1141
CAZy familyCBM2CBM10CBM13CBM32CBM35CBM50CBM57
Counts1111241
CAZy familyCE2CE4CE6CE7CE11CE12CE14
Counts1111132
CAZy familyCEa PL11
Counts31

agenes attributed to an enzyme class, but not to a family

Carbohydrate active enzymes (CAZy) in the genome of strain DSM 21788T agenes attributed to an enzyme class, but not to a family

Polysaccharide utilization loci

Members of flavobacteria were frequently found in aquatic habitats and play a pivotal role in the remineralization of complex organic matter [24, 39]. The coincidence of (i) a preference for polymeric substrates [39], (ii) the occurrence during algal blooms [62, 70] and (iii) the organization of genes involved in polysaccharide decomposition in polysaccharide utilization loci (PUL) [16, 67], suggests a specialization of members towards the utilization of complex organic matter. In DSM 21788T four PULs were identified consisting of a TonB-dependent receptor, a SusD-like protein and a series of carbohydrate active enzymes (Figs. 4, 5, 6 and 7). The synteny between the identified PULs and 40 currently available genomes were investigated using MultiGeneBlast [52]. Figure 4 shows one of the PULs being conserved between some strains from the genera , , and . Kabisch et al. [38] showed that proteins of the same PUL in ‘’ KT0803 were specifically expressed when grown on laminarin. The second PUL comprised of three glycosyl transferases, two GH5 and GH43 was found also in DSM 15936T and UW101 [50], but with an additional GH2 (Fig. 5). Two further PULs comprised combinations of GH2, GH3, GH31, GH97 and other glycoside hydrolases and were only partially identical with PULs of other members (Fig. 6a and b). These PULs potentially enable to decompose hemicellulose or xylose.
Fig. 4

Synteny between a potentially laminarin-specific PUL of F. rivuli DSM 21788T and other Flavobacteriaceae. Open circles indicate genes which were specifically expressed by ‘Gramella forsetii’ KT0803 when grown on laminarin, as shown by Kabisch et al. [38]. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47]

Fig. 5

Synteny between a PUL of F. rivuli DSM 21788T, F. denitrificans DSM 15936T and F. johnsoniae UW101T. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) and glycoside transferase families (GT) can be seen at the CAZy homepage [18, 47]

Fig. 6

Two PUL of F. rivuli DSM 21788T with low synteny (a, b) to PUL of other Flavobacterium members, potentially mediating the decomposition of hemicellulose or xylose. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47]

Fig. 7

Polygalacturonate decomposition potential in F. rivuli DSM 21788T. a The potentially polygalacturonate specific PUL was found exclusively in F. rivuli DSM 21788T. b Genes for the catabolism of d-galactopyranuronate are colocalized in a gene cluster syntenic between Flavobacterium members. c Enzymes of the pectate decomposition and catabolism pathway. Bold blue numbers indicate the position of enzymes in the pectate catabolism pathway c and their corresponding genes in the gene clusters a, b. Genes in gray encode for hypothetical proteins. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. Investigation of pectin degradation pathway was done using the MetaCyc homepage [19]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47]

Synteny between a potentially laminarin-specific PUL of F. rivuli DSM 21788T and other Flavobacteriaceae. Open circles indicate genes which were specifically expressed by ‘Gramella forsetii’ KT0803 when grown on laminarin, as shown by Kabisch et al. [38]. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47] Synteny between a PUL of F. rivuli DSM 21788T, F. denitrificans DSM 15936T and F. johnsoniae UW101T. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) and glycoside transferase families (GT) can be seen at the CAZy homepage [18, 47] Two PUL of F. rivuli DSM 21788T with low synteny (a, b) to PUL of other Flavobacterium members, potentially mediating the decomposition of hemicellulose or xylose. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47] Polygalacturonate decomposition potential in F. rivuli DSM 21788T. a The potentially polygalacturonate specific PUL was found exclusively in F. rivuli DSM 21788T. b Genes for the catabolism of d-galactopyranuronate are colocalized in a gene cluster syntenic between Flavobacterium members. c Enzymes of the pectate decomposition and catabolism pathway. Bold blue numbers indicate the position of enzymes in the pectate catabolism pathway c and their corresponding genes in the gene clusters a, b. Genes in gray encode for hypothetical proteins. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. Investigation of pectin degradation pathway was done using the MetaCyc homepage [19]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47] In addition to the PULs, DSM 21788T had one large operon-like structure comprising a set of 11 glycoside hydrolases, 3 carbohydrate esterases, one polysaccharide lyase (Fig. 7a), notably three GH28s (exo-poly-α-d-galacturonosidase) and a PL11 (digalacturonate lyase) for the decomposition of a pectate-like polysaccharide (polygalacturonate). Acetyl groups may be split of by CE7 (acetyl xylan esterase) and CE12 (rhamnogalacturonan acetylesterase). Interestingly, this operon additionally includes an altronate hydrolase and an oxidoreductase, which are part of the d-galactopyranuronate catabolic pathway (Fig. 7c), as well as two transporters, an aldose epimerase, a dehydrogenase and a kinase, which may mediate the catabolism of side-chain saccharides such as d-xylose, d-mannose and d-arabinose. In other species, genes of the d-galactopyranuronate catabolic pathway are all co-located in loci which are syntenic with a gene cluster in (Fig. 7b). However, the gene cluster in did not contain the altronate hydrolase and oxidoreductase. Conclusively, the absence of the two genes of the d-galactopyranuronate catabolic pathway, and thus the ability to utilize polygalacturonate, was possibly compensated by the large CAZy-rich gene cluster.

Conclusion

The high-quality draft genome sequence of the Gram-negative, non-motile WB 3.3-2T (=DSM 21788T) isolated from a spring of a hard water rivulet provided new insights into the polysaccharide-decomposition potential of freshwater . belongs to a group of deep-branching species within the genus that might be more closely related to the genus than to the type species of ,. The present data points towards an unsatisfactory taxonomy irrespective of which interpretation one follows and is largely a result of publishing new species in the genus without taking into consideration a wider range of species in that genus or including members of the genus as well as publishing new species within the genus without taking a larger number of species from the genus into consideration (including the type species). At the same time all evaluations are primarily based on “phylogenetic data” (i.e., gene sequence data) and genera are often poorly delineated. At first glance it does not appear that this approach will resolve this issue. Bernardet et al. [12] mentioned the clustering of among other species in groups or possible new genera which have 16S rRNA gene sequence identities below 93 % with the type species of the genus. However, the potentially new genera could not be delineated because different procedures or culture conditions were used to describe common features [12]. The problem of an essentially unresolved backbone in the 16S rRNA gene sequence phylogeny of the (see above) will most probably be overcome in the near future with the foreseeable increase of publicly available draft genome sequences from large scale projects such as GEBA, which will enable us to infer whole genome sequence based phylogenies with a significantly higher statistical support for the branching topology using genome-based inference methods [54]. The genome of strain WB 3.3-2T (DSM 21788T) comprised 4.48 Mbp on 23 scaffolds and was sequenced as part of the enomicncyclopedia ofacteria andrchaea project. The genome encoded for a great variety of 151 carbohydrate active enzymes and 177 peptidases. The four identified polysaccharide-utilization loci may enable strain WB 3.3-2T to decompose laminarin, hemicellulose and xylose. One gene cluster was identified that may enable strain WB 3.3-2T to decompose pectate-like polysaccharides. This genome in combination with other genomes of the will give further insights into the evolution and genetic potential of bacteria succeeding in substrate-related niches during polysaccharide decomposition in marine and freshwater habitats.
Table 1

Classification and general features of F. rivuli WB 3.3-2T in accordance with the MIGS recommendations [26], as developed by [25], List of Prokaryotic names with Standing in Nomenclature [23] and the Names for Life database [31]

MIGS IDPropertyTermEvidence code
Current classificationDomain Bacteria TAS [73]
Phylum Bacteroidetes TAS [2, 41]
Class Flavobacteriia TAS [3, 11]
Order Flavobacteriales TAS [14, 65]
Family Flavobacteriaceae TAS [13, 65]
Genus Flavobacterium TAS [12, 36]
Species Flavobacterium rivuli TAS [1]
type strain WB 3.3-2T TAS [1]
Gram-stainnegativeTAS [1]
Cell shaperod-shapedTAS [1]
MotilitynonmotileTAS [1]
Sporulationnon-spore formingTAS [13]
Temperature rangemesophilic (4–29 °C)TAS [1]
Optimum temperature16–24 °CTAS [1]
pH range; Optimum6.4–7.8, 7TAS [1]
Carbon sourceCarbohydrates, peptidesTAS [1]
MIGS-6Habitatfresh waterTAS [1, 17]
MIGS-6.3Salinity0–2 %TAS [1]
MIGS-22Oxygen requirementobligate aerobeTAS [1]
MIGS-15Biotic relationshipfree-livingTAS [1, 17]
MIGS-14Pathogenicitynot reportedNAS
MIGS-4Geographic locationHarz Mountains, North GermanyTAS [1, 17]
MIGS-5Sample collection time9 June 2005TAS [1, 17]
MIGS-4.1Latitude51.758065TAS [1, 17]
MIGS-4.2Longitude10.11638TAS [1, 17]
MIGS-4.4Altitude273 mTAS [17]

Evidence codes - IDA, Inferred from Direct Assay (first time in publication), TAS traceable author statement (i.e., a direct report exists in the literature), NAS, non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [5]

  54 in total

1.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

2.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

3.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  Substrate-controlled succession of marine bacterioplankton populations induced by a phytoplankton bloom.

Authors:  Hanno Teeling; Bernhard M Fuchs; Dörte Becher; Christine Klockow; Antje Gardebrecht; Christin M Bennke; Mariette Kassabgy; Sixing Huang; Alexander J Mann; Jost Waldmann; Marc Weber; Anna Klindworth; Andreas Otto; Jana Lange; Jörg Bernhardt; Christine Reinsch; Michael Hecker; Jörg Peplies; Frank D Bockelmann; Ulrich Callies; Gunnar Gerdts; Antje Wichels; Karen H Wiltshire; Frank Oliver Glöckner; Thomas Schweder; Rudolf Amann
Journal:  Science       Date:  2012-05-04       Impact factor: 47.728

5.  List of Bacterial Names with Standing in Nomenclature: a folder available on the Internet.

Authors:  J P Euzéby
Journal:  Int J Syst Bacteriol       Date:  1997-04

6.  Flavobacterium hauense sp. nov., isolated from soil and emended descriptions of Flavobacterium subsaxonicum, Flavobacterium beibuense and Flavobacterium rivuli.

Authors:  Kun Dong; Biao Xu; Fengqiu Zhu; Gejiao Wang
Journal:  Int J Syst Evol Microbiol       Date:  2013-03-01       Impact factor: 2.747

7.  Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs.

Authors:  Alexander F Auch; Hans-Peter Klenk; Markus Göker
Journal:  Stand Genomic Sci       Date:  2010-01-28

8.  RAPSearch: a fast protein similarity search tool for short reads.

Authors:  Yuzhen Ye; Jeong-Hyeon Choi; Haixu Tang
Journal:  BMC Bioinformatics       Date:  2011-05-15       Impact factor: 3.307

9.  dbCAN: a web resource for automated carbohydrate-active enzyme annotation.

Authors:  Yanbin Yin; Xizeng Mao; Jincai Yang; Xin Chen; Fenglou Mao; Ying Xu
Journal:  Nucleic Acids Res       Date:  2012-05-29       Impact factor: 16.971

10.  Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

Authors:  Nikos C Kyrpides; Tanja Woyke; Jonathan A Eisen; George Garrity; Timothy G Lilburn; Brian J Beck; William B Whitman; Phil Hugenholtz; Hans-Peter Klenk
Journal:  Stand Genomic Sci       Date:  2013-12-17
View more
  8 in total

1.  Comparative genome analysis of Bacillus velezensis reveals a potential for degrading lignocellulosic biomass.

Authors:  Long Chen; Wei Gu; Hai-Yan Xu; Gui-Lian Yang; Xiao-Feng Shan; Guang Chen; Yuan-Huan Kang; Chun-Feng Wang; Ai-Dong Qian
Journal:  3 Biotech       Date:  2018-05-11       Impact factor: 2.406

2.  Analysis of 1,000 Type-Strain Genomes Improves Taxonomic Classification of Bacteroidetes.

Authors:  Marina García-López; Jan P Meier-Kolthoff; Brian J Tindall; Sabine Gronow; Tanja Woyke; Nikos C Kyrpides; Richard L Hahnke; Markus Göker
Journal:  Front Microbiol       Date:  2019-09-23       Impact factor: 5.640

3.  Changes in bacterial diversity of activated sludge exposed to titanium dioxide nanoparticles.

Authors:  Pabel Cervantes-Avilés; César Augusto Caretta; Elcia Margareth Souza Brito; Pierre Bertin; Germán Cuevas-Rodríguez; Robert Duran
Journal:  Biodegradation       Date:  2021-04-03       Impact factor: 3.909

4.  Comparing polysaccharide decomposition between the type strains Gramella echinicola KMM 6050(T) (DSM 19838(T)) and Gramella portivictoriae UST040801-001(T) (DSM 23547(T)), and emended description of Gramella echinicola Nedashkovskaya et al. 2005 emend. Shahina et al. 2014 and Gramella portivictoriae Lau et al. 2005.

Authors:  Irina Panschin; Sixing Huang; Jan P Meier-Kolthoff; Brian J Tindall; Manfred Rohde; Susanne Verbarg; Alla Lapidus; James Han; Stephan Trong; Matthew Haynes; T B K Reddy; Marcel Huntemann; Amrita Pati; Natalia N Ivanova; Konstantinos Mavromatis; Victor Markowitz; Tanja Woyke; Markus Göker; Hans-Peter Klenk; Nikos C Kyrpides; Richard L Hahnke
Journal:  Stand Genomic Sci       Date:  2016-06-03

5.  Ancient acquisition of "alginate utilization loci" by human gut microbiota.

Authors:  Sophie Mathieu; Mélanie Touvrey-Loiodice; Laurent Poulet; Sophie Drouillard; Renaud Vincentelli; Bernard Henrissat; Gudmund Skjåk-Bræk; William Helbert
Journal:  Sci Rep       Date:  2018-05-23       Impact factor: 4.379

6.  High-quality draft genome sequence of Flavobacterium suncheonense GH29-5(T) (DSM 17707(T)) isolated from greenhouse soil in South Korea, and emended description of Flavobacterium suncheonense GH29-5(T).

Authors:  Nisreen Tashkandy; Sari Sabban; Mohammad Fakieh; Jan P Meier-Kolthoff; Sixing Huang; Brian J Tindall; Manfred Rohde; Mohammed N Baeshen; Nabih A Baeshen; Alla Lapidus; Alex Copeland; Manoj Pillay; T B K Reddy; Marcel Huntemann; Amrita Pati; Natalia Ivanova; Victor Markowitz; Tanja Woyke; Markus Göker; Hans-Peter Klenk; Nikos C Kyrpides; Richard L Hahnke
Journal:  Stand Genomic Sci       Date:  2016-06-16

7.  High-quality draft genome sequence of Aquidulcibacter paucihalophilus TH1-2T isolated from cyanobacterial aggregates in a eutrophic lake.

Authors:  Haiyuan Cai; Yonghui Zeng
Journal:  Stand Genomic Sci       Date:  2017-12-02

8.  Comparative Genome Analyses Reveal the Genomic Traits and Host Plant Adaptations of Flavobacterium akiainvivens IK-1T.

Authors:  Xuehua Wan
Journal:  Int J Mol Sci       Date:  2019-10-03       Impact factor: 5.923

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.