| Literature DB >> 26380634 |
Richard L Hahnke1, Erko Stackebrandt1, Jan P Meier-Kolthoff1, Brian J Tindall1, Sixing Huang1, Manfred Rohde2, Alla Lapidus3, James Han4, Stephan Trong4, Matthew Haynes4, T B K Reddy4, Marcel Huntemann4, Amrita Pati4, Natalia N Ivanova4, Konstantinos Mavromatis4, Victor Markowitz5, Tanja Woyke4, Markus Göker1, Nikos C Kyrpides6, Hans-Peter Klenk7.
Abstract
Flavobacterium rivuli Ali et al. 2009 emend. Dong et al. 2013 is one of about 100 species in the genus Flavobacterium (family Flavobacteriacae, phylum Bacteroidetes) with a validly published name, and has been isolated from the spring of a hard water rivulet in Northern Germany. Including all type strains of the genus Myroides and Flavobacterium into the 16S rRNA gene sequence phylogeny revealed a clustering of members of the genus Myroides as a monophyletic group within the genus Flavobacterium. Furthermore, F. rivuli WB 3.3-2(T) and its next relatives seem more closely related to the genus Myroides than to the type species of the genus Flavobacterium, F. aquatile. The 4,489,248 bp long genome with its 3,391 protein-coding and 65 RNA genes is part of the G enomic E ncyclopedia of B acteria and A rchaea project. The genome of F. rivuli has almost as many genes encoding carbohydrate active enzymes (151 CAZymes) as genes encoding peptidases (177). Peptidases comprised mostly metallo (M) and serine (S) peptidases. Among CAZymes, 30 glycoside hydrolase families, 10 glycosyl transferase families, 7 carbohydrate binding module families and 7 carbohydrate esterase families were identified. Furthermore, we found four polysaccharide utilization loci (PUL) and one large CAZy rich gene cluster that might enable strain WB 3.3-2(T) to decompose plant and algae derived polysaccharides. Based on these results we propose F. rivuli as an interesting candidate for further physiological studies and the role of Bacteroidetes in the decomposition of complex polymers in the environment.Entities:
Keywords: Aerobic; Bacteroidetes; Carbohydrate active enzyme; Flavobacteriaceae; GEBA-KMG I; Gram-negative; Hard water rivulet; Myroides; Non-motile; Polysaccharide utilization loci
Year: 2015 PMID: 26380634 PMCID: PMC4572689 DOI: 10.1186/s40793-015-0032-y
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic tree of the genus Flavobacterium and its most closely related genus Capnocytophaga. The tree was inferred from 1,254 aligned characters of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion as previously described [34]. The sequences were aligned using poa [45] and the resulting alignment restricted to its conserved part using Gblocks [20]. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates (left) and from 1,000 maximum-parsimony bootstrap replicates (right) if larger than 60 % [34]. Acccession numbers of 16S rRNA gene sequences are listed in Acccession numbers of 16S rRNA gene sequences are listed in Additional file 1: suppl. Table 6
Fig. 2Histogram showing the distribution of pairwise SSU similarities of the type species Flavobacterium aquatile with respect to all other 119 strains in the dataset. Except the genus Myroides, all genera are clearly segregated from each other. Pairwise SSU similarities were calculated using the recommended approach described in [55]. Bars are colored according to genus affiliation. The figure was visualized using the ggplot package [72] for the R statistical framework [63]. Acccession numbers of 16S rRNA gene sequences are listed in suppl. Table 6
Fig. 3Scanning electron micrograph of F. rivuli WB 3.3-2T (DSM 21788T) showing expression of extracellular polymeric substances, EPS (arrows)
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31.1 | Sequencing quality | Level 2: High-Quality Draft |
| MIGS-28.1 | Libraries method | Illumina Std shotgun library |
| MIGS-28.2 | Reads count | 14,972,538 sequencing reads |
| MIGS-29 | Sequencing method | Illumina HiSeq 2000, |
| MIGS-31.2 | Fold coverage | 124.1x |
| MIGS-30 | Assembly method | Velvet v. 1.1.04; ALLPATHS v. r41043 |
| MIGS-32 | Gene calling method | Prodigal, GenePRIMP, IMG-ER |
| NCBI project ID | 182404 | |
| Genbank ID | ARKJ00000000 | |
| Genbank Date of Release | 16-SEP-2013 | |
| IMG object ID | 2519103183 | |
| GOLD ID | Gi11501 | |
| MIGS-13 | Source Material Identifier | DSM 21788 |
| Project relevance | Tree of Life, GEBA-KMG |
Genome statistics
| Attribute | Number | % of Total |
|---|---|---|
| DNA, total number of bases | 4489248 | 100.0 |
| DNA coding number of bases | 3981399 | 88.7 |
| DNA G + C number of bases | 1777758 | 39.6 |
| DNA scaffolds | 23 | 100.0 |
| Genes total number | 4056 | 100.0 |
| Protein coding genes | 3991 | 98.4 |
| RNA genes | 65 | 1.6 |
| rRNA genes | 8 | 0.2 |
| 5S rRNA | 5 | 0.1 |
| 16S rRNA | 1 | <0.1 |
| 23S rRNA | 2 | 0.1 |
| tRNA genes | 48 | 1.2 |
| Other RNA genes | 9 | 0.2 |
| Protein coding genes with function prediction | 2842 | 70.1 |
| without function prediction | 1149 | 28.3 |
| Protein coding genes with COGs | 2570 | 63.4 |
| Protein coding genes with Pfam | 2924 | 72.1 |
| Protein coding genes coding signal peptides | 654 | 16.1 |
| Protein coding genes coding transmembrane proteins | 906 | 22.3 |
| CRISPR repeats | 0 |
Number of genes associated with the general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 152 | 5.4 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.1 | RNA processing and modification |
| K | 209 | 7.4 | Transcription |
| L | 138 | 4.9 | Replication, recombination and repair |
| B | 1 | 0.1 | Chromatin structure and dynamics |
| D | 21 | 0.8 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 49 | 1.7 | Defense mechanisms |
| T | 177 | 6.3 | Signal transduction mechanisms |
| M | 237 | 8.4 | Cell wall/membrane/envelope biogenesis |
| N | 10 | 0.4 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 50 | 1.8 | Intracellular trafficking, secretion, and vesicular transport |
| O | 111 | 3.9 | Posttranslational modification, protein turnover, chaperones |
| C | 152 | 5.4 | Energy production and conversion |
| G | 201 | 7.2 | Carbohydrate transport and metabolism |
| E | 199 | 7.1 | Amino acid transport and metabolism |
| F | 67 | 2.4 | Nucleotide transport and metabolism |
| H | 124 | 4.4 | Coenzyme transport and metabolism |
| I | 109 | 3.9 | Lipid transport and metabolism |
| P | 133 | 4.7 | Inorganic ion transport and metabolism |
| Q | 48 | 1.7 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 360 | 12.8 | General function prediction only |
| S | 261 | 9.3 | Function unknown |
| - | 1486 | 36.6 | Not in COGs |
Pairwise comparison of F. rivuli with F. filum, F. subsaxonicum, F. beibuense and F. aquatile using the GGDC (Genome-to-Genome Distance Calculator). Digital DDH (dDDH) and the respective confidence intervals (C.I.) are specified for GGDC’s recommended formula 2
|
| % dDDH | % C.I. dDDH | HSP length/total length [%] | Identities/HSP length [%] | Identities/total length [%] |
|---|---|---|---|---|---|
|
| 18.4 | 2.5 | 2 | 76 | 1 |
|
| 18.7 | 2.6 | 7 | 76 | 6 |
|
| 19.0 | 2.5 | 2 | 77 | 1 |
|
| 21.3 | 2.9 | 31 | 79 | 24 |
Gliding motility-related genes in strain DSM 21788T compared to genes in Flavobacterium strains studied by McBride and Zhu [51]
|
|
|
| ||
|---|---|---|---|---|
| Locus tag prefix | F565_ RS01 | FP | Fjoh_ | |
| Gliding motility | – | + | + | |
| Adhesin-like | ||||
| remA | – | 1959 | 0808 | |
| remB | – | 2117 | 1657 | |
| sprB | – | 0016 | 0979 | |
| ATP-binding cassette transporter | ||||
| gldA | 05270 | 0252 | 1516 | |
| gldF | 00760 | 1089 | 2722 | |
| gldG | 00765 | 1090 | 2721 | |
| Additional protein required for gliding | ||||
| gldBa | 13390 | 2069 | 1793 | |
| gldC | 13385 | 2068 | 1794 | |
| gldDa | 18865 | 1663 | 1540 | |
| gldE | 18860 | 1358 | 1539 | |
| gldHa | 10515 | 0024 | 0890 | |
| gldJa | 11845 | 1389 | 1557 | |
| Peptidoprolyl isomerase ( | ||||
| gldI | 08180 | 1892 | 2369 | |
| PorS secretion system (secretion of RemA/RemB and SprA/SprB) | ||||
| gldKa | 18605 | 1973 | 1853 | |
| gldLa | 18600 | 1972 | 1854 | |
| gldMa | 18595 | 1971 | 1855 | |
| gldNa | 18590 | 1970 | 1856 | |
| sprAa | 06065 | 2121 | 1653 | |
| sprEa | 19150 | 2467 | 1051 | |
| sprTa | 05475 | 0326 | 1466 | |
aessential gliding motility genes after McBride and Zhu [51]
Peptidases and simple peptidase inhibitors in the genome of strain DSM 21788T
| Peptidase family | M01 | M03 | M12 | M13 | M14 | M16 | M19 | M20 | M23 |
|---|---|---|---|---|---|---|---|---|---|
| Counts | 6 | 2 | 2 | 2 | 8 | 3 | 1 | 5 | 8 |
| Peptidase family | M24 | M28 | M38 | M41 | M42 | M43 | M48 | M50 | M61 |
| Counts | 2 | 2 | 6 | 1 | 1 | 1 | 1 | 1 | 1 |
| Peptidase family | M75 | M79 | M90 | M96 | |||||
| Counts | 2 | 1 | 1 | 2 | |||||
| Peptidase family | S01 | S08 | S09 | S11 | S12 | S14 | S16 | S24 | S26 |
| Counts | 2 | 5 | 31 | 1 | 6 | 3 | 3 | 5 | 1 |
| Peptidase family | S33 | S41 | S46 | S49 | S51 | S54 | S66 | ||
| Counts | 16 | 6 | 3 | 1 | 2 | 3 | 2 | ||
| Peptidase family | C01 | C25 | C26 | C40 | C44 | C56 | C82 | ||
| Counts | 1 | 1 | 8 | 3 | 4 | 4 | 1 | ||
| Peptidase family | N11 | T02 | U32 | U73 | A08 | A28 | |||
| Counts | 1 | 1 | 2 | 1 | 1 | 1 | |||
| Inhibitor family | I39 | I71 | I87 | ||||||
| Counts | 22 | 1 | 2 |
Carbohydrate active enzymes (CAZy) in the genome of strain DSM 21788T
| CAZy family | GH1 | GH2 | GH3 | GH5 | GH13 | GH16 | GH23 |
|---|---|---|---|---|---|---|---|
| Counts | 1 | 15 | 11 | 4 | 4 | 2 | 2 |
| CAZy family | GH25 | GH27 | GH28 | GH29 | GH30 | GH31 | GH36 |
| Counts | 1 | 2 | 5 | 2 | 3 | 5 | 2 |
| CAZy family | GH37 | GH39 | GH42 | GH43 | GH51 | GH65 | GH73 |
| Counts | 1 | 3 | 1 | 11 | 3 | 1 | 1 |
| CAZy family | GH78 | GH88 | GH92 | GH95 | GH97 | GH105 | GH106 |
| Counts | 1 | 1 | 2 | 3 | 5 | 4 | 2 |
| CAZy family | GH127 | GH130 | GHa | ||||
| Counts | 1 | 2 | 3 | ||||
| CAZy family | GT2 | GT4 | GT5 | GT9 | GT19 | GT20 | GT28 |
| Counts | 13 | 10 | 1 | 3 | 1 | 1 | 1 |
| CAZy family | GT30 | GT41 | GT51 | GTa | |||
| Counts | 1 | 1 | 4 | 1 | |||
| CAZy family | CBM2 | CBM10 | CBM13 | CBM32 | CBM35 | CBM50 | CBM57 |
| Counts | 1 | 1 | 1 | 1 | 2 | 4 | 1 |
| CAZy family | CE2 | CE4 | CE6 | CE7 | CE11 | CE12 | CE14 |
| Counts | 1 | 1 | 1 | 1 | 1 | 3 | 2 |
| CAZy family | CEa | PL11 | |||||
| Counts | 3 | 1 |
agenes attributed to an enzyme class, but not to a family
Fig. 4Synteny between a potentially laminarin-specific PUL of F. rivuli DSM 21788T and other Flavobacteriaceae. Open circles indicate genes which were specifically expressed by ‘Gramella forsetii’ KT0803 when grown on laminarin, as shown by Kabisch et al. [38]. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47]
Fig. 5Synteny between a PUL of F. rivuli DSM 21788T, F. denitrificans DSM 15936T and F. johnsoniae UW101T. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) and glycoside transferase families (GT) can be seen at the CAZy homepage [18, 47]
Fig. 6Two PUL of F. rivuli DSM 21788T with low synteny (a, b) to PUL of other Flavobacterium members, potentially mediating the decomposition of hemicellulose or xylose. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47]
Fig. 7Polygalacturonate decomposition potential in F. rivuli DSM 21788T. a The potentially polygalacturonate specific PUL was found exclusively in F. rivuli DSM 21788T. b Genes for the catabolism of d-galactopyranuronate are colocalized in a gene cluster syntenic between Flavobacterium members. c Enzymes of the pectate decomposition and catabolism pathway. Bold blue numbers indicate the position of enzymes in the pectate catabolism pathway c and their corresponding genes in the gene clusters a, b. Genes in gray encode for hypothetical proteins. Locus tags are given below both the first and last gene of the loci. Accession numbers in brackets are GenBank accession numbers of the corresponding contig. Investigation of syntenic loci was done using MultiGeneBlast [52]. Investigation of pectin degradation pathway was done using the MetaCyc homepage [19]. A description of glycoside hydrolase families (GH) can be seen at the CAZy homepage [18, 47]
Classification and general features of F. rivuli WB 3.3-2T in accordance with the MIGS recommendations [26], as developed by [25], List of Prokaryotic names with Standing in Nomenclature [23] and the Names for Life database [31]
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| type strain WB 3.3-2T | TAS [ | ||
| Gram-stain | negative | TAS [ | |
| Cell shape | rod-shaped | TAS [ | |
| Motility | nonmotile | TAS [ | |
| Sporulation | non-spore forming | TAS [ | |
| Temperature range | mesophilic (4–29 °C) | TAS [ | |
| Optimum temperature | 16–24 °C | TAS [ | |
| pH range; Optimum | 6.4–7.8, 7 | TAS [ | |
| Carbon source | Carbohydrates, peptides | TAS [ | |
| MIGS-6 | Habitat | fresh water | TAS [ |
| MIGS-6.3 | Salinity | 0–2 % | TAS [ |
| MIGS-22 | Oxygen requirement | obligate aerobe | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | not reported | NAS |
| MIGS-4 | Geographic location | Harz Mountains, North Germany | TAS [ |
| MIGS-5 | Sample collection time | 9 June 2005 | TAS [ |
| MIGS-4.1 | Latitude | 51.758065 | TAS [ |
| MIGS-4.2 | Longitude | 10.11638 | TAS [ |
| MIGS-4.4 | Altitude | 273 m | TAS [ |
Evidence codes - IDA, Inferred from Direct Assay (first time in publication), TAS traceable author statement (i.e., a direct report exists in the literature), NAS, non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [5]