Literature DB >> 27313837

High-quality draft genome sequence of Flavobacterium suncheonense GH29-5(T) (DSM 17707(T)) isolated from greenhouse soil in South Korea, and emended description of Flavobacterium suncheonense GH29-5(T).

Nisreen Tashkandy1, Sari Sabban1, Mohammad Fakieh1, Jan P Meier-Kolthoff2, Sixing Huang2, Brian J Tindall2, Manfred Rohde3, Mohammed N Baeshen4, Nabih A Baeshen4, Alla Lapidus5, Alex Copeland6, Manoj Pillay7, T B K Reddy6, Marcel Huntemann6, Amrita Pati6, Natalia Ivanova6, Victor Markowitz7, Tanja Woyke6, Markus Göker2, Hans-Peter Klenk8, Nikos C Kyrpides9, Richard L Hahnke2.   

Abstract

Flavobacterium suncheonense is a member of the family Flavobacteriaceae in the phylum Bacteroidetes. Strain GH29-5(T) (DSM 17707(T)) was isolated from greenhouse soil in Suncheon, South Korea. F. suncheonense GH29-5(T) is part of the G enomic E ncyclopedia of B acteria and A rchaea project. The 2,880,663 bp long draft genome consists of 54 scaffolds with 2739 protein-coding genes and 82 RNA genes. The genome of strain GH29-5(T) has 117 genes encoding peptidases but a small number of genes encoding carbohydrate active enzymes (51 CAZymes). Metallo and serine peptidases were found most frequently. Among CAZymes, eight glycoside hydrolase families, nine glycosyl transferase families, two carbohydrate binding module families and four carbohydrate esterase families were identified. Suprisingly, polysaccharides utilization loci (PULs) were not found in strain GH29-5(T). Based on the coherent physiological and genomic characteristics we suggest that F. suncheonense GH29-5(T) feeds rather on proteins than saccharides and lipids.

Entities:  

Keywords:  Aerobic; Bacteroidetes; Carbohydrate active enzyme; Flavobacteriaceae; GEBA; GGDC; Gliding motility; Greenhouse soil; KMG-1; Polysaccharide utilization loci; Tree of Life

Year:  2016        PMID: 27313837      PMCID: PMC4910214          DOI: 10.1186/s40793-016-0159-5

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

/ have been frequently observed in aquatic and soil habitats [1-3] and play a major role in polysaccharide decomposition [2, 4, 5]. Type strains of the genus have been isolated from many different habitats such as fresh water, sea ice and soil, and some strains are pathogenic to humans and animals [2, 6]. Strain GH29-5T (= DSM 17707 = CIP 109901 = KACC 11423) is the type strain of [2, 7], which belongs to [8]. GH29-5T was isolated from greenhouse soil in Korea [10]. UW101T, a well studied model organism, was as well isolated from soil [11, 12] and harbors a considerable number of CAZymes and PULs [13]. Thus, an investigation of the genome of strain GH29-5T will give further insights into the variety of CAZymes and the polysaccharide decomposition potential of this microrganism. Here we present the set of carbohydrate active enzymes, polysaccharide utilization loci and peptidases of GH29-5T, together with a set of phenotypic features and the description and annotation of the high-quality draft genome sequence from a culture of DSM 17707.

Organism information

Classification and features

The sequence of the single 16S rRNA gene copy in the genome is identical with the previously published 16S rRNA gene sequence (DQ222428). Figure 1 shows the phylogenetic neighborhood of GH29-5T inferred from a tree of 16S rRNA gene sequence, as previously described [14]. The next related type species are R2A-7T (EU521691), DK69T (JN790956), JC2902T (JQ928688) and S13T (DQ021903) with less than 95.9 % 16S rRNA gene identity. The 16S rRNA gene sequence of strain GH29-5T has an identity of only 93.9 % with DSM 1132 (AM230485).
Fig. 1

Phylogenetic tree of the genus Flavobacterium and its most closely related genus Capnocytophaga. Modified from Hahnke et al. [68]. In short: the tree was inferred from 1254 aligned characters of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1000 ML bootstrap replicates (left) and from 1000 maximum-parsimony bootstrap replicates (right) if larger than 60 %

Phylogenetic tree of the genus Flavobacterium and its most closely related genus Capnocytophaga. Modified from Hahnke et al. [68]. In short: the tree was inferred from 1254 aligned characters of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion. The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1000 ML bootstrap replicates (left) and from 1000 maximum-parsimony bootstrap replicates (right) if larger than 60 % The 16S rRNA gene sequence of GH29-5T was compared with the Greengenes database [15]. Considering the best 100 hits, 99 sequences belonged to and one sequence to sp. (X85210). Among the most frequent keywords within the labels of environmental samples were 40.4 % marine habitats (such as marine sediment, deep sea, seawater, whale fall, diatom/phytoplankton bloom, Sargasso Sea, sponge, sea urchin, bacterioplankton), 12.3 % soil habitats (such as rhizosphere, grassland, compost), 11.6 % freshwater habitats (such as lake, riverine sediment, groundwater), 8.9 % cold environments (such as Antarctic/Artic seawater, lake ice or sediment), but also 2.7 % wastewater habitats. Interestingly, environmental 16S rRNA gene sequences with 99 % sequence identity with GH29-5T were clones from wetland of France (KC432449) [16] and an enrichment culture of heterotrophic soil bacteria from the Netherlands (JQ855723), and with 98 % sequence identity to a soil isolate from Taiwan (DQ239767). As described for [17], GH29-5T stains are Gram-negative (Table 1). The colonies are convex, round and yellow, but flexirubin-type pigments are absent and gliding motility was not observed [10]. The strain is positive for the catalase and oxidase tests [10], as are most members of the genus [6]. Cells divide by binary fission, possess appandages and occur either as single rod shaped cells, with 0.3 μm in width and 1.5–2.5 μm in length, or as filaments (Fig. 2).
Table 1

Classification and general features of F. suncheonense GH29-5T in accordance with the MIGS recommendations [59], as developed by [60], List of Prokaryotic names with Standing in Nomenclature [61] and the Names for Life database [62]

MIGS IDPropertyTermEvidence code
Current classificationDomain: Bacteria TAS [12]
Phylum: Bacteroidetes TAS [63, 64]
Class: ‘Flavobacteriia’ TAS [65, 66]
Order: Flavobacteriales TAS [9, 67]
Family: Flavobacteriaceae TAS [8, 9]
Genus: Flavobacterium TAS [6, 68]
Species: Flavobacterium suncheonense TAS [10]
Type strain: GH29-5T TAS [10]
Gram-stainNegativeTAS [10]
Cell shaperod-shapedTAS [10]
MotilityNonmotileTAS [10]
Sporulationnon-spore formingNAS
Temperature rangemesophilic (15–37 °C)TAS [10]
Optimum temperature16–24 °CTAS [10]
pH range; Optimum6–8,TAS [10]
Carbon sourceCarbohydrates, peptidesTAS [10]
Energy sourcechemoheterotrophTAS [10]
MIGS-6Habitatgreenhouse soilTAS [10]
MIGS-Salinity0–1 % NaCl, 0 % NaClTAS [10]
MIGS-22Oxygen requirementaerobeTAS [10]
MIGS-15Biotic relationshipfree-livingTAS [10]
MIGS-14PathogenicityunknownTAS [69]
Biosafety level1TAS [69]
MIGS-4Geographic locationSuncheon City, South KoreaTAS [10]
MIGS-5Sample collection2005TAS [10]
MIGS-Latitude34.954TAS [10]
MIGS-4.2Longitude127.483TAS [10]
MIGS-4.4Altitudenot reportedTAS [10]

Evidence codes are from the Gene Ontology project [18]

Evidence codes - IDA inferred from direct assay (first time in publication); TAS traceable author statement (i.e., a direct report exists in the literature); NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence)

Fig. 2

Scanning-electron micrograph of F. suncheonense GH29-5T (DSM 17707T) showing appendages 50–80 nm in diameter and 0.5–8 μm in length (arrows)

Classification and general features of F. suncheonense GH29-5T in accordance with the MIGS recommendations [59], as developed by [60], List of Prokaryotic names with Standing in Nomenclature [61] and the Names for Life database [62] Evidence codes are from the Gene Ontology project [18] Evidence codes - IDA inferred from direct assay (first time in publication); TAS traceable author statement (i.e., a direct report exists in the literature); NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence) Scanning-electron micrograph of F. suncheonense GH29-5T (DSM 17707T) showing appendages 50–80 nm in diameter and 0.5–8 μm in length (arrows) GH29-5T grows between 15 °C and 37 °C, pH 6 and 8 and in media with up to 1 % NaCl [10], with optimal growth at pH 7.0 and without NaCl [7]. Strain GH29-5T decomposes gelatin and casein, but not starch, carboxymethyl cellulose, agar, alginate, pectin, chitin, aesculin and DNA [10]. Strain GH29-5T produces H2S and neither reduces nitrate nor produces indole or ferments glucose [10]. Moreover, strain GH29-5T does not utilize arabinose, mannose, N-acetyl-D-glucosamine, maltose, gluconate, caprate, adipate, malate, citrate and phenylacetate [19]. Strain GH29-5T possesses alkaline phosphatase, esterase C4, esterase lipase C8, leucine arylamidase, valine arylamidase, acid phosphatase, naphthol-AS-BI-phosphohydrolase and N-acetyl-β-glucosaminidase, but has no lipase C14, cystine arylamidase, trypsin, α-chymotrypsin, α-galactosidase, β-galactosidase, β-glucuronidase, α-glucosidase, β-glucosidase, α-mannosidase, α-fucosidase and urease activity [10].

Chemotaxonomic data

The major cellular fatty acids are iso-C15 : 0 (29.9 %), iso-C17 : 0 3-OH (17.7 %), iso-C15 : 1 G (12.0 %) and iso-C15 : 0 3-OH (11.1 %) and MK-6 is the sole quinone [10], as common in [6]. Besides phosphatidylethanol-amine, several unidentified lipids, aminolipids and aminophospholipids were observed in strain GH29-5T [7]. The DNA G + C content was reported to be 39.0 mol % [10].

Genome sequencing information

Genome project history

This strain was selected for sequencing on the basis of its phylogenetic position [20, 21], and is part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project [22], a follow-up of the Genomic Encyclopedia of and Archaea (GEBA) pilot project [23], which aims at sequencing key reference microbial genomes and generating a large genomic basis for the discovery of genes encoding novel enzymes [24]. KMG-I is the part of the “Genomic Encyclopedia of and Archaea: sequencing a myriad of type strains initiative” [25] and a Genomic Standards Consortium project [26]. The genome project is deposited in the Genomes OnLine Database [27] and the permanent draft genome sequence is deposited in GenBank. Sequencing, finishing and annotation were performed by the DOE-JGI using state-of-the-art sequencing technology [28]. A summary of the project information is shown in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityLevel 2: High-Quality Draft
MIGS-28Libraries usedIllumina Std shotgun library
MIGS 29Sequencing platformsIllumina, Illumina HiSeq 2000, Illumina HiSeq 2500
MIGS 31.2Fold coverage115.3x
MIGS 30AssemblersVelvet v. 1.1.04; ALLPATHS v. r41043
MIGS 32Gene calling methodProdigal, GenePRIMP, IMG-ER
Locus TagG498
Genbank IDAUCZ00000000
GenBank Date of Release12-DEC-2013
GOLD IDGp0013510
BIOPROJECTPRJNA185581
MIGS 13Source Material IdentifierDSM 17707
Project relevanceTree of Life, GEBA-KMG
Project information

Growth conditions and genomic DNA preparation

A culture of GH29-5T (DSM 17707) was grown aerobically in DSMZ medium 830 (R2A Medium) [29] at 28 °C. Genomic DNA was isolated using a Jetflex Genomic DNA Purification Kit (GENOMED 600100) following the standard protocol provided by the manufacturer. DNA is available from the DSMZ through the DNA Bank Network [30].

Genome sequencing and assembly

The draft genome of strain GH29-5T was generated using the Illumina technology [31]. An Illumina Std. shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 9,392,462 reads totaling 1408.9 Mbp (Table 3). All general aspects of library construction and sequencing performed at the DOE-JGI can be found at [32]. All raw sequence data were passed through DUK, a filtering program developed at DOE-JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J: DUK. unpublished 2011). The following steps were performed for assembly: (1) filtered reads were assembled using Velvet [33], (2) 1–3 Kbp simulated paired-end reads were created from Velvet contigs using wgsim [34], (3) Sequence reads were assembled with simulated read pairs using Allpaths–LG [35]. Parameters for assembly steps were: 1) Velvet (“velveth 63 -shortPaired” and “velvetg -very_clean yes -exportFiltered yes -min_contig_lgth 500 -scaffolding no -cov_cutoff 10”), (2) wgsim (“wgsim -e 0–1 100–2 100 -r 0 -R 0 -X 0”) (3) Allpaths–LG (“PrepareAllpathsInputs: PHRED_64 = 1 PLOIDY = 1 FRAG_COVERAGE = 125 JUMP_COVERAGE = 25 LONG_JUMP_COV = 50” and “RunAllpathsLG THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True”). The final draft assembly contained 57 contigs in 54 scaffolds. The total size of the genome is 2.9 Mbp and the final assembly is based on 331.3 Mbp of data, which provides a 114.2x average coverage of the genome.
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)2,880,663100.0
DNA coding (bp)2,622,75191.1
DNA G + C (bp)1,165,57540.5
DNA scaffolds54
Total genes2821100.0
Protein coding genes273997.1
RNA genes822.9
Pseudo genes00.0
Genes in internal clusters1254.43
Genes with function prediction191667.92
Genes assigned to COGs143951.01
Genes with Pfam domains202071.61
Genes with signal peptides34812.34
Genes with transmembrane helices63122.37
CRISPR repeats0
Genome statistics

Genome annotation

Genes were identified using Prodigal [36] as part of the DOE-JGI genome annotation pipeline [37], followed by manual curation using the DOE-JGI GenePRIMP pipeline [38]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information non-redundant database, UniProt, TIGR-Fam, Pfam, PRIAM, KEGG, COG, and InterPro database. These data sources were combined to assert a product description for each predicted protein. Additional gene prediction analysis and functional annotation was performed within the IMG-ER platform [39].

Genome properties

The assembly of the draft genome sequence consists of 54 scaffolds amounting to 2,880,663 bp. The G + C content is 40.5 % (Table 3) which is 1.5 % higher than previously reported by Kim et al. [10] and thus shows a difference that surpasses the maximal range among strains belonging to the same species [40]. Of the 2821 genes predicted, 2739 were protein-coding genes, and 82 RNAs. The majority of the protein-coding genes (69.2 %) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COG functional categories is presented in Table 4.
Table 4

Number of genes associated with the general COG functional categories

CodeValue% ageDescription
J17811.5Translation, ribosomal structure and biogenesis
ARNA processing and modification
K835.3Transcription
L764.9Replication, recombination and repair
B10.1Chromatin structure and dynamics
D241.5Cell cycle control, cell division, chromosome partitioning
YNuclear structure
V442.8Defense mechanisms
T533.4Signal transduction mechanisms
M16510.6Cell wall/membrane/envelope biogenesis
N100.6Cell motility
ZCytoskeleton
WExtracellular structures
U151.0Intracellular trafficking, and secretion
O936.0Posttranslational modification, protein turnover, chaperones
C845.4Energy production and conversion
G513.1Carbohydrate transport and metabolism
E1097.1Amino acid transport and metabolism
F624.0Nucleotide transport and metabolism
H996.4Coenzyme transport and metabolism
I775.0Lipid transport and metabolism
P744.8Inorganic ion transport and metabolism
Q291.9Secondary metabolites biosynthesis, transport and catabolism
R1318.4General function prediction only
S835.3Function unknown
138249.0Not in COGs
Number of genes associated with the general COG functional categories

Insights from the genome sequence

Comparative genomics

We conducted a comparative genomics analysis of (AUCZ00000000) with a selection of closely related (according to 16S rRNA gene sequence similarities) type strains, i.e., (AVCS00000000), (AVBI00000000), (AVFO00000000) and (CP003222) and the type species (JRHH00000000). The genome sizes of the five type strains were 3.1 Mbp on average with the biggest difference of 0.5 Mbp between the genomes of and , on the one hand, and , on the other hand. Genome sizes were 3.1 Mbp (), 3.2 Mbp (), 3.4 Mbp (), 2.9 Mbp () and 2.9 Mbp (). However, since these genomes have not yet been sequenced completely, their sizes might slightly change in the future. An estimate of the overall similarity between and the five reference strains was conducted using the Genome-to-Genome Distance Calculator (GGDC 2.0) [41, 42]. It reports model-based DDH estimates (digital DDH or dDDH) along with their confidence intervals [42], which allow for genome-basted species delineation and genome-based subspecies delineation. The recommended distance formula 2 is robust against the use of incomplete genome sequences and is thus especially suited for this dataset. The result of this comparison is shown in Table 5 and yields dDDH of below 22 % throughout, which confirms the expected status of distinct species. Furthermore, the G + C content was calculated from the genome sequences of the above strains and their pairwise differences were assessed with respect to . Differences were 2.4 % (), 2.8 % (), 1 % (), 9.1 % () and 8.3 % (). These differences confirm the status of distinct species, because, if computed from genome sequences, these differences can only vary up to 1 % within species [40].
Table 5

Pairwise comparison using the GGDC (Genome-to-Genome Distance Calculator) of F. suncheonense with a selection of currently available Flavobacterium genomes, F. enshiense (AVCS00000000), F. cauense (AVBI00000000), F. saliperosum (AVFO00000000) and F. columnare (CP003222), plus the type species F. aquatile (JRHH00000000)

F. suncheonenseversus% dDDH% C.I. dDDHHSP length/total length [%]Identities HSP/length [%]Identities/total length [%]
F. aquatile 18.72.64763
F. cauense 21.23.0457936
F. columnare 20.92.64793
F. enshiense 20.22.9297823
F. saliperosum 21.03.0417933

Digital DDH values (dDDH) and the respective confidence intervals (C.I.) are specified for GGDC's recommended formula 2. The columns “HSP length / total length [%]”, “identities / HSP length [%]” and “identities / total length [%]” list similarities as calculated from the intergenomic distances, which were also reported by the GGDC (Formulae 1–3)

Pairwise comparison using the GGDC (Genome-to-Genome Distance Calculator) of F. suncheonense with a selection of currently available Flavobacterium genomes, F. enshiense (AVCS00000000), F. cauense (AVBI00000000), F. saliperosum (AVFO00000000) and F. columnare (CP003222), plus the type species F. aquatile (JRHH00000000) Digital DDH values (dDDH) and the respective confidence intervals (C.I.) are specified for GGDC's recommended formula 2. The columns “HSP length / total length [%]”, “identities / HSP length [%]” and “identities / total length [%]” list similarities as calculated from the intergenomic distances, which were also reported by the GGDC (Formulae 1–3)

Gliding motility

McBride and Zhu [43] described the diversity of genes involved in gliding motility among members of phylum . The machinery for gliding motility is composed of adhesin-like proteins, the type IX secretion system, and additional proteins [43]. Even though strain GH29-5T was never observed to glide [10], all necessary genes for gliding motility were identified in its genome (Table 6).
Table 6

Gliding motility-related genes in strain GH29-5T compared to genes in Flavobacterium strains studied by McBride and Zhu [43]

F. suncheonense GH29-5T F. rivuli DSM 21788T F. johnsoniae ATCC 17061T
locus tag prefixG498_RS01F565_ RS01Fjoh_
Gliding motility+
Adhesin-like
remA 007160808
remB 018031657
sprB +b 0979
ATP-binding cassette transporter
gldA 02505052701516
gldF 02374007602722
gldG 02375007652721
Additional proteins
gldB a 00808133901793
gldC 00807133851794
gldD a 01936188651540
gldE 00405188601539
gldH a 02655105150890
gldJ a 00438118451557
Peptidoprolyl isomerase (‘Flavobacteriia’, protein folding)
gldI 01009081802369
Type IX secretion system (secretion of RemA/RemB)
gldK a 00758186051853
gldL a 00757186001854
gldM a 00756185951855
gldN a 00755185901856
sprA a 01807060651653
sprE a 02154191501051
sprT a 02545054751466

aessential gliding motility genes after McBride and Zhu [43]

bpartial gene sequences, located at the beginning of AUCZ00000022 and at the end of AUCZ00000002

Gliding motility-related genes in strain GH29-5T compared to genes in Flavobacterium strains studied by McBride and Zhu [43] aessential gliding motility genes after McBride and Zhu [43] bpartial gene sequences, located at the beginning of AUCZ00000022 and at the end of AUCZ00000002

Carbohydrate active enzymes and peptidases

Cottrell and Kirchman [44] showed that members of the Cytophaga-Flavobacteria group preferentially consume polysaccharides and proteins rather than amino acids. This phenotypic feature was attributed by Fernández-Gómez et al. [4] to higher numbers of peptidases and additionally higher numbers of glycoside hydrolases and carbohydrate-binding modules in the genomes of compared to other bacteria. GH29-5T was isolated from greenhouse soil, hydrolyzes casein and gelatin, but did not utilize any of the tested saccharides [10, 19]. Therefore, we compared the predicted CDS against the CAZyme [45, 46] and dbCAN [47] database. The CAZyme annotation (Additional file 1, Table S1) was a combination of RAPSearch2 search [48, 49] and HMMER scanning [50] as described in Hahnke et al. [14]. The genome of strain GH29-5T comprised a small number of carbohydrate active enzymes (49) including 36 glycosyl transferases, nine glycoside hydrolases, four carbohydrate binding modules and six carbohydrate esterases (Table 7). Furthermore, sulfatases were suggested as important enzymes for the metabolic potential of to degrade sulfated algae polysaccharides such as carrageenan, agarans and fucans. Only, three sulfatases were identified in the genome of strain GH29-5T (Additional file 1, Table S2).
Table 7

Carbohydrate active enzymes (CAZy) in the genome of strain GH29-5T

CAZy familyGH2GH3GH20GH23GH25GH73GH92
Counts1112111
CAZy familyGHa CBM50CBMa
Counts131
CAZy familyGT2GT4GT5GT9GT19GT28GT30
Counts141112111
CAZy familyGT51GT56
Counts41
CAZy familyCE4CE11CE14CEa AA1AAa
Counts212111

agenes attributed to an enzyme class, but not to a family

Carbohydrate active enzymes (CAZy) in the genome of strain GH29-5T agenes attributed to an enzyme class, but not to a family

Polysaccharide utilization loci

CAZymes of that are suggested to be involved in polysaccharide decomposition are frequently observed to be organized in gene clusters. Such polysaccharides-utilization loci (PULs) consist of a TonB-dependent receptor, a SusD-like protein and carbohydrate active enzymes [51, 52]. In strain GH29-5T five TonB-dependent transporters were identified of which G498_00119, G498_01595, G498_02575 were associated to siderophores and G498_00706, G498_00915 were associated with a SusD-like protein. The gene cluster up-stream of the TonB-dependent transporter G498_00706 comprised five hypothetical proteins.

Peptidases

The MEROPS annotation was carried out by searching the sequences against the MEROPS 9.10 database [53] (access date: 2014.10.16, version: pepunit.lib) as described in Hahnke et al. [14]. The genome of strain GH29-5T comprised 117 identified peptidase genes (or homologues), mostly serine peptidases (S, 50), metallo peptidases (M, 50) and cysteine peptidases (C, 14) (Table 8, Additional file 1: Tables S3 and S4). Hence, the low number of carbohydrate active enzymes and the high number of peptidases in the genome of strain GH29-5T reflects its currently known substrate range being proteins rather than saccharides.
Table 8

Peptidases and simple peptidase inhibitors in the genome of strain GH29-5T

PeptidaseM01M03M12M13M14M16M20M23M24
Counts412252362
PeptidaseM28M36M38M41M42M43M48M50M61
Counts314112111
PeptidaseM79M90
Counts11
PeptidaseS01S06S08S09S12S14S16S24S26
Counts1131651311
PeptidaseS33S41S46S49S51S54S66
Counts6321141
PeptidaseC01C25C26C40C44C45C56
Counts1152311
PeptidaseN11T02U32U73A08A28
Counts114111
InhibitorI39I87
Counts41
Peptidases and simple peptidase inhibitors in the genome of strain GH29-5T

Conclusions

The genome of GH29-5T contains a relaltively low number of carbohydrate active enzymes in contrast to genomes of other such as [54], [14], [55], [4, 56], ‘’ [57] and [17]. This is surpising, since greenhouse soil might be a rich source of plant litter. McBride et al. [13] described the genome features of UW101T, a bacterium that was as well isolated from soil [11, 58]. Both the genomes of UW101T and GH29-5T have an almost equal number of 31 and 39 peptidases per Mbp, respectively. The genomes, however, differ remarkably in the number of CAZymes, with 47 genes per Mbp in the genome of UW101T and only 18 genes per Mbp in the genome of GH29-5T. Thus, this small set of CAZymes contributes only little to a pool of enzymes, which might be essential for a to feed on soil components. A systematic collection of genome sequences, such as GEBA [23] and KMG-1 [22], will provide the scientific community with the possibility for a systematic discovery of genes encoding for novel enzymes [24] and support microbial taxonomy. In addition, genome sequences also provide further taxonomically useful information such as the G + C content [40], which, as seen in this report might significantly differ from the values determined with traditional methods. Based on the observed large difference in the DNA G + C content and the additional information on cell morphology obtained in this study, an emended description of is proposed.

Emended description of GH29-5T Kim et al. 2006 emend. Dong et al. 2013

The description of is as given by Kim et al. [10] and Dong et al. [7], with the following modifications: the DNA G + C content is 40.5 mol%, and amendments: possesses appendages of 50–80 nm in diameter and 0.5–8 μm in length.
  53 in total

1.  Natural assemblages of marine proteobacteria and members of the Cytophaga-Flavobacter cluster consuming low- and high-molecular-weight dissolved organic matter.

Authors:  M T Cottrell; D L Kirchman
Journal:  Appl Environ Microbiol       Date:  2000-04       Impact factor: 4.792

2.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

3.  Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB.

Authors:  T Z DeSantis; P Hugenholtz; N Larsen; M Rojas; E L Brodie; K Keller; T Huber; D Dalevi; P Hu; G L Andersen
Journal:  Appl Environ Microbiol       Date:  2006-07       Impact factor: 4.792

4.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

5.  Complete genome sequence of the fish pathogen Flavobacterium branchiophilum.

Authors:  Marie Touchon; Paul Barbier; Jean-François Bernardet; Valentin Loux; Benoit Vacherie; Valérie Barbe; Eduardo P C Rocha; Eric Duchaud
Journal:  Appl Environ Microbiol       Date:  2011-09-16       Impact factor: 4.792

6.  Characterization of the first alginolytic operons in a marine bacterium: from their emergence in marine Flavobacteriia to their independent transfers to marine Proteobacteria and human gut Bacteroides.

Authors:  François Thomas; Tristan Barbeyron; Thierry Tonon; Sabine Génicot; Mirjam Czjzek; Gurvan Michel
Journal:  Environ Microbiol       Date:  2012-04-19       Impact factor: 5.491

7.  Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs.

Authors:  Alexander F Auch; Hans-Peter Klenk; Markus Göker
Journal:  Stand Genomic Sci       Date:  2010-01-28

8.  RAPSearch: a fast protein similarity search tool for short reads.

Authors:  Yuzhen Ye; Jeong-Hyeon Choi; Haixu Tang
Journal:  BMC Bioinformatics       Date:  2011-05-15       Impact factor: 3.307

9.  dbCAN: a web resource for automated carbohydrate-active enzyme annotation.

Authors:  Yanbin Yin; Xizeng Mao; Jincai Yang; Xin Chen; Fenglou Mao; Ying Xu
Journal:  Nucleic Acids Res       Date:  2012-05-29       Impact factor: 16.971

10.  Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

Authors:  Nikos C Kyrpides; Tanja Woyke; Jonathan A Eisen; George Garrity; Timothy G Lilburn; Brian J Beck; William B Whitman; Phil Hugenholtz; Hans-Peter Klenk
Journal:  Stand Genomic Sci       Date:  2013-12-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.