Literature DB >> 26380636

High quality draft genome sequence of Bacteroides barnesiae type strain BL2(T) (DSM 18169(T)) from chicken caecum.

Mitsuo Sakamoto1, Alla L Lapidus2, James Han3, Stephan Trong3, Matthew Haynes3, T B K Reddy3, Natalia Mikhailova3, Marcel Huntemann3, Amrita Pati3, Natalia N Ivanova3, Rüdiger Pukall4, Victor M Markowitz5, Tanja Woyke3, Hans-Peter Klenk4, Nikos C Kyrpides6, Moriya Ohkuma1.   

Abstract

Bacteroides barnesiae Lan et al. 2006 is a species of the genus Bacteroides, which belongs to the family Bacteroidaceae. Strain BL2(T) is of interest because it was isolated from the gut of a chicken and the growing awareness that the anaerobic microbiota of the caecum is of benefit for the host and may impact poultry farming. The 3,621,509 bp long genome with its 3,059 protein-coding and 97 RNA genes is a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.

Entities:  

Keywords:  Bacteroidaceae; Cecum; Gram-negative; Non-motile; Poultry; Rod-shaped; Strictly anaerobic

Year:  2015        PMID: 26380636      PMCID: PMC4572637          DOI: 10.1186/s40793-015-0045-6

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Strain BL2T (= DSM 18169 = CCUG 54636 = JCM 13652) is the type strain of which belongs to the genus [1]. The species epithet is derived from the name of Ella M. Barnes, a British microbiologist, who has contributed much to our knowledge of intestinal bacteriology and anaerobic bacteriology in general. strain BL2T was isolated from caecum of a healthy chicken. Four other strains belonging to the same species have been isolated from the same source [1]. The genus represents one of the predominant anaerobic genera found in chicken caecum [2-4]. species are thought to play a fundamental role in the breakdown of complex molecules (such as polysaccharides) into simpler compounds that are used by the animal host as well as the microorganisms themselves [5, 6], in the utilization of nitrogenous substances and in the biotransformation of bile acids and other steroids [7]. They also play a role as beneficent protectors of the gut against pathogenic microorganisms [8]. Here we present a summary classification and set of features for strain BL2T, together with the description of the complete genomic sequencing and annotation.

Organism information

Classification and features

A 1301 bp long contig contained the most complete 16S rRNA gene copy in the draft genome. This partial gene differed by 7 nucleotides (0.5 %) from the 16S rRNA reference sequence (AB253726) generated for the original description of [1]. Such a difference is not unusual when comparing original sequences from the time organisms were initially described with sequences of type strain genomes sequenced in the KMG project [9], a problem that was only partially resolved in the sequencing orphan species initiative (SOS) [10]. A representative 16S rRNA gene sequence of strain BL2T was compared with GenBank using NCBI BLAST. The single most frequent genus found was . The highest-scoring environmental sequences (up to 99.8 % sequence identity), including HQ784912 (‘gastrointestinal specimens clone ELU0102-T240-S-NI_000093’), were all from a study on gastrointestinal specimens linked to inflammatory bowel diseases phenotype in human ileum [11] and indicate that close relatives of strain BL2T and representatives of are also relevant to human health. Fig. 1 shows the phylogenetic position of in a 16S rRNA gene sequence-based tree.
Fig. 1

Phylogenetic tree based on the 16S rRNA gene sequences showing the relationship of Bacteroides barnesiae strain BL2T among the genus Bacteroides . The tree was constructed by the neighbor-joining method. Numbers at nodes indicate the percentage bootstrap values of 1000 replicates. Bars, 0.01 substitutions per nucleotide position. Accession numbers are given for each strain

Phylogenetic tree based on the 16S rRNA gene sequences showing the relationship of Bacteroides barnesiae strain BL2T among the genus Bacteroides . The tree was constructed by the neighbor-joining method. Numbers at nodes indicate the percentage bootstrap values of 1000 replicates. Bars, 0.01 substitutions per nucleotide position. Accession numbers are given for each strain The cells of are pleomorphic rods (0.5-1.4 × 0.8-10.6 μm) (Fig. 2). The cells are usually arranged singly or in pairs [1]. is a Gram-negative, non-sporeforming bacterium (Table 1) that is described as non-motile, with only seven genes associated with motility having been found in the genome (see below). The optimum temperature for growth of strain BL2T is 37 °C. is a strictly anaerobic chemoorganotroph and is able to ferment glucose, lactose, sucrose, maltose, salicin, xylose, cellobiose, mannose and raffinose [1]. The organism hydrolyzes esculin but does not liquefy gelatin, and neither reduces nitrate nor produces indole from tryptophan [1]. does not utilize mannitol, arabinose, glycerol, melezitose, sorbitol, rhamnose or trehalose [1]. Growth is possible in the presence of bile [1]. Major fermentation products from broth (1 % peptone, 1 % yeast extract, and 1 % glucose each (w/v)) are acetic acid and succinic acid, whereas isovaleric acid is produced in small amounts [1]. shows activity for α-galactosidase, β-galactosidase, α-glucosidase, β-glucosidase, α-arabinosidase, N-acetyl-β-glucosaminidase, α-fucosidase, alkaline phosphatase, leucyl glycine arylamidase, alanine arylamidase and glutamyl glutamic acid arylamidase but no activity urease, catalase, arginine dihydrolase, β-galactosidase 6-phosphate, β-glucuronidase, glutamic acid decarboxylase and arginine, proline, phenylalanine, leucine, pyroglutamic acid, tyrosine, glycine, histidine and serine arylamidase [1].
Fig. 2

Light microscope image of strain BL2T

Table 1

Classification and general features of Bacteroides barnesiae strain BL2T in accordance with the MIGS recommendations [33] published by the Genome Standards Consortium [34] and the NamesforLife database [35]

MIGS IDPropertyTermEvidence code
Current classificationDomain Bacteria TAS [36]
Phylum Bacteroidetes TAS [37, 38]
Class Bacteroidia TAS [38, 39]
Order Bacteroidales TAS [38, 40]
Family Bacteroidaceae TAS [41, 42]
Genus Bacteroides TAS [42, 43]
Species Bacteroides barnesiae TAS [1]
Strain BL2T TAS [1]
Gram stainNegativeTAS [1]
Cell shapePleomorphic rodsTAS [1]
MotilityNon-motileTAS [1]
SporulationNon-sporulatingTAS [1]
Temperature rangeMesophilicTAS [1]
Optimum temperature37 °CTAS [1]
pH range; OptimumNot reported
Carbon sourceMono- and polysaccharidesTAS [1]
Energy metabolismChemoorganotrophTAS [1]
MIGS-6HabitatChickenTAS [1]
MIGS-6.3SalinityNot reported
MIGS-22Oxygen requirementStrictly anaerobicTAS [1]
MIGS-15Biotic relationshipFree-livingTAS [1]
MIGS-14PathogenicityNoneNAS
Biosafety level1NAS
MIGS-23IsolationChicken caecumTAS [1]
MIGS-4Geographic locationJapanTAS [1]
MIGS-5Sample collection timeNot reported
MIGS-4.1LatitudeNot reported
MIGS-4.2LongitudeNot reported
MIGS-4.3DepthNot reported
MIGS-4.4AltitudeNot reported

Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [44]

Light microscope image of strain BL2T Classification and general features of Bacteroides barnesiae strain BL2T in accordance with the MIGS recommendations [33] published by the Genome Standards Consortium [34] and the NamesforLife database [35] Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [44] strain BL2T contains menaquinones MK-10 (58 %) and MK-11 (34 %) as principal respiratory quinones, small amounts of MK-8, MK-9 and MK-12 (2 % each) are found as minor components [1]. The major fatty acids found were anteiso-C15:0 (32 %), iso-C15:0 (15 %), 3-hydroxy C16:0 (10 %) and C16:0 (10 %). Fatty acids C14:0 (4 %), C15:0 (2 %), C18:1ω9c (4 %), C18:2ω6,9c (2 %) and 3-hydroxy iso-C17:0 (7 %) were found in minor amounts [1]. Chemotaxonomic features are in line with known features from other representatives of the genus [1].

Genome sequencing information

Genome project history

The organism was selected for sequencing on the basis of its phylogenetic position [12-14]. Sequencing of strain BL2T is part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project [9] which aims not only to increase the sequencing coverage of key reference microbial genomes [15], but also to generate a large genomic basis for the discovery of genes encoding novel enzymes [16]. The genome project is deposited in the Genomes OnLine Database [17] and the permanent draft genome sequence is deposited in GenBank. Sequencing, finishing and annotation were performed by the DOE Joint Genome Institute using state of the art sequencing technology [18]. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityLevel 2: High-Quality Draft
MIGS-28Libraries usedIllumina Std. shotgun library
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage122.7 ×
MIGS-30AssemblersVelvet v. 1.1.04; ALLPATHS v. r41043
MIGS-32Gene calling methodProdigal
Locus TagC510
Genbank IDARGC00000000
Genbank Date of Release16-SEP-2013
GOLD IDGi11191
BIOPROJECTPRJN174979
MIGS-13Source Material IdentifierDSM 18169
Project relevanceTree of Life, GEBA-KMG
Genome sequencing project information

Growth conditions and genomic DNA preparation

strain BL2T, DSM 18169, was grown anaerobically in DSMZ medium 429 (Columbia Blood Agar) at 37 °C [19]. DNA was isolated from 0.5-1 g of cell paste using JetFlex genomic DNA purification (GENOMED) following the standard protocol as recommended by the manufacturer with and additional protease K (50 μl; 21 mg/ml) digest for 60 min. at 58 °C followed by addition of 200 μl Protein Precipitation Buffer after protein precipitation and overnight incubation on ice. DNA is available through the DNA Bank Network [20].

Genome sequencing and assembly

The permanent draft genome of strain BL2T was generated using Illumina technology [18, 21]. An Illumina Standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 11,109,700 reads totaling 1,666.5 Mb. All general aspects of library construction and sequencing performed at the DOE-JGI can be found at [22]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts [23]. Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet [24], (2) 1–3 kb simulated paired end reads were created from Velvet Contigs using wgsim [25], (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r41043) [26]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very clean yes –export- Filtered yes –min contig lgth 500 –scaffolding no –cov cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True). The final draft assembly contained 47 contigs in 43 scaffolds. The total size of the genome is 3.6 Mb and the final assembly is based on 443.6 Mb of Illumina data, which provides an average 122.7 × coverage of the genome.

Genome annotation

Genes were identified using Prodigal [27] as part of the DOE-JGI genome annotation pipeline [28, 29], following by a round of manual curation using the JGI GenePRIMP pipeline [30]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information non-redundant database, UniProt, TIGR-Fam, Pfam, PRIAM, KEGG, COG, and InterPro database. These data sources were combined to assert a product description for each predicted protein. Additional gene prediction analysis and functional annotation was performed within the Integrated Microbial Genomes-Expert Review platform [31].

Genome properties

The assembly of the draft genome sequence consists of 43 scaffolds amounting to 3,621,509 bp, and the G + C content is 46.8 % (Table 3). Of the 3,156 genes predicted, 3,059 were protein-coding genes, and 97 RNAs. The majority of the protein-coding genes (71.7 %) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3

Genome statistics

AttributeValue% of total
Genome size (bp)3,621,509100.00
DNA coding region (bp)3,241,16389.50
DNA G + C content (bp)1,696,15046.84
DNA scaffolds43
Total genes3,156100.00
Protein coding genes3,05996.93
RNA genes973.07
Genes with function prediction2,26371.70
Genes assigned to COGs1,66852.85
Genes with Pfam domains2,43177.03
Genes with signal peptides44514.10
Genes with transmembrane helices71122.53
CRISPR repeats7
Table 4

Number of genes associated with the general COG functional categories

CodeValue% ageDescription
J1448.03Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K1075.96Transcription
L1267.02Replication, recombination and repair
B00.00Chromatin structure and dynamics
D201.11Cell cycle control, cell division, chromosome partitioning
Y00.00Nuclear structure
V623.46Defense mechanisms
T603.34Signal transduction mechanisms
M1427.72Cell wall/membrane/envelope biogenesis
N40.22Cell motility
Z00.00Cytoskeleton
W00.00Extracellular structures
U472.62Intracellular trafficking, secretion, and vesicular transport
O603.34Posttranslational modification, protein turnover, chaperones
C1035.74Energy production and conversion
G1407.30Carbohydrate transport and metabolism
E1387.69Amino acid transport and metabolism
F643.57Nucleotide transport and metabolism
H905.02Coenzyme transport and metabolism
I482.68Lipid transport and metabolism
P975.41Inorganic ion transport and metabolism
Q191.06Secondary metabolites biosynthesis, transport and catabolism
R21912.21General function prediction only
S1045.80Function unknown
-1,48847.15Not in COGs
Genome statistics Number of genes associated with the general COG functional categories

Insights from the genome sequence

strain BL2T, strain BL78T and strain C35T were isolated from the cecum of the same healthy chicken [1]. The GGDC (Genome-to-Genome Distance Calculator) web server (GGDC 2.0) [32] was used for the estimation of the overall similarity between the three genomes. The comparison of with and revealed that 11.1 % and 5.2 %, respectively, of the average of the genome lengths are covered with HSPs (high-scoring segment pairs). The identity within the HSPs was 83.6 % and 84.6 %, respectively, whereas the identity over the whole genome was 9.3 % and 4.4 %, respectively. The comparison of with revealed that 5.4 % of the genome is covered with HSPs, with an identity within in the HSPs of 84.1 % and an identity over the whole genome of 4.6 %. According to these calculations the similarity between and is higher than the similarity between and as well as the similarity between and . The genome size of (3.6 Mb) is significantly smaller than those of (4.3 Mb) and (4.9 Mb).

Conclusions

strain BL2T genome consists of a single chromosome of 3.6 Mb predicted to encode 3,156 genes. Strain BL2T has a relatively small genome in comparison to other sequenced species isolated from the same chicken (4.3-4.9 Mb). These differences of genome size may be the results of adaptation in this niche. Further study will be necessary for elucidation of this idea.
  30 in total

1.  Solexa Ltd.

Authors:  Simon Bennett
Journal:  Pharmacogenomics       Date:  2004-06       Impact factor: 2.533

2.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

Review 3.  En route to a genome-based classification of Archaea and Bacteria?

Authors:  H-P Klenk; M Göker
Journal:  Syst Appl Microbiol       Date:  2010-04-20       Impact factor: 4.022

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  Isolation, culture characteristics, and identification of anaerobic bacteria from the chicken cecum.

Authors:  J P Salanitro; I G Fairchilds; Y D Zgornicki
Journal:  Appl Microbiol       Date:  1974-04

6.  A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea.

Authors:  Dongying Wu; Philip Hugenholtz; Konstantinos Mavromatis; Rüdiger Pukall; Eileen Dalin; Natalia N Ivanova; Victor Kunin; Lynne Goodwin; Martin Wu; Brian J Tindall; Sean D Hooper; Amrita Pati; Athanasios Lykidis; Stefan Spring; Iain J Anderson; Patrik D'haeseleer; Adam Zemla; Mitchell Singer; Alla Lapidus; Matt Nolan; Alex Copeland; Cliff Han; Feng Chen; Jan-Fang Cheng; Susan Lucas; Cheryl Kerfeld; Elke Lang; Sabine Gronow; Patrick Chain; David Bruce; Edward M Rubin; Nikos C Kyrpides; Hans-Peter Klenk; Jonathan A Eisen
Journal:  Nature       Date:  2009-12-24       Impact factor: 49.962

7.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

8.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

9.  Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

Authors:  Nikos C Kyrpides; Tanja Woyke; Jonathan A Eisen; George Garrity; Timothy G Lilburn; Brian J Beck; William B Whitman; Phil Hugenholtz; Hans-Peter Klenk
Journal:  Stand Genomic Sci       Date:  2013-12-17

10.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

View more
  1 in total

1.  High-quality draft genome sequence of Gracilimonas tropica CL-CB462(T) (DSM 19535(T)), isolated from a Synechococcus culture.

Authors:  Dong Han Choi; Chisang Ahn; Gwang Il Jang; Alla Lapidus; James Han; T B K Reddy; Marcel Huntemann; Amrita Pati; Natalia Ivanova; Victor Markowitz; Manfred Rohde; Brian Tindall; Markus Göker; Tanja Woyke; Hans-Peter Klenk; Nikos C Kyrpides; Byung Cheol Cho
Journal:  Stand Genomic Sci       Date:  2015-11-11
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.