| Literature DB >> 21475586 |
Amrita Pati, Sabine Gronow, Ahmet Zeytun, Alla Lapidus, Matt Nolan, Nancy Hammon, Shweta Deshpande, Jan-Fang Cheng, Roxane Tapia, Cliff Han, Lynne Goodwin, Sam Pitluck, Konstantinos Liolios, Ioanna Pagani, Natalia Ivanova, Konstantinos Mavromatis, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, John C Detter, Evelyne Brambilla, Manfred Rohde, Markus Göker, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Susan Lucas.
Abstract
Bacteroides helcogenes Benno et al. 1983 is of interest because of its isolated phylogenetic location and, although it has been found in pig feces and is known to be pathogenic for pigs, occurrence of this bacterium is rare and it does not cause significant damage in intensive animal husbandry. The genome of B. helcogenes P 36-108(T) is already the fifth completed and published type strain genome from the genus Bacteroides in the family Bacteroidaceae. The 3,998,906 bp long genome with its 3,353 protein-coding and 83 RNA genes consists of one circular chromosome and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Bacteroidaceae; GEBA; Gram-negative; animal pathogen; chemoorganotrophic; mesophilic; nonmotile; pig abscess; strictly anaerobic
Year: 2011 PMID: 21475586 PMCID: PMC3072090 DOI: 10.4056/sigs.1513795
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of B. helcogenes relative to those type strains within the genus that appeared within a monophyletic Bacteroides main clade in preliminary analyses. Note that several of the Bacteroides type strain 16S rRNA sequences (from B. cellulosolvens, B. galacturonicus, B. pectinophilus, B. vulgatus) did not cluster together with this clade (data not shown, but see [5]) and were omitted from the main phylogenetic inference analysis. The same holds for the sequence from Anaerorhabdus furcosa (GU585668; also Bacteroidaceae). Other Bacteroides species lacked a sufficiently long 16S rRNA sequence and also had to be omitted (B. coagulans, B. xylanolyticus). The tree was inferred from 1,414 aligned characters [6,7] of the 16S rRNA gene sequence under the maximum likelihood criterion [8] and rooted with the type strain of the family 'Prevotellaceae'. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates [9] if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [10] are shown in blue, published genomes [11] and Prevotella melaninogenica released Genbank accession CP002122 in bold.
Figure 2Scanning electron micrograph of B. helcogenes P 36-108T
Classification and general features of B. helcogenes P 36-108T according to the MIGS recommendations [12].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class ' | TAS [ | ||
| Order ' | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain P 36-108 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rod-shaped, single or in pairs | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | mesophile | TAS [ | |
| Optimum temperature | 37°C | TAS [ | |
| Salinity | normal | TAS [ | |
| MIGS-22 | Oxygen requirement | strictly anaerobic | TAS [ |
| Carbon source | carbohydrates | TAS [ | |
| Energy source | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | host | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | animal pathogen | TAS [ |
| Biosafety level | 2 | TAS [ | |
| Isolation | | TAS [ | |
| MIGS-4 | Geographic location | Japan | TAS [ |
| MIGS-5 | Sample collection time | 1974 | TAS [ |
| MIGS-4.1 | Latitude | not reported | NAS |
| MIGS-4.2 | Longitude | not reported | NAS |
| MIGS-4.3 | Depth | not reported | NAS |
| MIGS-4.4 | Altitude | not reported | NAS |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [25]. If the evidence code is IDA, then the property was directly observed by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 56.3 × Illumina; 36.7 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3-PreRelease-10-21-2009-gcc-4.1.2-threads, |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP002352 | |
| Genbank Date of Release | January 18, 2011 | |
| GOLD ID | Gc01593 | |
| NCBI project ID | 41913 | |
| Database: IMG-GEBA | 2503538016 | |
| MIGS-13 | Source material identifier | DSM 20613 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 3,998,906 | 100.00% |
| DNA coding region (bp) | 3,583,947 | 89.62% |
| DNA G+C content (bp) | 1,788,209 | 44.72% |
| Number of replicons | 1 | 100.00% |
| Extrachromosomal elements | 0 | |
| Total genes | 3,436 | 100.00% |
| RNA genes | 83 | 2.42% |
| rRNA operons | 5 | |
| Protein-coding genes | 3,353 | 97.58% |
| Pseudo genes | 109 | 3.17% |
| Genes with function prediction | 2,215 | 64.46% |
| Genes in paralog clusters | 454 | 13.21% |
| Genes assigned to COGs | 2103 | 61.20% |
| Genes assigned Pfam domains | 2360 | 68.68% |
| Genes with signal peptides | 980 | 28.52% |
| Genes with transmembrane helices | 798 | 23.22% |
| CRISPR repeats | 1 |
Figure 3Graphical circular map of the chromosome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 147 | 6.5 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 157 | 6.9 | Transcription |
| L | 125 | 5.5 | Replication, recombination and repair |
| B | 0 | 0 | Chromatin structure and dynamics |
| D | 20 | 0.9 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0 | Nuclear structure |
| V | 67 | 2.9 | Defense mechanisms |
| T | 125 | 5.5 | Signal transduction mechanisms |
| M | 245 | 10.8 | Cell wall/membrane/envelope biogenesis |
| N | 5 | 0.2 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 48 | 2.1 | Intracellular trafficking, secretion, and vesicular transport |
| O | 66 | 2.9 | Posttranslational modification, protein turnover, chaperones |
| C | 120 | 5.3 | Energy production and conversion |
| G | 185 | 8.1 | Carbohydrate transport and metabolism |
| E | 149 | 6.5 | Amino acid transport and metabolism |
| F | 67 | 2.9 | Nucleotide transport and metabolism |
| H | 120 | 5.3 | Coenzyme transport and metabolism |
| I | 64 | 2.8 | Lipid transport and metabolism |
| P | 161 | 7.6 | Inorganic ion transport and metabolism |
| Q | 20 | 0.9 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 266 | 11.7 | General function prediction only |
| S | 122 | 5.4 | Function unknown |
| - | 1,333 | 38.8 | Not in COGs |