| Literature DB >> 24976892 |
Namita Khanna1, Ananta Kumar Ghosh1, Marcel Huntemann2, Shweta Deshpande2, James Han2, Amy Chen2, Nikos Kyrpides2, Kostas Mavrommatis2, Ernest Szeto2, Victor Markowitz2, Natalia Ivanova2, Ioanna Pagani2, Amrita Pati2, Sam Pitluck2, Matt Nolan2, Tanja Woyke2, Hazuki Teshima3, Olga Chertkov3, Hajnalka Daligault3, Karen Davenport3, Wei Gu3, Christine Munk3, Xiaojing Zhang3, David Bruce3, Chris Detter3, Yan Xu3, Beverly Quintana3, Krista Reitenga3, Yulia Kunde3, Lance Green3, Tracy Erkkila3, Cliff Han3, Evelyne-Marie Brambilla4, Elke Lang4, Hans-Peter Klenk4, Lynne Goodwin3, Patrick Chain3, Debabrata Das1.
Abstract
Enterobacter sp. IIT-BT 08 belongs to Phylum: Proteobacteria, Class: Gammaproteobacteria, Order: Enterobacteriales, Family: Enterobacteriaceae. The organism was isolated from the leaves of a local plant near the Kharagpur railway station, Kharagpur, West Bengal, India. It has been extensively studied for fermentative hydrogen production because of its high hydrogen yield. For further enhancement of hydrogen production by strain development, complete genome sequence analysis was carried out. Sequence analysis revealed that the genome was linear, 4.67 Mbp long and had a GC content of 56.01%. The genome properties encode 4,393 protein-coding and 179 RNA genes. Additionally, a putative pathway of hydrogen production was suggested based on the presence of formate hydrogen lyase complex and other related genes identified in the genome. Thus, in the present study we describe the specific properties of the organism and the generation, annotation and analysis of its genome sequence as well as discuss the putative pathway of hydrogen production by this organism.Entities:
Keywords: Enterobacter sp. IIT-BT 08; biohydrogen; facultative anaerobe; genome sequence
Year: 2013 PMID: 24976892 PMCID: PMC4062630 DOI: 10.4056/sigs.4348035
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Enterobacter sp. IIT-BT 08 according to the MIGS recommendation [8]
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | [ | ||
| Type strain IIT-BT 08 | |||
| Gram stain | Negative | NAS | |
| Cell shape | Rod shaped | IDA | |
| Motility | motile via peritrichous flagella | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | 25 – 40 °C | TAS [ | |
| Optimum temperature | 37 °C | TAS [ | |
| Carbon source | carbohydrates | TAS [ | |
| Energy metabolism | Chemoorganotrophic | IDA | |
| Terminal electron receptor | Oxygen | IDA | |
| MIGS-6 | Habitat | Plant leaves | TAS [ |
| MIGS-6.3 | Salinity | Not studied | |
| MIGS-22 | Oxygen | Facultative anaerobe; grows well under oxic and anoxic conditions | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | None | |
| MIGS-4 | Geographic location | Kharagpur in the district of West Midnapur, West Bengal, India. | TAS [ |
| MIGS-5 | Sample collection time | July, 1999 | |
| MIGS-4.1 | Latitude | 22 02’ 30” | |
| MIGS-4.2 | Longitude | 87 11’ 0” | |
| MIGS-4.3 | Depth | NA | |
| MIGS-4.4 | Altitude | NA |
a) MIGS: The minimum information about a genome sequence
b) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [21].
Figure 1Phylogenetic tree high-lighting the position of “Enterobacter sp. IIT-BT 08 (•)” relative to other type and non-type strains within the Enterobacteriaceae. Strains shown are those within the Enterobacteriaceae having corresponding NCBI genome project ids. The tree was constructed using Mega4 software. The tree based on Jukes–Cantor distance was constructed using neighbor-joining algorithm with 1,000 bootstrapping. Acetobacterium woodii strain DSM 1030 (⬥) and Desulfocaldus sp. (■) was considered as the out group. The scale bar represents 0.1 substitutions per nucleotide position. Numbers at the nodes are the bootstrap values.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | IGHT and IGGH |
| MIGS-29 | Sequencing platforms | Illumina, 454 |
| MIGS-31.2 | Fold coverage | 50× |
| MIGS-30 | Assemblers | Velvet v. 1.1.05, ALLPATHS v. 39750, Phrap v. 4.24 |
| MIGS-32 | Gene calling method | Gene Prodigal |
| Genome Database release | September 6th, 2012 | |
| NCBI ID | 1070842 | |
| Genbank Date of Release | Not determined | |
| GOLD ID | Gi12106 | |
| Project relevance | Biohydrogen production |
Figure 2Graphical linear map of the genome. From left to right: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Nucleotide content and gene count levels of the genome
| Value | % of totala | |
|---|---|---|
| Genome size (bp) | 4,672,040 | 100.00 |
| DNA coding region (bp) | 4220,082 | 90.33 |
| DNA G+C content (bp) | 2,616273 | 56.01 |
| Total genes | 4,532 | 100.00 |
| RNA genes | 179 | 3.95 |
| Protein-coding genes | 4,393 | 96.05 |
| Genes in paralog clusters | 1,665 | 36.74 |
| Genes assigned to COGs | 3,780 | 83.41 |
| Genes assigned Pfam domains | 3,949 | 87.14 |
| Genes assigned TIGRfam domains | 1,715 | 37.84 |
| Genes with signal peptides | 1,603 | 35.37 |
| Genes with transmembrane helices | 1,101 | 24.29 |
| Pseudo Genesb | 43 | 0.95 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
b) Pseudogenes may also be counted as protein coding or RNA genes, so is not additive under total gene count.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 191 | 4.4 | Translation |
| A | 1 | 0.0 | RNA processing and modification |
| K | 372 | 8.5 | Transcription |
| L | 153 | 3.5 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 32 | 0.7 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 56 | 1.3 | Defense mechanisms |
| T | 220 | 5.1 | Signal transduction mechanisms |
| M | 249 | 5.7 | Cell wall/membrane biogenesis |
| N | 148 | 3.4 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 155 | 3.6 | Intracellular trafficking and secretion |
| O | 146 | 3.4 | Posttranslational modification, protein turnover, chaperones |
| C | 244 | 5.6 | Energy production and conversion |
| G | 396 | 9.1 | Carbohydrate transport and metabolism |
| E | 402 | 9.2 | Amino acid transport and metabolism |
| F | 83 | 1.9 | Nucleotide transport and metabolism |
| H | 165 | 3.8 | Coenzyme transport and metabolism |
| I | 105 | 2.4 | Lipid transport and metabolism |
| P | 246 | 5.7 | Inorganic ion transport and metabolism |
| Q | 80 | 1.8 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 446 | 10.2 | General function prediction only |
| S | 375 | 8.6 | Function unknown |
| - | 88 | 2.0 | Not in COGs |
aThe total is based on the total number of protein coding genes in the entire annotated genome
Preliminary genes involved in the hydrogen production pathway according to the MIGS recommendations [8]
| | | | |
|---|---|---|---|
| FdhD | EntIITBT8DRAFT_3548 | 29.9 | formate dehydrogenase family accessory protein (protease) |
| FdhE | EntIITBT8DRAFT_3551 | 89.4 | formate dehydrogenase, alpha subunit, proteobacterial-type [EC:1.2.1.2 ] |
| FdhF | EntIITBT8DRAFT_1016 | 79.3 | formate dehydrogenase, alpha subunit, archaeal-type (EC:1.2.1.2) |
| HycA | EntIITBT8DRAFT_2685 | 17.5 | Transcriptional repressor of hyc and hyp operons |
| HycD | EntIITBT8DRAFT_2682 | 33.0 | Formate hydrogenlyase subunit 4 |
| HycE | EntIITBT8DRAFT_2681 | 64.8 | Ni, Fe-hydrogenase III large subunit |
| HycG | EntIITBT8DRAFT_2679 | 28.0 | Ni,Fe-hydrogenase III small subunit |
| HycH | EntIITBT8DRAFT_2678 | 15.2 | Formate hydrogenlyase maturation protein |
| HycI | EntIITBT8DRAFT_2677 | 16.2 | hydrogenase maturation protease [EC:3.4.23.51] |
| ? | EntIITBT8DRAFT_2511 | 23.0 | Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase subunit (chain I) |
| ? | EntIITBT8DRAFT_2683 | 63.5 | Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
| HycB?? | EntIITBT8DRAFT_2684 | 21.7 | Fe-S-cluster-containing hydrogenase components 2 |
| HypA | EntIITBT8DRAFT_2686 | 13.1 | hydrogenase nickel insertion protein |
| HypB | EntIITBT8DRAFT_2687 | 31.1 | hydrogenase accessory protein |
| HypC/HupF | EntIITBT8DRAFT_2688 | 9.65 | hydrogenase assembly chaperone HypC/HupF |
| HypD | EntIITBT8DRAFT_2689 | 41.1 | hydrogenase expression/formation protein HypD |
| HypE | EntIITBT8DRAFT_2690 | 35.3 | hydrogenase expression/formation protein HypE |
| HypF | EntIITBT8DRAFT_2672 | 80.5 | [NiFe] hydrogenase maturation protein HypF |
| HoxN/HupN/NixA family | EntIITBT8DRAFT_2671 | 36.7 | high-affinity nickel-transporter |
Figure 3Putative mechanism of hydrogen production by Enterobacter sp. IIT-BT 08 based on the genes identified in the genome. Figure is adapted from [33].