| Literature DB >> 25197457 |
Jodi L Humann1, Mark Wildung2, Derek Pouchnik2, Austin A Bates3, Jennifer C Drew4, Ursula N Zipperer4, Eric W Triplett4, Dorrie Main1, Brenda K Schroeder3.
Abstract
The Enterobacter cloacae complex is genetically very diverse. The increasing number of complete genomic sequences of E. cloacae is helping to determine the exact relationship among members of the complex. E. cloacae P101 is an endophyte of switchgrass (Panicum virgatum) and is closely related to other E. cloacae strains isolated from plants. The P101 genome consists of a 5,369,929 bp chromosome. The chromosome has 5,164 protein-coding regions, 100 tRNA sequences, and 8 rRNA operons.Entities:
Year: 2014 PMID: 25197457 PMCID: PMC4149030 DOI: 10.4056/sigs.4808608
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of P101 according to MIGS recommendations [21]
| MIGS ID | Property | Term | Evidence Code |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain P101 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rod | TAS [ | |
| Motility | motile via peritrichous flagella | TAS [ | |
| Sporulation | non-sporulating | TAS [ | |
| Temperature range | mesophilic, 25-40°C | TAS [ | |
| Optimum temperature | 30-37°C | TAS [ | |
| Salinity | not reported | ||
| MIGS-22 | Oxygen requirement | facultative anaerobe | TAS [ |
| Carbon source | carbohydrates | TAS [ | |
| Energy source | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | soil, switchgrass | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | pathogenic on onion bulb | IDA |
| Biosafety level | 2 | ||
| Isolation | Isolated from switchgrass | TAS [ | |
| MIGS-4 | Geographic location | Wisconsin, USA | TAS [ |
| MIGS-5 | Sample collection time | not reported |
Evidence codes – IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [38]. If the evidence code is IDA, then the property was directly observed for a live isolate by one of the authors, or an expert mentioned in the acknowledgements.
Figure 1Phylogenetic tree of 16S rRNA sequences from sp. with genome sequences. strains grouped separately into a clade from other species using Bayesian phylogenetic analyses of the 16S rRNA region. Analyses were implemented in MRBAYES [39] and the Bayesian Information Criterion (BIC), DT-ModSel [40] was used to determine the nucleotide substitution model best suited for the dataset. To ensure that the average split frequency between runs was less than 1%, the Markov chain Monte Carlo search included two runs with four chains each for 10,000,000 generations. served as the outgroup for the analysis. Numbers in parentheses behind the bacterial names correspond to the GenBank accession numbers for the genome sequences. The scale bar indicates the number of substitutions/site.
P101 Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-29 | Sequencing platform | PacBio RS |
| MIGS-31 | Finishing quality | Finished |
| MIGS-31.2 | Fold coverage | 130× |
| MIGS-30 | Assembler | HGAP [ |
| MIGS-32 | Gene calling method | NCBI Prokaryotic Genome Annotation Pipeline [ |
| GenBank ID | CP006580 | |
| GenBank date of release | December 31, 2013 | |
| Project relevance | Plant-microbe interactions |
P101 Genome Statistics
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 5,369,929 | 100% |
| DNA coding region (bp) | 4,773,116 | 88.89% |
| DNA G+C content (bp) | 2,920,174 | 54.38% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genesb | 5,289 | 100% |
| tRNA genes | 100 | 1.89% |
| rRNA operons | 8 | |
| Protein-coding regions | 5,164 | 97.64% |
| Pseudo genes | 29 | 0.55% |
| Genes with function prediction | 4,419 | 83.55% |
| Genes in paralog clusters | 3,903 | 73.79% |
| Genes assigned to COGs | 4,086 | 77.25% |
| Genes assigned Pfam domains | 4,474 | 84.59% |
| Genes with signal peptides | 498 | 9.42% |
| Genes with transmembrane helices | 1,213 | 22.93% |
| CRISPR repeats | 2 |
a The total is based on either the total number of base pairs or the total number of genes in the genome
b Includes the tRNA genes and pseudogenes
Number of genes associated with the general COG functional categories
| Code | Value | %agea | Description |
|---|---|---|---|
| J | 195 | 3.8 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 432 | 8.4 | Transcription |
| L | 209 | 4.0 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 35 | 0.7 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 56 | 1.1 | Defense mechanisms |
| T | 240 | 4.6 | Signal transduction mechanisms |
| M | 259 | 5.0 | Cell wall/membrane/envelope biogenesis |
| N | 159 | 3.1 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 141 | 2.7 | Intracellular trafficking, secretion, and vesicular transport |
| O | 145 | 2.8 | Posttranslational modification, protein turnover, chaperones |
| C | 234 | 4.5 | Energy production and conversion |
| G | 445 | 8.6 | Carbohydrate transport and metabolism |
| E | 385 | 7.5 | Amino acid transport and metabolism |
| F | 88 | 1.7 | Nucleotide transport and metabolism |
| H | 167 | 3.2 | Coenzyme transport and metabolism |
| I | 121 | 2.3 | Lipid transport and metabolism |
| P | 234 | 4.5 | Inorganic ion transport and metabolism |
| Q | 90 | 1.7 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 515 | 10.0 | General function prediction only |
| S | 428 | 8.3 | Function unknown |
| - | 343 | 11.3 | Not in COGs |
a The total is based on the total number of protein coding genes in the entire annotated genome