| Literature DB >> 27699151 |
Farzaneh Moghadam1, M B Couger1, Breeanna Russ1, Randi Ramsey1, Radwa A Hanafy1, Connie Budd1, Donald P French2, Wouter D Hoff1, Noha Youssef1.
Abstract
The genus Pantoea is a predominant member of host-associated microbiome. We here report on the genomic analysis of Pantoea eucrina strain Russ that was isolated from a trashcan at Oklahoma State University, Stillwater, OK. The draft genome of Pantoea eucrina strain Russ consists of 3,939,877 bp of DNA with 3704 protein-coding genes and 134 RNA genes. This is the first report of a genome sequence of a member of Pantoea eucrina. Genomic analysis revealed metabolic versatility with genes involved in the metabolism and transport of all amino acids as well as glucose, fructose, mannose, xylose, arabinose and galactose, suggesting the organism is a versatile heterotroph. The genome also encodes an extensive secretory machinery including types I, II, III, IV, and Vb secretion systems, and several genes for pili production including the new usher/chaperone system (pfam 05,229). The implications of these systems for opportunistic pathogenesis are discussed.Entities:
Keywords: Detailed annotation; Draft genome sequence; Metabolic versatility; Pantoea eucrina; Pili; Secretion systems; Student Initiated Microbial Discovery (SIMD) project
Year: 2016 PMID: 27699151 PMCID: PMC5037212 DOI: 10.1016/j.gdata.2016.09.006
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Project information.
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | Illumina 2X300 paired end chemistry |
| MIGS 29 | Sequencing platforms | Illumina Miseq |
| MIGS 31.2 | Fold coverage | 300x |
| MIGS 30 | Assemblers | Velvet 2.0 |
| MIGS 32 | Gene calling method | Prodigal, IMG-ER |
| Genbank ID | MAYN00000000 | |
| GenBank date of release | July 2016 | |
| GOLD ID | Gp0126758 | |
| BIOPROJECT | PRJNA327384 | |
| MIGS 13 | Project relevance | Environmental |
Fig. 1Negative stain TEM micrograph of Pantoea eucrina Russ.
Classification and general features of Pantoea eucrina Russ [25].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Classification | Domain Bacteria | TAS | |
| Phylum Proteobacteria | TAS | ||
| Class Gammaproteobacteria | TAS | ||
| Order Enterobacteriales | TAS | ||
| Family Enterobacteriaceae | TAS | ||
| Genus | TAS | ||
| Species | TAS | ||
| Strain: Russ | |||
| Gram stain | Negative | TAS | |
| Cell shape | Rod | TAS | |
| Motility | Motile | TAS | |
| Sporulation | Non-spore forming | TAS | |
| Temperature range | Mesophile | TAS | |
| Optimum temperature | 28 °C | TAS | |
| pH range; optimum | Unknown | ||
| Carbon source | TAS | ||
| MIGS-6 | Habitat | Trashcan | IDA |
| MIGS-6.3 | Salinity | Growth in TSA (0.5%) | IDA |
| MIGS-22 | Oxygen requirement | Facultative anaerobe | TAS |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Unknown | |
| MIGS-4 | Geographic location | Stillwater, Oklahoma | IDA |
| MIGS-5 | Sample collection | March 2015 | IDA |
| MIGS-4.1 | Latitude | 36.1157 | IDA |
| MIGS-4.2 | Longitude | − 97.0586 | IDA |
| MIGS-4.4 | Altitude | 1 M | IDA |
Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [27].
Fig. 2A maximum likelihood phylogenetic tree constructed using multiple sequence alignments of 16S rRNA genes. “Pantoea eucrina Russ” sequence is shown in bold. Reference sequences are also shown and Genbank accession numbers are given in parentheses. The tree was obtained under “K2 + G” model with a variable site γ shape parameter of 0.05. “Escherichia coli” was used as the out-group. Bootstrap values, in percent, are based on 100 replicates and are shown for branches with > 50% bootstrap support. Multiple sequence alignment, model selection, and maximum likelihood analysis were carried out in MEGA [28].
Genome statistics.
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 3,939,877 | 100% |
| DNA coding (bp) | 3,459,667 | 87.81% |
| DNA G + C (bp) | 2,205,503 | 55.98% |
| DNA scaffolds | 8 | 100% |
| Total genes | 3838 | 100% |
| Protein coding genes | 3704 | 96.51% |
| RNA genes | 134 | 3.49% |
| Pseudo genes | 0 | |
| Genes in internal clusters | 829 | 21.60% |
| Genes with function prediction | 3079 | 80.22% |
| Genes assigned to COGs | 2865 | 74.65% |
| Genes with Pfam domains | 3267 | 85.12% |
| Genes with signal peptides | 312 | 8.10% |
| Genes with transmembrane helices | 853 | 22.23% |
| CRISPR repeats | 0 |
Number of genes associated with general COG functional categories.
| Code | Value | % age | Description |
|---|---|---|---|
| J | 242 | 7.5% | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03% | RNA processing and modification |
| K | 258 | 8% | Transcription |
| L | 124 | 3.84% | Replication, recombination and repair |
| B | 0 | 0% | Chromatin structure and dynamics |
| D | 44 | 1.36% | Cell cycle control, cell division, chromosome partitioning |
| V | 67 | 2.08% | Defense mechanisms |
| T | 186 | 5.77% | Signal transduction mechanisms |
| M | 233 | 7.19% | Cell wall/membrane biogenesis |
| N | 77 | 2.39% | Cell motility |
| U | 39 | 1.21% | Intracellular trafficking and secretion |
| O | 120 | 3.72% | Posttranslational modification, protein turnover, chaperones |
| C | 176 | 5.46% | Energy production and conversion |
| G | 312 | 9.67% | Carbohydrate transport and metabolism |
| E | 315 | 9.76% | Amino acid transport and metabolism |
| F | 95 | 2.94% | Nucleotide transport and metabolism |
| H | 177 | 5.49% | Coenzyme transport and metabolism |
| I | 97 | 3.01% | Lipid transport and metabolism |
| P | 198 | 6.14% | Inorganic ion transport and metabolism |
| Q | 43 | 1.33% | Secondary metabolites biosynthesis, transport and catabolism |
| R | 232 | 7.19% | General function prediction only |
| S | 165 | 5.11% | Function unknown |
| – | 1640 | Not in COGs |
The total is based on the total number of protein coding genes in the genome.
Fig. 3Comparative genomics of Pantoea eucrina strain Russ and 21 closely related genomes. (A) KEGG profile clustering of the genomes compared in this study. (B) PCA biplot of the genomic features and COG category distribution in the genomes compared. Genomes are represented by stars, where the strain name is depicted. Arrows represent genomic features or COG categories used for comparison. The arrow directions follow the maximal abundance, and their lengths are proportional to the maximal rate of change between genomes. The first two components explained 75% of variation.