| Literature DB >> 26566420 |
Aliya D Suleimanova1, Anna A Toymentseva2, Eugenia A Boulygina3, Sergey V Kazakov4, Ayslu M Mardanova1, Nelly P Balaban1, Margarita R Sharipova1.
Abstract
Strain 3.5.1 was isolated from soils of the Republic of Tatarstan, Russia, on the basis of presence of a high phytate-degrading activity. Strains with such activities attract special interest because of its potential use as feed additives and natural manures. Strain 3.5.1 harbors a 99 % 16S rRNA nucleotide sequence similarity to different Pantoea species (P. vagans, P. ananatis, P. agglomerans, P. anthophila and Pantoea sp.) and exhibits unique biochemical properties that do not allow strain identification up to species. Moreover, the strain 3.5.1 shows a low ANI and MALDI-TOF Mass Spectrometry scores. Thus, it is likely that the strain 3.5.1 represents a new Pantoea species. Here, we present the genome sequence of Pantoea sp. strain 3.5.1. The 4,964,649 bp draft genome consists of 23 contigs with 4,556 protein-coding and 143 RNA genes. Genome sequencing and annotation revealed two phytase genes and putative regulatory genes controlling its activity.Entities:
Keywords: 454; Genome; Ion Torrent; Pantoea; Phytase; Strain 3.5.1
Year: 2015 PMID: 26566420 PMCID: PMC4642748 DOI: 10.1186/s40793-015-0093-y
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1General characteristics of the strain 3.5.1. a Screening for phytate-hydrolyzing activity on PSM solid medium. Strain 3.5.1 is indicated by red arrow. b Phase contrast micrograph of the strain 3.5.1. c and d Scanning electron micrographs of the strain 3.5.1 (Carl Zeiss, Merlin)
Classification and general features of Pantoea sp. 3.5.1. in accordance with the MIGS recommendations [42] as published by the Genome Standards Consortium [43]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain: 3.5.1 | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod-shaped | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-spore forming | IDA | |
| Temperature range | Mesophilic | IDA | |
| Optimum temperature | +37 °C | IDA | |
| pH range; Optimum | 3.5–7; 5 | IDA | |
| Carbon source | D-glucose, lactose, maltose, mannite | IDA | |
| Energy source | Chemoorganotroph | NAS | |
| MIGS-6 | Habitat | Soil | IDA |
| MIGS-6.3 | Salinity | Not tested | |
| MIGS-22 | Oxygen requirement | Facultative aerobic | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| MIGS-14 | Pathogenicity | Opportunistic pathogen | NAS |
| MIGS-4 | Geographic location | Agerze village, Aznakaevo district, Republic of Tatarstan, Russia | IDA |
| MIGS-5 | Sample collection | September 2010 | IDA |
| MIGS-4.1 | Latitude | 54°83´ | IDA |
| MIGS-4.2 | Longitude | 53°00´ | IDA |
| MIGS-4.3 | Depth | 25 cm | IDA |
| MIGS-4.4 | Altitude | 233 m | IDA |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53]
Fig. 2Maximum likelihood phylogenetic tree highlighting the position of the strain 3.5.1 relative to other species within the genus Pantoea based on 16S rRNA gene sequences. Bootstrap consensus tree were inferred from 1,000 replicates. Two E. coli strains were used as outgroup. The scale bar, 0,005 substitutions per nucleotide position. The phylogenetic tree was obtained by MEGA 6 software [44]. The corresponding GenBank accession numbers for 16S rRNA sequences are: NR_074740 (P. ananatis AJ13355), FJ611814 (P. ananatis LMG 20103), FJ611845 (P. ananatis LMG 5342), NC_017554 (P. ananatis PA13), NZ_JMRT02000019 (the strain 3.5.1), NZ_KK403348 (P. agglomerans Eh318), NR_102966 (P. vagans C9-1), JXXL01000005 (P. anthophila 11–2), FJ611810 (Pantoea sp. Sc1), AJ311838 (P. stewartii subsp. stewartii DC283), KJ830125 (P. stewartii M009), NR_119256 (P. stewartii subsp. indologenes LMG 2632), JX861128 (Pantoea sp. IMH), NZ_CP009454 (P. rwandensis ND04), NC_014837 (Pantoea sp. At-9b), NR_102804 (E. coli K-12 substr. MG1655), NZ_CP007592.1 (E. coli O157:H16 strain Santai)
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | High quality draft |
| MIGS-28 | Libraries used | Two single-end libraries of 200 bp and 600 bp |
| MIGS 29 | Sequencing platforms | Ion Torrent PGM and 454 GS Junior |
| MIGS 31.2 | Fold coverage | 32× |
| MIGS 30 | Assemblers | SPAdes 3.5.0 |
| MIGS 32 | Gene calling method | GeneMark, RAST |
| Locus Tag | EP46 | |
| Genbank ID | JMRT00000000.2 | |
| GenBank Date of Release | April 16, 2015 | |
| GOLD ID | Gp0114842 | |
| BIOPROJECT | PRJNA246264 | |
| MIGS 13 | Source Material Identifier | 3.5.1 |
| Project relevance | Phytase producer bacterium, Agricultural |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 4,964,649 | 100.00 |
| DNA coding (bp) | 4,306,583 | 86.74 |
| DNA G + C (bp) | 2,768,589 | 55.77 |
| DNA scaffolds | 23 | 100.00 |
| Total genes | 4,699 | 100.00 |
| Protein coding genes | 4,556 | 96.96 |
| RNA genes | 143 | 3.04 |
| Pseudo genes | 135 | 2.87 |
| Genes in internal clusters | NA | |
| Genes with function prediction | 3,921 | 83.44 |
| Genes assigned to COGs | 3,507 | 74.63 |
| Genes with Pfam domains | 4,059 | 86.38 |
| Genes with signal peptides | 425 | 9.04 |
| Genes with transmembrane helices | 1,075 | 22.88 |
| CRISPR repeats | 1 | - |
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 250 | 6.28 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 366 | 9.19 | Transcription |
| L | 127 | 3.19 | Replication, recombination and repair |
| B | - | - | Chromatin structure and dynamics |
| D | 45 | 1.13 | Cell cycle control, Cell division, chromosome partitioning |
| V | 85 | 2.13 | Defense mechanisms |
| T | 186 | 4.67 | Signal transduction mechanisms |
| M | 250 | 6.28 | Cell wall/membrane biogenesis |
| N | 106 | 2.66 | Cell motility |
| U | 48 | 1.21 | Intracellular trafficking and secretion |
| O | 136 | 3.42 | Posttranslational modification, protein turnover, chaperones |
| C | 212 | 5.32 | Energy production and conversion |
| G | 427 | 10.72 | Carbohydrate transport and metabolism |
| E | 395 | 9.92 | Amino acid transport and metabolism |
| F | 101 | 2.54 | Nucleotide transport and metabolism |
| H | 209 | 5.25 | Coenzyme transport and metabolism |
| I | 131 | 3.29 | Lipid transport and metabolism |
| P | 252 | 6.33 | Inorganic ion transport and metabolism |
| Q | 80 | 2.01 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 342 | 8.59 | General function prediction only |
| S | 204 | 5.12 | Function unknown |
| - | 1192 | 25.37 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 3Circular representation of the genome comparison between the Pantoea sp. 3.5.1 strain and P. vagans C9-1. The inner ring shows GC content (in black) and GC skew (in purple/green). Color intensity within the circle represents the levels of nucleotide homology (from 70 % to 100 %). The external ring shows CDS of two detected phytase genes and the candidate regulatory genes