| Literature DB >> 24501629 |
Saraswoti Neupane1, Lynne A Goodwin2, Nils Högberg1, Nikos C Kyrpides3, Sadhna Alström1, David Bruce2, Beverly Quintana2, Christine Munk2, Hajnalka Daligault2, Hazuki Teshima2, Karen Davenport2, Krista Reitenga2, Lance Green2, Patrick Chain2, Tracy Erkkila2, Wei Gu2, Xiaojing Zhang2, Yan Xu2, Yulia Kunde2, Olga Chertkov2, James Han3, Cliff Han2, John C Detter2, Natalia Ivanova3, Amrita Pati3, Amy Chen3, Ernest Szeto3, Kostas Mavromatis3, Marcel Huntemann3, Matt Nolan3, Sam Pitluck3, Shweta Deshpande3, Victor Markowitz3, Ioanna Pagani3, Hans-Peter Klenk4, Tanja Woyke3, Roger D Finlay1.
Abstract
Serratia proteamaculans S4 (previously Serratia sp. S4), isolated from the rhizosphere of wild Equisetum sp., has the ability to stimulate plant growth and to suppress the growth of several soil-borne fungal pathogens of economically important crops. Here we present the non-contiguous, finished genome sequence of S. proteamaculans S4, which consists of a 5,324,944 bp circular chromosome and a 129,797 bp circular plasmid. The chromosome contains 5,008 predicted genes while the plasmid comprises 134 predicted genes. In total, 4,993 genes are assigned as protein-coding genes. The genome consists of 22 rRNA genes, 82 tRNA genes and 58 pseudogenes. This genome is a part of the project "Genomics of four rapeseed plant growth-promoting bacteria with antagonistic effect on plant pathogens" awarded through the 2010 DOE-JGI's Community Sequencing Program.Entities:
Keywords: Facultative aerobe; agriculture; chemoorganotrophic; gram-negative; mesophilic; motile; non-sporulating
Year: 2013 PMID: 24501629 PMCID: PMC3910699 DOI: 10.4056/sigs.4027757
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of S4 in relation to other type and non-type strains within the family . The tree is based on 1,489 characters of the 16S rRNA gene sequence aligned in ClustalW2 [6] under the default settings. The tree was constructed using MEGA5 software [7] under the Maximum likelihood criterion and the tree was rooted with (a member of the family ). The branches are scaled according to the expected number of substitutions per site. The numbers above the branches are support values from 1,000 bootstrap replicates if larger than 60% [8]. All lineages with genome sequences are registered in GOLD [9].
Figure 2Scanning electron micrograph of S4
Classification and general features of S4 according to the MIGS recommendations [12]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain S4 | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | 4 – 40 °C | IDA | |
| Optimum temperature | 28 °C | IDA | |
| Carbon source | Glucose, sucrose, succinate, mannitol, arabinose, sorbitol, inositol | IDA | |
| Energy source | Chemoorganotrophic | IDA | |
| MIGS-6 | Habitat | Wild | IDA |
| MIGS-6.3 | Salinity | Medium | IDA |
| MIGS-22 | Oxygen | Facultative | IDA |
| MIGS-15 | Biotic relationship | Plant associated | IDA |
| MIGS-14 | Pathogenicity | None | NAS |
| MIGS-4 | Geographic location | Uppsala, Sweden | NAS |
| MIGS-5 | Sample collection time | 1980 | NAS |
| MIGS-4.1 | Latitude – | 59 | NAS |
| MIGS-4.2 | Longitude | 17 | NAS |
| MIGS-4.3 | Depth | 0.1 m | NAS |
| MIGS-4.4 | Altitude | 58 - 63 m | NAS |
a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [25]. If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Non-contiguous Finished |
| MIGS-28 | Libraries used | Three libraries: one 454 standard library, one paired end 454 library (10 kb insert size) |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 767.4 × Illumina, 8.7 × pyrosequencing |
| MIGS-30 | Assemblers | Velvet version 1.1.05, Newbler version 2.6, phrap version SPS – 4.24 |
| MIGS-32 | Gene calling method | Prodigal (1.4), GenePRIMP |
| NCBI project ID | 61833 | |
| NCBI taxon ID | 768491 | |
| IMG object ID | 2508501071 | |
| GOLD ID | Gi08429 | |
| Project relevance | Biocontrol, Agriculture |
Genome statistics
| Value | % of totala | |
|---|---|---|
| Genome size (bp) | 5,454,741 | 100.00 |
| DNA coding region (bp) | 4,825,361 | 88.46 |
| DNA G+C content (bp) | 2,999,404 | 54.99 |
| Total genes | 5,142 | 100.00 |
| RNA genes | 149 | 2.90 |
| rRNA operons | 7 | |
| Protein-coding genes | 4,993 | 97.10 |
| Pseudogenes | 58 | 1.13 |
| Genes in paralog clusters | 2,759 | 53.66 |
| Genes assigned to COGs | 4,247 | 82.59 |
| Genes with signal peptides | 1,154 | 22.44 |
| Genes with transmembrane helices | 1,236 | 24.04 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 201 | 4.18 | Translation |
| A | 1 | 0.02 | RNA processing and modification |
| K | 452 | 9.41 | Transcription |
| L | 158 | 3.29 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 37 | 0.77 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 57 | 1.19 | Defense mechanisms |
| T | 198 | 4.12 | Signal transduction mechanisms |
| M | 256 | 5.33 | Cell wall/membrane biogenesis |
| N | 142 | 2.96 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 166 | 3.46 | Intracellular trafficking and secretion |
| O | 153 | 3.18 | Posttranslational modification, protein turnover, chaperones |
| C | 275 | 5.72 | Energy production and conversion |
| G | 427 | 8.89 | Carbohydrate transport and metabolism |
| E | 487 | 10.14 | Amino acid transport and metabolism |
| F | 109 | 2.27 | Nucleotide transport and metabolism |
| H | 179 | 3.73 | Coenzyme transport and metabolism |
| I | 139 | 2.89 | Lipid transport and metabolism |
| P | 287 | 5.97 | Inorganic ion transport and metabolism |
| Q | 122 | 2.54 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 549 | 11.43 | General function prediction only |
| S | 408 | 8.49 | Function unknown |
| - | 895 | 17.41 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.
Figure 3aGraphical circular map of the chromosome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs blue, rRNAs red, other RNAs black), GC content, GC skew.
Figure 3bGraphical circular map of the plasmid. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs blue, rRNAs red, other RNAs black), GC content, GC skew.