| Literature DB >> 26380632 |
Ron Yates1, John Howieson2, Sofie E De Meyer2, Rui Tian2, Rekha Seshadri3, Amrita Pati3, Tanja Woyke3, Victor Markowitz4, Natalia Ivanova3, Nikos Kyrpides5, Angelo Loi6, Brad Nutt6, Giovanni Garau7, Leonardo Sulas8, Wayne Reeve2.
Abstract
Rhizobium sullae strain WSM1592 is an aerobic, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen (N2) fixing root nodule formed on the short-lived perennial legume Hedysarum coronarium (also known as Sulla coronaria or Sulla). WSM1592 was isolated from a nodule recovered from H. coronarium roots located in Ottava, bordering Sassari, Sardinia in 1995. WSM1592 is highly effective at fixing nitrogen with H. coronarium, and is currently the commercial Sulla inoculant strain in Australia. Here we describe the features of R. sullae strain WSM1592, together with genome sequence information and its annotation. The 7,530,820 bp high-quality permanent draft genome is arranged into 118 scaffolds of 118 contigs containing 7.453 protein-coding genes and 73 RNA-only encoding genes. This rhizobial genome is sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.Entities:
Keywords: Alphaproteobacteria; GEBA-RNB; Nitrogen fixation; Rhizobia; Root-nodule bacteria
Year: 2015 PMID: 26380632 PMCID: PMC4572446 DOI: 10.1186/s40793-015-0020-2
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Images of Rhizobium sullae strain WSM1592 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)
Fig. 2Phylogenetic tree highlighting the position of R. sullae strain WSM1592 (shown in blue print) relative to other type and non-type rhizobia strains using a 901 bp internal region of the 16S rRNA gene. Bradyrhizobium elkanii ATCC 49852T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [33]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [18] have the GOLD ID mentioned after the strain number and represented in bold, otherwise the NCBI accession number is provided. Finished genomes are designated with an asterisk
Classification and general features of R. sullae strain WSM1592 [16, 34]
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Classification | Domain Bacteria | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| (Type) | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | Not reported | ||
| Optimum temperature | 28 °C | NAS | |
| pH range; Optimum | Not reported | ||
| Carbon source | Not reported | ||
| MIGS-6 | Habitat | Soil, root nodule, on host | IDA |
| MIGS-6.3 | Salinity | Non-halophile | NAS |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free living, symbiotic | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | Sassari, Italy | IDA |
| MIGS-5 | Soil collection date | 20 May 1995 | IDA |
| MIGS-4.1 | Latitude | 8.465 | IDA |
| MIGS-4.2 | Longitude | 40.777 | IDA |
| MIGS-4.4 | Altitude | 83 m | IDA |
Evidence codes – IDA Inferred from Direct Assay; TAS Traceable Author Statement (i.e., a direct report exists in the literature); NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality permanent draft |
| MIGS-28 | Libraries used | Illumina Std PE (2x150bps) |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 877x |
| MIGS-30 | Assemblers | Velvet 1.1.04; Allpaths-LG r39750 |
| MIGS-32 | Gene calling methods | Prodigal 1.4 |
| Locus Tag | A3C1 | |
| Genbank ID | ATZB00000000 | |
| Genbank Date of Release | December 12, 2013 | |
| GOLD ID | Gp0010240 | |
| BIOPROJECT | PRJNA165333 | |
| MIGS-13 | Source Material Identifier | WSM1592 |
| Project relevance | Symbiotic N2 fixation, agriculture |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 7,530,820 | 100.00 |
| DNA coding (bp) | 6,571,312 | 87.26 |
| DNA G + C (bp) | 4,508,646 | 59.87 |
| DNA scaffolds | 118 | |
| Total genes | 7,526 | 100.00 |
| Protein coding genes | 7,453 | 99.03 |
| RNA genes | 73 | 0.97 |
| Pseudo genes | 0 | 0 |
| Genes in internal clusters | 538 | 7.15 |
| Genes with function prediction | 5,902 | 78.42 |
| Genes assigned to COGs | 5,148 | 68.40 |
| Genes with Pfam domains | 6,174 | 82.04 |
| Genes with signal peptides | 659 | 8.76 |
| Genes with transmembrane helices | 1,699 | 22.58 |
| CRISPR repeats | 0 | 0 |
Number of genes associated with the general COG functional categories
| Code | Value | % age | COG Category |
|---|---|---|---|
| J | 186 | 3.25 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 554 | 9.67 | Transcription |
| L | 158 | 2.76 | Replication, recombination and repair |
| B | 2 | 0.03 | Chromatin structure and dynamics |
| D | 36 | 0.63 | Cell cycle control, Cell division, chromosome partitioning |
| V | 65 | 1.13 | Defense mechanisms |
| T | 211 | 3.68 | Signal transduction mechanisms |
| M | 293 | 5.11 | Cell wall/membrane/envelope biogenesis |
| N | 68 | 1.19 | Cell motility |
| U | 112 | 1.95 | Intracellular trafficking, secretion, and vesicular transport |
| O | 167 | 2.91 | Posttranslational modification, protein turnover, chaperones |
| C | 327 | 5.71 | Energy production and conversion |
| G | 614 | 10.71 | Carbohydrate transport and metabolism |
| E | 684 | 11.94 | Amino acid transport and metabolism |
| F | 107 | 1.87 | Nucleotide transport and metabolism |
| H | 174 | 3.04 | Coenzyme transport and metabolism |
| I | 196 | 3.42 | Lipid transport and metabolism |
| P | 318 | 5.55 | Inorganic ion transport and metabolism |
| Q | 138 | 2.41 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 743 | 12.96 | General function prediction only |
| S | 578 | 10.09 | Function unknown |
| - | 2378 | 31.60 | Not in COGS |
The total is based on the total number of protein coding genes in the genome