| Literature DB >> 21304679 |
Wayne Reeve, Graham O'Hara, Patrick Chain, Julie Ardley, Lambert Bräu, Kemanthi Nandesena, Ravi Tiwari, Stephanie Malfatti, Hajnalka Kiss, Alla Lapidus, Alex Copeland, Matt Nolan, Miriam Land, Natalia Ivanova, Konstantinos Mavromatis, Victor Markowitz, Nikos Kyrpides, Vanessa Melino, Matthew Denton, Ron Yates, John Howieson.
Abstract
Rhizobium leguminosarum bv trifolii is the effective nitrogen fixing microsymbiont of a diverse range of annual and perennial Trifolium (clover) species. Strain WSM2304 is an aerobic, motile, non-spore forming, Gram-negative rod, isolated from Trifolium polymorphum in Uruguay in 1998. This microsymbiont predominated in the perennial grasslands of Glencoe Research Station, in Uruguay, to competitively nodulate its host, and fix atmospheric nitrogen. Here we describe the basic features of WSM2304, together with the complete genome sequence, and annotation. This is the first completed genome sequence for a nitrogen fixing microsymbiont of a clover species from the American center of origin. We reveal that its genome size is 6,872,702 bp encoding 6,643 protein-coding genes and 62 RNA only encoding genes. This multipartite genome was found to contain 5 distinct replicons; a chromosome of size 4,537,948 bp and four circular plasmids of size 1,266,105 bp, 501,946 bp, 308,747 bp and 257,956 bp.Entities:
Keywords: Alphaproteobacteria; Gram-negative rod; aerobic; microsymbiont; nitrogen fixation; non-pathogenic; root-nodule bacteria
Year: 2010 PMID: 21304679 PMCID: PMC3035254 DOI: 10.4056/sigs.44642
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of R. leguminosarum bv trifolii WSM2304 in accordance with the MIGS recommendations [8].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain WSM2304 | |||
| Gram stain | negative | TAS [ | |
| Cell shape | rod | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | non-sporulating | TAS [ | |
| Temperature range | mesophile | TAS [ | |
| Optimum temperature | 28°C | TAS [ | |
| Salinity | unknown | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | glucose, mannitol | TAS [ | |
| Energy source | chemoheterotroph | TAS [ | |
| MIGS-6 | Habitat | Soil, root nodule, host | TAS [ |
| MIGS-15 | Biotic relationship | Free living, Symbiotic | TAS [ |
| MIGS-14 | Pathogenicity | none | TAS [ |
| Biosafety level | 1 | TAS [ | |
| Isolation | TAS [ | ||
| MIGS-4 | Geographic location | Glencoe Research Station, | TAS [ |
| MIGS-5 | Sample collection time | December 1st, 1998 | TAS [ |
| MIGS-4.1 MIGS-4.2 | Latitude – Longitude | -56 | TAS [ |
| MIGS-4.3 | Depth | 5cm soil depth | NAS [ |
| MIGS-4.4 | Altitude | 130m | TAS [ |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [23]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements
Figure 1Images of R. leguminosarum bv trifolii strain WSM2304 using scanning (A) and transmission electron microscopy (B). The appearance of colony morphology on solid media (C).
Figure 2Phylogenetic tree showing the relationships of R. leguminosarum bv trifolii strain WSM2304 with the type strains of Rhizobiaceae based on aligned sequences of the 16S rRNA gene (1,440 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 3.1 [25]. Kimura two-parameter distances were derived from the aligned sequences [26] and a bootstrap analysis [27] as performed with 500 replicates in order to construct a consensus unrooted tree using the neighbor-joining method [28] for each gene alignment separately. The genera in this tree include Bradyrhizobium (B.), Mesorhizobium (M), Rhizobium (R); Ensifer (Sinorhizobium) (S). Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [22] are in bold red print. Published genomes are designated with an asterisk.
Genome sequencing project information for R. leguminosarum bv trifolii WSM2304.
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Four genomic libraries: three Sanger libraries;1-2 kb pTH1522, 6-8 kb pMCL200, fosmid pcc1Fos and one 454 pyrosequencing standard library |
| MIGS-29 | Sequencing platforms | ABI3730xl, 454 GS FLX |
| MIGS-31.2 | Sequencing coverage | 21.3 x Sanger; 10.1 x Pyrosequencing |
| MIGS-30 | Assemblers | Newbler version 1.1.02.15, Phrap |
| MIGS-32 | Gene calling method | Prodigal |
| Genbank ID | CP001191 (Chromosome)a | |
| Genbank Date of Release | 16-OCTOBER-2008 | |
| GOLD ID | Gc00870f | |
| NCBI project ID | 20179 | |
| Database: IMG | 643348569 g | |
| Project relevance | Symbiotic nitrogen fixation, agriculture |
a http://www.ncbi.nlm.nih.gov/nuccore/209533368
b http://www.ncbi.nlm.nih.gov/nuccore/209537694
c http://www.ncbi.nlm.nih.gov/nuccore/209538856
d http://www.ncbi.nlm.nih.gov/nuccore/209539307
e http://www.ncbi.nlm.nih.gov/nuccore/209539531
f http://genomesonline.org/GOLD_CARDS/Gc00870.html
g http://img.jgi.doe.gov/cgi-bin/pub/main.cgi?section=TaxonDetail&page=taxonDetail&taxon_oid=64173348569
Production sequence for the finished genome of R. leguminosarum bv trifolii WSM2304, JGI project 4024175
| pMCL200 | FHOO | 7.0 ± 0.9 | 74,398 | 66.6 | 51.6 |
| pcc1Fos | FHTU | 36 ± 3.4 | 15,776 | 11.8 | 7.8 |
| pTH1522 | FNNZ | 1.8 ± 0.3 | 79,386 | 68.6 | 53.5 |
| 454-std | FHTW | NA | 719,338 | 69.9 | NA |
Genome Statistics for R. leguminosarum bv trifolii WSM2304.
| Genome size (bp) | 6,872,702 | 100.00% |
| DNA coding region (bp) | 6,053,973 | 88.09% |
| DNA G+C content (bp) | 4,204,577 | 61.18% |
| Number of replicons | 5 | 100.00% |
| Extrachromosomal elements | 4 | 80.00% |
| Total genes | 6,643 | 100.00% |
| RNA coding genes | 62 | 0.93% |
| rRNA operons | 3 | |
| Protein-coding genes | 6,581 | 99.07% |
| Pseudo genes | 166 | 2.49% |
| Genes with function prediction | 4,812 | 72.44% |
| Genes in paralog clusters | 4,104 | 61.78% |
| Genes assigned to COGs | 5,105 | 76.85% |
| Genes assigned Pfam domains | 5,149 | 77.51% |
| Genes with signal peptides | 2,247 | 33.83% |
| Genes with transmembrane helices | 1,495 | 22.50% |
| CRISPR repeats | 0 |
Figure 3Graphical circular map of the chromosome of R. leguminosarum bv trifolii WSM2304. From outside to the center: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew. Chromosome is not drawn to scale relative to the plasmids in Figure 4.
Figure 4Graphical circular map of the plasmids of R. leguminosarum bv trifolii WSM2304. From outside to the center: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew. Plasmids pRLG201, pRLG202, pRLG203 and pRLG204 are not drawn to scale relative to each other or to the chromosome in Figure 3.
The number of predicted protein-coding genes of R. leguminosarum bv trifolii WSM2304 associated with the 21 general COG functional categories.
| | ||||
|---|---|---|---|---|
| J | 194 | 2.95 | Translation, ribosomal structure and biogenesis | |
| A | 0 | 0.00 | RNA processing and modification | |
| K | 558 | 8.48 | Transcription | |
| L | 164 | 2.49 | Replication, recombination and repair | |
| B | 2 | 0.03 | Chromatin structure and dynamics | |
| D | 38 | 0.58 | Cell cycle control, mitosis and meiosis | |
| Y | 0 | 0.00 | Nuclear structure | |
| V | 66 | 1.00 | Defense mechanisms | |
| T | 317 | 4.82 | Signal transduction mechanisms | |
| M | 309 | 4.70 | Cell wall/membrane biogenesis | |
| N | 91 | 1.38 | Cell motility | |
| Z | 0 | 0.00 | Cytoskeleton | |
| W | 0 | 0.00 | Extracellular structures | |
| U | 88 | 1.34 | Intracellular trafficking and secretion | |
| O | 165 | 2.51 | Posttranslational modification, protein turnover, chaperones | |
| C | 314 | 4.77 | Energy production and conversion | |
| G | 599 | 9.10 | Carbohydrate transport and metabolism | |
| E | 687 | 10.44 | Amino acid transport and metabolism | |
| F | 109 | 1.66 | Nucleotide transport and metabolism | |
| H | 180 | 2.74 | Coenzyme transport and metabolism | |
| I | 238 | 3.62 | Lipid transport and metabolism | |
| P | 278 | 4.22 | Inorganic ion transport and metabolism | |
| Q | 156 | 2.37 | Secondary metabolites biosynthesis, transport and catabolism | |
| R | 710 | 10.79 | General function prediction only | |
| S | 532 | 8.08 | Function unknown | |
| - | 1,476 | 22.43 | Not in COGs | |