| Literature DB >> 21304680 |
Wayne Reeve, Patrick Chain, Graham O'Hara, Julie Ardley, Kemanthi Nandesena, Lambert Bräu, Ravi Tiwari, Stephanie Malfatti, Hajnalka Kiss, Alla Lapidus, Alex Copeland, Matt Nolan, Miriam Land, Loren Hauser, Yun-Juan Chang, Natalia Ivanova, Konstantinos Mavromatis, Victor Markowitz, Nikos Kyrpides, Margaret Gollagher, Ron Yates, Michael Dilworth, John Howieson.
Abstract
Ensifer (Sinorhizobium) medicae is an effective nitrogen fixing microsymbiont of a diverse range of annual Medicago (medic) species. Strain WSM419 is an aerobic, motile, non-spore forming, Gram-negative rod isolated from a M. murex root nodule collected in Sardinia, Italy in 1981. WSM419 was manufactured commercially in Australia as an inoculant for annual medics during 1985 to 1993 due to its nitrogen fixation, saprophytic competence and acid tolerance properties. Here we describe the basic features of this organism, together with the complete genome sequence, and annotation. This is the first report of a complete genome sequence for a microsymbiont of the group of annual medic species adapted to acid soils. We reveal that its genome size is 6,817,576 bp encoding 6,518 protein-coding genes and 81 RNA only encoding genes. The genome contains a chromosome of size 3,781,904 bp and 3 plasmids of size 1,570,951 bp, 1,245,408 bp and 219,313 bp. The smallest plasmid is a feature unique to this medic microsymbiont.Entities:
Keywords: Alphaproteobacteria; Gram-negative rod; aerobic; microsymbiont; nitrogen fixation; non-pathogenic; root-nodule bacteria
Year: 2010 PMID: 21304680 PMCID: PMC3035259 DOI: 10.4056/sigs.43526
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of E. medicae WSM419 according to the MIGS recommendations [16].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| strain WSM419 | |||
| Gram stain | negative | TAS [ | |
| Cell shape | rod | TAS [ | |
| Motility | motile | TAS [ | |
| Sporulation | non-sporulating | TAS [ | |
| Temperature range | mesophile | TAS [ | |
| Optimum temperature | 28°C | TAS [ | |
| Salinity | unknown | ||
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | galactose, arabinose, glutamate | TAS [ | |
| Energy source | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | Soil, root nodule, host | TAS [ |
| MIGS-15 | Biotic relationship | Free living or symbiotic | TAS [ |
| MIGS-14 | Pathogenicity | none | TAS [ |
| Biosafety level | 1 | TAS [ | |
| Isolation | | TAS [ | |
| MIGS-4 | Geographic location | Forestry Station 7 km south | TAS [ |
| MIGS-5 | Nodule collection date | May 1st, 1981 | TAS [ |
| MIGS-4.1 | Longitude | 9.101915 | TAS [ |
| MIGS-4.3 | Depth | <10 cm | TAS [ |
| MIGS-4.4 | Altitude | 350m | TAS [ |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 1Unique colony morphology (Left) and scanning (Center) and transmission (Right) electron micrographs of E. medicae strain WSM419.
Figure 2Phylogenetic tree showing the relationships of E. medicae strain WSM419 to type strains in the Rhizobiaceae based on aligned sequences of the 16S rRNA gene (1,440 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 3.1 [34]. Kimura two-parameter distances were derived from the aligned sequences [35] and a bootstrap analysis [36] as performed with 500 replicates in order to construct a consensus unrooted tree using the neighbor-joining method [37] for each gene alignment separately. Genera in this tree include Bradyrhizobium (B); Mesorhizobium (M); Rhizobium (R); Ensifer (Sinorhizobium) (S). Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [31] are in bold red print. Published genomes are designated with an asterisk.
Genome sequencing project information of E. medicae WSM419.
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Four Sanger libraries – 3 kb |
| MIGS-29 | Sequencing platforms | ABI3730xl; MegaBACE4500 |
| MIGS-31.2 | Sequencing coverage | ~13× Sanger |
| MIGS-30 | Assemblers | PHRED/PHRAP/CONSED |
| MIGS-32 | Gene calling method | Critica, Generation and Glimmer |
| Genbank ID | CP000738 (Chromosome)a | |
| Genbank Date of Release | June 29, 2007 | |
| GOLD ID | Gc00590e | |
| NCBI project ID | 16304 | |
| Database: IMG | 640753051ff | |
| Project relevance | Symbiotic nitrogen fixation, agriculture |
a http://www.ncbi.nlm.nih.gov/nuccore/150026743
b http://www.ncbi.nlm.nih.gov/nuccore/150030273
c http://www.ncbi.nlm.nih.gov/nuccore/150031715
d http://www.ncbi.nlm.nih.gov/nuccore/150032810
e http://genomesonline.org/GOLD_CARDS/Gc00590.html
f http://img.jgi.doe.gov/cgi-bin/pub/main.cgi?page=taxonDetail&taxon_oid=640753051
Production sequence data for the E. medicae WSM419 genome project (JGI project 4001622).
| | | | |||
|---|---|---|---|---|---|
| BICH | pMCL200 | 5.9 | 37,091 | 36.3 | 25.7 |
| BICG | pUC18c | 2.6 | 33,520 | 36.8 | 26.1 |
| BICI | pCC1Fos | 38.8 | 13,929 | 13.9 | 8.9 |
| FAUT | pTH1522 | 2.1 | 7,376 | 6.4 | 5.4 |
| 91,916 | 93.4 | 66.1 |
Genome Statistics for E. medicae WSM419.
| | ||
|---|---|---|
| Genome size (bp) | 6,817,576 | 100.00 |
| DNA coding region (bp) | 6,001,805 | 88.03 |
| DNA G+C content (bp) | 4,168,935 | 61.15 |
| Number of replicons | 4 | 100.00 |
| Extrachromosomal elements | 3 | 75.00 |
| Total genes | 6,599 | 100.00 |
| RNA genes | 81 | 1.23 |
| rRNA operons | 3 | |
| Protein-coding genes | 6,518 | 98.77 |
| Pseudo genes | 305 | 4.62 |
| Genes with function prediction | 4,646 | 70.40 |
| Genes in paralog clusters | 4,138 | 62.71 |
| Genes assigned to COGs | 4,999 | 75.75 |
| Genes assigned Pfam domains | 5,051 | 76.54 |
| Genes with signal peptides | 2,170 | 32.88 |
| Genes with transmembrane helices | 1,481 | 22.44 |
| CRISPR repeats | 0 |
Figure 3Graphical circular map of the chromosome and plasmids of E. medicae WSM419. From outside to the center: Genes on forward strand (color by COG categories as denoted in the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew. The replicons are not drawn to scale.
Number of protein encoding genes of E. medicae WSM419 associated with the 21 general COG functional categories.
| | | ||
|---|---|---|---|
| J | 182 | 2.79 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 501 | 7.69 | Transcription |
| L | 250 | 3.84 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 36 | 0.55 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 56 | 0.86 | Defense mechanisms |
| T | 247 | 3.79 | Signal transduction mechanisms |
| M | 287 | 4.40 | Cell wall/membrane biogenesis |
| N | 66 | 1.01 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 1 | 0.02 | Extracellular structures |
| U | 106 | 1.63 | Intracellular trafficking and secretion |
| O | 178 | 2.73 | Posttranslational modification, protein turnover, chaperones |
| C | 336 | 5.15 | Energy production and conversion |
| G | 582 | 8.93 | Carbohydrate transport and metabolism |
| E | 622 | 9.54 | Amino acid transport and metabolism |
| F | 109 | 1.67 | Nucleotide transport and metabolism |
| H | 196 | 3.01 | Coenzyme transport and metabolism |
| I | 209 | 3.21 | Lipid transport and metabolism |
| P | 296 | 4.54 | Inorganic ion transport and metabolism |
| Q | 159 | 2.44 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 687 | 10.54 | General function prediction only |
| S | 528 | 8.10 | Function unknown |
| - | 1,519 | 23.30 | Not in COGs |