| Literature DB >> 24976897 |
Jason Terpolilli1, Giovanni Garau2, Yvette Hill1, Rui Tian1, John Howieson1, Lambert Bräu3, Lynne Goodwin4, James Han5, Konstantinos Liolios5, Marcel Huntemann5, Amrita Pati5, Tanja Woyke5, Konstantinos Mavromatis6, Victor Markowitz6, Natalia Ivanova5, Nikos Kyrpides5, Wayne Reeve1.
Abstract
Ensifer medicae WSM1369 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Medicago. WSM1369 was isolated in 1993 from a nodule recovered from the roots of Medicago sphaerocarpos growing at San Pietro di Rudas, near Aggius in Sardinia (Italy). WSM1369 is an effective microsymbiont of the annual forage legumes M. polymorpha and M. sphaerocarpos. Here we describe the features of E. medicae WSM1369, together with genome sequence information and its annotation. The 6,402,557 bp standard draft genome is arranged into 307 scaffolds of 307 contigs containing 6,656 protein-coding genes and 79 RNA-only encoding genes. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.Entities:
Keywords: Alphaproteobacteria; nitrogen fixation; rhizobia; root-nodule bacteria
Year: 2013 PMID: 24976897 PMCID: PMC4062641 DOI: 10.4056/sigs.4838624
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Images of WSM1369 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on half strength lupin agar (Right).
Classification and general features of WSM1369 according to the MIGS recommendations [18]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain WSM1369 | TAS [ | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | Mesophile | NAS | |
| Optimum temperature | 28°C | IDA | |
| Salinity | Non-halophile | NAS | |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| Carbon source | Varied | NAS | |
| Energy source | Chemoorganotroph | NAS | |
| MIGS-6 | Habitat | Soil, root nodule, on host | NAS |
| MIGS-15 | Biotic relationship | Free living, symbiotic | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | Root nodule | TAS [ | |
| MIGS-4 | Geographic location | Sardinia, Italy | TAS [ |
| MIGS-5 | Soil collection date | 28 April 1993 | IDA |
| MIGS-4.1 | Longitude | 9.019167 | IDA |
| MIGS-4.3 | Depth | 0-10 cm | IDA |
| MIGS-4.4 | Altitude | Not recorded | IDA |
Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [30].
Figure 2Phylogenetic tree showing the relationship of WSM1369 (shown in bold print) to other spp. in the order based on aligned sequences of the 16S rRNA gene (1,290 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5 [31]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [32]. Bootstrap analysis [33] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database accession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD [34]. Published genomes are indicated with an asterisk.
Genome sequencing project information for WSM1369
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Standard draft |
| MIGS-28 | Libraries used | One Illumina fragment library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Sequencing coverage | Illumina: 321× |
| MIGS-30 | Assemblers | Velvet version 1.1.04; Allpaths-LG version r39750 |
| MIGS-32 | Gene calling methods | Prodigal 1.4 |
| GenBank | AQUS00000000 | |
| GenBank release date | August 28, 2013 | |
| GOLD ID | Gi08907 | |
| NCBI project ID | 165337 | |
| Database: IMG | 2513237156 | |
| Project relevance | Symbiotic N2 fixation, agriculture |
Genome Statistics for WSM1369
| | | |
|---|---|---|
| Genome size (bp) | 6,402,557 | 100.00 |
| DNA coding region (bp) | 5,536,774 | 86.48 |
| DNA G+C content (bp) | 3,913,921 | 61.13 |
| Number of scaffolds | 307 | |
| Number of contigs | 307 | |
| Total gene | 6,735 | 100.00 |
| RNA genes | 79 | 1.17 |
| rRNA operons | 1 | 0.01 |
| Protein-coding genes | 6,656 | 98.83 |
| Genes with function prediction | 4,993 | 74.14 |
| Genes assigned to COGs | 4,988 | 74.06 |
| Genes assigned Pfam domains | 5,185 | 76.99 |
| Genes with signal peptides | 508 | 7.54 |
| Genes coding transmembrane proteins | 1,424 | 21.14 |
| CRISPR repeats | 0 |
Figure 3Graphical map of the genome of WSM1369 showing the seven largest scaffolds. From bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew.
Number of protein coding genes of WSM1369 associated with the general COG functional categories.
| Value | | | |
|---|---|---|---|
| J | 193 | 3.48 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 486 | 8.77 | Transcription |
| L | 275 | 4.96 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 40 | 0.72 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 54 | 0.97 | Defense mechanisms |
| T | 241 | 4.35 | Signal transduction mechanisms |
| M | 267 | 4.82 | Cell wall/membrane biogenesis |
| N | 77 | 1.39 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 1 | 0.02 | Extracellular structures |
| U | 124 | 2.24 | Intracellular trafficking and secretion |
| O | 184 | 3.32 | Posttranslational modification, protein turnover, chaperones |
| C | 308 | 5.56 | Energy production conversion |
| G | 510 | 9.21 | Carbohydrate transport and metabolism |
| E | 613 | 11.06 | Amino acid transport metabolism |
| F | 108 | 1.95 | Nucleotide transport and metabolism |
| H | 196 | 3.54 | Coenzyme transport and metabolism |
| I | 193 | 3.48 | Lipid transport and metabolism |
| P | 280 | 5.05 | Inorganic ion transport and metabolism |
| Q | 158 | 2.85 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 662 | 11.95 | General function prediction only |
| S | 569 | 10.27 | Function unknown |
| - | 1,747 | 25.94 | Not in COGS |