Literature DB >> 24976888

Genome sequence of Ensifer meliloti strain WSM1022; a highly effective microsymbiont of the model legume Medicago truncatula A17.

Jason Terpolilli1, Yvette Hill1, Rui Tian1, John Howieson1, Lambert Bräu2, Lynne Goodwin3, James Han4, Konstantinos Liolios4, Marcel Huntemann4, Amrita Pati5, Tanja Woyke4, Konstantinos Mavromatis5, Victor Markowitz5, Natalia Ivanova3, Nikos Kyrpides3, Wayne Reeve1.   

Abstract

Ensifer meliloti WSM1022 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Medicago. WSM1022 was isolated in 1987 from a nodule recovered from the roots of the annual Medicago orbicularis growing on the Cyclades Island of Naxos in Greece. WSM1022 is highly effective at fixing nitrogen with M. truncatula and other annual species such as M. tornata and M. littoralis and is also highly effective with the perennial M. sativa (alfalfa or lucerne). In common with other characterized E. meliloti strains, WSM1022 will nodulate but fixes poorly with M. polymorpha and M. sphaerocarpos and does not nodulate M. murex. Here we describe the features of E. meliloti WSM1022, together with genome sequence information and its annotation. The 6,649,661 bp high-quality-draft genome is arranged into 121 scaffolds of 125 contigs containing 6,323 protein-coding genes and 75 RNA-only encoding genes, and is one of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

Entities:  

Keywords:  Alphaproteobacteria; nitrogen fixation; rhizobia; root-nodule bacteria

Year:  2013        PMID: 24976888      PMCID: PMC4062635          DOI: 10.4056/sigs.4608286

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

An available source of nitrogen (N) is essential to life on Earth. Although the atmosphere consists of approximately 80% N, the overwhelming proportion of this is present in the form of dinitrogen (N2) which is biologically inaccessible to the vast majority of higher organisms. Only a subset of microbes has the necessary molecular machinery to make atmospheric N2 bioavailable by enzymatically reducing N2 to NH3. The fact that plant growth is most commonly limited by the availability of N may have provided the selective pressure for a wide range of plant genera, most of which are legumes, to evolve a symbiotic relationship with these N2-fixing microbes. These microsymbionts, collectively termed root nodule bacteria, receive a carbon source from the plant and in return supply the host with biologically fixed N. When these symbiotic interactions are optimally harnessed in agriculture, all the N-requirements of the host can be met, without the need to apply industrially synthesized N-based fertilizers, thereby increasing both the economic and environmental sustainability of the farming system [1]. Forage and fodder legumes play an integral role in sustainable farming practice, providing feed for stock while also enriching soil with bioavailable N. Worldwide, there are approximately 110 million ha of forage and fodder legumes under production [2], of which members of the Medicago genus comprise a considerable component. Two bacterial species, and are known to nodulate and fix N2 with Medicago spp. [3], although they differ in their symbiotic properties on some Medicago hosts. Specifically, while can nodulate and fix N2 with M. murex, M. arabica and M. polymorpha, does not nodulate M. murex, does not fix with M. polymorpha and fixes N2 very poorly with M. arabica [4-6]. strain WSM1022 was isolated in 1987 from a nodule collected from the annual M. orbicularis growing on the Cyclades Island of Naxos in Greece. WSM1022 is a highly effective microsymbiont of Medicago, forming efficient N2-fixing associations with the annual species M. littoralis and M. tornata [7]. In common with WSM419 [8], WSM1022 also fixes approximately twice as much N2 as 1021 on the model legume M. truncatula A17 [7]. However, unlike WSM419, WSM1022 is also highly effective with the perennial M. sativa (alfalfa or lucerne) [7]. Therefore, WSM1022 is a broadly effective microsymbiont of Medicago spp. and as such represents a unique tool for the molecular analysis of effective N2 fixation with fully sequenced macro-and microsymbionts. Here we present a summary classification and a set of general features for strain WSM1022 together with a description of its genome sequence and annotation.

Classification and features

WSM1022 is a motile, Gram-negative rod (Figure 1 Left and Center) in the order of the class . It is fast growing, forming colonies within 3-4 days when grown on half strength Lupin Agar (½LA) [9], tryptone-yeast extract agar (TY) [10] or a modified yeast-mannitol agar (YMA) [11] at 28°C. Colonies on ½LA are white-opaque, slightly domed and moderately mucoid with smooth margins (Figure 1Right).
Figure 1

Images of WSM1022 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on a solid medium (Right).

Images of WSM1022 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on a solid medium (Right). Minimum Information about the Genome Sequence (MIGS) is provided in Table 1. Figure 2 shows the phylogenetic neighborhood of WSM1022 in a 16S rRNA sequence based tree. This strain shares 99.92% and 99.61% sequence identity (over 1290 bp) to the 16S rRNA of the fully sequenced 1021 [29] and WSM419 [8] strains, respectively.
Table 1

Classification and general features of 12]

MIGS ID     Property      Term     Evidence code
     Current classification      Domain Bacteria     TAS [13]
      Phylum Proteobacteria     TAS [14]
      Class Alphaproteobacteria     TAS [15,16]
      Order Rhizobiales     TAS [16,17]
      Family Rhizobiaceae     TAS [18,19]
      Genus Ensifer     TAS [20-22]
      Species Ensifer meliloti     TAS [21]
     Gram stain      Negative     IDA
     Cell shape      Rod     IDA
     Motility      Motile     IDA
     Sporulation      Non-sporulating     NAS
     Temperature range      Mesophile     NAS
     Optimum temperature      28°C     NAS
     Salinity      Non-halophile     NAS
MIGS-22     Oxygen requirement      Aerobic     TAS [7]
     Carbon source      Varied     NAS
     Energy source      Chemoorganotroph     NAS
MIGS-6     Habitat      Soil, root nodule, on host     TAS [7]
MIGS-15     Biotic relationship      Free living, symbiotic     TAS [7]
MIGS-14     Pathogenicity      Non-pathogenic     NAS
     Biosafety level      1     TAS [23]
     Isolation      Root nodule     TAS [11]
MIGS-4     Geographic location      Naxos, Greece     TAS [11]
MIGS-5     Soil collection date      28 April 1987     IDA
MIGS-4.1     Longitude      37.107772     IDA
MIGS-4.2     Latitude      25.387841
MIGS-4.3     Depth      0-10cm
MIGS-4.4     Altitude      Not recorded

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [24].

Figure 2

Phylogenetic tree showing the relationship of WSM1022 (shown in bold print) to other spp. in the order based on aligned sequences of the 16S rRNA gene (1,290 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5 [25]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [26]. Bootstrap analysis [27] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database accession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD [28]. Published genomes are indicated with an asterisk.

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [24]. Phylogenetic tree showing the relationship of WSM1022 (shown in bold print) to other spp. in the order based on aligned sequences of the 16S rRNA gene (1,290 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5 [25]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [26]. Bootstrap analysis [27] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database accession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD [28]. Published genomes are indicated with an asterisk.

Symbiotaxonomy

strain WSM1022 was isolated in 1987 from a nodule collected from the annual M. orbicularis growing on the Cyclades Island of Naxos in Greece. The site of collection was a gentle slope and the soil a sandy-loam texture of pH 7.5-8.0. forms nodules (Nod+) and fixes N2 (Fix+) on a range of annual Medicago spp. as well as the perennial M. sativa (Table 2). In common with other characterized strains, WSM1022 does not nodulate M. murex, does not fix N2 with M. polymorpha and M. arabica [4,5] and is a poorly effective microsymbiont of M. sphaerocarpos [11]. However, WSM1022 is broadly effective with the alkaline soil-adapted annuals M. littoralis and M. tornata as well as the widely grown perennial forage legume M. sativa. In addition, WSM1022 is also a highly effective microsymbiont for the model legume M. truncatula A17.
Table 2

Nodulation and N2 fixation properties of WSM1022 on selected Medicago spp. Data compiled from [7,11]†

Species Name     Cultivar or Accession     Growth     Habit     Nodulation     N2 fixation     Comment
M. truncatula     A17     Annual     Nod+     Fix+     Highly effective
M. truncatula     Jemalong     Annual     Nod+     Fix+     Highly effective
M. truncatula     Caliph     Annual     Nod+     Fix+     Highly effective
M. littoralis     Harbinger     Annual     Nod+     Fix+     Highly effective
M. tornata     Tornafield     Annual     Nod+     Fix+     Highly effective
M. sphaerocarpos     Orion     Annual     Nod+     Fix+     Poorly effective
M. arabica     SA36043     Annual     Nod+     Fix-     No fixation
M. polymorpha     Santiago     Annual     Nod+     Fix-     No fixation
M. murex     Zodiac     Annual     Nod-     Fix-     No nodulation
M. sativa     Sceptre     Perennial     Nod+     Fix+     Highly effective

†Note that ‘+’ and ‘-’ denote presence or absence, respectively, of nodulation (Nod) or N2 fixation (Fix).

Note that ‘+’ and ‘-’ denote presence or absence, respectively, of nodulation (Nod) or N2 fixation (Fix).

Genome sequencing and annotation

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Community Sequencing Program at the U.S. Department of Energy, Joint Genome Institute (JGI) for projects of relevance to agency missions. The genome project is deposited in the Genomes OnLine Database [28] and an improved-high-quality-draft genome sequence in IMG. Sequencing, finishing and annotation were performed by the JGI. A summary of the project information is shown in Table 3.
Table 3

Genome sequencing project information for WSM1022.

MIGS ID    Property    Term
MIGS-31    Finishing quality    Improved high-quality draft
MIGS-28    Libraries used    1× Illumina library
MIGS-29    Sequencing platforms    Illumina HiSeq 2000
MIGS-31.2    Sequencing coverage    Illumina: 275×
MIGS-30    Assemblers    Velvet version 1.1.04; Allpaths-LG version r42328
MIGS-32    Gene calling methods    Prodigal 1.4, GenePRIMP
    GOLD ID    Gi08916
    NCBI project ID    78233
    Database: IMG    2510065057
    Project relevance    Symbiotic N2 fixation, agriculture

Growth conditions and DNA isolation

WSM1022 was cultured to mid logarithmic phase in 60 ml of TY rich medium [30] on a gyratory shaker at 28°C. DNA was isolated from the cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [31].

Genome sequencing and assembly

The genome of WSM1022 was sequenced at the Joint Genome Institute (JGI) using Illumina technology [32]. An Illumina standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 12,082,430 reads totaling 1812.4 Mbp. All general aspects of library construction and sequencing performed at the JGI can be found at the JGI website [31]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun, L., Copeland, A. and Han, J., unpublished). The following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet [33] (version 1.1.04), (2) 1–3 kb simulated paired end reads were created from Velvet contigs using wgsim (https://github.com/lh3/wgsim), (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG [34] (version r42328). Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: –veryclean yes –exportFiltered yes –mincontiglgth 500 –scaffolding no–covcutoff 10) 2) wgsim (–e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs:PHRED64=1 PLOIDY=1 FRAGCOVERAGE=125 JUMPCOVERAGE=25 LONGJUMPCOV=50, RunAllpath-sLG: THREADS=8 RUN=stdshredpairs TARGETS=standard VAPIWARNONLY=True OVERWRITE=True). The final draft assembly contained 125 contigs in 121 scaffolds. The total size of the genome is 6.6 Mb and the final assembly is based on 1,812.4 Mbp of Illumina data, which provides an average 275× coverage of the genome.

Genome annotation

Genes were identified using Prodigal [35] as part of the DOE-JGI annotation pipeline [36]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) nonredundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. The tRNAScanSE tool [37] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [38]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL (http://infernal.janelia.org). Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes (IMG-ER) platform [39].

Genome properties

The genome is 6,649,661 nucleotides with 62.16% GC content (Table 4) and comprised of 121 scaffolds (Figure 3) of 125 contigs. From a total of 6,398 genes, 6,323 were protein encoding and 75 RNA only encoding genes. The majority of genes (80.78%) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 5.
Table 4

Genome Statistics for WSM1022

Attribute    Value    % of Total
Genome size (bp)    6,649,661    100.00
DNA coding region (bp)    5,733,017    86.22
DNA G+C content (bp)    4,133,661    62.16
Number of scaffolds    121
Number of contigs    125
Total gene    6,398    100.00
RNA genes    75    1.17
rRNA operons    1    0.02
Protein-coding genes    6,323    98.83
Genes with function prediction    5,168    80.78
Genes assigned to COGs    5,147    80.45
Genes assigned Pfam domains    5,331    83.32
Genes with signal peptides    563    8.80
Genes with transmembrane helices    1,437    22.93
CRISPR repeats    0
Figure 3

Graphical map of the genome of WSM1022 showing the seven largest scaffolds. From bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew.

Table 5

Number of protein coding genes of WSM1022 associated with the general COG functional categories.

Code     Value      % age      COG Category
J     194      3.38      Translation, ribosomal structure and biogenesis
A     0      0.00      RNA processing and modification
K     497      8.65      Transcription
L     196      3.41      Replication, recombination and repair
B     1      0.02      Chromatin structure and dynamics
D     38      0.66      Cell cycle control, mitosis and meiosis
Y     0      0.00      Nuclear structure
V     61      1.06      Defence mechanisms
T     235      4.09      Signal transduction mechanisms
M     301      5.24      Cell wall/membrane biogenesis
N     71      1.24      Cell motility
Z     0      0.00      Cytoskeleton
W     1      0.02      Extracellular structures
U     113      1.97      Intracellular trafficking and secretion
O     177      3.08      Posttranslational modification, protein turnover, chaperones
C     357      6.21      Energy production conversion
G     606      10.54      Carbohydrate transport and metabolism
E     623      10.84      Amino acid transport metabolism
F     109      1.90      Nucleotide transport and metabolism
H     200      3.48      Coenzyme transport and metabolism
I     207      3.60      Lipid transport and metabolism
P     312      5.43      Inorganic ion transport and metabolism
Q     158      2.75      Secondary metabolite biosynthesis, transport and catabolism
R     708      12.32      General function prediction only
S     583      10.14      Function unknown
-     1,251      19.55      Not in COGS
Total     5,748      -      -
Graphical map of the genome of WSM1022 showing the seven largest scaffolds. From bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew.
  22 in total

1.  Solexa Ltd.

Authors:  Simon Bennett
Journal:  Pharmacogenomics       Date:  2004-06       Impact factor: 2.533

2.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data.

Authors:  Sante Gnerre; Iain Maccallum; Dariusz Przybylski; Filipe J Ribeiro; Joshua N Burton; Bruce J Walker; Ted Sharpe; Giles Hall; Terrance P Shea; Sean Sykes; Aaron M Berlin; Daniel Aird; Maura Costello; Riza Daza; Louise Williams; Robert Nicol; Andreas Gnirke; Chad Nusbaum; Eric S Lander; David B Jaffe
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-27       Impact factor: 11.205

3.  List of new names and new combinations previously effectively, but not validly, published.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2006-01       Impact factor: 2.747

4.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

5.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

6.  Constructs for insertional mutagenesis, transcriptional signal localization and gene regulation studies in root nodule and other bacteria.

Authors:  Wayne G Reeve; Ravi P Tiwari; Penelope S Worsley; Michael J Dilworth; Andrew R Glenn; John G Howieson
Journal:  Microbiology       Date:  1999-06       Impact factor: 2.777

7.  Rapid identification of Medicago nodulating strains by using two oligonucleotide probes complementary to 16S rDNA sequences.

Authors:  S Rome; J C Cleyet-Marel; L A Materon; P Normand; B Brunel
Journal:  Can J Microbiol       Date:  1997-09       Impact factor: 2.419

8.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

9.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

10.  The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata.

Authors:  Konstantinos Liolios; Konstantinos Mavromatis; Nektarios Tavernarakis; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2007-11-02       Impact factor: 16.971

View more
  5 in total

1.  Genome sequence of Ensifer medicae Di28; an effective N2-fixing microsymbiont of Medicago murex and M. polymorpha.

Authors:  Giovanni Garau; Jason Terpolilli; Yvette Hill; Rui Tian; John Howieson; Lambert Bräu; Lynne Goodwin; James Han; Tbk Reddy; Marcel Huntemann; Amrita Pati; Tanja Woyke; Konstantinos Mavromatis; Victor Markowitz; Natalia Ivanova; Nikos Kyrpides; Wayne Reeve
Journal:  Stand Genomic Sci       Date:  2014-12-08

2.  A Chloroplast-Localized Rubredoxin Family Protein Gene from Puccinellia tenuiflora (PutRUB) Increases NaCl and NaHCO₃ Tolerance by Decreasing H₂O₂ Accumulation.

Authors:  Ying Li; Panpan Liu; Tetsuo Takano; Shenkui Liu
Journal:  Int J Mol Sci       Date:  2016-05-30       Impact factor: 5.923

3.  Priority effects alter interaction outcomes in a legume-rhizobium mutualism.

Authors:  Julia A Boyle; Anna K Simonsen; Megan E Frederickson; John R Stinchcombe
Journal:  Proc Biol Sci       Date:  2021-03-10       Impact factor: 5.349

4.  Genome sequence of Ensifer medicae strain WSM1369; an effective microsymbiont of the annual legume Medicago sphaerocarpos.

Authors:  Jason Terpolilli; Giovanni Garau; Yvette Hill; Rui Tian; John Howieson; Lambert Bräu; Lynne Goodwin; James Han; Konstantinos Liolios; Marcel Huntemann; Amrita Pati; Tanja Woyke; Konstantinos Mavromatis; Victor Markowitz; Natalia Ivanova; Nikos Kyrpides; Wayne Reeve
Journal:  Stand Genomic Sci       Date:  2013-12-17

5.  Comparative Genomics across Three Ensifer Species Using a New Complete Genome Sequence of the Medicago Symbiont Sinorhizobium (Ensifer) meliloti WSM1022.

Authors:  Laura Baxter; Proyash Roy; Emma Picot; Jess Watts; Alex Jones; Helen Wilkinson; Patrick Schäfer; Miriam Gifford; Beatriz Lagunas
Journal:  Microorganisms       Date:  2021-11-25
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.