Literature DB >> 26664655

High-quality permanent draft genome sequence of Ensifer medicae strain WSM244, a microsymbiont isolated from Medicago polymorpha growing in alkaline soil.

Julie Ardley1, Rui Tian1, Graham O'Hara1, Rekha Seshadri2, T B K Reddy2, Amrita Pati2, Tanja Woyke2, Victor Markowitz3, Natalia Ivanova2, Nikos Kyrpides4, John Howieson1, Wayne Reeve1.   

Abstract

Ensifer medicae WSM244 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Medicago species. WSM244 was isolated in 1979 from a nodule recovered from the roots of the annual Medicago polymorpha L. growing in alkaline soil (pH 8.0) in Tel Afer, Iraq. WSM244 is the only acid-sensitive E. medicae strain that has been sequenced to date. It is effective at fixing nitrogen with M. polymorpha L., as well as with more alkaline-adapted Medicago spp. such as M. littoralis Loisel., M. scutellata (L.) Mill., M. tornata (L.) Mill. and M. truncatula Gaertn. This strain is also effective with the perennial M. sativa L. Here we describe the features of E. medicae WSM244, together with genome sequence information and its annotation. The 6,650,282 bp high-quality permanent draft genome is arranged into 91 scaffolds of 91 contigs containing 6,427 protein-coding genes and 68 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.

Entities:  

Keywords:  Alphaproteobacteria; Ensifer; GEBA-RNB; Nitrogen fixation; Root-nodule bacteria; Symbiosis

Year:  2015        PMID: 26664655      PMCID: PMC4674904          DOI: 10.1186/s40793-015-0119-5

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Root nodule bacteria that fix atmospheric nitrogen in association with annual and perennial pasture legumes have important roles in agriculture. Some of the most important associations in temperate and Mediterranean regions are the (1) - Medicago symbioses that produce nutritious feed for animals. is a genus within tribe Trifolieae, which is included in the “temperate herbaceous papilionoid” Inverted Repeat Lacking Clade (IRLC) legumes [1, 2]. Species of Medicago are amongst the most extensively grown forage and pasture plants and have been cultivated ever since L. (alfalfa, or lucerne) was first domesticated in the Near East and/or Central Asia in about 5000 BC. In addition to perennial M. sativa L., annual medic species used widely in agriculture include M. tornata (L.) Mill. (disc medic), the model legume M. truncatula Gaertn. (barrel medic) and M. littoralis Loisel. (strand medic), together with more recently commercialised species such as M. polymorpha L. (burr medic) and M. murex Willd. (murex medic) [3]. Medicago spp. are symbiotically specific: nearly all studied species are nodulated by strains of rhizobia belonging to either or the closely related species [4, 5]. can be distinguished from by its ability to nodulate and fix nitrogen with M. polymorpha L. [5]. WSM244 was isolated in 1979 from a root nodule of M. polymorpha L. growing on alkaline soil (pH 8.0) near Tel Afer, Iraq [6]. This strain was superior in N2-fixation on a range of medics (M sativa L., M truncatula Gaertn., M. tornata L., M. polymorpha L., M. littoralis Loisel., M scutellata (L.) Mill.) in glasshouse tests in Australia and field trials in Iraq in 1980, and was recommended for development as an inoculant in Iraq (D. Chatel, pers com.). WSM244 has also been used in trials aimed at developing acid-tolerant inoculant strains for pasture medics, as the acid-sensitive nature of the microsymbiont is a constraint to the growth and persistence of Medicago spp. in agricultural regions with moderately acidic soils [7]. When field tested in an acidic soil (pH 5.0 CaCl2) in Western Australia, WSM244 survived at the site of inoculation for two years, but unlike several more acid tolerant strains it did not demonstrate saprophytic competence and was unable to colonize the soil [8]. This characteristic of WSM244 as an acid-soil sensitive strain correlates with its acid sensitive profile for growth in laboratory media and an inability to maintain a neutral intracellular pH when exposed to pH 6.0 or less [9]. This is in contrast to other strains, which typically are the dominant microsymbiont partners of annual medics growing on acid soils, in contrast to the more acid-sensitive , which preferentially associates with alkaline-soil-adapted Medicago spp. [10]. The pH response phenotype of WSM244 is in marked contrast to the sequenced acid tolerant strain WSM419 [11]. Sequencing the genome of WSM244 and comparing its attributes with an acid-tolerant strain such as WSM419 would provide a means of establishing the molecular determinants required for adaptation to acid soils. This strain was therefore selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project [12]. Here we present a summary classification and a set of general features for strain WSM244, together with a description of its genome sequence and annotation.

Organism information

Classification and features

WSM244 is a motile, Gram-negative rod (Fig. 1 Left and Center) in the order of the class . It is fast growing, forming colonies within 3–4 days when grown on half strength Lupin Agar [13], tryptone-yeast extract agar [14] or a modified yeast-mannitol agar [15] at 28 °C. Colonies on ½LA are white-opaque, slightly domed and moderately mucoid with smooth margins (Fig. 1 Right).
Fig. 1

Images of Ensifer medicae WSM244 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)

Images of Ensifer medicae WSM244 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right) Figure 2 shows the phylogenetic relationship of WSM244 in a 16S rRNA sequence based tree. This strain is the most phylogenetically related to WSM419 and LMG 6133 based on the 16S rRNA gene alignment, with sequence identities of 100 % and 99.71 %, respectively, as determined using the EzTaxon-e database, which contains the sequences of validly published type strains [16]. Minimum Information about the Genome Sequence for WSM244 is provided in Table 1 and Additional file 1: Table S1.
Fig. 2

Phylogenetic tree showing the relationship of Ensifer medicae WSM244 (shown in bold blue print) to other type and non-type strains in the Ensifer genus and to other root nodule bacteria species in the order Rhizobiales, based on aligned sequences of the 16S rRNA gene (1,283 bp internal region). (The species name “Sinorhizobium chiapanecum” has not been validly published.) Azorhizobium caulinodans ORS 571T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 6 [37]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [38]. Bootstrap analysis [39] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [17] are in bold font and the GOLD ID is provided after the GenBank accession number. Finished genomes are indicated with an asterisk

Table 1

Classification and general features of Ensifer medicae WSM244 in accordance with the MIGS recommendations [40] published by the Genome Standards Consortium [41]

MIGS IDPropertyTermEvidence codea
ClassificationDomain BacteriaTAS [42]
Phylum Proteobacteria TAS [43, 44]
Class Alphaproteobacteria TAS [45, 46]
Order Rhizobiales TAS [47]
Family Rhizobiaceae TAS [48]
Genus Ensifer TAS [4951]
Species Ensifer medicae TAS [5]
Strain: WSM244TAS [6]
Gram stainNegativeIDA
Cell shapeRodIDA
MotilityMotileIDA
SporulationNon-sporulatingNAS
Temperature range10–40 °CIDA
Optimum temperature25–30 °CIDA
pH range; Optimum6–10; 6.5–8TAS [9]
Carbon sourceArabinose, galactose, mannitol, tryptoneTAS [9]
MIGS-6HabitatSoil; root nodule on host (Medicago polymorpha L.)TAS [8]
MIGS-6.3Salinity0.89–2.0 % (w/v)NAS
MIGS-22Oxygen requirementAerobicTAS [8]
MIGS-15Biotic relationshipFree living, symbioticTAS [8]
MIGS-14PathogenicityBiosafety level 1TAS [52]
MIGS-4Geographic locationTel Afer, IraqTAS [6]
MIGS-5Sample collection1979TAS [6]
MIGS-4.1Latitude36.3833TAS [6]
MIGS-4.2Longitude42.4500TAS [6]
MIGS-4.3Depth0–10 cmNAS
MIGS-4.4Altitude400 mTAS [6]

aEvidence codes—IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53] (http://geneontology.org/page/guide-go-evidence-codes)

Phylogenetic tree showing the relationship of Ensifer medicae WSM244 (shown in bold blue print) to other type and non-type strains in the Ensifer genus and to other root nodule bacteria species in the order Rhizobiales, based on aligned sequences of the 16S rRNA gene (1,283 bp internal region). (The species name “Sinorhizobium chiapanecum” has not been validly published.) Azorhizobium caulinodans ORS 571T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 6 [37]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [38]. Bootstrap analysis [39] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [17] are in bold font and the GOLD ID is provided after the GenBank accession number. Finished genomes are indicated with an asterisk Classification and general features of Ensifer medicae WSM244 in accordance with the MIGS recommendations [40] published by the Genome Standards Consortium [41] aEvidence codes—IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53] (http://geneontology.org/page/guide-go-evidence-codes)

Symbiotaxonomy

WSM244 nodulates and is effective for nitrogen fixation with M. littoralis Loisel., M sativa L., M. tornata (L.) Mill. [3], M. murex Willd., M. polymorpha L., M truncatula Gaertn. [8] and M scutellata (L.) Mill. (D. Chatel per com). WSM244 nodulates and is partially effective for nitrogen fixation with M. rotata Boiss. and M. rugosa Desr., but does not nodulate M. blancheana Boiss. (D. Chatel per com). The symbiotic characteristics of WSM244 on a range of selected hosts are summarised in Additional file 2: Table S2.

Genome sequencing information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, The Root Nodulating Bacteria chapter project at the U.S. Department of Energy, Joint Genome Institute. The genome project is deposited in the Genomes OnLine Database [17] and a high-quality permanent draft genome sequence is deposited in IMG [18]. Sequencing, finishing and annotation were performed by the JGI [19]. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information for E. medicae WSM244

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-quality draft
MIGS-28Libraries usedIllumina Standard shotgun library
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage677x Illumina
MIGS-30AssemblersVelvet version 1.1.04; ALLPATHS v. r41043
MIGS-32Gene calling methodsProdigal 1.4
Locus TagA3C7 (http://www.ncbi.nlm.nih.gov/bioproject/?term=A3C7)
Genbank IDATTR00000000
Genbank Date of ReleaseJuly 9 2013
GOLD IDGp0010265 (https://gold.jgi-psf.org/project?id=10265)
BIOPROJECT882
MIGS-13Source Material IdentifierWSM244
Project relevanceSymbiotic N2 fixation, agriculture
Genome sequencing project information for E. medicae WSM244

Growth conditions and genomic DNA preparation

WSM244 was grown on TY solid medium [14] for three days, then a single colony was selected and used to inoculate 5 ml TY broth medium. The culture was grown for 48 h on a gyratory shaker (200 rpm) at 28 °C. Subsequently 1 ml was used to inoculate 60 ml TY broth medium and grown on a gyratory shaker (200 rpm) at 28 °C until OD 0.6 was reached. DNA was isolated from 60 ml of cells using a CTAB bacterial genomic DNA isolation method (http://jgi.doe.gov/collaborate-with-jgi/pmo-overview/protocols-sample-preparation-information/). Final concentration of the DNA was 0.5 mg ml−1.

Genome sequencing and assembly

The draft genome of WSM244 was generated at the DOE Joint genome Institute (JGI) using the Illumina technology [20]. An Illumina Std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 22,576,268 reads totaling 3,386.4 Mbp. All general aspects of library construction and sequencing performed at the JGI can be found at the JGI website. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts ((Mingkun L, Copeland A, Han J. unpublished) . The following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) [21], (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim (https://github.com/lh3/wgsim), (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r41043) [22]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very clean yes –export- Filtered yes –min contig lgth 500 –scaffolding no –cov cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True) . The final draft assembly contained 91 contigs in 91 scaffolds. The total size of the genome is 6.7 Mbp and the final assembly is based on 789.1 Mbp of Illumina data, which provides an average 118.7x coverage of the genome.

Genome annotation

Genes were identified using Prodigal [23], as part of the DOE-JGI genome annotation pipeline [24, 25]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information nonredundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [26] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [27]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [28]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes (IMG) platform [29] developed by the Joint Genome Institute, Walnut Creek, CA, USA [30].

Genome properties

The genome is 6,650,282 nucleotides with 61.21 % GC content (Table 3) and comprised of 91 scaffolds of 91 contigs. From a total of 6,495 genes, 6,427 were protein encoding and 68 RNA only encoding genes. The majority of protein-coding genes (79.34 %) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3

Genome statistics for Ensifer medicae WSM244

AttributeValue% of Total
Genome size (bp)6,650,282100.00
DNA coding (bp)5,800,63987.22
DNA G + C (bp)4,070,65961.21
DNA scaffolds91100.00
Total genes6,495100.00
Protein coding genes6,42798.95
RNA genes681.05
Pseudo genes40.06
Genes in internal clusters2,88944.48
Genes with function prediction5,15379.34
Genes assigned to COGs4,56770.32
Genes with Pfam domains5,31181.77
Genes with signal peptides5368.25
Genes with transmembrane helices1,46722.59
CRISPR repeats0-
Table 4

Number of genes of Ensifer medicae WSM244 associated with general COG functional categories

CodeValue% age of total (4,567)Description
J2204.23Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K4498.62Transcription
L1202.30Replication, recombination and repair
B10.02Chromatin structure and dynamics
D290.57Cell cycle control, cell division, chromosome partitioning
Y00.00Nuclear structure
V1001.92Defense mechanisms
T2094.01Signal transduction mechanisms
M2655.09Cell wall/membrane/envelope biogenesis
N711.36Cell motility
Z00.00Cytoskeleton
W240.46Extracellular structures
U681.31Intracellular trafficking, secretion, and vesicular transport
O1873.59Posttranslational modification, protein turnover, chaperones
C3386.49Energy production and conversion
G57411.02Carbohydrate transport and metabolism
E60211.56Amino acid transport and metabolism
F1252.40Nucleotide transport and metabolism
H2254.32Coenzyme transport and metabolism
I2204.23Lipid transport and metabolism
P2865.49Inorganic ion transport and metabolism
Q1603.07Secondary metabolite biosynthesis, transport and catabolism
R55210.60General function prediction only
S3236.20Function unknown
X470.90Mobilome: prophages, transposons
-192829.68Not in COGS
Genome statistics for Ensifer medicae WSM244 Number of genes of Ensifer medicae WSM244 associated with general COG functional categories

Insights from the genome sequence

WSM244 is one of six strains of and one of 30 or -nodulating strains that have been sequenced and whose genomes have been deposited in the IMG database. The genome of WSM244 falls within the expected size range of 6.4–7.2 Mbp for . As observed in other genomes, WSM244 possesses a large number of genes assigned to COG functional categories for: transport and metabolism of amino acids (12.15 %), carbohydrates (11.17 %), inorganic ions (5.3 %), lipids (3.91 %) and coenzymes (3.32 %), transcription (8.63 %) and signal transduction (3.66 %). The WSM244 genome contains only four pseudo genes, the numbers of which are highly variable in sequenced strains and can be as high as 485 (WSM4191). All six strains share high ANI values of 99.18–99.67 %, which is consistent with the low levels of genetic diversity found in populations [31]. The six strains share 5,425 core orthologous genes. WSM244 contains 202 unique genes, including those found in clusters encoding a putative polyketide synthase, phage proteins and a sulfonate transport system. Around 72 % of these unique genes encode hypothetical proteins. Strain WSM244 is particularly interesting, as it lacks the acid tolerance of other strains. The genome of this strain does contain orthologs of acid response or acid tolerance genes that were initially discovered in WSM419. These genes include actA (lnt), actP, actR, actS, phrR, lpiA and acvB [32-35]. WSM244 also contains the tcsA-tcrA-fsrR- regulatory gene cluster which is required for the low-pH-activation of lpiA and acvB in WSM419 [36]. This finding is in direct contrast to the absence of fsrR, tcsA and tcrA in the the acid-sensitive strain 1021. This suggests that either there may be differences in pH responsive gene expression in the WSM244 background, or that acid tolerant strains possess other candidate genes that are required for low pH adaptation and have not yet been identified.

Conclusions

WSM244 is of particular interest as it was isolated from M. polymorpha growing in alkaline soil and it lacks the acid tolerance of strains isolated from medics growing in acid Sardinian and Greek soils [9]. WSM244 is the only acid-sensitive strain that has been sequenced to date. Analysis of its sequenced genome and comparison with other sequenced and genomes will yield new insights into the molecular basis of acid tolerance in rhizobia and into the ecology and biogeography of the Ensifer-Medicago symbiosis.
  35 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  ActP controls copper homeostasis in Rhizobium leguminosarum bv. viciae and Sinorhizobium meliloti preventing low pH-induced copper toxicity.

Authors:  Wayne G Reeve; Ravi P Tiwari; Neetin B Kale; Michael J Dilworth; Andrew R Glenn
Journal:  Mol Microbiol       Date:  2002-02       Impact factor: 3.501

3.  Solexa Ltd.

Authors:  Simon Bennett
Journal:  Pharmacogenomics       Date:  2004-06       Impact factor: 2.533

4.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

5.  List of new names and new combinations previously effectively, but not validly, published.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2006-01       Impact factor: 2.747

6.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

7.  Maintenance of Intracellular pH and Acid Tolerance in Rhizobium meliloti.

Authors:  Graham W O'hara; Thomas J Goss; Michael J Dilworth; Andrew R Glenn
Journal:  Appl Environ Microbiol       Date:  1989-08       Impact factor: 4.792

8.  The Sinorhizobium medicae WSM419 lpiA gene is transcriptionally activated by FsrR and required to enhance survival in lethal acid conditions.

Authors:  Wayne G Reeve; Lambert Bräu; Joanne Castelli; Giovanni Garau; Christian Sohlenkamp; Otto Geiger; Michael J Dilworth; Andrew R Glenn; John G Howieson; Ravi P Tiwari
Journal:  Microbiology       Date:  2006-10       Impact factor: 2.777

9.  Medicago-Sinorhizobium symbiotic specificity evolution and the geographic expansion of Medicago.

Authors:  G Béna; A Lyet; T Huguet; I Olivieri
Journal:  J Evol Biol       Date:  2005-11       Impact factor: 2.411

10.  SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB.

Authors:  Elmar Pruesse; Christian Quast; Katrin Knittel; Bernhard M Fuchs; Wolfgang Ludwig; Jörg Peplies; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2007-10-18       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.