Literature DB >> 26380038

High-quality permanent draft genome sequence of Ensifer meliloti strain 4H41, an effective salt- and drought-tolerant microsymbiont of Phaseolus vulgaris.

Ridha Mhamdi1, Julie Ardley2, Rui Tian2, Rekha Seshadri3, T B K Reddy3, Amrita Pati3, Tanja Woyke3, Victor Markowitz4, Natalia Ivanova3, Nikos Kyrpides5, Wayne Reeve2.   

Abstract

Ensifer meliloti 4H41 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of common bean (Phaseolus vulgaris). Strain 4H41 was isolated in 2002 from root nodules of P. vulgaris grown in South Tunisia from the oasis of Rjim-Maatoug. Strain 4H41 is salt- and drought-tolerant and highly effective at fixing nitrogen with P. vulgaris. Here we describe the features of E. meliloti 4H41, together with genome sequence information and its annotation. The 6,795,637 bp high-quality permanent draft genome is arranged into 47 scaffolds of 47 contigs containing 6,350 protein-coding genes and 72 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.

Entities:  

Keywords:  Alphaproteobacteria; Ensifer; Nitrogen fixation; Phaseolus vulgaris; Root-nodule bacteria

Year:  2015        PMID: 26380038      PMCID: PMC4571125          DOI: 10.1186/s40793-015-0005-1

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Common bean () represents a very valuable source of proteins for low-income populations in Latin America and Africa [1]. However, this legume is considered to be a poor nitrogen-fixing pulse in comparison to other grain legumes [2]. This problem is generally attributed to the ineffectiveness of the native rhizobia, which is typically linked to the nodulation promiscuity of [3-6] or to adverse abiotic conditions [7,8]. Salinity and drought are considered to be the major abiotic constraints that affect legumes in Tunisia and other countries. The selection of superior strains of rhizobia capable of assuring optimal nitrogen fixation under these adverse conditions is of high interest. Attention has therefore been directed to the isolation and characterization of rhizobial strains from various marginal areas that are subject to adverse climatic and edaphic conditions. In this context, (formerly ‘ ’) strain 4H41 was isolated from root nodules of common bean grown in sandy, slightly alkaline soil from the oasis of Rjim-Maatoug in South Tunisia [9]. is classically considered to be a specific microsymbiont of the genera , Melilotus and [10], however, recent studies have identified strains of that effectively nodulate or several other legume species in northern Africa, South Africa and the Canary Islands [11-14]. Strain 4H41 induced nitrogen-fixing nodules on but failed to nodulate Medicago spp. The phylogenetic analysis of nifH and nodC genes showed that strain 4H41 should be classified in a novel symbiovar (sv. mediterranense) [15]. The symbiovar mediterranense has also been used to describe and Ensifer americanum strains that can nodulate and fix nitrogen with , species of Mexican (now reclassified as Vachellia and Senegalia [16]) and [17]. Strain 4H41 was able to grow in 4.4% NaCl (750 mM), while the commercial inoculant CIAT899 did not grow in salt concentrations higher than 1.8% [9]. Inoculationwater deficiency showed that strain 4H41 was more competitive and more effective than strain experiments under CIAT899 [7]. In field trials, inoculated with strain 4H41 showed a significant increase in nodule number, shoot dry weight and grain yield even in non-irrigated fields. Under these conditions of water deficiency, nodulation by indigenous rhizobia was totally absent. However, when common bean was grown in adequately irrigated soil samples from these fields, numerous nodules could be observed, suggesting that, in contrast to 4H41, the native rhizobia were not tolerant of water deficiency [7]. Because of its effectiveness and high salt tolerance, strain 4H41 is considered to be an elite candidate for inoculant formulation in order to promote cultivation of common bean under salt and drought constraints. This strain has therefore been selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project [18]. Here we present a summary classification and a set of general features for strain 4H41, together with a description of its genome sequence and annotation.

Organism information

Classification and features

4H41 is a motile, Gram-negative strain in the order of the class . The rod shaped form (Figure 1 Left and Center) has dimensions of approximately 0.25-0.5 μm in width and 0.75-1.0 μm in length. It is fast growing, forming colonies within 3–4 days when grown on half strength Lupin Agar (½LA) [19], tryptone-yeast extract agar (TY) [20] or a modified yeast-mannitol agar (YMA) [21] at 28°C. Colonies on ½LA are white-opaque, slightly domed and moderately mucoid with smooth margins (Figure 1 Right).
Figure 1

Images of Ensifer meliloti 4H41 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right).

Images of Ensifer meliloti 4H41 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right). Figure 2 shows the phylogenetic relationship of 4H41 in a 16S rRNA sequence based tree. This strain is the most similar to LMG 6133 and ORS 1407 based on the 16S rRNA gene alignment with sequence identities of 99.85% and 99.63%, respectively, as determined using the EzTaxon-e server [22]. Minimum Information about the Genome Sequence (MIGS) for 4H41 is provided in Table 1 and Additional file 1: Table S1.
Figure 2

Phylogenetic tree showing the relationship of Ensifer meliloti 4H41 (shown in bold blue print) to Ensifer spp. and other root nodule bacteria species in the order Rhizobiales, based on aligned sequences of the 16S rRNA gene (1,240 bp internal region). (The species name “Sinorhizobium chiapanecum” has not been validly published.) Azorhizobium caulinodans ORS 571T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 6 [45]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [46]. Bootstrap analysis [47] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [23] are in bold font and the GOLD ID is provided after the GenBank accession number, where this is available. Finished genomes are indicated with an asterisk.

Table 1

Classification and general features of 4H41 [48,49]

MIGS ID Property Term Evidence code a
ClassificationDomain Bacteria TAS [50]
Phylum Proteobacteria TAS [51,52]
Class Alphaproteobacteria TAS [51,53]
Order Rhizobiales TAS [51,54]
Family Rhizobiaceae TAS [51,55]
Genus Ensifer TAS [56,57]
Species Ensifer meliloti TAS [56]
Strain: 4H41
Gram stainNegativeIDA
Cell shapeRodIDA
MotilityMotileIDA
SporulationNon-sporulatingNAS
Temperature range10-40°CTAS [56]
Optimum temperature28°CTAS [56]
pH range; Optimum5-9.5; 6.5-8TAS [56]
Carbon sourceMannitolTAS [9]
MIGS-6HabitatSoil; root nodule on host (Phaseolus vulgaris)TAS [9]
MIGS-6.3Salinity0.4-4.4% (w/v)TAS [9]
MIGS-22Oxygen requirementAerobicNAS [9]
MIGS-15Biotic relationshipFree living, symbioticTAS [9]
MIGS-14PathogenicityNon-pathogenNAS
MIGS-4Geographic locationRjim Maatoug, TunisiaTAS [9]
MIGS-5Sample collection date2002TAS [9]
MIGS-4.1 MIGS-4.2Longitude7.99TAS [9]
Latitude33.3245TAS [9]
MIGS-4.3Depth0-10 cmNAS
MIGS-4.4Altitude40 mTAS [9]

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [58,59].

Phylogenetic tree showing the relationship of Ensifer meliloti 4H41 (shown in bold blue print) to Ensifer spp. and other root nodule bacteria species in the order Rhizobiales, based on aligned sequences of the 16S rRNA gene (1,240 bp internal region). (The species name “Sinorhizobium chiapanecum” has not been validly published.) Azorhizobium caulinodans ORS 571T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 6 [45]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [46]. Bootstrap analysis [47] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [23] are in bold font and the GOLD ID is provided after the GenBank accession number, where this is available. Finished genomes are indicated with an asterisk. Classification and general features of 4H41 [48,49] Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [58,59].

Symbiotaxonomy

strain 4H41 is highly effective for nitrogen fixation with , but is unable to nodulate several legume species that have previously been identified as hosts [14]. The symbiotic characteristics of strain 4H41 on a range of selected phylogenetically diverse hosts are provided in Table 2.
Table 2

Nodulation and N fixation properties of 4H41 on various hosts

Legume Species Legume Tribe Nod * Fix Comment
Argyrolobium uniflorum GenisteaeNod-Fix-
Genista saharae GenisteaeNod-Fix-
Medicago ciliaris TrifolieaeNod-Fix-
Medicago laciniata TrifolieaeNod-Fix-
Medicago sativa TrifolieaeNod-Fix-
Medicago truncatula TrifolieaeNod-Fix-
Phaseolus vulgaris PhaseoleaeNod+ Fix+ Highly effective
Retama raetam GenisteaeNod-Fix-

‘+’ and ‘-’ denote presence or absence, respectively, of nodulation (Nod) or N2 fixation (Fix).

Nodulation and N fixation properties of 4H41 on various hosts ‘+’ and ‘-’ denote presence or absence, respectively, of nodulation (Nod) or N2 fixation (Fix).

Genome sequencing information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, The Root Nodulating Bacteria chapter (GEBA-RNB) project at the U.S. Department of Energy, Joint Genome Institute (JGI). The genome project is deposited in the Genomes OnLine Database [23] and a high-quality permanent draft genome sequence is deposited in IMG [24]. Sequencing, finishing and annotation were performed by the JGI [25]. A summary of the project information is shown in Table 3.
Table 3

Genome sequencing project information for 4H41

MIGS ID Property Term
MIGS-31Finishing qualityHigh-quality permanent draft
MIGS-28Libraries usedIllumina Standard shotgun library
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage122.2× Illumina
MIGS-30AssemblersVelvet version 1.1.04; Allpaths-LG version r41043
MIGS-32Gene calling methodsProdigal 1.4
Locus TagB075 [60]
GenBank IDAQWP00000000
GenBank Date of ReleaseApr 20 2013
GOLD IDGp0010268 [60]
BIOPROJECT169747
MIGS-13Source Material Identifier4H41, WSM4555
Project relevanceSymbiotic N2 fixation, agriculture
Genome sequencing project information for 4H41

Growth conditions and genomic DNA preparation

4H41 was cultured to mid logarithmic phase in 60 ml of TY rich media [26] on a gyratory shaker at 28°C. DNA was isolated from the cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [27].

Genome sequencing and assembly

The draft genome of 4H41 was generated at the DOE Joint Genome Institute (JGI) using the Illumina technology [28]. An Illumina standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 17,481,364 reads totaling 2,622.2 Mbp. All general aspects of library construction and sequencing performed at the JGI can be found on the JGI website [29]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts [30]. The following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) [31], (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [32], (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r41043) [33]. Parameters for assembly steps were: 1) Velvet (velveth: 63 -shortPaired and velvetg: −very_clean yes –export-Filtered yes –min_contig_lgth 500 –scaffolding no –cov_cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED_64 = 1 PLOIDY = 1 FRAG_COVERAGE = 125 JUMP_COVERAGE = 25 LONG_JUMP_COV = 50, RunAllpathsLG: THREADS = 8 RUN = std_shredpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True). The final draft assembly contained 47 contigs in 47 scaffolds. The total size of the genome is 6.8 Mbp and the final assembly is based on 830.5 Mbp of Illumina data, which provides an average 122.2x coverage of the genome.

Genome annotation

Genes were identified using Prodigal [34] as part of the DOE-JGI genome annotation pipeline [35,36]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) nonredundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [37] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [38]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [39]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes Expert Review (IMG-ER) [40] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The genome is 6,795,637 nucleotides with 62.01% GC content (Table 4) and comprised of 47 scaffolds of 47 contigs. From a total of 6,422 genes, 6,350 were protein encoding and 72 RNA only encoding genes. The majority of protein-coding genes (82.01%) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 5.
Table 4

Genome statistics for 4H41

Attribute Value % of Total
Genome size (bp)6,795,637100.00
DNA coding (bp)5,911,16386.98
DNA G + C (bp)4,213,72962.01
DNA scaffolds47100.00
Total genes6,422100.00
Protein coding genes6,35098.88
RNA genes721.12
Pseudo genes10.02
Genes in biosynthetic clusters3996.21
Genes with function prediction5,26782.01
Genes assigned to COGs4,71573.42
Genes assigned Pfam domains5,43584.63
Genes with signal peptides5538.61
Genes with transmembrane helices1,42622.20
CRISPR repeats0-
Table 5

Number of genes of meliloti 4H41 associated with general COG functional categories

Code Value % of total (5,383) Description
J2194.07Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K4658.64Transcription
L1342.49Replication, recombination and repair
B10.02Chromatin structure and dynamics
D390.72Cell cycle control, cell division, chromosome partitioning
V1071.99Defense mechanisms
T2153.99Signal transduction mechanisms
M2895.37Cell wall/membrane biogenesis
N671.24Cell motility
W300.56Extracellular structures
U831.54Intracellular trafficking and secretion
O2013.73Posttranslational modification, protein turnover, chaperones
C3336.19Energy production and conversion
G59010.96Carbohydrate transport and metabolism
E62511.61Amino acid transport and metabolism
F1122.08Nucleotide transport and metabolism
H2434.51Coenzyme transport and metabolism
I2364.38Lipid transport and metabolism
P2955.48Inorganic ion transport and metabolism
Q1683.12Secondary metabolites biosynthesis, transport and catabolism
R54610.14General function prediction only
S3376.26Function unknown
X480.89Mobilome: prophages, transposons
-1,70726.58Not in COGS
Genome statistics for 4H41 Number of genes of meliloti 4H41 associated with general COG functional categories

Conclusion

Based on the 16S rRNA gene alignment, 4H41 is most closely related to LMG 6133, a Medicago microsymbiont [41] and ORS 1407, which effectively nodulates [42]. In contrast to these two strains, 4H41 is unable to nodulate either of these hosts. Strain 4H41 is one of 27 strains of with sequenced genomes deposited in the IMG database. Of these, 4H41 and strain GVPV12 [12] are the only two strains that have been isolated from, and are able to nodulate and fix nitrogen with, . As the other sequenced strains are microsymbionts of Medicago spp., 4H41 is therefore a useful strain for comparing the molecular determinants of symbiosis in rhizobia with similar chromosomal backgrounds but which nodulate different legume hosts. The genome size of the strains ranges from 6.6 – 8.9 Mbp; at 6.80 Mbp, the 4H41 genome is at the lower end of this range. It contains one pseudo gene, the numbers of which are highly variable in the sequenced genomes and can be up to 444 (AK83). In common with the other genomes, 4H41 possesses a large number of genes assigned to COG functional categories for transport and metabolism of amino acids (12.22%), carbohydrates (11.03%), inorganic ions (5.3%), lipids (3.97%) and coenzymes (3.59%), and involved in transcription (8.78%), and signal transduction (3.58%). Genome analysis has revealed three distinct nodA genes, two coding for NodA proteins composed of 196 amino acids while the third encodes a NodA protein of 141 amino acids that lacks a 55 amino acid segment at the C-terminus. All three nodA copies are harboured within a symbiotic region of the genome and have highest sequence identity at the protein level with the common bean-nodulating strains GR64 [43] and GVPV12. Three distinct nodA genes are also found in the commercial inoculant strains CIAT 899 and sp. PRF 81 [44]. 4H41 is salt- and drought-tolerant and highly effective for nitrogen fixation with , and as such is a valuable inoculant strain. Analysis of its sequenced genome and comparison with the genomes of other sequenced and with RNB that nodulate the common bean will yield new insights into the molecular basis of salt- and drought-tolerance in rhizobia and into the molecular determinants of symbiotic specificity and nitrogen fixation in the important pulse legume .
  36 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Solexa Ltd.

Authors:  Simon Bennett
Journal:  Pharmacogenomics       Date:  2004-06       Impact factor: 2.533

3.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  Influence of the size of indigenous rhizobial populations on establishment and symbiotic performance of introduced rhizobia on field-grown legumes.

Authors:  J E Thies; P W Singleton; B B Bohlool
Journal:  Appl Environ Microbiol       Date:  1991-01       Impact factor: 4.792

6.  Salt-tolerant rhizobia isolated from a Tunisian oasis that are highly effective for symbiotic N2-fixation with Phaseolus vulgaris constitute a novel biovar (bv. mediterranense) of Sinorhizobium meliloti.

Authors:  Bacem Mnasri; Moncef Mrabet; Gisèle Laguerre; Mohamed Elarbi Aouani; Ridha Mhamdi
Journal:  Arch Microbiol       Date:  2006-09-22       Impact factor: 2.552

7.  Constructs for insertional mutagenesis, transcriptional signal localization and gene regulation studies in root nodule and other bacteria.

Authors:  Wayne G Reeve; Ravi P Tiwari; Penelope S Worsley; Michael J Dilworth; Andrew R Glenn; John G Howieson
Journal:  Microbiology       Date:  1999-06       Impact factor: 2.777

8.  The genus name Sinorhizobium Chen et al. 1988 is a later synonym of Ensifer Casida 1982 and is not conserved over the latter genus name, and the species name 'Sinorhizobium adhaerens' is not validly published. Opinion 84.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2008-08       Impact factor: 2.747

9.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

10.  SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB.

Authors:  Elmar Pruesse; Christian Quast; Katrin Knittel; Bernhard M Fuchs; Wolfgang Ludwig; Jörg Peplies; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2007-10-18       Impact factor: 16.971

View more
  2 in total

1.  Characterization of rhizobia isolates obtained from nodules of wild genotypes of common bean.

Authors:  Aline Assis Cardoso; Michel de Paula Andraus; Tereza Cristina de Oliveira Borba; Claudia Cristina Garcia Martin-Didonet; Enderson Petrônio de Brito Ferreira
Journal:  Braz J Microbiol       Date:  2016-10-04       Impact factor: 2.476

2.  Genetic Diversity and Characterization of Symbiotic Bacteria Isolated from Endemic Phaseolus Cultivars Located in Contrasting Agroecosystems in Venezuela.

Authors:  María Daniela Artigas Ramírez; Mingrelia España; Hitoshi Sekimoto; Shin Okazaki; Tadashi Yokoyama; Naoko Ohkama-Ohtsu
Journal:  Microbes Environ       Date:  2021       Impact factor: 2.912

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.