Literature DB >> 26380632

High-quality permanent draft genome sequence of Rhizobium sullae strain WSM1592; a Hedysarum coronarium microsymbiont from Sassari, Italy.

Ron Yates1, John Howieson2, Sofie E De Meyer2, Rui Tian2, Rekha Seshadri3, Amrita Pati3, Tanja Woyke3, Victor Markowitz4, Natalia Ivanova3, Nikos Kyrpides5, Angelo Loi6, Brad Nutt6, Giovanni Garau7, Leonardo Sulas8, Wayne Reeve2.   

Abstract

Rhizobium sullae strain WSM1592 is an aerobic, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen (N2) fixing root nodule formed on the short-lived perennial legume Hedysarum coronarium (also known as Sulla coronaria or Sulla). WSM1592 was isolated from a nodule recovered from H. coronarium roots located in Ottava, bordering Sassari, Sardinia in 1995. WSM1592 is highly effective at fixing nitrogen with H. coronarium, and is currently the commercial Sulla inoculant strain in Australia. Here we describe the features of R. sullae strain WSM1592, together with genome sequence information and its annotation. The 7,530,820 bp high-quality permanent draft genome is arranged into 118 scaffolds of 118 contigs containing 7.453 protein-coding genes and 73 RNA-only encoding genes. This rhizobial genome is sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

Entities:  

Keywords:  Alphaproteobacteria; GEBA-RNB; Nitrogen fixation; Rhizobia; Root-nodule bacteria

Year:  2015        PMID: 26380632      PMCID: PMC4572446          DOI: 10.1186/s40793-015-0020-2

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The accessibility and supply of nitrogen fertilizer is an ever-increasing challenge that world agriculture faces [1]. Despite the fact the Earth’s atmosphere consists of approximately 78 % dinitrogen, it is in a form that must be converted before it can be utilised by plants [2]. Conversion of N2 can be achieved by the chemical synthesis of natural gas but these methods can be considered unsustainable because of the use of exhaustible and costly fossil fuel resources [3]. In addition, the manufacturing process not only increases the greenhouse gas emissions but also field N fertiliser application have been directly linked to contaminating and leading to detrimental effects in ecosystems and waterways. Alternatively, a more sustainable and environmentally friendly process of acquiring N is through the biological process of N fixation by diazotrophs [2]. Most biological fixation in world agriculture is provided from the process of symbiotic nitrogen fixation, which occurs following the successful formation of an effective symbiosis by leguminous plants and bacterial microsymbionts [4]. The productive efficiencies of SNF in agricultural areas rely on considerable efforts by researchers and producers in matching suitable legume hosts with their compatible microsymbionts [5]. Some agricultural areas farm with indigenous legumes, while others embark on introducing exotic legumes and their compatible microsymbionts from different geographical locations that are edaphically and climatically suited to their own [4]. In Australia for instance, selection programs have enabled the domestication of new Mediterranean legume species and their microsymbionts [6]. One such grazing legume species commercially introduced into Australian and New Zealand agriculture includes the Papilionoid legume (also known as or Sulla). Sulla is a deep-rooted, short-lived perennial pasture legume that is grown throughout Mediterranean countries where it is fed green, used for silage or as hay [7]. It is noted that the microsymbionts of Sulla display a high level of specificity for nodulation and nitrogen fixation [8]. However, when effectively nodulated Sulla plants have the ability to biologically fix large amounts of nitrogen for increased paddock fertility [9]. strain WSM1592 is the current Australian commercial inoculant for Sulla after replacing strain CC1335 in 2006. This strain has also been deposited in the Western Australian Soil Microbiology collection and is available for research. WSM1592 was isolated in 1995 from a nodule collected from a Sulla plant sampled on a roadside in calcareous loamy sand near the Ottava agriculture research farm, east of Sassari in Sardinia, Italy. The location has a Mediterranean climate with a long-term mean seasonal rainfall of 547 mm. Here we present a preliminary description of the general features for strain WSM1592 together with its genome sequence and annotation.

Organism information

Classification and features

strain WSM1592 is a motile, Gram-negative rod (Fig. 1 Left and Center) in the order of the class . It is fast growing, forming colonies within 3–4 days when grown on half strength Lupin Agar (½LA) [10], tryptone-yeast extract agar (TY) [11] or a modified yeast-mannitol agar (YMA) [12] at 28 °C. Colonies on ½LA are white-opaque, slightly domed and moderately mucoid with smooth margins (Fig. 1 Right).
Fig. 1

Images of Rhizobium sullae strain WSM1592 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)

Images of Rhizobium sullae strain WSM1592 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right) Figure 2 shows the phylogenetic relationship of strain WSM1592 in a 16S rRNA gene sequence based tree. This strain is phylogenetically the most related to IS 123T, USDA 2370 and Rhizobium phaseoliATCC 14482 with sequence identities to the WSM1592 16S rRNA gene sequence of 100 %, 99.84 % and 99.84 %, respectively, as determined using the EzTaxon-e server [13]. IS 123T was isolated from a root nodule discovered in Southern Spain [14]. In contrast, USDA 2370 was isolated from an effective nodule of and is also able to nodulate and [15]. ATCC 14482 was originally isolated from nodules of Phaseolus vulgaris and has been shown to nodulate , but not [15].
Fig. 2

Phylogenetic tree highlighting the position of R. sullae strain WSM1592 (shown in blue print) relative to other type and non-type rhizobia strains using a 901 bp internal region of the 16S rRNA gene. Bradyrhizobium elkanii ATCC 49852T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [33]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [18] have the GOLD ID mentioned after the strain number and represented in bold, otherwise the NCBI accession number is provided. Finished genomes are designated with an asterisk

Phylogenetic tree highlighting the position of R. sullae strain WSM1592 (shown in blue print) relative to other type and non-type rhizobia strains using a 901 bp internal region of the 16S rRNA gene. Bradyrhizobium elkanii ATCC 49852T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [33]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [18] have the GOLD ID mentioned after the strain number and represented in bold, otherwise the NCBI accession number is provided. Finished genomes are designated with an asterisk Minimum Information about the Genome Sequence [16] of WSM1592 is provided in Table 1 and Additional file 1: Table S1.
Table 1

Classification and general features of R. sullae strain WSM1592 [16, 34]

MIGS IDPropertyTermEvidence code
ClassificationDomain BacteriaTAS [35]
Phylum Proteobacteria TAS [36, 37]
Class Alphaproteobacteria TAS [38]
Order Rhizobiales TAS [39]
Family Rhizobiaceae TAS [40]
Genus Rhizobium TAS [41]
Species Rhizobium sullae TAS [14]
(Type) strain WSM1592 IDA
Gram stainNegativeIDA
Cell shapeRodIDA
MotilityMotileIDA
SporulationNon-sporulatingNAS
Temperature rangeNot reported
Optimum temperature28 °CNAS
pH range; OptimumNot reported
Carbon sourceNot reported
MIGS-6HabitatSoil, root nodule, on hostIDA
MIGS-6.3SalinityNon-halophileNAS
MIGS-22Oxygen requirementAerobicIDA
MIGS-15Biotic relationshipFree living, symbioticIDA
MIGS-14PathogenicityNon-pathogenicNAS
MIGS-4Geographic locationSassari, ItalyIDA
MIGS-5Soil collection date20 May 1995IDA
MIGS-4.1Latitude8.465IDA
MIGS-4.2Longitude40.777IDA
MIGS-4.4Altitude83 mIDA

Evidence codes – IDA Inferred from Direct Assay; TAS Traceable Author Statement (i.e., a direct report exists in the literature); NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42]

Classification and general features of R. sullae strain WSM1592 [16, 34] Evidence codes – IDA Inferred from Direct Assay; TAS Traceable Author Statement (i.e., a direct report exists in the literature); NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42]

Symbiotaxonomy

is a short-lived perennial pasture legume native to the Mediterranean basin and throughout the Hedysarum genus there is a large degree of specificity in symbiotic compatibility within this region [8]. WSM1592 nodulates (Nod+) and fixes nitrogen effectively (Fix+) with . However, inoculation of H. spinosissimum, H. flexuosum and H. carnosum with WSM1592 results in mostly Nod- but always Fix-.

Genome sequencing information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, The Root Nodulating Bacteria chapter project at the U.S. Department of Energy, Joint Genome Institute [17]. The genome project is deposited in the Genomes OnLine Database [18] and the high-quality permanent draft genome sequence in IMG [19]. Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology [20]. A summary of the project information is shown in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-quality permanent draft
MIGS-28Libraries usedIllumina Std PE (2x150bps)
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage877x
MIGS-30AssemblersVelvet 1.1.04; Allpaths-LG r39750
MIGS-32Gene calling methodsProdigal 1.4
Locus TagA3C1
Genbank IDATZB00000000
Genbank Date of ReleaseDecember 12, 2013
GOLD IDGp0010240
BIOPROJECTPRJNA165333
MIGS-13Source Material IdentifierWSM1592
Project relevanceSymbiotic N2 fixation, agriculture
Project information

Growth conditions and genomic DNA preparation

WSM1592 was cultured to mid logarithmic phase in 60 ml of TY rich media [11] on a gyratory shaker at 28 °C. DNA was isolated from the cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [21].

Genome sequencing and assembly

The draft genome of strain WSM1592 was generated at the DOE Joint Genome Institute using state of the art technology [20]. An Illumina Std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 29,255,624 reads. All general aspects of library construction and sequencing performed at the JGI can be found at the JGI’s web site [22]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J. unpublished). Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) [23] (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [24] (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r39750) [25]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very_clean yes –exportFiltered yes –min_contig_lgth 500 –scaffolding no–cov_cutoff 10) 2) wgsim (−e 0 –1 76 –2 76 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED_64 = 1 PLOIDY = 1 FRAG_COVERAGE = 125 JUMP_COVERAGE = 25 LONG_JUMP_COV = 50, RunAllpathsLG: THREADS = 8 RUN = std_shredpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True). The final draft assembly contained 118 contigs in 118 scaffolds. The total size of the genome is 7.5 Mbp and the final assembly is based on 2,498,075,850 bp of Illumina data, which provides an average of 877× coverage of the genome.

Genome annotation

Genes were identified using Prodigal [26], as part of the DOE-JGI genome annotation pipeline [27, 28]. The predicted CDSs were translated and used to search the National Centre for Biotechnology Information non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [29] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [30]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [31]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes-Expert Review system [32] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The genome is 7,530,820 nucleotides 59.87 % GC content (Table 3 and comprised of 118 scaffolds of 118 contigs. From a total of 7,526 genes, 7,453 were protein encoding and 73 RNA only encoding genes. The majority of genes (78.42 %) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COG functional categories is presented in Table 4.
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)7,530,820100.00
DNA coding (bp)6,571,31287.26
DNA G + C (bp)4,508,64659.87
DNA scaffolds118
Total genes7,526100.00
Protein coding genes7,45399.03
RNA genes730.97
Pseudo genes00
Genes in internal clusters5387.15
Genes with function prediction5,90278.42
Genes assigned to COGs5,14868.40
Genes with Pfam domains6,17482.04
Genes with signal peptides6598.76
Genes with transmembrane helices1,69922.58
CRISPR repeats00
Table 4

Number of genes associated with the general COG functional categories

CodeValue% ageCOG Category
J1863.25Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K5549.67Transcription
L1582.76Replication, recombination and repair
B20.03Chromatin structure and dynamics
D360.63Cell cycle control, Cell division, chromosome partitioning
V651.13Defense mechanisms
T2113.68Signal transduction mechanisms
M2935.11Cell wall/membrane/envelope biogenesis
N681.19Cell motility
U1121.95Intracellular trafficking, secretion, and vesicular transport
O1672.91Posttranslational modification, protein turnover, chaperones
C3275.71Energy production and conversion
G61410.71Carbohydrate transport and metabolism
E68411.94Amino acid transport and metabolism
F1071.87Nucleotide transport and metabolism
H1743.04Coenzyme transport and metabolism
I1963.42Lipid transport and metabolism
P3185.55Inorganic ion transport and metabolism
Q1382.41Secondary metabolite biosynthesis, transport and catabolism
R74312.96General function prediction only
S57810.09Function unknown
-237831.60Not in COGS

The total is based on the total number of protein coding genes in the genome

Genome statistics Number of genes associated with the general COG functional categories The total is based on the total number of protein coding genes in the genome

Conclusions

WSM1592 was isolated from a root nodule of (also known as ). Phylogenetic analysis revealed that WSM1592 is the most closely related to IS 123T, which was also isolated from Hedysarum coronarium growing in Southern Spain. The genome of WSM1592 is the first to be described for a strain of and is 7.5 Mbp, with a GC content of 59.87 %. As expected this genome contains the nitrogenase-RXN MetaCyc pathway characterized by the multiprotein nitrogenase complex and has been shown to fix effectively with . The genome attributes of WSM1592 will be important for the characterisation of the genetic determinants required for the establishment of an effective symbiosis with .
  23 in total

Review 1.  Symbiotic nitrogen fixation and phosphorus acquisition. Plant nutrition in a world of declining renewable resources.

Authors:  C P Vance
Journal:  Plant Physiol       Date:  2001-10       Impact factor: 8.340

2.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

3.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

5.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

6.  Revision of the taxonomic status of the species Rhizobium leguminosarum (Frank 1879) Frank 1889AL, Rhizobium phaseoli Dangeard 1926AL and Rhizobium trifolii Dangeard 1926AL. R. trifolii is a later synonym of R. leguminosarum. Reclassification of the strain R. leguminosarum DSM 30132 (=NCIMB 11478) as Rhizobium pisi sp. nov.

Authors:  Martha Helena Ramírez-Bahena; Paula García-Fraile; Alvaro Peix; Angel Valverde; Raúl Rivas; José M Igual; Pedro F Mateos; Eustoquio Martínez-Molina; Encarna Velázquez
Journal:  Int J Syst Evol Microbiol       Date:  2008-11       Impact factor: 2.747

7.  Rhizobium sullae sp. nov. (formerly 'Rhizobium hedysari'), the root-nodule microsymbiont of Hedysarum coronarium L.

Authors:  Andrea Squartini; Paolo Struffi; Heidi Döring; Sonja Selenska-Pobell; Elisabetta Tola; Alessio Giacomini; Elena Vendramin; Encarna Velázquez; Pedro F Mateos; Eustoquio Martínez-Molina; Frank B Dazzo; Sergio Casella; Marco P Nuti
Journal:  Int J Syst Evol Microbiol       Date:  2002-07       Impact factor: 2.747

8.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

9.  A Genomic Encyclopedia of the Root Nodule Bacteria: assessing genetic diversity through a systematic biogeographic survey.

Authors:  Wayne Reeve; Julie Ardley; Rui Tian; Leila Eshragi; Je Won Yoon; Pinyaruk Ngamwisetkun; Rekha Seshadri; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Stand Genomic Sci       Date:  2015-02-09

10.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

View more
  2 in total

1.  Genome Sequence of Rhizobium sullae HCNT1 Isolated from Hedysarum coronarium Nodules and Featuring Peculiar Denitrification Phenotypes.

Authors:  B de Diego-Diaz; L Treu; S Campanaro; V da Silva Duarte; M Basaglia; L Favaro; S Casella; A Squartini
Journal:  Genome Announc       Date:  2018-01-25

2.  Draft Genome Sequence of the Nitrogen-Fixing Rhizobium sullae Type Strain IS123T Focusing on the Key Genes for Symbiosis with its Host Hedysarum coronarium L.

Authors:  Gaurav Sablok; Riccardo Rosselli; Torsten Seeman; Robin van Velzen; Elisa Polone; Alessio Giacomini; Nicola La Porta; Rene Geurts; Rosella Muresu; Andrea Squartini
Journal:  Front Microbiol       Date:  2017-07-26       Impact factor: 5.640

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.