Literature DB >> 26221417

High-quality permanent draft genome sequence of Rhizobium leguminosarum bv. viciae strain GB30; an effective microsymbiont of Pisum sativum growing in Poland.

Andrzej Mazur1, Sofie E De Meyer2, Rui Tian2, Jerzy Wielbo1, Kamil Zebracki1, Rekha Seshadri3, Tbk Reddy3, Victor Markowitz4, Natalia N Ivanova3, Amrita Pati3, Tanja Woyke3, Nikos C Kyrpides5, Wayne Reeve2.   

Abstract

Rhizobium leguminosarum bv. viciae GB30 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Pisum sativum. GB30 was isolated in Poland from a nodule recovered from the roots of Pisum sativum growing at Janow. GB30 is also an effective microsymbiont of the annual forage legumes vetch and pea. Here we describe the features of R. leguminosarum bv. viciae strain GB30, together with sequence and annotation. The 7,468,464 bp high-quality permanent draft genome is arranged in 78 scaffolds of 78 contigs containing 7,227 protein-coding genes and 75 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.

Entities:  

Keywords:  Alphaproteobacteria; GEBA-RNB; Nitrogen fixation; Rhizobia; Root-nodule bacteria

Year:  2015        PMID: 26221417      PMCID: PMC4517663          DOI: 10.1186/s40793-015-0029-6

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The most efficient biological nitrogen fixation occurs when bacterial microsymbionts (rhizobia) form an effective symbiotic association with legume host plants. Legumes can develop these interactions with many different species of rhizobia belonging mainly to the , including , ,,, and [1, 2]. The genus contains at the time of writing 71 species, and within a species there may be distinct symbiovars [3]. Within the species , there are three distinct symbiovars [4, 5] including bv. phaseoli that forms nodules with , bv. trifolii that forms nodules with clover () and bv. viciae that forms nodules on vetch, pea and lentil (,, Pisum and ). In the nod genes that define these distinct host specificities are mostly located on the symbiotic plasmid, which has generically been designated pSym. The genomes of strains are usually large and complex containing, in addition to pSym, a chromosomal replicon and extra-chromosomal low-copy-number replicons characterized by the presence of repABC replication systems [6-8]. Recent studies have revealed that substantial divergence can occur in this genome organization and in the metabolic versatility of isolates [5, 9–12]. Kumar et al. [5] demonstrated that the diversity of within a local population of nodule isolates was 10 times higher than that found for . It was noted that the abundance of a particular genotype within the population can vary significantly and adaptation to the edaphic environment is a sought after trait particularly for the development of inoculants [13, 14]. bv. viciae GB30 was isolated as the most abundant nodule inhabitant (>42 %) of cv. Ramrod plants cultivated at a field site in Janow, Poland [10]. In contrast to other abundant isolates, GB30 formed nodules and fixed nitrogen with both P. sativum and (cv. Wista). Preliminary investigation into the genome architecture using Eckhardt analysis has revealed that GB30 contained a multipartite genome consisting of six replicons with one chromosome and five plasmids [10]. The genome of this strain could therefore provide important insights into the mechanisms required by effective microsymbionts to adapt to a particular edaphic environment. Here, we present a set of general features for bv. viciae GB30 together with the description of the complete genome sequence and annotation.

Organism information

Classification and features

bv. viciae strain GB30 is a motile, Gram-negative rod in the order of the class . The rod-shaped form varies in size with dimensions of 0.8-1 μm in width and 2.3-2.5 μm in length (Fig. 1 Left and Center). It is fast growing, forming colonies within 3–4 days when grown on half strength Lupin Agar (½LA) [15] at 28 °C. Colonies on ½LA are white-opaque, slightly domed and moderately mucoid with smooth margins (Fig. 1 Right).
Fig. 1

Images of Rhizobium leguminosarum bv. viciae strain GB30 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on ½LA solid media (Right)

Images of Rhizobium leguminosarum bv. viciae strain GB30 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on ½LA solid media (Right) Figure 2 shows the phylogenetic relationship of bv. viciae GB30 in a 16S rRNA gene sequence based tree. This strain is phylogenetically most related to FB206T and R602spT based on the 16S rRNA gene alignment with sequence identities of 100 %, as determined using the EzTaxon-e server [16]. FB206T was isolated from effective root nodules in Tunisia [17], whereas R602spT was isolated from effective root nodules in France [18]. Sequence similarity was also investigated with strains from the GEBA-RNB project [12] and GB30 was found to be closely related to bv. trifolii WSM1689 with 100 % 16S rRNA gene sequence identity. bv. trifolii WSM1689 is a highly effective microsymbiont of the perennial clover and has been shown to have a remarkable narrow host range [19]. Minimum Information about the Genome Sequence (MIGS) is provided in Table 1 and Additional file 1: Table S1.
Fig. 2

Phylogenetic tree highlighting the position of Rhizobium leguminosarum bv. viciae GB30 (shown in blue print) relative to other type and non-type strains in the Rhizobium genus using a 901 bp internal region of the 16S rRNA gene. Bradyrhizobium elkanii ATCC 49852T was used as outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [36]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [20] are shown in bold and have the GOLD ID mentioned after the strain number, otherwise the NCBI accession number has been provided. Finished genomes are designated with an asterisk

Table 1

Classification and general features of Rhizobium leguminosarum bv. viciae strain GB30 in accordance with the MIGS recommendations [37] published by the Genome Standards Consortium [38].

MIGS IDPropertyTermEvidence code
Domain Bacteria TAS [39]
Phylum Proteobacteria TAS [40, 41]
Class Alphaproteobacteria TAS [42, 43]
ClassificationOrder Rhizobiales TAS [44]
Family Rhizobiaceae TAS [45]
Genus Rhizobium TAS [46]
Species Rhizobium leguminosarum TAS [4749]
Gram stainNegativeIDA
Cell shapeRodIDA
MotilityMotileIDA
SporulationNon-sporulatingNAS
Temperature rangeMesophileNAS
Optimum temperature28 °CTAS [9]
pH range; OptimumNot reported
Carbon sourceNot reported
MIGS-6HabitatSoil, root nodule, on hostTAS [9]
MIGS-6.3SalinityNon-halophileNAS
MIGS-22Oxygen requirementAerobicTAS [49]
MIGS-15Biotic relationshipFree living, symbioticTAS [10]
MIGS-14PathogenicityNon-pathogenicTAS [50]
MIGS-4Geographic locationJanow, near Lublin, eastern PolandTAS [10]
MIGS-5Sample collectionBetween May and June, 2008TAS [10]
MIGS-4.1Latitude51.387638TAS [10]
MIGS-4.2Longitude22.369194TAS [10]
MIGS-4.3Altitude185 mIDA

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [51].

Phylogenetic tree highlighting the position of Rhizobium leguminosarum bv. viciae GB30 (shown in blue print) relative to other type and non-type strains in the Rhizobium genus using a 901 bp internal region of the 16S rRNA gene. Bradyrhizobium elkanii ATCC 49852T was used as outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [36]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [20] are shown in bold and have the GOLD ID mentioned after the strain number, otherwise the NCBI accession number has been provided. Finished genomes are designated with an asterisk Classification and general features of Rhizobium leguminosarum bv. viciae strain GB30 in accordance with the MIGS recommendations [37] published by the Genome Standards Consortium [38]. Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [51].

Symbiotaxonomy

bv. viciae strain GB30 was obtained from pea nodules (P. sativum cv. Ramrod) growing in sandy loam (N:P:K 0.157:0.014:0.013 %) in Janow near Lublin (Poland). The soil contained a relatively high number of bv. viciae, bv. trifolii and bv. phaseoli cells i.e., 9.2 × 103, 4.2 ÷ 103 and 1.5 × 103 bacteria/g of soil, respectively, as determined by the most probable number (MPN) method [10]. Plants were grown on 1 m2 plot for six weeks between May and June, 2008. Five randomly chosen pea plants growing in each other’s vicinity were harvested; the nodules were collected, surface-sterilized and the microsymbionts isolated [10]. One of the most abundant isolates, GB30, formed nodules (Nod+) and fixed N2 (Fix+) with P. sativum and (cv. Wista) increasing the wet mass weight by 54 and 38 %, respectively. Plants inoculated with GB30 also showed a 2.6 fold increase in nodule number and a 2.2 fold increase in seed pod number.

Genome sequencing and annotation information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, The Root Nodulating Bacteria chapter (GEBA-RNB) project at the U.S. Department of Energy, Joint Genome Institute [12]. The genome project is deposited in the Genomes OnLine Database [20] and the high-quality permanent draft genome sequence in IMG [21]. Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology [22]. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information for Rhizobium leguminosarum bv. viciae strain GB30

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-quality permanent draft
MIGS-28Libraries usedIllumina Std PE
MIGS-29Sequencing platformsIllumina Hiseq 2000
MIGS-31.2Fold coverage121.9 x Illumina
MIGS-30AssemblersVelvet version 1.1.04; ALLPATHS v. r41043
MIGS-32Gene calling methodsProdigal 1.4
Locus TagA3A3
GenBank IDATTP00000000
GenBank Date of ReleaseJuly 9, 2013
GOLD IDGp0009658 [52]
BIOPROJECTPRJNA165299
MIGS-13Source Material IdentifierGB30
Project relevanceSymbiotic N2 fixation, agriculture
Genome sequencing project information for Rhizobium leguminosarum bv. viciae strain GB30

Growth conditions and genomic DNA preparation

bv. viciae strain GB30 was grown to mid logarithmic phase in TY rich media [23] on a gyratory shaker at 28 °C. DNA was isolated from 60 mL of cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [24].

Genome sequencing and assembly

The draft genome of bv. viciae GB30 was generated at the DOE Joint Genome Institute [22]. An Illumina Std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 25,943,396 reads totaling 3,891.5 Mbp. All general aspects of library construction and sequencing performed at the JGI can be found at the JGI web site [25]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artefacts (Mingkun L, Copeland A, Han J. unpublished). Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet version 1.1.04 [26] (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [27] (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r41043) [28]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very_clean yes –export-Filtered yes –min_contig_lgth 500 –scaffolding no –cov_cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED_64 = 1 PLOIDY = 1 FRAG_COVERAGE = 125 JUMP_COVERAGE = 25 LONG_JUMP_COV = 50, RunAllpathsLG: THREADS = 8 RUN = std_shredpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True). The final draft assembly contained 78 contigs in 78 scaffolds. The total size of the genome is 7.5 Mbp and the final assembly is based on 910.4 Mbp of Illumina data, which provides an average of 121.9× coverage.

Genome annotation

Genes were identified using Prodigal [29], as part of the DOE-JGI genome annotation pipeline [30, 31]. The predicted CDSs were translated and used to search the National Centre for Biotechnology Information (NCBI) non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [32] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [33]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [34]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes-Expert Review (IMG-ER) system [35] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome Properties

The genome is 7,468,464 nucleotides with 60.81 % GC content (Table 3) and comprised of 78 scaffolds of 78 contigs. From a total of 7,302 genes, 7,227 were protein encoding and 75 RNA only encoding genes. The majority of genes (79.57 %) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3

Genome Statistics for Rhizobium leguminosarum bv. viciae strain GB30

AttributeValue% of Total
Genome size (bp)7,468,464100.00
DNA coding (bp)6,497,89887.00
DNA G + C (bp)4,541,55860.81
DNA scaffolds78100.00
Total genes7,302100.00
Protein coding genes7,22798.97
RNA genes751.03
Pseudo genes00
Genes in internal clusters4706.44
Genes with function prediction5,81079.57
Genes assigned to COGs5,18270.97
Genes with Pfam domains6,02582.51
Genes with signal peptides6348.68
Genes with transmembrane proteins1,64622.54
CRISPR repeats1
Table 4

Number of genes associated with the general COG functional categories.

CodeValue% ageDescription
J2333.90Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K5979.98Transcription
L1282.14Replication, recombination and repair
B20.03Chromatin structure and dynamics
D350.59Cell cycle control, Cell division, chromosome partitioning
V1191.99Defense mechanisms
T2854.77Signal transduction mechanisms
M3105.18Cell wall/membrane/envelope biogenesis
N931.56Cell motility
U580.97Intracellular trafficking, secretion, and vesicular transport
O2063.44Posttranslational modification, protein turnover, chaperones
C3255.43Energy production and conversion
G64410.77Carbohydrate transport and metabolism
E68911.52Amino acid transport and metabolism
F1161.94Nucleotide transport and metabolism
H2704.52Coenzyme transport and metabolism
I2414.03Lipid transport and metabolism
P3175.30Inorganic ion transport and metabolism
Q1863.11Secondary metabolite biosynthesis, transport and catabolism
R69511.62General function prediction only
S3816.37Function unknown
-2,12029.03Not in COGS

The total is based on the total number of protein coding genes in the genome.

Genome Statistics for Rhizobium leguminosarum bv. viciae strain GB30 Number of genes associated with the general COG functional categories. The total is based on the total number of protein coding genes in the genome.

Conclusion

bv. viciae GB30 belongs to a group of Alpha-rhizobia strains isolated from in Poland. Strain GB30 is part of the GEBA-RNB project that sequenced 24 strains and 12 bv. viciae strains [12]. Phylogenetic analysis revealed that GB30 is most closely related to bv. trifolii CB782 and WSM1689, both part of the GEBA-RNB project [12]. Full genome comparison of GB30 and WSM1689 [19] revealed that GB30 has the largest genome (7.4 Mbp), with the highest COG count (5,182), the lowest Pfam % (82.51) and the lowest TIGRfam % (22.13 %). The genome attributes of bv. viciae GB30, in conjunction with the other genomes, will be important for on-going comparative and functional analyses of the plant microbe interactions required for the successful establishment of agricultural crops.
  35 in total

1.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

2.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

Review 3.  Establishing nitrogen-fixing symbiosis with legumes: how many rhizobium recipes?

Authors:  Catherine Masson-Boivin; Eric Giraud; Xavier Perret; Jacques Batut
Journal:  Trends Microbiol       Date:  2009-09-18       Impact factor: 17.079

4.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

5.  Rhizobium gallicum sp. nov. and Rhizobium giardinii sp. nov., from Phaseolus vulgaris nodules.

Authors:  N Amarger; V Macheret; G Laguerre
Journal:  Int J Syst Bacteriol       Date:  1997-10

6.  Revision of the taxonomic status of the species Rhizobium leguminosarum (Frank 1879) Frank 1889AL, Rhizobium phaseoli Dangeard 1926AL and Rhizobium trifolii Dangeard 1926AL. R. trifolii is a later synonym of R. leguminosarum. Reclassification of the strain R. leguminosarum DSM 30132 (=NCIMB 11478) as Rhizobium pisi sp. nov.

Authors:  Martha Helena Ramírez-Bahena; Paula García-Fraile; Alvaro Peix; Angel Valverde; Raúl Rivas; José M Igual; Pedro F Mateos; Eustoquio Martínez-Molina; Encarna Velázquez
Journal:  Int J Syst Evol Microbiol       Date:  2008-11       Impact factor: 2.747

7.  Rhizobium laguerreae sp. nov. nodulates Vicia faba on several continents.

Authors:  Sabrine Saïdi; Martha-Helena Ramírez-Bahena; Nery Santillana; Doris Zúñiga; Estela Álvarez-Martínez; Alvaro Peix; Ridha Mhamdi; Encarna Velázquez
Journal:  Int J Syst Evol Microbiol       Date:  2013-09-25       Impact factor: 2.747

8.  Complete genome sequence of Rhizobium leguminosarum bv. trifolii strain WSM1325, an effective microsymbiont of annual Mediterranean clovers.

Authors:  Wayne Reeve; Graham O'Hara; Patrick Chain; Julie Ardley; Lambert Bräu; Kemanthi Nandesena; Ravi Tiwari; Alex Copeland; Matt Nolan; Cliff Han; Thomas Brettin; Miriam Land; Galina Ovchinikova; Natalia Ivanova; Konstantinos Mavromatis; Victor Markowitz; Nikos Kyrpides; Vanessa Melino; Matthew Denton; Ron Yates; John Howieson
Journal:  Stand Genomic Sci       Date:  2010-06-15

Review 9.  The genome of Rhizobium leguminosarum has recognizable core and accessory components.

Authors:  J Peter W Young; Lisa C Crossman; Andrew W B Johnston; Nicholas R Thomson; Zara F Ghazoui; Katherine H Hull; Margaret Wexler; Andrew R J Curson; Jonathan D Todd; Philip S Poole; Tim H Mauchline; Alison K East; Michael A Quail; Carol Churcher; Claire Arrowsmith; Inna Cherevach; Tracey Chillingworth; Kay Clarke; Ann Cronin; Paul Davis; Audrey Fraser; Zahra Hance; Heidi Hauser; Kay Jagels; Sharon Moule; Karen Mungall; Halina Norbertczak; Ester Rabbinowitsch; Mandy Sanders; Mark Simmonds; Sally Whitehead; Julian Parkhill
Journal:  Genome Biol       Date:  2006-04-26       Impact factor: 13.583

10.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

View more
  1 in total

1.  Electrophoretic profiles of lipopolysaccharides from Rhizobium strains nodulating Pisum sativum do not reflect phylogenetic relationships between these strains.

Authors:  Jolanta Kutkowska; Monika Marek-Kozaczuk; Jerzy Wielbo; Marek Wójcik; Teresa Urbanik-Sypniewska
Journal:  Arch Microbiol       Date:  2017-04-06       Impact factor: 2.552

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.