Literature DB >> 26478785

High-quality permanent draft genome sequence of the Lebeckia ambigua-nodulating Burkholderia sp. strain WSM4176.

Sofie E De Meyer1, Rui Tian1, Rekha Seshadri2, Tbk Reddy2, Victor Markowitz3, Natalia Ivanova2, Amrita Pati2, Tanja Woyke2, Nikos Kyrpides4, Ron Yates5, John Howieson1, Wayne Reeve1.   

Abstract

Burkholderia sp. strain WSM4176 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected in Nieuwoudtville, Western Cape of South Africa, in October 2007. This plant persists in infertile, acidic and deep sandy soils, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. Here we describe the features of Burkholderia sp. strain WSM4176, which represents a potential inoculant quality strain for L. ambigua, together with sequence and annotation. The 9,065,247 bp high-quality-draft genome is arranged in 13 scaffolds of 65 contigs, contains 8369 protein-coding genes and 128 RNA-only encoding genes, and is part of the GEBA-RNB project proposal (Project ID 882).

Entities:  

Keywords:  Betaproteobacteria; GEBA-RNB; Nitrogen fixation; Rhizobia; Root-nodule bacteria

Year:  2015        PMID: 26478785      PMCID: PMC4609093          DOI: 10.1186/s40793-015-0072-3

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Leguminous pasture species are important in Western Australian agriculture because the soils are inherently infertile. Together with changing patterns of rainfall, this agricultural system cannot continue to rely on the current commercially used annual legumes. Deep-rooted herbaceous perennial legumes including Rhynchosia and Lebeckia species from the Cape Floristic Region in South Africa have been investigated because of their adaptation to acid and infertile soils [1-3]. These plants naturally occur in the CFR, which is one of the richest areas for plants in the world and covers 553,000 ha of land protected by the UNESCO as important world heritage. Elevations in this area range from 2077 m in the Groot Winterhoek to sea level in the De Hoop Nature Reserve. Moreover, a great part of the area is characterized by mountains, rivers, waterfalls and pools. In areas where is native, rainfall ranges between 150 and 400 mm annually. Parts of the CFR have thus similar soil and climate conditions to Western Australia. In four expeditions to the Western Cape of South Africa, held between 2002 and 2007, nodules and seeds were collected and stored as previously described [4]. The isolation of bacteria from these nodules gave rise to a collection of 23 strains that were identified as . Unlike most of the previously studied rhizobial strains, this South African group appears to associate with papilionoid forage legumes, rather than Mimosa species. WSM4176 belongs to a subgroup of strains that were isolated in 2004 from nodules collected near Nieuwoudtville in the Western Cape of South Africa [3]. The site of collection was moderately grazed rangeland field owned by the Louw family, and the soil was composed of stony-sand with a pH of 6. sp. strain WSM4176 is highly effective at fixing nitrogen with , with which it forms crotaloid, indeterminate, nodules [3]. WSM4176 represents thus a potential inoculant quality strain for , which is being developed as a grazing legume adapted to infertile soils that receive 250–400 mm annual rainfall, where climate change has necessitated the domestication of agricultural species with altered characteristics. Therefore, this strain is of special interest to the IMG/GEBA project. Here we present a summary classification and a set of general features for sp. strain WSM4176 together with the description of the complete genome sequence and annotation.

Organism information

Classification and features

sp. strain WSM4176 is a motile, Gram-negative, non-spore-forming rod (Fig. 1 Left, Center) in the order of the class . The rod-shaped form varies in size with dimensions of 0.1–0.2 μm in width and 2.0–3.0 μm in length (Fig. 1 Left). It is fast growing, forming 0.5–1 mm diameter colonies after 24 h when grown on half Lupin Agar [5] and TY [6] at 28 °C. Colonies on ½LA are white-opaque, slightly domed, moderately mucoid with smooth margins (Fig. 1 Right).
Fig. 1

Images of Burkholderia sp. strain WSM4176 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)

Images of Burkholderia sp. strain WSM4176 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right) Figure 2 shows the phylogenetic relationship of sp. strain WSM4176 in a 16S rRNA gene sequence based tree. This strain clusters closest to STM678T and AC1100T with 99.86 and 97.28 % sequence identity, respectively. Minimum Information about the Genome Sequence is provided in Table 1.
Fig. 2

Phylogenetic tree highlighting the position of Burkholderia sp. strain WSM4176 (shown in blue print) relative to other type and non-type strains in the Burkholderia genus (1322 bp internal region). Cupriavidus taiwanensis LMG 19424T was used as outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [27]. The tree was build using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [9] are in bold print and the GOLD ID is mentioned after the NCBI accession number. Published genomes are designated with an asterisk

Table 1

Classification and general features of Burkholderia sp. strain WSM4176 in accordance with the MIGS recommendations [28] published by the Genome Standards Consortium [29]

MIGS IDPropertyTermEvidence code
ClassificationDomain Bacteria TAS [30]
Phylum Proteobacteria TAS [31, 32]
Class Betaproteobacteria TAS [33]
Order Burkholderiales TAS [34]
Family Burkholderiaceae TAS [35]
Genus Burkholderia TAS [36]
Species Burkholderia sp.TAS [3]
(Type) strain WSM4176TAS [3]
Gram stainNegativeIDA [36]
Cell shapeRodIDA
MotilityMotileIDA
SporulationNon-sporulatingIDA [36]
Temperature rangeNot reported
Optimum temperature28 °CIDA
pH range; OptimumNot reported
Carbon sourceNot reported
MIGS-6HabitatSoil, root nodule on hostTAS [3]
MIGS-6.3SalinityNot reported
MIGS-22Oxygen requirementAerobicIDA
MIGS-15Biotic relationshipFree living, symbioticTAS [3]
MIGS-14PathogenicityNon-pathogenicNAS
MIGS-4Geographic locationSouth AfricaTAS [3]
MIGS-5Sample collection2007TAS [3]
MIGS-4.1Latitude−31.381TAS [3]
MIGS-4.2Longitude19.30TAS [3]
MIGS-4.4Altitude789 mIDA

Evidence codes – IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [37]

Phylogenetic tree highlighting the position of Burkholderia sp. strain WSM4176 (shown in blue print) relative to other type and non-type strains in the Burkholderia genus (1322 bp internal region). Cupriavidus taiwanensis LMG 19424T was used as outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [27]. The tree was build using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [9] are in bold print and the GOLD ID is mentioned after the NCBI accession number. Published genomes are designated with an asterisk Classification and general features of Burkholderia sp. strain WSM4176 in accordance with the MIGS recommendations [28] published by the Genome Standards Consortium [29] Evidence codes – IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [37]

Symbiotaxonomy

sp. strain WSM4176 belongs to a group of strains that nodulate papilionoid forage legumes rather than the classical hosts Mimosa spp. (Mimosoideae) [7]. sp. strain WSM4176 was assessed for nodulation and nitrogen fixation on three separate L. ambigua genotypes (CRSLAM-37, CRSLAM-39 and CRSLAM-41) [3]. Strain WSM4176 could nodulate and fix effectively on CRSLAM-39 and CRSLAM-41 but was partially effective on CRSLAM-37 [3].

Genome sequencing information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, The Root Nodulating Bacteria chapter project at the U.S. Department of Energy, Joint Genome Institute for projects of relevance to agency missions [8]. The genome project is deposited in the Genomes OnLine Database [9] and the high-quality permanent draft genome sequence in IMG [10]. Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology [11]. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information for Burkholderia sp. strain WSM4176

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-quality-permanent-draft
MIGS-28Libraries usedIllumina CLIP PE and Illumina Std PE Unamplified
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage361 × Illumina
MIGS-30AssemblersALLPATHS V.r41554
MIGS-32Gene calling methodsProdigal 1.4, GenePRIMP
Locus TagB014
Genbank IDARCY00000000
Genbank Date of ReleaseJuly 11, 2014
GOLD IDGi08873
BIOPROJECTPRJNA169686
MIGS-13Source Material IdentifierWSM4176
Project relevanceSymbiotic N2fixation, agriculture
Genome sequencing project information for Burkholderia sp. strain WSM4176

Growth conditions and genomic DNA preparation

sp. strain WSM4176 was grown to mid logarithmic phase in TY rich media [6] on a gyratory shaker at 28 °C. DNA was isolated from 60 mL of cells using a CTAB bacterial genomic DNA isolation method [12].

Genome sequencing and assembly

The genome of sp. strain WSM4176 was sequenced at the DOE Joint Genome Institute (JGI) using Illumina data [13]. For this genome, we constructed and sequenced an Illumina short-insert paired-end library with an average insert size of 270 bp which generated 7,496,994 reads and an Illumina long-insert paired-end library with an average insert size of 6899.89 +/− 882.09 bp which generated 11,773,350 reads totaling 2891 Mbp of Illumina data (unpublished, Feng Chen). All general aspects of library construction and sequencing performed at the JGI can be found at the JGI’s web site [11]. The initial draft assembly contained 66 contigs in eight scaffold(s). The initial draft data was assembled with Allpaths, version r41554 [14], and the consensus was computationally shredded into 10 Kbp overlapping fake reads (shreds). The Illumina draft data was also assembled with Velvet, version 1.1.05 [15], and the consensus sequences were computationally shredded into 1.5 Kbp overlapping fake reads (shreds). The Illumina draft data was assembled again with Velvet using the shreds from the first Velvet assembly to guide the next assembly. The consensus from the second Velvet assembly was shredded into 1.5 Kbp overlapping fake reads. The fake reads from the Allpaths assembly and both Velvet assemblies and a subset of the Illumina CLIP paired-end reads were assembled using parallel phrap, version 4.24 (High Performance Software, LLC). Possible mis-assemblies were corrected with manual editing in Consed [16-18]. Gap closure was accomplished using repeat resolution software (Wei Gu, unpublished), and sequencing of bridging PCR fragments with Sanger and/or PacBio (unpublished, Cliff Han) technologies. For improved high quality draft and non-contiguous finished projects, one round of manual/wet lab finishing may have been completed. Primer walks, shatter libraries, and/or subsequent PCR reads may also be included for a finished project. A total of 11 PCR PacBio consensus sequences were completed to close gaps and to raise the quality of the final sequence. The total size of the genome is 9.1 Mb and the final assembly is based on 2891 Mbp of Illumina draft data, which provides an average 318× coverage of the genome.

Genome annotation

Genes were identified using Prodigal [19] as part of the DOE-JGI Annotation pipeline [17], followed by a round of manual curation using the JGI GenePRIMP pipeline [20]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information nonredundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. These data sources were combined to assert a product description for each predicted protein. Non-coding genes and miscellaneous features were predicted using tRNAscan-SE [21], RNAMMer [22], Rfam [23], TMHMM [24] and SignalP [23]. Additional gene prediction analyses and functional annotation were performed within the Integrated Microbial Genomes platform [24].

Genome properties

The genome is 9,065,247 nucleotides with 62.89 % GC content (Table 3) and comprised of 13 scaffolds and 65 contigs (Fig. 3). From a total of 8497 genes, 8369 were protein encoding and 128 RNA only encoding genes. The majority of genes (75.46 %) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3

Genome statistics for Burkholderia sp. strain WSM4176

AttributeValue% of total
Genome size (bp)9,065,247100.00
DNA coding (bp)7,632,17484.19
DNA G+C (bp)5,701,43262.89
DNA scaffolds13
Total genes8497100.00
Protein-coding genes836998.49
RNA genes1281.51
Pseudo genes00.00
Genes in internal clusters6487.63
Genes with function prediction641275.46
Genes assigned to COGs549164.62
Genes with Pfam domains676679.63
Genes with signal peptides7388.69
Genes with transmembrane helices186521.95
CRISPR repeats00.00
Fig. 3

Graphical map of the genome of Burkholderia sp. strain WSM4176. First four large scaffolds are shown according to size. From the bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew

Table 4

Number of protein coding genes of Burkholderia sp. strain WSM4176 associated with the general COG functional categories

CodeValue% ageCOG category
J2003.21Translation
A10.02RNA processing and modification
K5969.55Transcription
L2994.79Replication, recombination and repair
B10.02Chromatin structure and dynamics
D380.61Cell cycle control, mitosis and meiosis
V741.19Defense mechanisms
T2704.33Signal transduction mechanisms
M3896.23Cell wall/membrane biogenesis
N1051.68Cell motility
U1462.34Intracellular trafficking and secretion
O1722.76Posttranslational modification, protein turnover, chaperones
C4617.39Energy production conversion
G4957.93Carbohydrate transport and metabolism
E6119.79Amino acid transport metabolism
F1011.62Nucleotide transport and metabolism
H2103.37Coenzyme transport and metabolism
I3235.18Lipid transport and metabolism
P3175.08Inorganic ion transport and metabolism
Q2253.61Secondary metabolite biosynthesis, transport and catabolism
R72711.65General function prediction only
S4797.68Function unknown
-300635.38Not in COGS

The total is based on the total number of protein coding genes in the genome

Genome statistics for Burkholderia sp. strain WSM4176 Graphical map of the genome of Burkholderia sp. strain WSM4176. First four large scaffolds are shown according to size. From the bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew Number of protein coding genes of Burkholderia sp. strain WSM4176 associated with the general COG functional categories The total is based on the total number of protein coding genes in the genome

Conclusion

sp. WSM4176 belongs to a group of Beta-rhizobia isolated from from the fynbos biome in South Africa [3]. WSM4176 is phylogeneticaly most closely related to STM678T. Both STM678T and WSM4176 have comparable genome sizes, 8.3–9.1 respectively. Recently, two more genomes from strains originating from were investigated, WSM3556 and WSM5005 [25]. Both of these strains have a genome size of 7.7 Mbp, which is considerably smaller than WSM4176. All four strains, STM678T, WSM3556, WSM4176 and WSM5005, contain a large number of genes assigned to transport and metabolism of amino acids (9.79–10.94 %) and carbohydrates (7.93–8.38 %), and transcription (9.55–9.94 %). Interestingly, STM678T was initially isolated from Aspalathus species but does not nodulate this host, however it has been shown to nodulate Cyclopia species from the same fynbos biome in South Africa as [26]. Considering the ability of these strains to nodulate and fix nitrogen effectively with legumes, they share in common many of the genes responsible for the nitrogenase pathway (IMG pathway number 798). The genome sequence of WSM4176 provides thus an unprecedented opportunity to study the host range and nitrogen fixation capacities of these fynbos bacteria.
  25 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  Improved prediction of signal peptides: SignalP 3.0.

Authors:  Jannick Dyrløv Bendtsen; Henrik Nielsen; Gunnar von Heijne; Søren Brunak
Journal:  J Mol Biol       Date:  2004-07-16       Impact factor: 5.469

3.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

4.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

5.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

6.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

7.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

8.  Consed: a graphical tool for sequence finishing.

Authors:  D Gordon; C Abajian; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

9.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

10.  Genome sequence of the Lebeckia ambigua-nodulating "Burkholderia sprentiae" strain WSM5005(T.).

Authors:  Wayne Reeve; Sofie De Meyer; Jason Terpolilli; Vanessa Melino; Julie Ardley; Tian Rui; Ravi Tiwari; John Howieson; Ron Yates; Graham O'Hara; Megan Lu; David Bruce; Chris Detter; Roxanne Tapia; Cliff Han; Chia-Lin Wei; Marcel Huntemann; James Han; I-Min Chen; Konstantinos Mavromatis; Victor Markowitz; Ernest Szeto; Natalia Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Ioanna Pagani; Amrita Pati; Lynne Goodwin; Lin Peters; Sam Pitluck; Tanja Woyke; Nikos Kyrpides
Journal:  Stand Genomic Sci       Date:  2013-12-15
View more
  2 in total

Review 1.  Emergence of β-rhizobia as new root nodulating bacteria in legumes and current status of the legume-rhizobium host specificity dogma.

Authors:  Ahmed Idris Hassen; Sandra C Lamprecht; Francina L Bopape
Journal:  World J Microbiol Biotechnol       Date:  2020-02-24       Impact factor: 3.312

Review 2.  Members of the genus Burkholderia: good and bad guys.

Authors:  Leo Eberl; Peter Vandamme
Journal:  F1000Res       Date:  2016-05-26
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.