Literature DB >> 26388968

High-quality permanent draft genome sequence of the Lebeckia - nodulating Burkholderia dilworthii strain WSM3556(T).

Sofie E De Meyer1, Rui Tian1, Rekha Seshadri2, Natalia Ivanova2, Amrita Pati2, Victor Markowitz3, Tanja Woyke2, Ron Yates4, John Howieson1, Nikos Kyrpides5, Wayne Reeve1.   

Abstract

Burkholderia dilworthii strain WSM3556(T) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected near Grotto Bay Nature Reserve, in the Western Cape of South Africa, in October 2004. This plant persists in infertile and deep sandy soils with acidic pH, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. WSM3556(T) thus represents a potential inoculant quality strain for L. ambigua for which we describe the general features, together with genome sequence and annotation. The 7,679,067 bp high-quality permanent draft genome is arranged in 140 scaffolds of 141 contigs, contains 7,059 protein-coding genes and 64 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.

Entities:  

Keywords:  Betaproteobacteria; GEBA-RNB; Lebeckia; Nitrogen fixation; Root-nodule bacteria; South Africa

Year:  2015        PMID: 26388968      PMCID: PMC4575442          DOI: 10.1186/s40793-015-0048-3

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Over the last decade, agricultural scientists have sought to discover perennial legumes from a wide range of natural environments to develop new plants for grazing systems [1]. It is thought that these plants might be more resilient to changing rainfall patterns, such as in the target environments of Western Australia. Here, winter rainfall has declined by 20 % in the last two decades [2], although more frequent summer rainfall events have been experienced. In the fynbos biome of South Africa, several species that offer potential for domestication have been discovered [1, 3]. These legumes are frequently nodulated by bacteria in the class [3, 4]. The symbiosis between these and legumes from the genera Lebeckia and Rhynchosia fix atmospheric nitrogen to enable their cultivation on infertile soils [4-7]. is proving well adapted to Western Australia [1] because in areas where it is naturally found in South Africa the soil and climatic conditions approximate those of Western Australia. Nodules and seeds of L. ambigua were collected in four expeditions to the Western Cape of South Africa between 2002 and 2007. The isolation of bacteria from these nodules gave rise to a collection of 23 strains that were identified as [3]. Unlike most of the previously studied nodulating strains, this South African group appears to associate with papilionoid forage legumes, rather than Mimosa species. WSM3556 belongs to a subgroup of strains that were isolated in 2004 from nodules collected south west of Darling, in a natural rangeland site on the southern border of the Grotto Bay Nature Reserve [3]. The soil at the site of collection was deep sand with a pH of 6. strain WSM3556 was isolated from those nodules and is effective at fixing nitrogen with L. ambigua and L. sepiaria. The nodules formed by these symbioses are crotaloid and indeterminate [3]. WSM3556 thus represents a potential inoculant quality strain for L. ambigua, which is being developed as a grazing legume adapted to infertile soils that receive 250–400 mm annual rainfall in southern Australia and is therefore of special interest to the RNB chapter of the GEBA project. Here we present a summary classification and a set of general features for strain WSM3556 together with the description of the permanent draft genome sequence and annotation.

Organism information

Classification and features

strain WSM3556 is a motile, Gram-negative, non-spore-forming rod (Fig. 1 Left, Center) in the order of the class . The rod-shaped form varies in size with dimensions of 0.9–2 μm in width and 0.4–3.0 μm in length (Fig. 1 Left). It is fast growing, forming 0.4–2 mm diameter colonies after 24 h when grown on half Lupin Agar [8] and TY [9] at 28 °C. Colonies on ½LA are white-opaque, slightly domed, moderately mucoid with smooth margins (Fig. 1 Right). Additional physiological properties of this strain were previously published [5].
Fig. 1

Images of Burkholderia dilworthii strain WSM3556T using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)

Images of Burkholderia dilworthii strain WSM3556T using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right) Figure 2 shows the phylogenetic relationship of strain WSM3556 in a 16S rRNA gene sequence based tree. This strain is most similar to WSM3937 and PsJNT based on the 16S rRNA with sequence identities of 98.50 % and 98.11 %, respectively, as determined using the EzTaxon-e server [10]. WSM3937 has been isolated from , a herbaceous legume from the fynbos biome in South Africa [7]. PsJNT was isolated from surface sterilized onion roots and has plant growth promoting properties on various plants, however it has not been reported in association with legumes [11]. Minimum Information about the Genome Sequence of WSM3556 is provided in Table 1.
Fig. 2

Phylogenetic tree highlighting the position of Burkholderia dilworthii strain WSM3556T (shown in blue print), relative to other strains in the Burkholderia genus using a 1,322 bp internal region of the 16S rRNA gene. Cupriavidus taiwanensis LMG 19424T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [31]. The tree was build using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [14] are in bold print and the GOLD ID is provided after the NCBI accession number. Published genomes are designated with an asterisk

Table 1

Classification and general features of Burkholderia dilworthii WSM3556T in accordance with the MIGS recommendations [32] published by the Genome Standards Consortium [33]

MIGS IDPropertyTermEvidence code
ClassificationDomain Bacteria TAS [34]
Phylum Proteobacteria TAS [35]
Class Betaproteobacteria TAS [36, 37]
Order Burkholderiales TAS [37, 38]
Family Burkholderiaceae TAS [37, 39]
Genus Burkholderia TAS [37, 40]
Species Burkholderia dilworthii TAS [5]
(Type) strain: WSM3556T TAS [5]
Gram stainNegativeTAS [5]
Cell shapeRodTAS [5]
MotilityMotileTAS [5]
SporulationNon-sporulatingTAS [39]
Temperature range15–37 °CTAS [5]
Optimum temperature28 °CTAS [5]
pH range; Optimum5.5–8; 7TAS [5]
Carbon sourceLarge rangeTAS [5]
MIGS-6HabitatSoil, root nodule on hostIDA
MIGS-6.3Salinity0–10 %TAS [5]
MIGS-22Oxygen requirementAerobicIDA
MIGS-15Biotic relationshipFree living, symbioticIDA
MIGS-14PathogenicityNon-pathogenicNAS
MIGS-4Geographic locationSouth AfricaTAS [3]
MIGS-5Sample collection2004TAS [3]
MIGS-4.1 MIGS-4.2Longitude18.44TAS [3]
Latitude−33.49TAS [3]
MIGS-4.4Altitude237IDA

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [41]

Phylogenetic tree highlighting the position of Burkholderia dilworthii strain WSM3556T (shown in blue print), relative to other strains in the Burkholderia genus using a 1,322 bp internal region of the 16S rRNA gene. Cupriavidus taiwanensis LMG 19424T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [31]. The tree was build using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [14] are in bold print and the GOLD ID is provided after the NCBI accession number. Published genomes are designated with an asterisk Classification and general features of Burkholderia dilworthii WSM3556T in accordance with the MIGS recommendations [32] published by the Genome Standards Consortium [33] Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [41]

Symbiotaxonomy

strain WSM3556 belongs to a group of strains that nodulate papilionoid forage legumes rather than the classical Mimosa host species (Mimosoideae) described for other microsymbionts [12]. strain WSM3556 was assessed for nodulation and nitrogen fixation on three separate L. ambigua genotypes (CRSLAM-37, CRSLAM-39 and CRSLAM-41) [3]. It could nodulate and fix effectively on CRSLAM-41 but was partially effective on CRSLAM-37 and CRSLAM-39 [3]. Moreover, WSM3556 also nodulates and fixes nitrogen in association with .

Genome sequencing information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, The Root Nodulating Bacteria chapter project at the U.S. Department of Energy, Joint Genome Institute for projects of relevance to agency missions [13]. The genome project is deposited in the Genomes OnLine Database [14] and the high-quality permanent draft genome sequence in IMG [15]. Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology [16]. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information for Burkholderia dilworthii WSM3556T

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-quality-permanent-draft
MIGS-28Libraries usedIllumina Std
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage367 × Illumina
MIGS-30AssemblersVelvet 1.1.04, ALLPATHS V.r37348
MIGS-32Gene calling methodsProdigal 1.4
Locus TagF759
Genbank IDAWZT00000000
Genbank Date of ReleaseDecember 12, 2013
GOLD IDGp0010131
BIOPROJECTPRJNA182743
MIGS-13Source Material IdentifierWSM3556, LMG 27173, HAMBI3353
Project relevanceSymbiotic N2fixation, agriculture
Genome sequencing project information for Burkholderia dilworthii WSM3556T

Growth conditions and genomic DNA preparation

strain WSM3556 was grown on TY solid medium [9] for 3 days, a single colony was selected and used to inoculate 5 ml TY broth medium. The culture was grown for 48 h on a gyratory shaker (200 rpm) at 28 °C. Subsequently 1 ml was used to inoculate 60 ml TY broth medium and grown on a gyratory shaker (200 rpm) at 28 °C until OD 0.6 was reached. DNA was isolated from 60 mL of cells using a CTAB bacterial genomic DNA isolation method [17]. Final concentration of the DNA was 0.5 mg/ml.

Genome sequencing and assembly

The genome of strain WSM3556 was sequenced at the DOE Joint Genome Institute using state of the art technology [18]. For this genome, an Illumina standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform, which generated 9,394,768 reads totalling 2,818.4 Mbp of Illumina data. All general aspects of library construction and sequencing performed at the JGI can be found on the JGI web site [16]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J. unpublished). The following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet, version 1.1.04 [19], (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [20], (3) Illumina reads were assembled with simulated read pairs using Allpaths (version r37348) [21]. Parameters for assembly steps were: 1) Velvet -exp_cov 90 -cov_cutoff 20 -exportFiltered yes -very_clean yes), 2) wgsim (−e 0–1 76–2 76 -r 0 -R 0 -X 0 -d 3000 -s 300 -N 1266735), 3) Allpaths–LG (PrepareAllpathsInputs: PHRED_64 = 1 PLOIDY = 1 JUMP_COVERAGE = 25 FRAG_COVERAGE = 125, RunAllpathsLG: RUN = 125std + 25xfakedpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True). The final draft assembly contained 141 contigs in 140 scaffolds. The total size of the genome is 7.7 Mbp and the final assembly is based on 2,818.4 Mbp of Illumina draft data, which provides an average of 367x coverage of the genome.

Genome annotation

Genes were identified using Prodigal [22], as part of the DOE-JGI genome annotation pipeline [23, 24] followed by a round of manual curation using GenePRIMP [25] for finished genomes and Draft genomes in fewer than 10 scaffolds. The predicted CDSs were translated and used to search the NCBI non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [26] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [27]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [28]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes-Expert Review system [29] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The genome is 7,679,067 nucleotides with 61.77 % GC content (Table 3) and comprised of 140 scaffolds and 141 contigs. From a total of 7,123 genes, 7,059 were protein encoding and 64 RNA only encoding genes. The majority of genes (76.25 %) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COG functional categories is presented in Table 4.
Table 3

Genome statistics for Burkholderia dilworthii strain WSM3556T

AttributeValue% of total
Genome size (bp)7,679,067100.00
DNA coding (bp)6,485,06384.45
DNA G + C (bp)4,743,59861.77
DNA scaffolds140100.00
Total genes7,123100.00
Protein-coding genes7,05999.10
RNA genes640.90
Pseudo genes00.00
Genes in internal clusters4265.98
Genes with function prediction5,43176.25
Genes assigned to COGs4,70466.04
Genes with Pfam domains5,73080.44
Genes with signal peptides6429.01
Genes with transmembrane helices1,58522.25
CRISPR repeats00
Table 4

Number of genes associated with general COG functional categories

CodeValue% ageCOG category
J1863.50Translation, ribosomal structure and biogenesis
A10.02RNA processing and modification
K5289.94Transcription
L1833.44Replication, recombination and repair
B10.02Chromatin structure and dynamics
D340.64Cell cycle control, Cell division, chromosome partitioning
V500.94Defense mechanisms
T2354.42Signal transduction mechanisms
M3105.83Cell wall/membrane/envelope biogenesis
N921.73Cell motility
U1332.50Intracellular trafficking, secretion, and vesicular transport
O1592.99Posttranslational modification, protein turnover, chaperones
C3626.81Energy production and conversion
G4458.38Carbohydrate transport and metabolism
E58110.94Amino acid transport and metabolism
F891.68Nucleotide transport and metabolism
H1953.67Coenzyme transport and metabolism
I2554.80Lipid transport and metabolism
P2624.93Inorganic ion transport and metabolism
Q1793.37Secondary metabolite biosynthesis, transport and catabolism
R60011.29General function prediction only
S4318.11Function unknown
-241933.96Not in COGS

The total is based on the total number of protein coding genes in the genome

Genome statistics for Burkholderia dilworthii strain WSM3556T Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome

Conclusion

WSM3556 belongs to a group of Beta-rhizobia isolated from from the fynbos biome in South Africa [3]. WSM3556 is phylogeneticaly most closely related to WSM3937 and PsJNT. Of these strains only WSM3556 and WSM3937 are legume microsymbionts. Out of 13 strains that are known legume microsymbionts, only four (WSM3556, WSM4176, WSM5005, STM678T) nodulate South African papilionoid species. A comparison of these nodulating strains reveals that WSM3556 has the smallest genome (7.7 Mbp), the smallest KOG count (1295) and the lowest GC (61.77 %) percentage in this group. These four genomes share the nitrogenase-RXN MetaCyc pathway catalyzed by a multiprotein nitrogenase complex. Strains WSM3556, WSM4176, WSM5005 [30] have been shown to fix nitrogen with provenances with varying degrees of effectiveness. WSM3556 is partially effective on two out of three L. ambigua provenances, WSM4176 is partially effective on only one L. ambigua provenance and WSM5005 is effective on all three L. ambigua provenances. The genome sequences of these fynbos bacteria provides an unprecedented opportunity to reveal the genetic determinants required for effective nitrogen fixation with .
  27 in total

1.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

2.  List of new names and new combinations previously effectively, but not validly, published.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2006-01       Impact factor: 2.747

3.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

5.  Burkholderia rhynchosiae sp. nov., isolated from Rhynchosia ferulifolia root nodules.

Authors:  Sofie E De Meyer; Margo Cnockaert; Julie K Ardley; Robert D Trengove; Giovanni Garau; John G Howieson; Peter Vandamme
Journal:  Int J Syst Evol Microbiol       Date:  2013-05-24       Impact factor: 2.747

6.  Burkholderia spp. are the most competitive symbionts of Mimosa, particularly under N-limited conditions.

Authors:  Geoffrey N Elliott; Jui-Hsing Chou; Wen-Ming Chen; Guido V Bloemberg; Cyril Bontemps; Esperanza Martínez-Romero; Encarna Velázquez; J Peter W Young; Janet I Sprent; Euan K James
Journal:  Environ Microbiol       Date:  2008-11-10       Impact factor: 5.491

7.  Burkholderia dilworthii sp. nov., isolated from Lebeckia ambigua root nodules.

Authors:  Sofie E De Meyer; Margo Cnockaert; Julie K Ardley; Ben-Erik Van Wyk; Peter A Vandamme; John G Howieson
Journal:  Int J Syst Evol Microbiol       Date:  2013-12-24       Impact factor: 2.747

8.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

9.  Genome sequence of the Lebeckia ambigua-nodulating "Burkholderia sprentiae" strain WSM5005(T.).

Authors:  Wayne Reeve; Sofie De Meyer; Jason Terpolilli; Vanessa Melino; Julie Ardley; Tian Rui; Ravi Tiwari; John Howieson; Ron Yates; Graham O'Hara; Megan Lu; David Bruce; Chris Detter; Roxanne Tapia; Cliff Han; Chia-Lin Wei; Marcel Huntemann; James Han; I-Min Chen; Konstantinos Mavromatis; Victor Markowitz; Ernest Szeto; Natalia Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Ioanna Pagani; Amrita Pati; Lynne Goodwin; Lin Peters; Sam Pitluck; Tanja Woyke; Nikos Kyrpides
Journal:  Stand Genomic Sci       Date:  2013-12-15

10.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.