| Literature DB >> 25197442 |
Robert Walker1, Elizabeth Watkin1, Rui Tian2, Lambert Bräu3, Graham O'Hara2, Lynne Goodwin4, James Han5, Tatiparthi Reddy5, Marcel Huntemann5, Amrita Pati5, Tanja Woyke5, Konstantinos Mavromatis5, Victor Markowitz6, Natalia Ivanova5, Nikos Kyrpides5, Wayne Reeve2.
Abstract
Burkholderia sp. strain WSM2232 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod that was trapped in 2001 from acidic soil collected from Karijini National Park (Australia) using Gastrolobium capitatum as a host. WSM2232 was effective in nitrogen fixation with G. capitatum but subsequently lost symbiotic competence during long-term storage. Here we describe the features of Burkholderia sp. strain WSM2232, together with genome sequence information and its annotation. The 7,208,311 bp standard-draft genome is arranged into 72 scaffolds of 72 contigs containing 6,322 protein-coding genes and 61 RNA-only encoding genes. The loss of symbiotic capability can now be attributed to the loss of nodulation and nitrogen fixation genes from the genome. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.Entities:
Keywords: Betaproteobacteria; nitrogen fixation; rhizobia; root-nodule bacteria
Year: 2013 PMID: 25197442 PMCID: PMC4148988 DOI: 10.4056/sigs.5018795
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Images of strain WSM2232using scanning (A) and transmission (B) electron microscopy.
Classification and general features of strain WSM2232 according to the MIGS recommendations [19]
| | | ||
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDA | ||
| Strain WSM2232 | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | 30°C | IDA | |
| Salinity | Non-halophile | IDA | |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| Carbon source | Varied | IDA | |
| Energy source | Chemoorganotroph | NAS | |
| MIGS-6 | Habitat | Soil, root nodule, on host | IDA |
| MIGS-15 | Biotic relationship | Free living, symbiotic | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | IDA |
| Biosafety level | 1 | TAS | |
| Isolation | Root nodule of | IDA | |
| MIGS-4 | Geographic location | Karijini National Park, Australia | IDA |
| MIGS-5 | Soil collection date | September, 2001 | IDA |
| MIGS-4.1 | Latitude | 117.99 | IDA |
| MIGS-4.3 | Depth | 0-10 cm | IDA |
| MIGS-4.4 | Altitude | Not recorded | IDA |
Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [28].
Figure 2Phylogenetic tree showing the relationship of strain WSM2232 (shown in bold print) to other members of the order based on aligned sequences of the 16S rRNA gene (1,242 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA [29], version 5. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [30]. Bootstrap analysis [31] with 500 replicates was performed to assess the support for the clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database accession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD [32]. Published genomes are indicated with an asterisk.
Compatibility of WSM2232 with nine legume species for nodulation (Nod) and N2-Fixation (Fix).
| | | | |||
|---|---|---|---|---|---|
| Bitter Pea | Perennial | +1 | +1 | IDA | |
| -2 | -2 | IDA | |||
| Coral Vine | Perennial | - | - | IDA | |
| Sturts Desert Pea | Annual | - | - | IDA | |
| - | Annual | - | - | IDA | |
| Shaggy Pea | Perennial | - | - | IDA | |
| Jam Wattle | Perennial | - | - | IDA | |
| Weeping Mulga | Perennial | - | - | IDA | |
| - | Perennial | - | - | IDA | |
| Sensitive Plant | Perennial | - | - | IDA |
1 result obtained from trapping experiment
2 authentication result following long-term storage
IDA: Inferred from Direct Assay from http://www.geneontology.org/GO.evidence.shtml of the Gene Ontology project [28].
Reduction of tetrazolium dye by NADH produced by respiring cells of WSM2232 in the Biolog Phenotype Microarray
| | | ||||
|---|---|---|---|---|---|
| L-Arabinose | + | Chondroitin Sulfate C | - | Ammonia | + |
| N-Acetyl-D Glucosamine | + | α-Cyclodextrin | - | Nitrite | + |
| D-Saccharic Acid | + | β-Cyclodextrin | - | Nitrate | + |
| Succinic Acid | + | γ-Cyclodextrin | - | Urea | + |
| D-Galactose | + | Dextrin | + | Biuret | - |
| L-Aspartic Acid | + | Gelatin | - | L-Alanine | + |
| L-Proline | + | Glycogen | - | L-Arginine | + |
| D-Alanine | + | Inulin | - | L-Asparagine | + |
| D-Trehalose | + | Laminarin | - | L-Aspartic Acid | + |
| D-Mannose | + | Mannan | - | L-Cysteine | + |
| Dulcitol | + | Pectin | - | L-Glutamic Acid | + |
| D-Serine | - | N-Acetyl-D-Galactosamine | + | L-Glutamine | + |
| D-Sorbitol | + | N-Acetyl-Neuraminic Acid | - | Glycine | + |
| Glycerol | + | β-D-Allose | - | L-Histidine | + |
| L-Fucose | + | Amygdalin | - | L-Isoleucine | + |
| D-Glucuronic Acid | + | D-Arabinose | + | L-Leucine | + |
| D-Gluconic Acid | + | D-Arabitol | + | L-Lysine | + |
| D,L-α-Glycerol-Phosphate | + | L-Arabitol | + | L-Methionine | + |
| D-Xylose | + | Arbutin | - | L-Phenylalanine | + |
| L-Lactic Acid | + | 2-Deoxy-D-Ribose | + | L-Proline | + |
| Formic Acid | + | I-Erythritol | - | L-Serine | + |
| D-Mannitol | + | D-Fucose | + | L-Threonine | + |
| L-Glutamic Acid | + | 3-0-β-D-Galacto-pyranosyl-DArabinose | - | L-Tryptophan | + |
| D-Glucose-6-Phosphate | + | Gentiobiose | - | L-Tyrosine | + |
| D-Galactonic Acid-γ-Lactone | + | L-Glucose | - | L-Valine | + |
| D,L-Malic Acid | + | Lactitol | - | D-Alanine | + |
| D-Ribose | + | D-Melezitose | - | D-Asparagine | + |
| Tween 20 | + | Maltitol | - | D-Aspartic Acid | + |
| L-Rhamnose | + | α-Methyl-D-Glucoside | - | D-Glutamic Acid | + |
| D-Fructose | + | β-Methyl-D-Galactoside | + | D-Lysine | + |
| Acetic Acid | + | 3-Methyl Glucose | - | D-Serine | + |
| α-D-Glucose | + | β-Methyl-D-Glucuronic Acid | - | D-Valine | + |
| Maltose | - | α-Methyl-D-Mannoside | - | L-Citrulline | + |
| D-Melibiose | - | β-Methyl-D-Xyloside | - | L-Homoserine | + |
| Thymidine | - | Palatinose | - | L-Ornithine | + |
| L-Asparagine | + | D-Raffinose | - | N-Acetyl-D,L-Glutamic Acid | + |
| D-Aspartic Acid | - | Salicin | - | N-Phthaloyl-L-Glutamic Acid | - |
| D-Glucosaminic Acid | + | Sedoheptulosan | - | L-Pyroglutamic Acid | + |
| 1,2-Propanediol | - | L-Sorbose | - | Hydroxylamine | + |
| Tween 40 | + | Stachyose | - | Methylamine | + |
| α-Keto-Glutaric Acid | + | D-Tagatose | + | N-Amylamine | + |
| α-Keto-Butyric Acid | + | Turanose | + | N-Butylamine | + |
| α-Methyl-D-Galactoside | - | Xylitol | + | Ethylamine | - |
| α-D-Lactose | - | N-Acetyl-D-Glucosaminitol | + | Ethanolamine | + |
| Lactulose | + | γ-Amino Butyric Acid | + | Ethylenediamine | - |
| Sucrose | - | δ-Amino Valeric Acid | + | Putrescine | + |
| Uridine | + | Butyric Acid | + | Agmatine | - |
| L-Glutamine | + | Capric Acid | - | Histamine | - |
| M-Tartaric Acid | + | Caproic Acid | + | β-Phenylethylamine | + |
| D-Glucose-1-Phosphate | + | Citraconic Acid | + | Tyramine | - |
| D-Fructose-6-Phosphate | + | Citramalic Acid | + | Acetamide | + |
| Tween 80 | + | D-Glucosamine | + | Formamide | + |
| α-Hydroxy Glutaric Acid-γ-Lactone | - | 2-Hydroxy Benzoic Acid | - | Glucuronamide | + |
| α-Hydroxy Butyric Acid | + | 4-Hydroxy Benzoic Acid | + | D,L-Lactamide | + |
| β-Methyl-D-Glucoside | - | β-Hydroxy Butyric Acid | + | D-Glucosamine | + |
| Adonitol | + | γ-Hydroxy Butyric Acid | + | DGalactosamine | + |
| Maltotriose | - | α-Keto Valeric Acid | - | DMannosamine | + |
| 2-Deoxy Adenosine | - | Itaconic Acid | - | N-Acetyl-D-Glucosamine | + |
| Adenosine | + | 5-Keto-D-Gluconic Acid | - | N-Acetyl-D-Galactosamine | - |
| Glycy-L-Aspartic Acid | + | D-Lactic Acid Methyl Ester | + | N-Acetyl-D-Mannosamine | - |
| Citric Acid | + | Malonic Acid | + | Adenine | + |
| M-Inositol | + | Melibionic Acid | + | Adenosine | + |
| D-Threonine | - | Oxalic Acid | + | Cytidine | + |
| Fumaric Acid | + | Oxalomalic Acid | + | Cytosine | + |
| Bromo Succinic Acid | + | Quinic Acid | + | Guanine | - |
| Propionic Acid | + | D-Ribono-1,4-Lactone | - | Guanosine | + |
| Mucic Acid | + | Sebacic Acid | + | Thymine | + |
| Glycolic Acid | - | Sorbic Acid | + | Thymidine | - |
| Glyoxylic Acid | + | Succinamic Acid | + | Uracil | + |
| D-Cellobiose | - | D-Tartaric Acid | + | Uridine | + |
| Inosine | + | L-Tartari c Acid | + | Inosine | + |
| Glycyl-L-Glutamic Acid | + | Acetamide | - | Xanthine | + |
| Tricarballylic Acid | + | L-Alaninamide | + | Xanthosine | + |
| L-Serine | + | N-Acetyl-L-Glutamic Acid | + | Uric Acid | + |
| L-Threonine | + | L-Arginine | + | Alloxan | + |
| L-Alanine | + | Glycine | - | Allantoin | + |
| L-Allnyl-Glycine | + | L-Histidine | + | Parabanic Acid | + |
| Acetoacetic Acid | + | L-Homoserine | + | D,L-α-Amino-N-Butyric Acid | + |
| N-Acetyl-β-D-Mannosamine | - | Hydroxy-L-Proline | + | γ-Amino-N-Butyric Acid | + |
| Mono Methyl Succinate | + | L-Isoleucine | + | ε-Amino-N-Caproic Acid | - |
| Methyl Pyruvate | + | L-Leucine | + | D,L-α-Amino-Caprylic Acid | - |
| D-Malic Acid | + | L-Lysine | + | δ-Amino-N-Valeric Acid | + |
| L-Malic Acid | + | L-Methionine | - | α-Amino-N-Valeric Acid | + |
| Glycyl-L-Proline | + | L-Ornithine | + | Ala-Asp | + |
| p-Hydroxy Phenyl Acetic Acid | + | L-Phenylalanine | + | Ala-Gln | + |
| m-Hydroxy Phenyl Acetic Acid | - | L-Pyroglutamic Acid | + | Ala-Glu | + |
| Tyramine | - | L-Valine | + | Ala-Gly | + |
| D-Psicose | - | D,L-Carnitine | + | Ala-His | + |
| L-Lyxose | + | Sec-Butylamine | - | Ala-Leu | + |
| Glucuronamide | - | D,L-Octopamine | - | Ala-Thr | + |
| Pyruvic Acid | + | Putrescine | - | Gly-Asn | + |
| L-Galactonic Acid-γ-Lactone | + | Dihydroxy Acetone | - | Gly-Gln | + |
| D-Galacturonic Acid | + | 2,3-Butanediol | + | Gly-Glu | + |
| Phenylethylamine | + | 2,3-Butanone | + | Gly-Met | + |
| 2-Aminoethanol | + | 3-Hydrox y-2-Butanone | - | Met-Ala | + |
Genome sequencing project information for WSM2232.
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Standard draft |
| MIGS-28 | Libraries used | One Illumina fragment library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Sequencing coverage | Illumina: 255× |
| MIGS-30 | Assemblers | Velvet version 1.1.04; Allpaths-LG version r37348 |
| MIGS-32 | Gene calling methods | Prodigal 1.4 |
| GOLD ID | Gi08832a | |
| NCBI project ID | 182741 | |
| Database: IMG | 2508501125b | |
| Project relevance | Symbiotic N2 fixation, agriculture |
Genome Statistics for strain WSM2232
| | | ||||
|---|---|---|---|---|---|
| Genome size (bp) | 7,208,311 | 100.00 | |||
| DNA coding region (bp) | 6,203,174 | 86.06 | |||
| DNA G+C content (bp) | 4,548,885 | 63.11 | |||
| Number of scaffolds | 72 | ||||
| Number of contigs | 72 | ||||
| Total gene | 6,383 | 100.00 | |||
| RNA genes | 61 | 0.96 | |||
| rRNA operons* | 1 | 0.02 | |||
| Protein-coding genes | 6,322 | 99.04 | |||
| Genes with function prediction | 5,164 | 80.90 | |||
| Genes assigned to COGs | 5,151 | 80.70 | |||
| Genes assigned Pfam domains | 5,425 | 84.99 | |||
| Genes with signal peptides | 645 | 10.10 | |||
| Genes with transmembrane helices | 1,497 | 23.45 | |||
| CRISPR repeats | 1 | ||||
*4 copies of 5S, 2 copies of 16S and 1 copy of 23S rRNA.
Figure 3Graphical map of the four largest scaffolds genome for the genome of strain WSM2232. From bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew.
Number of protein coding genes of strain WSM2232 associated with the general COG functional categories.
| | |||
|---|---|---|---|
| J | 474 | 8.15 | Carbohydrate transport and metabolism |
| A | 3 | 0.05 | RNA processing and modification |
| K | 151 | 2.60 | Replication, recombination and repair |
| L | 559 | 9.61 | Transcription |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 42 | 0.72 | Cell cycle control, cell division and chromosome partioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 0 | 0.0 | Defense mechcanism |
| T | 318 | 5.47 | Signal transduction mechanisms |
| M | 371 | 6.38 | Cell wall/membrane/envelope biogenesis |
| N | 125 | 2.15 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 2 | 0.03 | Extracellular structures |
| U | 154 | 2.65 | Intracellular trafficking, secretion, and vesicular transport |
| O | 183 | 3.15 | Posttranslational modification, protein turnover, chaperones |
| C | 384 | 6.60 | Energy production conversion |
| G | 194 | 3.34 | Translation, ribosomal structure and biogenesis |
| E | 569 | 9.79 | Amino acid transport and metabolism |
| F | 100 | 1.72 | Nucleotide transport and metabolism |
| H | 213 | 3.66 | Coenzyme transport and metabolism |
| I | 277 | 4.76 | Lipid transport and metabolism |
| P | 269 | 4.63 | Inorganic ion transport and metabolism |
| Q | 199 | 3.42 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 673 | 11.58 | General function prediction only |
| S | 500 | 8.60 | Function unknown |
| - | 1,232 | 19.30 | Not in COGs |