| Literature DB >> 26478785 |
Sofie E De Meyer1, Rui Tian1, Rekha Seshadri2, Tbk Reddy2, Victor Markowitz3, Natalia Ivanova2, Amrita Pati2, Tanja Woyke2, Nikos Kyrpides4, Ron Yates5, John Howieson1, Wayne Reeve1.
Abstract
Burkholderia sp. strain WSM4176 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected in Nieuwoudtville, Western Cape of South Africa, in October 2007. This plant persists in infertile, acidic and deep sandy soils, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. Here we describe the features of Burkholderia sp. strain WSM4176, which represents a potential inoculant quality strain for L. ambigua, together with sequence and annotation. The 9,065,247 bp high-quality-draft genome is arranged in 13 scaffolds of 65 contigs, contains 8369 protein-coding genes and 128 RNA-only encoding genes, and is part of the GEBA-RNB project proposal (Project ID 882).Entities:
Keywords: Betaproteobacteria; GEBA-RNB; Nitrogen fixation; Rhizobia; Root-nodule bacteria
Year: 2015 PMID: 26478785 PMCID: PMC4609093 DOI: 10.1186/s40793-015-0072-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Images of Burkholderia sp. strain WSM4176 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)
Fig. 2Phylogenetic tree highlighting the position of Burkholderia sp. strain WSM4176 (shown in blue print) relative to other type and non-type strains in the Burkholderia genus (1322 bp internal region). Cupriavidus taiwanensis LMG 19424T was used as outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [27]. The tree was build using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [9] are in bold print and the GOLD ID is mentioned after the NCBI accession number. Published genomes are designated with an asterisk
Classification and general features of Burkholderia sp. strain WSM4176 in accordance with the MIGS recommendations [28] published by the Genome Standards Consortium [29]
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain WSM4176 | TAS [ | ||
| Gram stain | Negative | IDA [ | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | IDA [ | |
| Temperature range | Not reported | ||
| Optimum temperature | 28 °C | IDA | |
| pH range; Optimum | Not reported | ||
| Carbon source | Not reported | ||
| MIGS-6 | Habitat | Soil, root nodule on host | TAS [ |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free living, symbiotic | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | South Africa | TAS [ |
| MIGS-5 | Sample collection | 2007 | TAS [ |
| MIGS-4.1 | Latitude | −31.381 | TAS [ |
| MIGS-4.2 | Longitude | 19.30 | TAS [ |
| MIGS-4.4 | Altitude | 789 m | IDA |
Evidence codes – IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [37]
Genome sequencing project information for Burkholderia sp. strain WSM4176
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality-permanent-draft |
| MIGS-28 | Libraries used | Illumina CLIP PE and Illumina Std PE Unamplified |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 361 × Illumina |
| MIGS-30 | Assemblers | ALLPATHS V.r41554 |
| MIGS-32 | Gene calling methods | Prodigal 1.4, GenePRIMP |
| Locus Tag | B014 | |
| Genbank ID | ARCY00000000 | |
| Genbank Date of Release | July 11, 2014 | |
| GOLD ID | Gi08873 | |
| BIOPROJECT | PRJNA169686 | |
| MIGS-13 | Source Material Identifier | WSM4176 |
| Project relevance | Symbiotic N2fixation, agriculture |
Genome statistics for Burkholderia sp. strain WSM4176
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 9,065,247 | 100.00 |
| DNA coding (bp) | 7,632,174 | 84.19 |
| DNA G+C (bp) | 5,701,432 | 62.89 |
| DNA scaffolds | 13 | |
| Total genes | 8497 | 100.00 |
| Protein-coding genes | 8369 | 98.49 |
| RNA genes | 128 | 1.51 |
| Pseudo genes | 0 | 0.00 |
| Genes in internal clusters | 648 | 7.63 |
| Genes with function prediction | 6412 | 75.46 |
| Genes assigned to COGs | 5491 | 64.62 |
| Genes with Pfam domains | 6766 | 79.63 |
| Genes with signal peptides | 738 | 8.69 |
| Genes with transmembrane helices | 1865 | 21.95 |
| CRISPR repeats | 0 | 0.00 |
Fig. 3Graphical map of the genome of Burkholderia sp. strain WSM4176. First four large scaffolds are shown according to size. From the bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew
Number of protein coding genes of Burkholderia sp. strain WSM4176 associated with the general COG functional categories
| Code | Value | % age | COG category |
|---|---|---|---|
| J | 200 | 3.21 | Translation |
| A | 1 | 0.02 | RNA processing and modification |
| K | 596 | 9.55 | Transcription |
| L | 299 | 4.79 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 38 | 0.61 | Cell cycle control, mitosis and meiosis |
| V | 74 | 1.19 | Defense mechanisms |
| T | 270 | 4.33 | Signal transduction mechanisms |
| M | 389 | 6.23 | Cell wall/membrane biogenesis |
| N | 105 | 1.68 | Cell motility |
| U | 146 | 2.34 | Intracellular trafficking and secretion |
| O | 172 | 2.76 | Posttranslational modification, protein turnover, chaperones |
| C | 461 | 7.39 | Energy production conversion |
| G | 495 | 7.93 | Carbohydrate transport and metabolism |
| E | 611 | 9.79 | Amino acid transport metabolism |
| F | 101 | 1.62 | Nucleotide transport and metabolism |
| H | 210 | 3.37 | Coenzyme transport and metabolism |
| I | 323 | 5.18 | Lipid transport and metabolism |
| P | 317 | 5.08 | Inorganic ion transport and metabolism |
| Q | 225 | 3.61 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 727 | 11.65 | General function prediction only |
| S | 479 | 7.68 | Function unknown |
| - | 3006 | 35.38 | Not in COGS |
The total is based on the total number of protein coding genes in the genome