| Literature DB >> 26388968 |
Sofie E De Meyer1, Rui Tian1, Rekha Seshadri2, Natalia Ivanova2, Amrita Pati2, Victor Markowitz3, Tanja Woyke2, Ron Yates4, John Howieson1, Nikos Kyrpides5, Wayne Reeve1.
Abstract
Burkholderia dilworthii strain WSM3556(T) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected near Grotto Bay Nature Reserve, in the Western Cape of South Africa, in October 2004. This plant persists in infertile and deep sandy soils with acidic pH, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. WSM3556(T) thus represents a potential inoculant quality strain for L. ambigua for which we describe the general features, together with genome sequence and annotation. The 7,679,067 bp high-quality permanent draft genome is arranged in 140 scaffolds of 141 contigs, contains 7,059 protein-coding genes and 64 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.Entities:
Keywords: Betaproteobacteria; GEBA-RNB; Lebeckia; Nitrogen fixation; Root-nodule bacteria; South Africa
Year: 2015 PMID: 26388968 PMCID: PMC4575442 DOI: 10.1186/s40793-015-0048-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Images of Burkholderia dilworthii strain WSM3556T using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)
Fig. 2Phylogenetic tree highlighting the position of Burkholderia dilworthii strain WSM3556T (shown in blue print), relative to other strains in the Burkholderia genus using a 1,322 bp internal region of the 16S rRNA gene. Cupriavidus taiwanensis LMG 19424T was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [31]. The tree was build using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [14] are in bold print and the GOLD ID is provided after the NCBI accession number. Published genomes are designated with an asterisk
Classification and general features of Burkholderia dilworthii WSM3556T in accordance with the MIGS recommendations [32] published by the Genome Standards Consortium [33]
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain: WSM3556T | TAS [ | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Rod | TAS [ | |
| Motility | Motile | TAS [ | |
| Sporulation | Non-sporulating | TAS [ | |
| Temperature range | 15–37 °C | TAS [ | |
| Optimum temperature | 28 °C | TAS [ | |
| pH range; Optimum | 5.5–8; 7 | TAS [ | |
| Carbon source | Large range | TAS [ | |
| MIGS-6 | Habitat | Soil, root nodule on host | IDA |
| MIGS-6.3 | Salinity | 0–10 % | TAS [ |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free living, symbiotic | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | South Africa | TAS [ |
| MIGS-5 | Sample collection | 2004 | TAS [ |
| MIGS-4.1 MIGS-4.2 | Longitude | 18.44 | TAS [ |
| Latitude | −33.49 | TAS [ | |
| MIGS-4.4 | Altitude | 237 | IDA |
Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [41]
Genome sequencing project information for Burkholderia dilworthii WSM3556T
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality-permanent-draft |
| MIGS-28 | Libraries used | Illumina Std |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 367 × Illumina |
| MIGS-30 | Assemblers | Velvet 1.1.04, ALLPATHS V.r37348 |
| MIGS-32 | Gene calling methods | Prodigal 1.4 |
| Locus Tag | F759 | |
| Genbank ID | AWZT00000000 | |
| Genbank Date of Release | December 12, 2013 | |
| GOLD ID | Gp0010131 | |
| BIOPROJECT | PRJNA182743 | |
| MIGS-13 | Source Material Identifier | WSM3556, LMG 27173, HAMBI3353 |
| Project relevance | Symbiotic N2fixation, agriculture |
Genome statistics for Burkholderia dilworthii strain WSM3556T
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 7,679,067 | 100.00 |
| DNA coding (bp) | 6,485,063 | 84.45 |
| DNA G + C (bp) | 4,743,598 | 61.77 |
| DNA scaffolds | 140 | 100.00 |
| Total genes | 7,123 | 100.00 |
| Protein-coding genes | 7,059 | 99.10 |
| RNA genes | 64 | 0.90 |
| Pseudo genes | 0 | 0.00 |
| Genes in internal clusters | 426 | 5.98 |
| Genes with function prediction | 5,431 | 76.25 |
| Genes assigned to COGs | 4,704 | 66.04 |
| Genes with Pfam domains | 5,730 | 80.44 |
| Genes with signal peptides | 642 | 9.01 |
| Genes with transmembrane helices | 1,585 | 22.25 |
| CRISPR repeats | 0 | 0 |
Number of genes associated with general COG functional categories
| Code | Value | % age | COG category |
|---|---|---|---|
| J | 186 | 3.50 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 528 | 9.94 | Transcription |
| L | 183 | 3.44 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 34 | 0.64 | Cell cycle control, Cell division, chromosome partitioning |
| V | 50 | 0.94 | Defense mechanisms |
| T | 235 | 4.42 | Signal transduction mechanisms |
| M | 310 | 5.83 | Cell wall/membrane/envelope biogenesis |
| N | 92 | 1.73 | Cell motility |
| U | 133 | 2.50 | Intracellular trafficking, secretion, and vesicular transport |
| O | 159 | 2.99 | Posttranslational modification, protein turnover, chaperones |
| C | 362 | 6.81 | Energy production and conversion |
| G | 445 | 8.38 | Carbohydrate transport and metabolism |
| E | 581 | 10.94 | Amino acid transport and metabolism |
| F | 89 | 1.68 | Nucleotide transport and metabolism |
| H | 195 | 3.67 | Coenzyme transport and metabolism |
| I | 255 | 4.80 | Lipid transport and metabolism |
| P | 262 | 4.93 | Inorganic ion transport and metabolism |
| Q | 179 | 3.37 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 600 | 11.29 | General function prediction only |
| S | 431 | 8.11 | Function unknown |
| - | 2419 | 33.96 | Not in COGS |
The total is based on the total number of protein coding genes in the genome