| Literature DB >> 26629309 |
Yanling Pan1, Ka Fai Kong1, Jimmy S H Tsang1.
Abstract
Burkholderia caribensis MBA4 was isolated from soil for its capability to grow on haloacids. This bacterium has a genome size of 9,482,704 bp. Here we report the genome sequences and annotation, together with characteristics of the genome. The complete genome sequence consists of three replicons, comprising 9056 protein-coding genes and 80 RNA genes. Genes responsible for dehalogenation and uptake of haloacids were arranged as an operon. While dehalogenation of haloacetate would produce glycolate, three glycolate operons were identified. Two of these operons contain an upstream glcC regulator gene. It is likely that the expression of one of these operons is responsive to haloacetate. Genes responsible for the metabolism of dehalogenation product of halopropionate were also identified.Entities:
Keywords: Burkholderia caribensis; Dehalogenase; Genome sequence; Glycolate operon; Haloacid degradation
Year: 2015 PMID: 26629309 PMCID: PMC4665904 DOI: 10.1186/s40793-015-0109-7
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Micrograph of Burkholderia caribensis MBA4
Classification and general features of Burkholderia caribensis MBA4 according to MIGS recommendations [21]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain: | IDA | ||
| Gram stain |
| IDA | |
| Cell shape |
| IDA | |
| Motility |
| IDA | |
| Sporulation |
| IDA | |
| Temperature range |
| IDA | |
| Optimum temperature |
| IDA | |
| pH range; Optimum |
| IDA | |
| Carbon source |
| IDA | |
| MIGS-6 | Habitat |
| IDA |
| MIGS-6.3 | Salinity |
| IDA |
| MIGS-22 | Oxygen requirement |
| IDA |
| MIGS-15 | Biotic relationship |
| IDA |
| MIGS-14 | Pathogenicity |
| IDA |
| MIGS-4 | Geographic location |
| IDA |
| MIGS-5 | Sample collection |
| IDA |
| MIGS-4.1 | Latitude |
| IDA |
| MIGS-4.2 | Longitude |
| IDA |
| MIGS-4.4 | Altitude |
| IDA |
a Evidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [41]
Fig. 2Phylogenetic tree highlighting the relative position of B. caribensis MBA4 in the Burkholderia genus. The phylogenetic tree was constructed with MEGA6 [34] based on analysis of 16S rDNA sequences. The evolutionary distances were computed using the Maximum Composite Likelihood method [35] and are in the units of the number of base substitutions per site. Numbers at nodes are bootstrap values inferred from 500 replicates. The GenBank accession number and the bacterial species are illustrated
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Four Illumina paired-end libraries, one 454 library, one PacBio 10–20 kb library |
| MIGS 29 | Sequencing platforms | Illumina HisSeq 2000, 454 GS FLX Titanium and PacBio RS II |
| MIGS 31.2 | Fold coverage | 850× |
| MIGS 30 | Assemblers | GLC Genomic Workbench 6.0.1, SMRT Analysis v2.3.0 HGAP.2 |
| MIGS 32 | Gene calling method | RAST and PGAAP |
| Locus Tag | K788 | |
| GenBank ID | CP012746, CP012747, CP012748 | |
| GenBank Date of Release | November, 2015 | |
| GOLD ID | Ga0082378 | |
| BIOPROJECT | PRJNA197459 | |
| MIGS 13 | Source Material Identifier | MBA4 |
| Project relevance | Biotechnological, environmental |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 9,482,704 | 100.00 |
| DNA coding (bp) | 8,209,808 | 86.58 |
| DNA G + C (bp) | 5,922,869 | 62.46 |
| DNA scaffolds | 3 | 100.00 |
| Total genes | 9151 | 100.00 |
| Protein coding genes | 9056 | 98.96 |
| RNA genes | 80 | 0.87 |
| Pseudo genes | 15 | 0.16 |
| Genes in internal clusters | Not determined | Not determined |
| Genes with function prediction | 7327 | 80.07 |
| Genes assigned to COGs | 6596 | 72.84 |
| Genes with Pfam domains | 6737 | 74.39 |
| Genes with signal peptides | 824 | 9.10 |
| Genes with transmembrane helices | 2008 | 22.17 |
| CRISPR repeats | 10 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with the general COG functional categories
| Code | Value | %agea | Description |
|---|---|---|---|
| J | 215 | 2.37 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.01 | RNA processing and modification |
| K | 809 | 8.93 | Transcription |
| L | 215 | 2.37 | Replication, recombination and repair |
| B | 4 | 0.04 | Chromatin structure and dynamics |
| D | 44 | 0.49 | Cell cycle control, Cell division, chromosome partitioning |
| V | 65 | 0.72 | Defense mechanisms |
| T | 528 | 5.83 | Signal transduction mechanisms |
| M | 470 | 5.19 | Cell wall/membrane biogenesis |
| N | 159 | 1.76 | Cell motility |
| U | 180 | 1.99 | Intracellular trafficking and secretion |
| O | 224 | 2.47 | Posttranslational modification, protein turnover, chaperones |
| C | 611 | 6.75 | Energy production and conversion |
| G | 625 | 6.90 | Carbohydrate transport and metabolism |
| E | 816 | 9.01 | Amino acid transport and metabolism |
| F | 110 | 1.21 | Nucleotide transport and metabolism |
| H | 246 | 2.72 | Coenzyme transport and metabolism |
| I | 356 | 3.93 | Lipid transport and metabolism |
| P | 359 | 3.96 | Inorganic ion transport and metabolism |
| Q | 253 | 2.79 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 931 | 10.28 | General function prediction only |
| S | 615 | 6.79 | Function unknown |
| - | 2460 | 27.16 | Not in COGs |
aThe total is based on the total number of protein coding genes in the genome
Fig. 3Genome maps of B. caribensis MBA4. The outer circle indicates the location of all ORFs. All ORFs were colored according to their COG functional groups. Light venetian red and medium rose colored arrows indicate tRNA and rRNA genes, respectively. GC content is in black and GC skew + and – is in green and fuchsia, respectively. The sizes of the replicons are not drawn to scale
Fig. 4Schematic representation of the genomic organization of three glycolate oxidase genes in B. caribensis MBA4. Glycolate oxidase genes comprising glcDEF were identified in replicons CP012746, CP012747 and CP012748. In replicons CP012747 and CP012748, a glcC regulator gene was also discovered. In replicon CP012747, a glcB gene, encoding malate synthase, was found downstream of glcDEF