| Literature DB >> 27688836 |
Shumei Zhang1, Wei Jiang1, Jing Li1, Liqiang Meng1, Xu Cao1, Jihua Hu2, Yushuai Liu1, Jingyu Chen2, Changqing Sha3.
Abstract
Bacillus amyloliquefaciens TF28 is a biocontrol endophytic bacterium that is capable of inhibition of a broad range of plant pathogenic fungi. The strain has the potential to be developed into a biocontrol agent for use in agriculture. Here we report the whole-genome shotgun sequence of the strain. The genome size of B. amyloliquefaciens TF28 is 3,987,635 bp which consists of 3754 protein-coding genes, 65 tandem repeat sequences, 47 minisatellite DNA, 2 microsatellite DNA, 63 tRNA, 7rRNA, 6 sRNA, 3 prophage and CRISPR domains.Entities:
Keywords: Bacillus amyloliquefaciens; Biocontrol; Broad spectrum; Endophytic bacterium; Genome sequence
Year: 2016 PMID: 27688836 PMCID: PMC5031281 DOI: 10.1186/s40793-016-0182-6
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1A scanning electron micrograph of B. amyloliquefaciens TF 28 cells
Classification and general features of B.amyloliquifaciens TF28 as per MIGS recommendation [26]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain: TF28 | TAS [ | ||
| Gram stain | Positive | TAS [ | |
| Cell shape | Rod | TAS [ | |
| Motility | Motile | TAS [ | |
| Sporulation | Endospore-forming | TAS [ | |
| Temperature range | 15–37 °C | TAS [ | |
| Optimum temperature | 30 °C | TAS [ | |
| pH range; Optimum | 5–9, 7.5 | TAS [ | |
| Carbon source | Glucose, lactose, starch | TAS [ | |
| MIGS-6 | Habitat | Soil, Plant | TAS [ |
| MIGS-6.3 | Salinity | 0–3 % W/V | TAS [ |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | China/Heilongjiang | TAS [ |
| MIGS-5 | Sample collection | 2006-06-10 | TAS [ |
| MIGS-4.1 | Latitude | Not reported | |
| MIGS-4.2 | Longitude | Not reported | |
| MIGS-4.4 | Altitude | Not reported |
aEvidence codes - NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [39]
Fig. 2Phylogenetic trees based on 16S rRNA gene sequences highlighting the position of B. amyloliquefaciens TF 28 (shown in bold). The GenBank accession numbers are shown in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences were constructed using the neighbor-joining method within the MEGA 5.10 software (Additional file 7: Table S7). Numbers at the nodes represent percentages of bootstrap values obtained by repeating the analysis 1000 times to generate a majority consensus tree. The scale bar indicates 0.0005 nucleotide change per nucleotide position, respectively
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | Illumina Paired-End Library (500 bp insert size) and Mate Pair Library (6,000 bp insert size) |
| MIGS 29 | Sequencing platforms | Illumina Hiseq2000 |
| MIGS 31.2 | Fold coverage | 150× |
| MIGS 30 | Assemblers | SOAPdenovo software 2.04 |
| MIGS 32 | Gene calling method | Glimmer |
| Locus Tag | TH57 | |
| GenBank ID | JUDU00000000 | |
| GenBank Date of Release | 2015/01/21 | |
| GOLD ID | - | |
| BIOPROJECT | PRJNA268537 | |
| MIGS 13 | Source Material Identifier | TF28 |
| Project relevance | Biocontrol, Agriculture |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 3,987,635 | 100.00 |
| DNA coding (bp) | 3,571,596 | 89.57 |
| DNA G + C (bp) | 1,849,465 | 46.38 |
| DNA scaffolds | 3 | - |
| Total genes | 3863 | 100.00 |
| Protein coding genes | 3754 | 97.18 |
| RNA genes | 76 | 1.97 |
| Pseudo genes | 38 | 0.98 |
| Genes in internal clusters | 1554 | 40.23 |
| Genes with function prediction | 2941 | 76.13 |
| Genes assigned to COGs | 3218 | 83.30 |
| Genes with Pfam domains | 3292 | 85.21 |
| Genes with signal peptides | 204 | 5.28 |
| Genes with transmembrane helices | 1041 | 26.95 |
| CRISPR repeats | 3 | - |
Fig. 3Circle map of strain TF28 genome. From outer to inner circle, circle 1 shows protein-coding genes colored by COG categories; circle 2 shows G + C% content plot; circle 3 shows GC skew
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 136 | 3.02 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 269 | 5.97 | Transcription |
| L | 114 | 2.53 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 34 | 0.75 | Cell cycle control, Cell division, chromosome partitioning |
| V | 49 | 1.09 | Defense mechanisms |
| T | 134 | 2.97 | Signal transduction mechanisms |
| M | 170 | 3.77 | Cell wall/membrane biogenesis |
| N | 54 | 1.19 | Cell motility |
| U | 45 | 0.99 | Intracellular trafficking and secretion |
| O | 96 | 2.13 | Posttranslational modification, protein turnover, chaperones |
| C | 178 | 3.95 | Energy production and conversion |
| G | 243 | 5.39 | Carbohydrate transport and metabolism |
| E | 342 | 7.58 | Amino acid transport and metabolism |
| F | 78 | 1.73 | Nucleotide transport and metabolism |
| H | 123 | 2.73 | Coenzyme transport and metabolism |
| I | 116 | 2.57 | Lipid transport and metabolism |
| P | 208 | 4.61 | Inorganic ion transport and metabolism |
| Q | 122 | 2.71 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 429 | 9.51 | General function prediction only |
| S | 277 | 6.14 | Function unknown |
| - | 1291 | 28.63 | Not in COGs |
The total % age is based on the total number of protein coding genes in the annotated genome
Fig. 4GO annotation of protein-coding genes
Fig. 5KEGG annotation of protein-coding genes
Number of strain-specific genes with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 2 | 0.99 | Translation,ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 5 | 2.49 | Transcription |
| L | 4 | 1.99 | Replication, recombination and repair |
| D | 5 | 2.49 | Cell cycle control, Cell division, chromosome partitioning |
| V | 9 | 4.48 | Defense mechanisms |
| T | 7 | 3.48 | Signal transduction mechanisms |
| N | 1 | 0.49 | Cell motility |
| C | 3 | 1.49 | Energy production and conversion |
| G | 14 | 6.97 | Carbohydrate transport and metabolism |
| E | 8 | 3.98 | Amino acid transport and metabolism |
| F | 4 | 1.99 | Nucleotide transport and metabolism |
| H | 6 | 2.99 | Coenzyme transport and metabolism |
| I | 1 | 0.49 | Lipid transport and metabolism |
| P | 1 | 0.49 | Inorganic ion transport and metabolism |
| Q | 1 | 0.49 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 9 | 4.48 | General function prediction only |
| S | 3 | 1.49 | Function unknown |
| - | 116 | 57.71 | Not in COGs |
Fig. 6Phylogenetic trees based on gyrA(a) and rpoB(b). The GenBank accession numbers are shown in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences were constructed using the neighbor-joining method within the MEGA 5.10 software. Numbers at the nodes represent percentages of bootstrap values obtained by repeating the analysis 1000 times to generate a majority consensus tree. The scale bar indicates 0.2 (gyrA) and 0.1 (rpoB) nucleotide change per nucleotide position, respectively