| Literature DB >> 35242952 |
Christian Blumenscheit1, Jennifer Jähne1, Andy Schneider1, Jochen Blom2, Thomas Schweder3, Peter Lasch1, Rainer Borriss3,4.
Abstract
Here, we report the complete genome sequence data of the biocontrol strains Bacillus velezensis BP1.2A and BT2.4 isolated from Vietnamese crop plants. The size of the genomes is 3,916,868 bp (BP1.2A), and 3,922,686 bp (BT2.4), respectively. The BioProjects have been deposited at NCBI GenBank. The GenBank accession numbers for the B. velezensis strains are PRJNA634914 (BP1.2A) and PRJNA634832 (BT2.4) for the BioProjects, CP085504 (BP1.2A) and CP085505 (BT2.4) for the chromosomes, GCA_013284785.2 (BP2.1A), and GCA_013284785.2 (BT2.4) for GenBank assembly accessions, and SAMN15012571 (BP1.2A) and SAMN15009897 (BT2.4) for the BioSamples. Both genomes were closely related to FZB42, the model strain for plant growth promoting bacilli.Entities:
Keywords: Bacillus velezensis; Complete genome; Lipopeptides; Macrolactin; Phylogenetic analysis; Polyketides
Year: 2022 PMID: 35242952 PMCID: PMC8885614 DOI: 10.1016/j.dib.2022.107978
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Sequence comparison of BP1.2A, and BT2.4 with FZB4242 using blastn, and ANIb [11]. The italic numbers set in brackets indicate the overlap of the sequences used in the comparison. Analysis of singletons was performed with the EDGAR software package [12].
| BP1.2A (CP085504.1) | BT2.4 (CP085505.1) | FZB42 (CP000560.2) | |
|---|---|---|---|
| BP1.2A | * | 100 (99.74) | 100.00 (99.64) |
| BT2.4 | 100.00 (99.67) | * | 99.99 (99.58) |
| FZB42 | 100.00 (99.64) | 99.99 (99.61) | * |
| Query BP1.2A | Query BT2.4 | Query FZB42 | |
| BP1.2A cover | 100 | 99.854% | 98.877% |
| BP1.2A identities | 100 | 99.995% | 99.989% |
| BP1.2A different nts | 0 | 184/3,916,940 | 426/3,874,585 |
| BP1.2A gaps | 0 | 74/3,916,940 | 102/3,874,585 |
| BT2.4 cover | 100% | 100% | 99.866% |
| BT2.4 identities | 99.996% | 100 | 99.993% |
| BT2.4 different nts | 174/3,916,868 | 0 | 274/3,911,604 |
| BT2.4 gaps | 25/3,916,868 | 0 | 21/3,911,604 |
| FZB42 cover | 99.697% | 98.026% | 100 |
| FZB42 identities | 99.987% | 99.990% | 100 |
| FZB42 different nts | 490/3,904,992 | 382/3,845,221 | 0 |
| FZB42 gaps | 182/3,904,992 | 192/3,845,221 | 0 |
| BP1.2A | BT2.4 | FZB42 | |
| BP1.2A | * | 1 | 41 |
| BT2.4 | 0 | * | 40 |
| FZB42 | 67 | 67 | * |
General genomic features of B. velezensis BP1.2A (CP085504.1), and BT2.4 (CP085505.1) compared with FZB42 (NC_009725.2). Methods used for generating the data are set in brackets (PGAP, RAST, EDGAR). Differences to FZB42 are labelled in red letters.
| BP1.2A | BT2.4 | FZB42 | |
|---|---|---|---|
| Genome size (bp) | 3,916,868 | 3,922,686 | 3,918,596 |
| G+C% | 46.5 | 46.5 | 46,5 |
| Number of genes (PGAP) | 3871 | 3870 | 3855 |
| CDSs total (PGAP) | 3753 | 3752 | 3734 |
| CDS core genome (EDGAR) | 3633 | 3633 | 3633 |
| CDS pan genome (EDGAR) | 3757 | 3757 | 3757 |
| RNA genes (RAST) | 118 | 118 | 118 |
| rRNAs (PGAP) | 27 | 27 | 29 |
| tRNAs (PGAP) | 86 | 86 | 88 |
| ncRNAs (PGAP) | 5 | 5 | 4 |
| Pseudo genes (PGAP) | 71 | 69 | 59 |
| Number of coding sequences (RAST) | 3939 | 3946 | 3938 |
| Number of Subsystems (RAST) | 324 | 324 | 324 |
| Cofactors, Vitamins, Prosthetic Groups, Pigments | 147 | 147 | 147 |
| Cell Wall and Capsule | 73 | 73 | 73 |
| Virulence, Disease and Defense | 38 | 38 | 38 |
| Potassium metabolism | 3 | 3 | 3 |
| Miscellaneous | 24 | 24 | 24 |
| Phages, Prophages, Transposable elements, Plasmids | 0 | 0 | 0 |
| Membrane Transport | 42 | 42 | 42 |
| Iron acquisition and metabolism | 25 | 25 | 25 |
| RNA metabolism | 63 | 63 | 64 |
| Nucleosides and Nucleotides | 95 | 95 | 95 |
| Protein Metabolism | 209 | 209 | 211 |
| Cell Division and Cell Cycle | 6 | 6 | 6 |
| Motility and Chemotaxis | 42 | 42 | 42 |
| Regulation and Cell signaling | 28 | 28 | 28 |
| Secondary Metabolism | 6 | 6 | 6 |
| DNA Metabolism | 63 | 63 | 63 |
| Fatty Acids, Lipids, and Isoprenoids | 53 | 53 | 53 |
| Nitrogen Metabolism | 20 | 20 | 20 |
| Dormancy and Sporulation | 91 | 91 | 91 |
| Respiration | 40 | 40 | 40 |
| Stress Response | 43 | 43 | 43 |
| Metabolism of Aromatic Compounds | 12 | 12 | 13 |
| Amino Acids and Derivatives | 299 | 300 | 301 |
| Sulfur Metabolism | 6 | 6 | 6 |
| Phosphorus Metabolism | 12 | 12 | 12 |
| Carbohydrates | 215 | 215 | 215 |
Detection of gene clusters involved in synthesis of secondary metabolites in the genomes of B. velezensis BP1.2A (CP085504), and B.velezensis BT2.4 (CP085505). For comparison FZB42 (CP000560.2) was also analyzed. Similarity to known metabolites listed in the MIBiG 2.0 repository [8] is indicated.
| Region | CP085504 | CP085505 | CP000560.2 | Similarity | ||||
|---|---|---|---|---|---|---|---|---|
| Surfactin | 318,208 | 383,067 | 318,208 | 383,067 | 322,723 | 387,582 | 95% | BGC0000433 |
| Plantazolicin | 717,159 | 740,336 | 717,099 | 740,276 | 721,674 | 744,851 | 100% | BGC0000569 |
| Ketoacyl:ACP synthase | 935,682 | 976,926 | 935,298 | 976,542 | 940,739 | 981,983 | 100% | Bacillus |
| Squalene/phytoene synthase | 1062,552 | 1079,781 | 1062,168 | 1079,397 | 1074,783 | 1075,523 | 100% | Bacillus |
| Macrolactin H | 1366,841 | 1453,226 | 1366,457 | 1452,842 | 1371,897 | 1458,282 | 100% | BGC0000181 |
| Bacillaene | 1676,755 | 1777,357 | 1676,371 | 1776,973 | 1681,811 | 1782,413 | 100% | BGC0001089 |
| Fengycin | 1866,123 | 1903,373 | 1865,739 | 1902,989 | 1871,179 | 1908,429 | 100% | BGC0001095 |
| Bacillomycin D | 1907,878 | 1963,948 | 1918,319 | 1963,564 | 1923,759 | 1969,004 | 100% | BGC0001090 |
| Squalene-hopene synthase | 2010,880 | 2032,763 | 2010,496 | 2032,379 | 2024,219 | 2026,102 | 100% | Bacillus |
| T3PKS | 2099,249 | 2140,349 | 2098,865 | 2139,965 | 2102,588 | 2143,688 | 100% | Bacillus |
| Difficidin | 2269,142 | 2362,931 | 2268,758 | 2362,547 | 2344,012 | 2286,309 | 100% | BGC0000176 |
| PK-5x Cys | 2851,295 | 2900,808 | 2850,911 | 2906,712 | 2873,990 | 2884,225 | 88% | B.velezensis |
| Bacillibactin | 3017,800 | 3024,927, | 3023,696 | 3030,823 | 3021,021 | 3033,995 | 100% | BGC0000309 |
| Amylocyclicin | 3039,655 | 3045,228, | 3045,551 | 3051,124 | 3043,470 | 3049,481 | 100% | BGC0000616 |
| Bacilysin | 3574,134 | 3615,552 | 3580,030 | 3621,448 | 3593,882 | 3599,780 | 100% | BGC0001184 |
Fig. 1Phylogenetic tree of B.velezensis strains BP1.2A (CP085504), and BT2.4 (CP085505) labelled in red letters. The tree, based on whole genome sequences, was inferred with FastME 2.1.6.1 [9] from GBDP distances calculated from genome sequences. The branch lengths are scaled in terms of GBDP distance formula. The numbers below branches are GBDP pseudo-bootstrap support values from 100 replications, with an average branch support of 57.3%.
Fig. 2Pairwise comparison of the genomes of B. velezensis BP1.2A, and BT2.4 with B. velezensis FZB42, and the type strain of B. velezensis CCUG 50,740 using ANIb [11].
Fig. 3Venn diagram of the genomes of FZB42 (1), BP1.2A (2), and BT2.4 (3). Please note: The singleton numbers don´t necessarily correspond to the numbers in the “Singleton” interface (Table 3). The Venn diagram constructed with EDGAR shows the number of best hits between subsets of genomes. But: A gene without reciprocal best hit to another genome is not necessarily a singleton [12].
| Subject | Biological sciences |
| Specific subject area | Molecular Phylogenetics |
| Type of data | Table, Figure, genome sequencing data in FASTA format. |
| How the data were acquired | Short reads were generated with Illumina HiSeq at LGC Genomics (Berlin, Germany). Long reads were obtained with Oxford Nanopore MinION. |
| Data format | Analyzed DNA sequence data in FASTA, NEWICK and text format. |
| Description of data collection | Pure cultures of BP1.2A and BT2.4 were used to isolate genomic DNA and to obtain the genomic data. Genome annotation was carried out using NCBI Genome Automatic Annotation Pipeline (PGAP) and RAST. |
Data source location | BP1.2A was isolated from black pepper roots (Viet Nam; Chu Se, Gia Lei), and BT2.4 was isolated from dragon fruit tree (Viet Nam: Ham thuan Nam, Binh Thuan) by Le Thi Thanh Tam, PPRI Hanoi Viet Nam |
| Data accessibility | The BioProjects have been deposited at NCBI GenBank under the following accession numbers: Bioprojects: PRJNA634914 (BP1.2A), and PRJNA634832 (BT2.4), Biosamples: SAMN15012571 (BP1.2A), and SAMN15009897 (BT2.4), Sequences of the chromosomes: CP085504.1 (BP1.2A) and CP085505.1 (BT2.4), GenBank assembly accessions: GCA_013285085.2 (BP1.2A), and GCA_013284785.2 (BT2.4). The SRA records could be accessed for BP1.2A, and BT2.4 from their corresponding links from the BioProjects. |
| With the article | |
| L.T.T. Tam, J. Jähne, P.T. Luong, L.T.P. Thao, L.T.K. Chung, A. Schneider, C. Blumenscheit, P. Lasch, T. Schweder, R. Borriss. Draft genome sequences of 59 endospore-forming Gram-positive bacteria associated with crop plants grown in Vietnam. Microbiol. Resour. Announc. 9 (2020): e01154–20 |