| Literature DB >> 28066362 |
Nan Zhang1, Dongqing Yang1, Joshua R A Kendall2, Rainer Borriss3, Irina S Druzhinina4, Christian P Kubicek4, Qirong Shen1, Ruifu Zhang5.
Abstract
Bacillus subtilis and its sister species B. amyloliquefaciens comprise an evolutionary compact but physiologically versatile group of bacteria that includes strains isolated from diverse habitats. Many of these strains are used as plant growth-promoting rhizobacteria (PGPR) in agriculture and a plant-specialized subspecies of B. amyloliquefaciens-B. amyloliquefaciens subsp. plantarum, has recently been recognized, here we used 31 whole genomes [including two newly sequenced PGPR strains: B. amyloliquefaciens NJN-6 isolated from Musa sp. (banana) and B. subtilis HJ5 from Gossypium sp. (cotton)] to perform comparative analysis and investigate the genomic characteristics and evolution traits of both species in different niches. Phylogenomic analysis indicated that strains isolated from plant-associated (PA) habitats could be distinguished from those from non-plant-associated (nPA) niches in both species. The core genomes of PA strains are more abundant in genes relevant to intermediary metabolism and secondary metabolites biosynthesis as compared with those of nPA strains, and they also possess additional specific genes involved in utilization of plant-derived substrates and synthesis of antibiotics. A further gene gain/loss analysis indicated that only a few of these specific genes (18/192 for B. amyloliquefaciens and 53/688 for B. subtilis) were acquired by PA strains at the initial divergence event, but most were obtained successively by different subgroups of PA stains during the evolutional process. This study demonstrated the genomic differences between PA and nPA B. amyloliquefaciens and B. subtilis from different niches and the involved evolutional traits, and has implications for screening of PGPR strains in agricultural production.Entities:
Keywords: Bacillus subtilis and Bacillus amyloliquefaciens; genome evolution; horizontal gene transfer; phylogenomics; plant-associated; rhizosphere adaptation
Year: 2016 PMID: 28066362 PMCID: PMC5169363 DOI: 10.3389/fmicb.2016.02039
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary statistics and information on the genome and the isolation of the 31 .
| NJN-6 | 4.05 | 8 | China | This study | ||||
| LFB112 | 3.94 | 9 | Chinese herbs | Cai et al., | ||||
| Y2 | 4.24 | 10 | He et al., | |||||
| CAU B946 | 4.02 | 10 | Blom et al., | |||||
| NAU-B3 | 4.2 | 11 | Wu et al., | |||||
| YAU B9601-Y2 | 4.24 | 10 | He et al., | |||||
| SQR9 | 4.12 | 7 | Zhang et al., | |||||
| CC178 | 3.92 | 9 | Korea | Kim et al., | ||||
| AS 43.3 | 3.96 | 10 | USA | Dunlap et al., | ||||
| UCMB5033 | 4.07 | 10 | Tajikistan | Niazi et al., | ||||
| UCMB5036 | 3.91 | 9 | Manzoor et al., | |||||
| UCMB5113 | 3.89 | 9 | Soil | Ukraine | Niazi et al., | |||
| FZB42T | 3.92 | 10 | Sugar beet field | Germany | Chen et al., | |||
| n.a. | IT-45 | 3.93 | 10 | n.a. | n.a. | CP004065 | ||
| DSM7T | 3.98 | 10 | n.a. | Germany | Rückert et al., | |||
| LL3 | 4 | 7 | Korean bibimbap | Korea | Geng et al., | |||
| TA208 | 3.94 | 7 | Soil | China | Zhang et al., | |||
| XH7 | 3.94 | 7 | n.a. | Yang et al., | ||||
| n.a. | HJ5 | 4.01 | 7 | China | This study | |||
| BAB-1 | 4.02 | 11 | Guo et al., | |||||
| n.a. | XF-1 | 4.06 | 10 | Guo et al., | ||||
| n.a. | BSn5 | 4.09 | 11 | Deng et al., | ||||
| 168 | 4.22 | 10 | n.a | USA | Kunst et al., | |||
| ATCC 6051 | 4.22 | 10 | n.a. | Kabisch et al., | ||||
| n.a. | PY79 | 4.03 | 10 | n.a. | Schroeder and Simmons, | |||
| RO-NN-1 | 4.01 | 10 | Desert soil | Earl et al., | ||||
| BSP1 | 4.04 | 10 | Poultry | Switzerland | Schyns et al., | |||
| n.a. | QB928 | 4.15 | 10 | n.a. | n.a. | Yu et al., | ||
| BEST195 | 4.09 | 10 | Miyagino-based natto | Japan | Kamada et al., | |||
| TU-B-10T | 4.21 | 10 | Soil | Tunisia | Earl et al., | |||
| W23 | 4.03 | 8 | n.a. | n.a. | Zeigler, |
n.a. not available
Based on the information from NCBI (.
These two strains are exactly the same strain which have been independent sequenced and annotated by different laboratories.
Figure 1Pan- and core-genome analysis of 18 . The lines in gray and black represent the pan- and core-genomes, respectively. The pan-genome increased with addition of new strains to the study (9037 OGs), while the core genome decreased at a slow rate with added strains (2409).
Figure 2Evolutionary relationship between the 31 Bacillus strains and hierarchal clustering. (A) The maximum-likehood phylogeny derived from the alignment of 1835 concatenation core genes. Leaf clipart indicates association with plants. Colors indicate biogeographic origin of the strains. All nodes shown on the phylogram are supported by >85% bootstrap values and >0.94 posterior probabilities obtained after the 0.2 M mcmc generations in Bayesian analysis. Black circles indicate support for the most important nodes. (B) Hierarchal clustering among genomes using presence/absence of orthologous groups Almost all of nodes on the phylogram are supported by 97% >approximately unbiased P-values.
Figure 3Gene counts for each COG category in the core genome for non-plant-associated (nPA) vs. plant-associated (PA) groups. Each pair of columns represents the COG category of the specific genes. Cellular process and signaling: D, cell cycle control, cell division, chromosome partitioning; M, cell wall/membrane/envelope biogenesis; N, cell motility; O, posttranslational modification, protein turnover, chaperones; T, signal transduction mechanisms; V, defense mechanisms. Metabolism: C, energy production and conversion; E, amino acid transport and metabolism; F, nucleotide transport and metabolism; G, carbohydrate transport and metabolism; H, coenzyme transport and metabolism; I, lipid transport and metabolism; P, inorganic ion transport and metabolism; Q, secondary metabolites biosynthesis, transport and catabolism. Information storage and processing: J, translation, ribosomal structure, and biogenesis; K, transcription; L, replication, recombination and repair. Poorly characterized: R, general function prediction only; S, function unknown.
Figure 4Results of gene gain/loss analysis. Boxes on nodes and tips of the phylogeny show genome size (genome size calculations exclude one of the orthologous groups that was excluded from the analysis, see Results). Numbers on branches indicate the number of genes gained (left, through HGT, gene duplication, or gene birth) and lost (right) at the divergence event.
Figure 5Heat map representing the COG categories across the . The dendrogram was constructed using the pairwise Pearson's r correlation calculated for the protein cluster membership distributions. The cell color in the heatmap represents the percentage of gene numbers of each COG category, as revealed by the legend.