| Literature DB >> 24651173 |
Jian-Bo Xie1, Zhenglin Du2, Lanqing Bai1, Changfu Tian1, Yunzhi Zhang1, Jiu-Yan Xie1, Tianshu Wang1, Xiaomeng Liu1, Xi Chen1, Qi Cheng3, Sanfeng Chen1, Jilun Li1.
Abstract
We provide here a comparative genome analysis of 31 strains within the genus Paenibacillus including 11 new genomic sequences of N2-fixing strains. The heterogeneity of the 31 genomes (15 N2-fixing and 16 non-N2-fixing Paenibacillus strains) was reflected in the large size of the shell genome, which makes up approximately 65.2% of the genes in pan genome. Large numbers of transposable elements might be related to the heterogeneity. We discovered that a minimal and compact nif cluster comprising nine genes nifB, nifH, nifD, nifK, nifE, nifN, nifX, hesA and nifV encoding Mo-nitrogenase is conserved in the 15 N2-fixing strains. The nif cluster is under control of a σ(70)-depedent promoter and possesses a GlnR/TnrA-binding site in the promoter. Suf system encoding [Fe-S] cluster is highly conserved in N2-fixing and non-N2-fixing strains. Furthermore, we demonstrate that the nif cluster enabled Escherichia coli JM109 to fix nitrogen. Phylogeny of the concatenated NifHDK sequences indicates that Paenibacillus and Frankia are sister groups. Phylogeny of the concatenated 275 single-copy core genes suggests that the ancestral Paenibacillus did not fix nitrogen. The N2-fixing Paenibacillus strains were generated by acquiring the nif cluster via horizontal gene transfer (HGT) from a source related to Frankia. During the history of evolution, the nif cluster was lost, producing some non-N2-fixing strains, and vnf encoding V-nitrogenase or anf encoding Fe-nitrogenase was acquired, causing further diversification of some strains. In addition, some N2-fixing strains have additional nif and nif-like genes which may result from gene duplications. The evolution of nitrogen fixation in Paenibacillus involves a mix of gain, loss, HGT and duplication of nif/anf/vnf genes. This study not only reveals the organization and distribution of nitrogen fixation genes in Paenibacillus, but also provides insight into the complex evolutionary history of nitrogen fixation.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24651173 PMCID: PMC3961195 DOI: 10.1371/journal.pgen.1004231
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Paenibacillus strains used in study.
| Strains | Source | Nitrogen fixer | Genome sequence |
|
| Sweetgum stem wood, Florida, USA | − |
|
|
| Obsidian hot spring, Montana, USA | − |
|
|
| Soil of Tianmu Mountain, Zhejiang, China | − | Unpublished |
|
| Soil of maize-farming fields, Guizhou, China | − | Unpublished |
|
| Rhizosphere soil, Shandong, China | − |
|
|
| Rhizosphere of winter barley, Chonnam, South Korea | − |
|
|
| Rhizosphere of pepper, Guizhou, China | − |
|
|
| Soil, Kobe city, Japan | − | Unpublished |
|
| Human intestinal microflora, USA | − | Unpublished |
|
| Human intestinal microflora, USA | − | Unpublished |
|
| Soil, Tel Aviv, Israel | − |
|
|
| Soil samples, Hangzhou, China | − |
|
|
| Milk, Belgium | − | Unpublished |
|
| Soil, Republic of Korea | − |
|
|
| Oral swab from female patient, USA | − | Unpublished |
|
| Rhizosphere, Tel Aviv, Israel | − |
|
|
| Bamboo rhizosphere, Beijing, China |
| Unpublished |
|
| Scutellaria rhizosphere, Liaoning, China |
| This study |
|
| Corn rhizosphere, Shanxi, China |
| This study |
|
| Wheat rhizosphere, Beijing, China |
| This study |
|
| Corn rhizosphere, Shanxi, China |
| This study |
|
| Root of | + |
|
|
| Soil of forest residue, Daejeon, Republic of Korea | + |
|
|
| Wheat roots, Parana state, Brazil |
| This study |
|
| Maize rhizosphere soil, Ramonville, France |
| This study |
|
| Rhizosphere of Ku Caihua, Xinjiang, China |
| This study |
|
| Rhizosphere of |
| This study |
|
| Willow rhizosphere, Beijing, China |
| This study |
|
| Pepper rhizosphere, Hubei, China |
| This study |
|
| Forsythia rhizosphere, Beijing, China |
| This study |
|
| Rhizosphere of |
| Unpublished |
Genomic features of Paenibacillus strains.
| Species | Status | GenBank accession number | Genome size (Mb) | G+C content | tRNA genes | Protein-coding sequences (CDSs) |
|
| Complete | CP001656. | 7.18 | 50.3 | 87 | 6213 |
|
| Complete | CP001793 | 7.12 | 51.2 | 73 | 6238 |
|
| Complete | CP002869 | 8.66 | 58.4 | 108 | 7811 |
|
| Complete | CP003422 | 8.77 | 58.2 | 189 | 7252 |
|
| Complete | CP003235 | 8.74 | 58.3 | 170 | 7057 |
|
| Complete | CP000154 | 5.39 | 45.8 | 91 | 4805 |
|
| Complete | CP002213 | 6.24 | 44.6 | 91 | 6032 |
|
| Complete | AEDD00000000 | 5.45 | 51.9 | 101 | 4824 |
|
| Draft | AEXS00000000 | 6.95 | 51.0 | 71 | 6496 |
|
| Draft | AFDH00000000 | 6.28 | 52.8 | 72 | 5992 |
|
| Draft | AHKH00000000 | 6.38 | 54.0 | 31 | 5660 |
|
| Draft | AFHW00000000 | 7.96 | 52.4 | 51 | 7777 |
|
| Draft | AGIP00000000 | 6.81 | 51.8 | 74 | 6149 |
|
| Draft | AGFX00000000 | 5.77 | 46.4 | 81 | 5073 |
|
| Draft | ACIH00000000 | 4.90 | 51.8 | 69 | 4460 |
|
| Draft | ADHJ00000000 | 6.39 | 48.8 | 57 | 5928 |
|
| Draft | ALJV00000000 | 5.92 | 45.1 | 54 | 5729 |
|
| Draft | ASSA00000000 | 6.10 | 45.0 | 50 | 5697 |
|
| Draft | ASRZ00000000 | 6.00 | 44.2 | 69 | 5731 |
|
| Draft | ASSB00000000 | 5.44 | 46.0 | 59 | 5599 |
|
| Draft | ASRY00000000 | 5.65 | 46.4 | 56 | 5628 |
|
| Draft | AGFI00000000 | 5.79 | 46.6 | 73 | 5275 |
|
| Complete | CP003107 | 6.08 | 46.8 | 89 | 5525 |
|
| Draft | ASSE00000000 | 6.32 | 48.4 | 63 | 5722 |
|
| Draft | ASSG00000000 | 7.08 | 50.4 | 61 | 7081 |
|
| Draft | AJTY00000000 | 7.61 | 50.4 | 46 | 7705 |
|
| Draft | ASQQ00000000 | 5.44 | 50.8 | 37 | 5924 |
|
| Draft | ASSF00000000 | 8.52 | 47.9 | 83 | 9087 |
|
| Draft | ASSD00000000 | 5.12 | 50.9 | 50 | 5622 |
|
| Draft | ASSC00000000 | 5.19 | 53.0 | 37 | 5552 |
|
| Complete | CP004078 | 5.27 | 52.6 | 82 | 5250 |
Figure 1Genomic diversity of strains in the genus Paenibacillus.
Each strain is represented by an oval that is colored: N2-fixing strains (red), non- N2-fixing strains (purple). The number of orthologous coding sequences (CDSs) shared by all strains (i.e., the core genome) is in the center. Overlapping regions show the number of CDSs conserved only within the specified genomes. Numbers in non-overlapping portions of each oval show the number of CDSs unique to each strain. The total number of protein coding genes within each genome is listed below the strain name.
Figure 2Functional classification of gene content of the 31 Paenibacillus strains.
(A) Profiles of Cluster of Orthologous Groups (COG) showing percentage of genes in each category out of total annotated genes. Taxa from inside of circle to outside of circle are from Paenibacillus sp. JDR 2 (top in the strain list) to P. sabinae T27 (down in the strain list). (B) Profiles of COG showing function categories for genes in core genomes. Taxa from inside of circle to outside of circle are from Paenibacillus sp. JDR 2 (top in the strain list) to P. sabinae T27 (down in the strain list).
Figure 3Organization of nif, vnf, anf and nif-like genes in N2-fixing Paenibacillus strains.
nif, vnf, anf and nif-like genes are marked with different colors. The 9 nif genes nifBHDKENXhesAnifV are contiguous within Sub-group I and there is a orf between nifX and hesA within Sub-group II.
Figure 4Comparison of G+C contents of the nif clusters with those of the average of the chromosomal genomes.
Figure 5Synteny of the chromosomal regions flanking the nif gene cluster among each sub-group.
(A) nif clusters of Sub-group I. (B) The chromosomal regions of non-N2-fixing strains corresponding to those flanking the nif gene cluster of Sub-group I. (C) nif clusters of Sub-group II.
Figure 6Bayesian inferred phylogenetic tree of concatenated NifHDK homologs.
The interior node values of the tree are clade credibility values, values lower than 100% are indicated. Branches are colored blue (Mo-nitrogenase, Nif), green (V-nitrogenase, Vnf), purple (Fe-nitrogenase, Anf), light blue (uncharacterized homolog), dark yellow (uncharacterized nitrogenase). The text colored red was Paenibacillus.
Figure 7Maximum-likelihood phylogenetic tree of Paenibacillus strains and the 4 possible evolutionary pathways of nitrogen fixation in Paenibacillus.
The tree was constructed based on 275 single-copy core proteins shared by the 31 Paenibacillus genomes and the rooting strain B. subtilis 168. Four likely pathways are marked with number 1–4.
Figure 8Maximum-likelihood phylogenetic tree of Mo-, Fe- and V-nitrogenases and nitrogenase-like sequences.
Nif/Vnf/AnfH, Nif/Vnf/AnfD, Nif/Vnf/AnfK, Nif/Vnf/AnfH-like, Nif/Vnf/AnfD-like and Nif/Vnf/AnfK sequences were derived from the 15 N2-fixing Paenibacillus strains and other representative species.