| Literature DB >> 31405380 |
Xiaoxiao Dang1,2,3, Zhihong Xie4,5, Wei Liu1,3, Yu Sun1,2,3, Xiaolin Liu1,2,3, Yongqiang Zhu6, Christian Staehelin7.
Abstract
BACKGROUND: Ensifer alkalisoli YIC4027, a recently characterized nitrogen-fixing bacterium of the genus Ensifer, has been isolated from root nodules of the host plant Sesbania cannabina. This plant is widely used as green manure and for soil remediation. E. alkalisoli YIC4027 can grow in saline-alkaline soils and is a narrow-host-range strain that establishes a symbiotic relationship with S. cannabina. The complete genome of this strain was sequenced to better understand the genetic basis of host specificity and adaptation to saline-alkaline soils.Entities:
Keywords: Comparative genomics; Complete genome sequencing; Ensifer alkalisoli; Environmental adaptation; Host-specific symbiosis
Mesh:
Substances:
Year: 2019 PMID: 31405380 PMCID: PMC6689892 DOI: 10.1186/s12864-019-6004-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Circular plots of the three replicons of E. alkalisoli YIC4027. The outermost and second-outermost circles represent the predicted protein coding sequences on the forward and the reverse strand, respectively. They are colored according to the assigned cluster of orthologous group classes. The innermost and second-innermost circles show the GC skew values and GC content, respectively. GC skew values are colored purple for plus values and yellow for minus values. GC content circles show deviations from the average (projecting outward for values higher than the average and projecting inward for values lower than the average). For the chromosome, the predicted RNA sequences on the forward and the reverse strands are shown in the circles 3 and 4 from the outermost to the innermost circles, respectively. The genes for rRNA are marked in red and those for tRNA in blue. All three replicons are drawn at the same scale
General genomic features of E. alkalisoli YIC4027
| Feature | |
|---|---|
| Genome size (bp) | 6,128,433 |
| Chromosome (bp) | 3,690,234 |
| pYIC4027b (bp) | 1,981,775 |
| pYIC4027a (bp) | 456,424 |
| GC content(%) | 62.2 |
| CDS coverage(%) | 86.2 |
| CDS number | 6024 |
| Number of genes with known function | 4540 (75.4%) |
| Hypothetical proteins | 1484 (24.6%) |
| tRNA genes | 55 |
| rRNA operons | 3 |
| Average gene length (bp) | 876 |
| Genes assigned to COG | 4555 (75.6%) |
| Genes assigned to KEGG | 2114 (35.1%) |
COG categorization of E. alkalisoli YIC4027 CDSs
| COG functional categories | CDSs | % of CDSs |
|---|---|---|
| Metabolism | ||
| C-Energy production and conversion | 298 | 4.95 |
| E-Amino acid transport and metabolism | 557 | 9.25 |
| F-Nucleotide transport and metabolism | 86 | 1.43 |
| G-Carbohydrate transport and metabolism | 438 | 7.27 |
| H-Coenzyme metabolism | 158 | 2.62 |
| I-Lipid metabolism | 139 | 2.31 |
| P-Inorganic ion transport and metabolism | 181 | 3.00 |
| Q-Secondary metabolites biosynthesis, transport, and catabolism | 113 | 1.88 |
| Cellular processes and signaling | ||
| D-Cell division and chromosome partitioning | 32 | 0.53 |
| M-Cell envelope biogenesis, outer membrane | 229 | 3.80 |
| N-Cell motility and secretion | 77 | 1.28 |
| O-Post-translational modification, protein turnover, chaperones | 175 | 2.91 |
| T-Signal transduction mechanisms | 184 | 3.05 |
| U-Intracellular trafficking and secretion | 91 | 1.51 |
| V-Defense mechanisms | 54 | 0.90 |
| Information storage and processing | ||
| B-Chromatin structure and dynamics | 2 | 0.03 |
| J-Translation, ribosomal structure and biogenesis | 169 | 2.81 |
| K-Transcription | 382 | 6.34 |
| L-DNA replication, recombination, and repair | 193 | 3.20 |
| Pooly characterized | ||
| R-General function prediction only | 492 | 8.17 |
| S-Function unknown | 505 | 8.38 |
| Total | 4555 | 75.61 |
KEGG pathway categorization of E. alkalisoli YIC4027 CDSs
| KEGG pathway functional categories | CDSs | % of CDSs |
|---|---|---|
| Metabolism | ||
| – Carbohydrate metabolism | 273 | 4.53 |
| – Energy metabolism | 223 | 3.70 |
| – Lipid metabolism | 92 | 1.53 |
| – Nucleotide metabolism | 112 | 1.86 |
| – Amino acid metabolism | 295 | 4.90 |
| – Metabolism of other amino acids | 87 | 1.44 |
| – Glycan biosynthesis and metabolism | 37 | 0.61 |
| – Metabolism of cofactors and vitamins | 163 | 2.71 |
| – Metabolism of terpenoids and polyketides | 35 | 0.58 |
| – Biosynthesis of other secondary metabolites | 25 | 0.42 |
| – Xenobiotic biodegradation and metabolism | 102 | 1.69 |
| Genetic information processing | ||
| – Transcription | 4 | 0.07 |
| – Translation | 81 | 1.34 |
| – Folding, sorting and degradation | 45 | 0.75 |
| – Replication and repair | 53 | 0.88 |
| Environmental information processing | ||
| – Membrane transport | 285 | 4.73 |
| – Signal transduction | 93 | 1.54 |
| Cellular processes | ||
| – Cell growth and death | 40 | 0.66 |
| – Cell motility | 55 | 0.91 |
| – transport and catabolism | 14 | 0.23 |
| Total | 2114 | 35.09 |
Fig. 2Phylogenetic relationships of E. alkalisoli YIC4027 and other rhizobia based on concatenated core protein sequences. The phylogenetic tree was constructed by the Maximum-Likelihood method using MEGA version 6.0 with 1000 bootstrap replicates
Fig. 3Venn diagram displaying overlaps and differences of orthologous genes in YIC4027 and E. fredii strains. Numbers represent the unique genes for an individual genome or common genes that are present in two or three genomes
Fig. 4Arrangement of nodulation genes in E. alkalisoli YIC4027, E. fredii strains NGR234, HH103, and USDA257. Black arrows represent structural genes involved in NF synthesis. White arrows indicate genes involved in the transcriptional regulation of NFs. Gray arrows represent nonfunctional nodulation genes (truncated or mutated genes)
Genes of E. alkalisoli YIC4027 related to protein secretion systems
| System | Gene identification | Localization |
|---|---|---|
| Type I secretion | ||
| | EKH55_1288 | Chromosome |
| | EKH55_2900 to 2901 | Chromosome |
| | EKH55_5479 to 5480 | pYIC4027b |
| Type III secretion (T3SS) | ||
| T3SS-I | ||
| | EKH55_5609 to 5632 | pYIC4027a |
| T3SS-II | ||
| | EKH55_0998 to 1018 | Chromosome |
| Type IV secretion (T4SS) | ||
| T4SS-I | ||
| | EKH55_5752 to 5930 | pYIC4027a |
| T4SS-II | ||
| | EKH55_4625 to 4740 | pYIC4027b |
| Twin-arginine translocation (TAT) system | ||
| | EKH55_1307 to 1309 | Chromosome |
| Secretion system | ||
| | EKH55_3274 | Chromosome |
| | EKH55_0177 | Chromosome |
| | EKH55_1104 | Chromosome |
| | EKH55_1142 | Chromosome |
| | EKH55_1213 | Chromosome |
| | EKH55_1317 | Chromosome |
| | EKH55_2587 | Chromosome |
| | EKH55_1316 | Chromosome |
| | EKH55_0075 | Chromosome |
| | EKH55_0711 | Chromosome |
| SRP (signal recognition particle) components | ||
| | EKH55_3169 | Chromosome |
| | EKH55_3175 | Chromosome |
Fig. 5Comparison of the T3SS gene clusters in E. alkalisoli YIC4027 and Ensifer fredii strains NGR234, USDA257, and HH103. Rhizobium conserved (rhc) genes are shown in gray. Nodulation outer proteins (nop) genes are shaded in black. The positions of the tts boxes are indicated by vertical arrows
Fig. 6Chemotaxis behavior of E. alkalisoli YIC4027 and constructed cheA mutants. a A representative soft agar plate with proline as the sole carbon source. b The percentages of the chemotactic ring diameters of the mutants relative to those of the wild-type strain. Error bars represent standard deviations (SD) calculated from three independent experiments
Fig. 7Nodulation efficiency of the constructed cheA1 and cheA2 mutants in competitive nodulation tests. S. cannabina seedlings were co-inoculated with the E. alkalisoli YIC4027 wild-type (WT) and an indicated mutant strain at ratios 1:1 and 1:10. Nodule occupancy was determined at the time of harvest. Error bars represent standard deviations calculated from at least three repetitions