| Literature DB >> 32845829 |
Benjamin J Perry1, John T Sullivan1, Elena Colombi2, Riley J T Murphy2, Joshua P Ramsay2, Clive W Ronson1.
Abstract
Mesorhizobium is a genus of soil bacteria, some isolates of which form an endosymbiotic relationship with diverse legumes of the Loteae tribe. The symbiotic genes of these mesorhizobia are generally carried on integrative and conjugative elements termed symbiosis islands (ICESyms). Mesorhizobium strains that nodulate Lotus spp. have been divided into host-range groupings. Group I (GI) strains nodulate L. corniculatus and L. japonicus ecotype Gifu, while group II (GII) strains have a broader host range, which includes L. pedunculatus. To identify the basis of this extended host range, and better understand Mesorhizobium and ICESym genomics, the genomes of eight Mesorhizobium strains were completed using hybrid long- and short-read assembly. Bioinformatic comparison with previously sequenced mesorhizobia genomes indicated host range was not predicted by Mesorhizobium genospecies but rather by the evolutionary relationship between ICESym symbiotic regions. Three radiating lineages of Loteae ICESyms were identified on this basis, which correlate with Lotus spp. host-range grouping and have lineage-specific nod gene complements. Pangenomic analysis of the completed GI and GII ICESyms identified 155 core genes (on average 30.1 % of a given ICESym). Individual GI or GII ICESyms carried diverse accessory genes with an average of 34.6 % of genes unique to a given ICESym. Identification and comparative analysis of NodD symbiotic regulatory motifs - nod boxes - identified 21 branches across the NodD regulons. Four of these branches were associated with seven genes unique to the five GII ICESyms. The nod boxes preceding the host-range gene nodZ in GI and GII ICESyms were disparate, suggesting regulation of nodZ may differ between GI and GII ICESyms. The broad host-range determinant(s) of GII ICESyms that confer nodulation of L. pedunculatus are likely present amongst the 53 GII-unique genes identified.Entities:
Keywords: ICE; Loteae; Lotus; Mesorhizobium; evolution; host range; nodulation; symbiosis
Year: 2020 PMID: 32845829 PMCID: PMC7643969 DOI: 10.1099/mgen.0.000426
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Comparison of GI and GII genomes and ICESyms
|
Species |
Strain designation |
Synonymous strain ID |
Geographic origin |
|
Host-range group |
ONT coverage |
Illumina coverage |
Genome size (bp) |
Genome GC% |
ICESym structure |
ICESym size (bp) |
ICESym GC% |
Strain original publication |
Prior genome-sequencing data |
GenBank accession |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
R7A |
Reisolate of: ICMP 3153; NZP2238; Lc265Da |
New Zealand (Ireland) |
|
GI |
27 |
490 |
6 529 994 |
62.9 |
Monopartite |
501 815 |
59.3 |
Sullivan |
Kelly |
CP033366.1 |
|
|
R88B |
ICESym-Transconjugate: ICMP 3153 |
New Zealand |
|
GI |
36 |
361 |
7 207 166 |
62.4 |
Monopartite |
501 815 |
59.3 |
Sullivan |
Reeve |
CP033367.1 |
|
|
MAFF303099 |
|
Japan |
|
GI |
|
|
7 596 297 |
62.7 |
Monopartite |
611 008 |
59.7 |
Kaneko |
|
BA000012.4, BA000013.4, AP003017.1 |
|
|
NZP2234 |
ICMP 10866; CC811 |
United States |
|
GI |
19 |
37 |
6 749 717 |
63.1 |
Monopartite |
422 420 |
59.6 |
Bailey |
|
CP033364.1 |
|
|
NZP2298 |
ICMP 12619 |
Canada |
|
GI |
50 |
40 |
7 336 816 |
62.8 |
Monopartite |
467 156 |
59.7 |
Charlton (1981) [ |
|
CP033365.1 |
|
|
NZP2014 |
ICMP 10717 |
New Zealand |
|
GII |
38 |
126 |
6 602 217 |
62.9 |
Monopartite |
528 908 |
59.2 |
Bailey |
Haskett |
CP033361.1 |
|
|
NZP2042 |
ICMP 10765 |
New Zealand |
|
GII |
24 |
136 |
6 870 350 |
63.1 |
Tripartite |
526 858 |
59.6 |
Bailey |
Haskett |
CP033334.1 |
|
|
NZP2037 |
ICMP 1326 |
New Zealand |
|
GII |
|
|
7 481 739 |
62.9 |
Tripartite |
562 308 |
59.4 |
Bailey |
Haskett |
CP016079.1, CP016080.1 |
|
|
SU343 |
ICMP 10808; NZP2196 |
United States |
|
GII |
21 |
120 |
7 200 913 |
63.0 |
Tripartite |
556 132 |
59.4 |
Crow |
Haskett |
CP033368.1, CP033369.1, CP033370.1 |
|
|
ATCC 700743T |
ICMP 4682; ATCC 33669T |
|
|
GII |
16 |
139 |
7 201 057 |
63.0 |
Tripartite |
556 132 |
59.4 |
Martinez-Hidalgo |
|
CP033507.1, CP033508.1, CP033509.1 |
Fig. 1.Summary of GI and GII pangenomic comparison and GII-ICESym uniquely conserved genes. (a) Pangenome calculation of GI and GII core and accessory genes as a function of the AAi thresholds used. (b) Pangenome calculation of core and accessory genes within GI and GII ICESym regions as a function of AAi threshold used. (c) Total uniquely conserved GII-ICESym genes across genomes versus ICESyms as a function of AAi threshold used for pangenome calculations. (d) Abundance of COG single-letter functional groups annotated to GII-ICESym uniquely conserved genes.
Fig. 2.Average nucleotide identity of genomes containing NodZ, NodD1 and NodD2 homologues. genomes were selected based on the conservation of homologues of all three of NodZ, NodD1 and NodD2 of the GI and GII strains at 85 % AAi or greater. Putative genospecies with ANI >95 % are grouped with black borders. Origin indicates the region a strain was originally isolated from: AFR – Africa, ASI – Asia, EUR – Europe, JAP – Japan, NAM – North America, SAM – South America. L1 – Lineage 1 Loteae ICESyms, L2 – Lineage 2 Loteae ICESyms, L3 – Lineage 3 Loteae ICESyms.
Fig. 3.Shared synteny of non-isogenic GI and GII ICESyms. Coloured regions indicate LCBs across ICESyms. Colourless stretches indicate ICESym unique regions, and regions below the axis indicate regions inverted relative to the R7A ICESym. For tripartite ICESyms, boundaries of the α, β and γ fragments are indicated with vertical red lines. The Minimal-ICESym maintains the conserved order of LCBs shared by all ICESyms.
Fig. 4.Average nucleotide identity of ICESym symbiotic regions. ANI comparison of ICESym symbiotic regions extracted from the 43 strains shown in Fig. 2. Symbiotic regions were identified based on the presence of a nod box or nifA binding motif within an assembled contig, and genomic DNA was trimmed from contigs using ICESym integrase attP sites. Pairwise comparisons of the 43 ICESym symbiotic regions spanned an average of 175 949 bp (±74 537 bp sd). Groupings representing Loteae ICESym lineages are indicated with black borders.
Fig. 5.HMM analysis of nod box regulatory motifs of GI and GII ICESyms. (a) Consensus nod box motif of the HMM training set and the nod box motifs identified in the GI and GII ICESym regions. (b) Maximum-likelihood phylogeny with 1000 bootstraps of the 95 nod box motifs identified. Bootstrap values greater than 60 are indicated in bold. (c) Overview of nod box motif presence or absence in the nod box regulons of unique GI and GII ICESyms. Duplicated or triplicate nod boxes within an ICESym are indicated with red borders.