Literature DB >> 35238735

Taxonomy of Rhizobiaceae revisited: proposal of a new framework for genus delimitation.

Nemanja Kuzmanović¹, Camilla Fagorzi², Alessio Mengoni², Florent Lassalle³, George C diCenzo⁴.

Abstract

Entities: Chemical

Keywords: Ensifer; Rhizobiaceae; Rhizobium; Sinorhizobium; Xaviernesmea; genus boundaries

Mesh：

Substances：

Year: 2022 PMID： 35238735 PMCID： PMC9558580 DOI： 10.1099/ijsem.0.005243

Source DB: PubMed Journal: Int J Syst Evol Microbiol ISSN： 1466-5026 Impact factor: 2.689

× No keyword cloud information.

Data Summary

All genome sequences used in this work were previously published, and the assembly accessions are provided in Dataset S1 (available in the online version of this article). Twelve supplementary figures and three supplementary datasets are included in the online version of this article. All raw data (core-proteome average amino acid identity data, whole-proteome average amino acid identity data, average nucleotide identity data, percentage of conserved proteins, and a Newick formatted phylogeny) used to generate the figures presented in this manuscript are available through Figshare at https://doi.org/10.6084/m9.figshare.17076455.v1 [1]. A pipeline to extract the 170 marker proteins used for phylogenetic reconstruction and in calculating core-proteome average amino acid identity values, is available through GitHub at github.com/flass/cpAAI_Rhizobiaceae.

Introduction

The family of the order was proposed in 1938 and has since undergone numerous, and at times contentious, taxonomic revisions [2, 3]. Currently, this family comprises the genera , , , , (syn. ), , , , , , , , ‘ ’, , , , , and (syn. ; https://lpsn.dsmz.de/) [4]. The family contains phenotypically diverse organisms, including N2-fixing legume symbionts (known as rhizobia), plant pathogens, bacterial predators, and other soil bacteria. The agricultural and ecological significance of the family has prompted the isolation and whole genome sequencing of hundreds of strains at a rate outpacing taxonomic refinement of the family. As a result, some species and genera within the family are well known to be paraphyletic [5], while others that are monophyletic likely represent multiple species/genera [6]. In addition, most currently named genera have been delineated based on genomic relatedness – as per current taxonomic guidelines [7] – but lack synapomorphic traits that would give them biological and ecological significance [8]. To aid in the taxonomic classification of this family, here we propose a general framework for defining genera in the family . This framework is based on a set of baseline genomic relatedness measures meant to serve as minimal thresholds for genus demarcation, while allowing for more closely related species to be divided into separate genera when supported by supplemental genomic and/or biological data. By applying this framework, we propose the formation of a new genus – Xaviernesmea – on the basis of the genomic relatedness measures, and provide support for the recently described genus . In addition, despite genomic relatedness values that would not support genus demarcation based on the proposed baseline thresholds, we argue that current phylogenetic, genomic (e.g. pentanucleotide frequency) and biological (e.g. division by budding) data indicate that the genera Casida et al. 1982 [9] and Chen 1988 [10] are not synonymous, meaning that the unification of the genera and in Opinion 84 of the Judicial Commission is no longer justified.

Methods

Dataset

The analysis was performed on a dataset of 94 genomes of strains, among which the majority were type strains of the corresponding species (Dataset S1). As an outgroup, we included the genomes of three strains, belonging to the related family . Moreover, for calculation of some overall genome relatedness indices (OGRIs) to support additional taxonomic revisions, genomes of two and two strains were included (Dataset S1). To verify the authenticity of genomes used for taxonomic reclassifications proposed in this paper, we compared the reference 16S rRNA gene sequences (as well as housekeeping gene sequences in ambiguous cases) associated with the original species publication with the sequences retrieved from genome sequences (Dataset S1). Whole genome sequences generated to support new species description in original publications were considered as authentic (Dataset S1).

Core-genome gene phylogeny

The core-genome phylogeny was obtained using the GET_HOMOLOGUES software package version 10032020 [11] and the GET_PHYLOMARKERS software package version 2.2.8_18Nov2018 [12], as described previously [13]. As a result, a set of 170 non-recombining single-copy core marker genes was selected, and a concatenation of their codon-based alignments was used as input for IQ-TREE ModelFinder, with which a search for the best sequence evolution model was conducted. The model ‘GTR+F+ASC+R8’ was selected based on a Bayesian information criterion. The maximum-likelihood (ML) core genome phylogeny was inferred under this model using IQ-TREE [14], with branch supports assessed with approximate Bayes test (-abayes) and ultrafast bootstrap with 1000 replicates (-bb 1000).

Overall genome relatedness indices calculations

Whole-proteome average amino-acid identity (wpAAI; usually simply known as AAI) was computed using the CompareM software (github.com/dparks1134/CompareM) using the aai_wf command with default parameters, i.e., ortholog identification with DIAMOND [15], e-value <1e-3, percent identity >30 %, and alignment length >70 % the length of the protein. Core-proteome average amino-acid identity (cpAAI) was computed as the proportion of substitutions in pairwise comparisons of sequences from the 170 non-recombining, single-copy core marker genes identified using GET_PHYLOMARKERS [12], using a custom R script that notably relied on the dist.aa() function from the ‘ape’ package [16]. Percentage of conserved proteins (POCP) was determined using publicly available code (github.com/hoelzer/pocp) and the ortholog identification thresholds defined by Qin et al. [17], namely, e-value <1e-5, percent identity >40 %, and alignment length >50 % the length of the protein. This pipeline involved the reannotation of genomes with Prodigal version 2.6.3 [18] and ortholog identification using the blast+ package, version 2.10.1 [19]. The average nucleotide identity (ANIb) comparisons were conducted using PyANI version 0.2.9, with scripts employing the blast+ algorithm to align the input sequences (https://github.com/widdowquinn/pyani). The digital DNA–DNA hybridization (dDDH) computations were performed with the Genome-to-Genome Distance Calculator (GGDC 2.1; https://ggdc.dsmz.de/distcalc2.php) using the recommended blast+ alignment and formula 2 (identities/HSP length) [20].

16S rRNA gene phylogeny

The RNA fasta files for the 157 or strains analysed in our recent study [21] were downloaded from the National Centre for Biotechnology Information database, and all 16S rRNA gene sequences ≥1000 nt were extracted. The 16S rRNA gene sequences were aligned using mafft version 7.3.10 with the localpair option [22], and trimmed using trimAl version 1.4.rev22 with the automated1 option [23]. A ML phylogeny was prepared using raxmlHPC-HYBRID-AVX2 version 8.2.12 with the GTRCAT model [24]. The final phylogeny is the bootstrap best tree following 756 bootstrap replicates, as determined by the extended majority-rule consensus tree criterion.

Code availability

To facilitate the adoption of our framework for genus demarcation in the family , a custom pipeline was prepared to extract the 170 marker proteins from other genomes. The pipeline, together with the 170 marker proteins from the 97 strains analysed in the current study, is available at github.com/flass/cpAAI_Rhizobiaceae. Briefly, the pipeline will use tblastn [19] to identify genes encoding orthologs of all 170 marker proteins in the input genomes, which will then be extracted and translated. Each set of protein orthologs is then aligned with mafft [22] or Clustal Omega [25] to the pre-computed alignment of the orthologs from the 97 strains analysed here, and a concatenated alignment prepared. The output files can then be used for phylogenetic reconstruction or cpAAI calculations, with sample code for cpAAI calculations also provided.

Results and discussion

Overall genomic relatedness indices measurements in the family

To develop a framework for genus demarcation within the family , we examined a selection of 94 genomes of isolates, most of which are species type strains. , an obligate intra-cellular pathogen with a highly reduced genome, was excluded from our selection of organisms to avoid biassing the analysis by overly reducing the conserved gene set. We reasoned that good practices for genome sequence-based genus delineation should consider both phylogenetic relatedness of species based on a concatenated alignment of core-genome genes (Figs 1 and S1), and one or more OGRI measurement. We initially considered four OGRIs, calculated as described in the Methods: (i) average nucleotide identity (ANIb), (ii) whole-proteome average amino acid identity (wpAAI); (iii) core-proteome average amino acid identity (cpAAI) based on the proportion of substitutions between the concatenated translated sequences of the core marker gene set used for the core-genome phylogeny; and (iv) the percentage of conserved proteins (POCP) as defined by Qin et al. [17]. Digital DNA–DNA hybridization (dDDH; Dataset S3) was also performed for some strains to verify they represented distinct species; however, dDDH was not considered when defining genera.

Fig. 1.

Maximum-likelihood core-genome phylogeny of the family . A maximum-likelihood phylogeny 94 strains is shown. The number of strains included in each collapsed clade is indicated. Clades of focus in the current study are expanded along the righthand side of the figure. The phylogeny is built from the concatenated alignments of 170 nonrecombinant loci using IQ-TREE [14]. The numbers on the nodes indicate the approximate Bayesian posterior probabilities support values (first value) and ultra-fast bootstrap values (second value). The tree was rooted using three spp. sequences as the outgroup. The scale bar represents the number of expected substitutions per site under the best-fitting GTR+F+ASC+R8 model. An expanded phylogeny is provided as Fig. S1. On the assumption that genera are not artificial divisions of a continuum of species, but that they instead represent biologically meaningful differentiation of groups of species, we reasoned that an OGRI threshold for delimiting genera should correspond to a drop in the OGRI frequency distribution. We therefore plotted histograms of all pairwise comparisons to identify potential genera boundaries (Figs 2 and S2–S4). It was previously suggested that a 50 % POCP threshold is a good measure of genus boundaries in other families [17]. However, we found that 3885 out of the 4371 (89 %) pairwise comparisons in our dataset gave a POCP value ≥50 %, with no clear breaks in the frequency distribution (Fig. S2). We therefore concluded that POCP is not a useful OGRI measurement for defining genera in the family . Similar observations were also reported for some other taxa, such as members of the roseobacter group [26].

Fig. 2.

Distribution of core-proteome AAI (cpAAI) comparisons of the family . Pairwise cpAAI values were calculated based on 170 nonrecombinant loci from the core-genome of 94 members of the family Rhizobiaceae. Results are summarized as a histogram with a bin width of 0.5 %. cpAAI data was recently used to delineate genera among other bacterial families [26, 27] and stands as a promising metric for genus demarcation in the family (Fig. 2). We observed a break in the frequency distribution at ~93 % to~94 %, but it was too stringent to use for genus demarcation as it would result in the majority of the 94 strains being classified into their own genera. Likewise, the break at ~73 % to~74 % was too lenient for genus delimitation as all strains would be grouped as a single genus, except for those belonging to the genera , , ‘ ’, and . Instead, the drop in the frequency distribution at 86–86.7 % (inclusive), within which only five of the 4371 pairwise comparisons fell, appeared to be a reasonable threshold to aid with defining genera in the family (Fig. 2). The drop in the frequency distribution at ~86 % was also visible in scatterplots showing the relationships between cpAAI and either wpAAI or ANIb (Fig. S5). Notably, this corresponds nicely with a recent study that used a cpAAI threshold of 86 % in genus demarcation in the roseobacter group of the class α-Proteobacteria [26]. Using a cpAAI threshold of ~86 %, combined with the phylogeny of Fig. 1, we were able to largely preserve the current taxonomy of the family , recovering the genera , , , (as previously defined), , , , ‘ ’, , , and (Figs 3 and S6). Such a threshold would, however, split the genera , , , and Rhizobium sensu stricto into two or more genera.

Fig. 3.

Core-proteome AAI (cpAAI) matrix of the family . A matrix showing the pairwise cpAAI values for each pair of 94 members of the family . Values were clustered using the core-genome gene phylogeny of Figs 1 and S1. Several named genera are indicated with red boxes, as indicated. A version of this matrix with a colour scheme representing the full range of cpAAI values is provided as Fig. S6. Both wpAAI and ANIb were correlated with cpAAI (Fig. S5), although neither relationship was linear. However, there was less support for the presence of genus-level drops in the wpAAI or ANIb frequency distributions (Figs S3 and S4). Nonetheless, the wpAAI frequency distribution density increased sharply below 76.5 %, while there was a sharp increase in the ANIb frequency distribution below 78.5 %. Although noisier, a wpAAI threshold of 76.5 % or a ANIb threshold of 78.5 % returned similar genus demarcations as did a cpAAI threshold of ~86 %, with a few exceptions (Figs S7–S10). In the case of wpAAI, the genus was recovered as a single genus, was split into fewer genera, and the separation between the genus and its sister taxon was less clear. When using ANIb, and were combined as a single genus, was split into three genera instead of two, the genus was recovered as a single genus, and the separation between the genus and its sister taxon was less clear.

Proposal for a framework for genus delineation in the family

Based on the results summarized above, we propose that genera within the family be defined as monophyletic groups (as determined by a phylogenetic reconstruction using a core-genome analysis approach; Fig. 1) separated from related species using a pairwise cpAAI threshold of approximately 86 % calculated as described in the methods. We specify ‘approximately 86%’ to provide some flexibility in the threshold to allow for differences in the evolution of each genus; for example, a cpAAI value of 85.7 % appears better suited for the genus . We strongly recommend the use of cpAAI over wpAAI or ANIb due to (1) its natural agreement – by construction – with the core-genome gene phylogeny, (2) clearer gaps in its distribution of values among , and (3) the fact that it would not be sensitive to the wide genome size variation within the , notably due to the variation in presence of large mobile genetic elements, including symbiotic and tumor-inducing megaplasmids. We do not, however, propose that cpAAI serve as the sole information source for genus demarcation as nearly all biological rules have exceptions. We therefore propose that genus demarcation using a cpAAI threshold higher than 86 % can be justified by the presence of alternate genomic or phenotypic evidence (as proposed below for splitting of the genus ), while a lower cpAAI threshold may be appropriate when considering historical classifications of genera within the family.

Taxonomic implications of the proposed framework

Following the criteria for genus demarcation outline above would notably lead to the formation of several new genera for species currently assigned to the genus , which is notoriously paraphyletic. They also imply that a few genera ( , , , , and Rhizobium sensu stricto) may be candidates for division. We also note that there is a clear break in the distribution of cpAAI values at ~73 % to ~74 % that may represent an appropriate threshold for delimiting the family . If adopted, this threshold would result in the genera and being transferred to their own families, while the genera and ‘ ’ would form another family. However, a proposal for family-level demarcations in the order is outside the scope of this work.

Proposal of a new genus encompassing the species and ‘ ’

In a recent study presenting a phylogeny of 571 and strains (ML tree based on 155 concatenated core proteins) [28], the type strains of the species ( ) [29] and ‘ ’ formed a well-delineated clade (with 100 % bootstrap support) that was clearly separated from the closest validly published genus type, i.e. strain H152T. This pattern was also evident from an ML phylogeny of 797 produced in another study based on the concatenation of 120 near-universal bacterial core genes [6]. The analyses presented in the current study further support the separation of the /‘ ’ clade (RoRr clade; two species type strains) not only from the clade (four species type strains), but also from a sister clade consisting solely of rhizobial strains from unnamed species (unRhsp clade; including sp. strains Leaf383, Leaf371, 9140 and NFR03) (Fig. 1); all three clades in the phylogenetic tree are supported by 100 % bootstrap values. All within-clade pairwise cpAAI values were above 85.7, 91 and 94 % for the clade, RoRr clade, and the unnamed clade, respectively (Fig. 3). In contrast, all pairwise cpAAI values between the RoRr clade and the or unRhsp clades were less than 81%, while pairwise cpAAI values between the and unRhsp clades were below 83.5 % (Fig. 3). All three clades thus represent separate genera according to the criteria proposed above, and this remains true when the analysis is repeated with an expanded set of strains (Fig. S11). We therefore propose to define a new genus encompassing the RoRr clade, for which we propose the name Xaviernesmea (see below for formal description). As no strains belonging to the unRhsp clade have been deposited in any international culture collection, we leave the task of describing new species and genera within this clade to others who have access to these strains.

Taxonomy of the ‘ complex’

The ‘ complex’ was initially identified as a sister taxon of the genus [29], with subsequent work demonstrating that it is instead located on a clade neighbouring the genus [13]. Moreover, the latter study suggested that ‘ complex’ includes members of the genus and that it may represent a novel genus on the basis of phylogenetic and multiple OGRI data, although the authors advised that further investigation was required [13]. It was recently suggested that the ‘ complex’ be split into two genera [30]. It was proposed that , and be transferred to the genus , while , , and be transferred to the novel genus along with the novel species ‘ ’ [30]; some of these changes have recently been validated [31]. The analyses presented in the current study included 13 strains belonging to the ‘ complex’ (Fig. 1). The genus demarcation framework proposed here supports the previous studies indicating that the ‘ complex’ is separate from the genus . A group of seven species that included all species present in our analysis ( DSM 1111T, ‘ ’ AOL15T, ‘Rhizobium glycinendophyticum’ CL12T, shin9-1T, ‘ ’ 7209-2T, ‘ ’ W3T and ‘ ’ W44T) formed a monophyletic group with 100 % bootstrap support (Fig. 1). All pairwise cpAAI values within this group were >88 %, while all pairwise cpAAI values against the other six ‘ complex’ species were <84.7 % (Fig. 3). These results support the formation of the genus [30], which should also include , as well as ‘ ’, and ‘R. glycinendophyticum’. We therefore propose that be transferred to the genus (see below for formal description). The species ‘ ’ and ‘R. glycinendophyticum’ should also be transferred to once their names are validly published. The remaining six ‘ complex’ species formed a monophyletic group that could be further sub-divided into two clades. One clade corresponded to a group of three species including the genus type strain, while the other clade contained DSM 17795T, TSY03bT and ATCC BAA-1503T (Fig. 1). All within-group cpAAI values were >86.5 % while all between-group cpAAI values were ≤85.4 %, providing support for these two clades representing separate genera. However, the bootstrap support for the split of these two clades in the phylogeny is only 80 %, and the topology of the tree in this region (Fig. 1) differs from the tree reported by Rahi et al., wherein , and were not monophyletic (see Fig. 2 of [30]). Overall, the data presented here are not in agreement with , and belonging to the genus . Instead, we propose this clade be referred to as the ‘ complex’ pending further study – enabled by the availability of additional genomes of strains belonging to these clades – to resolve whether these species belong to the genus or whether they should be transferred to a novel genus.

Proposal for the emendation of the genus as a distinct genus from

Taxonomy of the genus / has been the subject of discussion since the early 2000s. The genus was proposed in 1982 to describe , a bacterial predator [9]. Subsequently, the genus was proposed in 1988 when was reclassified as [10], which was followed by the emendation of this genus by de Lajudie et al. in 1994 [32]. In 2002, as the 16S rRNA gene sequence of became available, the Subcommittee on the Taxonomy of and (hereafter ‘the subcommittee’) of the International Committee on Systematics of Prokaryotes (ICSP) noted that this taxon is a part of [33]. Although the subcommittee pointed out that the name has priority, conservation of the name was endorsed in contravention of the rules of the International Code of Nomenclature of Prokaryotes (ICNP). Neighbour-joining trees reconstructed from 16S rRNA gene sequences or partial recA gene sequences, together with phenotypic data, provided further data interpreted as supporting the synonymy and unification of the genera and , leading Willems et al. to propose the new combination ‘ ’ [34]. Accordingly, in their Request for an Opinion to the Judicial Commission, Willems et al. officially proposed to conserve the name [34]. As the primary argument for conservation of the name , the authors indicated that the name would cause misunderstanding and confusion in the scientific community. A few months later, in a Request for an Opinion to the Judicial Commission, J. M. Young argued that , not , was the valid name for the unified genus, as had priority [35]. At the same time, J. M. Young emended the description of the genus , and transferred previously described species to this genus [35]. The Judicial Commission of the ICSP (Judicial Opinion 84) later confirmed that had priority over , pointed out that the name ‘ ’ is not validly published, and supported the transfer of members of the genus to [36]. In this Opinion, it was claimed that the transfer of the members of the genus to the genus would not cause confusion. The subcommittee, however, disagreed with this justification [37]. J. M. Young criticized these actions of the subcommittee [38], which was also later acknowledged by Tindall [39]. As predicted by Willems et al. [34], adoption of the genus name continues to be met with resistance from many rhizobiologists [40]. Earlier phylogenetic studies noted that was an outgroup of the genus [41], providing some support that represented a distinct genus; however, it was suggested that further evidence would be required prior to redefining genera within this clade [41]. Significant phylogenomic and phenotypic data now exists providing strong evidence that the genera and as defined Casida 1982 [9] and Chen et al. 1988 [10], respectively, refer to closely related, yet separate, taxa. At least seven studies, including the Genome Taxonomy Database, have presented phylogenetic trees containing two well-defined clades within the genus [21, 28, 42–46]. These phylogenies were built on the basis of gene (up to 1652 genes) or protein (up to 155 proteins) sequences using ML or Bayesian inference analysis approaches, indicating that the observed clades are robust to the choice of phylogenetic approach. Notably, our recent study presents an ML phylogeny where the genus is split into two clades of 12 and 20 genospecies with 100 % bootstrap support for the split, which we then defined as the ‘nonsymbiotic’ and ‘symbiotic’ clades, respectively [21]. The split is also observed in an ML phylogeny of the 16S rRNA genes of the same strains, with 62 % bootstrap support (Fig. S12). We similarly see a split of the genus into two clades of three species type strains (including Casida AT, the type strain of the type species of the genus Casida 1982) and 12 species type strains (including USDA 205T, the type strain of the type species of the genus Chen et al. 1988) in our core-genome gene phylogeny, with 100 % bootstrap support (Fig. 1), representing the nonsymbiotic and symbiotic clades, respectively. However, all between-clade cpAAI values were above the suggested 86 % threshold as a baseline criterion for genus delimitation. Despite this, and following our proposed framework, we argue that there is sufficient other genomic and phenotypic data supporting the division of this genus (cf. Figs 3–5 of [21]). We describe the distinctive traits and respective synapomorphies of these clades in Table 1 and below.

Table 1.

Characteristics differentiating the previously-defined nonsymbiotic and symbiotic clades of the genus , corresponding to the emended genera and , respectively

Characteristic	Nonsymbiotic clade (emended genus Ensifer )	Symbiotic clade (emended genus Sinorhizobium )	Reference
GANTC sites per kb	0.9–1.3 (mean: 1.06)	1.5 to 1.8 (mean: 1.70)	[47]
Number of CDS	5816–7682 (mean: 6876)	5516 to 8629 (mean: 6550)	[21]
Ribosomal RNA operons	5	3	[21]
Carries nod and nif genes	No*	Yes	[21]
Bacterial predation ability	Yes	No	[9, 52, 53]
Division by budding	Yes	No	[50]
Growth in unmodified LB medium	Yes	Poor	[21]
Starch hydrolysis	Yes	No	[51]
Growth at 37 ˚C	No (generally)	Yes (generally)	[21]
Fatty acids	More C_16 : 0 3OH	More C_18 : 1 ω9c	[51]
Carbon sources used (Biolog PM1/PM2)	69–87 (mean: 81)	50–81 (mean: 65)	[21]
pH tolerance (Biolog PM9)	Better low pH tolerance	Better high pH tolerance	[21]

*Except for the species E. sesbaniae, whose nine strains are legume symbionts.

Characteristics differentiating the previously-defined nonsymbiotic and symbiotic clades of the genus , corresponding to the emended genera and , respectively Characteristic Nonsymbiotic clade (emended genus ) Symbiotic clade (emended genus ) Reference GANTC sites per kb 0.9–1.3 (mean: 1.06) 1.5 to 1.8 (mean: 1.70) [47] Number of CDS 5816–7682 (mean: 6876) 5516 to 8629 (mean: 6550) [21] Ribosomal RNA operons 5 3 [21] Carries nod and nif genes No* Yes [21] Bacterial predation ability Yes No [9, 52, 53] Division by budding Yes No [50] Growth in unmodified LB medium Yes Poor [21] Starch hydrolysis Yes No [51] Growth at 37 ˚C No (generally) Yes (generally) [21] Fatty acids More C16 : 0 3OH More C18 : 1 ω9c [51] Carbon sources used (Biolog PM1/PM2) 69–87 (mean: 81) 50–81 (mean: 65) [21] pH tolerance (Biolog PM9) Better low pH tolerance Better high pH tolerance [21] *Except for the species E. sesbaniae, whose nine strains are legume symbionts. The genome-wide frequency of the pentanucleotide GANTC is higher in all genomes of the symbiotic clade compared to all genomes of the nonsymbiotic clade, with a statistically significant average difference of 60 % (1.70 vs 1.06 GANTC sites per kb, P-value<1×10−10 using a two-sample t-test) [47]. As the GANTC motif is methylated by the highly conserved cell cycle-regulated methyltransferase CcrM [48, 49], this difference may reflect an important difference in the cell cycle biology of these two clades [47]. Indeed, species of the nonsymbiotic clade ( and ) are capable of division by budding, unlike species of the symbiotic clade [50]. It has also been shown that the ability to hydrolyze starch [51] and robustly grow in LB broth lacking Ca2+ and Mg2+ ion supplementation [21] is specific to the nonsymbiotic clade. Stress tolerance of the two clades also differs (based on an analysis of 10 representative strains), with strains of the nonsymbiotic clade generally being more tolerant to alkaline conditions while strains of the symbiotic clade were generally more acid-tolerant and heat-tolerant [21]. Although many catabolic abilities could be found in at least a subset of each clade, which is unsurprising given both clades have open pangenomes, species of the nonsymbiotic clade are capable of catabolizing an average of 81 (out of 190 tested) carbon sources compared to an average of 65 for the symbiotic clade [21]. These differences in general phenotypic traits, together with the additional genomic and phenotypic differences outlined in Table 1, indicate marked differences in the biology of strains from these two clades. Indeed, at least two genospecies of the nonsymbiotic clade have been described as bacterial predators [9, 52, 53], a lifestyle that has not been attributed to any members of the symbiotic clade. Moreover, these two clades display significant differences in relation to their interactions with plant species, specifically, a biased distribution of the nod and nif genes required for establishment of nitrogen-fixing symbiosis with legumes [21, 45]. A recent study showed that whereas the core nodABC and nifHDK genes were present in strains from all 20 genospecies of the symbiotic clade, they were observed in just one of the 12 genospecies belonging the nonsymbiotic clade ( , with all nine reported strains, isolated from three different geographic origins, being symbiotic) [21, 51]. Symbiotic traits are linked to the presence of an accessory megaplasmid in the genome, and thus should not be considered relevant in delineating taxa [7]. However, this almost unique ability of genomes from the symbiotic clade to host symbiotic megaplasmids with respect to their relatives from the nonsymbiotic clade likely reflects differences in their genetic background. These discrepancies in symbiotic potential could thus be interpreted as a further marker of differentiated biology between these two clades. Taken together, these genomic and phenotypic data suggest that the organisms in these two clades significantly differ in their biology and ecology, reminiscent of the stable ecotype model for bacterial species [54]. Notably, the type species of the genus ( ) is found within the nonsymbiotic clade, while the original type species of the genus ( ) is found within the symbiotic clade. Given we established the taxonomic position of these type species-containing clades to be well separated, we argue that the proposal of Willems et al. [34] to unify the genera Casida 1982 and Chen et al. 1988, and the Judicial Opinion 84 enacting the transfer of the members of the genus to the genus [36], are no longer supported. Instead, we propose that Casida 1982 and Chen et al. 1988 refer to closely related sister genera, of which and , respectively, are the legitimate names in accordance with Rules 51a and 23a of the ICNP [55]. We note that the subcommittee has previously indicated support for this proposal [56], while also stating they are not in favour of creating subgenera for these taxa [40]. Formal genus and species emendations and circumscriptions are provided below.

Taxonomy of the genus

More study is required to resolve the taxonomic relationships between the ‘Neorhizobium sensu stricto’ clade (Fig. 1) – which includes , N. alkalisolii, N. hautlense, and , as well as – and related taxa. The core-genome gene phylogeny (Fig. 1) and the cpAAI data (Fig. 3) suggest that ‘Neorhizobium lilium’ represents a new genus, as does the clade formed by sp. NCHU2750, , and . However, because bootstrap values provided only moderate support for the topology of the tree in the extended clade and the clades were not well-resolved by the cpAAI data, we defer the proposal of new genera until publication of further genomic evidence.

Additional taxonomic implications of the proposed framework for genus delimitation

DSM 26482T formed a clade with ‘Neorhizobium sensu stricto’ (Fig. 1). Pairwise cpAAI values were all >90 % when was compared against ‘Neorhizobium sensu stricto’ species type strains (Fig. 3). We therefore propose that be transferred to the genus (see below for formal description). CCTC AB 2011022T formed a clade with the genus (Fig. 1). Pairwise cpAAI values were all >88 % when was compared against the four species type strains, but <85 % when compared against all other species (Fig. 3). We therefore propose that be transferred to the genus (see below for formal description). MIM27T formed a clade with the genus . Pairwise cpAAI values were 87.4, 87.4 and 85.7 % when was compared against the three species type strains, but <84 % when compared against all other species (Fig. 3). We therefore propose that be transferred to the genus (see below for formal description). DSM 100211T formed a clade with ‘ ’ JC85T and DSM 7138T (Fig. 1). Pairwise cpAAI values between these three species type strains were all >89 %, while cpAAI values against strains outside of this clade were all <85 % (Fig. 3). We therefore propose that be transferred to the genus (see below for formal description). CCTCC AB 2014007T formed a clade with (Fig. 1). The pairwise cpAAI value between these two species was >95 %, while cpAAI values against strains outside of this clade were all <84 % (Fig. 3). We therefore propose that be transferred to the genus (see below for formal description). To confirm the distinct taxonomic positions of the above-mentioned species and support their transfer to the respective genera, we compared them to other genus members using ANIb and dDDH indices. These indices are regarded as standard measures of relatedness between prokaryotic species that were widely used for species delimitation [57, 58]. In all cases, the obtained values were clearly below the thresholds for species delimitation (95–96 % for ANI or 70 % for DDH) (Datasets S2 and S3), confirming the authenticity of these species. In addition, the following species are candidates as type species for new genera: ‘R. album’, R. populii, and . The reclassification of R. album into a new genus was also suggested by Young et al. [6]. Moreover, CGMC 1.12192T, DSM 29514T and DSM 19479T formed a monophyletic group as a sister taxon to the genus (Fig. 1). This clade of three species type strains is another candidate for reclassification as a new genus as pairwise cpAAI values within the clade were all >86.5 % while all cpAAI values with species outside of the clade were all <83 % (Fig. 3).

Description of Xaviernesmea gen. nov.

Xaviernesmea (gza.vje.nem’e.a.; N.L. fem. n., in honour of Dr. Xavier Nesme, taxonomist of agrobacteria and rhizobia who pioneered the use of reverse ecology approaches to infer the ecology of genomic species from comparative genomic analyses [8]). Cells are Gram-negative, rod-shaped, and aerobic. Oxidase- and catalase-positive. Can utilize adonitol, raffinose and succinic acid. The pH range for growth is pH 5.0–11.0 [59]. The G+C content of the genomic DNA is in the range 62.8–64.7 mol%. The genus Xaviernesmea has been separated from other genera based on a core-genome phylogeny and whole- and core-proteome relatedness indices (wpAAI and cpAAI). The type species is Xaviernesmea oryzae.

Description of Xaviernesmea oryzae comb. nov.

Xaviernesmea oryzae (o.ry’zae. L. gen. fem. n. oryzae, of rice, referring to the host of isolation of the type strain). Basonym: Peng et al. 2008 [60]. Homotypic synonym: (Peng et al., 2008) Mousavi et al. 2015. The description is as provided by Peng et al. 2008 [60] and Mousavi et al. 2015 [29]. X. oryzae can be differentiated from other species of the genus Xaviernesmea based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.8 mol%. Its approximate genome size is 5.39 Mbp. The type strain is Alt 505T (=LMG 24253T=CGMCC 1.7048T), isolated from Oryza alta growing in the Wild Rice Core Collection Nursery of South China Agricultural University. The NCBI RefSeq Assembly accession number for the genome sequence is GCF_900109605.1.

Description of Xaviernesmea rhizosphaerae sp. nov.

Xaviernesmea rhizosphaerae (rhi.zo.sphae’rae. N.L. gen. fem. n. rhizosphaerae, of the rhizosphere, referring to host plant compartment of isolation of the type strain). The description is as provided by Zhao et al. [59]. X. rhizosphaerae can be differentiated from other species of the genus Xaviernesmea based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 64.7 mol%. Its approximate genome size is 5.18 Mbp. The type strain is MH17T (=ACCC 19963T=KCTC 52414T), which was isolated from the roots of rice collected from Beijing, PR China. The NCBI RefSeq assembly accession number for the genome sequence is GCF_001938945.1. We note that the name ‘ ’ that was proposed in the original publication [59] has yet to be validly published.

Emended description of the genus Casida 1982

The description is as given by Casida 1982 [9] with the following emendations. The optimal growth temperature is 27–28 °C. Some species can grow at 37 °C. Capable of growth in unmodified lysogeny broth (LB). Capable of hydrolysing starch. Resistant to multiple antibiotics including ampicillin and erythromycin. The genomic G+C content is ~61–63 mol%. The genomic GANTC pentanucleotide frequency is ~0.9–1.3 sites per kb. Most strains carry five rRNA operons. The genus can be differentiated from other genera based on core-genome gene phylogenies. The type species is . The emended genus contains the species , and . The species , , , , E. garamanticum, , , , , , E. mexicanum, , , , E. sofinae, and are transferred to the genus .

Emended description of the genus Chen et al. 1988 emend. de Lajudie et al. 1994

The description is as given by de Lajudie et al. [32] with the following emendations, drawing also from Young [35]. The optimal growth temperature is 25–33 °C, but some strains can grow at 12 °C and others can grow at 44 °C. Optimum pH is 6–7, but some strains can grow at pH 5.0 and others at pH 10.0. Starch is not utilized. Ammonium salts, nitrate, nitrite, and many amino acids can serve as nitrogen sources for most strains. Most strains produce cytochrome oxidase and catalase. The genomic G+C content is ~61–64 mol%. The genomic GANTC pentanucleotide frequency is ~1.5–1.8 sites per kb. Most strains carry three rRNA operons. The genus can be differentiated from other genera based on core-genome gene phylogenies. The type species is . The emended genus contains the species S. alkalisoli, , , , S. garamanticum, S. glycinis, , , , , S. mexicanum, S. numidicum, S. psoraleae, , S. sofinae, S. sojae and .

Description of Sinorhizobium alkalisoli comb. nov.

Sinorhizobium alkalisoli (al.ka.li.so’li. N.L. neut. n. alkali, alkali (from Arabic al-qaliy); L. neut. n. solum, soil; N.L. gen. neut. n. alkalisoli, of alkaline soil, referring to the saline-alkali soil where the bacterium was isolated). Basonym: Li et al. 2016. The description is as provided by Li et al. 2016 [61]. S. alkalisoli can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.2 mol%. Its approximate genome size is 6.13 Mbp. The type strain is YIC4027T (=HAMBI 3655T=LMG 29286T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_001723275.1.

Emended description of Toledo et al. 2004

(a.me.ri.ca’num. N.L. neut. adj. americanum, American, referring to the isolation of the type strain from the Colorado Plateau). Homotypic synonym: (Toledo et al. 2004) Wang et al. 2015 emend. Hördt et al. 2020. The description is as provided by Hördt et al. 2020 [5]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.3 mol%. Its approximate genome size is 6.75 Mbp. The type strain is CFNEI 156T (=ATCC BAA-532T=CIP 108390T=DSM 15007T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_001651855.1.

Emended description of Nick et al. 1999

(ar’bo.ris. L. fem. n. arbour, tree; L. gen. fem. n. arboris, of a tree). Homotypic synonym: (Nick et al. 1999) Young 2003 emend. Hördt et al. 2020. The description is as provided by Hördt et al. 2020 [5]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.0 mol%. Its approximate genome size is 6.85 Mbp. The type strain is HAMBI 1552T (=ATCC BAA-226T=DSM 13375T=LMG 14919T=NBRC 100383T=TTR 38T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_000427465.1.

Emended description of (Scholla and Elkan 1984) Chen et al. 1988

(fred'i.i. N.L. gen. neut. n. fredii, of Fred, named after of E.B. Fred). Homotypic synonym: (Scholla and Elkan 1984) Young 2003 emend. Hördt et al. 2020. Heterotypic synonym: (Chen et al. 1988) Young 2003 [43] The description is as provided by Hördt et al. 2020 [5]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.2 mol%. Its approximate genome size is 7.15 Mbp. The type strain is USDA 205T (=ATCC 35423T=CCUG 27877T=DSM 5851T=HAMBI 2075T=ICMP 11139T=IFO 14780T=JCM 20967T=LMG 6217T=NBRC 14780T=NRRL B-14241T=NRRL B-14594T=PRC 205T). The NCBI RefSeq Assembly accession number for the genome sequence is GCF_001461695.1.

Description of Sinorhizobium garamanticum comb. nov.

Sinorhizobium garamanticum (ga.ra.man’ti.cum. N.L. neut. adj. garamanticum, pertaining to Garamante, Garamantian, the country of Garamantes, from which the strains were isolated). Basonym: Merabet et al. 2010. The description is as provided by Merabet et al. 2010 [62]. S. garamanticum can be differentiated from other species of the genus by phylogenetic analysis based on several housekeeping (recA, glnA, gltA, thrC and atpD) genes and 16S rRNA gene sequencing. The genomic G+C content of the type strain is approximately 62.4 mol% (HPLC). The type strain is ORS 1400T (=CIP 109916T=LMG 24692T).

Description of Sinorhizobium glycinis comb. nov.

Sinorhizobium glycinis (gly.ci’nis. N.L. gen. n. glycinis, of the botanical genus Glycine, the soybean, named for its nodulation characteristics and symbiotic genes). Basonym: Yan et al. 2016. The description is as provided by Yan et al. 2016 [63]. S. glycinis can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.4 mol%. Its approximate genome size is 6.04 Mbp. The type strain is CCBAU 23380T (=HAMBI 3645T=LMG 29231T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_001651865.1. (kos.ti.en’se. N.L. neut. adj. kostiense, pertaining to Kosti, the region in Sudan where most of these organisms have been isolated). Homotypic synonym: (Nick et al. 1999) Young 2003. The description is as provided by Nick et al. 1999 [64]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 61.7 mol%. Its approximate genome size is 6.33 Mbp. The type strain is DSM 13372T (=ATCC BAA-227T=HAMBI 1489T=LMG 15613T=LMG 19227T=NBRC 100382T=TTR 15T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_017874595.1.

Emended description of Wei et al. 2002

(kum.me.ro’wi.ae. N.L. gen. fem. n. kummerowiae, of Kummerowia, a genus of leguminous plants, referring to the host from which the bacterium was isolated). Homotypic synonym: (Wei et al. 2002) Young 2003. The description is as provided by Wei et al. 2002 [65]. can be differentiated from other species of the genus by phylogenetic analysis based on 16S rRNA gene sequences. The genomic G+C content of the type strain is approximately 61.6 mol% (Tm). The type strain is CCBAU 71714T (=CGMCC 1.3046T=CIP 108026T=NBRC 100799T).

Emended description of Rome et al. 1996

(me’di.cae. L. gen. fem. n. medicae, of/from lucerne (plant belonging to the genus Medicago). Homotypic synonym: (Rome et al. 1996) Young 2003. The description is as provided by Rome et al. 1996 [66]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 61.2 mol%. Its approximate genome size is 6.53 Mbp. The type strain is A 321T (=11-3 21 aT=HAMBI 2306T=ICMP 13798T=LMG 19920T=NBRC 100384T=USDA 1037T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_009599935.1.

Emended description of (Dangeard 1926) de Lajudie et al. 1994

(me.li.lo’ti. N.L. masc. n. Melilotus, generic name of sweet clover; N.L. gen. masc. n. meliloti, of Melilotus). Homotypic synonym: (Dangeard 1926) Young 2003. The description is as provided by de Lajudie et al. 1994 [32]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.0 mol%. Its approximate genome size is 7.34 Mbp. The type strain is USDA 1002T (=ATCC 9930T=CCUG 27879T=CFBP 5561T=DSM 30135T=HAMBI 2148T=ICMP 12623T=IFO 14782T=JCM 20682T=LMG 6133T=NBRC 14782T=NCAIM B.01520T=NRRL L-45T=NZP 4027T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_009601385.1.

Description of Sinorhizobium mexicanum comb. nov.

Sinorhizobium mexicanum (me.xi.ca’num. N.L. neut. adj. mexicanum, of or belonging to Mexico, where the strains were isolated). Basonym: Lloret et al. 2011. The description is as provided by Lloret et al. 2011 [67]. S. mexicanum can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 61.4 mol%. Its approximate genome size is 7.14 Mbp. The type strain is ITTG R7T (=ATCC BAA-1312T=CFN ER1001T=CIP 109033T=DSM 18446T=HAMBI 2910T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_013488225.1.

Description of Sinorhizobium numidicum comb. nov.

Sinorhizobium numidicum (nu.mi’di.cum. N.L. neut. adj. numidicum, pertaining to the country of Numidia, Numidian, the Roman denomination of the region in North Africa from which the majority of the organisms were isolated). Basonym: Merabet et al. 2010. The description is as provided by Merabet et al. 2010 [62]. S. numidicus can be differentiated from other species of the genus by phylogenetic analysis based on several housekeeping (recA, glnA, gltA, thrC and atpD) genes and 16S rRNA gene sequencing. The genomic G+C content of the type strain is approximately 62.8 mol% (HPLC). The type strain is ORS 1407T (=CIP 109850T=LMG 27395T).

Description of Sinorhizobium psoraleae comb. nov.

Sinorhizobium psoraleae (pso.ra.le’ae N.L. gen. fem. n. psoraleae, of Psoralea, referring to the main host of the species). Basonym: Wang et al. 2013. The description is as provided by Wang et al. 2013 [51]. S. psoraleae can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 61.3 mol%. Its approximate genome size is 7.43 Mbp. The type strain is CCBAU 65732T (=HAMBI 3286T=LMG 26835T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_013283645.1.

Emended description of de Lajudie et al. 1994

(sa’hel.i. N.L. gen. neut. n. saheli, of the Sahel, the region in Africa from which they were isolated). Homotypic synonym: (de Lajudie et al. 1994) Young 2003 emend. Hördt et al. 2020. The description is as provided by Hördt et al. 2020 [5]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 63.6 mol%. Its approximate genome size is 5.99 Mbp. The type strain is LMG 7837T (=ATCC 51690T=DSM 11273T=HAMBI 215T=ICMP 13648T=NBRC 100386T=ORS 609T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_001651875.1.

Description of Sinorhizobium shofinae comb. nov.

Sinorhizobium shofinae (sho.fi’nae. N.L. fem. gen. n. shofinae from Shofine, a company name, referring to the fact that the type strain of this species was isolated from root nodule of soybean grown in the farm of the company, Shandong Shofine Seed Technology Company Ltd., located in Jiaxiang County, Shandong Province of China). Basonym: Chen et al. 2017 emend. Hördt et al. 2020. The description is as provided by Hördt et al. 2020 [5]. S. shofinae can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 59.9 mol%. Its approximate genome size is 6.21 Mbp. The type strain is CCBAU 251167T (=HAMBI 3507T=LMG 29645T=ACCC 19939T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_001704765.1.

Description of Sinorhizobium sojae comb. nov.

Sinorhizobium sojae (so'ja.e. N.L. gen. n. sojae, of soja, of soybean, referring to the source of the first isolates). Basonym: Li et al. 2011 emend. Hördt et al. 2020. The description is as provided by Hördt et al. 2020 [5]. S. sojae can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 60.9 mol%. Its approximate genome size is 6.09 Mbp. The type strain is CCBAU 5684T (=DSM 26426T=HAMBI 3098T=LMG 25493T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_000261485.1. [te’ran.gae. N.L. n. terengae, hospitality (from West African Wolof n. terenga, hospitality); N.L. gen. n. terangae, of hospitality, intended to mean that this species is isolated from different host plants]. Homotypic synonym: (de Lajudie et al. 1994) Young 2003. The description is as provided by de Lajudie et al. 1994 [32]. can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 61.4 mol%. Its approximate genome size is 6.79 Mbp. The type strain is SEMIA 6460T (=ATCC 51692T=DSM 11282T=HAMBI 220T=ICMP 13649T=LMG 7834T=NBRC 100385T=ORS 1009T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_014197705.1.

Description of Endobacterium yantingense comb. nov.

Endobacterium yantingense (yan. ting. en’se. N.L. neut. adj. yantingense referring to Yanting district, Sichuan Province, PR China, where the organism was isolated). Basonym: Chen et al. 2015. The description is as provided by Chen et al. 2015 [68]. E. yantingense can be differentiated from another species of the genus ( corrig. Menéndez et al. 2021) based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 59.5 mol%. Its approximate genome size is 5.82 Mbp. The type strain is H66T (=CCTCC AB 2014007T=LMG 28229T), which was isolated from the surfaces of weathered rock (purple siltstone) in Yanting (Sichuan, PR China). The JGI IMG accession number for the genome sequence is Ga0196656.

Description of Neorhizobium petrolearium comb. nov.

Neorhizobium petrolearium (pe.tro.le.a’ri.um. L. fem. n. petra, rock; L. neut. olearium related to oil, of or belonging to oil; N.L. neut. adj. petrolearium related to mineral oil). Basonym: Zhang et al. 2012. The description is as provided by Zhang et al. 2012 [69]. N. petrolearium can be differentiated from another species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 60.5 mol%. Its approximate genome size is 6.97 Mbp. The type strain, SL-1T (=ACCC 11238T=KCTC 23288T), was isolated from petroleum-contaminated sludge samples in Shengli oilfield, Shandong Province, China. The JGI IMG accession number for the genome sequence is Ga0196653.

Description of Pararhizobium arenae comb. nov.

Pararhizobium arenae (a.re’nae. L. fem. gen. n. arenae of sand, the isolation source of the type strain). Basonym: Zhang et al. 2017. The description is as provided by Zhang et al. 2017 [70]. P. arenae can be differentiated from another species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 59.8 mol%. Its approximate genome size is 4.94 Mbp. The type strain is MIM27T (=KCTC 52299T=MCCC 1K03215T), isolated from sand of the Mu Us Desert, PR China. The NCBI RefSeq Assembly accession number for the genome sequence is GCF_001931685.1.

Description of Peteryoungia aggregata comb. nov.

Peteryoungia aggregata (ag.gre.ga’ta. L. fem. part. adj. aggregata, joined together, referring to the frequent formation of rosettes). Basonym: Kaur et al. 2011 [71]. Homotypic synonym: Hirsch and Muller 1986 [72]. The description is as provided by Kaur et al. 2011 [71]. P. aggregata can be differentiated from another species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 62.7 mol%. Its approximate genome size is 4.81 Mbp. The type strain is IFAM 1003T (= DSM 1111T=ATCC 43293T). The JGI IMG accession number for the genome sequence is Ga0196658.

Description of Pseudorhizobium tarimense comb. nov.

Pseudorhizobium tarimense (ta.rim.en’se. N.L. neut. adj. tarimense, pertaining to Tarim basin in Xinjiang Uyghur autonomous region of China, where the type strain was isolated). Basonym: Turdahon et al. 2013. The description is as provided by Turdahon et al. 2013 [73]. P. tarimense can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 61.2 mol%. Its approximate genome size is 4.83 Mbp. The type strain is CCTCC AB 2011011T (=NRRL B-59556T=PL-41T). The JGI IMG accession number for the genome sequence is Ga0196649.

Description of Mycoplana azooxidifex comb. nov.

Mycoplana azooxidifex [a.zo.o.xi’di.fex. N.L. neut. n. azooxidum, dinitrogenmonoxide; L. masc. suff. -fex, the maker; N.L. masc. n. azooxidifex, the dinitrogenmonoxide maker (nominative in apposition)]. Basonym: Behrendt et al. 2016. The description is as provided by Behrendt et al. 2016 [74]. M. azooxidifex can be differentiated from other species of the genus based on OGRI calculations (ANI and dDDH). The genomic G+C content of the type strain is 64.3 mol%. Its approximate genome size is 5.89 Mbp. The type strain is DSM 100211T (=Po 20/26T=LMG 28788T). The NCBI RefSeq assembly accession number for the genome sequence is GCF_014196765.1. Click here for additional data file. Click here for additional data file.

65 in total

1. Proposal to include the rank of phylum in the International Code of Nomenclature of Prokaryotes.

Authors: Aharon Oren; Milton S da Costa; George M Garrity; Fred A Rainey; Ramon Rosselló-Móra; Bernhard Schink; Iain Sutcliffe; Martha E Trujillo; William B Whitman
Journal: Int J Syst Evol Microbiol Date: 2015-11 Impact factor: 2.747

2. Phylogenomics reveals the basis of adaptation of Pseudorhizobium species to extreme environments and supports a taxonomic revision of the genus.

Authors: Florent Lassalle; Seyed M M Dastgheib; Fang-Jie Zhao; Jun Zhang; Susanne Verbarg; Anja Frühling; Henner Brinkmann; Thomas H Osborne; Johannes Sikorski; Francois Balloux; Xavier Didelot; Joanne M Santini; Jörn Petersen
Journal: Syst Appl Microbiol Date: 2020-12-15 Impact factor: 4.022

3. The correct name of the taxon that contains the type strain of Rhodococcus equi.

Authors: B J Tindall
Journal: Int J Syst Evol Microbiol Date: 2014-01 Impact factor: 2.747

4. Valid publication of new names and new combinations effectively published outside the IJSEM.

Authors: Aharon Oren; George M Garrity
Journal: Int J Syst Evol Microbiol Date: 2021-08 Impact factor: 2.747

5. Rhizobium rosettiformans sp. nov., isolated from a hexachlorocyclohexane dump site, and reclassification of Blastobacter aggregatus Hirsch and Muller 1986 as Rhizobium aggregatum comb. nov.

Authors: Jaspreet Kaur; Mansi Verma; Rup Lal
Journal: Int J Syst Evol Microbiol Date: 2010-06-28 Impact factor: 2.747

6. Rhizobium arenae sp. nov., isolated from the sand of Desert Mu Us, China.

Authors: Shengnan Zhang; Shanshan Yang; Wei Chen; Yong Chen; Mingjuan Zhang; Xinai Zhou; Guohua Fan; Fu Ying Feng
Journal: Int J Syst Evol Microbiol Date: 2017-06-26 Impact factor: 2.747

7. Advantages of multilocus sequence analysis for taxonomic studies: a case study using 10 housekeeping genes in the genus Ensifer (including former Sinorhizobium).

Authors: Miet Martens; Peter Dawyndt; Renata Coopman; Monique Gillis; Paul De Vos; Anne Willems
Journal: Int J Syst Evol Microbiol Date: 2008-01 Impact factor: 2.747

8. Multilocus sequence analysis of root nodule isolates from Lotus arabicus (Senegal), Lotus creticus, Argyrolobium uniflorum and Medicago sativa (Tunisia) and description of Ensifer numidicus sp. nov. and Ensifer garamanticus sp. nov.

Authors: C Merabet; M Martens; M Mahdhi; F Zakhia; A Sy; C Le Roux; O Domergue; R Coopman; A Bekki; M Mars; A Willems; P de Lajudie
Journal: Int J Syst Evol Microbiol Date: 2009-08-05 Impact factor: 2.747

9. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.

Authors: Lam-Tung Nguyen; Heiko A Schmidt; Arndt von Haeseler; Bui Quang Minh
Journal: Mol Biol Evol Date: 2014-11-03 Impact factor: 16.240

10. GET_PHYLOMARKERS, a Software Package to Select Optimal Orthologous Clusters for Phylogenomics and Inferring Pan-Genome Phylogenies, Used for a Critical Geno-Taxonomic Revision of the Genus Stenotrophomonas.

Authors: Pablo Vinuesa; Luz E Ochoa-Sánchez; Bruno Contreras-Moreira
Journal: Front Microbiol Date: 2018-05-01 Impact factor: 5.640

3 in total

Review 1. New Insights into the Taxonomy of Bacteria in the Genomic Era and a Case Study with Rhizobia.

Authors: Luisa Caroline Ferraz Helene; Milena Serenato Klepa; Mariangela Hungria
Journal: Int J Microbiol Date: 2022-05-21

2. Taxonomy of Rhizobiaceae revisited: proposal of a new framework for genus delimitation.

Authors: Nemanja Kuzmanović; Camilla Fagorzi; Alessio Mengoni; Florent Lassalle; George C diCenzo
Journal: Int J Syst Evol Microbiol Date: 2022-03 Impact factor: 2.689

3. The two-component regulatory system CenK-CenR regulates expression of a previously uncharacterized protein required for salinity and oxidative stress tolerance in Sinorhizobium meliloti.

Authors: Eukene O Bensig; Cecilio Valadez-Cano; ZiYu Kuang; Isabela R Freire; Adrian Reyes-Prieto; Shawn R MacLellan
Journal: Front Microbiol Date: 2022-09-30 Impact factor: 6.064

3 in total