Literature DB >> 30809202

Complete Genome Sequence of 3-Chlorobenzoate-Degrading Bacterium Cupriavidus necator NH9 and Reclassification of the Strains of the Genera Cupriavidus and Ralstonia Based on Phylogenetic and Whole-Genome Sequence Analyses.

Ryota Moriuchi1,2, Hideo Dohra1, Yu Kanesaki1, Naoto Ogawa2,3.   

Abstract

Cupriavidus necator NH9, a 3-chlorobenzoate (3-CB)-degrading bacterium, was isolated from soil in Japan. In this study, the complete genome sequence of NH9 was obtained via PacBio long-read sequencing to better understand the genetic components contributing to the strain's ability to degrade aromatic compounds, including 3-CB. The genome of NH9 comprised two circular chromosomes (4.3 and 3.4 Mb) and two circular plasmids (427 and 77 kb) containing 7,290 coding sequences, 15 rRNA and 68 tRNA genes. Kyoto Encyclopedia of Genes and Genomes pathway analysis of the protein-coding sequences in NH9 revealed a capacity to completely degrade benzoate, 2-, 3-, or 4-hydroxybenzoate, 2,3-, 2,5-, or 3,4-dihydroxybenzoate, benzoylformate, and benzonitrile. To validate the identification of NH9, phylogenetic analyses (16S rRNA sequence-based tree and multilocus sequence analysis) and whole-genome sequence analyses (average nucleotide identity, percentage of conserved proteins, and tetra-nucleotide analyses) were performed, confirming that NH9 is a C. necator strain. Over the course of our investigation, we noticed inconsistencies in the classification of several strains that were supposed to belong to the two closely-related genera Cupriavidus and Ralstonia. As a result of whole-genome sequence analysis of 46 Cupriavidus strains and 104 Ralstonia strains, we propose that the taxonomic classification of 41 of the 150 strains should be changed. Our results provide a clear delineation of the two genera based on genome sequences, thus allowing taxonomic identification of strains belonging to these two genera.

Entities:  

Keywords:  ANI (average nucleotide identity); Cupriavidus; Ralstonia; TNA (tetra-nucleotide analysis); aromatic degradation; reclassification

Year:  2019        PMID: 30809202      PMCID: PMC6379261          DOI: 10.3389/fmicb.2019.00133

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

The Gram-negative bacterial genera Cupriavidus and Ralstonia belong to the family Burkholderiaceae and the class β-proteobacteria. The two genera are closely related and have a complex taxonomic history, which was addressed by Yabuuchi et al. (1995) and Vandamme and Coenye (2004). The genus Cupriavidus was established in 2004 (Vandamme and Coenye, 2004), with members of this genus isolated from a variety of environments, including soil (Poehlein et al., 2011), ground water (Ray et al., 2015), activated sludge (Shafie et al., 2017), root nodules (Amadou et al., 2008), spacecraft-associated environments (Monsieurs et al., 2014), and human clinical specimens (Monsieurs et al., 2013). These divergent ecological niches explain the diversity of the genus, which currently comprises 17 species (http://www.bacterio.net/cupriavidus.html). To date, the genomes of a variety of Cupriavidus species have been sequenced, and show several common features. In particular, all Cupriavidus species examined have multi-replicon genomes, often including large plasmids, containing metal resistance genes and genes involved in the biodegradation of persistent aromatic compounds (Amadou et al., 2008; Janssen et al., 2010; Lykidis et al., 2010; Poehlein et al., 2011; Ray et al., 2015; Suenaga et al., 2015; Wang X. et al., 2015; Fang et al., 2016; Shafie et al., 2017). As halogenated or non-halogenated aromatic compounds are abundant in the environment as pollutants (e.g., chlorobenzenes and polychlorinated biphenyls, PCBs), understanding the degradation for these recalcitrant aromatics by microorganisms is of great interest for characterizing the behavior of soil-dwelling microorganisms and for the development of novel bioremediation processes (Reineke and Knackmuss, 1988). Cupriavidus necator NH9 (formerly known as Alcaligenes eutrophus or Ralstonia eutropha) was isolated from a soil sample of the ground near a building of National Institute for Agro-Environmental Sciences (currently, Institute for Agro-Environmental Sciences, NARO) of Tsukuba city, Japan by using 3-chlorobenzoate (3-CB) as a sole source of carbon and energy (Ogawa and Miyashita, 1995). In strain NH9, 3-CB is thought to be first converted to 3- or 4-chlorocatechols by chromosomally-encoded enzymes. The resultant chlorocatechols are converted to β-ketoadipate, a central metabolite of soil bacteria, by the enzymes of the chlorocatechol ortho-cleavage pathway. These enzymes of strain NH9 are encoded by the cbnABCD genes, which are contained on plasmid pENH91 (Ogawa and Miyashita, 1995, 1999). Chlorocatechols are key intermediate metabolites in the aerobic microbial degradation pathways of various chlorinated aromatic compounds (Reineke, 1998). The genes for degradation of chlorocatechols are often encoded on large plasmids. For example, the tfdCDEF, clcABDE, and tcbCDEF genes encoding enzymes of chlorocatechol ortho-cleavage pathway are carried on the plasmids pJP4 of Cupriavidus pinatubonensis JMP134 (Don et al., 1985), pAC27 of Pseudomonas putida AC866 (Frantz and Chakrabarty, 1987; Kasberg et al., 1997) and pP51 of Pseudomonas sp. P51 (van der Meer et al., 1991), respectively. Accordingly, the genes encoding chlorocatechol ortho-cleavage pathway enzymes could spread beyond boundaries of bacterial species. In addition to their simple structures, the production of chlorocatechols as intermediates makes chlorobenzoates suitable model substrate compounds for the study of microbial degradation of chlorinated aromatics (Morimoto et al., 2005). Moreover, chlorobenzoates themselves are the intermediate products of the degradation of PCBs (Reineke and Knackmuss, 1988). In Comamonas testosterone BR60 (formerly Alcaligenes sp. BR60), 3-CB is known to be converted to 5-chloroprotocatechuate or protocatechuate by the products of the cbaABC genes and further metabolized via protocatechuate meta-ring fission pathway (Nakatsu et al., 1997). Several critical features of the ability of strain NH9 to degrade 3-CB have been characterized by analyses of the substrate specificity and application of chlorocatechol 1,2-dioxygenase (CbnA) (Liu et al., 2005; Ohmiya et al., 2009), and by biochemical and structural analyses of CbnR, a LysR-type transcriptional regulator controlling the expression of the cbnABCD genes (Moriuchi et al., 2017; Koentjoro et al., 2018). While these studies have been beneficial for gaining knowledge on both basic and applied aspects of the degradation ability of NH9 or its enzymes, genomic analysis of NH9 would provide further insights into the genes involved in the catabolism of aromatic compounds by this strain. In the course of our analysis of the genetic characteristics of the strain NH9 to degrade aromatic compounds in comparison with related bacterial strains, we noticed inconsistency of phylogenetic identification of several strains of the genera Cupriavidus and Ralstonia, which is the genus most close to Cupriavidus. Thus, in order to precisely understand genetic characteristic of the strain NH9 among phylogenetically related bacteria, accurate taxonomic identification of NH9 is required. The genus Ralstonia was first established by Yabuuchi et al. in 1995 to accommodate several misplaced species, including A. eutrophus (currently the genus Cupriavidus), Burkholderia pickettii, and B. solanacearum (Yabuuchi et al., 1995). As of May 2018, genome data of 104 strains that belong to four Ralstonia species have been deposited in the GenBank database. Ralstonia solanacearum, the most sequenced species, is an important phytopathogen that causes bacterial wilt in a variety of economically important crops (Hayward, 1991). R. solanacearum strains are divided into four phylotypes based on their geographic origins: Asia (phylotype I), America (IIA and IIB), Africa (III), and Indonesia-Japan (IV) (Castillo and Greenberg, 2007; Safni et al., 2014). The remaining three Ralstonia species, Ralstonia pickettii, R. insidiosa, and R. mannitolilytica, are commonly found in moist environments (e.g., water and soil) and are opportunistic human pathogens (Ryan and Adley, 2014). R. pickettii also has the capacity to degrade many toxic substances and, like Cupriavidus strains, is found in diverse habitats (Ryan et al., 2007). Because of the decreasing cost of genome sequencing, a growing number of Cupriavidus and Ralstonia genomes are being sequenced and deposited in public databases. However, taxonomic problems have arisen at the species and genus levels because of the diversity and complex taxonomic history of these two closely related genera. While several studies aimed at inferring the phylogeny of R. solanacearum have been performed (Prior et al., 2016; Zhang and Qiu, 2016), the phylogenetic relationships between the genera Cupriavidus and Ralstonia have never been elucidated. In this study, the complete genome sequence of C. necator NH9 was revealed using PacBio long-reads-based sequencing, allowing us to infer its capacity for the degradation of aromatic compounds. The phylogenetic relationships between Cupriavidus and Ralstonia were also investigated based on whole-genome sequences. Overall, our findings provide a detailed and well-supported description of the phylogenetic relationships between these two genera.

Materials and Methods

Genomic DNA Extraction and Genome Sequencing and Assembly

C. necator NH9 was cultured in basal salts medium (Ogawa and Miyashita, 1995) containing 5 mM 3-CB as the sole source of carbon and energy at 30°C. NH9 genomic DNA was extracted using a DNeasy Blood and Tissue Kit (QIAGEN) and then used as template for whole-genome sequencing via the PacBio RSII system (Pacific Biosciences) by Macrogen Inc. (http://www.macrogen.com), with the resulting assembly confirmed using the MiSeq platform (Illumina) at the Instrumental Research Support Office, Research Institute of Green Science and Technology, Shizuoka University. PacBio RSII sequencing produced 181,370 raw reads, which were filtered using PreAssembler Filter v1 of the RS HGAP Assembly.3 Protocol in SMRT Analysis Software version 2.3.0 (Chin et al., 2013). A minimum polymerase read quality cut-off of 0.75 and a minimum subread length of 7.5 kb were used. We obtained a total of 86,406 filtered subreads, with an N50 read length of 12,367 bp and a max read length of 41,609 bp, resulting in 1,050,061,719 bp of sequence with ~127-fold coverage. These high quality subreads were then de novo assembled using HGAP.3 (Chin et al., 2013) with a minimum seed read length of 15 kb. The resulting four contigs were polished using AssemblyPolishing v1 Quiver (RS HGAP Assembly.3 Protocol) and Arrow (https://github.com/PacificBiosciences/GenomicConsensus), and then closed using Circlator version 1.1.1 (Hunt et al., 2015). To identify errors in the final PacBio assembled contigs, Illumina sequencing data were also collected. A paired-end library was constructed for MiSeq sequencing using a KAPA HyperPlus Kit (KAPA BIOSYSTEMS), resulting in 3,436,955 paired-end reads (2 × 301 bp). Low-quality reads (quality score, < Q15), adapter sequences, reads < 150 bp, and the terminal 301 bases were filtered using Trimmomatic version 0.33 (Bolger et al., 2014), yielding 2,305,131 paired reads corresponding to a coverage of ~138-fold. These high-quality short reads were aligned against the four polished circular contigs using BWA-MEM (Li, 2013) and manually checked using Integrative Genomics Viewer (Thorvaldsdottir et al., 2013). When an error was identified, the relevant position was manually curated. The final complete genome sequence of NH9 has been deposited in DDBJ/ENA/GenBank under accession numbers CP017757–CP017760.

Genome Annotation

Four complete genome sequences of NH9 were annotated using the NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP) (Tatusova et al., 2016), the Microbial Genome Annotation Pipeline (MiGAP, http://www.migap.org), and Prokka software version 1.11 (Seemann, 2014). PGAAP annotation data was manually curated with respect to start codon position and missing genes by referencing it against the MiGAP and Prokka annotation data with the aid of GenomeMatcher (Ohtsubo et al., 2008), Geneious software version 11.0.4 (Kearse et al., 2012), BLASTP analysis (Altschul et al., 1997), and InterProScan (Jones et al., 2014). All putative proteins identified in the NH9 genome were functionally classified based on Clusters of Orthologous Groups (COG) analysis using RPS-BLAST (Altschul et al., 1997). BlastKOALA (Kanehisa et al., 2016a) was used for functional characterization of the NH9 complete genome to reconstruct aromatic compound degradation pathways using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database (Kanehisa et al., 2016b).

Genome Sequence Data Collection

All genome sequence data for the Cupriavidus and Ralstonia strains used in this study were obtained from the assembly summary report file (ftp://ftp.ncbi.nlm.nih.gov/genomes/ASSEMBLY_REPORTS/assembly_summary_refseq.txt). R. pickettii DTP0602 was also added manually. R. solanacearum BBAC-C1 was removed from all analyses because of low genome sequence coverage that adversely affected results. Complete or draft genome sequences, GenBank files, protein coding sequences, amino acid sequences, and RNA gene sequences for the 46 Cupriavidus and 104 Ralstonia named strains selected for analysis were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/) in May 2018.

16S rRNA and Multilocus Sequence Analysis

Phylogenetic analysis was performed using MEGA software version 7.0 (Kumar et al., 2016). For the 16S rRNA gene-based phylogenetic analysis, alignments were carried out using ClustalW and analysis was performed using the maximum likelihood method and the GTR + G substitution model. In addition to the genome-sequenced strains, the 16S rRNA genes of several Cupriavidus and Ralstonia type strains (Cupriavidus basilensis, C. gilardii, C. oxalaticus, C. pauculus, C. pinatubonensis, R. insidiosa, and R. mannitolilytica) were downloaded from the NCBI database. As only partial 16S rRNA gene sequences were available for Cupriavidus metallidurans NE12, C. oxalaticus NBRC 13593, Cupriavidus taiwanensis STM 6018, C. taiwanensis STM 6070, Cupriavidus sp. amp6, Cupriavidus sp. GA3-3, Cupriavidus sp. IDO, Cupriavidus sp. UYPR2.512, R. solanacearum P673, R. solanacearum Y45, and R. solanacearum Rs-10-244, these strains were not included in the analysis. Following alignment, all gaps were eliminated, resulting in a shared 1,386-bp sequence for the final analysis. The 16S rRNA gene sequence of Paraburkholderia xenovorans LB400 (Sawana et al., 2014) was used as the outgroup for the analysis. To evaluate the phylogenetic tree topology, a bootstrap analysis of 1,000 replicates was performed. For the multilocus sequence analysis (MLSA), we screened for the presence of several single-copy housekeeping genes in the genomes of the Cupriavidus and Ralstonia strains. As a result, we found that the following four genes were present in all of the strains except Cupriavidus sp. SK-3, thus the four genes were used for MLSA: atpD (β-subunit of ATP synthase F0F1 gene), leuS (leucine-tRNA ligase gene), rplB (50S ribosomal protein L2 gene), and gyrB (β-subunit of DNA gyrase gene). Multiple alignments were performed with respect to each gene using ClustalW, and all positions containing gaps or missing data were excluded. All aligned genes were then concatenated in the following order: atpD-leuS-rplB-gyrB. The final lengths of each gene and the complete concatenated sequence were: atpD, 1,278 bp; leuS, 2,571 bp; rplB, 822 bp; gyrB, 1,662 bp; concatenated sequence, 6,333 bp. Maximum likelihood analysis using the GTR + G substitution model was performed with 1,000 bootstrap replicates. The corresponding P. xenovorans LB400 gene sequences were used as the outgroup for the analysis.

Whole-Genome Comparisons

Average nucleotide identity (ANI) (Goris et al., 2007) and percentage of conserved proteins (POCP) (Qin et al., 2014) analyses were used to compare whole-genome sequences. The ANI value, resulting from the mean identity of BLASTN matches between the virtually-fragmented query and reference genomes, was calculated using ani.rb script from the enveomics collection (Rodriguez and Konstantinidis, 2016) with default settings. POCP was used to identify conserved proteins between a pair of genomes using BLASTP analysis and to provide accurate genus cut-off values. To calculate POCP values, a POCP script developed by Harris et al. (2017) was used with the following parameters: E-value < 1e−5, sequence identity ≥ 40%, and alignable region of the query protein sequences ≥ 50%. A dendrogram was constructed based on the Unweighted Pair Group Method with Arithmetic Mean clustering method with a distance of (1 – ANI) in R program version 3.4.4 (https://www.r-project.org/).

Tetra-Nucleotide Analysis (TNA)

The tetra-nucleotide frequencies of all Cupriavidus and Ralstonia genome sequences were calculated using the compseq program from the EMBOSS package (http://emboss.sourceforge.net/apps/cvs/emboss/apps/compseq.html). Results of TNA were visualized by generating a three-dimensional plot of principal component analysis (PCA) in R package rgl. The frequencies of all 256 possible tetra-nucleotides were used as input for PCA.

Results and Discussion

General Properties and Structure of the NH9 Genome

Genome statistics are presented in Table 1 and a circular genome map is depicted in Figure 1. The genome of C. necator strain NH9 comprises two circular chromosomes (Chr), Chr1 (4,347,557 bp, 65.8% G+C) (Figure 1A) and Chr2 (3,395,604 bp, 65.5% G+C) (Figure 1B), along with two circular plasmids, pENH91 (77,172 bp, 64.2% G+C) (Figure 1C) and pENH92 (426,602 bp, 61.8% G+C) (Figure 1D). A total of 7,290 coding sequences (CDSs) were predicted by homology analysis. The NH9 genome contained 68 tRNA genes, two of which were located on pENH92. Chr1 and Chr2 contained three and two rRNA gene operons (5S, 16S, and 23S rRNA genes), respectively, while Chr1 also had one tmRNA and three ncRNAs. Although the G+C contents of all replicons were similar, those of the two plasmids, particularly pENH92, were lower than those of the two chromosomes (Table 1).
Table 1

General properties of the C. necator NH9 genome.

Chromosome 1Chromosome 2pENH91pENH92Genome
Accession numberCP017757CP017758CP017760CP017759
Sequence length (bp)4,347,5573,395,60477,172426,6028,246,935
GC (%)65.865.564.261.865.5
tRNA genes5790268
tmRNA genes10001
ncRNA genes30003
rRNA gene operons32005
Coding (%)87.386.792.285.287.0
Total number of CDSs3,9122,932783687,290
CDSs assigned to COG3,166 (80.9%)2,300 (78.4%)46 (59.0%)249 (67.7%)5,761 (79.0%)
CDSs assigned to KEGG Ontology2,828 (72.3%)1,901 (64.8%)52 (66.7%)236 (64.1%)5,017 (68.8%)
Figure 1

Schematic representation of the four replicons of NH9. Circular maps of chromosome 1 (A), chromosome 2 (B), pENH91 (C), and pENH92 (D) are shown. The circles represent (from the inside): 1, GC skew; 2, GC content; 3, RNA genes (except for pENH91); 4, Clusters of Orthologous Groups (COG) assignments for coding sequences (CDSs) on the reverse strand; 5, COG assignments for CDSs on the forward strand; 6, scale in Mb or kb. Note that maps are not drawn to scale relative to the sizes of each replicon, very short RNA genes are enlarged to enhance visibility, and pENH91 does not contain any RNA genes.

General properties of the C. necator NH9 genome. Schematic representation of the four replicons of NH9. Circular maps of chromosome 1 (A), chromosome 2 (B), pENH91 (C), and pENH92 (D) are shown. The circles represent (from the inside): 1, GC skew; 2, GC content; 3, RNA genes (except for pENH91); 4, Clusters of Orthologous Groups (COG) assignments for coding sequences (CDSs) on the reverse strand; 5, COG assignments for CDSs on the forward strand; 6, scale in Mb or kb. Note that maps are not drawn to scale relative to the sizes of each replicon, very short RNA genes are enlarged to enhance visibility, and pENH91 does not contain any RNA genes. To analyze the functional content of the genome and the distribution of CDSs across the replicons, COG functional classification analysis was conducted for all proteins in the NH9 genome (Figure 2). In addition, the percentage of proteins assigned to COG categories in each replicon was compared between chromosomes and between plasmids (Table S1). A significant difference in functionality was observed between the replicons, with the main chromosome, Chr1, encoding proteins responsible for core cellular functions, including protein processing (class O), translational machinery (class J), DNA replication and repair (class L), amino acid metabolism (class E), and nucleotide metabolism (class F). In comparison, the smaller chromosome, Chr2, showed a functional bias toward cell motility (class N), transcription (class K), and energy metabolism (classes C, I, and Q), indicating that proteins encoded on Chr2 are mainly related to adaptation and survival. These biases are similar to those observed in other Cupriavidus genomes (Janssen et al., 2010; Wang X. et al., 2015). As expected, the two plasmids coded for a higher percentage of proteins involved in partitioning (class D) and plasmid replication (class L), as well as proteins of unknown function (class -), compared with the chromosomes (Figure 2 and Table S1), as has been reported previously (Leplae et al., 2006). The smaller plasmid, pENH91, uniquely coded for proteins involved in intracellular trafficking and secretion (class U), while the larger plasmid, pENH92, was not associated with any significantly different protein functions, but did show a functional bias toward energy metabolism, including amino acid, nucleotide, carbohydrate, coenzyme, and lipid metabolism (classes C, E, F, G, H, and I).
Figure 2

Functional classification of proteins encoded on each replicon of the NH9 genome based on Clusters of Orthologous Groups (COG) categories. COG categories are shown on the horizontal axis, with the percentage of proteins belonging to each category for each replicon plotted on the vertical axis (percentages >25% are not shown). *p < 0.05 and **p < 0.01, respectively, as determined by Fisher's test with false discovery rate adjustments. COG functional annotations and specific values are summarized in Table S1.

Functional classification of proteins encoded on each replicon of the NH9 genome based on Clusters of Orthologous Groups (COG) categories. COG categories are shown on the horizontal axis, with the percentage of proteins belonging to each category for each replicon plotted on the vertical axis (percentages >25% are not shown). *p < 0.05 and **p < 0.01, respectively, as determined by Fisher's test with false discovery rate adjustments. COG functional annotations and specific values are summarized in Table S1. A dnaA homolog and three DnaA boxes [TT(A/T)TNCACA] (Schaper and Messer, 1995) with no or only one mismatch compared with the consensus were located around the putative replication origin (oriV) of Chr1 (Figure S1A). Chr2 contained both repA and parAB homologs and 20 iteron-like repeats that may be involved in RepA binding [CGC(A/T)GA(A/T)(A/T)(C/T)(A/C/G)GGT(A/T)CG(C/G) consensus sequences] (Figure S1B), indicating that the partitioning system of Chr2 may be plasmid-like. Chr2 also contained three DnaA boxes with only one mismatch compared with the consensus. These chromosomal replication initiation systems were identical to those of other Cupriavidus strains such as C. necator H16 (Pohlmann et al., 2006), C. metallidurans CH34 (Janssen et al., 2010), and C. gilardii CR3 (Wang X. et al., 2015). Plasmid pENH91 harbored a trfA gene with 100% amino acid sequence identity to that of incompatibility (Inc) P-1β plasmid pA81 of Achromobacter xylosoxidans A8 (Jencova et al., 2008) (Figure S1C). Because Inc plasmid groups are classified based on the amino acid sequence of the replication initiation protein (Shintani et al., 2015), we propose that pENH91 is a member of the IncP-1β plasmid family. Using a BLASTN analysis of ArcWithColor (Ohtsubo et al., 2008) with the following parameters: wordsize, 21; E-value, < 1e−5; and filter query sequence, F, we determined that 74,985 bp (97.2%) of the 77,172-bp pENH91 nucleotide sequence showed 100% identity to the sequence of the corresponding part of pA81 (98,192 bp). In our previous study, we proposed that pENH91 is an IncP-1 group plasmid based on incompatibility test results (Ogawa and Miyashita, 1995). Therefore, the results of the current study support the previous classification of pENH91 as an IncP-1 plasmid. Five putative TrfA-binding sites (iterons) (Norberg et al., 2014) [(A/C/G)(A/C/T)GCCCC(C/T)CA(A/T)GTGTCA consensus sequences] were located between hypothetical protein-coding gene (BJN34_0385) and addiction module antitoxin-coding gene (BJN34_37215) (Figure S1C), suggesting that oriV was located in this region. pENH92 contained both repB and parAB and at least three DnaA boxes with no, one, or two mismatches compared with the consensus (Figure S1D). Interestingly, the predicted RepB and ParAB protein sequences showed greater similarity to those of pOLGA1 from Burkholderia sp. OLGA172 (Ricker et al., 2016) and pBN1 from Paraburkholderia aromaticivorans BN5 (Lee and Jeon, 2018) (63.8–82.8% amino acid identity) than to those of pBB1 from C. necator N-1 (Poehlein et al., 2011) and pRALTA from C. taiwanensis LMG19424 (Amadou et al., 2008) (39.3–71.9% amino acid identity). As mobile genetic elements (integrase and transposase) were identified either side of repB and parAB in pENH92 (Figure S1D), this region may have been acquired via horizontal transfer.

Capacity of NH9 to Catabolize Aromatic Compounds

To analyze the ability of NH9 to catabolize recalcitrant compounds other than 3-CB, the metabolic pathway for aromatic compound degradation was reconstructed using the KEGG database. Pathway analysis suggested that NH9 should be able to completely degrade benzoate, 2-, 3-, or 4-hydroxybenzoate, 2,3-, 2,5-, or 3,4-dihydroxybenzoate, benzoylformate, and benzonitrile (Table 2 and Figure 3). The catabolic capacity of NH9 was similar to that of other Cupriavidus strains; however, it lacked the complete set of phenol-degrading genes that are found in a number of other strains (Figure S2 and Table S2).
Table 2

Putative genes of C. necator NH9 involved in degradation of aromatic and relative compounds.

Compound nameGene designationaK numberEC numberDefinitionaAnnotated genesb,c
BenzonitrilenthAK01721EC:4.2.1.84Nitrile hydratase alpha subunitBJN34_31490 (C2)
nthBK20807EC:4.2.1.84Nitrile hydratase beta subunitBJN34_31495 (C2)
amiEK01426EC:3.5.1.4AmidaseBJN34_07485 (C1), 23735 (C2), 30640 (C2), 33905 (C2), 34230 (C2)
BenzoylformatemdlCK01576EC:4.1.1.7Benzoylformate decarboxylaseBJN34_05885 (C1)
mdlDK00141EC:1.2.1.28Benzaldehyde dehydrogenaseBJN34_05890 (C1)
BenzoatebenAK05549EC:1.14.12.10Benzoate 1,2-dioxygenase alpha subunitBJN34_08560 (C1)
benBK05550EC:1.14.12.10Benzoate 1,2-dioxygenase beta subunitBJN34_08565 (C1)
benCK05784EC:1.14.12.10Benzoate 1,2-dioxygenase reductase componentBJN34_08570 (C1)
benDK05783EC:1.3.1.251,6-dihydroxycyclohexa-2,4-diene−1-carboxylate dehydrogenaseBJN34_08575 (C1)
2-hydroxybenzoate(Salicylate)nahGK00480EC:1.14.13.1Salicylate hydroxylaseBJN34_24950 (C2)
3-hydroxybenzoatenagXK22270EC:1.14.13.243-hydroxybenzoate 6-monooxygenaseBJN34_30895 (C2)
nagIK00450EC:1.13.11.4Gentisate 1,2-dioxygenaseBJN34_30910 (C2)
nagLK01801EC:5.2.1.4Maleylpyruvate isomeraseBJN34_02270 (C1)d
nagKK16165EC:3.7.1.20Fumarylpyruvate hydrolaseBJN34_30905 (C2)
4-hydroxybenzoatepobAK00481EC:1.14.13.24-hydroxybenzoate 3-monooxygenaseBJN34_33835 (C2)
pcaGK00448EC:1.13.11.3Protocatechuate 3,4-dioxygenase, alpha subunitBJN34_33855 (C2)
pcaHK00449EC:1.13.11.3Protocatechuate 3,4-dioxygenase, beta subunitBJN34_33860 (C2)
pcaBK01857EC:5.5.1.23-carboxy-cis,cis-muconate cycloisomeraseBJN34_33850 (C2)
pcaLK14727EC:4.1.1.444-carboxymuconolactone decarboxylaseBJN34_33845 (C2)
2,3-dihydroxybenzoatedhbDK14333EC:4.1.1.462,3-dihydroxybenzoate decarboxylaseBJN34_29435 (C2)
3-chlorocatecholcbnAK15253EC:1.13.11.-Chlorocatechol 1,2-dioxygenaseBJN34_37380 (p1)
cbnBK01860EC:5.5.1.7.Chloromuconate cycloisomeraseBJN34_37385 (p1)d
cbnCK01061EC:3.1.1.45Dienelactone hydrolaseBJN34_37395 (p1)
cbnDK00217EC:1.3.1.32Maleylacetate reductaseBJN34_37400 (p1)d
CatecholcatAK03381EC:1.13.11.1Catechol 1,2-dioxygenaseBJN34_08555 (C1), 18150 (C1), 26685 (C2), 28970 (C2), 30600 (C2)
catBK01856EC:5.5.1.1Muconate cycloisomeraseBJN34_24340 (C2)
catCK03464EC:5.3.3.4Muconolactone isomeraseBJN34_29745 (C2)
catDK01055EC:3.1.1.243-oxoadipate enol-lactonaseBJN34_29740 (C2)
3-oxoadipatepcaIK01031EC:2.8.3.63-oxoadipate CoA-transferase, alpha subunitBJN34_21015 (C2)
pcaJK01032EC:2.8.3.63-oxoadipate CoA-transferase, beta subunitBJN34_21020 (C2)
pcaFK00632EC:2.3.1.163-oxoadipyl-CoA thiolaseBJN34_21025 (C2)d

Gene designation and definition from KEGG annotation were manually modified.

Genes listed in this table were manually selected.

Gene locations were shown as below; C1, Chromosome1; C2, Chromosome 2; p1, pENH91.

2nd best hit.

Figure 3

Aromatic compound degradation pathways in NH9. Enzyme-encoding genes are shown by red text, including locus tag(s). 3-chlorobenzoate, benzoate, benzonitrile, and benzoylformate are shaded in yellow, magenta, green, and purple, respectively. Hydroxybenzoate and dihydroxybenzoate are shaded in blue and orange, respectively.

Putative genes of C. necator NH9 involved in degradation of aromatic and relative compounds. Gene designation and definition from KEGG annotation were manually modified. Genes listed in this table were manually selected. Gene locations were shown as below; C1, Chromosome1; C2, Chromosome 2; p1, pENH91. 2nd best hit. Aromatic compound degradation pathways in NH9. Enzyme-encoding genes are shown by red text, including locus tag(s). 3-chlorobenzoate, benzoate, benzonitrile, and benzoylformate are shaded in yellow, magenta, green, and purple, respectively. Hydroxybenzoate and dihydroxybenzoate are shaded in blue and orange, respectively. Genes of NH9 involved in the decomposition of the compounds described above were mostly located on chromosome, especially Chr2, although the ben genes, coding for the key elements of benzoate and 3-CB degradation, were located on Chr1 (Table 2). Almost all of the genes involved in the dissimilation of aromatic compounds were found within clusters, although genes required for the degradation of 3-hydroxybenzoate and catechol were dispersed between Chr1 (nagL and catA) and Chr2 (catAB) (Table 2). BLASTN analysis showed that nagL homolog (BJN34_30900) was located between nagX (BJN34_30895) and nagK (BJN34_30905) on Chr2, indicative of an operon structure. A cat gene cluster containing all the three genes, catA, catB, and catC, for catechol degradation was not observed in the NH9 genome, suggesting that the cat genes were not acquired at the same time. In various Gram-negative bacteria (e.g., P. putida and Pseudomonas resinovorans), cat genes form an operon including catR, a LysR-type transcriptional regulator. Two catR homologs were identified upstream of catA (BJN34_08555) on Chr1 and catB (BJN34_24340) on Chr2, respectively, in the NH9 genome. Because catechol degradation is one of the central pathways in the metabolism of a variety of aromatic compounds (Broderick, 1999), the acquisition of the various cat genes is intriguing from an evolutionary standpoint in terms of the ability of NH9 to degrade aromatic compounds. Interestingly, several putative CatA-coding genes were present on chromosome 1 and 2 (Table 2). Catechol 1,2-dioxygenase, the product of catA, cleaves the aromatic ring of catechol between two hydroxyl groups (intradiol cleavage) (Figure 3). Other homologous ring-cleaving dioxygenase enzymes such as chlorocatechol dioxygenase and protocatechuate 3,4-dioxygenase catalyze similar reactions (Neidle et al., 1988), with this aromatic ring cleavage recognized as a critical step in the complete degradation of chlorinated or non-chlorinated aromatic compounds by soil bacteria (Harwood and Parales, 1996; Reineke, 1998; Broderick, 1999). In summary, strain NH9 shared several putative pathways for aerobic degradation of aromatic compounds with other Cupriavidus strains (Table 2, Figure S2, and Table S2). The characteristic feature of strain NH9 is its ability to degrade 3-CB which is an intermediate metabolite of the degradation of PCBs (Reineke and Knackmuss, 1988). Benzoate, hydroxybenzoates and dihydroxybenzoates are intermediate metabolites of the degradation of plant-derived compounds and polycyclic aromatic hydrocarbons (PAHs) (Seo et al., 2009; Wang J. Y. et al., 2015), suggesting that NH9 may be adapted for catabolism of those simple aromatic compounds in the soil environment.

Phylogenetic Analyses

In order to understand the genetic characteristics of the predicted degradation ability of the strain NH9 among related bacteria, precise taxonomic identification of NH9 was required. The necessity also arose from an intertwined history of the genera Cupriavidus with several other β-proteobacteria. In particular, there have been several taxonomic problems within the genera Cupriavidus and Ralstonia because of their genomic diversity and similarities (Vandamme and Coenye, 2004). For instance, R. pickettii DTP0602 and Ralstonia sp. PBA were proposed to belong to the genus Cupriavidus (Zhang and Qiu, 2016; Kim and Gan, 2017). It is possible that there are other bacteria currently belonging to these two genera that should be reclassified. Previous studies of related bacterial genera suggested relationship between the degradation ability of aromatic compounds and the phylogenetic location (Harwood and Parales, 1996; Perez-Pantoja et al., 2012). Thus, to understand first the distribution of aromatic degradation capabilities in the genus Cupriavidus and Ralstonia, orthologous genes and potential abilities to degrade aromatic compounds in selected Cupriavidus and Ralstonia strains were surveyed (Figure S2 and Table S2). These results indicated that the putative phenotype of degradation ability and taxonomic classification of those strains were not consistent absolutely, although there is a tendency that Ralstonia strains lack genes for degradation of several aromatic compounds which are present in most Cupriavidus strains. The above results led us to examining the genomes of all strains of the genera Cupriavidus and Ralstonia whose complete or draft genome sequences are available from the NCBI database in order to perform genome-based phylogenetic comparison. As a result, we propose reclassification of several strains belonging to these two genera. All Cupriavidus and Ralstonia strains examined in this study are shown in Table 3 and Table S3.
Table 3

Cupriavidus and Ralstonia strains used in this study.

GenusSpeciesStrainPhylotype sequevar# assembly accessionANI clusterTNA cluster
CupriavidusalkaliphilusASC-732aGCF_900094595.13B
basilensis4G11GCF_000832305.17E
KF708GCF_000876015.19C
OR16GCF_000243095.17E
gilardiiCR3GCF_001281465.110I
JZ4GCF_001658125.110I
metalliduransCH34aGCF_000196015.116L
Ni-2GCF_002944765.116L
H1130GCF_000496715.116L
NA1GCF_000709025.116L
NA4GCF_000709045.116L
NBRC 101272GCF_001598775.116L
NDB3NO24GCF_001543455.116L
NE12GCF_000709065.116L
nantongensisX1aGCF_001598055.13B
necatorH16GCF_000009285.11D
N-1aGCF_000219215.11D
NH9GCF_002011925.21D
A5-1GCF_000744095.12G
NBRC 102504GCF_001598755.11D
PHE3-6GCF_001853325.11D
oxalaticusNBRC 13593GCF_001592245.14cBc
pauculusKF709GCF_000974605.115cKc
UM1GCF_002858765.114J
pinatubonensisJMP134GCF_000203875.16cKc
taiwanensisLMG19424aGCF_000069785.13B
STM 6018GCF_000472465.13B
STM6070GCF_000372525.13B
sp.USMAA1020GCF_001854325.111A
USMAA2-4GCF_001854305.111A
USMAHM13GCF_001854285.111A
amp6GCF_000426345.14G
BIS7GCF_000292345.117cKc
D384GCF_001652915.114J
GA3-3GCF_000389805.11D
HMR-1GCF_000319775.116L
HPC(L)GCF_000307735.310I
IDOGCF_000812465.15G
OV038GCF_900112215.112H
OV096GCF_900115455.112H
SHEGCF_000812445.116L
SK-3GCF_000611145.18F
SK-4GCF_000611125.11D
UYPR2.512GCF_000379565.11D
WSGCF_000395345.19C
YR651GCF_900101625.113cKc
RalstoniainsidiosaATCC 49129GCF_001663855.120N
FC1138GCF_001653935.120N
WCHRI065162GCF_002939165.120N
WCHRI065437GCF_002939035.120N
mannitolilyticaSN82F48GCF_000954135.119O
SN83A39GCF_001628775.119O
GML-Rals1-TRGCF_002863525.119O
MRY14-0246GCF_000953875.119O
WCHRM065694GCF_002939115.119O
WCHRM065837GCF_002939145.119O
pickettii12DGCF_000023425.118M
12JGCF_000020205.118M
DTP0602GCF_000471925.12G
FDAARGOS_410GCF_002393485.119O
52GCF_002849525.118M
5_7_47FAAGCF_000165085.118M
ATCC 27511aGCF_000743455.118M
CW2GCF_000607185.118M
H2Cu2GCF_001699795.118M
H2Cu5GCF_001699815.118M
ICMP-8657GCF_002516395.118M
NBRC 102503GCF_001544155.118M
OR214GCF_000372665.118M
SSH4GCF_000607165.118M
solanacearumGMI1000I-18GCF_000009125.123P
CQPS-1IGCF_002220465.123P
EP1IGCF_001891105.123P
FJAT-91IGCF_002155245.123P
FJAT-1458IGCF_001887535.123P
FQY_4IGCF_000348545.123P
Rs-09-161IGCF_000671335.123P
Rs-10-244IGCF_000671315.123P
P781I-14GCF_001644865.123P
UW757I-14GCF_001645725.123P
CaRs-MepIGCF_001855495.123P
SD54IGCF_000430925.223P
Y45IGCF_000223115.123P
UW25aIIA-7GCF_002251695.125P
Grenada 9-1IIA-6GCF_000825845.125P
B50IIA-24GCF_000825785.125P
IBSBF1900IIA-24GCF_001373275.125P
CIP120IIA-38GCF_001644795.125P
P597IIA-38GCF_001644805.125P
UW551IIB-1GCF_002251655.125P
UY031IIB-1GCF_001299555.125P
IBSBF1503IIB-4GCF_001587155.125P
Po82IIB-4GCF_000215325.125P
UW163IIB-4GCF_001587135.125P
CFBP3858IIB-1GCF_001373335.125P
IPO1609IIB-1GCF_001050995.125P
NCPPB 909IIB-1GCF_000710695.125P
POPS2IIB-1GCF_000750585.125P
RS2IIB-1GCF_001373295.125P
UW365IIB-1GCF_001696865.125P
UW491IIB-1GCF_001696845.125P
NCPPB 282IIB-2GCF_000750575.125P
CFBP1416IIB-3GCF_000825925.125P
CIP417IIB-3GCF_000825825.125P
MolK2IIB-3GCF_000212635.325P
CFBP6783IIB-4GCF_001644815.125P
P673IIB-4GCF_000525615.125P
UW179IIB-4GCF_000825805.125P
23-10BRIIB-27GCF_000749995.125P
CFBP7014IIB-59GCF_001373255.125P
CFIA906IIBGCF_000710135.225P
CMR15IIIGCF_000427195.123P
CFBP3059III-48GCF_001644855.123P
PSI07aIVGCF_000283475.124P
KACC10709NAbGCF_001708525.123P
KACC 10722NAbGCF_001586135.124P
OE1-1NAbGCF_001879565.123P
RS 488NAbGCF_002501565.125P
RS 489NAbGCF_002549815.125P
RSCMNAbGCF_002894285.123P
SEPPX05NAbGCF_002162015.123P
YC40-MNAbGCF_001663415.123P
58_RSOLNAbGCF_001065525.120N
GEO_6NAbGCF_002894765.125P
GEO_55NAbGCF_002894845.125P
GEO_57NAbGCF_002029885.125P
GEO_81NAbGCF_002894785.125P
GEO_96NAbGCF_002029895.125P
GEO_99NAbGCF_002029865.125P
GEO_230NAbGCF_002894795.125P
GEO_304NAbGCF_002894775.125P
PSS4NAbGCF_001876985.123P
PSS190NAbGCF_001870825.123P
PSS216NAbGCF_001876975.123P
PSS1308NAbGCF_001870805.123P
RD15NAbGCF_001854265.123P
Rs-T02NAbGCF_001484095.123P
UTT-25NAbGCF_002930085.123P
UW24NAbGCF_001696855.125P
UW181NAbGCF_001373315.125P
sp.25mfcol4.1GCF_900104095.114J
5_2_56FAAGCF_000227255.218M
A12GCF_000801955.122N
AU12-08GCF_000442475.120N
MD27GCF_001078575.121N
NFACC01GCF_900115545.118M
NT80GCF_001485395.120N
PBAGCF_000272025.1NAb, cKc
UNC404CL21ColGCF_000620465.118M
UNCCL144GCF_900099845.118M

Type strains that are proposed in List of prokaryotic names with standing in nomenclature (.

Not assigned.

Contradictory clusterings between ANI analysis and TNA.

Cupriavidus and Ralstonia strains used in this study. Type strains that are proposed in List of prokaryotic names with standing in nomenclature (. Not assigned. Contradictory clusterings between ANI analysis and TNA. Phylogenetic relationships between the genera Cupriavidus and Ralstonia, including strain NH9 and several type strains, were inferred based on 16S rRNA gene sequences (Figure 4). A clear separation of the genus Cupriavidus from the genus Ralstonia could be seen in the phylogenetic tree (Figure 4A). Overall, only Ralstonia sp. PBA was not grouped into either of the clades. The tree generated for the genus Cupriavidus showed a number of clades and included two strains currently classified as Ralstonia (Figure 4B). R. pickettii DTP0602 clustered into the C. necator clade (type strain C. necator N-1) and Ralstonia sp. 25mfcol4.1 was also included in a clade consisting only of Cupriavidus sp. strains. C. pinatubonensis JMP134 and C. pinatubonensis 1245T formed a homogeneous group with very high similarity (99.6%), as previously reported (Sato et al., 2006). Four different species, C. oxalaticus, C. taiwanensis, C. nantongensis, and C. alkaliphilus, were contained in one clade, reflecting the high level of similarity among the 16S rRNA gene sequences of these species (>97.2%). In particular, C. taiwanensis LMG19424T, C. nantongensis X1T, and C. alkaliphilus ASC-732T showed very high sequence similarities (>99.1%), as previously reported (Sun et al., 2016). Strain NH9 shared 99.2% 16S rRNA nucleotide sequence similarity with C. necator N-1T and, as expected, was categorized into the C. necator clade. The genus Ralstonia formed two large clades, one consisting of R. pickettii (type strain R. pickettii ATCC 27511) and R. mannitolilytica and the other comprising R. insidiosa and R. solanacearum (type strain R. solanacearum UW25) (Figure 4C). However, the R. solanacearum clade exhibited aberrant branching; for example, strains CMR15 and CFBP3059, which belong to phylotype III, did not form a clade. Additionally, the R. insidiosa clade was included within the R. solanacearum clade.
Figure 4

Maximum likelihood trees showing the phylogeny of the genera Cupriavidus and Ralstonia based on 16S rRNA gene sequences. Phylogenetic trees generated for both genera (A), the genus Cupriavidus (B), and the genus Ralstonia (C). Full-length 16S rRNA gene sequences were aligned and used in the phylogenetic analysis (see section Materials and Methods for detail). Proposed type-strains are designated with a “T.” Strain NH9 is marked with a red circle. Bootstrap values are represented at the branching points (only values >50% are shown).

Maximum likelihood trees showing the phylogeny of the genera Cupriavidus and Ralstonia based on 16S rRNA gene sequences. Phylogenetic trees generated for both genera (A), the genus Cupriavidus (B), and the genus Ralstonia (C). Full-length 16S rRNA gene sequences were aligned and used in the phylogenetic analysis (see section Materials and Methods for detail). Proposed type-strains are designated with a “T.” Strain NH9 is marked with a red circle. Bootstrap values are represented at the branching points (only values >50% are shown). 16S rRNA gene sequence-based phylogenetic analysis is a basic approach used in prokaryotic taxonomy; however, it cannot provide sufficient resolution to discriminate sequences to the species level. Because MLSA can help to resolve phylogenetic relationships at the genus and species levels, we also constructed a MLSA-based phylogenetic tree using four single-copy housekeeping genes, atpD, leuS, rplB, and gyrB, obtained from 45 Cupriavidus and 104 Ralstonia strains (Figure 5). leuS, rplB, and gyrB were also used in a recent MLSA study of R. solanacearum (Zhang and Qiu, 2016). The concatenated gene sequence-based phylogenetic tree showed broadly similar patterns to the 16S rRNA-based phylogeny and all clades were clearly resolved, although Ralstonia sp. PBA was grouped within the same clade as the genus Cupriavidus (Figure 5A). This branching of strain PBA was consistent with a previously reported maximum likelihood tree (Kim and Gan, 2017). Additionally, the phylogenetic positions of Cupriavidus sp. BIS7, Cupriavidus sp. YR651, and Ralstonia sp. A12 differed compared with the 16S rRNA-based tree (Figures 5B,C). As has been reported previously, a clade consisting of the R. solanacearum strains showed rational branching (Zhang and Qiu, 2016). Furthermore, the R. insidiosa clade was separated from the R. solanacearum clade (Figure 5C). Bootstrap values in the concatenated gene tree were significantly higher than those in the 16S rRNA gene-based tree, indicating that the observed branching patterns were reliable.
Figure 5

Maximum likelihood phylogenetic trees constructed using concatenated atpD, leuS, rplB, and gyrB nucleotide sequences from strains belonging to the genera Cupriavidus and Ralstonia. Phylogenetic trees generated for both genera (A), the genus Cupriavidus (B), and the genus Ralstonia (C). The final dataset contained 6,333 positions (see section Materials and Methods for details). Strain NH9 is marked with a red circle. Bootstrap values are represented at the branching points (only values >50% are shown).

Maximum likelihood phylogenetic trees constructed using concatenated atpD, leuS, rplB, and gyrB nucleotide sequences from strains belonging to the genera Cupriavidus and Ralstonia. Phylogenetic trees generated for both genera (A), the genus Cupriavidus (B), and the genus Ralstonia (C). The final dataset contained 6,333 positions (see section Materials and Methods for details). Strain NH9 is marked with a red circle. Bootstrap values are represented at the branching points (only values >50% are shown).

Average Nucleotide Identity Analysis

ANI analysis is usually used for bacterial species delineation along with MLSA (Goris et al., 2007). To assess genome similarities, ANI analysis was performed using all downloaded Cupriavidus and Ralstonia genome sequences (Figures 6, 7, and Table S4). Clusters determined by ANI analysis are described in Table 3. The ANI cut-off value for species distinction is generally 95–96% (Richter and Rossello-Mora, 2009; Kim et al., 2014; Ciufo et al., 2018). While this threshold value was applicable to many Ralstonia strains examined in this study (Table S4), we propose that in the case of the genus Cupriavidus and the species R. pickettii, 90% is a more reasonable threshold considering the results of ANI, phylogenetic analyses, and TNA. For instance, the ANI score of strain NH9 was 91.16% (ANI1 and 2) when compared with C. necator N-1T (Table S4), which would suggest that NH9 does not belong to the species C. necator when using a standard ANI threshold value. Likewise, some of the R. pickettii strains were not correctly grouped at a 95–96% cut-off value. Richter and Rossello-Mora reported that at ANI values below 90%, taxonomic differences became more evident, suggesting that ANI values above 90% produce more robust results (Richter and Rossello-Mora, 2009). Furthermore, it has been discovered that several genera contain species with non-standard ANI cut-off points, which is thought to reflect species diversity (Kim et al., 2014; Ciufo et al., 2018). As noted above, the genus Cupriavidus and the species R. pickettii are very diverse. The relationships between strains categorized at a 90% ANI cut-off were almost identical to the groupings identified by phylogenetic analysis and TNA. Therefore, we propose relaxation of the ANI cut-off value from 95–96 to 90% for the genus Cupriavidus and the species R. pickettii.
Figure 6

Heatmap and dendrogram of average nucleotide identity (ANI) values of the 48 putative Cupriavidus genomes. Strains with ANI values >90% are shown within blue squares (see Table 3 and Table S4 for a description of cluster designations). All squares were assigned numbers from 1 to 17.

Figure 7

Heatmap and dendrogram of average nucleotide identity (ANI) values of the 101 putative Ralstonia genomes. Strains with ANI values >90 and >95% are shown within blue and red squares, respectively (see Table 3 and Table S4 for a description of cluster designations). All squares were assigned a number from 18 to 25.

Heatmap and dendrogram of average nucleotide identity (ANI) values of the 48 putative Cupriavidus genomes. Strains with ANI values >90% are shown within blue squares (see Table 3 and Table S4 for a description of cluster designations). All squares were assigned numbers from 1 to 17. Heatmap and dendrogram of average nucleotide identity (ANI) values of the 101 putative Ralstonia genomes. Strains with ANI values >90 and >95% are shown within blue and red squares, respectively (see Table 3 and Table S4 for a description of cluster designations). All squares were assigned a number from 18 to 25. We also performed additional DNA comparisons whereby the two main replicons from each of the complete genome-sequenced Cupriavidus and Ralstonia strains were, respectively, compared with those from all of the other strains. It was expected that the ANI cut-off values produced from the comparisons between main replicons and between second replicons would be dramatically improved relative to the original results; however, this was not the case (data not shown). This result thus seems to endorse the validity of ANI analysis using the whole-genome sequence of each strain. Species grouped with ANI values >90 and 95% are shown in blue and red boxes, respectively, in Figures 6, 7. Cupriavidus and Ralstonia strains formed 17 and 8 groups, respectively, indicating greater diversity within the genus Cupriavidus compared with Ralstonia. Only Ralstonia sp. PBA did not share similarity with either genus, as was also observed in the 16S rRNA gene-based phylogeny (Table S4). Interestingly, group 3 included three different species (ANI values >92%): C. alkaliphilus, C. nantongensis, and C. taiwanensis. This is not surprising given the abovementioned high degree of 16S rRNA nucleotide sequence similarity (>99.1%) between the three species. While ANI-based groupings were consistent with the MLSA-based phylogenetic relationships, some variations in the grouping patterns were observed. C. necator A5-1 and R. pickettii DTP0602 (group 2) were separated from the species C. necator (group 1), while the members of group 9, C. basilensis KF708 and Cupriavidus sp. WS, were distinguished from the species C. basilensis (group 7). Cupriavidus sp. BIS7 (group 17), Ralstonia sp. MD27 (group 21), and Ralstonia sp. A12 (group 22), which were categorized into the C. metallidurans, R. insidiosa, and R. pickettii clades, respectively, in the MLSA-based phylogeny, were separated from their respective groups in this analysis. Further, Ralstonia sp. A12 showed higher similarity to R. insidiosa (group 20) than to R. pickettii (group 18). The species R. solanacearum formed three subgroups, consisting of phylotype I and III (group 23), phylotype IV (group 24), and phylotype II (group 25) strains, with an ANI value of 95%. It has previously been proposed that R. solanacearum should be classified into three species, R. pseudosolanacearum (phylotype I and III), R. solanacearum (phylotype II), and R. syzygii (phylotype IV) (Safni et al., 2014), which was validated by genomic, phylogenetic, and proteomic approaches (Prior et al., 2016; Zhang and Qiu, 2016). The result of ANI analysis supported this classification.

Percentage of Conserved Proteins Analysis

Phylogenetic analyses and ANI matrixes showed that two strains currently categorized as Ralstonia, R. pickettii DTP0602 and Ralstonia sp. 25fmcol4.1, belong to the genus Cupriavidus, and suggested that Ralstonia sp. PBA does not belong to either Cupriavidus or Ralstonia. However, while these analyses are useful for species delineation, they are not suitable for determining genera (Qin et al., 2014). Because the POCP method can provide comprehensive information for prokaryotic genus definition and delimitation, we performed POCP analysis for all Cupriavidus and Ralstonia strains (Figure 8). The POCP value threshold for a genus boundary is generally 50%; however, in the present case, we propose that a 60% POCP value is more rational. While most of the Cupriavidus and Ralstonia strains were correctly categorized into the appropriate genus, the two Ralstonia strains mentioned above showed same behavior as the Cupriavidus genus strains. This result confirmed that R. pickettii DTP0602 and Ralstonia sp. 25fmcol4.1 should be reclassified into the genus Cupriavidus. Ralstonia sp. PBA had POCP values of 51.4 and 52.6% when compared with Cupriavidus and Ralstonia strains, respectively, suggesting that it likely does not belong to either genus (Figure 8).
Figure 8

Comparison of average percentage of conserved proteins (POCP) values between Cupriavidus (A) and Ralstonia (B) named strains. Ralstonia group and Cupriavidus group strains are shown to the left and right of the vertical dotted bar, respectively. The two red dots indicate strains that are currently classified as Ralstonia but grouped into the genus Cupriavidus in this analysis. Blue dots indicate identical average POCP values when compared with Cupriavidus and Ralstonia strains. The red horizontal line indicates the genus boundary threshold (a POCP value of 60% was used in this study). Organism names are abbreviated as follows: 25fmcol4.1, Ralstonia sp. 25fmcol4.1; DTP0602, R. pickettii DTP0602; PBA, Ralstonia sp. PBA.

Comparison of average percentage of conserved proteins (POCP) values between Cupriavidus (A) and Ralstonia (B) named strains. Ralstonia group and Cupriavidus group strains are shown to the left and right of the vertical dotted bar, respectively. The two red dots indicate strains that are currently classified as Ralstonia but grouped into the genus Cupriavidus in this analysis. Blue dots indicate identical average POCP values when compared with Cupriavidus and Ralstonia strains. The red horizontal line indicates the genus boundary threshold (a POCP value of 60% was used in this study). Organism names are abbreviated as follows: 25fmcol4.1, Ralstonia sp. 25fmcol4.1; DTP0602, R. pickettii DTP0602; PBA, Ralstonia sp. PBA.

Tetra-Nucleotide Analysis

TNA and a PCA were also performed because tetra-nucleotide usage could be an alternative marker for clustering bacteria based on similarities in their genome sequence features (Richter and Rossello-Mora, 2009). These analyses showed that Cupriavidus strains formed 12 scattered clusters (clusters A–L) while Ralstonia strains formed only four clear clusters (clusters M–P), indicating the greater diversity of the Cupriavidus genome structure (Figure 9). Cluster assignments based on PCA are described in Table 3. Strain NH9 was categorized into Cluster D, consisting mainly of C. necator strains. PCA grouped C. necator A5-1 and R. pickettii DTP0602 into the same cluster (cluster G) as shown in the ANI matrix, with the addition of Cupriavidus sp. amp6 and Cupriavidus sp. IDO (Figure 9 and Table 3). Because Cupriavidus sp. amp6 and Cupriavidus sp. IDO have comparatively high ANI values when compared with C. necator A5-1 and R. pickettii DTP0602 (Table S4), this clustering is appropriate. However, C. oxalaticus NBRC 13593, which had an ANI score of >90% when compared with Cupriavidus sp. amp6, was noticeably separated from cluster G and was instead categorized into cluster B, comprised of C. taiwanensis, C. nantongensis, and C. alkaliphilus strains. Cluster K contained several species, including C. pauculus KF709, C. pinatubonensis JMP134, Cupriavidus sp. BIS7, Cupriavidus sp. YR651, and Ralstonia sp. PBA. Surprisingly, the genome features of Ralstonia sp. PBA were more similar to those of Cupriavidus strains than to Ralstonia strains. As expected, R. pickettii DTP0602 and Ralstonia sp. 25fmcol4.1 were categorized into the Cupriavidus-derived clusters (Table 3). Ralstonia sp. A12 was classified into cluster N, consisting mainly of R. insidiosa strains. This classification is inconsistent with the results of MLSA, but similar to the results of ANI analysis. R. solanacearum strains (cluster P) were not separated into three groups as shown in the ANI matrix, but instead formed a single grouping. Cluster designations of all other Cupriavidus and Ralstonia strains based on TNA were the same as those determined by ANI analysis.
Figure 9

Three-dimensional plot of principal components analysis results for all 150 Cupriavidus and Ralstonia genomes. Differences in color between clusters indicates divergence using the first three principal components. Sixteen clusters (A to P) are shown (see Table 3 for a description of cluster assignments).

Three-dimensional plot of principal components analysis results for all 150 Cupriavidus and Ralstonia genomes. Differences in color between clusters indicates divergence using the first three principal components. Sixteen clusters (A to P) are shown (see Table 3 for a description of cluster assignments). ANI analysis and TNA produced contradictory results in the designation of the several strains (C. oxalaticus NBRC 13593, C. pauculus KF709, C. pinatubonensis JMP134, Cupriavidus sp. BIS7, Cupriavidus sp. YR651, and Ralstonia sp. PBA) (Table 3). Richter and Rossello-Mora (Richter and Rossello-Mora, 2009) considered two scenarios as the causes for this paradoxical observation; (i) evolutionary or environmental forces may impede modifications of the genomic signature, resulting in a tetra-nucleotide frequency that does not reflect the actual phylogenetic position of the strain, and (ii) the amount of aligned sequence should be taken into account when ANI analysis was performed. We confirmed that adequate amounts of sequence were used for pairwise DNA sequence alignment in our analysis. When compared with the results of phylogenetic analyses and ANI analysis, we propose that ANI method is more reliable and suitable for inferring phylogenetic relationships as the results are more clear cut than TNA method.

Reclassification of the Strains of the Genera Cupriavidus and Ralstonia

The proposed reclassification of several Cupriavidus and Ralstonia strains is summarized in Table 4. Phylogenetic and whole-genome sequence analyses confirmed that strain NH9 belongs to the species C. necator, but also suggested that 41 of the strains examined in this study should be corrected in terms of their taxonomic classifications. While phylogenetic analysis did not support a change in taxonomic classification for C. basilensis KF708 and C. necator A5-1, whole-genome sequence analyses suggested that the species designations of these strains were incorrect. The classification of R. pickettii DTP0602 was also called into question based on the results of whole-genome sequence analyses (Table 4). Overall, these results suggest that the combination of phylogenetic and whole-genome sequence analyses can identify the correct taxonomic assignments for bacterial strains, with whole-genome sequence analyses being particularly useful for improving the resolution, although biochemical characterization is required for complete taxonomic classification.
Table 4

Summary of proposed reclassification of Cupriavidus and Ralstonia strains.

GenBank strain nameClassificationPhylogenetic analysesWhole-genome sequence analyses
CurrentProposed16SMLSAANITNAPOCP
KF708C. basilensisCupriavidus sp.C. basilensisC. basilensis++Cupriavidus
A5-1C. necatorCupriavidus sp.C. necatorC. necator++Cupriavidus
NH9C. necatorC. necator++++Cupriavidus
UM1C. pauculusCupriavidus sp.++++Cupriavidus
GA3-3Cupriavidus sp.C. necatorN. T.+++Cupriavidus
HMR-1Cupriavidus sp.C. metallidurans++++Cupriavidus
HPC(L)Cupriavidus sp.C. gilardii++++Cupriavidus
SHECupriavidus sp.C. metallidurans++++Cupriavidus
SK-4Cupriavidus sp.C. necator++++Cupriavidus
UYPR2.512Cupriavidus sp.C. necatorN. T.+++Cupriavidus
DTP0602R. pickettiiCupriavidus sp.C. necatorC. necator++Cupriavidus
FDAARGOS_410R. pickettiiR. mannitolilytica++++Ralstonia
CQPS-1R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
EP1R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
P781R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
UW757R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
CaRs-MepR. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
CFBP3059R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
KACC10709R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
KACC 10722R. solanacearumR. syzygii+++R. solanacearumRalstonia
OE1-1R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
RSCMR. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
SEPPX05R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
YC40-MR. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
58_RSOLR. solanacearumR. insidiosa++++Ralstonia
PSS4R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
PSS190R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
PSS216R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
PSS1308R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
RD15R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
Rs-T02R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
UTT-25R. solanacearumR. pseudosolanacearum+++R. solanacearumRalstonia
25mfcol4.1Ralstonia sp.Cupriavidus sp.++++Cupriavidus
5_2_56FAARalstonia sp.R. pickettii++++Ralstonia
A12Ralstonia sp.R. insidiosa+R. pickettiiRalstonia sp.+Ralstonia
AU12-08Ralstonia sp.R. insidiosa++++Ralstonia
MD27Ralstonia sp.R. insidiosa++Ralstonia sp.+Ralstonia
NFACC01Ralstonia sp.R. pickettii++++Ralstonia
NT80Ralstonia sp.R. insidiosa++++Ralstonia
UNC404CL21ColRalstonia sp.R. pickettii++++Ralstonia
UNCCL144Ralstonia sp.R. pickettii++++Ralstonia
PBARalstonia sp.Other+Cupriavidus sp.+Cupriavidus sp.+

The symbol “+” indicates same taxonomy as “Proposed.” N. T., Not tested.

Summary of proposed reclassification of Cupriavidus and Ralstonia strains. The symbol “+” indicates same taxonomy as “Proposed.” N. T., Not tested. All analysis methods clearly indicated that R. pickettii DTP0602 and Ralstonia sp. 25mfcol4.1 should be reclassified into the genus Cupriavidus. Strain DTP0602 has also previously been flagged for reclassification into the genus Cupriavidus (Zhang and Qiu, 2016), but robust taxonomic analysis has not been performed until now. In contrast, the current study is the first to reveal strain 25mfcol4.1 as a member of the genus Cupriavidus. Interestingly, three different species, C. alkaliphilus, C. nanotongensis, and C. taiwanensis showed high similarities in all analyses. ANI-based categorization (cut-off value 90%), especially, suggested that these bacteria are the same species (ANI values > 92%), although the three species have been already reported to be separated via phenotypic characterization and DNA-DNA hybridization (Sun et al., 2016). Strains H16 and JMP134 (formerly known as R. eutropha) were reported to share high similarities with C. necator and C. pinatubonensis type-strains, respectively, based on DNA-DNA hybridization analyses (Vandamme and Coenye, 2004; Sato et al., 2006). However, detailed taxonomic experiments, and therefore a robust classification, have never been performed for strain H16. Our results of 16S rRNA gene-based phylogenetic analysis, MLSA, ANI, and TNA agreed with previous classifications. By using several discrimination methods, the taxonomic positions of strains H16 and JMP134, which have been widely studied as a polyhydroxybutyrate producer (Pohlmann et al., 2006; Kutralam-Muniasamy and Perez-Guevara, 2018) and a 2,4-dichlorophenoxyacetic acid degrader (Lykidis et al., 2010), respectively, have been clarified in the current study. The species R. solanacearum were clearly separated into three subgroups based on phylogenetic and ANI analyses, and this result was consistent with previous studies (Safni et al., 2014; Prior et al., 2016; Zhang and Qiu, 2016). In our research, newly 20 strains were proposed to be reclassified into appropriate taxonomic positions (Table 4). Contradictory results were obtained regarding the phylogenetic position of Ralstonia sp. A12, as described above. Considering all of the results for this strain, we concluded that it could also be classified as R. insidiosa (Table 4). Although a species classification could not be determined for Ralstonia sp. MD27 at an ANI value of 95%, phylogenetic analyses and TNA clearly indicated that it belongs to the species R. insidiosa. While Ralstonia sp. PBA shows phylogenetic affinity to the genus Cupriavidus based on 16S rRNA nucleotide sequence analysis (Figure 4) and tetra-nucleotide usage (Figure 9), it is unlikely to belong to either the genus Cupriavidus or the genus Ralstonia based on POCP identities of <60% when compared with Cupriavidus and Ralstonia strains (Figure 8). Based on POCP analysis, Kim and Gan proposed that Ralstonia sp. PBA should be classified into the genus Cupriavidus; however, they only performed pairwise comparison of the proteome of strain PBA with proteins from Cupriavidus and Burkholderia strains (Kim and Gan, 2017). Regardless, all results presented so far confirm that strain PBA is a member of the family Burkholderiaceae. A more comprehensive analysis including strains belonging to related genera will provide a more appropriate classification of strain PBA.

Conclusion

In the present work, the complete genome sequence of C. necator NH9 was obtained. Analyses of general genome properties, genome structure, and the aromatic compound degradation capacity of NH9 demonstrated that this bacterium had similar characteristics to other Cupriavidus strains. The presence of several dioxygenase-encoding genes suggested a versatile role for NH9 in the degradation of aromatic compounds in contaminated soil. Based on comprehensive phylogenetic and genomic analyses, NH9 was clearly identified as belonging to the species C. necator. Further analyses of 46 Cupriavidus and 104 Ralstonia strains also indicated that 41 of these strains should be reclassified at either the genus or species level. In particular, two Ralstonia strains should be reclassified into the genus Cupriavidus. The combination of several discrimination methods allowed more precise classification of these bacteria, which have a complex taxonomic history. We determined that the ANI method was a particularly powerful tool for classification of bacteria at the species level. However, we propose that standard ANI cut-off values of 90 and 95% be applied to Cupriavidus and Ralstonia strains, respectively, because the species diversity within the genus Cupriavidus is higher than that of the genus Ralstonia. In addition, a 90% ANI threshold should also be applied to the species R. pickettii because of its similarly high level of diversity. On the other hand, while the phylogenetic re-location of the strain DTP0602 as a species of Cupriavidus by our analysis turned out to accord with the tendency of degradation ability of aromatic compounds by Cupriavidus, incongruence was observed between the delineation of the two genera and the tendency of the aromatic degradation abilities (Figure S2). This suggested that some genetic events such as horizontal transfer of the degradation genes in the past beyond the genus.

Author Contributions

RM, HD, and NO conceived and designed the experiments. RM performed the experiments, analyzed the data, prepared all tables and figures, and wrote the manuscript. HD and YK provided assistance with analysis tools. HD, YK, and NO critically reviewed and curated the manuscript. NO is responsible for the project.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  5 in total

1.  Genomic characterization of denitrifying methylotrophic Pseudomonas aeruginosa strain AAK/M5 isolated from municipal solid waste landfill soil.

Authors:  Ashish Kumar Singh; Rakesh Kumar Gupta; Hemant J Purohit; Anshuman Arun Khardenavis
Journal:  World J Microbiol Biotechnol       Date:  2022-06-16       Impact factor: 3.312

2.  Genomic, morphological, and biochemical analyses of a multi-metal resistant but multi-drug susceptible strain of Bordetella petrii from hospital soil.

Authors:  Urmi Halder; Raju Biswas; Ashutosh Kabiraj; Rajendar Deora; Moitri Let; Rajendra Kr Roy; Annapurna Chitikineni; Krishnendu Majhi; Shrabana Sarkar; Bhramar Dutta; Anubhab Laha; Arunava Datta; Dibyendu Khan; Rajeev K Varshney; Dipnarayan Saha; Saswati Chattopadhyay; Rajib Bandopadhyay
Journal:  Sci Rep       Date:  2022-05-19       Impact factor: 4.996

3.  Cupriavidus metallidurans CH34 Possesses Aromatic Catabolic Versatility and Degrades Benzene in the Presence of Mercury and Cadmium.

Authors:  Pablo Alviz-Gazitua; Roberto E Durán; Felipe A Millacura; Franco Cárdenas; Luis A Rojas; Michael Seeger
Journal:  Microorganisms       Date:  2022-02-21

4.  Whole genome sequence analysis of Cupriavidus campinensis S14E4C, a heavy metal resistant bacterium.

Authors:  Gorkhmaz Abbaszade; Attila Szabó; Balázs Vajna; Rózsa Farkas; Csaba Szabó; Erika Tóth
Journal:  Mol Biol Rep       Date:  2020-05-13       Impact factor: 2.316

5.  Natural Chromosome-Chromid Fusion across rRNA Operons in a Burkholderiaceae Bacterium.

Authors:  Jiro F Mori; Robert A Kanaly
Journal:  Microbiol Spectr       Date:  2022-01-05
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.