Literature DB >> 29755434

A Polyphasic and Taxogenomic Evaluation Uncovers Arcobacter cryaerophilus as a Species Complex That Embraces Four Genomovars.

Alba Pérez-Cataluña1, Luis Collado2, Oscar Salgado2,3, Violeta Lefiñanco2, María J Figueras1.   

Abstract

The species Arcobacter cryaerophilus is found in many food products of animal origin and is the dominating species in wastewater. In addition, it is associated with cases of farm animal and human infectious diseases,. The species embraces two subgroups i.e., 1A (LMG 24291T = LMG 9904T) and 1B (LMG 10829) that can be differentiated by their 16S rRNA-RFLP pattern. However, some authors, on the basis of the shared intermediate levels of DNA-DNA hybridization, have suggested abandoning the subgroup classification. This contradiction indicates that the taxonomy of this species is not yet resolved. The objective of the present study was to perform a taxonomic evaluation of the diversity of A. cryaerophilus. Genomic information was used along with a Multilocus Phylogenetic Analysis (MLPA) and phenotypic characterization on a group of 52 temporally and geographically dispersed strains, coming from different types of samples and hosts from nine countries. The MLPA analysis showed that those strains formed four clusters (I-IV). Values of Average Nucleotide Identity (ANI) and in silico DNA-DNA Hybridization (isDDH) obtained between 13 genomes representing strains of the four clusters were below the proposed cut-offs of 96 and 70%, respectively, confirming that each of the clusters represented a different genomic species. However, none of the evaluated phenotypic tests enabled their unequivocal differentiation into species. Therefore, the genomic delimited clusters should be considered genomovars of the species A. cryaerophilus. These genomovars could have different clinical importance, since only the cluster I included strains isolated from human specimens. The discovery of at least one stable distinctive phenotypic character would be needed to define each cluster or genomovar as a different species. Until then, we propose naming them "A. cryaerophilus gv. pseudocryaerophilus" (Cluster I = LMG 10229T), "A. cryaerophilus gv. crypticus" (Cluster II = LMG 9065T), "A. cryaerophilus gv. cryaerophilus" (Cluster III = LMG 24291T) and "A. cryaerophilus gv. occultus" (Cluster IV = LMG 29976T).

Entities:  

Keywords:  ANI; Arcobacter cryaerophilus; MLPA; genomovar; isDDH

Year:  2018        PMID: 29755434      PMCID: PMC5934430          DOI: 10.3389/fmicb.2018.00805

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

The genus Arcobacter, within the family Campylobacteraceae, was proposed by Vandamme et al. (1991) to reclassify two species that were, at that time, assigned to the genus Campylobacter i.e., Campylobacter nitrofigilis (Arcobacter nitrofigilis, that was selected as the representative or the type species of the genus) and Campylobacter cryaerophila (now Arcobacter cryaerophilus). The phenotypic characteristics that differentiate Campylobacter and Arcobacter are the ability of the latter to grow in aerobic conditions and at lower temperatures (Vandamme et al., 1991; Collado and Figueras, 2011). Using more than 4000 genomes, Waite et al. (2017) recently analyzed the 16S and 23S rRNA genes and 120 protein sequences and as a result they moved the Epsilonproteobacteria to the phylum level with the name Epsilonbacteraeota. In addition, they created a new family Arcobacteraceae that includes only the genus Arcobacter. Currently, the genus Arcobacter includes 27 species (Park et al., 2016; Whiteduck-Léveillée et al., 2016; Diéguez et al., 2017; Figueras et al., 2017; Tanaka et al., 2017; Pérez-Cataluña et al., 2018), four of which have been linked with human disease: Arcobacter butzleri, A. cryaerophilus, A. thereius, and A. skirrowii (Collado and Figueras, 2011; Figueras et al., 2014; Ferreira et al., 2015). The species A. cryaerophilus has been found in many food products of animal origin (like poultry, pork, lamb, and seafood and in dairy food processing facilities (Collado et al., 2008; Collado and Figueras, 2011). On the basis of the different Restriction Fragment Length Polymorphism (RFLP) of the 16S and 23S rRNA genes, Kiehlbauch et al. (1991) and Vandamme et al. (1992) divided the species A. cryaerophilus into two subgroups, subgroup 1 or 1A and subgroup 2 or 1B (from here on we will call them subgroups 1A and 1B), represented by strains LMG 24291T (= LMG 9904T) and LMG 10829, respectively. Additionally, it was demonstrated that the two subgroups showed different whole-cell protein and fatty acid contents (Vandamme et al., 1992) and clustered apart by their Amplified Fragment Length Polymorphism (AFLP) patterns (On et al., 2003). A 16S rDNA-RFLP identification method established the separation of the subgroups on the basis of their restriction patterns (Figueras et al., 2008). Despite strains belonging to both subgroups having been found at the same time in animal and human clinical samples and in food products, 1B is generally much more frequently found than 1A (Collado and Figueras, 2011 and references therein). In 2010, Debruyne et al. (2010) reassessed the taxonomy of these two subgroups of A. cryaerophilus using 59 strains isolated mainly from aborted animals (74% of the strains) and human faces (19%). The clustering of the strains obtained by AFLP and by the phylogenetic analysis of the cpn60 gene, together with the shared intermediate levels of DNA-DNA hybridization observed between the strains lead the authors to conclude that despite A. cryaerophilus having a complex taxonomy, the subgroup nomenclature should be abandoned (Debruyne et al., 2010). Furthermore, it was considered that the type strain (LMG 24291T = LMG 9904T) of A. cryaerophilus was not representative of the species because it corresponded with the less abundant 1A subgroup. They therefore proposed that it should be changed for the strain LMG 10829, representative of subgroup 1B (Debruyne et al., 2010). However, a recent metagenomic analysis of Arcobacter populations recovered from sewage samples of the wastewater treatment plant in the city of Reus (Spain) and from various cities of the United States gave evidence that both A. cryaerophilus subgroups (1A and 1B) were dominating in this environment (Fisher et al., 2014). In addition, a different prevalence of the two A. cryaerophilus subgroups was found depending on the wastewater temperature, 1B dominating in wastewater samples with temperatures above 20°C. Fisher et al. (2014) concluded that this finding is relevant because understanding the ecological factors that affect the fate of Arcobacter spp. in wastewater may help to better understand the risks associated with these emerging pathogens. The latter study showed that both subgroups of A. cryaerophilus were abundant and represented two different ecotypes. Therefore, based on those findings, a new polyphasic re-evaluation of the taxonomic diversity of this species is required. The aim of the present study was to investigate the taxonomy of A. cryaerophilus, evaluating strains from 9 different countries recovered from wastewater, different types of shellfish, human faces and various types of animal samples (feces, various viscera from fetuses, uterus, and milk). To our knowledge, this is the most diverse collection of strains of this species studied so far. The polyphasic study involved a phylogenetic analysis of the sequences of the 16S and 23S rRNA genes and of several housekeeping genes, an analysis of 13 genomes (7 of which were obtained in this study) from a representative strains and a phenotypic characterization.

Materials and methods

Strains used in this study

The study included a total of 52 strains that were widely distributed, both geographically and by the type of sample from which they were isolated that, included different host species (humans, pigs, cow, deer, clams, etc.) and environments (water, milk, reclaimed water etc.) as show in Table 1. Six strains possessed their genomes available at the GenBank database, 36 were field isolates from different sources and countries collected over a broad time frame (1985–2013) and 10 strains were from the BCCM/LMG Bacteria Culture Collection (Table 1). Among the latter was the type strain of A. cryaerophilus LMG 24291T that corresponds to subgroup 1A and the reference strain LMG 10829 of the subgroup 1B (Table 1). The 46 strains were re-evaluated or ascribed to subgroups 1A or 1B using the 16S rDNA-RFLP method described by Figueras et al. (2008, 2012). The method consists of the digestion of an amplified fragment (1026 bp) of the 16S rRNA gene with the enzyme MseI, which produces a pattern with different band sizes for subgroup 1A (395, 216, 143, 138 bp) and for subgroup 1B (365, 216, 143, and 138 bp). The RFLP patterns of the six genomes from the GenBank database (genomes L397 to L401 and L406) were obtained by an in silico simulation of the enzymatic digestion using GeneQuest software (DNASTAR, USA). When a different pattern from that expected for A. cryaerophilus was obtained, it was compared with those patterns described for the type strains of all the Arcobacter species by Figueras et al. (2008, 2012). In addition the identity of the strains were confirmed by sequencing the rpoB gene using primers and conditions described in other studies (Collado et al., 2009; Levican et al., 2015).
Table 1

Strains used (n = 52) in this study included field isolates, the type and reference collection strains of the species A. cryaerophilus and genomes from the NCBI databasea and 7 obtained in this studyb (accession numbers in Table 2).

CountryStrainSourceIsolation year16S-RFLP PatternCluster
BrazilF196Aborted porcine fetus19971BI
BrazilUF1TUterus, sow19971BI
BrazilUF2TUterus, sow19971BI
BrazilUPER3Uterus, sow19971BI
CanadaLMG 10229bKidney, aborted porcine fetus19901BI
CanadaLMG 10241Kidney, aborted porcine fetus19901BI
CanadaLMG 10210bAborted bovine fetus19901BIV
CanadaL397aWastewater20081BI
CanadaL398aWater20081BI
CanadaL399aWastewater20081BI
CanadaL400aWastewater20081BI
CanadaL401aGoose feces20091BI
CanadaL406aWater20081BI
ChileAB3AAbomasum, aborted bovine fetus20111BI
ChileAB74AAbomasum, aborted bovine fetus20131BI
ChileAO2ALungs, aborted ovine fetus20111BI
ChileAL 20-1Clam20111BII
ChileCV-152Feces, deer20131AIII
ChileCV-2101Feces, deer20131AIII
ChileEMU-3Feces, emu20131AIII
ChileFE7Feces, chicken2005AbutzII
ChileHHS 118AFeces, asymtomatic human20131BI
ChileHHS 133AFeces, asymtomatic human20131BI
ChileHHS 188AFeces, asymtomatic human20131BI
ChileHHS 191AFeces, asymtomatic human20131BI
ChileHHS 205AFeces, asymtomatic human20131BI
ChileMC 2-2Surf clam2011NPI
ChileMCV 42-1Feces, cow20111BI
ChileME 15-4Mussel2011AbutzII
ChileNAV 15-1Razor clam20111AIV
ChileNAV12-2Razor clam2011NPI
ChileNB14AJejunum, calf20111BI
Costa Rica14 PHAViscera, chicken20111BI
Costa Rica20 PHFViscera, chicken20111BI
IrelandLMG 24291TbBrain, aborted bovine fetus19851AIII
IrelandLMG 9065bPlacenta, aborted ovine fetus19891AII
IrelandLMG 9861bPeritoneum, aborted bovine fetus19901BI
IrelandLMG 9863bPlacenta, aborted ovine fetus1990AbutzII
IrelandLMG 29976bEye, aborted porcine fetus19901AIV
IrelandLMG 9871bKidney, aborted bovine fetus1990AbutzII
Italy284/1Cow milk20121BI
ItalyBUF3Buffalo milk20121BI
ItalyFEBU4Feces, buffalo20121BI
New Zealand8749401Diarrhoeic feces, human20081BI
New Zealand8756347Diarrhoeic feces, human20081BI
Spain8122333Diarrhoeic feces, human20121BI
SpainRW15-1Reclaimed water20131AIV
SpainRW17-4Reclaimed water20131AIV
SpainRW25-5Reclaimed water20131AI
SpainRW33-8Reclaimed water20131AI
SpainRW45-3Reclaimed water20131AIV
USALMG 10829Human blood19901BI
Strains used (n = 52) in this study included field isolates, the type and reference collection strains of the species A. cryaerophilus and genomes from the NCBI databasea and 7 obtained in this studyb (accession numbers in Table 2).
Table 2

Characteristics of the 13 genomes from representative strains from each of the clusters.

CLUSTERS
IIIIIIIV
FeaturesLMG 10229T (NXGD01a)LMG 9861 (NXGJ01a)L397 (LRUQ01a)L398 (LRUR01a)L399 (LRUS01a)L400 (LRUT01a)L401 (LRUU01a)L406 (LRUV01a)LMG9065T (NXGI01a)LMG9871 (NXGH01a)LMG 24291T (NXGK01a)LMG 29976T (NXGK01a)LMG 10210 (NXGE01a)
Deep Coverage25X37X180X129X77X106X133X190X74X19X196X187X160X
Size (Mb)2.062.022.312.032.102.202.172.022.052.082.052.192.26
Contigs2732967192928565561809132270
N50 (Kb)1991095664545458641383854241355
G+C%27.327.627.027.227.427.327.127.427.328.227.230.027.6
Genes (Total)2,1342,0742,3732,1002,2142,2582,2052,1022,1392,2372,1412,4752,346
CDS (Coding)2,0712,0002,2462,0022,1002,1382,1172,0202,0702,1362,0812,2882,255
Genes (RNA)46523331383730345450495959
tRNAs36402725302925293840404643
ncRNAs2332322222333

Genome accession number; ncRNAs, non-coding RNAs.

Phylogenetic analysis

A Multilocus Phylogenetic Analysis (MLPA) was carried out by amplifying and sequencing 4 housekeeping genes (gyrB, rpoB, atpA, and cpn60) following protocols described by Levican Asenjo (2013). In addition, these genes and the 16S and 23S rRNA genes were extracted from the 7 obtained genomes and from the 6 downloaded from the GenBank database. Accession number or locus tag of each gene and strain are show in Supplementary Table S1. Genes were aligned (Supplementary Figure S4) using CLUSTALW (Larkin et al., 2007) implemented in MEGA 6 software (Tamura et al., 2013). The same software was used for the phylogenetic analysis using Neighbor-Joining (NJ) algorithm (Kimura, 1980; Saitou and Nei, 1987) and the bootstrap support for individual nodes was calculated with 1,000 replicates.

Whole genome sequencing and analysis

The genome sequence of the type strain of A. cryaerophilus (LMG 24291T) and of six additional strains (LMG 10229T, LMG 9861, LMG 9065T, LMG 9871, LMG 29976T, and LMG 10210) representative of the different MLPA clusters were obtained in the present study using Illumina MiSeq platform (San Diego, CA, USA). The genomic DNA was extracted from pure cultures using the Easy-DNA™ gDNA Purification kit (Invitrogen, Madrid, Spain). Genomic libraries were prepared with the Nextera® XT DNA Sample Preparation Kit (Illumina) following manufacturer's instructions. Genome assembly was carried out with the SPAdes 3.9 (Nurk et al., 2013) and the CGE assemblers (Larsen et al., 2012) and the best results were selected for further analysis. Assembled genomes were annotated using Prokka v1.11 software (Seemann, 2014). Additionally, the protein-encoding sequences (CDS) were annotated using the Rapid Annotation Subsystem Technology (RAST) (Aziz et al., 2008) and the PATRIC server v3.5.2. (Wattam et al., 2017). The general characteristics derived from the NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP) and described for the 13 genomes (6 from the GenBank database and 7 from this study) were: genome size (Mb), number of contigs, N50 (bp), G+C content (%) and the number of predicted CDS. Furthermore, the genomes were compared by the Average Nucleotide Identity (ANI) and the in silico DNA-DNA hybridization (isDDH) indices using OrthoANI (Lee et al., 2015) and Genome-to-Genome Distance Calculator software (Meier-Kolthoff et al., 2013), respectively. Additionally, a phylogenetic analysis of the 13 genomes (LMG 24291T, LMG 10229T, LMG 9861, L397-L401, L406, LMG 9065T, LMG 9871, LMG 29976T, and LMG 10210) was carried out using the Maximum Likelihood estimation using RAxML (Stamatakis, 2014) with the pipeline implemented in the PATRIC server (Wattam et al., 2017). The genome of A. trophiarum LMG 25534Twas used as outgroup. As a first step, the phylogeny was constructed using a set of homologous proteins identified with BLASTp (Boratyn et al., 2013) and clustered with the Markov Cluster Algorithm (MCL) (Dongen, 2000). The second step was an alignment of the protein set using MUSCLE (Edgar, 2004) and the Hidden Markov Models (HMM) were constructed with HMMER tools (Eddy, 1998).

Virulence and antibiotic resistance genes

Virulence genes were searched by BLASTn analysis with default parameters using the Virulence Factors of Pathogenic Bacteria Database (VFDB) (Chen et al., 2005), Victors Database (University of Michigan, USA) and PATRIC_VF (Wattam et al., 2017). Antibiotic resistance genes were searched using the Antibiotic Resistance Database (ARDB) (Liu and Pop, 2009) and the Comprehensive Antibiotic Resistance Database (CARD) (Jia et al., 2017). The five mentioned databases are included at the Specialty Genes tool available at the PATRIC server (Wattam et al., 2017). Furthermore, the Antibiotic Resistance Gene-Annotation database (ARG-ANNOT) (Gupta et al., 2014) was also used to search antibiotic resistance genes by BLASTp analysis using default parameters and the database ARG-ANNOT AA V3 (March 2017). Virulence and resistance mechanisms were also searched for with RAST (Aziz et al., 2008) and PATRIC servers (Wattam et al., 2017). Additionally, genes related with the virulence of Arcobacter (Collado and Figueras, 2011; Douidah et al., 2012; Levican et al., 2013a) were searched for with BLASTn using sequences obtained from GenBank and from the annotated Arcobacter genomes of A. butzleri RM4018, A. nitrofigilis DSM 7299 and Arcobacter sp. L. The genes studied were cadF and cj1349, which encode two fibronectin binding proteins; ciaB encodes the invasion protein CiaB, mviN gene related to peptidoglycan synthesis; pldA gene encodes a phospholipase; tlyA gene codifies for a hemolysine; hecB related to hemolysis activation; hecA gene that encodes an adhesion protein and finally the gene irgA that codifies an iron-regulated outer membrane protein (Collado and Figueras, 2011; Douidah et al., 2012; Levican et al., 2013a). The accession number or locus tag of those genes are show in Supplementary Table S2. A phylogenetic analysis was conducted using the three virulence genes (cj1349, mviN, and pldA) present in all the studied genomes to evaluate their genetic relatedness and evolution.

Comparison of the genome derived metabolic and phenotypic information

The genomes of the seven representative strains from each cluster (LMG 10229T, LMG 9861, LMG 9065T, LMG 9871, LMG 24291T, LMG 29976T, and LMG 10210) were compared using the Functional Comparison Tool implemented in the Seed Viewer (Overbeek et al., 2014). This software uses the protein sequences of each compared genome annotated with RAST (Aziz et al., 2008) and reconstructs the metabolic pathways. On the other hand, the phenotypic traits derived from each genome were obtained with Traitar software (Weimann et al., 2016) using the protein annotations obtained with Prokka v1.2 (Seemann, 2014). This software infers phenotypic traits using data from the Global Infectious Disease and Epidemiology Online Network (GIDEON) and from the Bergey's Systematic Bacteriology (Goodfellow et al., 2012). The software works with a total of 67 traits that embrace different microbiological or biochemical characteristics involved in enzyme activity, growth, oxygen requirements, morphology, and hydrogen sulfide production (Weimann et al., 2016).

Phenotypic characterization

Phenotypic characterization of the 46 strains included 9 tests recommended in the guidelines for defining new species of the family Campylobacteraceae (Ursing et al., 1994; On et al., 2017) and 7 additional tests used in the description of other Arcobacter spp. (Donachie et al., 2005; Houf et al., 2005). Most of these tests were chosen using as a criterion the biochemical tests that gave variable results for both A. cryaerophilus subgroups in the previous study by On (1996), in which a total of 67 phenotypic tests were analyzed from 9 and 10 strains of subgroups 1A and 1B, respectively. Growth conditions on blood agar were tested (BD Difco, NJ, USA) at 37° and 42°C at three different atmospheres: aerobic, microaerobic, and anaerobic conditions. The biochemical properties were tested at 30°C in aerobic conditions for the 46 strains using positive and negative controls in parallel for each specific test. To evaluate inter laboratory reproducibility, the strains LMG 9065, LMG 9861, LMG 9871, LMG 10229 and LMG 24291Twere tested in parallel in two different laboratories in different countries (Chile and Spain).

Results and discussion

Molecular identification and phylogeny

Table 1 shows that 46 of the 52 strains gave RFLP patterns defined by Figueras et al. (2008) for A. cryaerophilus and 4 showed the one for A. butzleri (FE7, ME15-4, LMG 9863, and LMG 9871). However, strains NAV12-2 and MC2-2 produced a new RFLP pattern different to the described ones (Figueras et al., 2008, 2012). From the 46 strains that gave the pattern of A. cryaerophilus, 34 gave the pattern of the subgroup 1B (including the in silico simulated patterns obtained from the 16S rRNA genes of the 6 GenBank genomes L397- L401 and L406) and 12 the one of the subgroup 1A. This demonstrated once more that subgroup 1B is more abundant than 1A, in agreement with results of previous studies (Debruyne et al., 2010; Collado and Figueras, 2011; Fisher et al., 2014). As Figueras et al. (2012) explained when describing the 16S rDNA-RFLP identification method, different RFLP patterns from those expected for the Arcobacter spp. can obtained for new species or might be due to the existence of a mutation on the targeted site of the endonucleases in a known species. The former occurred for instance in A. mytili (Collado et al., 2009) and A. molluscorum (Figueras et al., 2011a) among other species (Figueras et al., 2011b; Levican et al., 2012, 2013b, 2015). Mutations at the binding site of the endonuclease MseI were described in the strains LMG 9863 and LMG 9871 (used in this study, Table 1), but in this case instead of resulting in a new pattern they were responsible for generating the pattern for A. butzleri instead of A. cryaerophilus (Figueras et al., 2012). The MLPA with the concatenated sequences (2,408 bp) of the four housekeeping genes (gyrB, rpoB, atpA, and cpn60) of the 52 strains showed that they grouped into four main clusters (Figure 1). Cluster I had 36 strains, most of them (88.8%) from the subgroup 1B, and included the reference strain for the 1B subgroup LMG 10829. The other four strain of this cluster presented the pattern of subgroup 1A (n = 2) and a different pattern to those described (n = 2). Cluster II (n = 6) corresponded to the four strains that showed a 16S rDNA-RFLP pattern similar to the one described for A. butzleri (Figueras et al., 2008) and two other strains with the patterns for the subgroups 1A and 1B. Cluster III, included the type strain of A. cryaerophilus LMG 24291T and three field isolates from Chilean animals all belonging to the subgroup 1A, and Cluster IV comprised six strains, mostly from subgroup 1A (n = 5). Interestingly, strains recovered from human specimens belonged exclusively to Cluster I, suggesting potential host specificity because strains associated with farm animal abortions were present in the four clusters (Figure 1).
Figure 1

Neighbor joining tree based on the concatenated sequences of gyrB, rpoB, atpA, and hsp60 (2,408 bp) genes showing the distribution of the 52 strains in four clusters. Bootstrap values (>50%) based on 1,000 replications are shown at the nodes of the tree. Bar indicates 1 substitutions per 100 bp. Isolation source: •, Human; ◦, Animal; ■, Shellfish; ▴, Water.

Neighbor joining tree based on the concatenated sequences of gyrB, rpoB, atpA, and hsp60 (2,408 bp) genes showing the distribution of the 52 strains in four clusters. Bootstrap values (>50%) based on 1,000 replications are shown at the nodes of the tree. Bar indicates 1 substitutions per 100 bp. Isolation source: •, Human; ◦, Animal; ■, Shellfish; ▴, Water. A representative type strain was selected from each cluster (I–IV) for further analysis and for constructing a 16S rRNA gene phylogenetic tree (Figure 2). The tree showed that the four strains formed separated branches, strains LMG 24291T and LMG 29976T being the nearest ones. The percentage of similarity of the 16S rRNA gene between the type strains ranged from 99.5% between strains LMG 10229T (Cluster I) and LMG 9065T (Cluster II) to 99.9% between the original type strain of A. cryaerophilus LMG 24291T (Cluster III) and the representative strain of Cluster IV (LMG 29976T). These results agree with what occurs between other species of Arcobacter, such as A. ellisii and A. cloacae (Figueras et al., 2011b; Levican et al., 2013b), where the 16S rRNA gene does not have enough resolution to differentiate the species. The phylogeny of the 23S rRNA gene (Supplementary Figure S1) and the one carried out with the concatenated sequences of the two rRNA genes (Supplementary Figure S2) presented the same topology shown with the 16S rRNA gene (Figure 2) and confirmed that the strains of Cluster III are more closely related to Cluster IV than to the other clusters.
Figure 2

Neighbor joining tree based on 16S rRNA (1,496 bp) sequences showing the phylogenetic position of the representative strains of the four clusters of A. cryaerophilus within the genus Arcobacter. Bootstrap values (>50%) based on 1,000 replications are shown at the nodes of the tree. Bar, 5 substitutions per 1,000 bp.

Neighbor joining tree based on 16S rRNA (1,496 bp) sequences showing the phylogenetic position of the representative strains of the four clusters of A. cryaerophilus within the genus Arcobacter. Bootstrap values (>50%) based on 1,000 replications are shown at the nodes of the tree. Bar, 5 substitutions per 1,000 bp.

Genome analysis

The characteristics of the 13 compared genomes (8 representatives of Cluster I, two of clusters II and IV and one of Cluster III) are shown in Table 2. The quality of the genome sequences was in general in agreement with the minimal standards established for the use of genome data for taxonomical purposes, that embraces characteristics of the sequencing and assembly of the genomes like the depth of coverage, the value of N50 and the number of contigs (Chun et al., 2018). The exceptions were the genome sequence data of strains LMG 10229T, LMG 9871, and LMG 9861 that presented a depth of coverage lower than 50X proposed in the standards (Table 2). Globally, the genomic characteristics of the 13 compared genomes shown in Table 2 were very similar, with sizes that did not differ in more than 0.29 Mb, with a %mol G+C content ranging between 27.0 and 30.0% and with a number of coding sequences or CDS of around 2000 (Table 2). The G+C values were in agreement with those (24.6–31%) described in the recent emended description of the genus Arcobacter (Sasi Jyothsna et al., 2013). Table 3 shows the results from the calculated overall genome related taxonomical indices i.e., ANI and isDDH. For species delineation the generally accepted ANI and isDDH boundary values are 95–96 and 70%, respectively (Goris et al., 2007; Richter and Rossello-Mora, 2009; Meier-Kolthoff et al., 2013; Chun et al., 2018). However, for the genus Arcobacter, ANI values above 96% were the ones that better correlated with isDDH results above 70% in previous studies (Figueras et al., 2017; Pérez-Cataluña et al., 2018) in agreement with what happens in other genera (Beaz-Hidalgo et al., 2015; Figueras et al., 2017; Liu et al., 2017). The ANI values of the representative strains from each of the four different clusters were below the 96% cut-off indicating that the compared genomes belonged to different species, while the intra-cluster ANI values ranged from 96.6 to 98.6%. The isDDH results of <70% found between strains of the four clusters confirmed as the ANI results did that each cluster represented an independent species. The core genome phylogenetic tree inferred from 893 protein sequences of the 13 genomes obtained with PATRIC showed that the genomes also grouped into four different well-supported clusters with bootstraps of 100% (Figure 3). Interestingly, clusters IV and I formed a separate branch from clusters II and III. This indicates that the proteins of the genomes of clusters I and IV are more similar than the nucleotide sequences, and this was also in agreement with the higher values observed with ANI and isDDH for these two clusters.
Table 3

Results of Average Nucleotide Identity (ANI) and in silico DNA-DNA hibridization (isDDH) between representative genomes of the four clusters.

CLUSTER
IIIIIIIV
LMG 10229TLMG 9861L397L398L399L400L401L406LMG 9065TLMG 9871LMG 24291TLMG 10210LMG 29976T
Cluster I
LMG 10229T74.774.372.373.573.571.975.750.350.450.163.762.0
LMG 986197.269.669.668.770.369.772.350.249.649.763.062.1
L39797.196.673.575.378.471.982.149.349.049.260.159.9
L39896.996.697.178.474.987.775.349.750.349.360.159.7
L39997.196.697.397.775.379.075.849.849.549.560.259.2
L40097.196.797.797.297.374.183.449.149.349.160.359.7
L40196.996.697.198.697.797.174.649.949.950.560.759.5
L40697.396.998.197.397.498.297.220.150.049.761.760.5
Cluster II
LMG 9065T93.193.192.993.193.192.993.193.081.456.151.151.7
LMG 987193.193.192.993.193.092.993.193.098.149.751.052.2
Cluster III
LMG 24291T92.992.892.692.692.792.592.992.994.394.452.354.4
Cluster IV
LMG 1021095.795.895.295.295.395.395.295.593.593.493.473.4
LMG 29976T95.295.595.194.994.995.194.995.193.593.593.997.1

Values in bold in the lower triangle corresponds to ANI and in the upper triangle to isDDH.

Figure 3

Maximum likelihood tree based on the concatenated sequences of 893 core protein sequences showing the distribution of the 13 genomes used in the same four clusters shown in Figure 1. Boostrap values based on 1,000 replications are shown at the nodes of the tree. Bar indicates 2 substitution per 100 aa.

Characteristics of the 13 genomes from representative strains from each of the clusters. Genome accession number; ncRNAs, non-coding RNAs. Results of Average Nucleotide Identity (ANI) and in silico DNA-DNA hibridization (isDDH) between representative genomes of the four clusters. Values in bold in the lower triangle corresponds to ANI and in the upper triangle to isDDH. Maximum likelihood tree based on the concatenated sequences of 893 core protein sequences showing the distribution of the 13 genomes used in the same four clusters shown in Figure 1. Boostrap values based on 1,000 replications are shown at the nodes of the tree. Bar indicates 2 substitution per 100 aa. Of the different methods and databases used for recognizing virulence factors (Victors, VFDB and PATRIC_VF) none of them were useful for recognizing virulence genes. There were only a few exceptions. The phospholipase C identified with the databases PATRIC_VF and Victors in the genome of the strain LMG 29976T (Cluster IV). The enzyme UDP-N-acetylglucosamine 4-6 dehidratase involved in flagelline glycosylation and identified with the VFDB database in the genomes L397 and L399 (Cluster I). Finally, the Pspa protein (EC 2.3.1.41), essential for gluconeogenesis, identified using PATRIC_VF in the genome LMG 9871 of Cluster II (Table 4). However the BLASTn carried out for the detection of virulence genes showed the presence of different genes related with adhesion (cj1349), invasion (ciaB and mviN) and phospholipase activity (pldA) (Table 4). None of the genomes showed the cadF, hecA and hecB genes that encode a fibronectin binding protein, an adhesion protein and a factor for hemolysis activation, respectively. These results agree with those obtained for the genome of A. thereius LMG 24486T (Rovetto et al., 2017). The irgA and tlyA genes that encode an iron-regulated outer membrane protein and a hemolysine, respectively, were the only ones found i.e. the gene irgA in the genomes L398 and L401 (Cluster I); the gene tlyA in LMG 24291T (Cluster III) and LMG 9065T (Cluster II). The phylogenetic analysis of the concatenated sequences of the four virulence genes present in all the genomes (cj1349, mviN, pldA and ciaB) formed the same four clusters (Supplementary Figure S3). However, the distribution of the clusters was similar to the one obtained with the core genome tree (Figure 3), where clusters I and IV formed a separated branch from clusters II and III.
Table 4

Antibiotic resistant genes and virulence factors.

CLUSTERS
IIIIIIIV
12345678910111213
ANTIBIOTIC RESISTANCE
Multidrug efflux pumps
    CmeABC systema+++++++++++++
    MFS Superfamilya+++++++
Macrolids
    MacAB−TolCa+++++++++++++
Quinolones
    gyrA mutation
    23S rRNA mutations
    OqxBb+++++++++++++
β-lactamics
    β-lactamasea+d
Colistin
Mcr−1b++++++++++
Mcr−2b++++++++++
Acriflavin resistancea+++++++++++++
Streptomycin/Spectinomycina+
VIRULENCE FACTORS
Invasion
    ciaBc+++++++++++++
    mviNc+++++++++++++
Adhesion
    cj1349c+++++++++++++
    cadFc
Filamentous hemmaglutinin
    hecAc
Hemolysis
    hecBc
    tlyAc++
Outer membarne protein
    irgAc++
Phospholipase
    pldAc++++++++++++e+

1, A. cryaerophilus gv. pseudocryaerophilus LMG 10229.

RAST/PATRIC results,

ARG–ANNOT results,

BLASTn of virulence genes results (See Supplementary Table ,

β-lactamase class D,

Phospholipase A and C.

Antibiotic resistant genes and virulence factors. 1, A. cryaerophilus gv. pseudocryaerophilus LMG 10229. RAST/PATRIC results, ARG–ANNOT results, BLASTn of virulence genes results (See Supplementary Table , β-lactamase class D, Phospholipase A and C. Regarding the presence of antibiotic resistant mechanisms, all the genomes showed the cmeABC multidrug efflux pump, the MacAB-TolC system for macrolide resistance, the oxqB gene related with quinolone resistance and genes related with the resistance to acriflavine. Resistance to colistin by the genes mcr-1 and mcr-2 were present in 85% of the genomes. The genome L406 was the only one that possessed a β-lactamase gene of class D. Resistance to β-lactamic compounds have been reported in other studies (Atabay and Aydin, 2001; Fera et al., 2003) and the same β-lactamase gene is present in the genome of A. butleri RM4018. However, this gene is absent in the genome of A. thereius LMG 24486T (Rovetto et al., 2017). The genome LMG 29976 was the only one that presented genes for the resistance to streptomycin/spectomycin. The susceptibility of A. cryaerophilus to streptomycin has been previously demonstrated (Kabeya et al., 2004; Rahimi, 2014). However, this is the first report that show the presence of resistance genes to this antimicrobial compound. Mutations on the 23S rRNA (Ren et al., 2011) and the gyrA gene (Carattoli et al., 2002) for erythromycin and quinolone resistance were not detected, despite gyrA mutations have been found in some quinolone-resistant A. cryaerophilus strains (Abdelbaqi et al., 2007; Van den Abeele et al., 2016).

Functional and phenotypic inference

Several subsystems where found to be characteristic of each Cluster on the basis of the functional-based comparison between the representative genomes (Figure 4). Cluster I genomes (LMG 10229T and LMG 9861) carry specifically multi-subunit cation antiporters [Na(+) H(+) cation antiporter ABCDEFG] whose function includes sodium tolerance and pH homeostasis in an alkaline environment (Ito et al., 2017). Cluster II genomes (LMG 9065T and LMG 9871) were the only ones that did not show the chromate transport protein ChrA, which confers resistance to chromate compounds present in the other studied genomes. Cluster III (LMG 24291T) was the only one that presented transposable elements as the putative transposase TniA and the Nucleotide Triphosphate binding protein TniB. Finally, the enzyme Adenosine deaminase (EC 3.5.4.4) involved in purine metabolism was only detected in Cluster IV genomes (LMG 29976T and LMG 10210).
Figure 4

Function based comparison between the representative genomes of each cluster using RAST annotation results. Black squares represent de presence and white squares the absence of each subsystem/protein. 1-3 Polar lipids genes: 1. Phosphatidilglycerol phosphatase A (pspA); 2. phosphatidate cytidylyltransferase (cdsA); 3. phosphatidyl serine decarboxylase (psd). Genes pspA and cdsA are involved in the synthesis of phosphatidilglycerol (PG) and psd gene in the synthesis of phosphatidylethanolamine (PE).

Function based comparison between the representative genomes of each cluster using RAST annotation results. Black squares represent de presence and white squares the absence of each subsystem/protein. 1-3 Polar lipids genes: 1. Phosphatidilglycerol phosphatase A (pspA); 2. phosphatidate cytidylyltransferase (cdsA); 3. phosphatidyl serine decarboxylase (psd). Genes pspA and cdsA are involved in the synthesis of phosphatidilglycerol (PG) and psd gene in the synthesis of phosphatidylethanolamine (PE). From the 67 phenotypic inferred traits analyzed with Traitar 11(16.4%) were found in all the analyzed genomes while 12 were only found in some of them (Figure 5). The genomes of Cluster I (LMG 10229T and LMG 9861) were predicted to produce hydrogen sulfide while those of Cluster IV (LMG 29976T and LMG 10210) showed acetate utilization and bile susceptibility. However, none of these characteristics have been observed when they have been tested in the laboratory on those strains. This might be due to the inability to reproduce the necessary conditions in the laboratory for the expression of these features. None of the other nine traits recognized in some genomes enabled us to differentiate between the IV Clusters.
Figure 5

Phenotypic inference using Traitar software for the representative genomes of each cluster. The software uses two prediction models, the phypat model (predicts the presence/absence of proteins found in the phenotype of 234 bacterial species) and a combination of phypat+PGL models (uses the information of phypat combined with the information of the acquisition or loss of protein families and phenotypes through the evolution), to determine the phenotypic characteristics.

Phenotypic inference using Traitar software for the representative genomes of each cluster. The software uses two prediction models, the phypat model (predicts the presence/absence of proteins found in the phenotype of 234 bacterial species) and a combination of phypat+PGL models (uses the information of phypat combined with the information of the acquisition or loss of protein families and phenotypes through the evolution), to determine the phenotypic characteristics.

Phenotypical characterization

Table 5 shows the phenotypical results obtained from the strains of each of the four clusters. In agreement with what was found in previous studies where phenotypic test did not differentiate between subgroups 1A and 1B (Neill et al., 1985; Vandamme et al., 1992; On, 1996), none of the performed phenotypic tests enabled to clearly distinguish strains from each of the four phylogenetic clusters. Most of the tests gave variable results except for Cluster IV. However, this might be due to the small number of strains (n = 2) analyzed in this group. Considering these results, each of the three genetically recognized new species (clusters I, II, and IV) should be considered a different genomovar (gv.) of the species A. cryaerophilus. A genomovar is a well-delimited group of strains that correspond to a new species by genomic information but that cannot be phenotypically differentiated (Ursing et al., 1995). Cluster III represents the original species A. cryaerophilus because it embraces the type strain of the species. The value of the phenotypic characterization has already been questioned considering the lack of reproducibility of results between laboratories and some authors have suggested it is now time to base the description of new taxa on the genome sequence analysis (Moore et al., 2010; Sutcliffe, 2015). According to Sutcliffe (2015), phenotypic characterization is harder to evaluate nowadays than the genotype. Considering that genomic characterization is objective and reproducible, we agree with Sutcliffe (2015) that we should be able to define species on the basis of genetic characters like the ones evaluated in this study. This will favor the faster discover of the large number of taxa waiting to be described (Sutcliffe, 2015). However, this will require a modification of the Bacteriological Code, which we hope will happen in the near future.
Table 5

Phenotypic characteristics of the four clusters.

Characteristics1234
Growth in/on
Air at 37°C+V(+)+
Microaerobiosis at 37°C+V(+)V(+)
Anaerobiosis at 37°CV(+)V(−)V(−)
Air at 42°CV(−)
2% (w/v) NaClV(+)+V(+)+
3% NaClV(−)
4% (w/v) NaCl
1% bilis+V(+)V(+)+
1.5% bilis++V(+)+
2% bilis++V(+)+
1% (w/v) glycineV(−)
0.1% sodium deoxycholateV(+)V(+)V(+)+
MacConkeyV(+)+V(+)+
CdCl2V(−)V(−)V(−)V
Resistance to
Cefoperazone (64 mg/L)V(+)+++
Enzyme activity
Catalase++++

Taxa: 1, A. cryaerophilus gv. pseudocryaerophilus (n = 27) [Cluster I]; 2, A. cryaerophilus gv. crypticus (n = 5) [Cluster II]; 3, A. cryaerophilus gv. cryaerophilus(n = 3) [Cluster III]; 4, A. cryaerophilus gv. occultus (n = 2) [Cluster IV]. The specific responses for type strains were coincidental or expressed in brackets. Unless otherwise indicated: +, ≥95% strains positive; −, ≤ 11% strains positive; V, variable; (), main result of the strains; CO.

Phenotypic characteristics of the four clusters. Taxa: 1, A. cryaerophilus gv. pseudocryaerophilus (n = 27) [Cluster I]; 2, A. cryaerophilus gv. crypticus (n = 5) [Cluster II]; 3, A. cryaerophilus gv. cryaerophilus(n = 3) [Cluster III]; 4, A. cryaerophilus gv. occultus (n = 2) [Cluster IV]. The specific responses for type strains were coincidental or expressed in brackets. Unless otherwise indicated: +, ≥95% strains positive; −, ≤ 11% strains positive; V, variable; (), main result of the strains; CO.

Conclusion

The phylogenetic and genomic analysis showed that the strains of the species A. cryaerophilus represent four separated species. In addition, phenotypical and functional traits were in evidence for the genomes selected as representative of each cluster. Despite all the results, phenotypic characterization carried out at the laboratory showed a high inter- and intra-cluster variability that did not allow us to determine specific phenotypic characteristics or therefore to define the three uncovered clusters as three new species. Following current bacterial taxonomic rules, we will not be able to define these species until we find phenotypical characteristics that allow us to discriminate the three new species from each other and from the species A. cryaerophilus. Therefore, we describe them as four genomovars with the names “A. cryaerophilus gv. pseudocryaerophilus” (pseu.do.cry.a.e.ro'phi.lus. Gr. adj. pseudês false, N.L. masc. adj. cryaerophilus specific epithet of an Arcobacter species; N.L. masc. adj. pseudocryaerophilus false cryaerophilus; Cluster I = LMG 10229T), “A. cryaerophilus gv. crypticus” (cryp'ti.cus. L. masc. adj. Crypticus hidden; Cluster II = LMG 9065T), A. cryaerophilus gv. cryaerophilus (Cluster III = LMG 24291T) and “A. cryaerophilus gv. occultus” (oc.cul′tus. L. adj. occultus occulted, hidden; Cluster IV = LMG 29976T). Unfortunately, the phenotype derived from the genome could not be reproduced in the laboratory, either. This might be due to the inability to mimic in vitro the conditions for the expression of these pathways or characteristics. The phenotypic characterization limits a proper description and it might be considered an important shortcoming in the genomic era in which all the molecular and genomic data leave no doubts about the existence of four different species among the investigated A. cryaerophilus strains.

Authors contributions

LC and MF: designed the work; LC and OS: carried out the phylogenetic analysis; VL and AP-C: carried out the phenotypic characterization of the strains; AP-C: carried out the genome sequencing and analysis; LC, MF, and AP-C: wrote the paper.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  71 in total

1.  Arcobacter molluscorum sp. nov., a new species isolated from shellfish.

Authors:  Maria José Figueras; Luis Collado; Arturo Levican; Jessica Perez; Maria Josep Solsona; Clara Yustes
Journal:  Syst Appl Microbiol       Date:  2010-12-24       Impact factor: 4.022

2.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

3.  Restriction fragment length polymorphisms in the ribosomal genes for species identification and subtyping of aerotolerant Campylobacter species.

Authors:  J A Kiehlbauch; B D Plikaytis; B Swaminathan; D N Cameron; I K Wachsmuth
Journal:  J Clin Microbiol       Date:  1991-08       Impact factor: 5.948

4.  Susceptibility of Arcobacter butzleri isolates to 23 antimicrobial agents.

Authors:  H I Atabay; F Aydin
Journal:  Lett Appl Microbiol       Date:  2001-12       Impact factor: 2.858

5.  Arcobacter halophilus sp. nov., the first obligate halophile in the genus Arcobacter.

Authors:  Stuart P Donachie; John P Bowman; Stephen L W On; Maqsudul Alam
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

6.  Reassessment of the taxonomy of Arcobacter cryaerophilus.

Authors:  Lies Debruyne; Kurt Houf; Laid Douidah; Sarah De Smet; Peter Vandamme
Journal:  Syst Appl Microbiol       Date:  2009-11-28       Impact factor: 4.022

7.  Presence of Arcobacter spp. in environmental waters correlates with high levels of fecal pollution.

Authors:  Luis Collado; Isabel Inza; Josep Guarro; Maria Jose Figueras
Journal:  Environ Microbiol       Date:  2008-01-21       Impact factor: 5.491

8.  Prevalence of Arcobacter species in retail meats and antimicrobial susceptibility of the isolates in Japan.

Authors:  Hidenori Kabeya; Soichi Maruyama; Yukio Morita; Tomoko Ohsuga; Saori Ozawa; Yoko Kobayashi; Maiko Abe; Yasuji Katsube; Takeshi Mikami
Journal:  Int J Food Microbiol       Date:  2004-02-01       Impact factor: 5.277

9.  Arcobacter anaerophilus sp. nov., isolated from an estuarine sediment and emended description of the genus Arcobacter.

Authors:  T S Sasi Jyothsna; K Rahul; E V V Ramaprasad; Ch Sasikala; Ch V Ramana
Journal:  Int J Syst Evol Microbiol       Date:  2013-08-05       Impact factor: 2.747

Review 10.  Mrp Antiporters Have Important Roles in Diverse Bacteria and Archaea.

Authors:  Masahiro Ito; Masato Morino; Terry A Krulwich
Journal:  Front Microbiol       Date:  2017-11-23       Impact factor: 5.640

View more
  9 in total

Review 1.  The unexpected habitat in sewer pipes for the propagation of microbial communities and their imprint on urban waters.

Authors:  Sandra L McLellan; Adélaïde Roguet
Journal:  Curr Opin Biotechnol       Date:  2019-01-22       Impact factor: 9.740

2.  Revisiting the Taxonomy of the Genus Arcobacter: Getting Order From the Chaos.

Authors:  Alba Pérez-Cataluña; Nuria Salas-Massó; Ana L Diéguez; Sabela Balboa; Alberto Lema; Jesús L Romalde; Maria J Figueras
Journal:  Front Microbiol       Date:  2018-09-04       Impact factor: 5.640

3.  Prevalence and antimicrobial susceptibility of Arcobacter species in human stool samples derived from out- and inpatients: the prospective German Arcobacter prevalence study Arcopath.

Authors:  Greta Gölz; Markus M Heimesaat; Vanessa Brückner; Ulrike Fiebiger; Ralf Ignatius; Johannes Friesen; Martin Eisenblätter; Marlies Höck; Thomas Alter; Stefan Bereswill
Journal:  Gut Pathog       Date:  2020-04-15       Impact factor: 4.181

4.  Taxogenomics Resolves Conflict in the Genus Rhodobacter: A Two and Half Decades Pending Thought to Reclassify the Genus Rhodobacter.

Authors:  G Suresh; Tushar D Lodha; B Indu; Ch Sasikala; Ch V Ramana
Journal:  Front Microbiol       Date:  2019-10-31       Impact factor: 5.640

5.  Characterization of Arcobacter strains isolated from human stool samples: results from the prospective German prevalence study Arcopath.

Authors:  Markus M Heimesaat; Greta Gölz; Vanessa Brückner; Ulrike Fiebiger; Ralf Ignatius; Johannes Friesen; Martin Eisenblätter; Marlies Höck; Thomas Alter; Stefan Bereswill
Journal:  Gut Pathog       Date:  2020-01-08       Impact factor: 4.181

6.  Antimicrobial Susceptibility and Genomic Analysis of Aliarcobacter cibarius and Aliarcobacter thereius, Two Rarely Detected Aliarcobacter Species.

Authors:  Ingrid Hänel; Eva Müller; Belén González Santamarina; Herbert Tomaso; Helmut Hotzel; Anne Busch
Journal:  Front Cell Infect Microbiol       Date:  2021-03-17       Impact factor: 5.293

7.  Alkalihalobacterium elongatum gen. nov. sp. nov.: An Antibiotic-Producing Bacterium Isolated From Lonar Lake and Reclassification of the Genus Alkalihalobacillus Into Seven Novel Genera.

Authors:  Amaraja Joshi; Sonia Thite; Prachi Karodi; Neetha Joseph; Tushar Lodha
Journal:  Front Microbiol       Date:  2021-10-11       Impact factor: 5.640

8.  Guts of the Urban Ecosystem: Microbial Ecology of Sewer Infrastructure.

Authors:  Adélaïde Roguet; Ryan J Newton; A Murat Eren; Sandra L McLellan
Journal:  mSystems       Date:  2022-06-28       Impact factor: 7.324

9.  Species classification and novel plasmid identifications in Arcobacter cryaerophilus and Arcobacter cryaerophilus-like organisms.

Authors:  Guilan Zhou; Min Wang; Hairui Wang; Xiaoli Chen; Yixin Gu; Zhujun Shao; Jianzhong Zhang; Maojun Zhang
Journal:  Front Microbiol       Date:  2022-09-21       Impact factor: 6.064

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.