Thea G Fennell1,2,3, Grace A Blackwell4,1, Nicholas R Thomson5,1, Matthew J Dorman1,2. 1. Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK. 2. Churchill College, Storey's Way, Cambridge, CB3 0DS, UK. 3. Present address: Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge, UK. 4. EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD, UK. 5. London School of Hygiene and Tropical Medicine, Keppel St., Bloomsbury, London, WC1E 7HT, UK.
Abstract
Members of the bacterial genus Vibrio utilize chitin both as a metabolic substrate and a signal to activate natural competence. Vibrio cholerae is a bacterial enteric pathogen, sub-lineages of which can cause pandemic cholera. However, the chitin metabolic pathway in V. cholerae has been dissected using only a limited number of laboratory strains of this species. Here, we survey the complement of key chitin metabolism genes amongst 195 diverse V. cholerae. We show that the gene encoding GbpA, known to be an important colonization and virulence factor in pandemic isolates, is not ubiquitous amongst V. cholerae. We also identify a putatively novel chitinase, and present experimental evidence in support of its functionality. Our data indicate that the chitin metabolic pathway within V. cholerae is more complex than previously thought, and emphasize the importance of considering genes and functions in the context of a species in its entirety, rather than simply relying on traditional reference strains.
Members of the bacterial genus Vibrio utilize chitin both as a metabolic substrate and a signal to activate natural competence. Vibrio cholerae is a bacterial enteric pathogen, sub-lineages of which can cause pandemic cholera. However, the chitin metabolic pathway in V. cholerae has been dissected using only a limited number of laboratory strains of this species. Here, we survey the complement of key chitin metabolism genes amongst 195 diverse V. cholerae. We show that the gene encoding GbpA, known to be an important colonization and virulence factor in pandemic isolates, is not ubiquitous amongst V. cholerae. We also identify a putatively novel chitinase, and present experimental evidence in support of its functionality. Our data indicate that the chitin metabolic pathway within V. cholerae is more complex than previously thought, and emphasize the importance of considering genes and functions in the context of a species in its entirety, rather than simply relying on traditional reference strains.
The authors confirm that all supporting data, code and protocols have been provided within the article or through supplementary data files.No whole-genome sequencing data were generated in this study. Accession numbers for the publicly available sequences used for these analyses are listed in Tables 2 and S1 (available in the online version of this article), and the Methods.Metadata, accession numbers and references for each genome included in this study are provided in Table S1, deposited in Figshare: (https://dx.doi.org/10.6084/m9.figshare.14398688).Supplementary tables, figures and references are available in the Supplementary Material file, deposited in Figshare: (https://dx.doi.org/10.6084/m9.figshare.14398688).All other data which underpin the figures in this paper, including pangenome data matrices, modified and unmodified sequence alignments and phylogenetic trees, original images of gels and immunoblots, raw fluorescence data, amplicon sequencing reads, and the R code used to generate Fig. 5, are available in Figshare: https://dx.doi.org/10.6084/m9.figshare.13169189 (DOI).It is thought that the ability to metabolize chitin is ubiquitous amongst species, and that this enables these species to survive in aqueous and estuarine environmental contexts. Although chitin metabolism pathways have been detailed in several members of this genus, little is known about how these processes vary within a single species. Here, we present the distribution of genes encoding key chitinase and chitin-binding proteins across diverse , and show that our canonical understanding of this pathway in this species is challenged when isolates from non-pandemic lineages are considered alongside those linked to pandemics. Furthermore, we show that genes previously thought to be species core genes are not in fact ubiquitous, we identify novel components of the chitin metabolic cascade in this species, and we present functional validation for these observations.
Introduction
The genus of marine gammaproteobacteria contains a number of virulent human pathogens, of significant public health concern [1]. Most notorious of these pathogens is , members of which are the aetiological agent of cholera in humans [2, 3]. Two biochemically defined and distinct biotypes are associated with cholera pandemics. Classical biotype are believed to have caused the first six pandemics [2-4], whilst the current seventh pandemic (1961 to the present) is attributed to El Tor biotype [5, 6]. Genomic evidence has shown that classical form a discrete phylogenetic lineage from the lineage causing the seventh cholera pandemic, dubbed the seventh pandemic El Tor lineage (7PET) [7-12]. It is to these two pandemic lineages that commonly used El Tor and classical biotype laboratory strains belong. Although cholera is estimated to cause 100 000 deaths annually worldwide [13], other species can also cause enteric and extraintestinal disease in humans. For example, can cause septicaemia and systemic infection in humans [14], and can cause gastrointestinal infection, septicaemia and wound infections [1, 15]. Other vibrios may be pathogenic to livestock and other animals, such as Vibrio nigripulchritudino, which is a pathogen of farmed shrimp [16, 17], , which causes vibriosis in multiple species of fish [18], and , which is an important pathogen of both fish and shrimp [19].In spite of differences in the types of disease which species may cause, there are several commonalities amongst members of this genus. For example, it has been suggested that the ability to grow on chitin is a ubiquitous phenotype amongst the [20], and therefore that all species are capable of metabolizing chitin, a highly abundant polymer of N-acetyl-β-d-glucosamine (GlcNAc) [21]. This is directly relevant to the environmental lifestyles of vibrios – for example, colonizes and grows on the surface of chitinaceous animals such as shellfish [22]. secretes a chitinase and can adsorb onto particulate chitin and copepods [23]. Similarly, can metabolize chitin [24], has chitinase activity and can adsorb on chitinous substrates [25], and can colonize chitinous surfaces such as those of copepods [26]. Chitin metabolism is linked to other aspects of biology, including the regulation of natural competence [27-29], and to the survival of in the context of the intestine during an infection [30].The pathways by which chitin is degraded and utilized by have been described in detail [31], as it has been in other members of the genus (e.g. [32-39]). Although a comprehensive review of the chitin utilization pathway is beyond the scope of this paper, it is important to appreciate the complexity of this pathway. Chitin degradation, import and metabolism in involves at least 27 proteins, 24 encoded by genes on chromosome 1, and three by genes on the smaller chromosome 2 [20]. Here, focus will be directed to the initial stages of chitin metabolism – adhesion to a chitinous substrate, and expression of extracellular degradative chitinase enzymes.The first step in chitin metabolism is the attachment of to chitinaceous surfaces through interactions with GlcNAc. This is mediated both by the mannose-sensitive haemagglutinin (MSHA) pilus and the chitin adhesin GbpA (encoded by VC_A0811, accession no. AAF96709.1) [24, 40]. Although GbpA was initially identified as a putative chitinase enzyme [24], it was shown to be an adhesin induced by GlcNAc which enabled to attach to chitinous substrates [24]. Subsequently, it was found that as well as mediating attachment of to chitin, GbpA is also required for the successful colonization of the intestine [40]. This is thought to occur through interactions with mucin – GbpA interacts with mucin in the intestine, and gbpA transcription increases upon exposure of to mucin [41]. The crystal structure and domain architecture of GbpA have been determined [42], and the fourth domain of GbpA is structurally similar to the chitin-binding domain of known chitinases [42]. Evidence also suggests that GbpA has lytic polysaccharide monooxygenase activity [43], and that GbpA activity is higher at low population densities due to the activity of quorum-sensing-regulated proteases [44].Once adheres to a chitinaceous surface, extracellular endochitinase enzymes are required for the bacterium to hydrolyse complex chitin polymers into oligosaccharides which can be imported into the cell for further metabolism [35]. As many as seven putative endochitinases have been identified in [20, 45, 46], two of which, ChiA-1 (encoded by VC_1952, accession no. AAF95100.1) and ChiA-2 (encoded by VC_A0027, accession no. AAF95941.1), are the principal chitinases required for chitin catabolism [24, 31, 45, 47]. ChiA-1 was first shown to be an extracellular chitinase in 1998 [48]; subsequently, ChiA-2 was shown to be important for intestinal colonization and for metabolizing mucin in the intestine by strain N16961 (N16961) [30]. ChiA-2 is also the most highly expressed chitinase in El Tor biotype strain E7946 [45]. Both ChiA-1 and ChiA-2 are essential for to grow in media supplemented with colloidal chitin [24]. Once chitin oligomers have been digested by extracellular chitinases, the resultant oligosaccharides are thought to enter the bacterial periplasm via the chitoporin ChiP (encoded by VC_0972, accession no. AAF94134.1) and by other as-yet-uncharacterized porins [24, 36, 49], and subsequently transported to the cytoplasm via phosphotransferase (PTS) and ABC-type transporters [20, 36] (Fig. 1).
Fig. 1.
Initial steps in chitin uptake and catabolism. Schematic summarizing the principal stages in chitin degradation and import by . Comprehensive descriptions of this pathway are reported in the literature [20, 36]. The MSHA adhesin has not been included in this diagram.
Initial steps in chitin uptake and catabolism. Schematic summarizing the principal stages in chitin degradation and import by . Comprehensive descriptions of this pathway are reported in the literature [20, 36]. The MSHA adhesin has not been included in this diagram.Previous work used the genomes of 20 diverse (including seven ) to determine the presence and absence of genes involved in metabolizing chitin across this family of bacteria [20]. However, it is important to note that the chitin degradation pathway of has been described using reference strains of the species (particularly N16961 [24]), and although data exist on how the chitin catabolism pathway varies amongst members of the genus [20], less is known about how this pathway varies within a single species. This is particularly relevant because emerging evidence suggests that non-7PET lineages of cause different patterns of disease, even if they harbour some or all of the canonical pathogenicity determinants associated with cholera cases [8]. However, because the chitin metabolic pathway has principally been studied in N16961, a 7PET strain, we know little about the extent to which it varies amongst non-pandemic members of .In this study, we focused specifically on genes that encode components of the initial steps of the chitin degradation pathway across . We focused on these because the functions of many of these genes have been characterized experimentally, and we sought to determine how well the observations in the literature reflect the true distribution of these genes, and their functions, across a diverse species. We generated a pangenome from 195 annotated genome sequences, which were chosen to obtain as balanced and unbiased a view of as possible (i.e. without focusing solely on epidemic and pandemic lineages). We find that the distribution of these genes is not uniform within , and we identify variation amongst the chitinases encoded by diverse . We also identify a putatively novel chitinase gene, and present experimental evidence in support of its functional classification.
Methods
Strains, plasmids and oligonucleotides
Strains, plasmids and oligonucleotide primers (Sigma-Aldrich) used for experimental work in this study are listed in Table 1. Bacteria were cultured routinely on LB media supplemented with chloramphenicol (10 µg ml−1; LB-Cm) where appropriate.
Table 1.
Strains, plasmids and oligonucleotides used in this study
Restriction enzyme recognition sites are underlined. The primer sequence incorporating a C-terminal 6xHis translational fusion into chiA-3 is presented in lowercase, and the sequences of ribosome binding sites, start and STOP codons are in bold. CmR, chloramphenicol resistant; StrR, streptomycin resistant.
Internal strain ID
Strain name
Genotype/details
Source/reference
Vibrio cholerae
MJD382
NCTC 30
Non-pandemic V. cholerae harbouring chiA-3
Thomson lab stocks; National Centre for Type Cultures, batch 3, sequenced previously [65]
Strains, plasmids and oligonucleotides used in this studyRestriction enzyme recognition sites are underlined. The primer sequence incorporating a C-terminal 6xHis translational fusion into chiA-3 is presented in lowercase, and the sequences of ribosome binding sites, start and STOP codons are in bold. CmR, chloramphenicol resistant; StrR, streptomycin resistant.Internal strain IDStrain nameGenotype/detailsSource/referenceMJD382NCTC 30Non-pandemic harbouring chiA-3Thomson lab stocks; National Centre for Type Cultures, batch 3, sequenced previously [65]MJD1506NEB 10-betaΔ(ara-leu) 7697 araD139 fhuA ΔlacX74 galK16 galE15 e14- ϕ80dlacZΔM15 recA1 relA1 endA1 nupG rpsL (StrR) rph spoT1 Δ(mrr-hsdRMS-mcrBC)New England BiolabsMJD1507NEB NiCo21(DE3)can::CBD fhuA2 [lon] ompT gal (λ DE3) [dcm] arnA::CBD slyD::CBD glmS6Ala ∆hsdS λ DE3=λ sBamHIo ∆EcoRI-B int::(lacI::PlacUV5::T7 gene1) i21 ∆nin5New England BiolabsMJD1481MJD148110-beta harbouring pMJD157. CmRThis studyMJD1495MJD149510-beta harbouring pBAD33. CmRThis studyMJD1496MJD1496NiCo21(DE3) harbouring pMJD157. CmRThis studyMJD1499MJD1499NiCo21(DE3) harbouring pBAD33. CmRThis studyPlasmid nameGenotype/detailsSource/referencepBAD33Arabinose-inducible expression plasmid; pACYC184 replication origin. CmRThomson lab stocks [88]pMJD157chiA-3 cloned into pBAD33. CmRThis studyPrimer IDOther nameSequence 5′−3′oMJD202TF_SacI_Chitinase_FGCGAGCTC
AGGAGGATCTCTATGAAAAAAACAGTCATTGCTACCoMJD203TF_Chitinase_6xHis_STOP_SalI_RGCGTCGAC
TTAgtgatggtgatggtgatgTTTGATCGTTTCAAACATGGToMJD204pBAD33_check_FGCCATAGCATTTTTATCCATAoMJD205pBAD33_check_RGCCAGGCAAATTCTGTTTTAT
Genome sequences and accession numbers
The 198 genome sequences used to calculate the pangenome described in this paper are listed in Table S1. Accession numbers for additional chromosome sequences to which the text refers are as follows: chromosome 2 (accession no. CP009468.1); chromosome 2 (accession no. BA000032.2). Accession numbers for the chitinase protein sequences referred to in Hunt et al. [20] and used for blastp comparisons are listed in Table 2.
Table 2.
Pairwise blastp alignments between chiA-3 and chitinases from Hunt et al. [20]
No significant alignment was found between chiA-3 and VAS14_08875, VAS14_08910, V12G01_01435, V12G01_22308 or VF1146. A multiple sequence alignment containing each of these protein sequences has been included in the Figshare repository for this study.
Chitinase gene
Species
Accession no.
Subject length (aa)
e-value
Covered by query (%)
Identity (%)
chiA-3 (self)
Vibrio cholerae
n/a
431
0
100
100
SKA34_14935
Photobacterium sp.
EAR55415.1
441
5e-116
99
39.63
SKA34_13330
Photobacterium sp.
EAR55445.1
399
6e-08
21
28.97
P3TCK_21620
Photobacterium profundum
EAS45126.1
948
2e-04
30
27.47
VAS14_08875
Vibrio angustum
EAS63573.1
560
n/a
n/a
n/a
VAS14_08910
Vibrio angustum
EAS63580.1
732
n/a
n/a
n/a
V12G01_01435
Vibrio alginolyticus
EAS76629.1
718
n/a
n/a
n/a
V12G01_22308
Vibrio alginolyticus
EAS77796.1
307
n/a
n/a
n/a
VF1146
Aliivibrio fischeri
AAW85641.1
789
n/a
n/a
n/a
VPA1177
Vibrio parahaemolyticus
BAC62520.1
430
0
100
76.57
Pairwise blastp alignments between chiA-3 and chitinases from Hunt et al. [20]No significant alignment was found between chiA-3 and VAS14_08875, VAS14_08910, V12G01_01435, V12G01_22308 or VF1146. A multiple sequence alignment containing each of these protein sequences has been included in the Figshare repository for this study.Chitinase geneSpeciesAccession no.Subject length (aa)e-valueCovered by query (%)Identity (%)chiA-3 (self)n/a4310100100SKA34_14935sp.EAR55415.14415e-1169939.63SKA34_13330sp.EAR55445.13996e-082128.97P3TCK_21620EAS45126.19482e-043027.47VAS14_08875Vibrio angustumEAS63573.1560n/an/an/aVAS14_08910Vibrio angustumEAS63580.1732n/an/an/aV12G01_01435EAS76629.1718n/an/an/aV12G01_22308EAS77796.1307n/an/an/aVF1146AAW85641.1789n/an/an/aVPA1177BAC62520.1430010076.57
Genome assemblies
genome sequences were assembled from short-read data using SPAdes v3.8.2 [50], as part of a high-throughput pipeline [51]. Assemblies were annotated automatically using Prokka v1.5 [52] and a genus-specific reference database [53]. If raw sequencing reads were unavailable for genome sequences, assemblies were downloaded and similarly annotated using the automated Prokka-based pipeline.
Pangenome and phylogenetic calculations
A pangenome was produced from 198 Prokka-annotated genome assemblies using Roary v3.12.0 [54] (parameters: '-p 10 -e --mafft -s -cd 97'). A core-gene alignment of 2520 genes and 1 096 140 nucleotides was produced from this pangenome calculation. The alignment was trimmed using trimAl v1.4.1 [55] and used to produce an alignment of 183 896 single nucleotide variants (SNVs) using SNP-sites v2.5.1 [56]. A maximum-likelihood phylogeny was produced using IQ-Tree v1.6.10 [57] from the SNV-only alignment (options: '-nt 10 -m GTR+ASC -bb 5000 -alrt 5000').
Protein sequence alignments, domain prediction and comparative genomics
Protein sequences were aligned using blastp [58] and were annotated using the InterProScan web server [59]. Comparative genomic figures were generated using blastn [58] sequence alignments and visualized using ACT v13 and v18.0.2 [60], and Easyfig v2.2.2 [61].
Confirmation of gene presence/absence by mapping
Reads were mapped to reference sequences using SMALT v0.7.4 (https://www.sanger.ac.uk/tool/smalt-0/) and the method described by Harris et al. [62], as part of automated analysis pipelines run by Wellcome Sanger Institute Pathogen Informatics. All of the software developed by Pathogen Informatics is freely available for download from GitHub under an open source licence, GNU GPL 3 (https://github.com/sanger-pathogens/vr-codebase). Ordered BAM files were visualized against reference sequences using Artemis v16 and v18.0.2, which incorporate BamView [63, 64].
Molecular cloning
Plasmid DNA was extracted from using the QIAprep Spin Miniprep kit (Qiagen; #27104). Genomic DNA (gDNA) was extracted from NCTC 30 as described previously [65]. Cloning intermediates were purified using the QIAquick PCR Purification kit (Qiagen; #28104). gDNA from NCTC 30 was used as a template from which to amplify chiA-3 using primers oMJD202 and oMJD203, high-fidelity Phusion Hot Start Flex polymerase [New England Biolabs (NEB); #M0535S] using the supplied GC buffer, DMSO (3 %, v/v, final concentration) and dNTPs (Thermo Scientific; #R0191). Twenty-nine PCR cycles were performed using the manufacturer’s protocol (annealing temperature: 55 °C, extension time: 2 min). The amplicon was purified and digested using 30 units of SacI-HF and SalI-HF (NEB; #R3156S and R3138S respectively) at 37 °C for 45 min. pBAD33 was similarly treated with SacI-HF and SalI-HF, and after 15 min incubation at 37 °C, the plasmid digestion was supplemented with 1.5 units of recombinant shrimp alkaline phosphatase (rSAP; NEB; #M0371S). Digested insert and vector were purified and ligated together at room temperature for 30 min using T4 DNA ligase (NEB; #M0202S) in approximately a 3 : 1 molar ratio. Chemically competent 10-beta (NEB; #C3019I) were transformed with ligated DNA according to the manufacturer’s instructions, and transformants were selected for on LB agar supplemented with chloramphenicol (10 µg ml−1).Chloramphenicol-resistant colonies were resuspended in 30 µl PBS. A screen for clones containing an insert into pBAD33 was carried out using 1 µl of this suspension as a PCR template using primers oMJD204 and oMJD205 and OneTaq Quickload 2X Master Mix (NEB; #M0486S), according to the manufacturer’s instructions (annealing temperature 45 °C, extension time 3 min). Plasmids were extracted from overnight cultures of clones from which PCR produced an amplicon of the expected size (1548 bp). The presence of an insertion into pBAD33 was verified by digesting purified plasmid DNA with SacI-HF and SalI-HF as described above. Plasmids were then sequence-confirmed by amplicon sequencing (GATC/Eurofins Genomics) in both directions across the pBAD33 multiple cloning site using primers oMJD204 and oMJD205. Sequence-verified plasmids were transformed into chemically competent NiCo21(DE3) cells (NEB; #C2529H) following the manufacturer’s instructions, and these transformants were used for protein expression purposes.
Protein expression and immunoblotting
Single colonies of NiCo21(DE3) harbouring pMJD157 and pBAD33 (empty vector) were inoculated into 3 ml LB-Cm and cultured at 37 °C with shaking (200 r.p.m.) for 8 h. These were then diluted 1 : 100 into baffled flasks containing 25 ml LB-Cm supplemented with either ᴅ-(+)-glucose (BDH, now VWR; #101176K) or ʟ-(+)-arabinose (Sigma-Aldrich; #A3256), both at 0.4 % (w/v) final concentration. These cultures were grown for 18 h at 23 °C with shaking (200 r.p.m.). Cells were collected by centrifugation (3900 , 5 min) and the supernatant was filter-sterilized (0.22 µm) and stored at −20 °C. Cell pellets were lysed in 3 ml BugBuster HT (Millipore; #70922–4) for 30 min at room temperature on a rotator. Debris was collected by centrifugation (3900 , 5 min) and discarded. Lysates were stored at −20 ˚C.Sixty microlitres of filtered supernatants and lysates was mixed 1 : 1 with 2× tris-glycine-SDS sample buffer (Invitrogen; #LC2676), boiled at 100 °C for 5 min, and 30 µl of each sample was used to load duplicate NuPAGE 4–12 % Bis-Tris acrylamide gels (Invitrogen; #NP0321) which were electrophoresed simultaneously, in the same gel tank. Stained and unstained protein ladders (NEB; #P7719S and #P7717S, Invitrogen; #LC5925) were used for size estimation where appropriate. One gel of the pair was stained with InstantBlue (Expedeon/Abcam; #ISB1L) according to the manufacturer’s instructions prior to imaging; the other was used for Western immunoblotting.For Western blotting, electrophoresed proteins were transferred from an acrylamide gel onto a nitrocellulose membrane using the iBlot 2 dry blotting system and transfer stack (ThermoFisher; #IB21001 and #IB23001). After transfer, the membrane was blocked for 3 h in 5 % (w/v) Marvel milk powder dissolved in PBS-Tween 20 (Marvel-PBS-T) at 4 °C, with rocking. An antibody recognizing the 6xHis epitope and directly conjugated to horseradish peroxidase (Abcam; #ab1187) was diluted in Marvel-PBS-T according to the manufacturer’s instructions and used to probe the membrane for 30 min at 4 °C, with rocking. The membrane was then washed in PBS-T for 15 min three times, and then incubated with Clarity Western ECL substrate (Bio-Rad; #170-5060) for 5 min. Luminescence signal was allowed to decay overnight, and the blot was then imaged with Amersham Hyperfilm ECL film (GE Healthcare; #28906836). Coloured protein size standards were marked manually on the developed film.
Chitinase assay
Chitinase activity was assayed using fluorogenic substrates (Sigma-Aldrich; #CS1030). The kit was used according to the manufacturer’s instructions, with the following modifications, Ten microlitres of cell lysate or supernatant was used per assay well. Five microlitres of the supplied chitinase control enzyme was used per positive control reaction, rather than a 1 : 200 dilution of the control enzyme, to ensure that fluorescence was detectable. Assays were carried out in black Nunc flat-bottomed microtitre plates (Sigma-Aldrich; #P8741), and technical triplicates were included for each sample. Once mixed, reaction plates were incubated for 30 min (37 °C, static) before the addition of stop solution. Fluorescence was measured using a FLUOstar Omega plate reader (BMG LabTech), set to excitation and emission wavelengths of 360 and 450 nm, respectively. A 1 % gain was applied to the fluorescence measured by the reader. Blank fluorescence was subtracted from each sample reading prior to analysis.
Statistics, data visualization and figure generation
Figures were produced using R v3.5.1 [66], ggpubr v0.2.3 (https://github.com/kassambara/ggpubr), ggplot2 v3.2.1 [67], ggforce v0.3.1.9000 (https://github.com/thomasp85/ggforce), and the Phandango web server [68]. Statistical tests were performed using R v3.5.1 [66]. Where required, figures were modified manually using InkScape v0.92.4 and Adobe Illustrator CC v23.1.1.
Results
Distribution of chitinase genes amongst
The key components of chitin catabolism summarized in Fig. 1 have been previously described [24, 45, 69]. The presence and absence of orthologues of each of the principal chitin-binding proteins and extracellular chitinases [45] known to be encoded by the 7PET reference strain N16961 (based on their N16961 locus identifiers) were identified in a pangenome calculated from 195 genomes, plus three species genomes used as an outgroup (Table S1). Genes that were annotated as encoding putative chitinases, as well as those genes known to be present in N16961, were identified in the pangenome (Tables S2 and S3). A phylogenetic tree was calculated using an SNV-only alignment of 2520 core genes taken from the pangenome, and the distribution of these chitinase genes across the phylogeny is presented in Fig. 2.
Fig. 2.
Distribution of chitinase genes amongst diverse . Visualization of the presence and absence of genes encoding key chitinase enzymes and chitin adhesion factors (Fig. 1). The seven genes encoding putative endochitinases described by Hayes et al. [45] are listed, as well as two additional putative chitinases identified in this analysis. The figure was generated using Phandango [68]. Isolate assignments to lineages were taken verbatim from the literature [65, 89], and are named after Domman et al. [8]. Chromosomal location assigned to genes present in N16961. The presence of VC_A0027 in three genomes was confirmed manually, as described in the text. Colour coding of gbpA, chiA-1, chiA-2 and chiA-3 is consistent among figures in this paper.
Distribution of chitinase genes amongst diverse . Visualization of the presence and absence of genes encoding key chitinase enzymes and chitin adhesion factors (Fig. 1). The seven genes encoding putative endochitinases described by Hayes et al. [45] are listed, as well as two additional putative chitinases identified in this analysis. The figure was generated using Phandango [68]. Isolate assignments to lineages were taken verbatim from the literature [65, 89], and are named after Domman et al. [8]. Chromosomal location assigned to genes present in N16961. The presence of VC_A0027 in three genomes was confirmed manually, as described in the text. Colour coding of gbpA, chiA-1, chiA-2 and chiA-3 is consistent among figures in this paper.
chiA-2 is ubiquitous amongst diverse , but gbpA and chiA-1 are not
The first, and most striking, observation made from these data was that gbpA (VC_A0811) did not appear to be ubiquitous amongst all of the genomes included in this study. This was surprising because this gene had previously been reported to be ubiquitous amongst [70]. We found that gbpA was present in only 189 of 195 genomes (96.9 %; Fig. 2, Table S2). We manually inspected the genome assembly for each isolate which lacked gbpA, to guard against this being an artefact of the computational approach taken (Fig. S1).Three genomic arrangements were observed at this locus – the presence of an intact VC_A0811 locus as found in gbpA
+ genomes, a deletion of gbpA and two adjacent genes (VC_A0811-VC_A0813), and the replacement of these three genes with additional sequence in the genome of NCTC 30 (Fig. S1). In order to ensure that the VC_A0811-VC_A0813 genes were not present at a different position in the NCTC 30 genome, we mapped the Illumina short-reads for this isolate to the N16961 reference sequence and inspected the mapping coverage across this region. This confirmed that the absence of the genes VC_A0811–VC_A0813 from NCTC 30 was not a result of a mis-assembly (Fig. S2). The two genes adjacent to gbpA, VC_A0812 and VC_A0813, encode LapX and Lap, respectively. Both genes are putatively regulated by the HapR master quorum-sensing regulator, and encode proteins that were detected in an hapA mutant [71]. Both Lap and LapX were found to be putative components of the Type 2 secretome in N16961 [72], and lap has been used as a polymorphic locus in multilocus enzyme electrophoresis (MLEE) schemes for classifying [73, 74]. We were unable to find published evidence linking these genes to GbpA activity or chitin adhesion more generally, although we note that lap and lapX are oriented in the same direction as gbpA, and we cannot exclude the possibility that these three genes are co-regulated or co-transcribed.We found that VC_A0027 (encoding ChiA-2) was near-ubiquitous, initially being detected in 192/195 (Fig. 2; Table S2). Manual inspection of the assemblies for those three isolates which appeared to lack the gene confirmed that the majority of this gene was in fact present; assembly and resultant annotation errors were likely to be responsible for this result (data not shown). This suggests that VC_A0027 is core to (195/195 genomes). This is consistent with ChiA-2 being the most highly expressed chitinase enzyme in the species, and with the observation that deletion of this gene alone causes a significant growth defect on minimal media containing chitin as a sole carbon source [45].However, although VC_1952 (ChiA-1) was present in all pandemic isolates (defined as those isolates which were members of the 7PET and Classical lineages), it was not ubiquitous across the species, and was only identified in 61.2 % of the non-pandemic in this dataset (101/165; Fig. 2; Table S2). This observation was surprising, because both ChiA-1 and ChiA-2 have been shown to be necessary for N16961 to grow in media supplemented with colloidal chitin [24]. Keymer and colleagues previously observed, using microarray approaches, that some diverse environmental isolates of varied in terms of their VC_1952 genotype [75]. We propose that our data recapitulate this observation, albeit in silico. We manually examined the region surrounding the VC_1952 locus in a subset of the genome assemblies for isolates lacking this gene, and found both that the gene was absent in its entirety, and that this did not appear to affect the genes adjacent to chiA-1 (Figs S3 and S4). Moreover, the distribution of putative chitinases (Fig. 2) suggested that isolates lacking ChiA-1 may encode additional chitinases. Since ChiA-1 is known to have a functional role in chitin metabolism, this led us to speculate that these additional putative chitinases, if functional, might be able to provide chitinase activity in the absence of ChiA-1.
Identification and characterization of chiA-3
Eleven gene clusters in the pangenome included genes with the annotation ‘chitinase’ or ‘putative chitinase’ (Table S3). Five of these were found only in one genome, of which four were found only in the non- outgroup. Of the remaining six genes, four are known to be present in N16961 (Tables S2 and S3). On further examination, the products of one of the two gene clusters, ‘endo I_2’, were not predicted in silico to contain a chitinase domain, although a putative chitin-binding domain was identified (Fig. S5; Table S3). The second gene identified was predicted to encode a protein containing a chitinase domain (Fig. 3a). The molecular weight (47.69 kDa) and domain composition of the protein were distinct from those of chiA-2 and chiA-1 (Fig. 3a), as was the genomic context and location of the gene, which was inserted between VC_A0620 and VC_A0621 on chromosome 2 (Fig. 3b). This gene was therefore referred to as chiA-3, to differentiate it from the two previously described genes. chiA-3 was identified in 87 genomes, and was absent from all of the genomes belonging to both pandemic lineages included in this study. Additionally, 57 of the 67 isolates which lacked chiA-1 harboured chiA-3 (85.0%).
Fig. 3.
ChiA-3 is a protein distinct from ChiA-1 and ChiA-2. (a) Cartoons of the protein domains predicted to be present in each of ChiA-1, ChiA-2 and ChiA-3 are presented. Predicted molecular weights are indicated. Proteins are not to scale. (b) chiA-3 is integrated between VC_A0620 and VC_A0621 in the smaller chromosome. This genomic position is conserved in species which harbour chiA-3 orthologues [76, 79]. The figure was generated using Easyfig [61] and blastn comparisons [58].
ChiA-3 is a protein distinct from ChiA-1 and ChiA-2. (a) Cartoons of the protein domains predicted to be present in each of ChiA-1, ChiA-2 and ChiA-3 are presented. Predicted molecular weights are indicated. Proteins are not to scale. (b) chiA-3 is integrated between VC_A0620 and VC_A0621 in the smaller chromosome. This genomic position is conserved in species which harbour chiA-3 orthologues [76, 79]. The figure was generated using Easyfig [61] and blastn comparisons [58].In order to determine whether chiA-3 had been identified previously in other species, the gene was compared to the nine genes listed by Hunt et al. as chitinases found in non-cholera vibrios [20] (Table 2). The most similar protein to ChiA-3 (76.57 % aa identity) was that encoded by VPA1177 (chiA, accession no. BAC62520.1), found in strain RIMD 2210633 (Table 2) [76]. VPA1177 encodes a 430 aa protein (47.98 kDa) which previous genetic analyses have shown to make a minimal contribution to the ability of to degrade chitin; ChiA-2 (encoded by VPA0055, accession no. BAC61398.1) is the major protein responsible for this phenotype in [77]. Transcription of VPA1177 has been shown to be significantly reduced in the presence of chitin [77]; however, the VPA1177 protein has been shown to be expressed by , albeit at very low levels in culture supernatants [36].A previous report had also identified a functional secreted chitinase from of a similar molecular weight (47 kDa) to both VPA1177 and ChiA-3 [78]. The ATCC 33843 genome [79] contains a gene encoding a putative chitinase (blastp: 100 % query coverage, 77.73 % amino acid identity to ChiA-3; predicted molecular weight 48.0 kDa) in a similar genomic context on chromosome 2 to that of chiA-3 in NCTC 30 (Fig. 3b). This is distinct from the location of the functionally characterized chiA gene (LA59_20935) which encodes an 850 aa ChiA chitinase precursor (accession no. Q9AMP1 [37, 80, 81]), and from other functionally characterized β-N-acetylglucosaminidases [38]. This protein is also 90.9 % identical to VPA1177. As well as their high amino acid identity, each of these proteins was predicted to contain similar domain compositions and configurations across the three species (Fig. S6). It is reasonable to infer that these enzymes are orthologues of ChiA-3.Since VPA1177 has been shown to be transcribed [77] and to produce a translated protein in [36], we sought to determine whether the product of chiA-3 from had chitinase activity. We amplified the gene from the genome of NCTC 30, a non-pandemic lineage , and cloned it directionally into pBAD33 such that expression of the gene was governed by the arabinose-inducible PBAD promoter and the translated product linked to a C-terminal 6xHis tag, similar to previous reports [40, 45] (denoted pMJD157, Fig. S7).harbouring pMJD157 produced a His-tagged protein of the expected molecular weight that was retained in the cell pellet when cultured with arabinose at 23 °C (Fig. 4). We used a commercial fluorogenic assay for chitinase activity which relies on the hydrolysis of 4-methylumbelliferyl (4-MU) chitin analogues to detect chitinase activity. A similar assay has been used previously to assay chitinase activity in vibrios [78]. We found that samples from cultures expressing 6xHis-tagged ChiA-3 demonstrated statistically significant activity on 4-MU-linked substrates (Fig. 5). These data were consistent with the His-tagged protein detected in Fig. 4) (ChiA-3-6xHis) having endochitinase and chitobiosidase activities, but lacking β-N-acetylglucosaminidase activity. These results are also fully consistent with previous studies of chitinase enzyme activity when expressed in [48].
Fig. 4.
Molecular cloning of chiA-3 and expression of ChiA-3-6xHis. (a) InstantBlue-stained acrylamide gel of proteins present in supernatants and cell pellet lysates from cultures grown at 23 °C supplemented with arabinose (induction +) or glucose (induction -). No induced bands were easily discerned. (b) Western immunoblot produced from an identically loaded acrylamide gel to that presented in (a) run in parallel with the gel in (a) and probed with an α−6xHis antibody (see Methods). A band corresponding to the expected molecular weight of ChiA-3-6xHis (48.51 kDa) was only detected in the cell pellet lysate of harbouring pMJD157 (plasmid +). This size is consistent with the retention of the fusion protein without the cleavage of the putative signal sequence. Protein ladders: NEB #P7719S and #P7717S. EV=empty vector (pBAD33).
Fig. 5.
ChiA-3-6xHis displays chitobiosidase and endochitinase activities, but not β-N-acetylglucosaminidase activity. Lysates and supernatants included in Fig. 4(a, b) were assayed for chitinase enzyme activity using a fluorometric chitinase assay kit (see Methods for details). Lysed cells from cultures harbouring pMJD157 and cultured in the presence of arabinose were the only samples which produced detectable and statistically significant signals on triacetylchitotriose and chitobiose substrates (a, b). No signal was detected in the presence of glucosaminide substrate (c). All plots are scaled equivalently. P=pellet; S=supernatant; EV=empty vector (pBAD33). Parametric t-tests were performed where indicated, testing for statistically significant differences between cell pellets of both induced and non-induced cultures harbouring each plasmid (effect of arabinose induction on chitinase activity), as well as between pellets from induced cultures (effect of plasmid insert on activity). ns=not significant; ***P<0.001; ****P<0.0001.
Molecular cloning of chiA-3 and expression of ChiA-3-6xHis. (a) InstantBlue-stained acrylamide gel of proteins present in supernatants and cell pellet lysates from cultures grown at 23 °C supplemented with arabinose (induction +) or glucose (induction -). No induced bands were easily discerned. (b) Western immunoblot produced from an identically loaded acrylamide gel to that presented in (a) run in parallel with the gel in (a) and probed with an α−6xHis antibody (see Methods). A band corresponding to the expected molecular weight of ChiA-3-6xHis (48.51 kDa) was only detected in the cell pellet lysate of harbouring pMJD157 (plasmid +). This size is consistent with the retention of the fusion protein without the cleavage of the putative signal sequence. Protein ladders: NEB #P7719S and #P7717S. EV=empty vector (pBAD33).ChiA-3-6xHis displays chitobiosidase and endochitinase activities, but not β-N-acetylglucosaminidase activity. Lysates and supernatants included in Fig. 4(a, b) were assayed for chitinase enzyme activity using a fluorometric chitinase assay kit (see Methods for details). Lysed cells from cultures harbouring pMJD157 and cultured in the presence of arabinose were the only samples which produced detectable and statistically significant signals on triacetylchitotriose and chitobiose substrates (a, b). No signal was detected in the presence of glucosaminide substrate (c). All plots are scaled equivalently. P=pellet; S=supernatant; EV=empty vector (pBAD33). Parametric t-tests were performed where indicated, testing for statistically significant differences between cell pellets of both induced and non-induced cultures harbouring each plasmid (effect of arabinose induction on chitinase activity), as well as between pellets from induced cultures (effect of plasmid insert on activity). ns=not significant; ***P<0.001; ****P<0.0001.
Discussion
In this study, we present three major observations: first, that gbpA is not ubiquitous amongst ; second, we show that there is additional variability in the chitinase genes harboured by diverse , which show phylogenetic signals in their distribution; third, we present functional evidence that one of these putatively novel genes encodes a protein with chitinase activity.The fact that gbpA is not present in all is important, given that gbpA had previously been suggested to be a candidate diagnostic gene for the detection of [70, 82]. This was based both on the high level of conservation of gbpA amongst tested , and on the number of differences between gbpA in and alleles found in other species [70]. In addition to our results, others have noted that gbpA can be found in non-cholera vibrios and in non-pathogenic , suggesting that this makes gbpA an unreliable marker for quantitative study of [83].The biological consequences of the absence of gbpA from these bacteria is interesting to consider. As discussed previously, GbpA is an important factor in both environmental and pathogenic colonization. The fact that gbpA is absent from non-pandemic that appear to be basal to the rest of the species (Fig. 2) suggests that its role in pathogenicity may be more complex than previously thought. It might be that acquisition of gbpA by was an important step in its evolution as a human pathogen. Conversely, because several of the isolates in the lineage lacking gbpA are of clinical as well as environmental origin [8, 65, 84–86], including some which were isolated from cases of acute or ‘choleraic’ diarrhoea [65, 84, 85], it might be that gbpA may not be essential for pathogenic colonization. It remains to be seen whether the natural absence of gbpA affects the ability of such to colonize both the intestinal mucosa and chitinous surfaces. The roles played by other adhesins in these diverse , such as MSHA, should also be considered in the future.Although ChiA-3 orthologues have been examined in other vibrios, we believe that this is the first report of this gene in , and the first report that the V. cholerae chiA-3 gene encodes a functional chitinase. The fact that chiA-3 was found only in non-pandemic is also intriguing. It is not yet known whether non-pandemic harbouring chiA-3 can respire chitin as effectively, or more effectively, than N16961 or other laboratory strains. A limitation of our study is that the genome sequences available to us are derived from isolates that are biased towards clinical cases of disease. Thus, our study may underestimate the diversity within the species [87], as well as the numbers of chitinases present in the species. Future research should include sequencing the genomes of both clinical and non-clinical , to address this implicit bias in genomic datasets.There are fundamental differences between from pandemic and non-pandemic lineages, in terms of both their ability to cause cholera epidemics and their basic biology. We still do not fully understand these differences, but in order to do so, we must study pathogenicity in conjunction with more fundamental biological processes. It is currently unclear whether variation in the complements of chitinases and chitin-binding proteins encoded by have physiological consequences for different lineages of the species. However, given the importance of these genes to pathogenicity [30, 40], environmental lifestyles [24] and natural competence [29], it is plausible that these differences reflect differences in the ecological niches occupied by different lineages of the species. Research in this area may provide further insights into the genetic and biochemical differences between lineages that cause dramatically different patterns of disease worldwide. Collectively, these findings underline the fact that, as we continue to study diverse , our understanding of the nuance and specifics of this species will improve and be refined.Click here for additional data file.Click here for additional data file.
Authors: N A Daniels; L MacKinnon; R Bishop; S Altekruse; B Ray; R M Hammond; S Thompson; S Wilson; N H Bean; P M Griffin; L Slutsker Journal: J Infect Dis Date: 2000-05-15 Impact factor: 5.226
Authors: Alison M Devault; G Brian Golding; Nicholas Waglechner; Jacob M Enk; Melanie Kuch; Joseph H Tien; Mang Shi; David N Fisman; Anna N Dhody; Stephen Forrest; Kirsten I Bos; David J D Earn; Edward C Holmes; Hendrik N Poinar Journal: N Engl J Med Date: 2014-01-08 Impact factor: 91.245
Authors: Nur A Hasan; Seon Young Choi; Mark Eppinger; Philip W Clark; Arlene Chen; Munirul Alam; Bradd J Haley; Elisa Taviani; Erin Hine; Qi Su; Luke J Tallon; Joseph B Prosper; Keziah Furth; M M Hoq; Huai Li; Claire M Fraser-Liggett; Alejandro Cravioto; Anwar Huq; Jacques Ravel; Thomas A Cebula; Rita R Colwell Journal: Proc Natl Acad Sci U S A Date: 2012-06-18 Impact factor: 11.205
Authors: Aleksandra E Sikora; Ryszard A Zielke; Daniel A Lawrence; Philip C Andrews; Maria Sandkvist Journal: J Biol Chem Date: 2011-03-08 Impact factor: 5.157