Literature DB >> 25745421

Characterization of Vibrio parahaemolyticus clinical strains from Maryland (2012-2013) and comparisons to a locally and globally diverse V. parahaemolyticus strains by whole-genome sequence analysis.

Julie Haendiges1, Ruth Timme2, Marc W Allard2, Robert A Myers1, Eric W Brown2, Narjol Gonzalez-Escalona2.   

Abstract

Vibrio parahaemolyticus is the leading cause of foodborne illnesses in the US associated with the consumption of raw shellfish. Previous population studies of V. parahaemolyticus have used Multi-Locus Sequence Typing (MLST) or Pulsed Field Gel Electrophoresis (PFGE). Whole genome sequencing (WGS) provides a much higher level of resolution, but has been used to characterize only a few United States (US) clinical isolates. Here we report the WGS characterization of 34 genomes of V. parahaemolyticus strains that were isolated from clinical cases in the state of Maryland (MD) during 2 years (2012-2013). These 2 years saw an increase of V. parahaemolyticus cases compared to previous years. Among these MD isolates, 28% were negative for tdh and trh, 8% were tdh positive only, 11% were trh positive only, and 53% contained both genes. We compared this set of V. parahaemolyticus genomes to those of a collection of 17 archival strains from the US (10 previously sequenced strains and 7 from NCBI, collected between 1988 and 2004) and 15 international strains, isolated from geographically-diverse environmental and clinical sources (collected between 1980 and 2010). A WGS phylogenetic analysis of these strains revealed the regional outbreak strains from MD are highly diverse and yet genetically distinct from the international strains. Some MD strains caused outbreaks 2 years in a row, indicating a local source of contamination (e.g., ST631). Advances in WGS will enable this type of analysis to become routine, providing an excellent tool for improved surveillance. Databases built with phylogenetic data will help pinpoint sources of contamination in future outbreaks and contribute to faster outbreak control.

Entities:  

Keywords:  NGS; SNPs; Vibrio parahaemolyticus; WGS; clinical; phylogenetic analysis; phylogeny

Year:  2015        PMID: 25745421      PMCID: PMC4333860          DOI: 10.3389/fmicb.2015.00125

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

Vibrio parahaemolyticus is a natural inhabitant of temperate and tropical coastal waters and is the leading cause of seafood-borne gastroenteritis in the United States (US) (Scallan et al., 2011). Cases of illness are usually associated with eating raw or undercooked seafood. Strains of V. parahaemolyticus carrying genes for thermostable direct hemolysin (tdh) and/or thermostable direct hemolysin-related hemolysin (trh) are considered pathogenic (Turner et al., 2013) and typically represent <1% of environmental V. parahaemolyticus strains (DePaola et al., 2000). However, this frequency may depend on the location, sample sources, and detection methods (Kaysner et al., 1990; Alam et al., 2002; Cook et al., 2002; Hervio-Heath et al., 2002; Martinez-Urtaza et al., 2008). During the last two decades V. parahaemolyticus infections and outbreaks have increased throughout the world. Most of these new cases belong to a pandemic clonal complex, known as CC3, first identified in in February of 1996 in India (Martinez-Urtaza et al., 2005; Nair et al., 2007; Gonzalez-Escalona et al., 2008; Haendiges et al., 2014). The emergence of CC3 caused public health concerns about the potential worldwide spread of virulent V. parahaemolyticus, which previously had been restricted to particular regions. Other clonal complexes of V. parahaemolyticus, specifically CC36 and CC34, have been observed among coastal US strains (Gonzalez-Escalona et al., 2008), and strains of Sequence Type (ST) 36, the ancestral type of CC36, have also been detected on the western coast of Canada (Banerjee et al., 2014). Although infections in the US are typically caused by strains from CC36, which is endemic to the West Coast (Abbott et al., 1989; Gonzalez-Escalona et al., 2008), a Maryland outbreak in August 2012 (Haendiges et al., 2014) was caused by strains belonging to CC3. Foodborne illnesses due to V. parahaemolyticus are uncommonly reported in the US and in the state of Maryland. Forty-six cases of V. parahaemolyticus gastroenteritis associated illnesses were reported between 2012 and 2013. From those cases, 34 strains were isolated. During the summer of 2012 a multistate outbreak associated with harvested shellfish was reported on the East Coast of the US (Newton et al., 2014), caused by V. parahaemolyticus strains belonging to CC36 (Martinez-Urtaza et al., 2013), subsequently shown to be ST36. That outbreak affected 28 persons in nine states (Newton et al., 2014), and the isolated strains were of the O4:K12 or O4:K (unknown) serotypes. Usually strains from this serotype/ST isolated in the US are from the West Coast (Gonzalez-Escalona et al., 2008; Newton et al., 2014). Another outbreak, involving 104 cases, occurred in the summer of 2013 affecting people in 13 US states and were caused by the same ST36 (Newton et al., 2014). An outbreak in Spain, during the summer of 2012, was attributed to strains that were ST36 as well (Martinez-Urtaza et al., 2013), associated with cooked seafood that had been cooled with untreated local seawater (Martinez-Urtaza et al., 2013); contrarily to the infections in the US which are often linked to the consumption of contaminated oysters (Newton et al., 2014). The numbers of V. parahaemolyticus infections appears to have increased approximately 4 times in US in the last decade (Mead et al., 1999; Scallan et al., 2011). Scallan et al. (2011) reported that the domestically acquired foodborne average for V. parahaemolyticus infections annually were around 35,000 in the period 2000–2008, while Mead et al. (1999) reported around 8000 infections annually in the period 1992–1997. The increasing number of infections could have detrimental impacts on public health and economic growth, particularly in regions where seafood harvest and consumption are important. However, active surveillance depends upon having effective ways to identify and monitor the nature or identity of the V. parahaemolyticus strains causing outbreaks. Various typing methods have been used to distinguish bacterial isolates for epidemiological investigations (Foxman et al., 2005). Pulsed Field Gel Electrophoresis (PFGE) has been a favored method for genotyping V. parahaemolyticus isolates (Marshall et al., 1999) and it is considered the “gold standard” for outbreak investigations (Parsons et al., 2007; Wagley et al., 2008; Dauros et al., 2011; Banerjee et al., 2014; Haendiges et al., 2014; Ma et al., 2014; Pazhani et al., 2014). While useful for short-term epidemiology, PFGE does not provide details of the genetic relationships among isolates (e.g., evolutionary relationships) (Foxman et al., 2005). Another common method for characterizing V. parahaemolyticus isolates is Multilocus Sequence Typing (MLST). This method is based on direct sequence analysis of housekeeping genes, making MLST better for long term evolutionary studies (Gonzalez-Escalona et al., 2008; Harth et al., 2009; Yan et al., 2011; Gavilan et al., 2013; Martinez-Urtaza et al., 2013; Turner et al., 2013). A public database was established to archive V. parahaemolyticus sequences (http://pubmlst.org/vparahaemolyticus) (Gonzalez-Escalona et al., 2008). In the last 5 years, scientists have begun using genomic techniques to analyze historical collections of pathogens and outbreak isolates, providing new insights for outbreak investigations. Whole genome sequencing (WGS) and WGS-SNP data analyses allow us to better understand population dynamics and mechanisms contributing to increased virulence among foodborne bacterial pathogens, including outbreaks of Salmonella Montevideo in 2010 (Bakker et al., 2011; Allard et al., 2012), Vibrio cholerae in Haiti in 2010 (Chin et al., 2011), E. coli O104:H4 (in Germany in 2011, Rasko et al., 2011) and Salmonella Enteritidis in the US in 2010 (Allard et al., 2013). In the current project, we utilize a similar approach to explore the diversity and relationships among V. parahaemolyticus isolates causing outbreaks in Maryland. Specifically, in order to better understand potential changes in V. parahaemolyticus populations in the state of Maryland and to investigate whether the spikes in cases during the summers of 2012 and 2013 were related to this recently introduced East Coast clone, ST36, we used WGS to compare genomes from those outbreaks with those of the other V. parahaemolyticus strains causing illnesses in recent years (2012–2013). We began by sequencing the 34 strains from MD, then 10 additional historical V. parahaemolyticus strains from different sources (clinical and environmental) from the East and West Coasts of US were also sequenced and used for phylogenetic comparative analysis. Additional available genomes from global V. parahaemolyticus strains were also used in the phylogenetic comparative analysis.

Materials and methods

Bacterial strains and media

The V. parahaemolyticus isolates sequenced for this project are listed, along with their assigned CFSAN numbers, in Tables 1, 2. Table 1 lists the 34 clinical isolates from Maryland, and Table 2 lists the national and international V. parahaemolyticus isolates used for comparison. All isolates were retrieved from storage (−80°C freezer), transferred to Luria-Bertani (LB) medium with 3% NaCl and incubated at 250 rpm at 37°C.
Table 1

Clinical .

Isolate nameAccession no(s) WGSCFSAN numberSTsCountyStateCollection dateSourcetdh, trh presenceaReferences
VP1JNSM01000000CFSAN007429631Anne ArundelMD6/15/2012stool2,1Haendiges et al., 2014
VP8JNSN01000000CFSAN007430631CecilMD7/12/2012stool2,1Haendiges et al., 2014
VP9JNSO01000000CFSAN007431631Queen Anne'sMD7/17/2012stool2,1Haendiges et al., 2014
VP31JNSP01000000CFSAN007432631HarfordMD6/16/2013stool2,1Haendiges et al., 2014
VP35JNSQ01000000CFSAN007433631Baltimore CityMD7/11/2013stool2,1Haendiges et al., 2014
VP41JNSR01000000CFSAN007434631UnknownMD7/17/2013stool2,1Haendiges et al., 2014
VP44JNSS01000000CFSAN007435631UnknownMD8/3/2013stool2,1Haendiges et al., 2014
VP45JNST01000000CFSAN007436631Anne ArundelMD8/9/2013stool2,1Haendiges et al., 2014
VP2JNSU01000000CFSAN007437651MontgomeryMD5/31/2012stool0,0Haendiges et al., 2014
VP4JNSW01000000CFSAN007439653Out of State-DEMD6/17/2012stool0,2Haendiges et al., 2014
VP34JNSX01000000CFSAN007440653Prince George'sMD7/11/2013stool0,2Haendiges et al., 2014
VP7JNSZ01000000CFSAN007442113CharlesMD7/10/2012stool2,1Haendiges et al., 2014
VP10JNTC01000000CFSAN00744543MontgomeryMD6/12/2012stool2,1Haendiges et al., 2014
VP16JNTG01000000CFSAN0074493Anne ArundelMD8/21/2012stool1 and 2, 0Haendiges et al., 2014
VP17JNTH01000000CFSAN0074503Anne ArundelMD8/22/2012stool1 and 2, 0Haendiges et al., 2014
VP18JNTI01000000CFSAN0074513Anne ArundelMD8/24/2012stool1 and 2, 0Haendiges et al., 2014
VP39JNTL00000000CFSAN007455896WicomicoMD7/19/2013stool0,0Haendiges et al., 2014
VP12JNTM00000000CFSAN00612936MontgomeryMD8/3/2012stool2,1Haendiges et al., 2014
VP32JNTN00000000CFSAN00613136Anne ArundelMD6/30/2013stool2,1Haendiges et al., 2014
VP33JNTO01000000CFSAN00613236HowardMD6/17/2013stool2,1Haendiges et al., 2014
VP36JNTP01000000CFSAN00613336Baltimore CountyMD7/5/2013stool2,1Haendiges et al., 2014
VP38JNTQ00000000CFSAN00613436Baltimore CityMD7/16/2013stool2,1Haendiges et al., 2014
VP40JNTR00000000CFSAN00613536Baltimore CountyMD7/21/2013stool2,1Haendiges et al., 2014
VP42JNTS00000000CFSAN00746036Baltimore CityMD8/7/2013stool2,1Haendiges et al., 2014
VP43JNTT00000000CFSAN00746136TalbotMD7/31/2013stool2,1Haendiges et al., 2014
VP30JNTV00000000CFSAN00613036Pringe George'sMD6/2/2013stool2,1Haendiges et al., 2014
VP46JNTU00000000CFSAN00746236Pringe George'sMD8/27/2013stool2,1Haendiges et al., 2014
VP5*JNSY01000000CFSAN007441113CalvertMD6/5/2012wound0,0Haendiges et al., 2014
VP13*JNTD01000000CFSAN007446678CharlesMD8/5/2012wound0,0Haendiges et al., 2014
VP6*JNTB01000000CFSAN007444677Baltimore CityMD6/25/2012wound0,0Haendiges et al., 2014
VP15*JNTF01000000CFSAN007448679Out of State-PAMD8/3/2012wound0,0Haendiges et al., 2014
VP3*JNSV01000000CFSAN007438652Anne ArundelMD6/8/2012wound0,0Haendiges et al., 2014
VP11*JNTA01000000CFSAN007443113SomersetMD7/23/2012ear0,0Haendiges et al., 2014
VP14*JNTE01000000CFSAN007447162WicomicoMD8/10/2012ear0,0Haendiges et al., 2014

The isolates were organized by their isolation source.

type of tdh or trh gene present in that strain by in silico reference mapping using CLC Genomics workbench. Example: 2,1 means tdh2 and trh1 types, respectively, and 0 means not present.

Strains isolated from wound or ear.

Table 2

Locally and globally diverse .

Isolate nameAccession no(s) WGSCFSAN numberSTsStateCountryIsolation yearSourceReferences
029-1(b)JNTW00000000CFSAN00161136ORUSA1997EHaendiges et al., 2014
48057JNTX00000000CFSAN00161236WAUSA1990CHaendiges et al., 2014
K1198JNTY01000000CFSAN00161459AKUSA2004EHaendiges et al., 2014
10292JNTZ00000000CFSAN00161750WAUSA1997CHaendiges et al., 2014
48291JNUA00000000CFSAN00161836WAUSA1990CHaendiges et al., 2014
F11-3AJNUB00000000CFSAN00161936WAUSA1988EHaendiges et al., 2014
NY-3483JNUC00000000CFSAN00162036NYUSA1998EHaendiges et al., 2014
K1203JNUD00000000CFSAN00117359AKUSA2004EHaendiges et al., 2014
98-513-F52JNUE00000000CFSAN00116034LAUSA1998EHaendiges et al., 2014
10290JNUF00000000CFSAN00161336WAUSA1997CHaendiges et al., 2014
10329NZ_AFBW01000000N/A36WAUSA1998CGonzalez-Escalona et al., 2011
RIMD 2210633NC_004603, NC_004605N/A3OsakaJapan1996CMakino et al., 2003
BB22OPCP003973.1, CP003972.1N/A88?Bangladesh1980EJensen et al., 2013
AN-5034ACFO00000000N/A3?Bangladesh1998?Chen et al., 2011
AQ3810AAWQ00000000N/A87?Singapore1983CUnpublished
AQ4037ACFN00000000N/A96?Maldives1985?Chen et al., 2011
K5030ACKB00000000N/A3??2005?Chen et al., 2011
PCV08-7AOCL00000000N/A808SelangorMalaysia2008ETiruvayipati et al., 2013
Peru-466ACFM00000000N/A3?Peru1996?Chen et al., 2011
SNUVpS-1AMRZ00000000N/A917?Korea2009EJun et al., 2013
V110AQPJ00000000N/A809?China2010ELiu and Chen, 2013
3259AVOL01000000N/A479?USA2007CUnpublished
949AVPV01000000N/A3?USA2006CUnpublished
NIHCB0603AVOM00000000N/A3?Bangladesh2006CUnpublished
NIHCB0757AVPX01000000N/A65?Bangladesh2006CUnpublished
VP-NY4AVON01000000N/A3?India1997CUnpublished
VP2007-095NZ_AVOI01000000N/A631FLUSA2007CUnpublished
VP232NZ_AVOJ01000000N/A3?India1998CUnpublished
VP250NZ_AVOK01000000N/A3?India1998CUnpublished
VPCR-2010NZ_AVPW01000000N/A308?USA2010EUnpublished
12310AYXP00000000N/A36WAUSA2006CUnpublished
3256AZGS00000000N/A36WAUSA2007CUnpublished

N/A, not applicable; ?, unknown.

Clinical . The isolates were organized by their isolation source. type of tdh or trh gene present in that strain by in silico reference mapping using CLC Genomics workbench. Example: 2,1 means tdh2 and trh1 types, respectively, and 0 means not present. Strains isolated from wound or ear. Locally and globally diverse . N/A, not applicable; ?, unknown.

DNA extraction and quantification

Genomic DNA from each isolate was isolated from overnight cultures using the DNeasy Blood and Tissue Kit (QIAGEN, Valencia, CA). The quality of the DNA was checked using a NanoDrop 1000 (Thermo Scientific, Rockford, IL) and the concentration was determined using a Qubit double-stranded DNA HS assay kit and a Qubit 2.0 fluorometer (Life Technologies, Grand Island, NY), according to each manufacturer's instructions.

Whole genome sequencing, contig assembly, and annotation

The genomes of the historical isolates were sequenced using 200 bp reads chemistry and the 36 MD outbreak isolates were sequenced using 300 bp reads chemistry, using an Ion Torrent (Thermo Scientific) sequencer, according to manufacturer's instructions, at approximately 20X coverage (Haendiges et al., 2014). Genomic sequence contigs were de novo assembled using CLC Genomics Workbench (QIAGEN). These draft genomes were annotated using the NCBI Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP, http://www.ncbi.nlm.nih.gov/genomes/static/Pipeline.html) (Klimke et al., 2009).

In silico MLST phylogenetic analysis

The initial analysis and identification of the isolates was performed using an in silico V. parahaemolyticus MLST approach, based on information available at the V. parahaemolyticus MLST website (http://pubmlst.org/vparahaemolyticus). Seven loci (dnaE, gyrB, recA, dtdS, pntA, pyrC, and tnaA), described previously for V. parahaemolyticus (Gonzalez-Escalona et al., 2008), were used for MLST analysis. The same V. parahaemolyticus MLST database was also used to assign numbers for alleles and STs.

Assignment to clonal complexes

The program goeBURST v1.2.1 was used to identify CCs among the isolates (http://goeburst.phyloviz.net) (Francisco et al., 2009). To qualify as members of the same CC, isolates must share at least 5 of the 7 alleles used in the V. parahaemolyticus MLST scheme (Feil et al., 2004). Those STs that differ by two alleles are considered double locus variants (DLV), and those which differ at only a single locus are single locus variants (SLV).

Phylogenomic analysis and targeted SNP analysis

We performed two phylogenetic analyses using the software kSNP v2 (Gardner and Hall, 2013) (kmer length = 31). The first one included a broad SNP analysis of all the genomes included in the analysis, including local, national and international isolates, and was performed to provide an evolutionary context for all the V. parahaemolyticus isolates used for the analysis. The second was conducted to provide a more detailed phylogenetic relationship among the isolates found in each individual cluster and were targeted for more refined SNP analyses. Our goal was to recover variations unique to each respective lineage. To do this, we created a matrix that included only core SNPs (i.e., SNPs that were present in all genomes) for each lineage, using the reference-free SNP- finding program, kSNP v2 (Gardner and Hall, 2013) (kmer length = 31). Maximum-likelihood phylogenies for each core SNP matrix were then constructed within the kSNP, labeled with the number of unique SNPs present in every descendant of that node. Draft assemblies were used to detect SNPs using the program, kSNP. Filtering was applied to eliminate detectable horizontal gene transfer such as mobile elements and paralogs, but we were not able to filter for genome recombination. Although important, reconstructing the reticulate evolutionary history of V. parahaemolyticus was outside the scope of this research effort. Using the kSNP approach offers several advantages. First, the kSNP approach to gathering reference-free SNPs for downstream phylogenetic analysis has been effective for microbial-scale phylogenomic analyses (Sahl et al., 2013; Timme et al., 2013), Second, these kSNP analyses are not as vulnerable to assembly errors as other methods because the putative SNPs are extracted from kmers (in our present analysis, one SNP per 25 mer), effectively eliminating any influence of assembly bias. Third, the algorithm behind SNP identification is especially conservative about common pyrosequencing errors, such as the homopolymeric runs commonly observed in data from 454 and Ion Torrent sequencing procedures: if an insertion or deletion is present in a region, the resulting 25-mer will no longer match the other homologous kmer regions, thus that SNP will be missing in that taxon. Additionally, kSNP omits kmers where all SNPs are in the center of a homopolymer repeat (Gardner and Slezak, 2010). These features make our approach conservative and prevent repeat regions from becoming a factor in our analyses: any k-mers comprising such regions will fail to be unique, and therefore will not be used.

Nucleotide sequence accession numbers

The draft genome sequences for all 46 V. parahaemolyticus strains in our analyses are available in GenBank under the accession numbers listed in Tables 1, 2.

Results and discussion

Maryland V. parahaemolyticus outbreak strains and in silico MLST

Our MLST analyses provide a broad look at the diversity of V. parahaemolyticus clinical samples collected during 2010–2013 in the state of Maryland. These 34 V. parahaemolyticus strains isolated from patients—stool (80%), ear (6%), and wound (14%) were sequenced using Ion Torrent. A quarter of these strains (25%) came from an outbreak in the summer of 2013. All the reported cases occurred during the warmer months between June and August (Table 1). In silico MLST (Gonzalez-Escalona et al., 2008) identified 13 different STs, showing that the sporadic outbreaks in Maryland were caused by highly diverse V. parahaemolyticus strains. Seven STs were novel (STs: 651, 652, 653, 677, 678, 679, and 896) and represented strains that have only been found (so far) in Maryland (Table 1). The most prevalent STs were ST36 (28%), ST631 (22%), ST3 (8%), ST113 (8%), and ST653 (6%). The rest of the STs were represented by a single strain. Some of the identified STs were isolated from more than one location, more than one person, and in different years (Table 1 and Figure 1); most of the illnesses associated with these isolates occurred in MD coastal counties (Figure 1). Interestingly, most of the cases on 2013 (88%) were caused by ST631 (31%) and ST36 (56%). This suggests these two STs are either more prevalent and/or have replaced the endemic V. parahaemolyticus populations in this region. Additional surveys of pathogenic V. parahaemolyticus in MD could confirm or clarify our theories.
Figure 1

Sequence types (STs) isolation locations on the state of Maryland, USA. STs found in more than one location are in red fonts.

Sequence types (STs) isolation locations on the state of Maryland, USA. STs found in more than one location are in red fonts. Our eBURST analysis (goeBURST) showed that the MD isolates do not form any clonal complexes and were not related to each other (Figure 2). Therefore, the outbreaks during this period in MD were caused by V. parahaemolyticus belonging to different populations. However, some findings suggested we had reached the limits of the MLST method: the ST36 observed in MD could not be distinguished from ST36 from the West Coast by this method. It is at this point we must turn to the higher discriminatory power of WGS analyses.
Figure 2

goeBURST population snapshot using the . Font sizes varied according to the frequency of the ST.

goeBURST population snapshot using the . Font sizes varied according to the frequency of the ST.

Phylogenomic analysis

Using 7 V. parahaemolyticus genomes available from NCBI (www.ncbi.nlm.nih.gov/genome/?term=vibrio parahaemolyticus) and the 10 genomes from historical US V. parahaemolyticus strains (Table 2), we determined the phylogenetic relationships among the MD strains using a whole-genome SNP analysis (Figure 3). As can be seen from Figure 3, V. parahaemolyticus is a highly diverse microorganism, documented by the high number of SNPs defining each branch, and a low-resolution tree is obtained when using this entire genome database. Nevertheless, a similar clustering to the one achieved by MLST analysis can still be observed for the Maryland V. parahaemolyticus strains, where the same STs were also grouping at the genome level. Five clusters of related strains were identified, cluster I (ST36), cluster II (ST653), cluster III (ST113), cluster IV (ST631), and cluster V (ST3).
Figure 3

Phylogenetic clustering analysis of the outbreak strains of . A k-SNP (Timme et al., 2013) analysis was performed. The maximum-likelihood tree shows V. parahaemolyticus MD outbreak isolates (in blue fonts), historical US strains sequenced in this study (in red fonts) and other related but unassociated V. parahaemolyticus strains (retrieved from NCBI, black fonts). Maximum-likelihood phylogeny was constructed from a 46,963 SNP matrix. Samples are annotated as follows: strain name, sequence type (ST), source (C, clinical; E, environmental), country of isolation (B, Bangladesh; CN, China; M, Malaysia; Mal, Maldives; USA, United States of America; Sing, Singapore; I, India, P, Peru; and K, Korea), and year when the samples were collected. All gene alignments of the SNPs observed in the entire dataset as well as in each individual cluster are available on request from the authors.

Phylogenetic clustering analysis of the outbreak strains of . A k-SNP (Timme et al., 2013) analysis was performed. The maximum-likelihood tree shows V. parahaemolyticus MD outbreak isolates (in blue fonts), historical US strains sequenced in this study (in red fonts) and other related but unassociated V. parahaemolyticus strains (retrieved from NCBI, black fonts). Maximum-likelihood phylogeny was constructed from a 46,963 SNP matrix. Samples are annotated as follows: strain name, sequence type (ST), source (C, clinical; E, environmental), country of isolation (B, Bangladesh; CN, China; M, Malaysia; Mal, Maldives; USA, United States of America; Sing, Singapore; I, India, P, Peru; and K, Korea), and year when the samples were collected. All gene alignments of the SNPs observed in the entire dataset as well as in each individual cluster are available on request from the authors. Besides Maryland ST36 strains (which appear in blue text on the Figure 3), Cluster I also contained strains that were isolated on the West Coast (1988–2004) (represented by the red text in Figure 3) and are part of CC36 (Gonzalez-Escalona et al., 2008). In this broad phylogeny, little differentiation can be observed among these ST36 strains. The same can be observed for most other clusters. But this broad phylogeny does allow us to confirm the diversity of populations causing the outbreaks in MD during that 2 year period. Given these patterns, we then used a cluster-based analysis that enabled us to identify relationships among strains that would not have been found by examining the entire dataset (Figure 4). Each individual cluster (when more than three strains) was analyzed separately. This type of analysis helped us to clearly distinguish strains in ST36 from MD (2012–2013) from those originating on the West Coast (1988–1998). These groups differed by at least 150 SNPs (Figure 4A). Interestingly, MD strains were similar to ST36, also from the West Coast, of US but from a later time point: 2006–2007 (strains 12310 and 3256). This clearly indicates that ST36 strains have evolved in the last decade and the new ST36 strains from the West Coast are more similar to the ST36 from MD (2012–2013) than the historical ST36 from the West Coast. Also, it seems that the outbreak in 2013 in MD of ST36 was also composed by strains from at least three different subgroups within lineage ST36. One subgroup contained the strain responsible for the cases in 2012 and most 2013 cases (VP43, 12, 42, 33, 36, 40, and 38), and the other two subgroups were composed of one (VP32) and two strains (VP30 and VP46). These differences in subgroups within lineage ST36 could have arisen from adaptations to surviving in different oysters/habitats and/or to the introduction of new variants.
Figure 4

Maximum-likelihood trees for each individual . Maximum-likelihood phylogeny was constructed from a 347, 931, and 88 core SNP matrix for clusters I, V, and VI, respectively. (A) Cluster I, (B) cluster V, and (C) cluster VI. Samples are annotated as in Figure 3. All gene alignments of the SNPs observed in the entire dataset as well as in each individual cluster are available on request from the authors.

Maximum-likelihood trees for each individual . Maximum-likelihood phylogeny was constructed from a 347, 931, and 88 core SNP matrix for clusters I, V, and VI, respectively. (A) Cluster I, (B) cluster V, and (C) cluster VI. Samples are annotated as in Figure 3. All gene alignments of the SNPs observed in the entire dataset as well as in each individual cluster are available on request from the authors. This is why WGS-SNP analyses provide clearer insights than earlier methods: many of these strains were unable to be distinguished from one another by MLST and PFGE analysis. The general understanding of V. parahaemolyticus evolution suggests that the typical mechanism is recombination, with a recombination ratio estimated at 2.5:1 and 8.8:1 by allele and site, respectively (Gonzalez-Escalona et al., 2008). Example of this are observed among strains belonging to CC345, where all known variants (SLV) of the ancestral ST type of the CC (ST345) have arisen by recombination, mainly in the recA allele (González-Escalona et al., in press). Cluster II and Cluster III were composed by two and three strains each, respectively; no other genomes have been reported for those STs, therefore we could not use a targeted SNP analysis to further analyze those strains. Nevertheless, these clustered with other US strains (Figure 3) indicating that they are probably strains endemic to the US. Strains belonging to ST113 were isolated from wound, ear, and stool in MD, suggesting that Clusters II and III are composed of local strains. ST113 was first described for a V. parahaemolyticus strain isolated from oysters in Louisiana state in 2007 (Johnson et al., 2009). The second most numerous ST observed among the MD outbreak strains was Cluster IV (ST631): these strains were highly similar, suggesting that they came from the same region 2 years in a row. In our analyses, the Cluster IV MD strains also clustered with a strain isolated in FL in 2007 (Figure 4B). If we fit this data together with the fact that ST631 had initially been reported through a strain isolated in Prince Edward Island (Canada) in 2009 (Banerjee et al., 2014), we have reason to believe that ST631 is a regional ST, endemic to the East Coast. Finally, Cluster V (ST3), included the MD 3 strains recovered from an outbreak that caused six illnesses during the summer of 2012 (Haendiges et al., 2014). These strains had previously been characterized by Haendiges et al. (2014), using a different phylogenetic tool (BIGSdb genome comparator tool). Due to homopolymer errors (inherent to Ion Torrent sequencing), no further resolution could be observed among strains belonging to Cluster V, so we could not determine whether the strains causing the outbreak were the same or not. The other analyses performed in our study were not impacted by such errors, since the miscalls were pruned from the analysis and only true high quality SNPs were included. With this targeted SNP based analysis, these strains were undistinguishable and can be clearly separated from other ST3 strain which genome were available at NCBI by at least 57 unique SNPs and can conclusively be called as being undistinguishable and probably the same strain and coming from the same source (Figure 4C). Therefore, this targeted SNP analysis can be used for outbreak detection caused by a specific V. parahaemolyticus strain.

Presence and absence of tdh and trh genes and their relation to etiology

There are five different subtypes of tdh genes that differ by sequence (tdh1 through tdh5); these subtypes share 96–98% identity (Nishibuchi and Kaper, 1990). In the case of trh genes there are two recognized subtypes (trh1 and 2), which share 84% identity (Kishishita et al., 1992). Most of the clinical MD V. parahaemolyticus analyzed carried tdh and/or trh genes (72%) (Table 1). Ten isolates (28%) were negative for tdh and trh, three (8%) were only tdh positive, four (11%) were only trh positive, and the remaining 19 (53%) isolates contained both genes. It was unusual to find that 28% of these clinical V. parahaemolyticus did not carry tdh or trh, since these strains are usually considered non-pathogenic (Nishibuchi and Kaper, 1995; Blackstone et al., 2003; DePaola et al., 2003). There have been some recent reports of V. parahaemolyticus without either of these two pathogenicity markers causing sporadic cases of illness (Jones et al., 2012; Banerjee et al., 2014) and having these 10 genomes may help researchers discover what makes these strains pathogenic. Jones et al. (2012) reported that 27% of their clinical isolates collected across US from July 2006 to November 2007 were tdh/trh negative while Banerjee et al. (2014) reported the same genotype for 4% of the clinical V. parahaemolyticus collected in Canada during 2000–2009. Furthermore, the majority of the tdh/trh negative strains (70%) were isolated from either wound or ear infections (Table 1). This further indicates that these two markers may not be necessary for human illness to occur (Broberg et al., 2011), although it is possible that these patients had some kind of immune deficiencies that could make them more prone to infections (chronic liver disease), as observed for V. vulnificus (Kim et al., 2014). The actual percentage of tdh/trh negative clinical strains is currently hard to estimate and might depend on several factors including sample size, V. parahaemolyticus endemic populations, type of infection, as well as errors caused by sporadic outbreaks that often go unnoticed or underreported (Scallan et al., 2011). Most of the tdh and/or trh positive strains carried tdh2 and trh1 combination. Most clinical US strains carry both tdh/trh (Okuda et al., 1997a; Jones et al., 2012; Paranjpye et al., 2012; Turner et al., 2013) while pandemic strains carried tdh1 and tdh2 (Okuda et al., 1997b; Matsumoto et al., 2000; Makino et al., 2003; Gonzalez-Escalona et al., 2005). Among the MD V. parahaemolyticus strains, only three were both tdh1 and two positive and these were identified as belonging to the pandemic clonal complex and were ST3 (Haendiges et al., 2014). Only two MD strains carried exclusively trh2. A study of environmental V. parahaemolyticus in Norway also identified strains containing only the trh gene that were also trh2, however those strains belonged different STs than the one observed in this study (Ellingsen et al., 2013). The availability of these genomes would help in future analysis in determining which other genes besides tdh and trh are necessary for pathogenesis in this marine bacterium.

Distribution of T3SS and other genomic regions in MD V. parahaemolyticus clinical strains

We performed an in silico analysis of each strain's genome using CLC Genome Workbench software (QIAGEN) for 24 known regions described for pandemic strains, including both T3SS, T3SS1 and 2α (Boyd et al., 2008) and well as for T3SS2β (Park et al., 2000), containing the ure genes and trh gene and described previously for strains carrying both tdh and trh genes (Supplementary Table 1). All strains carried NK, T3SS-1, Osmotolerance (chromosome I), Gametolysin, Osmotolerance (chromosome II), CPS, Type I secretion, Type I pilus, Multidrug efflux, and Ferric uptake. Most strains lacked all the pathogenicity islands described for the pandemic strain RIMD2210633 (VPaI1-7). Four other MD strains carried either VpaI1 (VP10) or VpaI2 (VP2, and VP14), although carrying a sequence different from that of the pandemic strain. Of the nine MD strains lacking tdh or trh gene, six carried a T6SS different from the pandemic, for the presence/absent of the other elements or PIs were very similar to the strains carrying tdh, trh, or tdh/trh strains. Seven of these strains were isolated from wound or ear infections. Since tdh/trh and T3SS2s were described as needed for producing infections at the intestinal level [because tdh/trh causes intestinal fluid secretion as well as cytotoxicity in a variety of cell types (Raimondi et al., 2000) while TTSS2 has a role in enterotoxicity (Park et al., 2004)], then these strains could have caused infections by a rather opportunistic way than by an actual infection as observed for gastroenteritis. These strains still possess TSS1 which is known for being involved in the cytotoxicity (Park et al., 2004). A more detailed study could reveal additional elements we did were not able to assess in this study; such analyses will certainly shed more light on the infection mechanisms of these unusual V. parahaemolyticus strains.

Conclusions

Our study used WGS to provide a high resolution investigation of genetic diversity and relationships of clinical V. parahaemolyticus strains isolated in the state of Maryland during 2012–2013 period in comparison to those isolated from other geographical locations. Having an archive of V. parahaemolyticus genomes allowed us to perform a phylogenetic analysis of strains and revealed that strains causing outbreaks in MD were highly diverse, genetically distinct, and clearly different from other regional and global strains. Despite the geographic breadth of our strains, we are aware that our study captured only a fraction of the actual number of V. parahaemolyticus strains causing illness in the state of Maryland, due in part to the sporadic nature of outbreaks, the low number of strains recovered per outbreak, and the fact that many outbreaks are never investigated (Scallan et al., 2011). We suggest using a similar approach during future outbreaks to analyze the identity of suspect V. parahaemolyticus strains and their relationship to strains causing previous outbreaks. Our WGS analysis of V. parahaemolyticus strains highlights the need for a genome database for V. parahaemolyticus to improve traceback and responses to outbreaks. We have created a new BioProject (PRJNA245882) at NCBI, spearheaded by FDA-CFSAN and the state of Maryland's DHMH, in which we are going to continually deposit new available genomes of V. parahaemolyticus causing outbreaks that would improve detection of new outbreak strains, track the emergence of new clonal strains in geographical regions where these strains are not endemic, and will be used in source tracking. We suggest and encourage the scientific community to deposit further V. parahaemolyticus genomes from their respective countries in centralize databases, such as NCBI, in order to improve source tracking.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  65 in total

1.  Genetic characterization of DNA region containing the trh and ure genes of Vibrio parahaemolyticus.

Authors:  K S Park; T Iida; Y Yamaichi; T Oyagi; K Yamamoto; T Honda
Journal:  Infect Immun       Date:  2000-10       Impact factor: 3.441

2.  Analysis of the thermostable direct hemolysin (tdh) gene and the tdh-related hemolysin (trh) genes in urease-positive strains of Vibrio parahaemolyticus isolated on the West Coast of the United States.

Authors:  J Okuda; M Ishibashi; S L Abbott; J M Janda; M Nishibuchi
Journal:  J Clin Microbiol       Date:  1997-08       Impact factor: 5.948

3.  Occurrence of pathogenic vibrios in coastal areas of France.

Authors:  D Hervio-Heath; R R Colwell; A Derrien; A Robert-Pillot; J M Fournier; M Pommepuy
Journal:  J Appl Microbiol       Date:  2002       Impact factor: 3.772

4.  Comparison of V. parahaemolyticus isolated from seafoods and cases of gastrointestinal disease in the UK.

Authors:  Sariqa Wagley; Kegakilwe Koofhethile; James B Wing; Rachel Rangdale
Journal:  Int J Environ Health Res       Date:  2008-08       Impact factor: 3.411

5.  Incidence of urea-hydrolyzing Vibrio parahaemolyticus in Willapa Bay, Washington.

Authors:  C A Kaysner; C Abeyta; R F Stott; J L Lilja; M M Wekell
Journal:  Appl Environ Microbiol       Date:  1990-04       Impact factor: 4.792

6.  Survival of Vibrio vulnificus genotypes in male and female serum, and production of siderophores in human serum and seawater.

Authors:  Hye-young Kim; Mesrop Ayrapetyan; James D Oliver
Journal:  Foodborne Pathog Dis       Date:  2013-10-26       Impact factor: 3.171

7.  Pandemic Vibrio parahaemolyticus O3:K6, Europe.

Authors:  Jaime Martinez-Urtaza; Lourdes Simental; David Velasco; Angelo DePaola; Masanori Ishibashi; Yoshitsugu Nakaguchi; Mitsuaki Nishibuchi; Dolores Carrera-Flores; Carmen Rey-Alvarez; Anxela Pousa
Journal:  Emerg Infect Dis       Date:  2005-08       Impact factor: 6.883

8.  Phylogenetic diversity of the enteric pathogen Salmonella enterica subsp. enterica inferred from genome-wide reference-free SNP characters.

Authors:  Ruth E Timme; James B Pettengill; Marc W Allard; Errol Strain; Rodolphe Barrangou; Chris Wehnes; Joann S Van Kessel; Jeffrey S Karns; Steven M Musser; Eric W Brown
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

9.  Genome anatomy of the gastrointestinal pathogen, Vibrio parahaemolyticus of crustacean origin.

Authors:  Suma Tiruvayipati; Subha Bhassu; Narender Kumar; Ramani Baddam; Sabiha Shaik; Anil Kumar Gurindapalli; Kwai Lin Thong; Niyaz Ahmed
Journal:  Gut Pathog       Date:  2013-12-11       Impact factor: 4.181

10.  Population structure of clinical and environmental Vibrio parahaemolyticus from the Pacific Northwest coast of the United States.

Authors:  Jeffrey W Turner; Rohinee N Paranjpye; Eric D Landis; Stanley V Biryukov; Narjol González-Escalona; William B Nilsson; Mark S Strom
Journal:  PLoS One       Date:  2013-02-07       Impact factor: 3.240

View more
  16 in total

1.  Distribution of phenotypic and genotypic antimicrobial resistance and virulence genes in Vibrio parahaemolyticus isolated from cultivated oysters and estuarine water.

Authors:  Saharuetai Jeamsripong; Winn Khant; Rungtip Chuanchuen
Journal:  FEMS Microbiol Ecol       Date:  2020-08-01       Impact factor: 4.194

2.  blaNDM-1-producing Vibrio parahaemolyticus and V. vulnificus isolated from recreational beaches in Lagos, Nigeria.

Authors:  Abolade A Oyelade; Olawale Olufemi Adelowo; Obasola Ezekiel Fagade
Journal:  Environ Sci Pollut Res Int       Date:  2018-09-29       Impact factor: 4.223

3.  Rapid biosynthesized AgNPs from Gelidiella acerosa aqueous extract mitigates quorum sensing mediated biofilm formation of Vibrio species-an in vitro and in vivo approach.

Authors:  Lakkakula Satish; Sivasubramanian Santhakumari; Shanmugaraj Gowrishankar; Shunmugiah Karutha Pandian; Arumugam Veera Ravi; Manikandan Ramesh
Journal:  Environ Sci Pollut Res Int       Date:  2017-09-30       Impact factor: 4.223

4.  Phylogenetic and Biogeographic Patterns of Vibrio parahaemolyticus Strains from North America Inferred from Whole-Genome Sequence Data.

Authors:  John J Miller; Bart C Weimer; Ruth Timme; Catharina H M Lüdeke; James B Pettengill; DJ Darwin Bandoy; Allison M Weis; James Kaufman; B Carol Huang; Justin Payne; Errol Strain; Jessica L Jones
Journal:  Appl Environ Microbiol       Date:  2021-01-15       Impact factor: 4.792

5.  Parallel Evolution of Two Clades of an Atlantic-Endemic Pathogenic Lineage of Vibrio parahaemolyticus by Independent Acquisition of Related Pathogenicity Islands.

Authors:  Feng Xu; Narjol Gonzalez-Escalona; Kevin P Drees; Robert P Sebra; Vaughn S Cooper; Stephen H Jones; Cheryl A Whistler
Journal:  Appl Environ Microbiol       Date:  2017-08-31       Impact factor: 4.792

6.  A Nonautochthonous U.S. Strain of Vibrio parahaemolyticus Isolated from Chesapeake Bay Oysters Caused the Outbreak in Maryland in 2010.

Authors:  Julie Haendiges; Jessica Jones; Robert A Myers; Clifford S Mitchell; Erin Butler; Magaly Toro; Narjol Gonzalez-Escalona
Journal:  Appl Environ Microbiol       Date:  2016-05-16       Impact factor: 4.792

7.  A Comprehensive Epidemiological Research for Clinical Vibrio parahaemolyticus in Shanghai.

Authors:  Huan Li; Rong Tang; Yang Lou; Zelin Cui; Wenjing Chen; Qing Hong; Zhaohuan Zhang; Pradeep K Malakar; Yingjie Pan; Yong Zhao
Journal:  Front Microbiol       Date:  2017-06-08       Impact factor: 5.640

8.  Sero-Prevalence and Genetic Diversity of Pandemic V. parahaemolyticus Strains Occurring at a Global Scale.

Authors:  Chongxu Han; Hui Tang; Chuanli Ren; Xiaoping Zhu; Dongsheng Han
Journal:  Front Microbiol       Date:  2016-04-22       Impact factor: 5.640

9.  Outbreak of Vibrio parahaemolyticus Sequence Type 120, Peru, 2009.

Authors:  Narjol Gonzalez-Escalona; Ronnie G Gavilan; Magaly Toro; Maria L Zamudio; Jaime Martinez-Urtaza
Journal:  Emerg Infect Dis       Date:  2016-07       Impact factor: 6.883

10.  Spreading of Pandemic Vibrio parahaemolyticus O3:K6 and Its Serovariants: A Re-analysis of Strains Isolated from Multiple Studies.

Authors:  Dongsheng Han; Fei Yu; Hui Tang; Chuanli Ren; Caiyun Wu; Pan Zhang; Chongxu Han
Journal:  Front Cell Infect Microbiol       Date:  2017-05-18       Impact factor: 5.293

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.