Literature DB >> 21205308

Comparative genomic analysis reveals significant enrichment of mobile genetic elements and genes encoding surface structure-proteins in hospital-associated clonal complex 2 Enterococcus faecalis.

Margrete Solheim1, Mari C Brekke, Lars G Snipen, Rob J L Willems, Ingolf F Nes, Dag A Brede.   

Abstract

BACKGROUND: Enterococci rank among the leading causes of nosocomial infections. The failure to identify pathogen-specific genes in Enterococcus faecalis has led to a hypothesis where the virulence of different strains may be linked to strain-specific genes, and where the combined endeavor of the different gene-sets result in the ability to cause infection. Population structure studies by multilocus sequence typing have defined distinct clonal complexes (CC) of E. faecalis enriched in hospitalized patients (CC2, CC9, CC28 and CC40).
RESULTS: In the present study, we have used a comparative genomic approach to investigate gene content in 63 E. faecalis strains, with a special focus on CC2. Statistical analysis using Fisher's exact test revealed 252 significantly enriched genes among CC2-strains. The majority of these genes were located within the previously defined mobile elements phage03 (n = 51), efaB5 (n = 34) and a vanB associated genomic island (n = 55). Moreover, a CC2-enriched genomic islet (EF3217 to -27), encoding a putative phage related element within the V583 genome, was identified. From the draft genomes of CC2-strains HH22 and TX0104, we also identified a CC2-enriched non-V583 locus associated with the E. faecalis pathogenicity island (PAI). Interestingly, surface related structures (including MSCRAMMs, internalin-like and WxL protein-coding genes) implicated in virulence were significantly overrepresented (9.1%; p = 0.036, Fisher's exact test) among the CC2-enriched genes.
CONCLUSION: In conclusion, we have identified a set of genes with potential roles in adaptation or persistence in the hospital environment, and that might contribute to the ability of CC2 E. faecalis isolates to cause disease.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21205308      PMCID: PMC3022643          DOI: 10.1186/1471-2180-11-3

Source DB:  PubMed          Journal:  BMC Microbiol        ISSN: 1471-2180            Impact factor:   3.605


Background

For many years, Enterococcus faecalis was considered as an intestinal commensal, which only sporadically caused opportunistic infections in immunocompromised patients. During the last thirty years, however, E. faecalis has gained notoriety as one of the primary causative agents of nosocomial infections [1,2], including urinary tract infections, endocarditis, intra-abdominal infections and bacteremia. The ability of E. faecalis to cause infection has been connected to inherent enterococcal traits, enabling the bacterium to tolerate diverse and harsh growth conditions. Moreover, several putative enterococcal virulence factors have been characterized (reviewed in [3]), and the role of these virulence factors in pathogenicity have been further established in various animal infection models [4-8] and cultured cell lines [9,10]. Reportedly, several of the proposed virulence determinants are enriched among infection-derived E. faecalis and/or E. faecium isolates, including esp (enterococcal surface protein) [11], hyl (hyaluronidase) [12], genes encoding collagen binding adhesins [13,14] and other matrix-binding proteins [15], and pilin loci [16,17]. On the other hand, recent studies on enterococcal pathogenicity have shown that a number of the putative virulence traits are present not only in infectious isolates but also in animal and environmental isolates [18-23]. This widespread distribution of putative virulence determinants in enterococcal isolates strongly suggest that enterococcal pathogenicity is not a result of any single virulence factor, but rather a more intricate process. Indeed, the virulence potential of the newly sequenced laboratory strain E. faecalis OG1RF was, despite its lack of several factors, comparable to that of the clinical isolate E. faecalis V583 [24]. Bourgogne et al. [24] proposed a scenario where the virulence of V583 and OG1RF may be linked to genes that are unique to each of the two strains, but where the combined endeavor of the different gene-sets result in the ability to cause infection. Population structure studies of E. faecalis by multilocus sequence typing (MLST) have previously defined distinct clonal complexes (CC) of E. faecalis enriched in hospitalized patients (CC2, CC9, CC28 and CC40), designated high-risk enterococcal clonal complexes (HiRECCs) [25,26]. In one of our previous studies, we reported an overall correlation between MLST and Bayesian phylogenetic analysis of gene content as revealed by microarray-based comparative genomic hybridization (CGH) [27]. This observation led us to speculate whether the virulence of different HiRECCs may be due to lineage-specific gene sets. In the present study we have used the comparative genomics approach to further investigate variation in gene content within E. faecalis, with a special focus on CC2. This complex was chosen on the basis of previous Bayesian-based phylogenetic reconstruction [27]. CC2 is equivalent to the previously designated BVE complex, and comprises several clinically important E. faecalis isolates, including the first known beta-lactamase producing isolate HH22, the first U.S. vancomycin-resistant isolate V583, and pathogenicity island (PAI)-harboring clinical bacteremia isolate MMH594 [26,28,29]. This CC represents a globally dispersed hospital-associated lineage, and identification of CC2-enriched genes may unravel novel fitness factors implicated in survival and spread of E. faecalis clones in the hospital environment.

Results and discussion

Overall genomic diversity

To explore the genetic diversity among E. faecalis, BLAST comparison was performed with 24 publicly available sequenced draft genomes, including the two CC2-strains TX0104 (ST2), which is an endocarditis isolate, and HH22 (ST6; mentioned above) against the genome of strain V583, which is also a ST6 isolate. The number of V583 genes predicted to be present varied between 2385 (OG1RF) and 2831 (HH22) for the 24 strains (Additional file 1). In addition, we used CGH to investigate variation in gene content within 15 E. faecalis isolated in European hospital environments, with a special focus on a hospital-adapted subpopulation identified by MLST (CC2). Of the 3219 V583 genes represented on the array, the number of V583 orthologous genes classified as present ranged from 2359 (597/96) to 2883 (E4250). Analysis of the compiled data set (in silico and CGH), revealed a total of 1667 genes present in all strains, thus representing the E. faecalis core genome. None of the annotated V583 genes were found to be divergent in all the isolates analyzed.

Putative CC2-enriched elements

In a previous study, we identified a set of potential pathogen-specific genes, which were entirely divergent in a collection of commensal baby isolates [27]. None of these genes were found to be present in all hospital-related isolates analyzed in the present study, neither was any gene found to be unique to any HiRECC. In order to identify genes specifically enriched among strains belonging to CC2, data from the present study were supplemented with hybridization data from an additional 24 strains of various origins ([27,30] and M. Solheim, unpublished data). The additional data sets were obtained by hybridization to the same array as described above. All together, data from a total of 63 strains were analyzed, in addition to V583 (Table 1). A genome-atlas presentation of the gene content in all the strains analyzed by CGH compared to the V583 genome is shown in Figure 1.
Table 1

Enterococcus faecalis isolates used in this study. CC; clonal complex, CGH; comparative genomic hybridization, MLST; multilocus sequence typing, S; singleton, ST; sequence type.

StrainYearCountrySourceMLSTApplicationReference
STCC
TX0104USAClinical22In silico[65]
609/961996PolandWound62CGH, PCR[25]
372-562007NorwayBlood62CGH, PCR
226B2005NorwayFeces62PCR[27]
368-422007NorwayBlood62PCR
442/052005PolandCSF62PCR[25]
E18282001SpainBlood62PCR[26]
MMH5941985USAClinical62CGHC, PCR[66]
V5831989USABlood62CGH, PCR[67]
158B2005NorwayFeces62CGHB, PCR[27]
HH22≤1982USAUrine62In silico[29]
LMGT330362CGHD, PCR
E18342001SpainBlood512CGH, PCR[26]
E42502007NetherlandsFeces1832CGH, PCR
HIP117042002USAClinical44In silico[68]
E18412001SpainBlood99CGH, PCR[26]
Vet1791999NorwayDog_urine99CGHD, PCR[69]
CH1881980sUSALiver99In silico[70]
E18072002SpainFeces179CGH, PCR[26]
X981934Feces1919In silico[71]
OG1RF≤1975USAOral121CGHC, PCR[72]
E19602001SpainFeces821CGH, PCR[26]
T8≤1992JapanUrine821In silico[73]
2426/032003PolandFeces2121CGH, PCR[25]
ATCC 29200≤1974CanadaUrogenital2121In silico[74]
T1≤19502121In silico[73]
LMGT34061999DenmarkPoultry_feces2221CGHD, PCR
111A2005NorwayFeces16121CGHB, PCR[27]
TX1322USA16121In silico[65]
3339/042004PolandBlood2325CGH, PCR[25]
UC11/46FinlandFeces9725CGH, PCR[19]
1892002-2003NorwayFeces16225CGHB, PCR[27]
Symbioflor 1GermanyFeces24825CGHC, PCR[75]
T2≤1992JapanUrine1128In silico[73]
E11881997GreeceBlood2828CGH, PCR[26]
383/042004PolandBlood8728CGH, PCR[25]
E1052NetherlandsFeces3030CGHD, PCR
852008NorwayFeces3030CGHB, PCR[27]
597/961996PolandUlcer4040CGH, PCR[25]
LMGT2333IcelandFish4040CGHD, PCR
JH1≤1974United KingdomClinical4040In silico[76]
LMGT3209GreeceFood_cheese4040CGHD, PCR
16452007DenmarkBlood22040CGH, PCR
29C2004NorwayFeces4444CGHB, PCR[27]
92A2005NorwayFeces4444CGHB[27]
DS5≤19745555In silico[77]
E2370HungaryWound1658CGH, PCR
1052002-2003NorwayFeces1658CGHB, PCR[27]
D6DenmarkPig1658In silico[31]
E1Sol1960sSolomon IslandsFeces9393In silico[78]
Merz962002USABlood103103In silico[79]
R712USAClinical103103In silico[65]
S613USAClinical103103In silico[65]
LMGT34051999DenmarkPoultry_feces116116CGHD, PCR
LMGT34071999DenmarkPoultry_feces34121CGHD, PCR
Fly12005USADrosophila101101AIn silico[31]
Vet1381998NorwayDog_ear164119ACGHD, PCR[69]
822008NorwayPoultry_feces65SCGHD, PCR
T11≤1992JapanUrine65SIn silico[73]
622002-2003NorwayFeces66SCGHB, PCR[27]
ATCC 42001926Blood105SIn silico
AR01/DG2001New ZealandDog108SIn silico[80]
2662002-2003NorwayFeces163SCGHB, PCR[27]
LMGT3143SpainAnimal_wood pigeon165SCGHD, PCR
LMGT3208GreeceFood_cheese166SCGHD, PCR
842008NorwayPoultry_feces249SCGHD, PCR
TuSoD ef11USAClinical364SIn silico[65]

AClonal complexes were no predicted founder was proposed by eBURST.

BIn Solheim et al. 2009.

CIn Vebø et al. 2010.

DMS, unpublished work.

Figure 1

Genome-atlas presentation of CGH data compared to the V583 genome and arranged by clonal relationship according to MLST. From inner to outer lanes: 1) percent AT, 2) GC skew, 3) global inverted repeats, 4) global direct repeats, 5) position preference, 6) stacking energy, 7) intrinsic curvature, 8) 189, 9) LMGT3208, 10) LMGT3407, 11) 92A, 12) 29C, 13) E1960, 14) 111A, 15) 105, 16) E2370, 17) 84, 18) 383/04, 19) E1188, 20) Vet179, 21) EF1841, 22) E1807, 23) LMGT3143, 24) LMGT3405, 25) OG1RF, 26) 2426/03, 27) LMGT3406, 28) 85, 29) E1052, 30) 1645, 31) LMGT3209, 32) LMGT2333, 33) 597/96, 34) 62, 35) Vet138, 36) 266, 37) UC11/96, 38) Symbioflor 1, 39) 3339/04, 40) 82, 41) E1834, 42) E4250, 43) LMGT3303, 44) 158B, 45) MMH594, 46) 372-56, 47) 609/96 and 48) annotations in V583. Elements enriched in CC2-strains are indicated with an asterisk.

Enterococcus faecalis isolates used in this study. CC; clonal complex, CGH; comparative genomic hybridization, MLST; multilocus sequence typing, S; singleton, ST; sequence type. AClonal complexes were no predicted founder was proposed by eBURST. BIn Solheim et al. 2009. CIn Vebø et al. 2010. DMS, unpublished work. Genome-atlas presentation of CGH data compared to the V583 genome and arranged by clonal relationship according to MLST. From inner to outer lanes: 1) percent AT, 2) GC skew, 3) global inverted repeats, 4) global direct repeats, 5) position preference, 6) stacking energy, 7) intrinsic curvature, 8) 189, 9) LMGT3208, 10) LMGT3407, 11) 92A, 12) 29C, 13) E1960, 14) 111A, 15) 105, 16) E2370, 17) 84, 18) 383/04, 19) E1188, 20) Vet179, 21) EF1841, 22) E1807, 23) LMGT3143, 24) LMGT3405, 25) OG1RF, 26) 2426/03, 27) LMGT3406, 28) 85, 29) E1052, 30) 1645, 31) LMGT3209, 32) LMGT2333, 33) 597/96, 34) 62, 35) Vet138, 36) 266, 37) UC11/96, 38) Symbioflor 1, 39) 3339/04, 40) 82, 41) E1834, 42) E4250, 43) LMGT3303, 44) 158B, 45) MMH594, 46) 372-56, 47) 609/96 and 48) annotations in V583. Elements enriched in CC2-strains are indicated with an asterisk. By Fisher's exact testing (q < 0.01), 252 genes were found to be more prevalent among CC2-strains than in non-CC2-strains (Additional file 2). The CC2-enriched genes included large parts of phage03 (p03; n = 51), efaB5 (n = 34) and a phage-related region identified by McBride et al. [31](EF2240-82/EF2335-51; n = 55), supporting the notion that the p03 genetic element may confer increased fitness in the hospital environment [27]. Indeed, prophage-related genes constituted a predominant proportion of the CC2-enriched genes (55.5%; p < 2.2e-16, Fisher's exact test). Interestingly, the Tn916-like efaB5 element has previously also been suggested to play a role in niche adaptation (Leavis, Willems et al. unpublished data): CGH analysis identified an efaB5-orthologous element in E. faecium that appeared to be common for HiRECC E. faecalis and CC17 E. faecium, a hospital-adapted subpopulation identified by MLST. To further confirm the presence of the relevant MGEs in E. faecalis, we used PCR combining internal primers with primers targeting the genes flanking p03, efaB5 and the vanB-associated phage-related element in V583, to monitor conserved V583 junctions on either side of the elements in 44 strains (Table 1). Seven strains contained the junctions on both sides of p03, of which six strains were CC2-strains. Eleven strains were positive for the junctions on both sides of efaB5, including nine CC2-strains, while thirteen strains gave positive PCR for both junctions of the phage-related element surrounding vanB, of which eleven strains belonged to CC2 (Additional file 3). These results substantiate the theory of p03, efaB5 and the vanB-associated phage as CC2-enriched elements. A total of 178 of the 252 putative CC2-enriched genes identified here, were associated with previously defined MGEs identified in V583 [32]. In addition to p03, efaB5 and the vanB-surrounding phage element, these included p01 (n = 5), PAI (n = 7), p04 (n = 21), p06 (n = 1) and pTEF1 and pTEF2 (n = 5) (Additional file 2). In addition, a ten-gene cluster (EF3217 to -27) with significant GC skew compared to the genome-average (31.6 and 37.4%, respectively), was found to be significantly more frequent in strains belonging to CC2 than in non-CC2 strains. The deviation in GC content suggests that this genetic element may also be of foreign origin. This notion was further supported by the sequence similarities of several of the genes with known phage-related transcriptional regulators (EF3221, EF3223 and EF3227). Moreover, EF3221 to -22 showed high degree of identity (>85%) to EfmE980_2492 to -93 of the newly sequenced Enterococcus faecium E980 [33]. EfmE980_2492 holds a domain characteristic of the aspartate aminotransferase superfamily of pyridoxal phosphate-dependent enzymes. Interestingly, EF3217 encodes a putative helicase, while EF3218 encodes a putative MutT protein, both with implications in DNA repair [34,35]. A potential role of these genes in protection against oxidative DNA damage induced in the hospital environment and during infection is plausible. To further investigate the distribution of EF3217 to -27 in E. faecalis, 44 strains were screened by PCR (Additional file 3): 10 CC2-strains held all ten genes, while 19 strains including two CC2-strains were devoid of the entire element. Moreover, 2 strains contained EF3225 only, 3 strains contained EF3217 to -18, while 8 strains, including OG1RF, contained EF3226 only. The two latter patterns of presence and divergence of EF3217 to -27 were also obtained with BLASTN analysis of TX0104 and OG1RF, respectively, corroborating that these are indeed genuine polymorphisms in this locus. Notably, in the OG1RF genome five more genes (OG1RF_0214 to -18) are also located between the homologs of EF3216 and EF3230 [24], suggesting this locus may represent a hot spot for insertions. Partial sequencing across the junction between EF3216 and EF3230 suggested that several of the non-CC2 strains carry genes homologous to OG1RF_0214 to -18 in this locus (results not shown). Mobile DNA constitutes a substantial fraction of the E. faecalis V583 genome and transfer of MGEs and transposons thus plays an important role in the evolution of E. faecalis genomes [32]. The large pool of mobile elements also represents an abundant source of pseudogenes, as indel events occurring within coding regions often render genes nonfunctional. To verify the expression of the CC2-enriched genes, we correlated the list of enriched genes with data from two transcriptional analyses performed in our laboratory with the same array as used in the CGH experiment described in present study ([30] and Solheim, unpublished work). Transcription was confirmed for all but fifteen of the CC2-enriched genes (results not shown), thus validating the expression of these reading frames. The fifteen genes, for which no transcripts were detected, were mainly located within efaB5 and phage04. A constraint of the comparative genomic analyses presented here, is that the comparison of gene content is based on a single reference strain only (V583). To compensate, we conducted a CC2 pangenome analysis with the draft genomes of CC2-strains HH22 and TX0104 to identify putative CC2-enriched non-V583 genes. The pangenome analysis identified a total of 298 non-V583 ORFs in the HH22 and TX0104 (Additional file 4). Among these ORFs, one gene cluster was identified as particularly interesting (Fisher's exact; Additional file 4 and Figure 2). Notably, HMPREF0348_0426 in TX0104 represented the best BLAST hit for all the three ORFs HMPREF0364_1864 to -66 in HH22, suggesting discrepancy in annotation between the two strains. Sequencing across the gap between contig 00034 and contig 00035 in TX0104 confirmed that HMPREF0348_0427 and HMPREF0348_0428 represent the two respective ends of a gene homologous to HMPREF0346_1863 in HH22. (Additional file 5). The presence of the putative non-V583 CC2-enriched gene cluster among E. faecalis was further elucidated by PCR in our collection of strains (Additional file 3). Strains were screened for the presence of three individual genes (HMPREF0346_1861, HMPREF0346_1864 and HMPREF0346_1868) and the entire element, with primers hmpref0346_1868-F and hmpref0346_1861-R. Fisher's exact testing (q < 0.01) on the basis of the PCR data confirmed that the gene cluster was significantly enriched among CC2. Comparative sequence analysis of the flanking regions suggests that the gene cluster is located in the HH22 and TX0104 versions of the E. faecalis pathogenicity island [36]. Recently, a microarray-based assessment of PAI-content in a set of clinical E. faecalis isolates revealed high degree of variation within the island, and an evidently modular evolution of the PAI [37], which would be consistent with acquisition by an indel event of this locus in the PAI of TX0104, HH22 and other positive CC2-strains.
Figure 2

Schematic representation of a putative non-V583 CC2-enriched gene cluster, as annotated in the ACIX00000000 and ACGL00000000, respectively). The EF-numbers of flanking genes indicate the insert site location compared to the E. faecalis V583 pathogenicity island.

Schematic representation of a putative non-V583 CC2-enriched gene cluster, as annotated in the ACIX00000000 and ACGL00000000, respectively). The EF-numbers of flanking genes indicate the insert site location compared to the E. faecalis V583 pathogenicity island.

CC2-enriched surface-related structures

Lepage et al. [38] have previously identified eight genes as potential markers for the V583/MMH594-lineage, of which all except one gene (EF2513) are found among the CC2-enriched genes in this study. Interestingly, several of these genes were later assigned to a recently classified family of surface proteins, with a C-terminal WxL domain, proposed to form multi-component complexes on the cell surface [39,40]. Siezen et al. [40] termed these genes cell-surface complex (csc) genes and postulated a role in carbon source acquisition. Independently, Brinster et al. [39] showed that WxL domains are involved in peptidoglycan-binding. A total of nine WxL protein-coding genes, divided into three clusters (EF2248 to -54, EF3153 to -55 and EF3248 to -53), were identified as putative CC2-enriched genes in the present study. Note that EF3153 to - 55 does not represent a complete csc gene cluster, as not all four csc gene families (cscA - cscD) are present in the cluster [40]. Interestingly, the OG1RF genome sequence revealed homologues loci encoding WxL-proteins corresponding to the gene clusters EF3153 to -55 and EF3248 to -53 in V583 (50-75% sequence identity) [24]. Such homologs may possibly explain the divergence observed between CC2 and non-CC2-strains in the present study. Indeed, BLAST analysis with the OG1RF sequences against the E. faecalis draft genomes suggested that the OG1RF_0209-10 and OG1RF_0224-25 are widely distributed among non-CC2 E. faecalis. Given the putative function in carbon metabolism, the observed sequence variation may be related to substrate specificity. In addition to the WxL domain, EF2250 also encodes a domain characteristic for the internalin family [39]. Internalins are characterized by the presence of N-terminal leucine-rich repeats (LRRs). The best characterized bacterial LRR proteins are InlA and InlB from Listeria monocytogenes, known to trigger internalization by normally non-phagocytic cells [41]. Two internalin-like proteins were identified in E. faecalis V583 (EF2250 and elrA (EF2686)) [41,42]. Recently, Brinster et al. [42] presented evidence of that ElrA play a role in E. faecalis virulence, both in early intracellular survival in macrophages and by stimulating the host inflammatory response through IL-6 induction. Moreover, by quantitative real-time PCR Shepard and Gilmore [43] found that elrA was induced in E. faecalis MMH594 during exponential growth in serum and during both exponential and stationary growth in urine. Contradictory data have, however, been published for this and other strains using different methods [42,44]. Although it is tempting to speculate that EF2250 contributes to the interaction with the mammalian host, the role of internalins in E. faecalis pathogenesis is still not understood, and it may therefore be premature to extrapolate function solely on the basis of shared structural domains. Glycosyl transferase family proteins are involved in the formation of a number of cell surface structures such as glycolipids, glycoproteins and polysaccharides [45]. E. faecalis is in possession of several capsular polysaccharides [46-48], with Cps and Epa being the best characterized. The epa (enterococcal polysaccharide antigen) cluster represents a rhamnose-containing polysaccharide which was originally identified in E. faecalis OG1RF [46]. The version of the epa cluster found in the V583 genome contains an insertion of four genes (EF2185 to -88) compared to OG1RF. This insertion appeared to be enriched among CC2. While EF 2185 and EF2187 encodes transposases of the IS256 family, the two remaining genes showed 100% identity to the two respective ends of a racemase domain protein in E. faecalis TX0104. Neighboring the epa cluster, two glycosyl transferases (EF2170 and EF2167) proposed as potential virulence factors [32], are part of a three operon locus (EF2172 to -66), possibly associated with lipopolysaccharide production. Five of the genes within this locus were also found to be enriched among CC2 in the present study. Paulsen et al. [32] also listed other putative surface-exposed virulence genes, including a choline-binding protein (CBP; EF2662) and a putative MSCRAMM (microbial surface components recognizing adhesive matrix molecules; EF2347) that based on our analysis were found to be enriched in CC2. A role of CBPs in pneumococcal colonization and virulence has been established [49,50]. A number of putative MSCRAMMs have been identified in E. faecalis [51], however, only Ace (adhesion of collagen from E. faecalis; EF1099) has been characterized in detail: Ace was shown to mediate binding to collagen (type I and IV), dentin and laminin [52-54]. Lebreton et al. [55] recently presented evidence of an in vivo function of Ace in enterococcal infections other than involvement in the interaction with extracellular matrix. It was demonstrated that an ace deletion mutant was significantly impaired in virulence, both in an insect model and in an in vivo-in vitro murine macrophage models. The authors suggested that Ace may promote E. faecalis phagocytosis and that it may also be possible that Ace is involved in survival of enterococci inside phagocytic cells. Also the structurally related MSCRAMM, Acm, found in E. faecium was recently reported to contribute to the pathogenesis of this bacterium [56]. Mucins are high molecular weight glycoproteins expressed by a wide variety of epithelial cells, including those of the gastrointestinal tract, and located at the interface between the cell and the surrounding environment [57]. The binding of bacteria to mucins through mucin-binding domain proteins is thought to promote colonization [58]. Diversity in the carbohydrate side chains creates a significant heterogeneity among mucins of different origin (e.g. different organisms or body sites), facilitating bacterial attachment to epithelial cells [58]. The non-V583 CC2-enriched gene cluster identified through in silico analysis in the present study harboured an ORF (HMPREF0346_1863 and HMPREF0348_0427/HMPREF0348_0428 in HH22 and TX0104, respectively) with homology to known mucin-binding domain proteins.

Conclusions

In conclusion, we have identified a set of genes that appear to be enriched among strains belonging to CC2. Since a significant proportion (9.1%; p = 0.036, Fisher's exact test) of these genes code for proteins associated with cell surface structures, absence of or divergence in these loci may lead to antigenic variation. Indeed, both MSCRAMMs and internalins have been identified as potential antigens of E. faecalis or other Gram-positive bacteria [59-61]. It is noteworthy that the genes encoding any of the established enterococcal virulence factors were not among the CC2-enriched genes. Surface structures that promote adhesion of pathogenic bacteria to human tissue are also promising targets for creation of effective vaccines. However, functional studies of the individual CC2-enriched genes are required in order to distinguish their implications in enterococcal virulence.

Methods

Bacterial strain and growth conditions

Bacterial strains used in this study are listed in Table 1. E. faecalis strains were grown overnight (ON) in brain heart infusion broth (BHI; Oxoid) at 37° without shaking. All the strains have previously been sequence typed by the MLST scheme proposed by Ruiz-Garbajosa et al. [26].

Comparative genomic hybridization

Microarrays

The microarray used in this work has been described previously [27]. The microarray design has been deposited in the ArrayExpress database with the accession number A-MEXP-1069 and A-MEXP-1765.

DNA isolation

Genomic DNA was isolated by using the FP120 FastPrep bead-beater (BIO101/Savent) and the QiaPrep MiniPrep kit (Qiagen) as previously described [27].

Fluorescent labeling and hybridization

Fifteen hospital-associated E. faecalis strains were selected for CGH based on their representation of MLST sequence types (STs) belonging to major CCs and potential HiRECCs, with a special focus on CC2, and their variety of geographical origins within Europe. Genomic DNA was labeled and purified with the BioPrime Array CGH Genomic labeling System (Invitrogen) and Cyanine Smart Pack dUTP (PerkinElmer Life Sciences), according to the manufacturer's protocol. Purified samples were then dried, prior to resuspension in 140 μl hybridization solution (5 × SSC, 0.1% (w/v) SDS, 1.0% (w/v) bovine serum albumin, 50% (v/v) formamide and 0.01% (w/v) single-stranded salmon sperm DNA) and hybridized for 16 h at 42°C to the E. faecalis oligonucleotide array in a Tecan HS 400 pro hybridization station (Tecan). Arrays were washed twice at 42°C with 2 × SSC + 0.2% SDS, and twice at 23°C with 2 × SSC, followed by washes at 23°C with 1) 0.2 × SSC and 2) H2O. Two replicate hybridizations (dye-swap) were performed for each test strain. Hybridized arrays were scanned at wavelengths of 532 nm (Cy3) and 635 nm (Cy5) with a Tecan scanner LS (Tecan). Fluorescent intensities and spot morphologies were analyzed using GenePix Pro 6.0 (Molecular Devices), and spots were excluded based on slide or morphology abnormalities. All water used for the various steps of the hybridization and for preparation of solutions was filtered (0.2 μM) MilliQ dH20.

Data analysis

Standard methods in the LIMMA package [62] in R http://www.r-project.org/, available from the Bioconductor http://www.bioconductor.org were employed for preprocessing and normalization. Within-array normalization was first conducted by subtracting the median from the log-ratios for each array. A standard loess-normalization was then performed, where smoothing was based only on spots with abs(log-ratio) < 2.0 to avoid biases due to extreme skewness in the log-ratio distribution. For the determination of present and divergent genes a method that predicts sequence identity based on array signals was used, as described by Snipen et al. [63]. A threshold of 0.75 was used in order to obtain a categorical response of presence or divergence, i. e. genes with Sb-value > 0.75 were classified as present, while genes with Sb-value < 0.75 were classified as divergent. Genes with Sb-value = 0.75 remained unclassified. All genes were tested for significant enrichment among the CC2-strains by using the Fisher's exact test.

Microarray data accession number

The microarray data have been deposited in the ArrayExpress database with the series accession number E-TABM-905.

Polymerase chain reaction

The presence of selected genes was verified by means of polymerase chain reactions (PCR). A similar approach was also applied to investigate the presence of selected mobile genetic elements (MGEs). Primers targeting the genes flanking the MGEs were combined with internal primers to monitor the presence of the junctions on either side of each MGE. PCR was carried out in 20 μl reaction volumes containing 1× buffer, 250 μM of each deoxynucleotide triphosphate and 1 U DyNAZyme II polymerase (Finnzymes). The reaction conditions included an initial denaturation step at 95°C and 35 cycles of 95°C for 30 s, 56-60°C for 30 s and 72°C for 1-5 min, followed by a final extension step at 72°C for 7 min. The primers used in this study are listed in Table 2.
Table 2

Primers used in this study.

Target genePrimer sequences (5' → 3')Amplicon size (bp)Application
ef1415F:TGTTGCGGTTTCTGCATTAG2818PCR on junction between EF1415 and EF1417
ef1417R:GCATCTCGATAGACAATTCGPCR on junction between EF1415 and EF1417
ef1489F:GAATCGAACTAGCATTTTTGGG465PCR on junction between EF1489 and EF1490
ef1490R:ATGGAACGAACCATTGGAAAPCR on junction between EF1489 and EF1490
ef1843F:GGAGCCGTTAGACAGACAGC2457PCR on junction between EF1843 and EF1847
ef1847R:GCTTGCTTTACAGCCTCAAGAPCR on junction between EF1843 and EF1847
ef1895F:GCACAACAAATTTCAATTCCA4573PCR on junction between EF1895 and EF1898
ef1898R:ATTGAAGTGGTTCGCTACGGPCR on junction between EF1895 and EF1898
ef2239F:AACTGCTGTCAAGCGTAGCA1252PCR on junction between EF2239 and EF2240
ef2240R:TGTGGCATTTTGGACTGTTGPCR on junction between EF2239 and EF2240
ef2350F:ATAACTGAGTGATTTTCACAATTGC654PCR on junction between EF2350 and EF2352
ef2352R:GATCCGTGGAAGTTCCTCAAPCR on junction between EF2350 and EF2352
ef3216F:TCGGCGTTGAAGACTATGAA-Sequencing of junction between EF3216 and EF3230
ef3217F:ATTGGGAATGACGGCTACACR:TTGCGTATTTCGCAGCATAA499PCR
ef3218F:TCGCGTAGTAGGAGCAATCAR:TTTTGTTCAGTTCCCACACCT396PCR
ef3220F:AGCTTTTGGCGAAGGAGATTR:TTTATTGCGGGTTCCTCAGT495PCR
ef3221F:TGAACGAAAATGAAGGTGGTR:TCATCAATCTCCAACGCATC196PCR
ef3222F:CAAAGAAGAATCAGCCGATTAAAR:ATATTTGGGCATTTGCATGG183PCR
ef3223F:AATTGGGAAAAAGGGGTCAGR:TTCGTGATCTGCTTGTTGTTCT501PCR
ef3224F:GTTGGGCTGGACGTATGAATR:TGTGGCTTTATAGGCTGTAGCA214PCR
ef3225F:ATTACTTCACCGCCCATGACR:CGCTGGAAGTCTGCTCTTG474PCR
ef3226F:GATGATTTAACCGCACAAGGAR:TTTTTATTTCGAGCGGATGC499PCR
ef3227F:ACAGGAAGCCATTCACAAACTR:CTGATTCGTGGAAGTCCAACT162PCR
ef3230R:TCCTGACTTCCGTTCTGCTT-Sequencing of junction between EF3216 and EF3230
hmpref0346_1861F:CGAGTTAGAGGAAGCGTTGG630PCR
R:CCAGACAATTTGGGCGTACT
hmpref0346_1864F:GAAATTTTCTGAAAGTGAAGACAAGA299PCR
R:TGATTAGCAGTCACAACAGCAA
hmpref0346_1868F:TGTACACAAGCTACCCGGATT538PCR
R:TTCCCACCTGCGTCTATTTT
hmpref0348_0427R:GAGACTTCAACCACTCCACAAAAACC-Sequencing of gap between contig00034-35 in TX0104
hmpref0348_0428F:CCTGTAGAAGTATTGTCCATTTTAACGCTATCSequencing of gap between contig00034-35 in TX0104
Primers used in this study.

Validation of microarray data by sequencing

Sequencing was performed using the ABI Prism Big dye Cycle Sequencing Ready Reaction kit (Applied Biosystems) in an ABI PrismTM 3100 Genetic Analyzer and primers listed in Table 2.

In silico comparison of E. faecalis draft genomes

Whole genome blast comparison against the V583 reference genome was conducted for 24 E. faecalis strains whose draft genomes were publicly available (GenBank accession numbers in parenthesis; Table 1): E. faecalis ARO1/DG (ACAK01000000); E. faecalis ATCC 4200 (ACAG01000000); E. faecalis ATCC 29200 (ACOX00000000); E. faecalis CH188 (ACAV01000000); E. faecalis D6 (ACAT01000000); E. faecalis DS5 (ACAI01000000); E. faecalis E1Sol (ACAQ01000000); E. faecalis Fly1 (ACAR01000000): E. faecalis HIP11704 (ACAN01000000); E. faecalis HH22 (ACIX00000000); E. faecalis JH1 (ACAP01000000); E. faecalis Merz96 (ACAM01000000); E. faecalis OG1RF (ABPI01000001); E. faecalis R712 (ADDQ00000000); E. faecalis S613 (ADDP00000000); E. faecalis T1 (ACAD01000000); E. faecalis T2 (ACAE01000000); E. faecalis T3 (ACAF01000000); E. faecalis T8 (ACOC01000000); E. faecalis T11 (ACAU01000000); E. faecalis TuSoD ef11(ACOX00000000); E. faecalis TX0104 (ACGL00000000); E. faecalis TX1322 (ACGM00000000); E. faecalis X98 (ACAW01000000) [64,65], as follows: the annotated V583 genes were blasted (BLASTN) against each genome, and presence and divergence was predicted based on a score calculated as number of identical nucleotides divided by the length of the query gene. Genes obtaining a score >0.75 were predicted to be present.

CC2 pangenome content analysis

Among the newly released E. faecalis draft genomes were two CC2-strains; HH22 and TX0104. In order to extend the list of CC2-enriched genes beyond V583, we conducted a BLAST search using the annotated genes of these two strains as queries against the full genome sequences of the other draft genomes. Again, a cutoff of 75% identity to the query was used to distinguish present from divergent genes.

Authors' contributions

MS conceived and designed the study, carried out the experimental work, analyzed the data, assisted in the bioinformatic analysis and drafted the manuscript. MCB performed the experimental work and assisted in critical review of the manuscript. LS contributed analysis tools, performed the statistical and bioinformatic analyses and assisted in the critical review of the manuscript. RJLW conceived and designed the study, contributed material and assisted in critical review of the manuscript. IFN conceived the study, contributed material and assisted in critical review of the manuscript. DAB participated in the design and coordination of the study, performed bioinformatic analysis and helped to draft the manuscript. All authors read and approved the final manuscript.

Additional file 1

BLAST comparison of . Data from BLAST comparison of 24 E. faecalis draft genomes with the annotated genes of strain V583. Click here for file

Additional file 2

V583 genes which were identified as significantly enriched among CC2-strains in the present study. A list of V583 genes which were identified as significantly enriched among CC2-strains in the present study. Click here for file

Additional file 3

PCR screening. An overview of results from PCR screening of a collection of E. faecalis isolates. Click here for file

Additional file 4

Enrichment analysis of CC6 non-V583 genes by Fisher's exact test. An overview of the presence non-V583 genes in 24 E. faecalis draft genomes CC6 including data from enrichment analysis by Fisher's exact test. Click here for file

Additional file 5

Amino acid alignment of HMPREF0346_1863 in . An amino acid alignment of HMPREF0346_1863 in Enterococcus faecalis HH22 and its homologue in E. faecalis TX0104. Click here for file
  79 in total

1.  Modulation of virulence within a pathogenicity island in vancomycin-resistant Enterococcus faecalis.

Authors:  Nathan Shankar; Arto S Baghdayan; Michael S Gilmore
Journal:  Nature       Date:  2002-06-13       Impact factor: 49.962

2.  Multilocus sequence typing scheme for Enterococcus faecalis reveals hospital-adapted genetic complexes in a background of high rates of recombination.

Authors:  Patricia Ruiz-Garbajosa; Marc J M Bonten; D Ashley Robinson; Janetta Top; Sreedhar R Nallapareddy; Carmen Torres; Teresa M Coque; Rafael Cantón; Fernando Baquero; Barbara E Murray; Rosa del Campo; Rob J L Willems
Journal:  J Clin Microbiol       Date:  2006-06       Impact factor: 5.948

3.  Clonal structure of Enterococcus faecalis isolated from Polish hospitals: characterization of epidemic clones.

Authors:  Magdalena Kawalec; Zbigniew Pietras; Emilia Daniłowicz; Aleksandra Jakubczak; Marek Gniadkowski; Waleria Hryniewicz; Rob J L Willems
Journal:  J Clin Microbiol       Date:  2006-11-08       Impact factor: 5.948

4.  InlA and InlC2 of Listeria monocytogenes serotype 4b are two internalin proteins eliciting humoral immune responses common to listerial infection of various host species.

Authors:  Wei Ling Yu; Hanhong Dan; Min Lin
Journal:  Curr Microbiol       Date:  2008-01-29       Impact factor: 2.188

Review 5.  The MutT proteins or "Nudix" hydrolases, a family of versatile, widely distributed, "housecleaning" enzymes.

Authors:  M J Bessman; D N Frick; S F O'Handley
Journal:  J Biol Chem       Date:  1996-10-11       Impact factor: 5.157

6.  Contribution of the pAD1-encoded cytolysin to the severity of experimental Enterococcus faecalis endophthalmitis.

Authors:  B D Jett; H G Jensen; R E Nordquist; M S Gilmore
Journal:  Infect Immun       Date:  1992-06       Impact factor: 3.441

7.  Recovery of resistance (R) factors from a drug-free community.

Authors:  P Gardner; D H Smith; H Beer; R C Moellering
Journal:  Lancet       Date:  1969-10-11       Impact factor: 79.321

8.  Comparative genomic hybridization analysis of Enterococcus faecalis: identification of genes absent from food strains.

Authors:  E Lepage; S Brinster; C Caron; Céline Ducroix-Crepy; L Rigottier-Gois; G Dunny; C Hennequet-Antier; P Serror
Journal:  J Bacteriol       Date:  2006-10       Impact factor: 3.490

9.  Occurrence of virulence factors among human intestinal enterococcal isolates.

Authors:  H Lempiäinen; K Kinnunen; A Mertanen; A von Wright
Journal:  Lett Appl Microbiol       Date:  2005       Impact factor: 2.858

10.  The capsular polysaccharide of Enterococcus faecalis and its relationship to other polysaccharides in the cell wall.

Authors:  Lynn E Hancock; Michael S Gilmore
Journal:  Proc Natl Acad Sci U S A       Date:  2002-02-05       Impact factor: 11.205

View more
  21 in total

1.  Genome Modification in Enterococcus faecalis OG1RF Assessed by Bisulfite Sequencing and Single-Molecule Real-Time Sequencing.

Authors:  Wenwen Huo; Hannah M Adams; Michael Q Zhang; Kelli L Palmer
Journal:  J Bacteriol       Date:  2015-03-30       Impact factor: 3.490

2.  Distribution of antimicrobial resistance determinants, virulence-associated factors and clustered regularly interspaced palindromic repeats loci in isolates of Enterococcus faecalis from various settings and genetic lineages.

Authors:  Iwona Gawryszewska; Katarzyna Malinowska; Alicja Kuch; Dorota Chrobak-Chmiel; Lucja Laniewska- Trokenheim; Waleria Hryniewicz; Ewa Sadowy
Journal:  Pathog Dis       Date:  2017-03-01       Impact factor: 3.166

3.  Comparison of antibiotic resistance, biofilm formation and conjugative transfer of Staphylococcus and Enterococcus isolates from International Space Station and Antarctic Research Station Concordia.

Authors:  Katarzyna Schiwon; Karsten Arends; Katja Marie Rogowski; Svea Fürch; Katrin Prescha; Türkan Sakinc; Rob Van Houdt; Guido Werner; Elisabeth Grohmann
Journal:  Microb Ecol       Date:  2013-02-15       Impact factor: 4.552

4.  Genetic basis for daptomycin resistance in enterococci.

Authors:  Kelli L Palmer; Anu Daniel; Crystal Hardy; Jared Silverman; Michael S Gilmore
Journal:  Antimicrob Agents Chemother       Date:  2011-04-18       Impact factor: 5.191

5.  A genomic virulence reference map of Enterococcus faecalis reveals an important contribution of phage03-like elements in nosocomial genetic lineages to pathogenicity in a Caenorhabditis elegans infection model.

Authors:  Sabina Leanti La Rosa; Lars-Gustav Snipen; Barbara E Murray; Rob J L Willems; Michael S Gilmore; Dzung B Diep; Ingolf F Nes; Dag Anders Brede
Journal:  Infect Immun       Date:  2015-03-16       Impact factor: 3.441

6.  Construction and application of a luxABCDE reporter system for real-time monitoring of Enterococcus faecalis gene expression and growth.

Authors:  Sabina Leanti La Rosa; Dzung B Diep; Ingolf F Nes; Dag Anders Brede
Journal:  Appl Environ Microbiol       Date:  2012-07-27       Impact factor: 4.792

7.  In vivo assessment of growth and virulence gene expression during commensal and pathogenic lifestyles of luxABCDE-tagged Enterococcus faecalis strains in murine gastrointestinal and intravenous infection models.

Authors:  Sabina Leanti La Rosa; Sabina Leanti La Rosa; Pat G Casey; Colin Hill; Dzung B Diep; Ingolf F Nes; Dag A Brede
Journal:  Appl Environ Microbiol       Date:  2013-04-19       Impact factor: 4.792

Review 8.  Friend turned foe: evolution of enterococcal virulence and antibiotic resistance.

Authors:  Daria Van Tyne; Michael S Gilmore
Journal:  Annu Rev Microbiol       Date:  2014-06-18       Impact factor: 15.500

9.  Nonclinical and clinical Enterococcus faecium strains, but not Enterococcus faecalis strains, have distinct structural and functional genomic features.

Authors:  Eun Bae Kim; Maria L Marco
Journal:  Appl Environ Microbiol       Date:  2013-10-18       Impact factor: 4.792

10.  Extensive Comparative Genomic Analysis of Enterococcus faecalis and Enterococcus faecium Reveals a Direct Association between the Absence of CRISPR-Cas Systems, the Presence of Anti-Endonuclease (ardA) and the Acquisition of Vancomycin Resistance in E. faecium.

Authors:  Kodjovi D Mlaga; Vincent Garcia; Philippe Colson; Ruimy Raymond; Jean-Marc Rolain; Seydina M Diene
Journal:  Microorganisms       Date:  2021-05-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.