| Literature DB >> 22291916 |
Jessica Galloway-Peña1, Jung Hyeob Roh, Mauricio Latorre, Xiang Qin, Barbara E Murray.
Abstract
Recent studies have pointed to the existence of two subpopulations of Enterococcus faecium, one containing primarily commensal/community-associated (CA) strains and one that contains most clinical or hospital-associated (HA) strains, including those classified by multi-locus sequence typing (MLST) as belonging to the CC17 group. The HA subpopulation more frequently has IS16, pathogenicity island(s), and plasmids or genes associated with antibiotic resistance, colonization, and/or virulence. Supporting the two clades concept, we previously found a 3-10% difference between four genes from HA-clade strains vs. CA-clade strains, including 5% difference between pbp5-R of ampicillin-resistant, HA strains and pbp5-S of ampicillin-sensitive, CA strains. To further investigate the core genome of these subpopulations, we studied 100 genes from 21 E. faecium genome sequences; our analyses of concatenated sequences, SNPs, and individual genes all identified two distinct groups. With the concatenated sequence, HA-clade strains differed by 0-1% from one another while CA clade strains differed from each other by 0-1.1%, with 3.5-4.2% difference between the two clades. While many strains had a few genes that grouped in one clade with most of their genes in the other clade, one strain had 28% of its genes in the CA clade and 72% in the HA clade, consistent with the predicted role of recombination in the evolution of E. faecium. Using estimates for Escherichia coli, molecular clock calculations using sSNP analysis indicate that these two clades may have diverged ≥1 million years ago or, using the higher mutation rate for Bacillus anthracis, ∼300,000 years ago. These data confirm the existence of two clades of E. faecium and show that the differences between the HA and CA clades occur at the core genomic level and long preceded the modern antibiotic era.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22291916 PMCID: PMC3266884 DOI: 10.1371/journal.pone.0030187
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Position of the 100 genes analyzed on the TX16 (DO) chromosome.
The chromosomal map of the Enterococcus faecium genome of the endocarditis isolate TX16, commonly referred to as DO. The arrows represent the 100 genes chosen for clade and SNP analysis and their position on the chromosome. The red arrows depict genes transcribed in the forward orientations, and blue arrows depict genes transcribed in the reverse orientation. The general location of a few subset of the 100 genes are labeled.
The sequence type, country, date, and source of isolation for the 21 sequenced E. faecium genomes used in this study.
| Strain | ST | Country | year | Source | Accession # |
| C68 | 16 | USA (OH) | 1998 | Endocarditis patient (blood) | ACJQ00000000 |
| Com12 | 107 | USA | 2006 | Healthy volunteer fecal sample | ACBC00000000 |
| Com15 | 583 | USA | 2006 | Healthy volunteer fecal sample | ACBD00000000 |
| D344SRF | 21 | France | 1985 | Clinical isolate | ACZZ00000000 |
| E980 | 94 | Netherlands | 1998 | Healthy volunteer fecal sample | ABQA01000001 |
| E1039 | 42 | Netherlands | 1998 | Healthy volunteer fecal sample | ACOS00000000 |
| E1071 | 32 | Netherlands | 2000 | Hospitalized patient fecal sample | ABQI01000001 |
| E1162 | 17 | France | 1997 | Blood Culture of Hospitalized Patient | ABQJ00000000 |
| E1636 | 106 | Netherlands | 1961 | Blood Culture of Hospitalized Patient | ABRY01000001 |
| E1679 | 114 | Brazil | 1998 | Swab of Vascular Catheter Tip | ABSC01000001 |
| TX16 | 18 | USA (TX) | 1992 | Endocarditis patient (blood) | ACIY00000000 |
| TX82 | 17 | USA (TX) | 1999 | Endocarditis patient (blood) | AEBU00000000 |
| TX0133 | 17 | USA (TX) | 2006 | Endocarditis patient (blood) | AECH00000000 |
| TX1330 | 107 | USA | 1994 | Healthy volunteer fecal sample | ACHL00000000 |
| U0317 | 78 | Netherlands | 2005 | UTI of Hospitalized Patient (Urine) | ABSW01000001 |
| 1141733 | 327 | unk. | unk. | Blood Culture of Hospitalized Patient | ACAZ00000000 |
| 1230933 | 18 | unk. | unk. | Wound Swab of Hospitalized Patient | ACAS00000000 |
| 1231408 | 582 | unk. | unk. | Blood Culture of Hospitalized Patient | ACBB00000000 |
| 1231410 | 17 | unk. | unk. | Skin and Soft Tissue Infection | ACBA00000000 |
| 1231501 | 52 | unk. | unk. | Blood Culture of Hospitalized Patient | ACAY00000000 |
| 1231502 | 203 | unk. | unk. | Blood Culture of Hospitalized Patient | ACAX00000000 |
ST is the sequence type by multilocus sequence typing.
“unk.” means the information for this isolate is unknown.
Percent identity and divergence score matrix of the 100 concatenated gene nucleotide sequence.
| Percent Identity (%) | ||||||||||||||||||||||
| Divergence | 1141733 | Com12 | Com15 | E980 | TX1330 | 1231408 | 1230933 | 1231410 | 1231501 | 1231502 | C68 | D344SRF | TX16 | E1039 | E1071 | E1162 | E1636 | E1679 | TX82 | TX0133A | U0317 | |
|
|
|
|
|
|
| 95.9 | 96 | 96.2 | 96 | 95.8 | 95.9 | 95.9 | 96.1 | 96 | 95.9 | 96 | 95.9 | 95.9 | 95.8 | 96 | ||
|
|
|
|
|
|
| 96 | 96 | 96.2 | 96 | 95.8 | 95.9 | 96 | 96.2 | 96 | 95.9 | 96 | 96 | 95.9 | 95.9 | 96 | ||
|
|
|
|
|
|
| 95.9 | 96 | 96.2 | 96.1 | 95.8 | 95.9 | 95.9 | 96.2 | 96.1 | 96 | 96.1 | 96 | 96 | 95.9 | 96.1 | ||
|
|
|
|
|
|
| 96.2 | 96.3 | 96.5 | 96.3 | 96 | 96.2 | 96.2 | 96.5 | 96.3 | 96.2 | 96.3 | 96.2 | 96.2 | 96.2 | 96.3 | ||
|
|
|
|
|
|
| 96 | 96 | 96.2 | 96 | 95.8 | 95.9 | 96 | 96.2 | 96 | 95.9 | 96 | 96 | 95.9 | 95.9 | 96 | ||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| 4.2 | 4.2 | 4.2 | 3.9 | 4.2 |
| 99.9 | 99.2 | 99.8 | 99.6 | 99.7 | 100 | 99.2 | 99.6 | 99.8 | 99.4 | 99.6 | 99.8 | 99.8 | 99.8 | ||
|
| 4.1 | 4.1 | 4.1 | 3.9 | 4.1 |
| 0.1 | 99.2 | 99.8 | 99.7 | 99.6 | 99.9 | 99.2 | 99.6 | 99.9 | 99.3 | 99.5 | 99.9 | 99.8 | 99.8 | ||
|
| 3.9 | 3.9 | 3.9 | 3.6 | 3.9 |
| 0.8 | 0.9 | 99.1 | 99 | 99.3 | 99.2 | 99.2 | 99.1 | 99.2 | 99.4 | 99.2 | 99.2 | 99.3 | 99.1 | ||
|
| 4.1 | 4.1 | 4.1 | 3.8 | 4.1 |
| 0.2 | 0.2 | 0.9 | 99.5 | 99.5 | 99.8 | 99.2 | 99.7 | 99.7 | 99.2 | 99.5 | 99.7 | 99.7 | 100 | ||
|
| 4.4 | 4.4 | 4.4 | 4.1 | 4.4 |
| 0.4 | 0.3 | 1 | 0.5 | 99.5 | 99.6 | 99 | 99.5 | 99.8 | 99.2 | 99.4 | 99.8 | 99.7 | 99.5 | ||
|
| 4.3 | 4.2 | 4.2 | 3.9 | 4.2 |
| 0.3 | 0.4 | 0.7 | 0.5 | 0.5 | 99.7 | 99.2 | 99.5 | 99.7 | 99.7 | 99.7 | 99.7 | 99.8 | 99.5 | ||
|
| 4.2 | 4.2 | 4.2 | 3.9 | 4.2 |
| 0 | 0.1 | 0.8 | 0.2 | 0.4 | 0.3 | 99.2 | 99.6 | 99.8 | 99.4 | 99.5 | 99.8 | 99.8 | 99.8 | ||
|
| 4 | 4 | 3.9 | 3.7 | 4 |
| 0.8 | 0.8 | 0.8 | 0.8 | 1 | 0.8 | 0.8 | 99.2 | 99.2 | 99.5 | 99.2 | 99.2 | 99.2 | 99.2 | ||
|
| 4.1 | 4.1 | 4.1 | 3.8 | 4.1 |
| 0.4 | 0.4 | 0.9 | 0.3 | 0.5 | 0.5 | 0.4 | 0.8 | 99.7 | 99.3 | 99.6 | 99.7 | 99.6 | 99.7 | ||
|
| 4.2 | 4.2 | 4.2 | 3.9 | 4.2 |
| 0.2 | 0.1 | 0.8 | 0.3 | 0.2 | 0.3 | 0.2 | 0.8 | 0.3 | 99.4 | 99.6 | 100 | 99.9 | 99.7 | ||
|
| 4.1 | 4.1 | 4.1 | 3.8 | 4.1 |
| 0.6 | 0.7 | 0.6 | 0.8 | 0.8 | 0.3 | 0.6 | 0.5 | 0.7 | 0.6 | 99.5 | 99.4 | 99.5 | 99.2 | ||
|
| 4.2 | 4.2 | 4.2 | 3.9 | 4.2 |
| 0.4 | 0.5 | 0.8 | 0.5 | 0.6 | 0.3 | 0.5 | 0.8 | 0.4 | 0.4 | 0.5 | 99.6 | 99.7 | 99.5 | ||
|
| 4.2 | 4.2 | 4.2 | 3.9 | 4.2 |
| 0.2 | 0.1 | 0.8 | 0.3 | 0.2 | 0.3 | 0.2 | 0.8 | 0.3 | 0 | 0.6 | 0.4 | 99.9 | 99.7 | ||
|
| 4.3 | 4.3 | 4.3 | 4 | 4.3 |
| 0.2 | 0.2 | 0.7 | 0.3 | 0.3 | 0.2 | 0.2 | 0.8 | 0.4 | 0.1 | 0.5 | 0.3 | 0.1 | 99.7 | ||
|
| 4.1 | 4.1 | 4.1 | 3.8 | 4.1 |
| 0.2 | 0.2 | 0.9 | 0 | 0.5 | 0.5 | 0.2 | 0.8 | 0.3 | 0.3 | 0.8 | 0.5 | 0.3 | 0.3 | ||
The numbers in italics (upper left) are the percent identity and divergence scores of the CA strains.
The numbers in bold are the percent identity and divergence scores of the hybrid strain.
The numbers in regular text (down right) are the percent identity and divergence scores of the HA strains.
Figure 2UPGMA phylogenetic tree of the concatenated 100 gene sequence (106,818 bp).
The results are based on a pair-wise analysis of the concatenated 100 gene nucleotide sequence for each of the 21 E. faecium strains using the Poisson correction method in MEGA4.0.2 software. An UPGMA tree was constructed depicting the evolutionary distance between these sequences. The tree is drawn to scale with the branch lengths representing the evolutionary distances, the scale can be seen at the bottom of the tree.
Analysis of 92 individual genes.
| Community Clade | Hospital Clade | Hybrid Clade |
| 1141733 (92+0) | C68 (0+92) | 1231408 (26+66) 66) |
| Com12 (91+1) | D344SRF (0+92) | |
| TX1330 (91+1) | E1162 (0+92) | |
| Com15 (90+2) | TX82 (0+92) | |
| E980 (84+8) | TX0133 (0+92) | |
| TX16 (1+91) | ||
| 1230933 (1+91) | ||
| 1231410 (1+91) | ||
| E1071 (2+90) | ||
| E1636 (2+90) | ||
| E1679 (4+88) | ||
| U0317 (4+88) | ||
| 1231502 (4+88) | ||
| E1039 (5+87) | ||
| 1231501 (5+87) |
Numbers in parentheses are given as number of genes which grouped with the community-associated clade plus the number of genes that grouped with the hospital-associated clade.
Molecular clock/time of divergence estimates based on sSNP analysis within and between clades.
| Based on the mutation rate of | |||
| Generations/yr | Hospital Clade | Community Clade | Two clades |
| 100 | 380,535 | 929,036 | 3,617,453 |
| 200 | 190,267 | 464,518 | 1,808,726 |
| 300 | 126,845 | 309,679 | 1,205,818 |