| Literature DB >> 20398277 |
Willem van Schaik1, Janetta Top, David R Riley, Jos Boekhorst, Joyce E P Vrijenhoek, Claudia M E Schapendonk, Antoni P A Hendrickx, Isaäc J Nijman, Marc J M Bonten, Hervé Tettelin, Rob J L Willems.
Abstract
BACKGROUND: The Gram-positive bacterium Enterococcus faecium is an important cause of nosocomial infections in immunocompromized patients.Entities:
Mesh:
Year: 2010 PMID: 20398277 PMCID: PMC2858755 DOI: 10.1186/1471-2164-11-239
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Origins of the strains of which the genomes were sequenced.
| Strain | Source | MLST sequence type | CC17 | Year of isolation | Country |
|---|---|---|---|---|---|
| E980 | Faecal isolate collected during community surveillance program [ | 94 | No | 1998 | NLD |
| E1039 | Faecal isolate collected during community surveillance program [ | 42 | No | 1998 | NLD |
| E1071 | Faecal isolate. Hospital surveillance program, hospitalized patient. | 32 | No | 2000 | NLD |
| E1162 | Clinical isolate (bloodstream infection). | 17 | Yes | 1997 | FRA |
| E1636 | Clinical isolate (bloodstream infection). | 106 | No | 1961 | NLD |
| E1679 | Clinical isolate from a vascular catheter. | 114 | No | 1998 | BRA |
| U0317 | Clinical isolate (urinary tract infection). | 78 | Yes | 2005 | NLD |
Country abbreviations. NLD: The Netherlands; FRA: France; BRA: Brazil.
General features of the E. faecium genomes.
| Features | E980 | E1039 | E1071 | E1162 | E1636 | E1679 | U0317 |
|---|---|---|---|---|---|---|---|
| Coverage of sequence | 19.1× | 24.4× | 19.9× | 20.0× | 19.3× | 16.7× | 17.0× |
| Average sequence read length | 240.0 | 246.0 | 254.3 | 256.7 | 237.6 | 239.8 | 235.6 |
| Number of contigs | 131 | 124 | 96 | 139 | 223 | 340 | 227 |
| Number of assembled bases | 2792626 | 2503230 | 2700770 | 2711396 | 2838335 | 2928184 | 2893029 |
| Contig N50 (bp) | 52826 | 46767 | 91386 | 50721 | 38772 | 20200 | 31583 |
| Largest contig (bp) | 145466 | 151666 | 228099 | 153370 | 152616 | 150593 | 162623 |
| G + C content, % | 38.1 | 38.0 | 37.9 | 38.0 | 37.8 | 37.7 | 37.7 |
| Protein coding DNA, % | 86.4 | 85.0 | 85.9 | 86.3 | 85.9 | 85.7 | 85.8 |
| Number of predicted CDS | 2869 | 2587 | 2714 | 2694 | 2940 | 3043 | 2965 |
| Average gene length | 841 | 822 | 855 | 869 | 830 | 824 | 837 |
| Bases with quality score <20, % | 0.051 | 0.035 | 0.027 | 0.034 | 0.104 | 0.196 | 0.155 |
Figure 1Phylogenomic analysis of . Panel (A): Unrooted maximum likelihood tree of E. faecium based on the concatenated alignments of 649 orthologous proteins (containing 11639 residues). Bootstrap values are based on 1000 permutations. Panel (B): Unrooted neighbour-joining tree depicting differences in gene content in E. faecium. The distance matrix used to generate this phylogenomic tree is provided as Additional file 7.
Figure 2Analysis of . Estimates of new gene discovery rate (panel A), total E. faecium gene pool (pan-genome) (panel B) and core-genome size (panel C) are shown for increasing values of the number n of E. faecium genomes sequenced. Colored circles represent the number of new or core genes present when a particular genome is added to the subset of the remaining genomes. Grey circles represent the total gene repertoire for a random collection of organisms. Medians of the distributions are indicated by red diamonds. The curve for the estimation of the gene discovery rate is a least squares fit of the power law Fnew(n) = κnew n-to medians. The curve for the estimation of the size of the E. faecium pan-genome is a least squares power law (Fpan(n) = κpan nγ) fit to the medians. The size of the core genome was estimated by fitting the exponential curve Fcore(n) = κcore exp[-n/τcore] + tgcore(θ) to medians.
CDS of bacteriophage origin in the sequenced genomes
| Strain | Number of CDS of bacteriophage origin | Number of genome-specific CDS of bacteriophage origin |
|---|---|---|
| E980 | 148 (5.2) | 89 (3.1) |
| E1039 | 115 (4.4) | 56 (2.2) |
| E1071 | 63 (2.3) | 40 (1.5) |
| E1162 | 86 (3.2) | 29 (1.1) |
| E1636 | 101 (3.4) | 32 (1.1) |
| E1679 | 149 (4.9) | 60 (2.0) |
| U0317 | 117 (3.9) | 32 (1.1) |
Values in brackets denote the percentage of CDS relative to the total number of CDS in the genome. A CDS was scored as unique for a specific genome when the encoded protein did not have a BLAST-hit with 90% amino acid identity or higher in any of the other sequenced isolates.
Figure 3Transmisssion electron micrographs of phage particles released by the indicated . Scale bars are 50 nm.
Figure 4Sequence alignment of the . The red-colored bands represent matching regions between the esp PAIs. A small blue band represents the inversion of a transposase fragment between E. faecium E1162 and E1679. Arrows indicate CDS and direction of transcription. The yellow arrows indicate the fragment of the esp PAI, that is identical to a fragment of the PAI of E. faecalis MMH594 [46]. The green arrows indicate the CDS that are homologous to EfaB5. The purple arrows in E1679 indicate the CDS predicted to encode the pathway of inositol metabolism. Orange arrows indicate transposases and integrases. White arrows indicate flanking genes that are not part of the esp PAI. The positions of the rpsI, tuf and esp genes are indicated.
Figure 5Transfer of the . Panel (A): representative ethidium bromide stained PFGE gel of SmaI-digested chromosomal DNA of the donor strain (E1162Δesp; lane 1), the recipient strain (BM4105RF; lane 2) and a transconjugant (lane 3). The gel band that has shifted in the recipient strain due to the insertion of the esp PAI is indicated by the white arrow. Panel (B): Southern blots of the PFGE gel hybridized with a probe for esp and a probe covering the insertion site of the esp PAI. Panel (C): PCR analysis of the esp PAI insertion site in the donor strain (E1162Δesp), the recipient strain (BM4105RF) and a transconjugant (TC). PCRs were performed with primers covering the esp PAI insertion site (reactions A), covering the 5' end of the esp PAI and the 5' flanking end of the insertion site (reactions B), and covering the 3' end of the esp PAI and the 3' flanking end of the insertion site (reactions C). The marker (M) is Invitrogen's 1 Kb Plus DNA Ladder. Note that, because of the size of the esp PAI, no products can be obtained in PCRs with primers on the 5' and 3' flanking regions of the esp PAI integration site in esp+ strains.