| Literature DB >> 27610213 |
Joran E Michiels1, Bram Van den Bergh1, Maarten Fauvart2, Jan Michiels1.
Abstract
Enterococcus faecium, traditionally considered a harmless gut commensal, is emerging as an important nosocomial pathogen showing increasing rates of multidrug resistance. We report the draft genome sequence of E. faecium strain LMG 8148, isolated in 1968 from a human in Gothenburg, Sweden. The draft genome has a total length of 2,697,490 bp, a GC-content of 38.3 %, and 2,402 predicted protein-coding sequences. The isolation of this strain predates the emergence of E. faecium as a nosocomial pathogen. Consequently, its genome can be useful in comparative genomic studies investigating the evolution of E. faecium as a pathogen.Entities:
Keywords: Draft genome; Enterococcus faecium; Gut commensal; Human isolate; Nosocomial pathogen
Year: 2016 PMID: 27610213 PMCID: PMC5015329 DOI: 10.1186/s40793-016-0187-1
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Enterococcus faecium strain LMG 8148 according to the MIGS recommendations [8]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain LMG 8148 | NAS | ||
| Gram stain | Positive | TAS [ | |
| Cell shape | Coccus | TAS [ | |
| Motility | Non-motile | NAS | |
| Sporulation | Non-sporulating | TAS [ | |
| Temperature range | 5-50 °C | TAS [ | |
| Optimum temperature | 37 °C | TAS [ | |
| pH range; Optimum | 4.6-9.9; 7.5 | TAS [ | |
| Carbon source | Glucose, citrate, complex carbon sources | TAS [ | |
| MIGS-6 | Habitat | Gastro-intestinal tracts of humans and other mammals | TAS [ |
| MIGS-6.3 | Salinity | 0-6.5 % | TAS [ |
| MIGS-22 | Oxygen requirement | Facultatively anaerobic | TAS [ |
| MIGS-15 | Biotic relationship | Commensal | TAS [ |
| MIGS-14 | Pathogenicity | Pathogenic | TAS [ |
| MIGS-4 | Geographic location | Sweden | NAS |
| MIGS-5 | Sample collection | 1961 | TAS [ |
| MIGS-4.1 | Latitude | Unknown | NAS |
| MIGS-4.2 | Longitude | Unknown | NAS |
| MIGS-4.4 | Altitude | Unknown | NAS |
aEvidence codes - IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature); NAS: Non-traceable VAuthor Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [26]
Fig. 1Phase-contrast micrograph of E. faecium LMG 8148
Fig. 216S rRNA phylogenetic tree indicating the position of E. faecium LMG 8148 relative to other E. faecium strains and other enterococcal species (type strain = T). Lactobacillus plantarum was included as an outgroup. Genbank accession numbers of the aligned sequences are indicated between brackets. 16S rDNA sequences were aligned using MUSCLE, and the phylogenetic tree was determined using the neighbour-joining algorithm with the Kimura 2-parameter distance model in MEGA (version 7) [27]. A gamma distribution (shape parameter = 1) was used for rate variation among sites. The optimal tree with the sum of branch lengths = 0.1983 is shown, and nodes that appeared in more than 50 % of replicate trees in the bootstrap test (1000 replicates) are marked with their bootstrap support values
Project information
| MIGS-ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | One paired-end Illumina library (Nextera) |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 317 |
| MIGS-30 | Assemblers | CLC NGS Cell 7.5.1 |
| MIGS-32 | Gene calling method | GeneMarkS+ |
| Locus Tag | AUC59 | |
| Genbank ID | LOHT00000000 | |
| GenBank Date of Release | 2016/02/26 | |
| GOLD ID | - | |
| BIOPROJECT | PRJNA305395 | |
| MIGS-13 | Source Material Identifier | LMG 8148 |
| Project relevance | Evolution |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 2,697,490 | 100.00 |
| DNA coding (bp) | 2,136,945 | 79.22 |
| DNA G + C (bp) | 1,034,256 | 38.34 |
| DNA scaffolds | 366 | 100.00 |
| Total genes | 2,772 | 100.00 |
| Protein coding genes | 2,402 | 86.65 |
| RNA genes | 67 | 2.42 |
| Pseudo genes | 303 | 10.93 |
| Genes in internal clusters | - | - |
| Genes with function prediction | 2,235 | 80.63 |
| Genes assigned to COGs | 2,153 | 77.67 |
| Genes with Pfam domains | 2,078 | 74.96 |
| Genes with signal peptides | 120 | 4.33 |
| Genes with transmembrane helices | 631 | 22.76 |
| CRISPR repeats | 1 | - |
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 150 | 6.24 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 185 | 7.70 | Transcription |
| L | 148 | 6.16 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 21 | 0.87 | Cell cycle control, cell division, chromosome partitioning |
| V | 49 | 2.04 | Defense mechanisms |
| T | 88 | 3.66 | Signal transduction mechanisms |
| M | 114 | 4.75 | Cell wall/membrane biogenesis |
| N | 13 | 0.54 | Cell motility |
| U | 27 | 1.12 | Intracellular trafficking and secretion |
| O | 58 | 2.41 | Posttranslational modification, protein turnover, chaperones |
| C | 74 | 3.08 | Energy production and conversion |
| G | 253 | 10.53 | Carbohydrate transport and metabolism |
| E | 144 | 6.00 | Amino acid transport and metabolism |
| F | 78 | 3.25 | Nucleotide transport and metabolism |
| H | 55 | 2.29 | Coenzyme transport and metabolism |
| I | 57 | 2.37 | Lipid transport and metabolism |
| P | 109 | 4.54 | Inorganic ion transport and metabolism |
| Q | 22 | 0.92 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 263 | 10.95 | General function prediction only |
| S | 245 | 10.20 | Function unknown |
| - | 249 | 10.37 | Not in COGs |
The total is based on the total number of protein coding genes in the genome