Literature DB >> 30795795

Draft genome sequence of Enterococcus faecium SP15, a potential probiotic strain isolated from spring water.

Fauzia Aziz1,2,3, Muhammad Nasim Khan4, Safia Ahmed5, Simon Colin Andrews6.   

Abstract

OBJECTIVES: Enterococci are Gram-positive lactic acid bacteria and common inhabitants of the gastrointestinal tract of mammals, including humans. They are also widely distributed in diverse environments such as soil, water, vegetables and food. Enterococcus faecium is able to produce antimicrobial compounds (enterocins) and thus can act as a probiotic. E. faecium SP15 is a newly identified enterocin-producing strain from spring water that has been subjected to genome sequence analysis to provide understanding of its antimicrobial and probiotic properties. DATA DESCRIPTION: The draft genome of E. faecium SP15 comprises of 2,783,033 bp with a G+C content of 38.08%. Five genetic loci predicted to specify enterocin production were identified, but no virulence factors could be detected and only two potential antibiotic resistance genes were noted.

Entities:  

Keywords:  Draft genome assembly; Enterocin; Enterococcus faecium; Probiotic; Spring water

Mesh:

Year:  2019        PMID: 30795795      PMCID: PMC6387508          DOI: 10.1186/s13104-019-4136-0

Source DB:  PubMed          Journal:  BMC Res Notes        ISSN: 1756-0500


Objective

Enterococci are Gram-positive lactic acid bacteria with a wide environmental distribution, encompassing many species from a wide variety of ecological niches [1]. Enterococcus faecium is a major nosocomial pathogen often causing neonatal meningitis or endocarditis [2]. However, certain strains of E. faecium have beneficial effects on human health due to their probiotic activity [3]. For example, E. faecium T-110 is a syndicate member in several probiotic products including BIO-THREER which is widely prescribed for human, animal and aquacultural use [3] and E. faecium strain L-3 is the principle organism in the probiotic Laminolact [4]. E. faecium is well known for its ability to produce bacteriocins, but there are relatively few reports in the literature on the genome sequence of E. faecium from non-clinical sources [5, 6] and there are no current NCBI database genome entries for E. faecium isolated from natural water sources.

Data description

Enterococcus faecium SP15 was isolated from spring water in Rawalakot (Azad Kashmir, Pakistan), a site of relative isolation with little waste water contamination. The strain exhibited strong antimicrobial activity against a panel of seven indicator strains, including Listeria monocytogenesis, indicative of enterocin production. E. faecium SP15 genomic DNA was extracted using a GeneJET genomic DNA purification kit (Thermofisher Scientific) as recommended by the vendor and assessed using a NanoDrop ND-1000 spectrophotometer and 0.7% agarose gel electrophoresis. Genome sequencing was performed by MicrobesNG (University of Birmingham, UK) using Illumina MiSeq and HiSeq 2500 platforms (Illumina, UK) with 2 × 250 bp paired-end reads. The reads were trimmed using Trimmomatic version 0.30 [7] and the quality was assessed using in-house scripts combined with BWA-MEM software 0.7.16 [8]. De novo assembly was performed with SPAdes software version 3.9.0 [9] and assembly metrics were calculated using QUAST version 2.0 [10]. Gene prediction and annotation were carried out using the Pathosystems Resources Integration Center (PATRIC) web server [11], RAST version 2.0 [12] and the NCBI, PGAP version 4.6 [13]. The tRNA genes were predicted by tRNA scan-SE 2.0 [14]. Antimicrobial mechanisms (e.g. enterocin production) were explored with BAGEL 3 [15] and anti-SMASH V4 [16]. Virulence factors were identified using the virulence factor database, VFDB [17]. Antibiotic resistance gene were identified using the Comprehensive Antibiotic Resistance Database, CARD [18], and acquired resistance genes were predicted by Resistance Finder 3.0 [19]. Contigs were ordered by alignment against the most closely related sequence in GenBank [20] (E. faecium T110, CP006030; 99% identity) using progressive Mauve version 2.4.0 [21]. Intact and incomplete prophage regions were identified through the integrated search and annotation tool, PHAST [22]. Clustered regularly interspaced short palindromic repeat (CRISPR) arrays were identified using CRISPR finder [23]. The draft genome assembly consisted of 121 contigs with a total size of 2,783,033 bp (Table 1). The genome sequence data was at 30× coverage with an N50 of 102,590 bp and mean GC content 38.08%. A total of 2900 protein-encoding genes were predicted of which 2063 were assigned putative functions while 837 remain hypothetical. A total of 63 tRNA structural genes were identified. BAGLE 3 predicted five bacteriocin biosynthetic gene clusters (enterocin-HF, enterocin-P, enterocin SE-K4, enterocin L50A/L50B and enterolysin). No virulence factors (AS, Ace, Acm, Scm, EfaA, EcbA, Esp, Cyl, GelE and SprE) were detected. Two antibiotic resistance genes aac (6′)-li and msrC (98 and 97% identity, respectively) were identified conferring resistance to aminoglycosides, and macrolides and streptogramin B antibiotics, but no acquired resistance determinants were found. Four prophage loci were predicted of which three were intact (Strept_9871, Lactob_phig1e and Staphy_SPbeta (40, 36.9 and 32.7 kb, respectively) and one incomplete (Salmon_SJ46, 17.4 kb). One CRISPR array was identified of 190 bp, containing three spacers with a highly conserved 24 bp DR region, and although two cas gene clusters (cas3_typeI, cas4_typeI-II) were found, these were not associated with the CRISPR array.
Table 1

Overview of data files

LabelName of data file/data setFile types (file extension)Data repository and identifier (DOI or accession number)
Data fileWhole genome shotgun projectFASTADDBJ/ENA/GenBank (accession RDQA00000000)
Overview of data files Two major replicons are apparent: a chromosome of ~ 2,545,000 bp and a plasmid of ~ 149,300 bp (related to plasmid pNB2354 from E. faecium NRRL-B-2354, CP004064).

Limitations

Current data is based on the draft level genome such that the exact length of the genome, and the number of rRNA genes and repetitive elements, cannot be absolutely determined. Furthermore, the genome includes extrachromosomal elements that cannot be predicted precisely.
  23 in total

Review 1.  Enterococci at the crossroads of food safety?

Authors:  C M Franz; W H Holzapfel; M E Stiles
Journal:  Int J Food Microbiol       Date:  1999-03-01       Impact factor: 5.277

2.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

Review 3.  Vancomycin-resistant enterococci: the clinical effect of a common nosocomial pathogen.

Authors:  P K Linden; C B Miller
Journal:  Diagn Microbiol Infect Dis       Date:  1999-02       Impact factor: 2.803

4.  progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors:  Aaron E Darling; Bob Mau; Nicole T Perna
Journal:  PLoS One       Date:  2010-06-25       Impact factor: 3.240

5.  PHAST: a fast phage search tool.

Authors:  You Zhou; Yongjie Liang; Karlene H Lynch; Jonathan J Dennis; David S Wishart
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

6.  antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences.

Authors:  Marnix H Medema; Kai Blin; Peter Cimermancic; Victor de Jager; Piotr Zakrzewski; Michael A Fischbach; Tilmann Weber; Eriko Takano; Rainer Breitling
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

7.  VFDB: a reference database for bacterial virulence factors.

Authors:  Lihong Chen; Jian Yang; Jun Yu; Zhijian Yao; Lilian Sun; Yan Shen; Qi Jin
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

10.  CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats.

Authors:  Ibtissem Grissa; Gilles Vergnaud; Christine Pourcel
Journal:  Nucleic Acids Res       Date:  2007-05-30       Impact factor: 16.971

View more
  1 in total

1.  High-quality genome sequence assembly of R.A73 Enterococcus faecium isolated from freshwater fish mucus.

Authors:  Rim El Jeni; Kais Ghedira; Monia El Bour; Sonia Abdelhak; Alia Benkahla; Balkiss Bouhaouala-Zahar
Journal:  BMC Microbiol       Date:  2020-10-23       Impact factor: 3.605

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.