Literature DB >> 29167245

Complete Genome Sequence of Enterococcus thailandicus Strain a523 Isolated from Urban Raw Sewage.

Gustavo Ybazeta1, Laura Douglas2, Jenna Graham2, Nya L Fraleigh2, Yanal Murad3, Justo Perez4, Francisco Diaz-Mitoma3,5, Kim Tilbe6,3, Reza Nokhbeh1,6,3.   

Abstract

We report here the first complete circularly closed genome sequence of Enterococcus thailandicus strain a523 isolated from raw urban sewage. This genome contains 2,646,250 bp with a G+C content of 36.8%, 2,499 genes, 2,370 protein-coding sequences, 6 rRNA operons, 65 tRNAs, and 6 clustered regularly interspaced short palindromic repeat arrays.
Copyright © 2017 Ybazeta et al.

Entities:  

Year:  2017        PMID: 29167245      PMCID: PMC5701470          DOI: 10.1128/genomeA.01298-17

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

We present here the first full genome sequence of Enterococcus thailandicus isolated from urban raw sewage. Comparison of this genome sequence to previously reported draft genomes of isolates from cattle (1) and fermented sausages showed differences (GenBank accession no. NZ_JXLE00000000). Indexed paired-end shotgun DNA libraries (2 × 125 bp) were prepared using the NEB Ultra II kit (New England Biolabs, Inc.) and sequenced using the Illumina HiSeq 2500 paired-end 125-bp platform. In addition, the Oxford Nanopore MinION sequencing platform (2D sequencing) was used to generate long sequence reads (2). The Unicycler version 0.4.0 (3) and SPAdes version 3.10.1 (4) pipelines were used to assemble the genome. A hybrid alignment strategy was employed to assemble the genome using long reads generated by MinION, which served as scaffolds to accurately determine the order of Illumina HiSeq contigs and to close the genome. A total of 15,845,454 HiSeq reads (125 bp, paired end) and 15,163 MinION long reads were used in the de novo assembly of the E. thailandicus strain a523 genome with 798-fold average coverage per base pair. The closed genome contains 2,646,250 bp with a G+C content of 36.8%. Genome-wide open reading frames (ORFs) were predicted and annotated using Prokka version 1.12 (5) and the NCBI Prokaryotic Genome Automatic Annotation Pipelines (PGAAP), leading to the identification of 2,499 genes, 2,370 coding sequences, and 42 pseudogenes. The genome of E. thailandicus a523 presents 6 rRNA operons, 65 tRNAs, and 1 transfer-messenger RNA. We identified several clustered regularly interspaced short palindromic repeat (CRISPR)-associated (Cas) systems, which included type II CRISPR-cas9, type II-cas1, cas2, and type II A-Csn2. Six arrays of repeats were detected that contained 16, 9, 8, 5, 7, and 6 repeats per array, respectively. The search for prophages, using Phaster (6, 7), identified a chromosomal island corresponding to a prophage sequence of 19.7 kb with a total of nine ORFs, of which six were annotated. We also identified at least four genomic cassettes associated with the production of bacteriocins. Searching within the Virulence Factor Database (VFDB) (8), analysis of this genome appeared to encode a hemolysin-like protein. Alignment of the amino acid sequence of this protein against the NCBI nonredundant protein database produced 99% identity to multispecies hemolysin III from Bacillus cereus. This alignment was further verified in Seaview (9). The Comprehensive Antibiotic Resistance Database (10, 11) was used to identify genes potentially implicated in antimicrobial resistance. This search produced three hits requiring in vitro validation for their activities. These include an aminoglycoside phosphotransferase (APH) and nucleotidyltransferase (ANT), a tetracycline target protection protein, and the lincosamide nucleotidyltransferase Lnu(G). Given the adaptive versatility of enterococci in diverse niches, and the scarcity of E. thailandicus-related publications, releasing this first completed genome will assist researchers in better understanding the biology of E. thailandicus and its role in multibacterial communities.

Accession number(s).

The complete genome sequence for E. thailandicus strain a523 was deposited in GenBank under accession number CP023074.
  10 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  SeaView version 4: A multiplatform graphical user interface for sequence alignment and phylogenetic tree building.

Authors:  Manolo Gouy; Stéphane Guindon; Olivier Gascuel
Journal:  Mol Biol Evol       Date:  2009-10-23       Impact factor: 16.240

3.  The comprehensive antibiotic resistance database.

Authors:  Andrew G McArthur; Nicholas Waglechner; Fazmin Nizam; Austin Yan; Marisa A Azad; Alison J Baylay; Kirandeep Bhullar; Marc J Canova; Gianfranco De Pascale; Linda Ejim; Lindsay Kalan; Andrew M King; Kalinka Koteva; Mariya Morar; Michael R Mulvey; Jonathan S O'Brien; Andrew C Pawlowski; Laura J V Piddock; Peter Spanogiannopoulos; Arlene D Sutherland; Irene Tang; Patricia L Taylor; Maulik Thaker; Wenliang Wang; Marie Yan; Tennison Yu; Gerard D Wright
Journal:  Antimicrob Agents Chemother       Date:  2013-05-06       Impact factor: 5.191

4.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

5.  PHAST: a fast phage search tool.

Authors:  You Zhou; Yongjie Liang; Karlene H Lynch; Jonathan J Dennis; David S Wishart
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

6.  CARD 2017: expansion and model-centric curation of the comprehensive antibiotic resistance database.

Authors:  Baofeng Jia; Amogelang R Raphenya; Brian Alcock; Nicholas Waglechner; Peiyao Guo; Kara K Tsang; Briony A Lago; Biren M Dave; Sheldon Pereira; Arjun N Sharma; Sachin Doshi; Mélanie Courtot; Raymond Lo; Laura E Williams; Jonathan G Frye; Tariq Elsayegh; Daim Sardar; Erin L Westman; Andrew C Pawlowski; Timothy A Johnson; Fiona S L Brinkman; Gerard D Wright; Andrew G McArthur
Journal:  Nucleic Acids Res       Date:  2016-10-26       Impact factor: 16.971

7.  Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.

Authors:  Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal:  PLoS Comput Biol       Date:  2017-06-08       Impact factor: 4.475

8.  PHASTER: a better, faster version of the PHAST phage search tool.

Authors:  David Arndt; Jason R Grant; Ana Marcu; Tanvir Sajed; Allison Pon; Yongjie Liang; David S Wishart
Journal:  Nucleic Acids Res       Date:  2016-05-03       Impact factor: 16.971

9.  Draft Genome Sequence of an Enterococcus thailandicus Strain Isolated from Bovine Feces.

Authors:  Alicia G Beukers; Rahat Zaheer; Noriko Goji; Shaun R Cook; Kingsley K Amoako; Alexandre V Chaves; Michael P Ward; Tim A McAllister
Journal:  Genome Announc       Date:  2016-07-21

10.  VFDB 2016: hierarchical and refined dataset for big data analysis--10 years on.

Authors:  Lihong Chen; Dandan Zheng; Bo Liu; Jian Yang; Qi Jin
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

  10 in total
  1 in total

1.  Complete Genome Sequences of a Diverse Group of 13 Propionibacterium acnes Bacteriophages Isolated from Urban Raw Sewage.

Authors:  Gustavo Ybazeta; Jenna Graham; Jelena Trifkovic; Lyne Giroux; Mazen Saleh; Syed A Sattar; Reza Nokhbeh
Journal:  Genome Announc       Date:  2018-06-28
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.