Literature DB >> 25125650

A Gapless, Unambiguous Genome Sequence of the Enterohemorrhagic Escherichia coli O157:H7 Strain EDL933.

Haythem Latif1, Howard J Li1, Pep Charusanti1, Bernhard Ø Palsson1, Ramy K Aziz2.   

Abstract

Escherichia coli EDL933 is the prototypic strain for enterohemorrhagic E. coli serotype O157:H7, associated with deadly food-borne outbreaks. Because the publicly available sequence of the EDL933 genome has gaps and >6,000 ambiguous base calls, we here present an updated high-quality, unambiguous genome sequence with no assembly gaps.
Copyright © 2014 Latif et al.

Entities:  

Year:  2014        PMID: 25125650      PMCID: PMC4132626          DOI: 10.1128/genomeA.00821-14

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Escherichia coli serotype O157: H7, a causative agent in food poisoning outbreaks leading to hemorrhagic colitis or hemolytic uremic syndrome, gained public attention following its association with an outbreak in 1993 related to the U.S. fast-food chain Jack-in-the-Box (1, 2) and another large outbreak among schoolchildren in Sakai, Japan in 1996 (3). Moreover, E. coli O157:H7 strains are known for their prophage-rich genomes and are currently considered the bacterial genomes with the largest number of integrated phages (4, 5). Strain EDL933 (ATCC 43895), isolated from ground beef linked to a massive hamburger outbreak in Michigan, USA in 1982 (6), is the prototypic reference strain representing this pathotype. Although the full genome of EDL933 was sequenced and published in 2001 (5), the deposited assembled genome has >6,000 ambiguous base calls and a chromosomal gap of 4,000 bp. While the utility of this reference genome, cited in >3,200 publications, is indisputable, several analyses reliant on a pristine reference (e.g., single nucleotide polymorphism studies) are hindered by those ambiguities and gaps. EDL933 has long phage-associated repeat regions >7 kb. Microbial genomes with these characteristics are the most complex to assemble (7), so we resorted to single-molecule sequencing using PacBio followed by polishing using Illumina short-reads to complete the EDL933 sequence. This produced a gapless genome assembly, with no ambiguous base calls, and an updated genome annotation. Genomic DNA from the EDL933 strain was prepared for PacBio and Illumina sequencing. PacBio libraries were prepared according to standard library preparation procedures with Blue Pippen size selection for >20-kb fragments and sequenced using P5/C3 chemistry and 3 h movies on the RS II system at the UCSD Genomics Core, San Diego, CA. Illumina libraries were prepared according to the TruSeq DNA PCR-Free sample preparation kit protocol (Illumina) and paired-end sequenced (2×250) on a MiSeq. SMRTAnalaysis 2.2.0 HGAP v2 assembly of PacBio reads (66,927) produced three polished contigs: two corresponding to the chromosome and one the plasmid. When compared with the reference, NC_002655, a region of high read density within the two chromosomal contigs was shown to be a large duplication that unites the two contigs. After the plasmid and chromosome were circularized, reads were mapped back to the assembled sequences to check for variants by first using Bridge Mapper (RS_BridgeMapper.1) with PacBio reads and then Breseq v0.24rc6 (8) with Illumina short reads. Coverage was ~100× for PacBio data and ~300 for Illumina data. The final assembled genome was automatically annotated, then manually corrected, through the RAST server using SEED annotation tools (9, 10). The updated EDL933 genome consists of a 5,547,323-bp chromosome and a 92,076-bp plasmid compared with 5,528,445 and 92,077 bp in the current EDL933 assembly. This gapless assembly eliminates 6,641 ambiguous base calls in the current EDL933 chromosome including 2,413 non-N ambiguous bases and 4,000 N’s belonging to a chromosomal gap. The updated genome has 5,675 and 97 annotated coding sequences (CDSs) compared with 5,286 and 99 CDSs found in the current reference chromosome and plasmid, respectively.

Nucleotide sequence accession numbers.

This whole-genome shotgun project has been deposited in GenBank under the accession numbers CP008957 and CP008958. The RAST version is available at http://rast.nmpdr.org under job 157998.
  10 in total

Review 1.  Escherichia coli O157.

Authors:  Hugh Pennington
Journal:  Lancet       Date:  2010-10-23       Impact factor: 79.321

Review 2.  Prophage genomics.

Authors:  Carlos Canchaya; Caroline Proux; Ghislain Fournous; Anne Bruttin; Harald Brüssow
Journal:  Microbiol Mol Biol Rev       Date:  2003-06       Impact factor: 11.056

3.  Genome sequence of enterohaemorrhagic Escherichia coli O157:H7.

Authors:  N T Perna; G Plunkett; V Burland; B Mau; J D Glasner; D J Rose; G F Mayhew; P S Evans; J Gregor; H A Kirkpatrick; G Pósfai; J Hackett; S Klink; A Boutin; Y Shao; L Miller; E J Grotbeck; N W Davis; A Lim; E T Dimalanta; K D Potamousis; J Apodaca; T S Anantharaman; J Lin; G Yen; D C Schwartz; R A Welch; F R Blattner
Journal:  Nature       Date:  2001-01-25       Impact factor: 49.962

4.  Cloning and sequencing of the gene encoding universal stress protein from Escherichia coli O157:H7 isolated from Jack-in-a-Box outbreak.

Authors:  J Chen; M W Griffiths
Journal:  Lett Appl Microbiol       Date:  1999-08       Impact factor: 2.858

5.  Massive outbreak of Escherichia coli O157:H7 infection in schoolchildren in Sakai City, Japan, associated with consumption of white radish sprouts.

Authors:  H Michino; K Araki; S Minami; S Takaya; N Sakai; M Miyazaki; A Ono; H Yanagawa
Journal:  Am J Epidemiol       Date:  1999-10-15       Impact factor: 4.897

6.  Hemorrhagic colitis associated with a rare Escherichia coli serotype.

Authors:  L W Riley; R S Remis; S D Helgerson; H B McGee; J G Wells; B R Davis; R J Hebert; E S Olcott; L M Johnson; N T Hargrett; P A Blake; M L Cohen
Journal:  N Engl J Med       Date:  1983-03-24       Impact factor: 91.245

7.  Identification of mutations in laboratory-evolved microbes from next-generation sequencing data using breseq.

Authors:  Daniel E Deatherage; Jeffrey E Barrick
Journal:  Methods Mol Biol       Date:  2014

8.  SEED servers: high-performance access to the SEED genomes, annotations, and metabolic models.

Authors:  Ramy K Aziz; Scott Devoid; Terrence Disz; Robert A Edwards; Christopher S Henry; Gary J Olsen; Robert Olson; Ross Overbeek; Bruce Parrello; Gordon D Pusch; Rick L Stevens; Veronika Vonstein; Fangfang Xia
Journal:  PLoS One       Date:  2012-10-24       Impact factor: 3.240

9.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

10.  Reducing assembly complexity of microbial genomes with single-molecule sequencing.

Authors:  Sergey Koren; Gregory P Harhay; Timothy P L Smith; James L Bono; Dayna M Harhay; Scott D Mcvey; Diana Radune; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Biol       Date:  2013       Impact factor: 13.583

  10 in total
  28 in total

1.  Escherichia coli O157:H7 Strain EDL933 Harbors Multiple Functional Prophage-Associated Genes Necessary for the Utilization of 5-N-Acetyl-9-O-Acetyl Neuraminic Acid as a Growth Substrate.

Authors:  Nadja Saile; Anja Voigt; Sarah Kessler; Timo Stressler; Jochen Klumpp; Lutz Fischer; Herbert Schmidt
Journal:  Appl Environ Microbiol       Date:  2016-09-16       Impact factor: 4.792

2.  Sorbitol-Fermenting Enterohemorrhagic Escherichia coli O157:H- Isolates from Czech Patients with Novel Plasmid Composition Not Previously Seen in German Isolates.

Authors:  Andreas Bauwens; Monika Marejková; Barbara Middendorf-Bauchart; Rita Prager; Annelene Kossow; Wenlan Zhang; Helge Karch; Alexander Mellmann; Martina Bielaszewska
Journal:  Appl Environ Microbiol       Date:  2017-11-16       Impact factor: 4.792

3.  Draft Genome Sequences of Escherichia coli O157:H7 Strains Rafaela_II (Clade 8) and 7.1_Anguil (Clade 6) from Cattle in Argentina.

Authors:  Ariel Fernando Amadio; Natalia Amigo; Andrea Fabiana Puebla; Marisa Diana Farber; Angel Adrián Cataldi
Journal:  Genome Announc       Date:  2015-06-11

4.  Draft Genome Sequences of Three European Laboratory Derivatives from Enterohemorrhagic Escherichia coli O157:H7 Strain EDL933, Including Two Plasmids.

Authors:  Lea Fellner; Christopher Huptas; Svenja Simon; Anna Mühlig; Siegfried Scherer; Klaus Neuhaus
Journal:  Genome Announc       Date:  2016-04-07

5.  Whole-Genome Sequence of Escherichia coli Serotype O157:H7 Strain EDL932 (ATCC 43894).

Authors:  Gaylen A Uhlich; George C Paoli; Chin-Yi Chen; Bryan J Cottrell; Xinmin Zhang; Xianghe Yan
Journal:  Genome Announc       Date:  2016-07-14

6.  Short-term evolution of Shiga toxin-producing Escherichia coli O157:H7 between two food-borne outbreaks.

Authors:  Lauren A Cowley; Timothy J Dallman; Stephen Fitzgerald; Neil Irvine; Paul J Rooney; Sean P McAteer; Martin Day; Neil T Perry; James L Bono; Claire Jenkins; David L Gally
Journal:  Microb Genom       Date:  2016-09-08

7.  Systems biology-guided identification of synthetic lethal gene pairs and its potential use to discover antibiotic combinations.

Authors:  Ramy K Aziz; Jonathan M Monk; Robert M Lewis; Suh In Loh; Arti Mishra; Amrita Abhay Nagle; Chitkala Satyanarayana; Saravanakumar Dhakshinamoorthy; Michele Luche; Douglas B Kitchen; Kathleen A Andrews; Nicole L Fong; Howard J Li; Bernhard O Palsson; Pep Charusanti
Journal:  Sci Rep       Date:  2015-11-04       Impact factor: 4.379

Review 8.  Tackling Drug Resistant Infection Outbreaks of Global Pandemic Escherichia coli ST131 Using Evolutionary and Epidemiological Genomics.

Authors:  Tim Downing
Journal:  Microorganisms       Date:  2015-05-20

9.  Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks.

Authors:  Brigida Rusconi; Fatemeh Sanjar; Sara S K Koenig; Mark K Mammel; Phillip I Tarr; Mark Eppinger
Journal:  Front Microbiol       Date:  2016-06-30       Impact factor: 5.640

10.  Discovery of numerous novel small genes in the intergenic regions of the Escherichia coli O157:H7 Sakai genome.

Authors:  Sarah M Hücker; Zachary Ardern; Tatyana Goldberg; Andrea Schafferhans; Michael Bernhofer; Gisle Vestergaard; Chase W Nelson; Michael Schloter; Burkhard Rost; Siegfried Scherer; Klaus Neuhaus
Journal:  PLoS One       Date:  2017-09-13       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.