Literature DB >> 29748405

High-Quality Whole-Genome Sequences for 77 Shiga Toxin-Producing Escherichia coli Strains Generated with PacBio Sequencing.

Pooja N Patel1,2, Rebecca L Lindsey3, Lisley Garcia-Toledo1,2, Lori A Rowe4, Dhwani Batra4, Samuel W Whitley1,2, Daniel Drapeau1,2, Devon Stoneburg1, Haley Martin1, Phalasy Juieng4, Vladimir N Loparev4, Nancy Strockbine1.   

Abstract

Shiga toxin-producing Escherichia coli (STEC) is an enteric foodborne pathogen that can cause mild to severe illness. Here, we report the availability of high-quality whole-genome sequences for 77 STEC strains generated using the PacBio sequencing platform.

Entities:  

Year:  2018        PMID: 29748405      PMCID: PMC5946054          DOI: 10.1128/genomeA.00391-18

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Shiga toxin-producing Escherichia coli (STEC) is a major foodborne pathogen responsible for outbreaks and sporadic cases of diarrheal illness (1). Although the majority of reported STEC infections in the United States are caused by E. coli O157:H7, non-O157 serotypes have grown to be a public health concern both in the United States and internationally, as they can cause severe illness comparable to that caused by STEC O157 (2, 3). Non-O157 STEC has been linked to a range of clinical illnesses, from asymptomatic shedding and mild diarrhea to hemorrhagic colitis and potentially fatal hemolytic-uremic syndrome (HUS); more than 100 STEC serotypes have been linked to such human disease (4). Many of these non-O157 serotypes do not have publicly available PacBio-sequenced genomes. Here, we report whole-genome sequences for 77 STEC strains representing 43 serotypes. The STEC cultures were grown overnight on blood agar plates at 37°C, and genomic DNA was extracted according to the manufacturer’s protocol (ArchivePure; 5 Prime, Gaithersburg, MD). The DNA was sheared to 20 kb using needle shearing, and the prepared libraries were further size selected using BluePippin (Sage Scientific, Beverly, MA). The large SMRTbell libraries were generated using standard library protocols of the Pacific Biosciences DNA template preparation kit (Pacific Biosciences, Menlo Park, CA). Each strain was sequenced using one, two, or three single-molecule real-time (SMRT) cells. The finished libraries were bound to proprietary P6 version 2 polymerase and sequenced on a PacBio RS II platform using C4 chemistry for 360-min movies. The sequence reads were then filtered and assembled de novo using Falcon, Canu, or the PacBio Hierarchical Genome Assembly Process version 3 (5–7). For 30 strains, whole-genome optical maps were generated using the Argus platform (OpGen, Gaithersburg, MD), and the sequence order was verified using corresponding AflII and NcoI whole-genome maps. The detected serotypes, accession numbers, and assembly metrics for each genome are listed in Table 1. The average G+C content for all 77 chromosomal sequences was 50.6%. The average coverage ranged from 39.5× to 230.8×, with an average coverage of 109×. All but nine chromosomal sequences were circularized and found to have overlapping ends. Of the nine genomes that could not be circularized due to collapsed or unresolved repeats, a single chromosomal sequence was obtained for 2014C-3741, 2014C-3716, 89-3506, 2013C-3996, and 2013C-3304. The remaining four genomes (2013C-3925, 03-3375, 2014C-4638, and 2012C-4196) had two or more chromosomal contigs. The average genome size of the 73 isolates with a single chromosomal sequence was 5,287,902 bp, ranging from 4,717,123 to 5,858,766 bp. Each genome contained between one and seven plasmids.
TABLE 1

Accession numbers and assembly metrics of 77 STEC whole-genome sequences

Escherichia coli strain IDaSerotypeChromosomal GenBank accession no.No. of contigsChromosome size (bp)Associated plasmid size(s) (bp) (GenBank accession no.)
2015C-3163O103:H2CP02721925,500,18994,104 (CP027220)
2015C-3101O111:H8CP02722135,313,27848,390 (CP027222), 72,543c (CP027223)
2015C-3108O111:H8CP02730735,364,44263,664c (CP027308), 93,724 (CP027309)
2014C-4135O113:H21CP02731024,949,048133,438 (CP027311)
2013C-3181O113:H21CP02731215,167,951No plasmids
2014C-3550O118:H16CP02731345,549,39559,928 (CP027314), 88,840 (CP027315), 179,514 (CP027316)
2015C-3107O121:H19CP02731725,388,26081,954 (CP027318)
2014C-3084O145:H28CP02731944,717,12378,854c (CP027320), 84,276c (CP027321), 706,680 (CP027322)
2013C-3033O146:H21CP02732325,426,201127,667 (CP027324)
2013C-4830O165:H25CP02732535,135,67574,671 (CP027326), 93,170 (CP027327)
2014C-3741O174:H8CP02732835,394,679c128,345c (CP027329), 102,897c (CP027330)
2013C-3277O26:H11CP02733145,438,69420,839c (CP027332), 46,866c (CP027333), 181,066c (CP027334)
2014C-3716O26:H11CP02733535,568,215c144,060c (CP027336), 62,870c (CP027337)
2013C-3925O5:H9PVMF000000006331,062,c 4,413,627,c 496,511c77,933,c 170,283,c 99,102c
2014C-3051O71:H11CP02733825,597,47592,644 (CP027339)
2015C-3121O91:H14CP02734025,366,577104,198c (CP027341)
2014C-4587OUND:H19CP02734225,040,163131,410 (CP027343)
2014C-3946O111:H8CP02734435,264,93822,197c (CP027345), 18,123c (CP027346)
2013C-4361O111:H8CP02734725,317,84670,613c (CP027348)
2014C-3655O121:H19CP02735125,442,53797,117c (CP027350)
2012C-4606O26:H11CP02735235,647,19520,881c (CP027353), 57,720c (CP027354)
2013C-4390O76:H19CP02748425,353,719147,394 (CP027485)
2013C-4991O80:H2CP02735545,367,25171,714c (CP027356), 131,463c (CP027357), 110,001 (CP027358)
2014C-4639O26:H11CP02736135,325,24654,873c (CP027359), 329,873 (CP027360)
95-3192O145:H28CP02736215,385,516No plasmids
88-3001O165:H25CP02736325,195,75374,659 (CP027364)
89-3156O174:H21CP02736625,065,883125,561 (CP027367)
03-3375O145:H25PVMG0000000045,199,239,c 40,965c30,901,c 83,963c
2014C-3307O178:H19CP02736834,965,987109,641 (CP027369), 176,149c (CP027370)
2015C-3905O181:H49CP02737124,901,620175,427c (CP027372)
2014C-4638O26:H11PVMH000000004261,681,c 2,112,842,c 3,317,231c88,223c
05-3629O8:H16CP02737334,904,15191,648 (CP027374), 118,863c (CP027375)
2013C-4404O91:H14CP02737645,009,82270,152 (CP027377), 113,102c (CP027378), 104,889 (CP027379)
2013C-3250O111:H8CP02738065,401,67224,547c (CP027381), 36,491c (CP027382), 73,784c (CP027383), 27,224c (CP027384), 118,259 (CP027385)
2014C-3057O26:H11CP02738725,645,98354,452c (CP027386)
2011C-4251O45:H2CP02738825,440,02668,062c (CP027389)
2015C-4944O26:H11CP02739025,802,74898,724 (CP027391)
97-3250O26:H11CP02759935,942,969120,604 (CP027600), 92,590c (CP027601)
2014C-3599O121:H19CP02743525,400,13883,611 (CP027436)
2012C-4221bO101:H6CP02743735,012,55774,904 (CP027438), 107,188 (CP027439)
2012C-4502O185:H28CP02744024,892,666173,714 (CP027441)
2013C-3252O69:H11CP02744235,636,73295,157 (CP027443), 91,399 (CP027444)
2013C-3492bO172:H25CP02744525,196,10574,269 (CP027446)
2014C-3075O36:H42CP02744725,168,620170,848 (CP027448)
2014C-3097bO181:H49CP02744935,077,22834,867 (CP027450), 173,649 (CP027451)
2014C-3338bO183:H18CP02745224,799,014159,611 (CP027453)
2014C-4423bO121:H19CP02745435,338,91573,262 (CP027455), 79,682 (CP027456)
88-3493bO137:H41CP02745725,001,754107,796 (CP027458)
90-3040bO172:H25CP02745925,253,71274,247 (CP027460)
95-3322bO22:H5CP02746115,095,223No plasmids
07-4299bO130:H11CP02746224,847,172125,059c (CP027463)
2013C-4248O186:H2CP02746485,243,827113,063 (CP027465), 10,950c (CP027466), 62,602 (CP027467), 97,439c (CP027468), 62,881c (CP027469), 80,206 (CP027470), 243,267 (CP027471)
2014C-3050bO118:H16CP02747225,671,59481,624c (CP027473)
89-3506bO126:H27CP02752035,178,386c160,231c (CP027521), 93,253c (CP027522)
2013C-3264bO103:H25CP02754425,486,407101,089 (CP027545)
2013C-4187bO71:H11CP02754625,509,93195,367 (CP027547)
2014C-3061bO156:H25CP02754825,303,93594,116 (CP027549)
2014C-4705bO112:H21CP02764025,329,029126,957 (CP027641)
2015C-4136CT1bO145:H34CP02755024,836,918162,810 (CP027551)
2015C-4498bO117:H8CP02755225,434,44267,055 (CP027553)
2013C-3513bO186:H11CP02755535,584,93970,129c (CP027554), 91,046c (CP027556)
2013C-3996O26:H11CP02757225,858,766c96,937 (CP027571)
2013C-4081bO111:H8CP02757345,411,94348,183 (CP027574), 95,952c (CP027575), 78,427 (CP027576)
2013C-4225bO103:H11CP02757725,646,44687,714c (CP027578)
2013C-4282bO77:H45CP02757935,030,04454,544c (CP027580), 118,822 (CP027581)
2013C-4538bO118:H16CP02758225,680,42888,339c (CP027583)
2014C-3003bO76:H19CP02767235,234,64088,529c (CP027673), 133,420 (CP027674)
2015C-3125bO145:H28CP02776335,471,13266,944c (CP027764), 66,388 (CP027765)
00-3076bO113:H21CP02758424,997,979160,576 (CP027585)
2012C-4196O145:H25PVZZ0000000053,847,435,c 1,375,699c26,290,c 111,344,c 65,126c
2012EL-2448bO91:H14CP02758615,272,286No plasmids
2013C-4974bO5:H9CP02758725,235,56058,109c (CP027588)
2014C-3011bO177:H25CP02759145,168,35075,065c (CP027589), 92,449c (CP027590), 17,880c (CP027592)
2013C-3304O71:H8CP02759345,309,950c14,119c (CP027594), 36,845c (CP027595), 87,855 (CP027596)
86-3153bO5:H9CP02759725,342,52874,505c (CP027598)
88-3510bO172:H25CP02767525,140,38665,738c (CP027676)
2013C-3342O117:H8CP02776625,489,45166,545 (CP027767)

ID, identification.

Strain for which an optical map was generated and used to confirm the sequence order.

A linear sequence that could not be circularized due to unresolved or collapsed repeats.

Accession numbers and assembly metrics of 77 STEC whole-genome sequences ID, identification. Strain for which an optical map was generated and used to confirm the sequence order. A linear sequence that could not be circularized due to unresolved or collapsed repeats.

Accession number(s).

The whole-genome sequences have been deposited in the DDBJ/ENA/GenBank under the accession numbers listed in Table 1. The versions described in this paper are first versions.
  7 in total

1.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

2.  Phased diploid genome assembly with single-molecule real-time sequencing.

Authors:  Chen-Shan Chin; Paul Peluso; Fritz J Sedlazeck; Maria Nattestad; Gregory T Concepcion; Alicia Clum; Christopher Dunn; Ronan O'Malley; Rosa Figueroa-Balderas; Abraham Morales-Cruz; Grant R Cramer; Massimo Delledonne; Chongyuan Luo; Joseph R Ecker; Dario Cantu; David R Rank; Michael C Schatz
Journal:  Nat Methods       Date:  2016-10-17       Impact factor: 28.547

3.  Non-O157 Shiga toxin-producing Escherichia coli infections in the United States, 1983-2002.

Authors:  John T Brooks; Evangeline G Sowers; Joy G Wells; Katherine D Greene; Patricia M Griffin; Robert M Hoekstra; Nancy A Strockbine
Journal:  J Infect Dis       Date:  2005-09-14       Impact factor: 5.226

4.  The emerging clinical importance of non-O157 Shiga toxin-producing Escherichia coli.

Authors:  Kristine E Johnson; Cheleste M Thorpe; Cynthia L Sears
Journal:  Clin Infect Dis       Date:  2006-11-09       Impact factor: 9.079

Review 5.  The non-O157 shiga-toxigenic (verocytotoxigenic) Escherichia coli; under-rated pathogens.

Authors:  Karl A Bettelheim
Journal:  Crit Rev Microbiol       Date:  2007       Impact factor: 7.624

Review 6.  Food-related illness and death in the United States.

Authors:  P S Mead; L Slutsker; V Dietz; L F McCaig; J S Bresee; C Shapiro; P M Griffin; R V Tauxe
Journal:  Emerg Infect Dis       Date:  1999 Sep-Oct       Impact factor: 6.883

7.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

  7 in total
  7 in total

1.  Characterization of Atypical Shiga Toxin Gene Sequences and Description of Stx2j, a New Subtype.

Authors:  Alexander Gill; Forest Dussault; Tanis McMahon; Nicholas Petronella; Xiong Wang; Elizabeth Cebelinski; Flemming Scheutz; Kelly Weedmark; Burton Blais; Catherine Carrillo
Journal:  J Clin Microbiol       Date:  2022-03-16       Impact factor: 11.677

2.  Comparative Genomics Applied to Systematically Assess Pathogenicity Potential in Shiga Toxin-Producing Escherichia coli O145:H28.

Authors:  Michelle Qiu Carter; Nicole Laniohan; Chien-Chi Lo; Patrick S G Chain
Journal:  Microorganisms       Date:  2022-04-21

3.  Is Shiga Toxin-Producing Escherichia coli O45 No Longer a Food Safety Threat? The Danger is Still Out There.

Authors:  Yujie Zhang; Yen-Te Liao; Xiaohong Sun; Vivian C H Wu
Journal:  Microorganisms       Date:  2020-05-22

4.  Differential dynamics and impacts of prophages and plasmids on the pangenome and virulence factor repertoires of Shiga toxin-producing Escherichia coli O145:H28.

Authors:  Keiji Nakamura; Kazunori Murase; Mitsuhiko P Sato; Atsushi Toyoda; Takehiko Itoh; Jacques Georges Mainil; Denis Piérard; Shuji Yoshino; Keiko Kimata; Junko Isobe; Kazuko Seto; Yoshiki Etoh; Hiroshi Narimatsu; Shioko Saito; Jun Yatsuyanagi; Kenichi Lee; Sunao Iyoda; Makoto Ohnishi; Tadasuke Ooka; Yasuhiro Gotoh; Yoshitoshi Ogura; Tetsuya Hayashi
Journal:  Microb Genom       Date:  2020-01

5.  Emergence of New ST301 Shiga Toxin-Producing Escherichia coli Clones Harboring Extra-Intestinal Virulence Traits in Europe.

Authors:  Aurélie Cointe; Etienne Bizot; Sabine Delannoy; Patrick Fach; Philippe Bidet; André Birgy; François-Xavier Weill; Sophie Lefèvre; Patricia Mariani-Kurkdjian; Stéphane Bonacorsi
Journal:  Toxins (Basel)       Date:  2021-09-26       Impact factor: 4.546

6.  Pathogenomes and variations in Shiga toxin production among geographically distinct clones of Escherichia coli O113:H21.

Authors:  Anna Allué-Guardia; Sara S K Koenig; Ricardo A Martinez; Armando L Rodriguez; Joseph M Bosilevac; Peter Feng; Mark Eppinger
Journal:  Microb Genom       Date:  2022-04

7.  The global population structure and evolutionary history of the acquisition of major virulence factor-encoding genetic elements in Shiga toxin-producing Escherichia coli O121:H19.

Authors:  Ruriko Nishida; Keiji Nakamura; Itsuki Taniguchi; Kazunori Murase; Tadasuke Ooka; Yoshitoshi Ogura; Yasuhiro Gotoh; Takehiko Itoh; Atsushi Toyoda; Jacques Georges Mainil; Denis Piérard; Kazuko Seto; Tetsuya Harada; Junko Isobe; Keiko Kimata; Yoshiki Etoh; Mitsuhiro Hamasaki; Hiroshi Narimatsu; Jun Yatsuyanagi; Mitsuhiro Kameyama; Yuko Matsumoto; Yuhki Nagai; Jun Kawase; Eiji Yokoyama; Kazuhiko Ishikawa; Takayuki Shiomoto; Kenichi Lee; Dongchon Kang; Koichi Akashi; Makoto Ohnishi; Sunao Iyoda; Tetsuya Hayashi
Journal:  Microb Genom       Date:  2021-12
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.