Literature DB >> 29326203

High-Quality Whole-Genome Sequences for 21 Enterotoxigenic Escherichia coli Strains Generated with PacBio Sequencing.

Peyton Smith1,2, Rebecca L Lindsey3, Lori A Rowe1, Dhwani Batra1, Devon Stripling1, Lisley Garcia-Toledo1,2, Daniel Drapeau1,2, Kristen Knipe1, Nancy Strockbine1.   

Abstract

Enterotoxigenic Escherichia coli (ETEC) is an important diarrheagenic pathogen. We report here the high-quality whole-genome sequences of 21 ETEC strains isolated from patients in the United States, international diarrheal surveillance studies, and cruise ship outbreaks.

Entities:  

Year:  2018        PMID: 29326203      PMCID: PMC5764927          DOI: 10.1128/genomeA.01311-17

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Enterotoxigenic Escherichia coli (ETEC) infections are the leading cause of travelers’ diarrhea and the most common cause of diarrhea among children in developing countries (1). ETEC causes approximately 210 million infections and 380,000 deaths per year (2). ETEC infections in humans are characterized by the production of either or both heat-stable (ST) and heat-labile (LT) enterotoxins (3). Here, we report the availability of high-quality genome sequences for 21 ETEC strains generated by PacBio sequencing. Each of the 21 genomes contained one chromosomal sequence, and 17 of these were determined to be circular with overlapping ends that were trimmed from one end. ETEC genomic DNA was extracted using Archive Pure according to the manufacturer’s protocol (5 Prime, Gaithersburg, MD, USA). The genomic DNA was sheared to 20 kb using needle shearing. These libraries were further size selected using BluePippin (Sage Scientific, Beverly, MA, USA). The sheared DNA was used to generate large SMRTbell libraries using the standard library protocols of the Pacific Biosciences DNA template preparation kit (Pacific Biosciences, Menlo Park, CA, USA). All strains were sequenced using one single-molecule real-time (SMRT) cell, except strain 9276-90, which used two. The finished libraries were bound to proprietary P6 v2 polymerase and sequenced on a PacBio RSII system using C4 chemistry for 360-min movies. Sequence reads were filtered and assembled de novo utilizing the PacBio Hierarchical Genome Assembly Process version 3 (4). Table 1 lists the accession numbers, detected serotype (genotypic), assembly metrics, and reference (if available) for each ETEC whole-genome sequence. A single chromosomal sequence was obtained for all genomes with an average coverage of 102.6× (range, 42.3× to 188.6×). The average G+C content for each chromosomal sequence was 50.68%, ranging from 50.37% to 50.82%. The chromosomal sequences are circular with overlapping ends for all but four genomes. The single chromosomal sequences for isolates 90-9276, 90-9269, F5176C6, and 00-3279 could not be circularized because of unresolved or collapsed repeats. Each ETEC genome contained between one and five plasmids. Table 1 notes whether overlap was found to form circular chromosomal or plasmid contigs.
TABLE 1 

Accession numbers and assembly metrics of 21 ETEC whole-genome sequences

E. coli strain no.(reference)SerotypeChromosomal GenBank accession no.Average coverage (×)Chromosome size (bp)Associated plasmid size (bp)(GenBank accession no.)
2014EL-1345-2O169:H41CP02422392.34,943,397a145,086a (CP024227)
85,864 (CP024226)
27,947 (CP024225)
20,005 (CP024224)
ATCC 43886 or E2539C1 (5, 6)O25:H16CP024256148.14,914,654a95,515a (CP024255)
107,732a (CP024254)
D181 (7)O182:H21CP024252101.64,891,230a46,427 (CP024253)
99,099a (CP024250)
167,230a (CP024249)
35,752 (CP024251)
90-9281 (8)O128:H27CP024243117.54,978,613a152,012a (CP024244)
90-9276 (8)O114:H49CP02429979.85,004,571167,764 (CP024297)
110,577 (CP024298)
90-9280 (8)O114:H49CP024240188.64,966,338a74,644a (CP024242)
104,674 (CP024241)
90-9269 (8)OUND:H4CP024661150.64,759,941160,351a (CP024662)
12,610 (CP024663)
52,270a (CP024664)
96,720 (CP024665)
20,753 (CP024666)
90-9272 (8)O15:H11CP02423948.34,906,680a274,465 (CP024238)
ATCC 43896 (or TX1) (9, 10)O78:H12CP024278107.75,088,038a84,894a (CP024281)
52,655 (CP024280)
28,860 (CP024279)
M9682-C1 (11)O6:H16CP024275156.64,778,550a100,184a (CP024277)
38,177 (CP024276)
B4103-1 (12)O27:H7CP02424583.14,708,118a138,289a (CP024248)
68,864 (CP024247)
36,948 (CP024246)
F6326-C1O169:H41CP02426378.14,934,701a150,389a (CP024265)
72,060 (CP024264)
F5176C6 (13)O167:H5CP02466786.45,069,317167,071a (CP024668)
110,612a (CP024669)
46,919a (CP024670)
31,494 (CP024671)
2014EL-1343-2O25:H16 (genotypic), O25:NM (phenotypic)CP02422891.44,848,034a82,510a (CP024231)
73,915 (CP024230)
38,467 (CP024229)
F5656C1 (14)O6:H16CP02426063.84,733,683a119,846a (CP024262)
45,056a (CP024261)
2014EL-1346-6O6:H16CP02423298.84,872,840a152,713a (CP024237)
226,119a (CP024236)
62,188a (CP024235)
40,223 (CP024234)
30,162 (CP024233)
F9792 (14, 15)O169:H41CP02427392.24,875,605a145,089a (CP024274)
F5505-C1O25:H16CP02425794.34,886,938a94,817a (CP024259)
96,607a (CP024258)
F6699O6:H16CP02426642.34,881,899a95,350 (CP024268)
34,042 (CP024267)
F8111-1SC3 (14)O169:H41CP02426978.64,905,023a147,766a (CP024272)
103,618 (CP024271)
35,769 (CP024270)
00-3279O78:H12CP024293155.0 5,116,480101,907a (CP024294)
103,995a (CP024295)
97,297a (CP024296)

A sequence that is circular with overlapping ends.

Accession numbers and assembly metrics of 21 ETEC whole-genome sequences A sequence that is circular with overlapping ends.

Accession number(s).

This whole-genome shotgun project has been deposited in DDBJ/ENA/GenBank under the accession numbers listed in Table 1. The versions described in this paper are the first versions.
  14 in total

1.  Human diarrheal disease caused by enterotoxigenic Escherichia coli.

Authors:  R B Sack
Journal:  Annu Rev Microbiol       Date:  1975       Impact factor: 15.500

2.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

3.  Plasmid-mediated properties of a heat-stable enterotoxin-producing Escherichia coli associated with infantile diarrhea.

Authors:  I K Wachsmuth; S Falkow; R W Ryder
Journal:  Infect Immun       Date:  1976-08       Impact factor: 3.441

4.  Prevalence of enterotoxigenic Escherichia coli-associated diarrhoea and carrier state in the developing world.

Authors:  Christine Wennerås; Valdemar Erling
Journal:  J Health Popul Nutr       Date:  2004-12       Impact factor: 2.000

5.  Endemically acquired foodborne outbreak of enterotoxin-producing Escherichia coli serotype O169:H41.

Authors:  Rose A Devasia; Timothy F Jones; Jennifer Ward; Lila Stafford; Henrietta Hardin; Cheryl Bopp; Mark Beatty; Eric Mintz; William Schaffner
Journal:  Am J Med       Date:  2006-02       Impact factor: 4.965

6.  Outbreaks of enterotoxigenic Escherichia coli infection in American adults: a clinical and epidemiologic profile.

Authors:  C B Dalton; E D Mintz; J G Wells; C A Bopp; R V Tauxe
Journal:  Epidemiol Infect       Date:  1999-08       Impact factor: 2.451

7.  Bacterial, viral and parasitic enteric pathogens associated with acute diarrhea in hospitalized children from northern Jordan.

Authors:  M Youssef; A Shurman; M Bougnoux; M Rawashdeh; S Bretagne; N Strockbine
Journal:  FEMS Immunol Med Microbiol       Date:  2000-07

8.  Epidemic diarrhea at Crater Lake from enterotoxigenic Escherichia coli. A large waterborne outbreak.

Authors:  M L Rosenberg; J P Koplan; I K Wachsmuth; J G Wells; E J Gangarosa; R L Guerrant; D A Sack
Journal:  Ann Intern Med       Date:  1977-06       Impact factor: 25.391

9.  Heat-labile enterotoxin production in isolates from a shipboard outbreak of human diarrheal illness.

Authors:  K Wachsmuth; J Wells; P Shipley; R Ryder
Journal:  Infect Immun       Date:  1979-06       Impact factor: 3.441

10.  Enterotoxin-producing Escherichia coli O169:H41, United States.

Authors:  Mark E Beatty; Cheryl A Bopp; Joy G Wells; Kathy D Greene; Nancy D Puhr; Eric D Mintz
Journal:  Emerg Infect Dis       Date:  2004-03       Impact factor: 6.883

View more
  6 in total

1.  Comparative Pathogenomics of Escherichia coli: Polyvalent Vaccine Target Identification through Virulome Analysis.

Authors:  J R Clark; A M Maresso
Journal:  Infect Immun       Date:  2021-07-15       Impact factor: 3.441

2.  High-Quality Whole-Genome Sequences for 59 Historical Shigella Strains Generated with PacBio Sequencing.

Authors:  Justin Kim; Rebecca L Lindsey; Lisley Garcia-Toledo; Vladimir N Loparev; Lori A Rowe; Dhwani Batra; Phalasy Juieng; Devon Stoneburg; Haley Martin; Kristen Knipe; Peyton Smith; Nancy Strockbine
Journal:  Genome Announc       Date:  2018-04-12

3.  Comparative genomic analysis and molecular examination of the diversity of enterotoxigenic Escherichia coli isolates from Chile.

Authors:  David A Rasko; Felipe Del Canto; Qingwei Luo; James M Fleckenstein; Roberto Vidal; Tracy H Hazen
Journal:  PLoS Negl Trop Dis       Date:  2019-11-20

4.  Long-read-sequenced reference genomes of the seven major lineages of enterotoxigenic Escherichia coli (ETEC) circulating in modern time.

Authors:  Astrid von Mentzer; Grace A Blackwell; Derek Pickard; Christine J Boinett; Enrique Joffré; Andrew J Page; Ann-Mari Svennerholm; Gordon Dougan; Åsa Sjöling
Journal:  Sci Rep       Date:  2021-04-29       Impact factor: 4.379

5.  Genome wide characterization of enterotoxigenic Escherichia coli serogroup O6 isolates from multiple outbreaks and sporadic infections from 1975-2016.

Authors:  Vaishnavi Pattabiraman; Lee S Katz; Jessica C Chen; Andre E McCullough; Eija Trees
Journal:  PLoS One       Date:  2018-12-31       Impact factor: 3.240

6.  Strain-Level Metagenomic Data Analysis of Enriched In Vitro and In Silico Spiked Food Samples: Paving the Way towards a Culture-Free Foodborne Outbreak Investigation Using STEC as a Case Study.

Authors:  Assia Saltykova; Florence E Buytaers; Sarah Denayer; Bavo Verhaegen; Denis Piérard; Nancy H C Roosens; Kathleen Marchal; Sigrid C J De Keersmaecker
Journal:  Int J Mol Sci       Date:  2020-08-08       Impact factor: 5.923

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.