Literature DB >> 34292065

Complete Closed Genome Sequence of the Extremely Heat-Resistant Strain Escherichia coli AW1.7.

Manita Guragain1, James L Bono1, Joseph M Bosilevac1.   

Abstract

Escherichia coli isolate AW1.7 is an extremely heat-resistant bacterium and has been widely used as a reference strain in extreme heat resistance studies for almost a decade. Here, we report its complete closed genome sequence.

Entities:  

Year:  2021        PMID: 34292065      PMCID: PMC8297447          DOI: 10.1128/MRA.00502-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

AW1.7 is an extremely heat-resistant Escherichia coli strain recovered from an unpasteurized dressed beef carcass at a Canadian beef slaughter plant in 2001, isolated on SD-39 agar (42°C, 18 h) (1), passaged four times, and archived at −80°C. Here, we report the sequence of its complete closed genome. An overnight culture of AW1.7 in Luria-Bertani (LB) broth was diluted 1:100 in LB broth and then shaken (150 rpm) at 37°C for 3 h. Genomic DNA (gDNA) was extracted using a Qiagen Genomic-tip 100/G kit (Valencia, CA) and sheared to an average size of 20 kb using a Covaris g-TUBE (Woburn, MA). A sequencing library was prepared using the SMRTbell template prep kit v1.0 (Pacific Biosciences, Menlo Park, CA) and size selected for 20-kb fragments using the BluePippin high pass Plus cassette (Sage Science, Inc., Beverly, MA). PacBio sequencing using the Sequel system (Menlo Park, CA) with v3.0 chemistry produced 21,618 subreads (331,276,790 total subread bases) with an average subread length of 15,220 bp. For the Illumina library, gDNA was sheared to an average size of 350 bp using a Covaris microtube; the library was prepared using the TruSeq DNA PCR-free library preparation (LP) kit (Illumina, San Diego, CA). An average genome coverage of 56× was obtained, with 847,900 paired-end reads (16,994,571 total bases) generated on a MiSeq device using a v2 kit (300 cycles) (Illumina). Quality trimming and adapter clipping of the raw reads was conducted using Trimmomatic v0.39 with the following parameters: ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:2:keepBothReads LEADING:3 TRAILING:3 MINLEN:36 (2). Default settings were used for all software unless otherwise noted. After assembly with the Microbial Assembly application (PacBio SMRT analysis pipeline v8.0), there were 3 contigs with an average coverage of 62×. One contig was circularized using the Microbial Assembly application and was determined to be the chromosome, while the remaining two were determined to be plasmids. The assembled genome sequence was analyzed using Geneious Prime v2020.1.2 (Biomatters Ltd., New Zealand). Further mapping of the circularized chromosome with the error-corrected PacBio and Illumina reads using the Geneious mapper confirmed the assembler’s results. Iterative mapping using the error-corrected PacBio and Illumina reads produced overlapping sequence ends on the two contigs as visualized using LastZ v1.02.00 (3, 4). This allowed the overlapping sequences to be trimmed and the contigs concatenated to produce closed circular plasmids. Position 1 of the chromosome was set to the origin of replication using Ori-Finder v2.0 (5). Using the Geneious mapper, the error-corrected PacBio and Illumina reads produced an overall even coverage of the mapped reads with no single-nucleotide polymorphism (SNP) differences using Find variation/SNPs. Four additional plasmids were identified using the unused reads with the Geneious assembler. The replicon regions were identified by in silico analysis of the plasmid sequences using PlasmidFinder v2.1 (6). Finally, the closed chromosome and plasmids were polished using the Illumina reads with Pilon v1.23 (7). The polished genome sequence was mapped against the shotgun sequences from the source lab to confirm the identity of the isolate. The characteristics of the chromosome and plasmids can be found in Table 1.
TABLE 1

Characteristics of contigs from E. coli AW1.7 genome assembly

Contig nameSequence length (bp)% GCGenBank accession no.a
Chromosome4,947,57750.7CP072539
pAW1.7_1103,12950.1CP072540
pAW1.7_266,90954.2CP072541
pAW1.7_34,84251.5CP072542
pAW1.7_41,93451.3CP072543
pAW1.7_53,21149.1CP072544
pAW1.7_62,45448.8CP072545

Genome sequence annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v5.1 (8, 9).

Characteristics of contigs from E. coli AW1.7 genome assembly Genome sequence annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v5.1 (8, 9).

Data availability.

The GenBank accession numbers for the PGAP-annotated genome sequence are provided in Table 1. The BioProject accession number for the raw sequence reads is PRJNA663878.
  8 in total

1.  PlasmidFinder and In Silico pMLST: Identification and Typing of Plasmid Replicons in Whole-Genome Sequencing (WGS).

Authors:  Alessandra Carattoli; Henrik Hasman
Journal:  Methods Mol Biol       Date:  2020

2.  Genotypic analysis of Escherichia coli recovered from product and equipment at a beef-packing plant.

Authors:  M Aslam; G G Greer; F M Nattress; C O Gill; L M McMullen
Journal:  J Appl Microbiol       Date:  2004       Impact factor: 3.772

3.  Human-mouse alignments with BLASTZ.

Authors:  Scott Schwartz; W James Kent; Arian Smit; Zheng Zhang; Robert Baertsch; Ross C Hardison; David Haussler; Webb Miller
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

4.  Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Authors:  Bruce J Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A Cuomo; Qiandong Zeng; Jennifer Wortman; Sarah K Young; Ashlee M Earl
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

5.  Ori-Finder 2, an integrated tool to predict replication origins in the archaeal genomes.

Authors:  Hao Luo; Chun-Ting Zhang; Feng Gao
Journal:  Front Microbiol       Date:  2014-09-15       Impact factor: 5.640

6.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

7.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

8.  RefSeq: an update on prokaryotic genome annotation and curation.

Authors:  Daniel H Haft; Michael DiCuccio; Azat Badretdin; Vyacheslav Brover; Vyacheslav Chetvernin; Kathleen O'Neill; Wenjun Li; Farideh Chitsaz; Myra K Derbyshire; Noreen R Gonzales; Marc Gwadz; Fu Lu; Gabriele H Marchler; James S Song; Narmada Thanki; Roxanne A Yamashita; Chanjuan Zheng; Françoise Thibaud-Nissen; Lewis Y Geer; Aron Marchler-Bauer; Kim D Pruitt
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

  8 in total
  1 in total

1.  Genomic and Phenotypic Analysis of Heat and Sanitizer Resistance in Escherichia coli from Beef in Relation to the Locus of Heat Resistance.

Authors:  Xianqin Yang; Frances Tran; Peipei Zhang; Hui Wang
Journal:  Appl Environ Microbiol       Date:  2021-09-22       Impact factor: 4.792

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.