Literature DB >> 29348347

Completed Genome Sequences of Strains from 36 Serotypes of Salmonella.

James Robertson1, Catherine Yoshida2, Simone Gurnik1, John H E Nash3.   

Abstract

We report here the completed closed genome sequences of strains representing 36 serotypes of Salmonella These genome sequences will provide useful references for understanding the genetic variation between serotypes, particularly as references for mapping of raw reads or to create assemblies of higher quality, as well as to aid in studies of comparative genomics of Salmonella. © Crown copyright 2018.

Entities:  

Year:  2018        PMID: 29348347      PMCID: PMC5773732          DOI: 10.1128/genomeA.01472-17

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Salmonella spp. are the leading cause of bacterial gastroenteritis in North America, with over 1.7 million cases per annum (1). Public health jurisdictions are replacing traditional serotyping with whole-genome sequencing (WGS) methodologies for quicker and more accurate outbreak detection and surveillance activities (2). To this end, we previously developed an in silico serotyping platform for Salmonella (3, 4). Unfortunately, the large amount of raw data available in the SRA are primarily composed of Illumina short reads which cannot circularize the Salmonella genome as one contiguous nucleic acid molecule. As of November 2017, the number of fully closed genomes is 501 for Salmonella enterica and 4 for Salmonella bongori. Therefore, we sequenced 36 diverse serotypes of Salmonella using a combination of Illumina and PacBio technologies to produce high-quality genomes for public health and comparative genomics applications. This data set represents 25 novel serotypes with closed reference genomes. Genomic DNA was isolated using the automated Qiagen EZ1 DNA tissue kit, using the manufacturer’s protocol, except 180 μl of G2 buffer was used with 10 μl of proteinase K and 10 μl of lysozyme (10 mg/ml; Sigma-Aldrich, Gillingham, UK). PacBio sequencing was performed at the Génome Québec Innovation Centre (McGill University, Quebec, Canada) using single-molecule real-time (SMRT) cells in an RSII sequencer, which produced 100,000 to 150,000 reads per sample, with an average read length of 6,000 bp. The PacBio read sets were assembled into circular consensus sequences using the HGAP workflow 1.1.13. Illumina sequencing on MiSeq version 3 (600-cycle kit) using Nextera XT libraries was performed at the National Microbiology Laboratory at Winnipeg (Winnipeg, Manitoba, Canada) to a target of 60-fold coverage. The quality of the Illumina read sets was examined using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Illumina read correction was performed using Lighter version 1.1.1 (https://github.com/mourisl/Lighter). Corrected Illumina reads were then mapped to the PacBio assembly using Bowtie2 version 2.1.0 (http://bowtie-bio.sourceforge.net/bowtie2/index.shtml) using the very-sensitive-local option. The output was sorted and converted into a bam file using SAMtools version 1.3 (http://samtools.sourceforge.net/) and input to Pilon version 1.2.2 (https://github.com/broadinstitute/pilon). The process was performed iteratively on the corrected assemblies until no changes were made to the output. Final assemblies were examined using Gap5 software version 1.2.14 (http://www.sanger.ac.uk/science/tools/gap5). Completed assemblies were processed through the Salmonella In Silico Typing Resource (SISTR) (3, 4) to confirm that the in silico predictions matched the serotype previously performed by our OIE Reference Laboratory for Salmonellosis in Guelph, Ontario, Canada. Closed reference genomes provide great value to an understanding of the biology of pathogens, and as such, it is important that genome repositories contain as many of them as possible. These would make important contributions as reference sequences for the WGS assembly of isolates of the same or highly similar serotypes, as well as provide more accurate genomes for comparative and epidemiological studies on outbreak detection and surveillance of Salmonella.

Accession number(s).

The genome sequences for these 36 Salmonella isolates have been deposited in DDBJ/ENA/NCBI under BioProject no. PRJNA294295. The GenBank accession numbers are listed in Table 1. The raw sequence data are available in the Sequence Read Archive.
TABLE 1 

Salmonella strains sequenced in this study, by serotype

SerotypeIsolate no.GenBank accession no.Genome size (bp)
AntsalovaS01-0511CP0191164,648,086
ApapaSA20060561CP0194034,801,658
BardoSA20113257CP0194044,849,139
BergenST350CP0194054,801,835
BlegdamS-1824CP0194064,693,979
BorrezeSA20041063CP0194074,777,558
BraenderupSA20026289CP0224904,734,880
Crossness1422-74CP0194084,847,468
DerbySA20035215CP0224944,850,334
DjakartaS-1087CP0194094,668,861
HillingdonN1529-D3CP0194104,618,056
HvittingfossSA20014981CP0225034,940,239
IndiaSA20085604CP0220155,395,280
JohannesburgST203CP0194114,651,794
KentuckySA20030505CP0225004,782,363
KoessenS-1501CP0194124,566,169
KrefeldSA20030536CP0194134,942,273
MacclesfieldS-1643CP0221174,822,139
ManchesterST278CP0194144,532,753
ManhattanSA20084699CP0224974,732,484
MbandakaSA20026234CP0224894,796,292
MoscowS-1843CP0194154,690,402
NitraS-1687CP0194164,691,807
OnderstepoortSA20060086CP0220344,774,926
OuakamSA20034636CP0221164,874,915
QuebecS-1267CP0220194,626,699
SaintpaulSA20031783CP0224914,775,303
subsp. II 55:k:z391315KCP0221394,859,044
subsp. II 57:z29:z42ST114CP0224674,719,375
subsp. IIIa 53:z4,z23,z32:-SA20100345CP0225044,586,333
subsp. IIIb 50:k:zMZ0080CP0221425,076,950
subsp. IIIb 65:c:zSA20044251CP0221354,913,978
subsp. V 66:z41:-SA19983605CP0221204,468,959
WandsworthSA20092095CP0194174,916,040
WaycrossSA20041608CP0221384,812,886
YovokomeS-1850CP0194184,640,929
Salmonella strains sequenced in this study, by serotype
  4 in total

1.  The global burden of nontyphoidal Salmonella gastroenteritis.

Authors:  Shannon E Majowicz; Jennie Musto; Elaine Scallan; Frederick J Angulo; Martyn Kirk; Sarah J O'Brien; Timothy F Jones; Aamir Fazil; Robert M Hoekstra
Journal:  Clin Infect Dis       Date:  2010-03-15       Impact factor: 9.079

Review 2.  PulseNet International: Vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance.

Authors:  Celine Nadon; Ivo Van Walle; Peter Gerner-Smidt; Josefina Campos; Isabel Chinen; Jeniffer Concepcion-Acevedo; Brent Gilpin; Anthony M Smith; Kai Man Kam; Enrique Perez; Eija Trees; Kristy Kubota; Johanna Takkinen; Eva Møller Nielsen; Heather Carleton
Journal:  Euro Surveill       Date:  2017-06-08

3.  The Validation and Implications of Using Whole Genome Sequencing as a Replacement for Traditional Serotyping for a National Salmonella Reference Laboratory.

Authors:  Chris A Yachison; Catherine Yoshida; James Robertson; John H E Nash; Peter Kruczkiewicz; Eduardo N Taboada; Matthew Walker; Aleisha Reimer; Sara Christianson; Anil Nichani; Celine Nadon
Journal:  Front Microbiol       Date:  2017-06-09       Impact factor: 5.640

4.  The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies.

Authors:  Catherine E Yoshida; Peter Kruczkiewicz; Chad R Laing; Erika J Lingohr; Victor P J Gannon; John H E Nash; Eduardo N Taboada
Journal:  PLoS One       Date:  2016-01-22       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.