Literature DB >> 32499355

Whole-Genome Sequences of Salmonella Isolates from an Ecological Wastewater Treatment System.

Charles J Connolly1, Laura Kaminsky1, Gabriella N Pinto1, Priscilla C Sinclair1, Gyasu Bajracharya2, Runan Yan2, Erin M Nawrocki2, Edward G Dudley2, Jasna Kovac3,2.   

Abstract

Twenty-seven Salmonella isolates were collected from four locations within an ecological wastewater treatment system located at The Pennsylvania State University and were subjected to whole-genome sequencing. The sequences obtained were used for in silico characterization, including serotyping and phylogenetic relatedness analysis.
Copyright © 2020 Connolly et al.

Entities:  

Year:  2020        PMID: 32499355      PMCID: PMC7272564          DOI: 10.1128/MRA.00456-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Twenty-seven Salmonella isolates were isolated from wastewater samples collected from four locations (wastewater delivery truck, anoxic tank, clarifier, and pond) within an experimental ecological wastewater treatment system located at The Pennsylvania State University. Samples were collected on four dates (31 May, 6 June, 21 June, and 2 July) in 2019. The collected wastewater samples were centrifuged, filtered, and enriched for Salmonella spp. following the Food and Drug Administration’s Bacteriological Analytical Manual protocol (1). Presumptive Salmonella colonies isolated from selective differential agars were subcultured in brain heart infusion (BHI) broth at 35°C for 24 h and were used for DNA extraction with DNeasy blood and tissue kits (Qiagen). Extracted DNA was used to prepare Illumina Nextera XT libraries, which were sequenced on an Illumina MiSeq system with 300-bp paired-end reads, using a 600-cycle v3 kit. Read quality was assessed with FastQC v0.11.5 (2) using default parameters. Low-quality bases and adapters were removed with Trimmomatic v0.36 (3) using the parameters recommended in the program manual. Trimmed reads were assembled de novo with SPAdes v3.14.0 (4), using k-mer lengths of 99 and 127 and a “careful” option. Assembly qualities were assessed with QUAST 4.6.1 (5). Average coverage was calculated using BWA v0.7.12 (6) and SAMtools v1.5 (7). Sequencing and SPAdes assembly metrics are shown in Table 1. The assemblies available in the NCBI nucleotide database were generated prior to this study using SKESA v2.2 (8) and were annotated using the NCBI Prokaryotic Genome Annotation Pipeline (9), but they were not used in this study. Serotypes were identified using sistr_cmd v1.0.2 (10). Core single-nucleotide polymorphisms (SNPs) for all 27 isolates were identified using kSNP3 v3.1.2 (11) with a k-mer size of 19 and were used to construct a maximum likelihood tree in RAxML v8.2, using a GTR-GAMMA model with Lewis correction and 1,000 bootstrap iterations (12). Another tree, containing only isolates of Salmonella enterica serotype Montevideo, was constructed using core genome SNPs identified with the CFSAN SNP pipeline Galaxy v1.0.1 (13), using accession number SRR9853881 as the reference genome. The cytolethal distending toxin-encoding gene cdtB (NP_456275) was added to the BTyper 2.3.1 virulence database and searched for using default parameters (14). BTyper was also used for the detection of antimicrobial resistance genes, with default parameters.
TABLE 1

Sequencing metrics for the 27 Salmonella whole-genome sequences and assemblies

SRA accession no.Total no. of contigsNo. of contigs ≥1,000 bpAssembly length (bp)N50 (bp)Avg coverage (×)GC content (%)No. of reads
SRR9853520121564,974,711216,9028552.141,802,482
SRR985352983374,648,670272,20210752.192,173,272
SRR985353074314,646,343362,5806852.191,340,334
SRR9853590114784,643,063108,2525852.201,136,098
SRR985387693594,642,802149,1955552.191,104,466
SRR985388171334,645,578151,19813452.202,992,046
SRR985388271344,643,296229,8534052.19789,878
SRR985388779414,644,479225,3286452.191,265,342
SRR9853888127684,679,950110,3616552.161,386,824
SRR985389075334,646,670272,0479252.191,840,696
SRR985389173314,646,815272,2025852.191,151,244
SRR98539061501024,646,13383,1214052.19878,332
SRR985404773244,675,646426,3024352.17859,994
SRR985404883434,644,127225,7038052.201,565,354
SRR985407079394,646,207225,32813152.192,835,590
SRR985407275314,648,893272,2027252.191,431,874
SRR985407584454,645,124221,8057252.191,459,050
SRR985407677364,614,194225,3279452.192,087,984
SRR9854078100614,639,441140,73711752.202,277,850
SRR985407973364,671,031144,8196052.211,195,606
SRR985408179364,646,637227,1529952.192,167,306
SRR985409896584,672,037147,7397152.201,412,384
SRR985410094424,652,181229,9317652.181,557,636
SRR985410474284,717,129418,8926552.241,319,400
SRR985425576384,644,689225,7035952.191,218,688
SRR994313387414,644,304261,5683652.20711,564
SRR99435781581114,637,28572,4942652.22566,540
Sequencing metrics for the 27 Salmonella whole-genome sequences and assemblies The average coverage, number of contigs larger than 1,000 bp, GC content, and N50 value for the assemblies obtained were 68.84×, 47.11, 52.19%, and 217,735 bp, respectively. Isolates were identified as S. enterica serotypes Montevideo (n = 21), Javiana (n = 2), Infantis (n = 1), Paratyphi B (n = 2), and Braenderup (n = 1). Untreated wastewater samples housed the greatest diversity of serotypes, as they had all five serotypes detected. Only S. Montevideo isolates were detected in samples from the anoxic tank and pond (Fig. 1A and B), while S. Montevideo, S. Javiana, and S. Paratyphi B were detected in samples from the clarifier. Among isolates of S. Montevideo, four pairs of isolates obtained from the same location on the same day had nearly identical core genomes (Fig. 1B). There was also a cluster of highly similar isolates spanning several sampling dates and locations (Fig. 1B), suggesting potential persistence of a specific genotype within the wastewater treatment system during the sampling period. The cdtB gene (15) was detected in all isolates of S. Montevideo and S. Javiana. Nine antimicrobial resistance genes were also detected, i.e., aac6-ly (n = 26), fosA7 (n = 21), qnrD (n = 2), aac6-laa, aadA1-pm, sul1, sul2, tetA, and tetR (n = 1).
FIG 1

Phylogenetic diversity of Salmonella isolates collected from an experimental ecological wastewater treatment system. (A) Maximum likelihood tree of all isolates, built using core genome SNPs identified with kSNP3. Isolate accession numbers are colored according to the serotype identified using sistr_cmd v1.0.2 and are labeled with the sampling date and location. (B) Maximum likelihood tree of Salmonella Montevideo isolates, built using core genome SNPs identified with the CFSAN SNP pipeline Galaxy v1.0.1. Isolate SRR9853881 was used as the reference genome. Isolate accession numbers are colored according to the sampling location and are labeled with the sampling date. Bootstrap values below 60 are not displayed.

Phylogenetic diversity of Salmonella isolates collected from an experimental ecological wastewater treatment system. (A) Maximum likelihood tree of all isolates, built using core genome SNPs identified with kSNP3. Isolate accession numbers are colored according to the serotype identified using sistr_cmd v1.0.2 and are labeled with the sampling date and location. (B) Maximum likelihood tree of Salmonella Montevideo isolates, built using core genome SNPs identified with the CFSAN SNP pipeline Galaxy v1.0.1. Isolate SRR9853881 was used as the reference genome. Isolate accession numbers are colored according to the sampling location and are labeled with the sampling date. Bootstrap values below 60 are not displayed.

Data availability.

The sequencing reads were deposited in the NCBI database under BioProject number PRJNA357723. The SRA accession numbers are SRR9854075, SRR9853529, SRR9853882, SRR9853520, SRR9854070, SRR9854072, SRR9854081, SRR9853906, SRR9854255, SRR9853590, SRR9943578, SRR9853887, SRR9853530, SRR9853876, SRR9853890, SRR9854100, SRR9854047, SRR9854104, SRR9853888, SRR9853891, SRR9854079, SRR9854076, SRR9854098, SRR9854078, SRR9853881, SRR9943133, and SRR9854048.
  13 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  kSNP3.0: SNP detection and phylogenetic analysis of genomes without genome alignment or reference genome.

Authors:  Shea N Gardner; Tom Slezak; Barry G Hall
Journal:  Bioinformatics       Date:  2015-04-25       Impact factor: 6.937

3.  QUAST: quality assessment tool for genome assemblies.

Authors:  Alexey Gurevich; Vladislav Saveliev; Nikolay Vyahhi; Glenn Tesler
Journal:  Bioinformatics       Date:  2013-02-19       Impact factor: 6.937

4.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

5.  Rapid, High-Throughput Identification of Anthrax-Causing and Emetic Bacillus cereus Group Genome Assemblies via BTyper, a Computational Tool for Virulence-Based Classification of Bacillus cereus Group Isolates by Using Nucleotide Sequencing Data.

Authors:  Laura M Carroll; Jasna Kovac; Rachel A Miller; Martin Wiedmann
Journal:  Appl Environ Microbiol       Date:  2017-08-17       Impact factor: 4.792

6.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

Review 7.  Dynamic Duo-The Salmonella Cytolethal Distending Toxin Combines ADP-Ribosyltransferase and Nuclease Activities in a Novel Form of the Cytolethal Distending Toxin.

Authors:  Rachel Miller; Martin Wiedmann
Journal:  Toxins (Basel)       Date:  2016-04-25       Impact factor: 4.546

8.  The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies.

Authors:  Catherine E Yoshida; Peter Kruczkiewicz; Chad R Laing; Erika J Lingohr; Victor P J Gannon; John H E Nash; Eduardo N Taboada
Journal:  PLoS One       Date:  2016-01-22       Impact factor: 3.240

9.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

10.  SKESA: strategic k-mer extension for scrupulous assemblies.

Authors:  Alexandre Souvorov; Richa Agarwala; David J Lipman
Journal:  Genome Biol       Date:  2018-10-04       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.