| Literature DB >> 32127544 |
Nanna Munck1, Pimlapas Leekitcharoenphon2, Eva Litrup3, Rolf Kaas2, Anika Meinen4, Laurent Guillier5, Yue Tang6, Burkhard Malorny7, Federica Palma5, Maria Borowiak7, Michèle Gourmelon8, Sandra Simon9, Sangeeta Banerji9, Liljana Petrovska6, Timothy J Dallman10, Tine Hald2.
Abstract
Zoonotic Salmonella causes millions of human salmonellosis infections worldwide each year. Information about the source of the bacteria guides risk managers on control and preventive strategies. Source attribution is the effort to quantify the number of sporadic human cases of a specific illness to specific sources and animal reservoirs. Source attribution methods for Salmonella have so far been based on traditional wet-lab typing methods. With the change to whole genome sequencing there is a need to develop new methods for source attribution based on sequencing data. Four European datasets collected in Denmark (DK), Germany (DE), the United Kingdom (UK) and France (FR) are presented in this descriptor. The datasets contain sequenced samples of Salmonella Typhimurium and its monophasic variants isolated from human, food, animal and the environment. The objective of the datasets was either to attribute the human salmonellosis cases to animal reservoirs or to investigate contamination of the environment by attributing the environmental isolates to different animal reservoirs.Entities:
Mesh:
Year: 2020 PMID: 32127544 PMCID: PMC7054362 DOI: 10.1038/s41597-020-0417-7
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Flow chart of data generation, data management and quality control. dcc_vivaldi[8]: a private datahub (https://www.ebi.ac.uk/ena/pathogens/login) set up for data sharing and hosted at the European Nucleotide Archive (ENA). Computerome: a local server used by DTU Food (https://www.computerome.dk). Prokka: Rapid Prokaryotic genome annotation[32]. Roary: Rapid prokaryotic genome annotation[33]. cgMLST and wgMLST obtained using the Enterobase scheme[11] in BioNumerics version 7.6 (Applied Maths, Sint Martens Latem, Belgium).
Overview of data collected and sequence quality. SA: Source attribution. BN: BioNumerics.
| Dataset | Years included | Total sequences collected for the study | Poor quality | Excluded for other reasons | Included in SA study | ||
|---|---|---|---|---|---|---|---|
| n total = 1781 | FoodQC | BN QC | FoodQC + BN QC | n total = 1259 | |||
| DK food | 2013, 2014 | 211 | 1 | 0 | 0 | 0 | 210 |
| DK human | 2013, 2014 | 181 | 0 | 40 | 0 | 0 | 141 |
| UK Food | 2014, 2015, 2016 | 329 | 4 | 41 | 0 | 2 | 282 |
| UK human | 2014, 2015, 2016 | 596 | 29 | 273 | 0 | 117 | 177 |
| DE food | 2014, 2015, 2016 | 193 | 0 | 3 | 0 | 0 | 190 |
| DE human | 2014, 2015, 2016 | 166 | 0 | 5 | 0 | 0 | 161 |
| FR animal | 2010–2015 | 71 | 0 | 2 | 0 | 0 | 69 |
| FR environmental | 2010–2015 | 34 | 0 | 0 | 5 | 0 | 29 |
Metadata variables.
| Sequence information |
|---|
| sample ID |
| Data provider |
| Pathogen Organism |
| Taxonomic Name/Serovar |
| Year of sampling |
| Primary Source |
| Imported Food (if relevant) |
| Country of sample origin |
| Outbreak ID (if relevant) |
| Patient Travel |
| MLST |
Number of Salmonella Typhimurium and its monophasic variants included in the Danish dataset.
| DK dataset, 2013–2014 Source | 2013 | 2014 | Number of isolates |
|---|---|---|---|
| Pigs (DK) | 84 | 41 | 125 |
| Pigs (Import) | 20 | 14 | 34 |
| Broilers (DK) | 13 | 21 | 34 |
| Ducks (Import) | 0 | 11 | 11 |
| Layers (DK) | 3 | 1 | 4 |
| Cattle (DK) | 1 | 0 | 1 |
| Cattle (Import) | 0 | 1 | 1 |
| Total animal | 121 | 89 | 210 |
| Human | 29 | 112 | 141 |
Number of Salmonella Typhimurium and its monophasic variants included in the German dataset.
| DE dataset, 2014–2016 Source | 2014 | 2015 | 2016 | Number of isolates |
|---|---|---|---|---|
| Birds (DE) | 0 | 0 | 1 | 1 |
| Broilers (DE) | 5 | 4 | 1 | 10 |
| Cattle (DE) | 22 | 26 | 21 | 69 |
| Game (DE) | 1 | 0 | 0 | 1 |
| Layers (DE) | 5 | 20 | 12 | 37 |
| Pigs (DE) | 25 | 26 | 21 | 72 |
| Total animal | 58 | 76 | 56 | 190 |
| Human | 43 | 49 | 69 | 161 |
Number of Salmonella Typhimurium and its monophasic variants included in the British dataset.
| UK dataset, 2014–2016 Source | 2014 | 2015 | 2016 | Number of isolates |
|---|---|---|---|---|
| Broilers (UK) | 4 | 2 | 3 | 9 |
| Cattle (UK) | 11 | 2 | 7 | 20 |
| Game (UK) | 3 | 9 | 6 | 18 |
| Layers (UK) | 4 | 2 | 1 | 7 |
| Other mammals (UK) | 14 | 8 | 20 | 42 |
| Pigs (UK) | 71 | 69 | 23 | 163 |
| Reptiles (UK) | 0 | 1 | 1 | 2 |
| Sheep (UK) | 0 | 1 | 6 | 7 |
| Turkey (UK) | 9 | 2 | 3 | 14 |
| Total animal | 116 | 96 | 70 | 282 |
| Human | 53 | 118 | 6 | 177 |
Number of Salmonella Typhimurium and its monophasic variants included in the French dataset.
| FR dataset, 2010–2015 Source | 2010 | 2011 | 2012 | 2013 | 2014 | 2015 | Number of isolates |
|---|---|---|---|---|---|---|---|
| Broilers (FR) | 0 | 1 | 4 | 0 | 2 | 0 | 7 |
| Cattle (FR) | 1 | 0 | 1 | 2 | 1 | 0 | 5 |
| Ducks (FR) | 0 | 1 | 0 | 0 | 1 | 0 | 2 |
| Layers (FR) | 0 | 0 | 2 | 0 | 0 | 0 | 2 |
| Pigs (FR) | 0 | 16 | 21 | 6 | 6 | 0 | 49 |
| Sheep goat (FR) | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| Turkey (FR) | 0 | 0 | 0 | 0 | 3 | 0 | 3 |
| Total animal | 1 | 19 | 28 | 8 | 13 | 0 | 69 |
| Environmental | 1 | 2 | 1 | 11 | 13 | 1 | 29 |
Quality control criteria of interest for all datasets. Depth of coverage is calculated as bases(MB)*1,000,000/total base pairs. Mean (minimum value; maximum value) reported.
| Data | N50 | No. contigs | Total base pairs of assembly | Depth of coverage |
|---|---|---|---|---|
| DK human | 274,445 (148,864; 490,821) | 63 (35; 151) | 4,932,862 (4,730,908; 5,217,740) | 90 (46; 296) |
| DK food and animal | 280,405 (151,473; 507,939) | 60 (36; 184) | 4,945,037 (4,769,818; 5,254,474) | 104 (65; 154) |
| DE human | 299,886 (188,812; 393,902) | 61 (33; 326) | 4,937,911 (4,682,559; 5,162,175) | 103 (45; 276) |
| DE food and animal | 137,544 (19,719*; 377,001) | 112 (35; 466) | 4,937,743 (4,730,293; 5,240,180) | 79 (33; 173) |
| UK human | 211,889 (53,936; 550,741) | 69 (36; 170) | 4,941,438 (4,723,064; 5,284,498) | 74 (36; 178) |
| UK food and animal | 267,335 (143,510; 464,171) | 68 (31; 125) | 5,037,113 (4,804,529; 8,982,655**) | 83 (31; 215) |
| FR enviromental | 263,923 (143,153; 356,020) | 64 (42; 101) | 4,939,325 (4,771,190; 5,090,477) | 236 (56; 473) |
| FR animal | 261,383 (143,153; 356,020) | 64 (42; 89) | 4,935,805 (4,810,678; 5,084,053) | 215 (56; 431) |
*This sequence was included despite its relatively low N50 value because the number of contigs and total base pairs were acceptable.
**This sequence was included despite its large size because the associated number of contigs and N50 value were acceptable and no contamination was observed from a kmer analysis using KmerFinder freely available from the Center for Genomic Epidemiology (www.genomicepidemiology.org).
Fig. 2Phylogeny of the Danish dataset. //: Branch length to outgroup ST36 reduced by 30 from 0.71421 to 0.023807. Isolates are annotated by source. Light red: domestically produced pigs, Pigs(DK), light pink: imported pigs, Pigs(Import), yellow; imported ducks, Ducks(Import), blue: domestically produced broilers, Broilers(DK), turquoise: domestically produced eggs, Layers(DK), light green: domestically produced cattle, Cattle(DK), dark green: imported cattle, Cattle(Import), dark grey: Danish human salmonellosis cases, Humans(DK).
Fig. 3Phylogeny of the German dataset. //: Branch length to outgroup ST36 reduced by 30 from 0.73924 to 0.0246413333. Isolates are annotated by source. Light red: domestically produced pigs, Pigs(DE), red: domestically birds, Birds(DE), blue: domestically produced broilers, Broilers(DE), turquoise: domestically produced eggs, Layers(DE), dark green: domestically produced cattle, Cattle(DE), purple: domestically game, Game(DE), dark grey: German human salmonellosis cases, Human(DE).
Fig. 5Phylogeny of the French dataset. Isolates are annotated by source. Light red: domestically produced pigs, Pigs(FR), green: domestically produced turkey, Turkey(FR), light blue: domestically produced ducks, Ducks(FR), blue: domestic produced broilers, Broilers(FR), orange: domestically produced eggs, Layers(FR), dark green: domestically produced cattle, Cattle(FR), brown: domestically produced sheep goats, SheepGoat(FR), purple: domestically produced crustaceans, Crustaceans(FR), dark grey: French environment samples, Environment(FR).
| Measurement(s) | sequence_assembly • DNA |
| Technology Type(s) | sequence assembly process • DNA sequencing • Paired-End Sequencing |
| Factor Type(s) | geographic location • Salmonella Typhimurium isolate source |
| Sample Characteristic - Organism | Salmonella |
| Sample Characteristic - Location | Kingdom of Denmark • Germany • United Kingdom • French Republic |