| Literature DB >> 29558450 |
Amber Hilliard1,2, Dara Leong3, Amy O'Callaghan4,5, Eamonn P Culligan6,7, Ciara A Morgan8,9, Niall DeLappe10, Colin Hill11,12, Kieran Jordan13, Martin Cormican14, Cormac G M Gahan15,16,17.
Abstract
Listeria monocytogenes is a major human foodborne pathogen that is prevalent in the natural environment and has a high case fatality rate. Whole genome sequencing (WGS) analysis has emerged as a valuable methodology for the classification of L. monocytogenes isolates and the identification of virulence islands that may influence infectivity. In this study, WGS was used to provide an insight into 25 L. monocytogenes isolates from cases of clinical infection in Ireland between 2013 and 2015. Clinical strains were either lineage I (14 isolates) or lineage II (11 isolates), with 12 clonal complexes (CC) represented, of which CC1 (6) and CC101 (4) were the most common. Single nucleotide polymorphism (SNP) analysis demonstrated that clinical isolates from mother-infant pairs (one isolate from the mother and one from the infant) were highly related (3 SNP differences in each) and also identified close similarities between isolates from otherwise distinct cases (1 SNP difference). Clinical strains were positive for common virulence-associated loci and 13 isolates harbour the LIPI-3 locus. Pulsed-field gel electrophoresis (PFGE) was used to compare strains to a database of 1300 Irish food and food processing environment isolates and determined that 64% of clinical pulsotypes were previously encountered in the food or food processing environment. Five of the matching food and food processing environment isolates were sequenced and results demonstrated a correlation between pulsotype and genotype. Overall, the work provides insights into the nature of L. monocytogenes strains currently causing clinical disease in Ireland and indicates that similar isolates can be found in the food or food processing environment.Entities:
Keywords: Listeria monocytogenes; SNP; clinical; genome; genomics; sequence; single nucleotide polymorphism
Year: 2018 PMID: 29558450 PMCID: PMC5867892 DOI: 10.3390/genes9030171
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Listeria monocytogenes strains sequenced in this study.
| Isolate | Genbank Accession Number | ST 1 | CC 2 | Lineage | Serotype | Year of Isolation | Sample Type | Pulsotype |
|---|---|---|---|---|---|---|---|---|
| MQ130026 | MUZG00000000 | ST-1 | CC1 | I | 4b | 2013 | Blood | P2 * |
| L970 | PJJD00000000 | ST-1 | CC1 | I | 4b | 2013 | Food production ** | P2 * |
| MQ130029 | MVED00000000 | ST-1 | CC1 | I | 4b | 2013 | CSF 3 | P2 * |
| MQ130032 | MVEE00000000 | ST-1 | CC1 | I | 4b | 2013 | Blood | P2 * |
| MQ130042 | MVEG00000000 | ST-1 | CC1 | I | 4b | 2013 | Pleural Swab | P1 * |
| MQ140025 | MVEK00000000 | ST-1 | CC1 | I | 4b | 2014 | Ear Swab | P68 |
| MQ140031 | MVEN00000000 | ST-1 | CC1 | I | 4b | 2014 | Blood | P2 * |
| MQ140033 | MVEP00000000 | ST-1 | CC1 | I | 4b | 2014 | Blood | P1 * |
| L2113 | PJJE00000000 | ST-1 | CC1 | I | 4b | 2015 | Food production | P2 * |
| MQ150012 | MVEY00000000 | ST-6 | CC6 | I | 4b | 2015 | Blood | P13 * |
| MQ150005 | MVEU00000000 | ST-6 | CC6 | I | 4b | 2015 | Blood | P13 * |
| MQ130058 | MVEH00000000 | ST-6 | CC6 | I | 4b | 2013 | Blood | P13 * |
| MQ140030 | MVEM00000000 | ST-4 | CC4 | I | 4b | 2014 | CSF 3 | NC 4 |
| MQ150004 | MVET00000000 | ST-54 | CC54 | I | 4b | 2015 | Placental Swab | P6 * |
| MQ130033 | MVEF00000000 | ST-54 | CC54 | I | 4b | 2013 | Blood | P12 |
| L2259 | PJJF00000000 | ST-54 | CC54 | I | 4b | 2015 | Food production | P6 * |
| MQ150013 | MVEZ00000000 | ST-2 | CC2 | I | 4b | 2015 | Blood | P16 * |
| MQ130037 | MVFA00000000 | ST-18 | CC18 | II | 1/2a | 2013 | Blood | P32 * |
| MQ150011 | MVEX00000000 | ST-20 | CC20 | II | 1/2a | 2015 | Nasal Swab | NC |
| MQ150001 | MVES00000000 | ST-37 | CC37 | II | 1/2a | 2015 | Blood | P32 * |
| MQ140029 | MVEL00000000 | ST-7 | CC7 | II | 1/2a | 2014 | Blood | P31 * |
| L1445 | PJJG00000000 | ST-7 | CC7 | II | 1/2a | 2014 | Food production | P31 * |
| L1976 | PJJI00000000 | ST-8 | CC8 | II | 1/2c | 2015 | Food production | P48 * |
| MQ140034 | MVEQ00000000 | ST-121 | CC121 | II | 1/2a | 2014 | Blood—Mother/Infant | P59 * |
| L2256 | PJJH00000000 | ST-121 | CC121 | II | 1/2c | 2015 | Food production | P59 * |
| MQ140035 | MVER00000000 | ST-121 | CC121 | II | 1/2a | 2014 | Ear Swab—Mother/Infant | P59 * |
| MQ140032 | MVEO00000000 | ST-425 | CC90 | II | 1/2a | 2014 | Blood | NC |
| MQ140012 | MVEJ00000000 | ST-101 | CC101 | II | 1/2a | 2014 | Blood—Mother/Infant | P30 |
| MQ140011 | MVEI00000000 | ST-101 | CC101 | II | 1/2a | 2014 | Placental Surface Swab—Mother/Infant | P30 |
| MQ150008 | MVEW00000000 | ST-431 | CC101 | II | 1/2a | 2015 | Blood | P30 |
| MQ150007 | MVEV00000000 | ST-431 | CC101 | II | 1/2a | 2015 | CSF 3 | P30 |
1 Sequence Type; 2 Clonal Complex; 3 Cerebrospinal fluid; 4 NC = not classified; * Indicates pulsotypes associated with persistence in the food processing environment in a 3 year study of Irish foods and food production facilities [21]; ** These samples are from food or food production environments (non-clinical) and were chosen for sequencing based upon pulsed-field gel electrophoresis (PFGE) similarities to clinical isolates (see text).
Figure 1(A) Clustering of isolates based upon multi-locus sequence typing (MLST). The evolutionary history was inferred by using the Maximum Likelihood method based on the Tamura 3-parameter model [30] as outlined in detail in Materials and Methods. The tree with the highest log likelihood (−5971.3729) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. Evolutionary analyses were conducted in MEGA7 [31]; (B) Clustering of isolates based upon pulsed field gel electrophoresis (PFGE). Dendograms were generated with BioNumerics v7.0 software (Applied Maths) using UPGMA (unweighted pair group method with averages) and the Pearson coefficient with 1% tolerance. Using either method (MLST or PFGE) strains cluster into two distinct groups dependent upon lineage. ST: sequence type; CC: clonal complex.
Figure 2Phylogeny of the isolates as determined by single nucleotide polymorphism (SNP) analysis. (A) All serotype 4b isolates were compared with strain F2365 as the reference genome. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model [34]. The tree with the highest log likelihood (−67933.7374) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 18 nucleotide sequences. Codon positions included were 1st + 2nd + 3rd + Noncoding. All positions with less than 95% site coverage were eliminated. That is, fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position. There were a total of 12,069 positions in the final dataset. Evolutionary analyses were conducted in MEGA7 [31]. (B) All serotype 1/2a isolates were compared with strain EGDe as the reference genome. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model [34]. The tree with the highest log likelihood (−214301.5165) is shown. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 15 nucleotide sequences. Codon positions included were 1st + 2nd + 3rd + Noncoding. All positions with less than 95% site coverage were eliminated. That is, fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position. There were a total of 30,102 positions in the final dataset. Evolutionary analyses were conducted in MEGA7 [31]. * Indicates strains from a food production source.
Relatedness of strains as determined by fine single nucleotide polymorphism (SNP) analysis with appropriate reference strains.
| Sequence Type | Isolates | Reference Genome | Minimum SNPs | Maximum SNPs |
|---|---|---|---|---|
| ST1 | L970, L2113, MQ130026, MQ130029, MQ130032 *, MQ130042, MQ140025, MQ140031, MQ140033 | F2365 | 43 | 261 |
| F2365, L2113, MQ130026, MQ130029, MQ130032 *, MQ130042, MQ140025, MQ140031, MQ140033 | L970 | 42 | 256 | |
| F2365, L970, MQ130026, MQ130029, MQ130032 *, MQ130042, MQ140025, MQ140031, MQ140033 | L2113 | 42 | 254 | |
| F2365, L970, MQ130029, MQ130032 *, MQ130042 | MQ130026 | 44 | 190 | |
| F2365, L2113, MQ140031, MQ140033 * | MQ140025 | 66 | 259 | |
| ST6 | H7858, MQ130058, MQ150012 * | MQ150005 | 199 | 373 |
| ST54 | LM07-01337, MQ130033, MQ150004 * | L2259 | 65 | 115 |
| ST7 | J2692, L1846, L2676, MQ140029 * | L1445 | 1 | 409 |
| ST121 | 4423, 6179, L2256, La111, Lm1880, N53-1, MQ140035 * | MQ140034 | 3 | 461 |
| ST101 | 2012-L5240, 2012-L5323, Lm1840, MQ140012, MQ150007, MQ150008 * | MQ140011 | 1 | 145 |
| 2012-L5240, 2012-L5323, Lm1840, MQ150008 * | MQ150007 | 2 | 146 |
* Indicates strain with closest similarity to the reference genome (lowest number of SNPs).
Figure 3Pan-genome, constructed using GView [35], of 34 genomes including 25 clinical isolates, six food-associated isolates and three reference genomes. The pangenome is constructed by iteratively appending unique regions onto the initial seed genome in this case F2365. Gaps indicate that the region is missing in a particular gene but is found in others. Strains are grouped according to CC and colour coded as indicated.
Figure 4The presence and absence of genes can be used to cluster the isolates into serotypes and sequence types.
Presence or absence of key loci encoding L. monocytogenes virulence-associated elements or loci putatively involved in environmental survival.
| Isolate | ST 1 | CC 2 | LIPI1 | LIPI3 | LIPI4 | InlA | SSI-1 |
|---|---|---|---|---|---|---|---|
| MQ130026 | ST-1 | CC1 | + 3 | + | − 4 | + 5 | − |
| L970 | ST-1 | CC1 | + | + | − | + | − |
| MQ130029 | ST-1 | CC1 | + | + | − | + | − |
| MQ130032 | ST-1 | CC1 | + | + | − | + | − |
| MQ130042 | ST-1 | CC1 | + | + | − | + | − |
| MQ140025 | ST-1 | CC1 | + | + | − | + | − |
| MQ140031 | ST-1 | CC1 | + | + | − | + | − |
| MQ140033 | ST-1 | CC1 | + | + | − | + | − |
| L2113 | ST-1 | CC1 | + | + | − | + | − |
| MQ150012 | ST-6 | CC6 | + | + | − | + | − |
| MQ150005 | ST-6 | CC6 | + | + | − | + | − |
| MQ130058 | ST-6 | CC6 | + | + | − | + | − |
| MQ140030 | ST-4 | CC4 | + | + | + | + | − |
| MQ150004 | ST-54 | CC54 | + | + | − | + | − |
| MQ130033 | ST-54 | CC54 | + | + | − | + | − |
| L2259 | ST-54 | CC54 | + | + | − | + | − |
| MQ150013 | ST-2 | CC2 | + | − | − | + | − |
| MQ130037 | ST-18 | CC18 | + | − | − | + | + |
| MQ150011 | ST-20 | CC20 | + | − | − | + | − |
| MQ150001 | ST-37 | CC37 | + | − | − | + | − |
| MQ140029 | ST-7 | CC7 | + | − | − | + | + |
| L1445 | ST-7 | CC7 | + | − | − | + | + |
| L1976 | ST-8 | CC8 | + | − | − | + | + |
| MQ140034 | ST-121 | CC121 | + | − | − | + | − |
| L2256 | ST-121 | CC121 | + | − | − | − | − |
| MQ140035 | ST-121 | CC121 | + | − | − | + | − |
| MQ140032 | ST-425 | CC90 | + | − | − | + | − |
| MQ140012 | ST-101 | CC101 | + | − | − | + | − |
| MQ140011 | ST-101 | CC101 | + | − | − | + | − |
| MQ150008 | ST-431 | CC101 | + | − | − | + | − |
| MQ150007 | ST-431 | CC101 | + | − | − | + | − |
1 Sequence Type; 2 Clonal Complex; 3 Presence of genes; 4 Absence of genes; 5 Indicates encoding predicted full-length InlA.