| Literature DB >> 33036450 |
Hee Jin Kwon1, Zhao Chen1, Peter Evans2, Jianghong Meng1, Yi Chen2.
Abstract
Recently developed nanopore sequencing technologies offer a unique opportunity to rapidly close the genome and to identify complete sequences of mobile genetic elements (MGEs). In this study, 17 isolates of Listeria monocytogenes (Lm) epidemic clone II (ECII) from seven ready-to-eat meat or poultry processing facilities, not known to be associated with outbreaks, were shotgun sequenced, and among them, five isolates were further subjected to long-read sequencing. Additionally, 26 genomes of Lm ECII isolates associated with three listeriosis outbreaks in the U.S. and South Africa were obtained from the National Center for Biotechnology Information (NCBI) database and analyzed to evaluate if MGEs may be used as a high-resolution genetic marker for identifying and sourcing the origin of Lm. The analyses identified four comK prophages in 11 non-outbreak isolates from four facilities and three comK prophages in 20 isolates associated with two outbreaks that occurred in the U.S. In addition, three different plasmids were identified among 10 non-outbreak isolates and 14 outbreak isolates. Each comK prophage and plasmid was conserved among the isolates sharing it. Different prophages from different facilities or outbreaks had significant genetic variations, possibly due to horizontal gene transfer. Phylogenetic analysis showed that isolates from the same facility or the same outbreak always closely clustered. The time of most recent common ancestor of the Lm ECII isolates was estimated to be in March 1816 with the average nucleotide substitution rate of 3.1 × 10-7 substitutions per site per year. This study showed that complete MGE sequences provide a good signal to determine the genetic relatedness of Lm isolates, to identify persistence or repeated contamination that occurred within food processing environment, and to study the evolutionary history among closely related isolates.Entities:
Keywords: Listeria monocytogenes; evolutionary analysis; mobile genetic elements; nanopore long-read sequencing; whole-genome sequencing
Year: 2020 PMID: 33036450 PMCID: PMC7599586 DOI: 10.3390/pathogens9100822
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Seventeen non-outbreak Listeria monocytogenes (Lm) isolates from ready-to-eat meat or poultry processing facilities in the U.S. including their sources, locations, dates of isolation, and the identified plasmids and the comK prophages with their lengths.
| Isolate | Source | NCBI Accession Number/SRA | State | Facility | Date of Isolation | Plasmid (Kb) | |
|---|---|---|---|---|---|---|---|
| OB020621 1 | Food | CP053478 | NC | A | Sep-2002 | NO 2 | 40,697 |
| OB020790 | Food | SRR12481282 | NC | A | Nov-2002 | NO | 40,697 |
| OB030029 1 | Food | CP053628-29 | IN | B | Jan-2003 | ~56 | 39,610 |
| OB040119 1 | Food | CP053630-31 | PA | C | Jun-2004 | ~56 | 40,199 |
| OB050272 | Food | SRR12481281 | PA | C | Aug-2005 | ~56 | 40,199 |
| OB050226 1 | Food | CP053632-33 | PA | D | Jul-2005 | ~56 | 39,677 |
| OB050347 | Environmental | SRR12481277 | PA | D | Oct-2005 | ~56 | 39,677 |
| OB050350 | Environmental | SRR12481276 | PA | D | Oct-2005 | ~56 | 39,677 |
| OB050351 | Environmental | SRR12481275 | PA | D | Oct-2005 | ~56 | 39,677 |
| OB050355 | Environmental | SRR12481274 | PA | D | Oct-2005 | ~56 | 39,677 |
| OB070122 | Environmental | SRR12481273 | PA | D | May-2007 | ~56 | 39,677 |
| OB080183 1 | Food | CP060526-27 | NY | E | May-2008 | ~90 | NO |
| OB080396 | Environmental | SRR12481271 | NJ | F | Aug-2008 | NO | NO |
| OB080398 | Environmental | SRR12481270 | NJ | F | Aug-2008 | NO | NO |
| OB080487 | Environmental | SRR12481280 | NJ | F | Oct-2008 | NO | NO |
| OB080567 | Environmental | SRR12481279 | NJ | F | Dec-2009 | NO | NO |
| OB090318 | Environmental | SRR12481278 | NY | G | Aug-2009 | NO | NO |
The National Center for Biotechnology Information (NCBI) accession number and/or sequence read archive (SRA) identifier for each isolate are included. 1 Complete genomes generated in this study. 2 NO, no plasmid or comK prophage was identified.
The complete genomes of five non-outbreak isolates using both long-reads and short-reads generated by Unicycler assembler.
| Isolate | Facility | Circular | Number of Contigs | Length of Chromosome (bp) | Length of Plasmid (bp) | Most Closely Related Plasmid in NCBI | Position of the |
|---|---|---|---|---|---|---|---|
| OB020621 | A | Yes | 1 | 2,949,231 | NO 1 | NO | 2,365,689 to 2,406,385 |
| OB030029 | B | Yes | 2 | 2,994,628 | 55,801 | pJ1776 | 2,411,970 to |
| OB040119 | C | Yes | 2 | 2,953,956 | 55,803 | pJ1776 | 2,370,708 to |
| OB050226 | D | Yes | 2 | 2,953,451 | 55,798 | pJ1776 | 2,370,726 to |
| OB080183 | E | Yes | 2 | 2,908,728 | 90,421 | pLM-F-131 | NO |
All five genomes were closed (circular) and OB020621 had one circular contig of chromosome, while each OB030029, OB040119, OB050226, and OB080183 had two circular contigs, a chromosome and a plasmid. For each plasmid that was identified from complete genomes, BLAST analysis was used to identify the most closely related Lm plasmid deposited in the NCBI database. The positions of each modified comK prophage on the complete genomes are included. 1 NO, no plasmid or comK prophage was identified.
Twenty-six listeriosis outbreak-related isolates analyzed in this study.
| Isolate | Source | NCBI Accession Number/SRA | State | Facility | Date of Isolation | Plasmid (Kb) | |
|---|---|---|---|---|---|---|---|
| H7355 | Clinical | SRR1814362 | n/a 1 | n/a | Nov-1998 | NO 2 | 40,606 |
| H7738 | Food | SRR3707884 | OH | n/a | Dec-1998 | ~82 | 40,606 |
| H7762 | Food | SRR3707885 | MI | n/a | Nov-1998 | ~82 | 40,606 |
| H7961 | Food | SRR1814399 | OH | n/a | Jan-1999 | ~82 | 40,606 |
| H7962 | Food | SRR1815438 | OH | n/a | Jan-1998 | ~82 | 40,606 |
| H7550 | Clinical | SRR1815437 | NY | n/a | Oct-1998 | ~82 | 40,815 |
| H7557 | Food | SRR3707886 | NY | n/a | Nov-1998 | NO | 40,815 |
| H7596 | Food | SRR1815440 | NY | n/a | Sep-1998 | NO | 40,815 |
| H7969 | Clinical | SRR3707879 | OH | n/a | Jan-1998 | ~82 | 40,815 |
| J1776 3 | Clinical | NC_021839.1 | NJ | n/a | Sep-2002 | ~56 | 39,947 |
| J1816 3 | Environmental | NC_022047.1 | PA | X | Oct-2002 | NO | 39,928 |
| J1817 3 | Environmental | NC_021830.2 | PA | X | Oct-2002 | ~56 | 39,932 |
| J1926 3 | Food | NC_021840.1 | PA | n/a | Nov-2002 | ~56 | 39,947 |
| J1703 | Clinical | SRR3707894 | PA | n/a | Sep-2002 | ~56 | 39,947 |
| J1705 | Clinical | SRR3707893 | PA | n/a | Sep-2002 | ~56 | 39,947 |
| J1735 | Clinical | SRR3707726 | n/a | n/a | Jan-2002 | ~56 | 39,947 |
| J1736 | Clinical | SRR1815439 | PA | n/a | Sep-2002 | NO | 39,947 |
| J1815 | Environmental | SRR3707728 | MI | Y | Jan-2002 | NO | 39,932 |
| J1925 | Food | SRR1814333 | n/a | n/a | Nov-2001 | ~56 | 39,947 |
| J1927 | Food | SRR3707892 | PA | n/a | Nov-2002 | ~56 | 39,947 |
| HM00113468 3 | Food | NZ_CP058256 | n/a | n/a | Feb-2018 | NO | NO |
| HM00108598 | Food | SRR7056256 | n/a | n/a | Jan-2018 | NO | NO |
| HM00110618 | Environmental | SRR7056247 | n/a | n/a | Feb-2018 | NO | NO |
| IG01149260 | Clinical | SRR7056255 | n/a | n/a | Dec-2017 | NO | NO |
| YA00079283 | Clinical | SRR7056251 | n/a | n/a | Dec-2017 | NO | NO |
| YA00082404 | Food | SRR7056250 | n/a | n/a | Jan-2018 | NO | NO |
The NCBI accession numbers for the complete genomes and SRA identifiers for the draft genomes are included. The plasmids and comK prophages determined in this study are shown with their lengths. 1 n/a, metadata not available. 2 NO, no plasmid or comK prophage was determined. 3 Complete genomes available in the NCBI database.
Figure 1(a) Phylogenetic tree constructed by Center for Food Safety and Applied Nutrition (CFSAN) single nucleotide polymorphism (SNP) pipeline with bootstraps values (n = 1000) labeled on each branch. The isolates sequenced by Oxford Nanopore Technologies MinION are indicated (*). Isolates originated from the same facility or the same outbreak clustered together under one clade. (b) The presence or absence of each plasmid and prophage is indicated (presence: black/absence: blank). Identical comK prophages are indicated with the same color.
Figure 2Artemis Comparison Tool comparison of seven complete comK prophages after modification using the disrupted comK gene. Each comK prophage contained ~65 genes in ~41 Kb regions. The two different comK prophages from the 1998 U.S. outbreak isolates shared 79% of genes. The comK prophage from the 2002 U.S. outbreak isolates shared higher percentage of genes with the H7355 comK prophage than H7596 comK prophage, 85% and 72% of genes, respectively. The comK prophages of Facilities A and B shared 22% of genes and the comK prophages of Facilities C and D shared 72% of the total number of genes.
Figure 3Mauve alignment of plasmids. (a) Plasmid #1: this plasmid (GenBank ID: CP006612.1) was found in J1776 of the 2002 U.S. outbreak isolates. The complete plasmid genome of Facilities B, C, and D isolates showed 100% alignment coverage (AC) and 99% sequence identity (SI) with pJ1776. (b) Plasmid #2 and #3: plasmid of LM-F-75 (GenBank ID: KY613765.1) from the NCBI database showed 98% AC and 99% SI to Plasmid #2 from OB080183 (Facility E). The strain LM-F-131 (GenBank ID: QADR01000015.1) has a plasmid with ~92% AC and 99% SI to Plasmid #3 found in the six shotgun genomes of the 1998 U.S. outbreak. The plasmid of LM-F-131 showed 100% of SI but 89% AC to Plasmid #2 (~90 Kb).
Figure 4The maximum clade credibility (MCC) tree of 39 ECII isolates from Bayesian evolutionary analysis by sampling trees (BEAST) analysis. The mean heights were labeled on each branch and its 95% highest posterior density (HPD) was displayed in a horizontal bar. The possible acquisition events of plasmids and prophages are displayed; the color code of comK prophage matches Figure 1. The time of the most recent common ancestor (tMRCA) was estimated: (i) July 1968 (95% HPD, January 1961 to June 1974) for the 1998 U.S. outbreak isolates, (ii) February 1950 (95% HPD, January 1928 to November 1967) for the 2002 U.S. outbreak isolates, (iii) May 2015 (95% HPD, December 2012 to April 2017) for the South African outbreak isolates, and (iv) March 1816 (95% HPD, January 1716 to February 1896) for the entire 39 ECII isolates.