| Literature DB >> 23825666 |
Henk C den Bakker1, Christopher A Desjardins, Allison D Griggs, Joseph E Peters, Qiandong Zeng, Sarah K Young, Chinnappa D Kodira, Chandri Yandava, Theresa A Hepburn, Brian J Haas, Bruce W Birren, Martin Wiedmann.
Abstract
Listeria monocytogenes, a foodborne bacterial pathogen, is comprised of four phylogenetic lineages that vary with regard to their serotypes and distribution among sources. In order to characterize lineage-specific genomic diversity within L. monocytogenes, we sequenced the genomes of eight strains from several lineages and serotypes, and characterized the accessory genome, which was hypothesized to contribute to phenotypic differences across lineages. The eight L. monocytogenes genomes sequenced range in size from 2.85-3.14 Mb, encode 2,822-3,187 genes, and include the first publicly available sequenced representatives of serotypes 1/2c, 3a and 4c. Mapping of the distribution of accessory genes revealed two distinct regions of the L. monocytogenes chromosome: an accessory-rich region in the first 65° adjacent to the origin of replication and a more stable region in the remaining 295°. This pattern of genome organization is distinct from that of related bacteria Staphylococcus aureus and Bacillus cereus. The accessory genome of all lineages is enriched for cell surface-related genes and phosphotransferase systems, and transcriptional regulators, highlighting the selective pressures faced by contemporary strains from their hosts, other microbes, and their environment. Phylogenetic analysis of O-antigen genes and gene clusters predicts that serotype 4 was ancestral in L. monocytogenes and serotype 1/2 associated gene clusters were putatively introduced through horizontal gene transfer in the ancestral population of L. monocytogenes lineage I and II.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23825666 PMCID: PMC3692452 DOI: 10.1371/journal.pone.0067511
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genomes and strains used for analyses.
| Strain | Sero-type | Line-age | Source | Sequencing center | Coverage & Status | Accession |
|
| ||||||
| 10403S | 1/2a | II | Streptomycin resistant lab strain, USA | Broad Institute v6 | 48×finished | AARZ00000000.3 |
| Finland1998 | 3a | II | Butter associated outbreak, Finland | Broad Institute v3 | 48×finished | AART00000000.2 |
| FSL R2-561 | 1/2c | II | Human sporadic case, England | Broad Institute v2 | 48×finished | AARS00000000.2 |
| J0161 | 1/2a | II | Human clinical case, USA | Broad Institute v8 | 29×finished | AARW00000000.3 |
| F6900 | 1/2a | II | Human clinical case, USA | Broad Institute v3 | 26×draft | AARU00000000.2 |
| FSL J1-194 | 1/2b | I | Human sporadic case, USA | Broad Institute v3 | 22×draft | AARJ00000000.2 |
| FSL J2-071 | 4c | III | Bovine clinical case, USA | Broad Institute v5 | 21×draft | AARN00000000.4 |
| FSL N1-017 | - | I | Food processing environment, USA | Broad Institute v5 | 22×draft | AARP00000000.4 |
| FSL R2-503 | 1/2b | I | Human clinical case, USA | Broad Institute v3 | 35×draft | AARR00000000.2 |
| FSL N3-165 | 1/2a | II | Farm environment, USA | Broad Institute v3 | 22×draft | AARQ00000000.2 |
| Aureli1997(HPB2262) | 4b | I | Human clinical case, Italy | Broad Institute v3 | 20×draft | AARL00000000.2 |
| J2818 | 1/2a | II | Food, outbreak related, USA | Broad Institute v3 | 24×draft | AARX00000000.2 |
| EGD-e | 1/2a | II | Lab strain, origin unknown | Institut Pasteur | finished | NC_003210 |
| H7858 | 4b | I | Food, outbreak related, USA | TIGR | 8×draft | AADR00000000 |
| F6854 | 1/2a | II | Food (Turkey Franks), USA | TIGR | 8×draft | AADQ00000000 |
| F2365 | 4b | I | Food, USA | TIGR | 8×finished | AE017262 |
| CLIP80459 | 4b | I | Human clinical case, France | Institut Pasteur | finished | NC_012488 |
| 08_5578 | 1/2a | II | Human clinical case, Canada | Public Health Agency of Canada | 40×finished | CP001602 |
| 08_5923 | 1/2a | II | Human clinical case, Canada | Public Health Agency of Canada | 36×finished | CP001604 |
| HCC23 | 4a | III | Healthy catfish, USA | Mississippi State University | 15×finished | NC_011660 |
| FSL J1-208 | 4a | IV | Caprine clinical case, outbreak related | Cornell University/Broad Institute | 200×draft | AEIS01000000 |
|
| ||||||
| FSL S4-120 | 6a | Forest soil, NY, USA, 2001 | Cornell University/Life Technologies | 200×draft | CM001047.1 | |
|
| ||||||
| CLIP11262 | 6a | Wild type lab strain | Institut Pasteur | finished | NC_003212 | |
| FSL S4-378 | 4ab | Puddle of water, NY, USA, 2002 | Cornell University/Life Technologies | 200×draft | NZ_CM001048.1 | |
| FSL J1-023 | 4b | Obtained from Qualicon | Cornell University/Life Technologies | 200×draft | NZ_CM001049.1 | |
|
| ||||||
| SLCC5334 | 6b | Decaying vegetation, USA | Justus-Liebig-University | 6.4×finished | AM263198 | |
|
| ||||||
| SLCC3954 | 1/2b | Soil, Germany | Justus-Liebig-University | 7×finished | NC_013891 | |
| FSL N1-067 | 7 | Food processing plant | Cornell University/Life Technologies | 200×draft | NZ_CM001051.1 | |
| FSL S4-171 | 4c | Urban environment, NY, USA, 2001 | Cornell University/Life Technologies | 200×draft | NZ_CM001052.1 | |
|
| ||||||
| PAM 55 | 5 | Sheep, Spain | Institut Pasteur | 8×finished | NC_016011.1 | |
| ATCC 49954 | 5 | Food, France | Cornell University/Life Technologies | 200×draft | NZ_CM001050.1 | |
While FSL N1-017 has been reported to be serotype 4b (http://www.broadinstitute.org/annotation/genome/listeria_group/GenomeDescriptions.html), gene content analysis suggests that it should be serotype 1/2b.
These genomes were previously sequenced [56], but here the assemblies were improved and the genomes were re-annotated.
Coverage information for these genome sequences could not be determined.
Genome sequences only used in O-antigen related analyses. Serotypes for these strains, with the exception of L. ivanovii, were newly determined in this study.
Genome statistics of L. monocytogenes genome sequences used in this study.
| Strain | No. of Scaffolds | Genome Size (Mb) | No. of Genes | Contig N50 (kb)1 | Q40 bases (%) | No. of Accessory Genes | No. of alignable Non-Phage Acc. Genes | No. of Unique Genes |
| 10403S | 1 | 2.90 | 2828 | N/A | 99.9 | 389 | 336 | 12 |
| F6900 | 23 | 2.97 | 3005 | 524 | 99.9 | 567 | 439 | 5 |
| Finland1998 | 1 | 2.87 | 2762 | N/A | 98.8 | 323 | 323 | 11 |
| FSL J1-194 | 30 | 2.99 | 3013 | 354 | 99.8 | 577 | 430 | 32 |
| FSL J2-071 | 53 | 2.85 | 2822 | 145 | 99.3 | 398 | 334 | 72 |
| FSL N1-017 | 79 | 3.14 | 3187 | 92 | 99.5 | 753 | 448 | 134 |
| FSL R2-503 | 55 | 2.99 | 3027 | 173 | 99.5 | 592 | 426 | 47 |
| FSL N3-165 | 39 | 2.88 | 2890 | 147 | 99.7 | 451 | 441 | 40 |
| FSL R2-561 | 1 | 2.97 | 2910 | N/A | 99.7 | 471 | 347 | 29 |
| Aureli1997 (HPB2262) | 79 | 2.99 | 3053 | 143 | 99.6 | 616 | 434 | 71 |
| J0161 | 1 | 3.00 | 2973 | N/A | 99.9 | 534 | 403 | 3 |
| J2818 | 24 | 2.97 | 3084 | 489 | 99.8 | 646 | 503 | 38 |
| EGDe | 1 | 2.94 | 2864 | N/A | – | 425 | 344 | 8 |
| H7858 | 181 | 2.97 | 3111 | N/A | – | 700 | 530 | 227 |
| F6854 | 133 | 2.95 | 2967 | N/A | – | 568 | 455 | 67 |
| F2365 | 1 | 2.91 | 2821 | N/A | – | 382 | 382 | 31 |
| CLIP80459 | 1 | 2.91 | 2766 | N/A | – | 327 | 327 | 12 |
| 08_5578 | 1 | 3.03 | 3010 | N/A | – | 571 | 404 | 33 |
| 08_5923 | 1 | 3.00 | 2966 | N/A | – | 527 | 406 | 5 |
| HCC23 | 1 | 2.98 | 2974 | N/A | – | 535 | 338 | 127 |
| FSL J1-208 | 8 | 2.74 | 2765 | 527 | – | 401 | 401 | 96 |
Contig N50 values are given for draft (unfinished) genomes assembled here, while the percent Q40 bases is given for all genomes assembled here.
N/A = not available, because the genome sequence is closed.
Percentage Q40 bases is only given for genome sequences newly presented in this publication.
Figure 1phylogenetic tree and accessory genome distribution plots.
Plots show the number of accessory genes in between each core gene as ordered in the reference strain EGDe. Insertion sites of prophages (P), integrated conjugative elements (ICE), and Listeria genomic islands (LGI) as detailed in Table 4 are indicated above each accessory genome distribution plot. Vertical dotted lines with a question mark indicate prophages, which are not assembled in a single contiguous piece, but are hypothesized to be present in the location based on presence of the appropriate phage genes in unalignable fraction of the assembly. Plots are colored by lineage: I, red, II, blue, III, green, IV, purple. Serotypes are shown to the right of each plot. The phylogenetic tree is based on a maximum likelihood analysis of the concatenated alignments of 2,086 core genes.
Overview of prophage and Inserted Conjugative Elements (ICE) insertion sites in L. monocytogenes.
| Mobile element integration site in | Type of element | gene closest to integration site (attC) | Insertion site occupied in |
| ICE1 | ICE | FlaR (lmo1412) | FSL N3-165 |
| ICE2 | Integrated conjugative element (ICE) | guaA (lmo1096) | EGD-e (tn916-like: ICELm1) |
| LGI1 | Genomic Island | RNA methyltransferase, TrmA family-Fosfomycin resistance protein FosX | 08-5578, 08-5923 |
| P1 | Prophage | tRNA-Lys | L99, HCC23 (PSA-like) |
| P2 | Prophage | tRNA-Arg | 08-5578, 08-5923, FSL R2-561, FSL N1-017 (B025-like) |
| P3 | Prophage | ribosomal protein S9 | L99, HCC23 (A500-like) |
| P4 | Prophage | ComK | J0161, F6854, J2818, F6900, 10403S, 08-5578, 08-5923, EGD-e, FSL R2-561, FSL J1-194, FSL R2-503, FSL N1-017.Aureli 1997, H7858 (A118-like) |
| P5 | Prophage | tRNA-Thr-4 | J0161, F6854, J2818, F6900 (A118-like) |
| P6 | Prophage | tRNA-Ser | 08-5578, 08-5923 (A118-like) |
| P7 | Prophage | hypothetical protein LMHG_11046 | FSL N1-017 (B054-like) |
Selection of strains in which this insertion site is occupied by a mobile element or prophage. Between parenthesis resemblance to sequenced phages is indicated.
Figure 2Cumulative distribution of the accessory genome throughout the chromosome in L. monocytogenes (n = 21), Staphylococcus aureus (n = 17) and strains of the Bacillus cereus group (n = 16).
The circular genome position starts at the origin of replication, which is at 0 degrees.
Top 25 most abundant Gene Ontology (GO) terms which are significantly enriched in the accessory genome versus the core genome of Listeria monocytogenes.
| Original GO term | No. of genes in | P value | GO description | |
| Core genome | Accessory genome | |||
| GO:0009401 | 1690 | 411 | 2.04E-45 | phosphoenolpyruvate-dependent sugar phosphotransferase system |
| GO:0008982 | 1235 | 335 | 1.10E-44 | protein-N(PI)-phosphohistidine-sugar phosphotransferase activity |
| GO:0005351 | 1020 | 333 | 1.35E-59 | sugar:hydrogen symporter activity |
| GO:0009986 | 302 | 246 | 2.81E-103 | cell surface |
| GO:0005618 | 363 | 233 | 1.25E-82 | cell wall |
| GO:0043565 | 533 | 150 | 6.87E-20 | sequence-specific DNA binding |
| GO:0009273 | 178 | 124 | 1.52E-45 | peptidoglycan-based cell wall biogenesis |
| GO:0015074 | 42 | 101 | 4.78E-67 | DNA integration |
| GO:0005576 | 169 | 70 | 4.30E-15 | extracellular region |
| GO:0004802 | 0 | 68 | 1.48E-67 | transketolase activity |
| GO:0005518 | 42 | 64 | 2.42E-35 | collagen binding |
| GO:0005529 | 250 | 60 | 1.53E-04 | sugar binding |
| GO:0004803 | 20 | 53 | 3.15E-35 | transposase activity |
| GO:0006313 | 62 | 53 | 2.95E-21 | transposition, DNA-mediated |
| GO:0004351 | 0 | 51 | 5.75E-50 | glutamate decarboxylase activity |
| GO:0006536 | 42 | 51 | 5.63E-25 | glutamate metabolic process |
| GO:0015755 | 104 | 48 | 3.86E-11 | fructose transport |
| GO:0015573 | 125 | 45 | 2.12E-07 | beta-glucoside transmembrane transporter activity |
| GO:0008861 | 0 | 42 | 1.15E-40 | formate C-acetyltransferase activity |
| GO:0000150 | 0 | 41 | 1.24E-39 | recombinase activity |
| GO:0008706 | 42 | 40 | 1.04E-16 | 6-phospho-beta-glucosidase activity |
| GO:0043624 | 126 | 40 | 4.22E-05 | cellular protein complex disassembly |
| GO:0047632 | 0 | 38 | 1.56E-36 | agmatine deiminase activity |
| GO:0006306 | 21 | 36 | 5.30E-20 | DNA methylation |
| GO:0006323 | 42 | 35 | 3.70E-13 | DNA packaging |
P values were calculated using Fisher’s exact test and Bonferroni corrected.
Accessory genome loci that distinguish lineages I and II.
| Presence in Lineage | |||||
| I | II | III | IV | Loci | Putative Function |
| + | – | + | + | LMOf2365_0374 | Internalin |
| + | – | +/− | + | LMOf2365_0413-0417 | ABC-type antimicrobial peptide transport system, cell-wall-anchored protein |
| + | – | – | + | LMOf2365_0693-0694 | Cell-wall-anchored proteins |
| + | – | + | + | LMOf2365_1131 | Unknown |
| + | – | + | + | LMOf2365_1142-1143 | Unknown |
| + | – | +/− | +/− | LMOf2365_1252-1254 | Internalin, cell-wall-anchored |
| + | – | +/− | – | LMOf2365_1681-1683 | N-acetylmuramic acid specific PTS |
| + | – | – | – | LMOf2365_2059 | Regulatory protein |
| + | – | – | + | LMOf2365_2361 | cAMP-binding protein |
| + | – | + | + | LMOf2365_2416 | Internalin |
| + | – | – | – | LMOf2365_2638 | Cell-wall-anchored protein |
| – | + | +/− | – | lmo0147 | Unknown |
| – | + | – | + | lmo0171 | Internalin, cell-wall-anchored |
| – | + | – | – | lmo0332 | Unknown |
| – | + | +/− | – | lmo0341 | Bacteriocin Immunity protein |
| – | + | +/− | – | lmo0421-0423 | Cell division, lineage-specific thermal regulator protein, RNA polymerase factor sigma C |
| lmo0525 | Unknown | ||||
| – | + | – | – | lmo0734-0739 | PTS, putative pentose phosphate specific |
| – | + | +/− | – | lmo0749-750 | putative regulatory protein |
| – | + | + | – | lmo0780 | Unknown |
| – | + | – | – | lmo1060-1063 | heavy metal associated two component response system and ABC transporter |
| – | + | + | + | lmo1125 | Unknown |
| – | + | + | + | lmo1289 | Internalin-like protein, Cell-wall-anchored |
| – | + | + | + | lmo1307 | Unknown |
| – | + | – | – | lmo1968-1974 | Creatinine amidohydrolase, KDPG and KHGaldolase, L-ascorbate specific PTS |
| – | + | + | – | lmo2169 | Unknown |
| – | + | +/− | – | lmo2576 | Putative collagen adhesion protein, cell-wall-anchored |
| – | + | + | + | lmo2644 | Regulation of 1,3-beta-glucan synthase |
| – | + | – | – | lmo2686 | Unknown |
| – | + | +/− | – | lmo2786-2788 | glucose–glucoside (Glc) family PTS |
+ = present, – = absent, +/− = present in some strains.
The putative function is inferred from the initial gene annotation.
Presence/absence of orthologs in each of the four lineages is listed, as well as putative function and the locus identifier(s) in the reference genome, either F2365 (lineage I) or EGDe (lineage II).
Figure 3Clade membership plot of individual genes plotted against the genome of L. monocytogenes F2365.
The order of genome rings is listed in the circle center, with F2365 being the outermost ring. The 7 outermost rings represent lineage I (serotype 4b and 1/2b), the next three rings represent lineage III and lineage IV strains (serotype 4a and 4c), and the last 11 rings represent lineage II strains (serotype 1/2a, 1/2c, and 3a). Clade membership of the individual genes is indicated by color; blue indicates lineage II, red indicates lineage I, and gray is unresolved membership. The two O-antigen gene clusters are highlighted in green and yellow. Genes in these clusters found in serotype 1/2b lineage I cluster phylogenetically with orthologs found in lineage II clade.
Figure 4Synteny and gene-specific phylogenetic history of the two O-antigen specific gene clusters.
The organismal phylogeny of the genus Listeria is shown in the upper panel (A), while the syntenic relationships of the two O-antigen gene clusters between the two major serotype divisions and the phylogenetic tree based on a representative serotype specific gene are shown in the two lower panels (B and C). Genes are colored by their phylogenetic histories: Serotype-specific genes (i.e., genes found only in specific serotypes) are colored green, while genes displaying an organismal phylogeny across the Listera genus are colored blue. Genes which follow a serotype-related phylogeny across Listeria are shown in orange. Values on the branches represent bootstrap values based on 100 bootstrap replicates. The organismal tree is based on a 10 locus multi-locus sequence analysis as described in Den Bakker et al. [52]. The topology of this tree is congruent with a tree based on the MLST scheme used in Ragon et al. [6].
Figure 5Phylogenetic reconstruction of serotype evolution in Listeria. Serotype 4 is shown in red while serotype 1/2 is shown in green.
This construction suggests that serotype 1/2 genes were horizontally transferred from L. seeligeri to an ancestor of L. monocytogenes lineages I and II. The origin of the serotype 1/2 cluster is unclear, we hypothesize that this cluster putatively originated in the most recent common ancestor of the L. seeligeri and L. ivanovii clade (as indicated by the dashed line). Serotype 4 genes appear to be largely inherited by vertical descent, except for a lateral transfer of genes from L. welshimeri into some strains of L. monocytogenes lineage III (dotted red line).