| Literature DB >> 26484097 |
Debarati Paul1, Chelsea Steele2, Janet R Donaldson3, Michelle M Banes2, Ranjit Kumar2, Susan M Bridges4, Mark Arick5, Mark L Lawrence2.
Abstract
More than 98% of reported human listeriosis cases are caused by specific serotypes within genetic lineages I and II. The genome sequence of Listeria monocytogenes lineage III strain HCC23 (serotype 4a) enables whole genomic comparisons across all three L. monocytogenes lineages. Protein cluster analysis indicated that strain HCC23 has the most unique protein pairs with nonpathogenic species Listeria innocua. Orthology analysis of the genome sequences of representative strains from the three L. monocytogenes genetic lineages and L. innocua (CLIP11262) identified 319 proteins unique to nonpathogenic strains HCC23 and CLIP11262 and 58 proteins unique to pathogenic strains F2365 and EGD-e. BLAST comparison of these proteins with all the sequenced L. monocytogenes and L. innocua revealed 126 proteins unique to serotype 4a and/or L. innocua; 14 proteins were only found in pathogenic serotypes. Some of the 58 proteins unique to pathogenic strains F2365 and EGD-e were previously published and are already known to contribute to listerial virulence.Entities:
Keywords: Comparative genomics; Genetic lineage; Listeria monocytogenes; Virulence
Year: 2014 PMID: 26484097 PMCID: PMC4536003 DOI: 10.1016/j.gdata.2014.06.010
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
List of L. monocytogenes and L. innocua strains used for BLAST analysis.
| 1/2a | 1/2b | 1/2c | 3a | 3b | 3c | 4a | 4b | 4c | 4e | 7 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| EGD-e | SLCC2755 | FSLR2-561 | Finland 1998 | SLCC2540 | SLCC2479 | HCC23 | 07PF0776 | SLCC2376 | SLCC2378 | SLCC2482 | CLIP 11262 |
| La111 | FSL J1-175 | SLCC2372 | SLCC7179 | L99 | ATCC 19117 | FSL J2-071 | FSL J1-023 | ||||
| 08-5578 | FSL J1-194 | LO28 | M7 | J1-220 | FSL S4-378 | ||||||
| 08-5923 | FSL J2-064 | FSL J1-208 | J1816 | ATCC 33091 | |||||||
| 10403S | FSL R2-503 | FSL F2-208 | L312 | ||||||||
| J0161 | CLIP 80459 | ||||||||||
| N53-1 | F2365 | ||||||||||
| SLCC5850 | LL195 | ||||||||||
| F6900 | FSL N1-017 | ||||||||||
| FSL F2-515 | HPB2262 | ||||||||||
| FSL J2-003 | H7858 | ||||||||||
| FSL N3-165 | |||||||||||
| J2818 | |||||||||||
| F6854 | |||||||||||
| NCCP No. 15743 |
General summaries of three L. monocytogenes genomes and a L. innocua genome.
| Strain | HCC23 | EGD-e | F2365 | CLIP11262 |
|---|---|---|---|---|
| Serotype | 4a | 1/2a | 4b | |
| Chromosome size | 2,976,212 | 2,944,528 | 2,905,187 | 3,093,113 |
| G + C content (%) | 38.2 | 37.98 | 38.04 | 37.38 |
| No. of CDSs | 2974 | 2846 | 2821 | 2968 |
| No. of rRNA genes | 18 | 18 | 18 | 18 |
| No. of tRNA genes | 67 | 67 | 67 | 66 |
Fig. 1Circular map of the Listeria monocytogenes serotype 4a strain HCC23 genome. From outside to inside, the circles represent: 1) COG group classification of proteins encoded on the positive strand, 2) CDS, tRNA, and rRNA on the positive strand, 3) CDS, tRNA and rRNA on the negative strand, 4) COG group classifications of proteins encoded on the negative strand, 5) BLAST results with L. monocytogenes serotype 1/2a strain EGD-e, 6) BLAST results with L. monocytogenes serotype 4b strain F2365, 7) G + C content, and 8) GC skew. BLAST results identified 11 clusters of strain HCC23-specific genes (labeled 1–11).
COG group classifications:.
Energy production and conversion
Cell cycle control mitosis and meiosis
Amino acid transport and metabolism
Nucleotide transport and metabolism
Carbohydrate metabolism and transport
Coenzyme transport and metabolism
Lipid transport and metabolism
Translation
Transcription
DNA replication, recombination, and repair
Cell wall/membrane biogenesis
Cell motility
Posttranslational modification, protein turnover
Inorganic ion transport and metabolism
Secondary metabolism biosynthesis, transport and catabolism
Function unknown
Signal transduction mechanisms
Intracellular trafficking and secretion
Defense mechanisms
Fig. 2Visualization of synteny analysis of six Listeria strains. Parallel axes with lengths corresponding to the genome sizes are used to represent each genome, and a ribbon corresponding to the width of each gene is drawn to connect corresponding genes on the parallel axes. The use of a color spectrum across the genome is used to facilitate visualization of inversions, deletions, and insertions.
Number of CDS and orthologous clusters present in six Listeria strains.
| Gene category | Number of orthologous | Number of CDS (% of proteins) |
|---|---|---|
| Core | 2,231 (56.5) | 13,473 (78.8) |
| Distributed | 734 (18.6) | 2,641 (15.4) |
| Unique | 984 (24.9) | 984 (5.8) |
| Total | 3,949 (100) | 17,098 (100) |
Core clusters: clusters that contain an orthologous protein from all six strains.
Distributed clusters: clusters that contain orthologous proteins from more than one but not all strains.
Unique clusters: clusters containing only one protein from a single strain.
Numbers of orthologous clusters for individual Listeria strains.
| Strain name | Number of orthologous clusters | Percent unique clusters | Percent distributed clusters |
|---|---|---|---|
| Li_11262 | 2666 | 8.8 | 14.2 |
| Lm_81459 | 2713 | 0.9 | 17.1 |
| Lm_EGD-e | 2712 | 4.1 | 16.9 |
| Lw_SLCC5334 | 2461 | 11.6 | 7.4 |
| Lm_HCC23 | 2687 | 9.3 | 14.7 |
| Lm_F2365 | 2715 | 2.8 | 17.2 |
‡ Percentage of distributed orthologous clusters relative to the total number of orthologous clusters in each strain.
Total number of orthologous clusters present in each strain.
Percentage of unique orthologous clusters relative to the total number of orthologous clusters in each strain.
Fig. 3Protein cluster analysis of six Listeria strains. Shared clusters are the number of protein clusters shared by both strains. Different clusters are the number of protein clusters present in only one of the strains. Pair unique clusters are the number of protein clusters present in these two strains and no other strains.
Sequence identity of some virulence related and surface proteins in L. monocytogenes strains EGD-e and F2365 compared to HCC23.
| Gene name | EGD-e | F2365 |
|---|---|---|
| PC-PLC | 96.9 | 98.9 |
| PlcA | 97.5 | 97.2 |
| ActA | 92.2 | 96.2 |
| PrfA | 99.2 | 99.2 |
| InlA | 97.6 | 99.1 |
| InlB | 92.4 | 96.7 |
| Ami | 98.6 | 98.1 |
| Lyt-G | 90.3 | 90.7 |
| Pbp-3 | 98.4 | 98.7 |
| Pbp-5 | 95.3 | 95.7 |
| GtcA | 84.9 | 98.3 |
| GltC | 100 | 99.7 |
Percent identity compared to strain HCC23 ortholog based on Clustal W alignment using MegAlign (Lasergene).