| Literature DB >> 23516614 |
Wen Zou1, Hung-Chia Chen, Kelley B Hise, Hailin Tang, Steven L Foley, Joe Meehan, Wei-Jiun Lin, Rajesh Nayak, Joshua Xu, Hong Fang, James J Chen.
Abstract
A database was constructed consisting of 45,923 Salmonella pulsed-field gel electrophoresis (PFGE) patterns. The patterns, randomly selected from all submissions to CDC PulseNet during 2005 to 2010, included the 20 most frequent serotypes and 12 less frequent serotypes. Meta-analysis was applied to all of the PFGE patterns in the database. In the range of 20 to 1100 kb, serotype Enteritidis averaged the fewest bands at 12 bands and Paratyphi A the most with 19, with most serotypes in the 13-15 range among the 32 serptypes. The 10 most frequent bands for each of the 32 serotypes were sorted and distinguished, and the results were in concordance with those from distance matrix and two-way hierarchical cluster analyses of the patterns in the database. The hierarchical cluster analysis divided the 32 serotypes into three major groups according to dissimilarity measures, and revealed for the first time the similarities among the PFGE patterns of serotype Saintpaul to serotypes Typhimurium, Typhimurium var. 5-, and I 4,[5],12:i:-; of serotype Hadar to serotype Infantis; and of serotype Muenchen to serotype Newport. The results of the meta-analysis indicated that the pattern similarities/dissimilarities determined the serotype discrimination of PFGE method, and that the possible PFGE markers may have utility for serotype identification. The presence of distinct, serotype specific patterns may provide useful information to aid in the distribution of serotypes in the population and potentially reduce the need for laborious analyses, such as traditional serotyping.Entities:
Mesh:
Year: 2013 PMID: 23516614 PMCID: PMC3597626 DOI: 10.1371/journal.pone.0059224
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The composition of the database of Salmonella PFGE fingerprints.
| Serotypes | Number of patterns | Ranks | Total/1996−2009 | Percent/1996−2009 |
| Agona | 1954 | 14 | 7376 | 1.4 |
| Braenderup | 2008 | 13 | 7807 | 1.5 |
| Enteritidis | 2338 | 2 | 90328 | 17.4 |
| Hadar | 1981 | 19 | 5263 | 1.0 |
| Heidelberg | 2114 | 4 | 24819 | 4.8 |
| I 4, | 2281 | 11 | 7912 | 1.5 |
| Infantis | 2078 | 12 | 7857 | 1.5 |
| Javiana | 2102 | 5 | 19170 | 3.7 |
| Mississippi | 1999 | 16 | 5430 | 1.0 |
| Montevideo | 2041 | 6 | 12855 | 2.5 |
| Muenchen | 1970 | 8 | 10652 | 2.1 |
| Newport | 2005 | 3 | 44483 | 8.6 |
| Oranienburg | 1951 | 10 | 9042 | 1.7 |
| Paratyphi B var. L(+) tartrate+ | 2011 | 18 | 5305 | 1.0 |
| Poona | 1956 | 20 | 4101 | 0.8 |
| Saintpaul | 2252 | 9 | 9606 | 1.9 |
| Thompson | 2045 | 15 | 7208 | 1.4 |
| Typhi | 1941 | 17 | 5371 | 1.0 |
| Typhimurium | 2064 | 1 | 91028 | 17.6 |
| Typhimurium var. 5- | 2146 | 7 | 12688 | 2.4 |
| Sub Total | 388301 | 74.9 | ||
| Anatum | 478 | 23 | 2863 | 0.6 |
| Bareilly | 426 | 24 | 2829 | 0.5 |
| Berta | 502 | 21 | 3059 | 0.6 |
| Derby | 393 | 30 | 2080 | 0.4 |
| Hartford | 531 | 27 | 2431 | 0.5 |
| Litchfield | 401 | 28 | 2424 | 0.5 |
| Mbandaka | 432 | 25 | 2727 | 0.5 |
| Panama | 516 | 29 | 2206 | 0.4 |
| Paratyphi A | 135 | 35 | 1678 | 0.3 |
| Schwarzengrund | 225 | 26 | 2544 | 0.5 |
| Senftenberg | 189 | 32 | 1989 | 0.4 |
| Stanley | 460 | 22 | 2914 | 0.6 |
| Sub Total | 418045 | 80.6 | ||
| Total | 45923 | Total/1996−2009 | 518419 | 100.0 |
: The table shows the information on Salmonella isolates from human sources during 1996−2009, which was derived and calculated from the Salmonella Annual Summary of 2006
(http://www.cdc.gov/ncidod/dbmd/phlisdata/salmtab/2006/SalmonellaTable1_2006.pdf) and the Salmonella Annual Summary Tables 2009 (http://www.cdc.gov/ncezid/dfwed/PDFs/SalmonellaAnnualSummaryTables2009.pdf)
Figure 1Band numbers of various Salmonella serotypes in the database.
The number under each bar indicates the number of the bands, and the number on top of each bar shows the number of serotypes.
Top 10 most frequent bands and the percentages of 20 most frequent serotypes in the database.
| Top 10 bands (kb) | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| Agona | 290.79 (93%) | 21.33 (89%) | 103.77 (88%) | 247.23 (86%) | 334.37 (75%) | 357.52 (73%) | 256.7 (70%) | 710.89 (65%) | 322.42 (64%) | 538.28 (63%) |
| Braenderup | 103.77 (97%) | 75.75 (97%) | 666.12 (89%) | 308.85 (89%) | 334.37 (87%) | 21.33 (85%) | 247.2 (81%) | 32.79 (81%) | 168.29 (79%) | 459.88 (77%) |
| Enteritidis | 308.85 (98%) | 666.12 (90%) | 110.39 (86%) | 53.56 (86%) | 21.33 (86%) | 247.23 (77%) | 37.2 (77%) | 1037 (74%) | 290.79 (73%) | 175.00 (70%) |
| Hadar | 84.87 (98%) | 75.75 (97%) | 308.85 (93%) | 211.97 (92%) | 256.74 (90%) | 237.54 (89%) | 103.77 (89%) | 30.88 (84%) | 66.11 (83%) | 334.37 (76%) |
| Heidelberg | 459.88 (98%) | 411.77 (98%) | 103.77 (95%) | 666.12 (94%) | 70.79 (94%) | 61.22 (90%) | 357.52 (89%) | 21.33 (89%) | 42.79 (87%) | 25.36 (83%) |
| I 4, | 70.79 (96%) | 373.92 (95%) | 97.28 (94%) | 84.87 (94%) | 42.79 (93%) | 223.18 (92%) | 66.11 (92%) | 710.89 (90%) | 21.33 (89%) | 247.23 (68%) |
| Infantis | 290.79 (97%) | 84.87 (95%) | 103.77 (88%) | 21.33 (87%) | 66.11 (85%) | 308.85 (81%) | 392.06 (79%) | 168.29 (78%) | 75.75 (74%) | 211.97 (68%) |
| Javiana | 183.39 (95%) | 66.11 (92%) | 21.33 (88%) | 256.74 (83%) | 223.18 (81%) | 481.15 (79%) | 290.79 (74%) | 168.29 (71%) | 275.55 (68%) | 645.62 (57%) |
| Mississippi | 21.33 (89%) | 70.79 (77%) | 411.77 (70%) | 103.77 (70%) | 183.39 (68%) | 290.79 (60%) | 97.28 (48%) | 342.77 (46%) | 247.23 (46%) | 75.75 (45%) |
| Montevideo | 290.79 (95%) | 21.33 (88%) | 42.79 (87%) | 127.24 (81%) | 247.23 (77%) | 66.11 (74%) | 322.42 (63%) | 256.74 (60%) | 334.37 (56%) | 308.85 (54%) |
| Muenchen | 103.77 (93%) | 21.33 (86%) | 75.75 (85%) | 308.85 (79%) | 290.79 (76%) | 97.28 (70%) | 513.42 (63%) | 66.11 (54%) | 168.29 (53%) | 247.23 (49%) |
| Newport | 21.33 (89%) | 290.79 (85%) | 97.28 (85%) | 308.85 (78%) | 75.75 (78%) | 223.18 (76%) | 175.00 (73%) | 42.79 (71%) | 183.39 (67%) | 168.29 (61%) |
| Oranienburg | 290.79 (98%) | 308.85 (95%) | 42.79 (94%) | 481.15 (90%) | 127.24 (89%) | 110.39 (89%) | 21.33 (89%) | 136.07 (88%) | 411.77 (86%) | 256.74 (85%) |
| Paratyphi B var. L(+)Tartrate+ | 308.85 (89%) | 21.33 (87%) | 290.79 (81%) | 247.23 (75%) | 75.75 (74%) | 103.77 (68%) | 42.79 (62%) | 66.11 (56%) | 168.29 (55%) | 194.6 (52%) |
| Poona | 275.55 (88%) | 256.74 (83%) | 66.11 (81%) | 21.33 (78%) | 223.18 (74%) | 194.6 (74%) | 308.85 (59%) | 237.54 (59%) | 32.79 (59%) | 373.92 (54%) |
| Saintpaul | 290.79 (98%) | 21.33 (84%) | 459.88 (83%) | 42.79 (82%) | 710.89 (80%) | 247.23 (79%) | 790.7 (75%) | 103.77 (69%) | 223.18 (67%) | 373.92 (64%) |
| Thompson | 75.75 (97%) | 459.88 (95%) | 290.79 (95%) | 168.29 (95%) | 66.11 (95%) | 61.22 (95%) | 481.15 (94%) | 103.77 (93%) | 21.33 (89%) | 322.42 (82%) |
| Typhi | 75.75 (95%) | 32.79 (95%) | 61.22 (92%) | 160.06 (89%) | 136.07 (89%) | 211.97 (88%) | 70.79 (87%) | 21.33 (85%) | 275.55 (83%) | 53.56 (79%) |
| Typhimurium | 373.92 (97%) | 70.79 (97%) | 42.79 (94%) | 290.79 (92%) | 223.18 (90%) | 97.28 (87%) | 21.33 (84%) | 666.12 (78%) | 66.11 (78%) | 25.36 (75%) |
| Typhimurium var. 5- | 373.92 (97%) | 70.79 (96%) | 42.79 (93%) | 223.18 (91%) | 290.79 (84%) | 21.33 (83%) | 247.23 (74%) | 84.87 (73%) | 97.28 (72%) | 25.36 (72%) |
Top 10 most frequent bands and the percentages of 12 less frequent serotypes in the database.
| Top 10 bands (kb) | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| Anatum | 97.28 (95%) | 411.77 (90%) | 75.75 (88%) | 194.6 (87%) | 275.55 (84%) | 66.11 (84%) | 42.79 (80%) | 21.33 (73%) | 666.12 (72%) | 61.22 (72%) |
| Bareilly | 84.87 (87%) | 42.79 (87%) | 21.33 (85%) | 290.79 (80%) | 308.85 (79%) | 223.18 (67%) | 247.23 (63%) | 97.28 (54%) | 256.74 (51%) | 160.06 (49%) |
| Berta | 211.97 (98%) | 308.85 (97%) | 103.77 (96%) | 175.00 (91%) | 247.23 (89%) | 21.33 (87%) | 70.79 (86%) | 666.12 (81%) | 560.06 (74%) | 710.89 (72%) |
| Derby | 275.55 (91%) | 256.74 (89%) | 118.78 (84%) | 223.18 (81%) | 32.79 (81%) | 308.85 (80%) | 21.33 (80%) | 247.23 (74%) | 61.22 (71%) | 1037 (69%) |
| Hartford | 290.79 (98%) | 84.87 (98%) | 75.75 (94%) | 136.07 (90%) | 21.33 (89%) | 513.42 (87%) | 334.37 (86%) | 145.81 (83%) | 183.39 (82%) | 168.29 (82%) |
| Litchfield | 127.24 (97%) | 308.85 (93%) | 75.75 (93%) | 21.33 (89%) | 168.29 (86%) | 175.00 (81%) | 290.79 (79%) | 183.39 (79%) | 850.72 (77%) | 790.7 (64%) |
| Mbandaka | 145.81 (98%) | 103.77 (95%) | 459.88 (94%) | 357.52 (92%) | 175.00 (87%) | 21.33 (87%) | 247.23 (84%) | 256.74 (77%) | 42.79 (71%) | 136.07 (60%) |
| Panama | 256.74 (94%) | 290.79 (93%) | 275.55 (88%) | 308.85 (86%) | 84.87 (85%) | 194.6 (78%) | 223.18 (76%) | 183.39 (72%) | 21.33 (71%) | 645.62 (59%) |
| Paratyphi A | 275.55 (89%) | 290.79 (87%) | 32.79 (83%) | 127.24 (82%) | 666.12 (81%) | 136.07 (81%) | 28.82 (81%) | 308.85 (79%) | 25.36 (76%) | 75.75 (75%) |
| Schwarzengrund | 194.6 (88%) | 75.75 (87%) | 21.33 (86%) | 47.45 (80%) | 103.77 (77%) | 322.42 (75%) | 290.79 (72%) | 30.88 (68%) | 1101 (56%) | 183.39 (56%) |
| Senftenberg | 247.23 (85%) | 710.89 (80%) | 97.28 (73%) | 21.33 (72%) | 70.79 (68%) | 160.06 (65%) | 61.22 (65%) | 168.29 (63%) | 290.79 (62%) | 322.42 (58%) |
| Stanley | 75.75 (97%) | 136.07 (93%) | 308.85 (92%) | 97.28 (92%) | 168.29 (91%) | 21.33 (87%) | 183.39 (85%) | 582.12 (81%) | 790.7 (73%) | 237.54 (73%) |
Figure 2Distance matrix of 32 serotypes.
The heatmap shows the distances matrix presenting the dissimilarities for any two patterns in the entire database. The dissimilarity of PFGE patterns inter- or intra-serotypes was calculated by Jaccard Distance, and the values ranged from 0 (green) to 1 (red) (shown in the index).
Figure 3Two-way hierarchical clustering analysis of the 32 serotypes in the database.
The color histogram shows the proportions of the bands present at every designated band location with values ranging from 0 to 1. The hierarchical cluster analysis was applied based on the dissimilarity measures of any two serotypes calculated by the Euclidean distance of the characteristic parameters. Both serotypes and band locations were clustered according to dissimilarity measures. The numbers 1, 2, and 3 and the letters A (a1, a2), B (b1, b2), and C (c1, c2) stand for the groups (and sub-groups) of band locations and serotypes.