| Literature DB >> 19450271 |
Alexandre P Francisco1, Miguel Bugalho, Mário Ramirez, João A Carriço.
Abstract
BACKGROUND: Multilocus Sequence Typing (MLST) is a frequently used typing method for the analysis of the clonal relationships among strains of several clinically relevant microbial species. MLST is based on the sequence of housekeeping genes that result in each strain having a distinct numerical allelic profile, which is abbreviated to a unique identifier: the sequence type (ST). The relatedness between two strains can then be inferred by the differences between allelic profiles. For a more comprehensive analysis of the possible patterns of evolutionary descent, a set of rules were proposed and implemented in the eBURST algorithm. These rules allow the division of a data set into several clusters of related strains, dubbed clonal complexes, by implementing a simple model of clonal expansion and diversification. Within each clonal complex, the rules identify which links between STs correspond to the most probable pattern of descent. However, the eBURST algorithm is not globally optimized, which can result in links, within the clonal complexes, that violate the rules proposed.Entities:
Mesh:
Year: 2009 PMID: 19450271 PMCID: PMC2705362 DOI: 10.1186/1471-2105-10-152
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison between eBURST and the proposed implementation.
| # diffs | Group | Ties | Links | ||||
| Dataset | founder | size | # | breaker | created | deleted | |
| 1 | 17 | 268 | 3 | SLV | 50–177 | 50–204 | |
| 3 | 124 | 78 | 3 | SLV | 392–2803 | 440–2803 | |
| 138 | 189 | 14 | SLV | 171–361 | 338–3163 | ||
| 217 | 33 | 2 | SLV | 618–3577 | 1325–1331 | ||
| 4 | 48 | 348 | 3 | SLV | 435–667 | 116–435 | |
| 48 | 348 | 53 | DLV | 211–300 | 56–99 | ||
| 48 | 348 | 3 | SLV | 70–290 | 66–67 | ||
| 48 | 348 | 4 | DLV | 24–643 | 570–643 | ||
| 5 | 21 | 849 | 29 | SLV | 104–492 | 474–577 | |
| 21 | 849 | 14 | SLV | 353–462 | 2395–2517 | ||
| 21 | 849 | 2 | SLV | 2141–2842 | 1076–2951 | ||
| 21 | 849 | 4 | SLV | 824–2141 | 878–2141 | ||
| 177 | 53 | 2 | SLV | 1022–1503 | 1387–2162 | ||
| 7 | 11 | 296 | 18 | SLV | 8–1058 | 66–1058 | |
| 11 | 296 | 1 | SLV | 10–2174 | 10–5091 | ||
| 22 | 344 | 23 | SLV | 23–1062 | 1062–1625 | ||
| 60 | 164 | 9 | SLV | 1157–1421 | 1421–1649 | ||
| 269 | 322 | 21 | SLV | 275–352 | 352–1163 | ||
| 1583 | 67 | 1 | SLV | 1905–6717 | 1579–1901 | ||
| 1583 | 67 | 4 | SLV | 1590–1928 | 1599–1903 | ||
The following data sets presented no differences: Helicobacter pylori, Streptococcus pyogenes, Staphylococcus aureus, Bacillus cereus, Pseudomonas aeruginosa, Enterococcus faecalis, Klebsiella pneumoniae, Campylobacter upsaliensis, Streptococcus suis, Haemophilus influenzae, Streptococcus uberis, Staphylococcus epidermidis and Streptococcus agalactiae.
Figure 1Partial snapshot of Burkholderia pseudomallei CC 48. Partial snapshot of Burkholderia pseudomallei CC 48 highlighting one of the differences (bold green lines) between current eBURST implementation (dashed lines) (ST 99 – ST56) and the proposed algorithm (full lines)(ST211 – ST300). The gray interrupted line represents the decision in the tiebreak that needs to be taken. The tiebreak level was at DLV level indicated by the green color of the link. See text for further details.
Figure 2Partial snapshot of Burkholderia pseudomallei CC 48. Partial snapshot of Burkholderia pseudomallei CC 48 highlighting one of the differences (bold blue lines) between current eBURST implementation (dashed lines)(ST66–ST67) and the proposed algorithm (full lines)(ST70–ST290). The gray interrupted line represents the decision in the tiebreak that needs to be taken. The tiebreak level for this example was at SLV indicated by the blue color of the link. See text for further details.
Figure 3Clonal Complex 17 of Enterococcus faecium. Representation of the largest clonal complex for Enterococcus faecium, with ST17 as the determined founder.
Figure 4Clonal Complex 48 of Burkholderia pseudomallei. Representation of the largest clonal complex for Burkholderia pseudomallei, with ST48 as the determined founder.
Figure 5Clonal Complex 41 of Neisseria spp. Representation of the largest clonal complex for Neisseria spp., with ST41 as the determined founder.