| Literature DB >> 23009728 |
Jon Ahlinder1, Caroline Öhrman, Kerstin Svensson, Petter Lindgren, Anders Johansson, Mats Forsman, Pär Larsson, Andreas Sjödin.
Abstract
BACKGROUND: Recent advances in sequencing technologies offer promising tools for generating large numbers of genomes, larger typing databases and improved mapping of environmental bacterial diversity. However, DNA-based methods for the detection of Francisella were developed with limited knowledge about genetic diversity. This, together with the high sequence identity between several Francisella species, means there is a high risk of false identification and detection of the highly virulent pathogen Francisella tularensis. Moreover, phylogenetic reconstructions using single or limited numbers of marker sequences often result in incorrect tree topologies and inferred evolutionary distances. The recent growth in publicly accessible whole-genome sequences now allows evaluation of published genetic markers to determine optimal combinations of markers that minimise both time and laboratory costs.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23009728 PMCID: PMC3575276 DOI: 10.1186/1471-2180-12-220
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Genomes sequences included in the study
| FSC200 | 16087 | |
| FSC208 | 73467 | |
| RC503 | 30637 | |
| LVS | 16421 | |
| FSC539 | 73393 | |
| OR96-246 | 30669 | |
| FTA | 20197 | |
| URFT1 | 19645 | |
| MI00-1730 | 30635 | |
| OSU18 | 17265 | |
| FSC021 | 73369 | |
| FSC022 | 19015 | |
| FSC147 | 19571 | |
| FSC148 | 73379 | |
| FSC054 | 73375 | |
| ATCC6223 | 30629 | |
| FSC033 | 19017 | |
| MA00-2987 | 30443 | |
| FSC198 | 17375 | |
| SCHUS4 (FSC237) | 9 | |
| FTE | 30119 | |
| U112 | 16088 | |
| FTG | 30447 | |
| GA99-3549 | 19019 | |
| FSC160 | 73385 | |
| FSC159 | 73383 | |
| GA99-3548 | 19573 | |
| FSC454 | 73391 | |
| FSC845 | 73171 | |
| FSC770 | 73389 | |
| FSC771 | 73447 | |
| FSC846 | 73463 | |
| FSC769 | 73397 | |
| FSC774 | 73457 | |
| FDC178 | 73465 | |
| FSC772 | 73449 | |
| FSC154 | 73381 | |
| FSC145 | 73377 | |
| ATCC25015 | 32411 | |
| FSC037 | 73371 | |
| FSC039 | 73373 | |
| ATCC25017 | 27853 |
Francisella genomes included in this study selected to represent the known diversity of Francisella: 22 strains representing the public health perspective of F. tularensis (clade 1) and 13 strains of F. noatunensis and F. philomiragia (clade 2) representing a fish farming industry and health perspective.
A list of the markers selected to represent published DNA-based markers for molecular PCR detection or phylogenetic identification targeting
| 01-16S | FTT_r04, FTT_r07, FTT_r10 | 1139 | 1311156-2294, 1378275–9413, 1771610-2748 | [ |
| 02-16 s + ItS + 23 s | FTT_r04, FTT_r07, FTT_r10 | 915 | 1311470-2371, 1378876–9490, 1771911-2825 | [ |
| 03-16 s + ItS + 23 s | FTT_r03-FTT_r04, FTT_r06-FTT_r07, FTT_r09-FTT_r10 | 948 | 1310519-1466, 1377638–8585, 1770973-1920 | [ |
| 04-16 s + ItS + 23 s | FTT_r03, FTT_r06, FTT_r09 | 925 | 1309613-10537, 1376732–7656, 1770067-991 | [ |
| 05-aroA | FTT_0588 | 650 | 608150-799 | [ |
| 06-atpA | FTT_0062 | 634 | 62762-3395 | [ |
| 07-dnaA | FTT_0001 | 618 | 303-920 | [ |
| 08-fabH | FTT_1373 | 1289 | 1418892-20155 | [ |
| 09-fopA | FTT_0583 | 886 | 599105-990 | [ |
| 10-fopA | FTT_0583 | 1068 | 599148-600215 | [ |
| 11-fopA-in | FTT_0583 | 404 | 599526-929 | [ |
| 12-fopA-out | FTT_0583 | 708 | 599428-600135 | [ |
| 13-fopA | FTT_0583 | 86 | 599767-852 | [ |
| 14-FtM19 | FTT_1472c | 250 | 1524132-381 | [ |
| 15-FtM19 | FTT_1472c | 316 | 1524066-381 | [ |
| 16-FTT0376 | FTT_0376 | 107 | 377718-824 | [ |
| 17-FTT0523 | FTT_0523 | 91 | 546620.712 | [ |
| 18-groEL | FTT_1696 | 803 | 1764659-5461 | [ |
| 19-iglC | FTT_1712c | 84 | 1792514-597 | [ |
| 20-ISFtu2b | FTT_1311 | 390 | 1335726-6115 | [ |
| 21-ISFtu2 | FTT_0099c | 97 | 103438-534c | [ |
| 22-lpnAb | FTT_0901 | 407 | 909857-10263 | [ |
| 23-lpnA | FTT_0901 | 93 | 910211-301 | [ |
| 24-lpnB | FTT_0904 | 252 | 911795-2046 | [ |
| 25-mdh | FTT_0535c | 715 | 556932-7646 | [ |
| 26-mutS | FTT_1499 | 495 | 1553224-3718 | [ |
| 27-parC | FTT_0396 | 643 | 397063-705 | [ |
| 28-pdpD | FTT_1360c, FTT_1715c | 136 | 1403503-638, 1796838-973 | [ |
| 29-pgm | FTT_0414 | 650 | 425033-682 | [ |
| 30-prfB | FTT_0191 | 376 | 207686-8061 | [ |
| 31-putA | FTT_1150c | 415 | 1165411-825 | [ |
| 32-rpoA | FTT_0350, FTT_1442c | 914 | 349619-50532 | [ |
| 33-rpoB | FTT_0144 | 262 | 156309-570 | [ |
| 34-sdhA | FTT_0074 | 223 | 75065-287 | [ |
| 35-tpiA | FTT_0080 | 484 | 83679-4162 | [ |
| 36-tpiA | FTT_0080 | 559 | 83657-4215 | [ |
| 37-trpE | FTT_1802c | 517 | 1888928-9444 | [ |
| 38-uup | FTT_0445 | 645 | 459229-873 | [ |
Amplicon locus tag, length and location in genome of F. tularensis strain SCHU S4.
b Primer sequence of primer Tuf1705 in marker 20-ISFtu2 and TUL-435 in marker 22-lpnA seem to be incorrectly specified in [56]. See [37] and [59] for the correct primer sequences.
c Insertion element present in multiple copies in reference. Only first position and gene specified.
Figure 1Overview of primer specificity. Weighted score of primer specificity calculated with penalties for mismatches and gaps, where zero indicates a perfect match. The first column of each marker represents the forward primer score and the second represents the reverse primer score. The score was calculated with PrimerProspector as follows: 3’ mismatch, 1 penalty per mismatch (length of 3’ region was set to 5), non-3’ mismatch, (0.4 penalty per mismatch), last base mismatch (penalty 3 per mismatch), non 3’ gap (penalty 1 per gap) and 3’ gap (penalty 3 per gap).
Summary of estimated amplification performance of primer pairs representing published DNA-based markers targeting
| 01-16S, 03-16S-Itr-23S, 04-16S-Itr-23S, 08-fabH, 18-groEL, 23-lpnAa, 25-mdh, 30-prfb and 35-tpiA. | |
| 05-aroA, 07-dnaA, 11-fopA-inaa, 12-fopA-outa, 13-fopAa, 14-FTM19b, 15-FTM19, 19-iglCac, 22-lpnAa, 26-mutS, 27-parCc, 31-putA, 36-tpiA, 37-trpE and 38-uup. | |
| Amplifies clade 1 but no other | 11-fopA-ina, 14-FtM19 and 15-FtM19a |
| Amplifies clade 1 as well as | 05-aroA, 07-dnaA, 12-fopA-outa, 27-parCc and 36-tpiA. |
| Amplifies clade 1 as well as | 13-fopAa, 19-iglCc, 22-lpnA, 31-putA, 37-trpE and 38-uup. |
| Amplifies clade 1 as well as | 26-mutS |
| 10-fopA | |
| 24-lpnB | |
| 02-16S-Itr-23S, 06-atpA, 09-fopA, 29-pgm, 32-rpoA, 33-rpoB and 34-sdhA. | |
| Amplifies all except | 09-fopA |
| Amplifies all except | 33-rpoB |
| Amplifies all except | 34-sdhA |
| Amplifies all except | 02-16S-Itr-23S, 29-pgm |
| Amplifies all except | 06-atpA |
| Amplifies all except | 32-rpoA |
| 16-FTT0376a, 17-FTT0523a, 20-ISFtu2b and 28-pdpDb. | |
| Amplifies only | 16-FTT0376a and 17-FTT0523a |
| Amplifies | 28-pdpDb |
| Amplifies isolates from all clade 1 species as well as | 20-ISFtu2b |
| 21-ISFtu2a |
Successful amplification was defined as having a primer score below two in both the forward and reverse primers.
a Have associated TaqMan probe which is not considered here. bDetection by variable-length amplicon which is not considered here.
cScore of F.noatunensis subsp orientalis <2.
Figure 2Whole-genome SNP phylogeny. The whole-genome phylogeny for 37 Francisella strains obtained with model averaging implemented in jModelTest using PhyML software. The removed part of the branches connecting clade 1 and 2 covers a genetic distance of 0.03.
Figure 3Single-marker phylogenies. Single-marker phylogeny of the Francisella genus: (A) highest ranked marker 08-fabH, (B) lowest ranked marker 33-rpoB, and (C) whole-genome phylogeny. Rank is based on difference in resolution between alternative and whole-genome topology.
Figure 4The impact of the number of markers on phylogenetic parameters. The effect of concatenating sequence markers on topology (of the Francisella genus) in comparison with the whole-genome tree for (A) incongruence score, (B) resolution score, and (C) average bootstrap support from 1000 replicates. The results of the optimised topology comparisons are shown as crosses.
Summary of the optimisation procedure for resolution (res) and congruence (inc) in the genus where the consensus set of markers are highlighted according to how often they are selected in the optimal partitions of markers; position 1 corresponds to the most represented marker
| | ||||||||
|---|---|---|---|---|---|---|---|---|
| 2 | res | 08-fabH | 35-tpiA | | | | | |
| | inc | 08-fabH | 35-tpiA | | | | | |
| 3 | res | 08-fabH | 35-tpiA | 24-lpnB | | | | |
| | inc | 08-fabH | 35-tpiA | 02-16 s | | | | |
| 4 | res | 08-fabH | 35-tpiA | 24-lpnB | 27-parC | | | |
| | inc | 35-tpiA | 08-fabH | 01-16S | 02-16 s | | | |
| 5 | res | 08-fabH | 35-tpiA | 24-lpnB | 27-parC | 22-lpnA | | |
| | inc | 35-tpiA | 08-fabH | 24-lpnB | 27-parC | 33-rpoB | | |
| 6 | res | 08-fabH | 24-lpnB | 35-tpiA | 27-parC | 22-lpnA | 25-mdh | |
| | inc | 35-tpiA | 08-fabH | 24-lpnB | 04-16 s | 01-16S | 33-rpoB | |
| 7 | res | 08-fabH | 35-tpiA | 24-lpnB | 26-mutS | 27-parC | 18-groEL | 22-lpnA |
| inc | 35-tpiA | 08-fabH | 01-16S | 04-16 s | 24-lpnB | 27-parC | 25-mdh |
Markers 02-16 s + ItS + 23 s and 04-16 s + ItS + 23 s are abbreviated as 02-16 s and 04-16 s, respectively.