| Literature DB >> 23094048 |
Sjoerd M Euser1, Nico J Nagelkerke, Frank Schuren, Ruud Jansen, Jeroen W Den Boer.
Abstract
BACKGROUND: Legionella, the causative agent for Legionnaires' disease, is ubiquitous in both natural and man-made aquatic environments. The distribution of Legionella genotypes within clinical strains is significantly different from that found in environmental strains. Developing novel genotypic methods that offer the ability to distinguish clinical from environmental strains could help to focus on more relevant (virulent) Legionella species in control efforts. Mixed-genome microarray data can be used to perform a comparative-genome analysis of strain collections, and advanced statistical approaches, such as the Random Forest algorithm are available to process these data.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23094048 PMCID: PMC3475688 DOI: 10.1371/journal.pone.0047437
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Prediction results for the 111 strains in the eleventh training dataset.
| Training dataset | Origin of strains | Total | |
| Clinical | Environmental | ||
| Prediction clinical | 23 | 22 | 45 |
| Prediction environmental | 1 | 65 | 66 |
| Total | 24 | 87 | 111 |
| Sensitivity | 96% | ||
| Specificity | 75% | ||
| PPV | 51% | ||
| NPV | 98% | ||
PPV = positive predictive value; NPV = negative predictive value.
Prediction results for the 111 strains in the eleventh test dataset.
| Test dataset | Origin of strains | Total | |
| Clinical | Environmental | ||
| Prediction clinical | 24 | 29 | 53 |
| Prediction environmental | 1 | 57 | 58 |
| Total | 25 | 86 | 111 |
| Sensitivity | 96% | ||
| Specificity | 66% | ||
| PPV | 45% | ||
| NPV | 98% | ||
PPV = positive predictive value; NPV = negative predictive value.
Figure 1Receiver operating characteristic (ROC) curve for the eleventh test dataset (n = 111).
This figure shows the performance of the logistic regression model with the four predictive markers (7B8, 15D6, 16E4, 33F8) for the eleventh test dataset. The proportion of the predicted clinical strains out of the truly clinical strains (true positive rate or sensitivity) is plotted against the proportion of the predicted clinical strains out of the truly environmental strains (false positive rate or 1-specificity).
Sequence location of the four predictive markers in ten available Legionella genomes.
| Legionella strains | Predictive markers | |||
| 7B8 | 15D6 | 16E4 | 33F8 | |
| Paris | n/a | n/a | 915834–916597 | 1812491–1813032 |
| Philadelphia | n/a | n/a | 823175–823938 | 1828523–1827989 |
| Lens | n/a | n/a | 895789–896552 | 1794085–1793544 |
| Corby | n/a | n/a | 931999–932762 | 1898600–1898059 |
| Alcoy | n/a | n/a | 919333–920096 | 1893679–1893138 |
| 130b | n/a | n/a | 899720–900483 | 1786224–1785683 |
| Lorraine | 713281–713837 | 2631852–2632252 | 856273–857036 | 1732233–1732774 |
| HL 060401035 | n/a | n/a | 882699–883462 | 1928802–1929343 |
| ATCC 43290 | n/a | n/a | 825518–826281 | 1744381–1744915 |
| L. longbeachae NSW150 | n/a | n/a | n/a | 2612495–2611967 |
n/a = not applicable.