| Literature DB >> 23815549 |
Jeroen W Den Boer1, Sjoerd M Euser, Nico J Nagelkerke, Frank Schuren, Sophie Jarraud, Jerome Etienne.
Abstract
BACKGROUND: Legionella is a water and soil bacterium that can infect humans, causing a pneumonia known as Legionnaires' disease. The pneumonia is almost exclusively caused by the species L. pneumophila, of which serogroup 1 is responsible for 90% of patients. Within serogroup 1, large differences in prevalence in clinical isolates have been described. A recent study, using a Dutch Legionella strain collection, identified five virulence associated markers. In our study, we verify whether these five Dutch markers can predict the patient or environmental origin of a French Legionella strain collection. In addition, we identify new potential virulence markers and verify whether these can predict better. A total of 219 French patient isolates and environmental strains were compared using a mixed-genome micro-array. The micro-array data were analysed to identify predictive markers, using a Random Forest algorithm combined with a logistic regression model. The sequences of the identified markers were compared with eleven known Legionella genomes, using BlastN and BlastX; the functionality for each of the predictive markers was checked in the literature.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23815549 PMCID: PMC3701591 DOI: 10.1186/1471-2164-14-435
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Prediction results on the origin of 219 French strains using five Dutch markers
| Prediction clinical | 89 | 48 | 137 |
| Prediction environmental | 25 | 57 | 82 |
| Total | 114 | 105 | 219 |
| Sensitivity | 78% | | |
| Specificity | 54% | | |
| PPV | 65% | | |
| NPV | 70% | ||
Abbreviation: PPV positive predictive value, NPV negative predictive value.
Prediction results for the 109 strains in the eleventh “test dataset” using four French markers
| Prediction clinical | 55 | 32 | 87 |
| Prediction environmental | 2 | 20 | 22 |
| Total | 57 | 52 | 109 |
| Sensitivity | 96% | | |
| Specificity | 38% | | |
| PPV | 63% | | |
| NPV | 91% | ||
Abbreviation: PPV, positive predictive value ; NPV, negative predictive value.
Spearman correlation coefficients of the 4 Dutch and the 4 French predictive markers for the total sample of French strains (n = 219)
| | ||||||||
| 7B8 | 1 | 0.421** | 0.212** | 0.209** | −0.183** | −0.381** | 0.416** | −0.060 |
| 15D6 | 0.421** | 1 | 0.177** | 0.462** | −0.110 | −0.432** | 0.859** | −0.199** |
| 16E4 | 0.212** | 0.177** | 1 | 0.082 | 0.073 | −0.085 | 0.228** | 0.116 |
| 33 F8 | 0.209** | 0.462** | 0.082 | 1 | −0.327** | −0.048 | 0.492** | 0.060 |
| 8D12 | −0.183** | −0.110 | 0.073 | −0.327** | 1 | 0.195** | −0.080 | 0.153* |
| 12H3 | −0.381** | −0.432** | −0.085 | −0.048 | 0.195** | 1 | −0.439** | 0.230** |
| 19D6 | 0.416** | 0.859** | 0.228** | 0.492** | −0.080 | −0.439** | 1 | −0.169* |
| 27B10 | −0.060 | −0.199** | 0.116 | 0.060 | 0.153* | 0.230** | −0.169* | 1 |
Correlation coefficients are based on the hybridization ratios of the individual markers.
* p-value < 0.05, **p-value < 0.01.
Sequence location of the four predictive markers in available genomes
| Paris | 2915168-2914552 | 2003186-2002613 | n/a | 674697-673883 |
| Philadelphia | 2812796-2812180 | 2027502-2026929 | n/a | 607471-606657 |
| Lens | 2748941-2748325 | 1985753-1985180 | n/a | 658953-658139 |
| Corby | 2990951-2990335 | 2108162-2107589 | n/a | 698878-698064 |
| Alcoy | 2927471-2926855 | 2102218-2101645 | n/a | 689774-688960 |
| 130b | 2885344-2884728 | 1975530-1974957 | n/a | 683508-682754 |
| Lorraine | 2804908-2805524 | 1928210-1928783 | 713550-714251 | 628527-629341 |
| HL 060401035 | 2910715-2911331 | 2131955-2132528 | n/a | 656754-657568 |
| L. longbeachae NSW150 | n/a | n/a | n/a | n/a |
| L. longbeachae D-4968 | n/a | n/a | n/a | n/a |
| ATCC 43290 | 2773461-2772845 | 1943894-1943321 | n/a | 611624-610810 |
n/a = not applicable.
Table of differences in hybridization ratios for the 4 Dutch and the 4 French markers between clinical isolates and environmental strains
| | ||||
|---|---|---|---|---|
| Dutch markers | | | | |
| 7B8 | 1.64 (2.94) | 2.33 (3.26) | 0.89 (2.35) | 0.50 |
| 15D6 | 0.50 (0.53) | 0.64 (0.53) | 0.34 (0.48) | 0.60 |
| 16E4 | 1.17 (0.25) | 1.23 (0.16) | 1.11 (0.30) | 0.50 |
| 33 F8 | 1.08 (0.36) | 1.07 (0.39) | 1.09 (0.34) | −0.06 |
| French markers | | | | |
| 8D12 | 0.96 (0.17) | 1.01 (0.13) | 0.91 (0.19) | 0.57 |
| 12H3 | 0.89 (0.20) | 0.84 (0.16) | 0.94 (0.23) | −0.50 |
| 19D6 | 0.64 (0.83) | 0.91 (0.87) | 0.35 (0.68) | 0.72 |
| 27B10 | 1.05 (0.14) | 1.04 (0.09) | 1.05 (0.18) | −0.11 |
Data represent mean hybridization ratios (SD). Effect size was calculated as the difference in ratio between clinical isolates and environmental strains, divided by the within group standard deviation (SD) of the ratio in the total sample. SD = standard deviation.