| Literature DB >> 24364868 |
Anthony P Underwood1, Garan Jones, Massimo Mentasti, Norman K Fry, Timothy G Harrison.
Abstract
BACKGROUND: Legionella pneumophila is an opportunistic pathogen of humans where the source of infection is usually from contaminated man-made water systems. When an outbreak of Legionnaires' disease caused by L. pneumophila occurs, it is necessary to discover the source of infection. A seven allele sequence-based typing scheme (SBT) has been very successful in providing the means to attribute outbreaks of L. pneumophila to a particular source or sources. Particular sequence types described by this scheme are known to exhibit specific phenotypes. For instance some types are seen often in clinical cases but are rarely isolated from the environment and vice versa. Of those causing human disease some types are thought to be more likely to cause more severe disease. It is possible that the genetic basis for these differences are vertically inherited and associated with particular genetic lineages within the population. In order to provide a framework within which to test this hypothesis and others relating to the population biology of L. pneumophila, a set of genomes covering the known diversity of the organism is required.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24364868 PMCID: PMC3877988 DOI: 10.1186/1471-2180-13-302
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Values of the standardised index of association and recombination to mutation ratio
| 0.193 | 1.6 | |
| 0.044 | 9.3 | |
| 0.116 | 32.5 | |
| 0.153 | 16.8 |
Figure 1A reticulate tree generated by the Neighbor-net algorithm of SplitsTree4 using the concatenated sequences of the seven SBT loci as input data.
Number of loci positive for recombination by the Sawyer’s run test and RDP suite
| 0 loci | 1 locus | |
| 3 loci | 4 loci | |
| 7 loci | 6 loci | |
| 1 locus | 2 loci |
Figure 2Clusters as determined by eBURST mapped onto a radial phylogram generated by FastTree 2. STs not assigned to a cluster (singletons in eBURST) are coloured black.
Figure 3Clusters as determined by BAPS using allelic data mapped onto a radial phylogram generated by FastTree 2.
Figure 4Clusters as determined by BAPS using linked sequence mapped onto a radial phylogram generated by FastTree 2. STs that have significant admixture are coloured black. The clusters are labelled using the lowest ST number found within the cluster.
Details of strains used in the study
| 46 | 1 | 45 | H093620212 | closest to centroid | fairly common clinical strain - assumed to be virulent | NGS mate paired Illumina | 169 | ERR315646 |
| 74* | 1 | 45 | H053260229 | second of cluster | fairly common clinical strain - assumed to be virulent | NGS 454 | | ERR315647 |
| 15* | 2 | 15 | Lens | already published | caused a big outbreak in France | GenBank (NC_006369.1) | | [ |
| 84 | 2 | 15 | H043940028 | closest available to centroid | only one available from this cluster | NGS paired end Illumina | 283 | ERR315648 |
| 47 | 3 | 16 | H063920004 | internationally significant | in top six strains that cause disease | NGS 454, paired end Illumina and mate paired Illumina | paired end 211 mate paired 227 | ERR315649 |
| 47 | 3 | 16 | Lorraine | already published | in top six strains that cause disease | GenBank(NC_018139.1) | | [ |
| 47 | 3 | 16 | LP_617 | already published | in top six strains that cause disease | EMBLBank(ERS166047) | | [ |
| 54* | 3 | 16 | H065000139 | closest to centroid | uncommon strain nothing known | NGS paired end Illumina | 161 | ERR315650 |
| 62 | 3 | 16 | H064180002 | internationally significant | in top six strains that cause disease | NGS 454 | | ERR315651 |
| 611 | 4 | 124 | H090500162 | only one in cluster | unique environmental isolate | NGS mate paired Illumina | 276 | ERR315652 |
| 87 | 5 | 17 | LC6677 | second of cluster | common serogroup 3 strain - does cause disease | NGS paired end Illumina | 490 | ERR315653 |
| 376 | 5 | 17 | RR08000760 | closest to centroid | unique environmental isolate | NGS mate paired Illumina | 235 | ERR315654 |
| 1* | 6 | 1 | Paris | already published | | GenBank (NC_006368.1) | | [ |
| 1 | 6 | 1 | LP_423 | already published | | EMBLBank(ERS166048) | | [ |
| 5 | 6 | 1 | EUL00013 (83/41091) | on an interesting branch of ST001 | only three in database - all from small outbreak in Glasgow | NGS mate paired Illumina | 304 | ERR315655 |
| 152 | 6 | 1 | H074360702 | closest to centroid | uncommon - mainly environmental | NGS mate paired Illumina | 180 | ERR315656 |
| 179 | 7 | 130 | H093380153 | closest to centroid | uncommon but causes disease | NGS paired end Illumina | 32 | ERR315657 |
| 337 | 7 | 130 | RR08000517 | second of cluster | uncommon strain appears to be phenotypically variable | NGS mate paired Illumina | 161 | ERR315658 |
| 42 | 8 | 14 | 130b (Wadsworth) | already published | in top six strains that cause disease - globally distributed. Isolated in USA in ~1980 | GenBank (FR687201.1) | | [ |
| 42 | 8 | 14 | H044540088 | internationally significant | as above but isolated in UK in 2004 - assumed to be virulent | NGS 454 | | ERR315659 |
| 44 | 8 | 14 | H100260089 | closest to centroid | similar to ST42 but not so common | NGS paired end Illumina | 346 | ERR315660 |
| 154* | 9 | 12 | LC677 4 | closest to centroid | seen in Canada and UK as a cause of nosocomial LD | NGS mate paired Illumina | 84 | ERR315661 |
| 336* | 9 | 12 | Lansing-3 (sgp15TS) | second of cluster | Representative of | NGS paired end Illumina | 150 | ERR315662 |
| 23 | 10 | 19 | H063280001 | closest to centroid | in top six strains that cause disease | NGS paired end Illumina | 265 | ERR315663 |
| 78* | 10 | 19 | LC6451 | second of cluster | unique strain - caused a major outbreak in Barrow in 2002. | NGS 454 | | ERR315664 |
| 51 | 11 | 2 | Corby | already published/closest to centroid | uncommon but well characterised - virulent in animal model and protozoa | GenBank (NC_009494.2) | | [ |
| 454 | 11 | 2 | H091960011 | second of cluster | unique environmental strain (from Roman baths in Bath) | NGS mate paired Illumina | 268 | ERR315665 |
| 578 | 11 | 2 | Alcoy | already published | responsible for a very big outbreak in Spain | GenBank (NC_014125.1) | | [ |
| 59 | 12 | 25 | H070840415 | closest to centroid | quite common environmental strain - a few cases of LD. | NGS mate paired Illumina | 246 | ERR315666 |
| 188 | 12 | 25 | H075160080 | second of cluster | no particular data | NGS paired end Illumina | 298 | ERR315667 |
| 36 | 13 | 9 | Philadelphia | already published/closest to centroid | the type strain - well characterised caused the Philadelphia outbreak. | GenBank (NC_002942.5) | | [ |
| 37 | 13 | 9 | H034680035 | internationally significant | in top six strains that cause disease | NGS 454 | | ERR315668 |
| 186 | 13 | 9 | H044500045 | second of cluster | unique clinical isolate | NGS paired end Illumina | 375 | ERR315669 |
| 34 | 14 | 34 | RR08000134 | closest available to centroid | no particular significance | NGS paired end Illumina | 301 | ERR315670 |
| 68 | 14 | 34 | H074360710 | second of cluster | no particular significance | NGS mate paired Illumina | 179 | ERR315671 |
| 707* | 15 | 71 | H091960009 | only one in cluster | unique environmental strain (from Roman baths in Bath) | NGS paired end Illumina and mate paired Illumina | Paired end 136 Mate paired 50 | ERR315672 |
Asterisks designate strains that are likely to have plasmids based on the analysis described in the methods section of this manuscript. The sequence data described can be obtained using the European Nucleotide Archive accession number listed in the table.
Figure 5BRIG blast analysis of the Legionella genomes using the genome of Corby as a reference. The strains and figure colours used were from centre to outside ST152 (CST1 mauve), ST5 (CST1 light blue), ST611 (CST124 dark blue), ST454(CST2 medium blue), ST47(CST16 leaf green), ST376 (CST17 dark green), ST46(CST45 light green), ST59 (CST25 pink), ST42(CST14 red), ST84 (CST15 purple), ST337 (CST130 mauve), ST23 (CST19 light blue), ST37 (CST9 dark blue), ST68 (CST34 medium blue), ST154 (CST12 leaf green) and ST707 (CST71 dark green). Regions showing significant variability in genomic content are boxed with black lines and labelled 1–14.
Figure 6A maximum parsimony tree based on the presence and absence of genes in the 27 genomes sequenced as part of this work and 5 additional genomes from GenBank (Alcoy, Corby, Lens, Paris, Philadelphia). The internal nodes are labelled with the bootstrap values.
Figure 7A maximum likelihood tree based on the SNP differences between all 27 genomes sequenced as part of this work and 5 additional genomes from GenBank (Alcoy, Corby, Lens, Paris, Philadelphia). Also included are four additional genomes from external sources (LP_423(ST1), Lorraine (ST47), LP_617 (ST47), Wadsworth (ST42)) used for intra ST-comparison. The internal nodes are labelled with the bootstrap values. The data for this tree can be viewed at http://purl.org/phylo/treebase/phylows/study/TB2:S15085.
Figure 8A reticulate tree generated by the Neighbor-net algorithm of SplitsTree4 using the concatenated SNPs from the genome sequences of 33 strains as input data.
Table showing admixture strains
| 3 | 47 | no | yes | ||
| 3 | 54 | no | yes | ||
| 3 | 62 | no | no | ||
| 7 | 179 | no | yes | ||
| 7 | 337 | no | no |
The clusters listed are those that show aberrant clustering on both trees derived from whole genome data. Only those clusters (cluster numbers shown in bold text) that contribute more than 0.1 of the genetic material of a strain are reported.