| Literature DB >> 29389983 |
Xiao-Yong Zhan1,2,3, Qing-Yi Zhu1,2.
Abstract
Inadequate discriminatory power to distinguish between L. pneumophila isolates, especially those belonging to disease-related prevalent sequence types (STs) such as ST1, ST36 and ST47, is an issue of SBT scheme. In this study, we developed a multilocus sequence typing (MLST) scheme based on two non-virulence loci (trpA, cca) and three virulence loci (icmK, lspE, lssD), to genotype 110 L. pneumophila isolates from various natural and artificial water sources in Guangdong province of China, and compared with the SBT. The isolates were assigned to 33 STs of the SBT and 91 new sequence types (nSTs) of the MLST. The indices of discrimination (IODs) of SBT and MLST were 0.920 and 0.985, respectively. Maximum likelihood trees of the concatenated SBT and MLST sequences both showed distinct phylogenetic relationships between the isolates from the two environments. More intragenic recombinations were detected in nSTs than in STs, and they were both more abundant in natural water isolates. We found out the MLST had a high discriminatory ability for the disease-associated ST1 isolates: 22 ST1 isolates were assigned to 19 nSTs. Furthermore, we assayed the discrimination of the MLST for 29 reference strains (19 clinical and 10 environmental). The clinical strains were assigned to eight STs and ten nSTs. The MLST could also subtype the prevalent clinical ST36 or ST47 strains: eight ST36 strains were subtyped into three nSTs and two ST47 strains were subtyped into two nSTs. We found different distribution patterns of nSTs between the environmental and clinical ST36 isolates, and between the outbreak clinical ST36 isolates and the sporadic clinical ST36 isolates. These results together revealed the MLST scheme could be used as part of a typing scheme that increased discrimination when necessary.Entities:
Mesh:
Year: 2018 PMID: 29389983 PMCID: PMC5794064 DOI: 10.1371/journal.pone.0190986
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
L. pneumophila reference clinical and environmental strain information.
| Strain | Source nature | Geographic location | Collection year | NCBI BioSample No. | GenBank accession No. | ST | nST |
|---|---|---|---|---|---|---|---|
| Thunder Bay | Clinical | Canada | N/A | SAMN02603729 | CP003730.1 | 187 | 92 |
| ATCC43290 | Clinical | USA: Denver | 1987 | SAMN02603182 | NC_016811.1 | 187 | 92 |
| ERS1434278 | Clinical | Germany | 1999 | SAMEA4535099 | NZ_LT632617.1 | 42 | 93 |
| Lorraine | Clinical | France: Lorraine | N/A | SAMEA3138425 | NC_018139.1 | 47 | 94 |
| 130b | Clinical | USA | 1980s | SAMEA2272454 | FR687201.1 | 42 | 93 |
| LP_617 | Clinical | UK | 2003 | SAMEA1487522 | NZ_FJOC01000001.1- NZ_FJOC010000074.1 | 47 | 100 |
| Lens | Clinical | France: Lens | 2003 | SAMEA3138253 | NC_006369.1 | 15 | 95 |
| Alcoy | Clinical | Spain: Alcoy | 1999 | SAMN02604292 | NC_014125.1 | 578 | 96 |
| Corby | Clinical | United Kingdom | N/A | SAMN02603241 | NC_009494.2 | 51 | 97 |
| OLDA | Clinical | USA | 1947 | SAMN05198688 | CP016030.2 | 1 | 20 |
| Paris | Clinical | France: Paris | 1987 | SAMEA3138252 | NC_006368.1 | 1 | 20 |
| C1_S | Clinical | USA: North Carolina | 2009 | SAMN05179547 | CP015932.1 | 36 | 92 |
| C2_S | Clinical | USA: Illinois | 2007 | SAMN05179997 | CP015933.1 | 36 | 92 |
| C3_O | Clinical | USA: Texas | 2006 | SAMN05180024 | CP015934.1 | 36 | 92 |
| C5_P | Clinical | USA: Ohio | 1998 | SAMN05180026 | CP015936.1 | 36 | 92 |
| C7_O | Clinical | USA: Deleware | 1994 | SAMN05180028 | CP015938.1 | 36 | 98 |
| C9_S | Clinical | USA: Indiana | 1982 | SAMN05180030 | CP015941.1 | 36 | 92 |
| C10_S | Clinical | USA: Nebraska | 1990 | SAMN05180031 | CP015944.1 | 36 | 99 |
| Philadelphia_1 | Clinical | USA: Pennsylvania | 1977 | SAMN05180044 | CP015928.1 | 36 | 92 |
| E1_P | Environmental | USA: California | 2013 | SAMN05180033 | CP015946.1 | 36 | 92 |
| E2_N | Environmental | USA: Nevada | 2012 | SAMN05180034 | CP015947.1 | 36 | 92 |
| E3_N | Environmental | USA: Texas | 2011 | SAMN05180035 | CP015949.1 | 36 | 92 |
| E4_N | Environmental | USA: Alabama | 2012 | SAMN05180036 | CP015950.1 | 36 | 92 |
| E5_N | Environmental | USA: Arkansas | 2011 | SAMN05180037 | CP015951.1 | 36 | 92 |
| E6_N | Environmental | USA: New Jersey | 2011 | SAMN0518003 | CP015953.1 | 36 | 101 |
| E7_O | Environmental | USA: Georgia | 2009 | SAMN05180039 | CP015954.1 | 36 | 92 |
| E8_O | Environmental | USA: Texas | 2006 | SAMN05180040 | CP015955.1 | 36 | 3 |
| E9_O | Environmental | USA: Illinois | 2012 | SAMN05180041 | CP015956.1 | 36 | 92 |
| E10_P | Environmental | USA: Ohio | 2007 | SAMN05180042 | CP015925.1 | 36 | 92 |
* The clinical and environmental ST36 isolates in the present study included the confirmed outbreak-associated isolates (_O), the potential outbreak isolates (_P), the sporadic isolates (_S), the non-disease-associated isolates (_N) and Philadelphia-1 isolates from USA CDC (Philadelphia_1). C1_S, C2_S, C3_O etc. indicate clinical isolates, while E1_P, E2_N, E3_N etc. indicate environmental isolates.
# Strain LP_617 only showed a set of whole genome shotgun sequences.
a Sequence type was derived from the genome sequence data.
b Strain Lens has two non-identical copies of the mompS locus (354nt) in their genome, and its ST was defined according to Moran-Gilad’s report [16].
ST distributions in the isolates from natural and artificial water sources.
| ST | Allelic profile | Natural isolates | Artificial isolates | |||
|---|---|---|---|---|---|---|
| ST1 | 1, 4, 3, 1, 1, 1, 1 | 2 | 3.39 | 20 | 39.22 | |
| ST630 | 1, 4, 3, 1, 1, 1, 10 | 4 | 6.78 | 2 | 3.92 | 0.684 |
| ST1417 | 8, 6, 34, 9, 2, 8, 209 | 2 | 3.39 | 3 | 5.88 | 0.664 |
| ST242 | 3, 10, 1, 28, 1, 9, 3 | 1 | 1.69 | 3 | 5.88 | 0.338 |
| ST1048 | 6, 10, 17, 3, 4, 14, 11 | 12 | 20.34 | 1 | 1.96 | |
| ST59 | 7, 6, 17, 3, 13, 11, 11 | 0 | 0 | 2 | 3.92 | 0.213 |
| ST739 | 12, 8, 11, 2, 10, 12, 2 | 7 | 11.86 | 0 | 0 | |
| ST1267 | 2, 6, 48, 6, 48, 5, 40 | 6 | 10.17 | 0 | 0 | |
| ST1266 | 12, 15, 11, 56, 29, 12, 34 | 2 | 3.39 | 0 | 0 | 0.496 |
| ST1785 | 2, 15, 3, 73, 29, 1, 201 | 2 | 3.39 | 0 | 0 | 0.496 |
| ST45 | 5, 1, 22, 26, 6, 10, 12, 45 | 2 | 3.39 | 0 | 0 | 0.496 |
| ST1049 | 12, 8, 11, 2, 11, 12, 4 | 2 | 3.39 | 0 | 0 | 0.496 |
| ST752 | 22, 4, 3, 1, 1, 1, 1 | 2 | 3.39 | 7 | 13.73 | 0.078 |
| ST1052 | 2, 10, 15, 28, 21, 3, 2 | 2 | 3.39 | 0 | 0 | 0.496 |
| ST1053 | 6, 16, 14, 28, 4, 14, 3 | 3 | 5.08 | 0 | 0 | 0.247 |
| ST1777 | 1, 4, 3, 1, 1, 1, 215 | 2 | 3.39 | 1 | 1.963 | 1.000 |
| ST1054 | 32, 12, 50, 6, 48, 11, 9 | 0 | 0 | 4 | 7.843 | |
| Other STs | 8 | 13.56 | 8 | 15.69 | 0.752(Chi-Square test) | |
| Total | 59 | 100 | 51 | 100 | ||
* Other STs, 16 STs including only one isolate
nST distributions in the isolates from natural and artificial water sources.
| Five-gene MLST | Natural isolates | Artificial isolates | |||
|---|---|---|---|---|---|
| nST5 | 0 | 0 | 2 | 3.92 | 0.213 |
| nST15 | 0 | 0 | 3 | 5.88 | 0.096 |
| nST17 | 0 | 0 | 2 | 3.92 | 0.213 |
| nST20 | 1 | 1.69 | 2 | 3.92 | 0.596 |
| nST35 | 2 | 2.39 | 2 | 3.92 | 1.000 |
| nST39 | 0 | 0 | 5 | 9.80 | |
| nST50 | 5 | 8.47 | 0 | 0 | 0.060 |
| nST68 | 2 | 2.39 | 0 | 0 | 0.498 |
| nST82 | 2 | 2.39 | 0 | 0 | 0.498 |
| Other nSTs | 47 | 79.66 | 35 | 68.63 | 0.185 (Chi-Square test) |
| Total | 59 | 100 | 51 | 100 | |
* Other nSTs, 82 nSTs including only one isolate
Genetic diversity of the concatenated MLST and SBT sequences in L. pneumophila isolates from natural (N) and artificial (A) water sources.
| Sequences | Strain types | Sequence, n | Sequence length | SD of Hd | SD of | SD of | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | 59 | 2876 | 0.006 | 0.00121 | 312 | 0.02335 | 0.00631 | 354 | |||||
| A | 51 | 2876 | 41 | 0.987 | 0.008 | 0.02507 | 0.00502 | 380 | 0.02937 | 0.00814 | 72.100 | 415 | |
| All | 110 | 2876 | 0.003 | 0.02763 | 0.00243 | 485 | 0.03198 | 0.00767 | 547 | ||||
| N | 59 | 2501/2498 | 0.019 | 0.00338 | 300 | 0.02585 | 0.00699 | 355 | |||||
| A | 51 | 2501 | 17 | 0.824 | 0.047 | 0.02217 | 0.00402 | 316 | 0.02808 | 0.00780 | 55.454 | 338 | |
| All | 110 | 2501/2498 | 33 | 0.928 | 0.013 | 0.02807 | 0.00252 | 369 | 0.02801 | 0.00675 | 70.118 | 429 |
H, Haplotypes,
Hd, Haplotype diversity
π, Nucleotide diversity
S, Polymorphic sites
θ, Theta (per site) from S
k, Nucleotide differences
Ƞ, Total number of mutations
Analysis of molecular variance of concatenated MLST and SBT sequences.
| Sequences | Source of variation | d.f. | Sum of squares | Variance components | Percentage of variation | F-statistics |
|---|---|---|---|---|---|---|
| Among groups | 1 | 170.327 | -1.82081 Va | -4.44 | ||
| Among populations within groups | 2 | 261.272 | 6.00777 Vb | 14.66 | ||
| Within populations | 106 | 3899.455 | 36.78731 Vc | 89.78 | ||
| Total | 109 | 4331.055 | 40.97427 | |||
| Among groups | 1 | 144.336 | -2.10141 Va | -5.78 | ||
| Among populations within groups | 2 | 247.195 | 5.82481 Vb | 16.03 | ||
| Within populations | 106 | 3456.305 | 32.60666 Vc | 89.75 | ||
| Total | 109 | 3847.836 | 36.33005 |
** P < 0.01
Fig 1Phylogenetic tree of the concatenated MLST sequences (2876 bp) of the 91 nSTs in this study.
Bootstrap support values (1000 replicates) for nodes higher than 50% are indicated next to the corresponding node. Five main groups of the branches could be found. Different color of the branches indicated distinct groups of the nSTs, and branches with the same color were clustered into a group. The blocks indicate the strains of corresponding nSTs. A indicates artificial isolates, and N indicates natural isolates.
Fig 2Phylogenetic tree of concatenated SBT sequences (2501/2498bp) of the 33 STs in this study.
Bootstrap support values (1000 replicates) for nodes higher than 50% are indicated next to the corresponding node. Five main groups of the branches could be found. Different color of the branches indicates distinct groups of the nSTs. Branches with the same color are clustered into a group. The relative size of solid circles indicates the number of isolates in the selective group; the red sector indicates artificial water isolates, while the blue sector indicates natural water isolates. The blocks indicate the isolates of the corresponding STs. A indicates artificial water isolates, and N indicates natural water isolates.
Fig 3Reticulate network tree by using the neighbor-net algorithm of SplitsTree4 using the five MLST loci concatenated alignments of the 91 nSTs.
All internal nodes represent hypothetical ancestral nSTs and edges correspond to reticulate events such as recombinations.
Fig 4Reticulate network tree by using the neighbor-net algorithm of SplitsTree4 using the seven SBT loci concatenated alignments of the 33 STs.
All internal nodes represent hypothetical ancestral STs and edges correspond to reticulate events such as recombinations.
Intragenic recombination in the 33 STs by using six different methods implemented in RDP software.
| Recombination events | Recombinant STs | Major parent | Minor parent | Detection methods implemented in RDP software | |||||
|---|---|---|---|---|---|---|---|---|---|
| RDP | GENECONV | Bootscan | Maxchi | Chimaera | SiSscan | ||||
| 1 | ST1417, ST1781, ST1782, ST1785, ST1777 | ST1788 | ST160 | Y | Y | Y | Y | Y | N |
| 2 | ST1418 | ST1051 | ST160 | Y | Y | N | Y | Y | N |
| 3 | ST160 | ST1778 | ST242 | Y | Y | N | Y | Y | N |
| 4 | ST114, ST59, ST1054 | ST1049 | ST1263 | Y | Y | Y | Y | Y | Y |
| 5 | ST93, ST242 | ST1052 | ST1049 | N | N | N | Y | Y | N |
| 6 | ST1262, ST1263 | ST172 | ST1052 | Y | Y | Y | Y | Y | N |
| 7 | ST1267 | ST1051 | ST1778 | N | Y | N | N | N | Y |
| 8 | ST1417 | ST1267 | ST1050 | Y | N | N | Y | N | Y |
| 9 | ST1263, ST1417, ST1781 | ST752 | ST1785 | Y | N | N | N | N | Y |
| 10 | ST1266 | ST1049 | ST1051 | Y | Y | Y | Y | Y | Y |
| 11 | ST1049, ST739 | ST45 | ST1778 | Y | N | N | Y | Y | N |
| 12 | ST1051, ST59 | ST1053 | ST1778 | N | Y | N | N | N | Y |
| 13 | ST1053, ST1051 | ST1050 | ST1785 | Y | N | N | N | N | Y |
* Major parent: parent contributing the larger fraction of the sequence.
# Minor parent: parent ST contributing the smaller fraction of the sequence.
$ Recombination events detected by more than two methods were shown.
a ST used to infer the existence of a missing parental sequence
b Y indicates recombination events were detected by the selected method.
c N indicates recombination events were not detected by the selected method.
Intragenic recombination in the 91 nSTs by using six different methods implemented in RDP software.
| Recombination events | Recombinant nSTs | Major parent | Minor parent | Detection methods implemented in RDP software | |||||
|---|---|---|---|---|---|---|---|---|---|
| RDP | GENECONV | Bootscan | Maxchi | Chimaera | SiSscan | ||||
| 1 | nST39 | nST6 | nST59 | Y | Y | Y | Y | Y | Y |
| 2 | nST60, nST61 | nST59 | nST6a | Y | Y | N | Y | Y | N |
| 3 | nST83, nST63, nST65, nST81, nST76, nST64, nST67, nST85, nST82, nST86 | nST72 | nST62 | Y | Y | N | Y | Y | Y |
| 4 | nST22, nST23, nST50, nST51, nST88 | nST37 | nST87 | Y | Y | Y | Y | Y | Y |
| 5 | nST71, nST72, nST67, nST73, nST74, nST77, nST78, nST79, nST83, nST86, nST87 | nST41 | nST33 | Y | Y | N | Y | N | Y |
| 6 | nST91, nST66 | nST41 | nST59 | Y | Y | N | Y | Y | Y |
| 7 | nST59 | nST70 | nST87 | N | Y | N | Y | Y | Y |
| 8 | nST70 | nST4 | nST41 | N | N | N | Y | Y | Y |
| 9 | nST59 | nST39 | nST33 | Y | Y | Y | Y | N | Y |
| 10 | nST41 | nST4 | nST49 | Y | Y | Y | Y | Y | Y |
| 11 | nST60, nST61 | nST72 | nST33 | Y | N | Y | Y | N | Y |
| 12 | nST46, nST43, nST44, nST45, nST47, nST48, nST49, nST68 | nST4 | nST33 | N | N | N | Y | Y | N |
| 13 | nST53, nST52 | nST46 | nST62 | Y | N | N | N | N | Y |
| 14 | nST91, nST66 | nST87 | nST22 | N | N | N | Y | N | Y |
* Major parent: parent contributing the larger fraction of the sequence.
# Minor parent: parent ST contributing the smaller fraction of the sequence
$ Recombination events detected by more than two methods were shown.
a nST used to infer the existence of a missing parental sequence
b Y indicates recombination events were detected by the selected method
c N indicates recombination events were not detected by the selected method.
Fig 5Phylogenetic tree and reticulate network tree of the concatenated MLST sequences (2876bp) for the 22 ST1 isolates.
A. Phylogenetic tree of the concatenated MLST sequences (2876bp) for the 22 ST1 isolates in this study. Two main groups of these isolates could be found in the tree. The blue branches indicate the natural water isolates (N), which constitute a subgroup. B. Reticulate network tree of the concatenated sequences of the MLST loci for the 22 ST1 isolates. Internal nodes and edges exist.
Fig 6Phylogenetic analysis of the concatenated MLST and SBT sequences for the nSTs and STs of the clinical strains.
Bootstrap support values (1000 replicates) for nodes higher than 50% are indicated next to the corresponding node. A. ML tree of the concatenated MLST sequences for the 10 nSTs of the 19 clinical strains. The ST36 isolates were marked red. B. ML tree of the concatenated SBT sequences for the eight STs of the 19 clinical strains.