| Literature DB >> 23593373 |
Monica Moschioni1, Morena Lo Sapio, Giovanni Crisafulli, Giulia Torricelli, Silvia Guidotti, Alessandro Muzzi, Michèle A Barocchi, Claudio Donati.
Abstract
Multi-Locus Sequence Typing (MLST) of Streptococcus pneumoniae is based on the sequence of seven housekeeping gene fragments. The analysis of MLST allelic profiles by eBURST allows the grouping of genetically related strains into Clonal Complexes (CCs) including those genotypes with a common descent from a predicted ancestor. However, the increasing use of MLST to characterize S. pneumoniae strains has led to the identification of a large number of new Sequence Types (STs) causing the merger of formerly distinct lineages into larger CCs. An example of this is the CC156, displaying a high level of complexity and including strains with allelic profiles differing in all seven of the MLST loci, capsular type and the presence of the Pilus Islet-1 (PI-1). Detailed analysis of the CC156 indicates that the identification of new STs, such as ST4945, induced the merging of formerly distinct clonal complexes. In order to discriminate the strain diversity within CC156, a recently developed typing schema, 96-MLST, was used to analyse 66 strains representative of 41 different STs. Analysis of allelic profiles by hierarchical clustering and a minimum spanning tree identified ten genetically distinct evolutionary lineages. Similar results were obtained by phylogenetic analysis on the concatenated sequences with different methods. The identified lineages are homogenous in capsular type and PI-1 presence. ST4945 strains were unequivocally assigned to one of the lineages. In conclusion, the identification of new STs through an exhaustive analysis of pneumococcal strains from various laboratories has highlighted that potentially unrelated subgroups can be grouped into a single CC by eBURST. The analysis of additional loci, such as those included in the 96-MLST schema, will be necessary to accurately discriminate the clonal evolution of the pneumococcal population.Entities:
Mesh:
Year: 2013 PMID: 23593373 PMCID: PMC3625235 DOI: 10.1371/journal.pone.0061003
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
CC156 strain panel used in this study.
| Strain name | ST | Serotype/serogroup | Country | MLST alleles in common with ST4945 | MLST alleles in common with ST156 | PI-1 | Data source | Strain source | Lineage |
|
| 90 | 6B | Spain | 2/7 | 1/7 | yes | GenBank:CP002176 |
| f |
|
| 94 | 6B | Italy | 2/7 | 1/7 | yes | This Study | Istituto Superiore di Sanità, Italy | f |
|
| 124 | 14 | Canada | 4/7 | 1/7 | no | GenBank:ABZC00000000 |
| d |
|
| 124 | 14 | Canada | 4/7 | 1/7 | no | GenBank:ABZT00000000 |
| d |
|
| 124 | 14 | USA | 4/7 | 1/7 | no | GenBank:ABAD00000000 |
| d |
|
| 138 | 6B | USA | 3/7 | 1/7 | yes | This Study | Center for Disease Control and Prevention, USA | b |
|
| 143 | 14 | Italy | 3/7 | 5/7 | yes | This Study | Istituto Superiore di Sanità, Italy | i |
|
| 145 | 6B | Iceland | 5/7 | 3/7 | yes | This Study | Landspitali, National University Hospital of Iceland, Iceland | e |
|
| 146 | 6B | New Zeland | 4/7 | 2/7 | yes | This Study | Center for Disease Control and Prevention, USA | e |
|
| 156 | 14 | Israel | 3/7 | 7/7 | yes | This Study | Ben-Gurion University of the Negev, Israel | i |
|
| 156 | 14 | Israel | 3/7 | 7/7 | yes | This Study | Ben-Gurion University of the Negev, Israel | i |
|
| 156 | 11A | Israel | 3/7 | 7/7 | yes | This Study | Ben-Gurion University of the Negev, Israel | i |
|
| 156 | 9V | Thailand | 3/7 | 7/7 | yes | This Study | Shoklo Malaria Research Unit, Thailand | i |
|
| 156 | 9V | Thailand | 3/7 | 7/7 | yes | This Study | Shoklo Malaria Research Unit, Thailand | i |
|
| 156 | 14 | Brazil | 3/7 | 7/7 | yes | This Study | Oswaldo Cruz Foundation Salvador, Brazil | i |
|
| 156 | 14 | Brazil | 3/7 | 7/7 | yes | This Study | Oswaldo Cruz Foundation Salvador, Brazil | i |
|
| 156 | 9V | Italy | 3/7 | 7/7 | yes | This Study | Istituto Superiore di Sanità, Italy | i |
|
| 156 | 14 | Italy | 3/7 | 7/7 | yes | This Study | Istituto Superiore di Sanità, Italy | i |
|
| 156 | 9V | Italy | 3/7 | 7/7 | yes | This Study | Istituto Superiore di Sanità, Italy | i |
|
| 156 | 14 | Italy | 3/7 | 7/7 | yes | This Study | Istituto Superiore di Sanità, Italy | i |
|
| 156 | 9V | Sweden | 3/7 | 7/7 | yes | This Study | Karolinska Institutet, Sweden | i |
|
| 156 | 9V | Sweden | 3/7 | 7/7 | yes | This Study | Karolinska Institutet, Sweden | i |
|
| 156 | 9V | Worldwide | 3/7 | 7/7 | yes | GenBank:ABGE00000000 | Genome Biol 11:R107 | i |
|
| 162 | 9V | Brazil | 4/7 | 6/7 | yes | This Study | Oswaldo Cruz Foundation Salvador, Brazil | i |
|
| 162 | 9V | Brazil | 4/7 | 6/7 | yes | This Study | Oswaldo Cruz Foundation Salvador, Brazil | i |
|
| 162 | 24F | Italy | 4/7 | 6/7 | yes | This Study | Istituto Superiore di Sanità, Italy | i |
|
| 162 | 24F | Italy | 4/7 | 6/7 | yes | This Study | Istituto Superiore di Sanità, Italy | i |
|
| 166 | 9V | USA | 4/7 | 6/7 | Yes | This Study | Center for Disease Control and Prevention, USA | i |
|
| 171 | 6B | n.d. | 3/7 | 2/7 | no | This Study | University of Alabama, USA | b |
|
| 172 | 23F | Israel | 1/7 | 1/7 | no | This Study | Ben-Gurion University of the Negev, Israel | a |
|
| 172 | 19A | Israel | 1/7 | 1/7 | no | This Study | Ben-Gurion University of the Negev, Israel | a |
|
| 172 | 23F | Israel | 1/7 | 1/7 | yes | This Study | Ben-Gurion University of the Negev, Israel | a |
|
| 172 | 23F | Thailand | 1/7 | 1/7 | no | This Study | Shoklo Malaria Research Unit, Thailand | a |
|
| 172 | 23F | Thailand | 1/7 | 1/7 | yes | This Study | Shoklo Malaria Research Unit, Thailand | a |
|
| 173 | 23F | Poland | 2/7 | 2/7 | yes | This Study | Center for Disease Control and Prevention, USA | c |
|
| 176 | 6B | Italy | 2/7 | 1/7 | yes | This Study | Istituto Superiore di Sanità, Italy | b |
|
| 239 | 9V | Poland | 1/7 | 1/7 | no | This Study | National Medicine Institute, Poland | g |
|
| 268 | 19A | Hungary | 1/7 | 1/7 | yes | GenBank:CP000936 | Genome Biol 11:R107 | c |
|
| 273 | 6B | Greece | 4/7 | 1/7 | yes | This Study | Center for Disease Control and Prevention, USA | f |
|
| 280 | 9V | Thailand | 2/7 | 1/7 | no | This Study | Shoklo Malaria Research Unit, Thailand | g |
|
| 338 | 23F | Colombia | 1/7 | 1/7 | no | This Study | Center for Disease Control and Prevention, USA | a |
|
| 361 | 6A | Ghana | 2/7 | 2/7 | no | This Study | Swiss Tropical Institute, Switzerland | a |
|
| 385 | 6B | USA | 5/7 | 2/7 | yes | This Study | University of Alabama, USA | e |
|
| 392 | 17F | USA | 6/7 | 3/7 | no | This Study | Center for Disease Control and Prevention, USA | h |
|
| 440 | 23F | Italy | 5/7 | 2/7 | no | This Study | Ospedale le Scotte,Siena, Italy | h |
|
| 559 | 6B | Italy | 2/7 | 1/7 | Yes | This Study | Istituto Superiore di Sanità, Italy | b |
|
| 602 | 23F | Poland | 4/7 | 1/7 | no | This Study | National Medicine Institute, Poland | h |
|
| 642 | 9V | USA | 4/7 | 4/7 | Yes | This Study | Center for Disease Control and Prevention, USA | i |
|
| 671 | 14 | USA | 2/7 | 4/7 | Yes | This Study | Center for Disease Control and Prevention, USA | i |
|
| 789 | 14 | Uruguay | 6/7 | 2/7 | no | This Study | The Rockfeller University, New York, USA | d |
|
| 847 | 19A | Kenya | 4/7 | 4/7 | yes | This Study | Kenyan Medical Research Center, Kenya | j |
|
| 847 | 19A | Kenya | 4/7 | 4/7 | yes | This Study | Center for Disease Control and Prevention, USA | j |
|
| 1269 | 9 | USA | 4/7 | 5/7 | yes | GenBank:ABAB00000000 |
| i |
|
| 1349 | 23B | Turkey | 0/7 | 0/7 | no | This Study | Center for Disease Control and Prevention, USA | a |
|
| 2218 | 23F | Thailand | 2/7 | 1/7 | no | This Study | Shoklo Malaria Research Unit, Thailand | a |
|
| 4404 | 6B | Thailand | 6/7 | 3/7 | no | This Study | Shoklo Malaria Research Unit, Thailand | e |
|
| 4405 | 6B | Thailand | 5/7 | 2/7 | no | This Study | Shoklo Malaria Research Unit, Thailand | e |
|
| 4945 | 17F | Sweden | 7/7 | 3/7 | no | This Study | Center for Disease Control and Prevention, USA | h |
|
| 4945 | 17F | Egypt | 7/7 | 3/7 | no | This Study | Center for Disease Control and Prevention, USA | h |
|
| 4948 | 14 | Egypt | 4/7 | 4/7 | Yes | This Study | Center for Disease Control and Prevention, USA | i |
|
| 4966 | 6B | Thailand | 4/7 | 3/7 | No | This Study | Center for Disease Control and Prevention, USA | b |
|
| 4966 | 6C | Thailand | 4/7 | 3/7 | No | This Study | Center for Disease Control and Prevention, USA | b |
|
| 4968 | 23A | Mozambique | 1/7 | 1/7 | No | This Study | Center for Disease Control and Prevention, USA | a |
|
| 5420 | 6B | Thailand | 4/7 | 3/7 | No | This Study | Center for Disease Control and Prevention, USA | b |
|
| 5613 | 6A | Nepal | 4/7 | 2/7 | No | This Study | Center for Disease Control and Prevention, USA | b |
|
| 6214 | 6 | USA | 3/7 | 2/7 | yes | GenBank:ABAE00000000 |
| e |
For each strain name, ST, serotype/serogroup, country of isolation, number of MLST alleles in common with ST156 and ST4945, data source, strain source and lineage (as identified by 96-MLST hierarchical clustering, see Figure 2) are indicated.
Figure 2Hierarchical clustering performed on the 96-MLST alleles identifies ten genetically distinct evolutionary lineages (a-j) within the 66 CC156 strains analyzed.
Sequences were converted into allelic profiles assigning a unique ID number to each allele. Hierarchical clustering was performed using the package Cluster v1.13.1. Distances between strains were computed using the function “Daisy” with Gower’s distance, counting the number of differences between allelic profiles. An agglomerative hierarchical clustering of the data was performed using the function “Agnes” with “average” (unweighted pair-group average method – UPGMA) method. The ten lineages identified (a-j) are indicated by coloured boxes, and numbers represent the bootstrap support. The STs of all the strains are indicated in the coloured bar.
Figure 1Graphic representation of CC156 by e-BURST.
A) In the absence of ST4945 CC156 is partitioned in three different CCs by e-BURST analysis. B) 32 out of the 41 CC156 STs analyzed differ in four or more than four alleles from the founder ST, ST156. The MLST database was accessed on 15h January 2012 and CC156 visualized using eBURST (the e-BURST algorithm was executed on a dataset comprising all the STs in the database represented once). A) Shadowed shapes indicate the partitioning in distinct CCs of CC156 (CC162 blue, CC124 red, CC176 green) when eBURST was executed with the same ST dataset but excluding ST4945. ST156 and ST4945 are highlighted in red, while all the other STs analysed in this study are in black. B) The STs analysed in this study are highlighted and colour coded based on the number of MLST alleles in common with the predicted founder, ST156 (colour coding is indicated in the Figure).
Figure 3Minimum Spanning Tree analysis based on 96-MLST allelic profiles identifies seven distinct lineages by imposing a maximum threshold of 75 different loci.
The Minimum Spanning Tree analysis was performed by using PHYLOVIZ on the 96-MLST alleles of the 66 strains considered in this study. The lineages identified by applying the threshold of 75/96 different loci are highlighted with shadowed shapes and named according to the lineage identification of Figure 2.
Figure 4ST4945 can be unequivocally assigned to one of the identified lineages.
The distribution of the 7-MLST and the 96-MLST alleles was analysed by assigning identical colours to identical alleles across the strains (white = unique alleles). Red arrows indicate ST4945 strains, while black and orange arrows indicate single and double 7-MLST locus variants of ST4945, respectively. The 96-MLST loci are listed according to their order in the genome.
Figure 5The CC156 lineages (a-j) identified with the hierarchical clustering (shadowed shapes) as defined in Figure 2 correlate with the ST distribution in the eBURST diagram and with PI-1 distribution.
STs are indicated with different colours depending on PI-1 presence/absence as indicated in the Figure legend.