| Literature DB >> 31184299 |
Andries J van Tonder1, Rebecca A Gladstone1, Stephanie W Lo1, Moon H Nahm2, Mignon du Plessis3,4, Jennifer Cornick5, Brenda Kwambana-Adams6, Shabir A Madhi7, Paulina A Hawkins8, Rachel Benisty9, Ron Dagan9, Dean Everett10, Martin Antonio6, Keith P Klugman8, Anne von Gottberg3,4, Robert F Breiman8,10, Lesley McGee11, Stephen D Bentley1,12.
Abstract
The pneumococcus produces a polysaccharide capsule, encoded by the cps locus, that provides protection against phagocytosis and determines serotype. Nearly 100 serotypes have been identified with new serotypes still being discovered, especially in previously understudied regions. Here we present an analysis of the cps loci of more than 18 000 genomes from the Global Pneumococcal Sequencing (GPS) project with the aim of identifying novel cps loci with the potential to produce previously unrecognized capsule structures. Serotypes were assigned using whole genome sequence data and 66 of the approximately 100 known serotypes were included in the final dataset. Closer examination of each serotype's sequences identified nine putative novel cps loci (9X, 11X, 16X, 18X1, 18X2, 18X3, 29X, 33X and 36X) found in ~2.6 % of the genomes. The large number and global distribution of GPS genomes provided an unprecedented opportunity to identify novel cps loci and consider their phylogenetic and geographical distribution. Nine putative novel cps loci were identified and examples of each will undergo subsequent structural and immunological analysis.Entities:
Keywords: Streptococcus pneumoniae; cps locus; pneumococcus; polysaccharide capsule; serotype
Mesh:
Substances:
Year: 2019 PMID: 31184299 PMCID: PMC6700660 DOI: 10.1099/mgen.0.000274
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.(a) World map showing the global distribution of the samples in the study dataset. (b) Bubble plot showing the 66 serotypes included in the dataset (the area of each circle is proportional to the number of samples).
Fig. 2.Identification of putative novel cps loci; plots of total SNPs against the percentage of the serotype reference covered by sequence reads are shown for nine serotypes with putative novel cps loci (highlighted in red).
Summary of putative novel cps loci identified in this study Putative novel cps loci (in alphanumerical order by related serotype).
|
Putative |
Genomes ( |
GPSC lineages ( |
Supraregions ( |
Countries ( |
Carriage ( |
Disease ( |
Collection dates |
|---|---|---|---|---|---|---|---|
|
9X |
9 |
6 |
3 |
5 |
5 |
4 |
2007–2014 |
|
11X |
44 |
5 |
3 |
4 |
43 |
1 |
2000–2017 |
|
16X |
338 |
11 |
2 |
6 |
251 |
72 |
2003–2016 |
|
18X1 |
1 |
1 |
1 |
1 |
1 |
0 |
2014 |
|
18X2 |
7 |
4 |
3 |
5 |
2 |
5 |
2006–2016 |
|
18X3 |
16 |
4 |
5 |
7 |
4 |
12 |
1999–2013 |
|
29X |
29 |
5 |
2 |
5 |
7 |
20 |
2000–2015 |
|
33X |
30 |
3 |
7 |
10 |
10 |
20 |
2003–2015 |
|
36X |
2 |
2 |
2 |
2 |
2 |
0 |
2009 |
Fig. 3.(a) Phylogeny and SNP map of serotype 9L. (b) Comparison of the putative serotype 9X sequence and the serotype 9L and 9N reference sequences.
Fig. 4.(a) Phylogeny and SNP map of serotype 11A. (b) Comparison of the putative serotype 11X sequence and the serotype 11A and 11F reference sequences. (c) Phylogeny containing the serogroup 11 references, serotype 11X and PMP1343.
Fig. 5.(a) Phylogeny and SNP map of serotype 16F. (b) Comparison of the putative serotype 16X sequence and the serotype 16F reference sequence.
Fig. 6.(a) Phylogeny and SNP map of serotype 18A. (b) Phylogeny and SNP map of serotype 18C. (c) Comparison of the putative serotype 18X sequences and the serotype 18A and 18C reference sequences.
Fig. 8.(a) Phylogeny and SNP map of serotype 29. (b) Comparison of the putative serotype 29X sequence and the serotype 29 reference sequence.
Fig. 9.(a) Phylogeny and SNP map of serotype 33F. (b) Comparison of the putative serotype 33X sequence and the serotype 33F reference sequence.
Fig. 10.(a) Phylogeny and SNP map of serotype 36. (b) Comparison of the putative serotype 36X sequence and the serotype 36 reference sequence.