| Literature DB >> 28729954 |
Carmen L Sheppard1, Georgia Kapatai1, Karen Broughton1, Ulf Schaefer2, Matthew Hannah3, David J Litt1, Norman K Fry1.
Abstract
The major virulence factor of the pneumococcus, and target for conjugate vaccines, is the polysaccharide capsule, which is usually encoded by the highly variable cps locus. Serotype 37 is an unusual pneumococcal type in which the single β-glucosyltransferase gene responsible for serotype capsule production (tts) is located outside of the capsular operon region. Using a previously described automated whole genome sequence (WGS)-based serotyping bioinformatics tool, PneumoCaT, we identified and investigated seven clinical isolates (three from blood cultures) of non-pneumococcal streptococci containing a highly homologous tts and included them in a study panel of 20 isolates which included a 11 further clinical isolates of S. pneumoniae serotype 37, a reference strain of serotype 37 and the S. pseudopneumoniae type strain BAA 960T. The seven non-pneumococcal isolates generated novel alleles at all pneumococcal MLST loci and gave low percentage similarity (<45%) to S. pneumoniae or S. pseudopneumoniae species by comparison of short sequence patterns in genomic data (k-mer analysis). The S. pseudopneumoniae BAA-960T isolate generated two novel alleles in the MLST and gave a high similarity (>99%) to the reference sequence for BAA-960T. Twelve isolates gave high similarity (>77%) to the Streptococcus pneumoniae 5652-06 serotype 19A reference genome sequence and had previously reported MLST alleles. Each of the seven clinical non-pneumococcal strains and all of the 12 S. pneumoniae possessed a β-glycosyltransferase gene (tts) with >95% similarity to the pneumococcal tts reference DNA sequence with 20-22 non-synonymous SNPs. All but two strains in which the tts gene was detected gave positive reactions for serotype 37 in slide agglutination tests with serotype 37 typing sera. Phylogenetic analysis using both SNP and MLST data showed distinct clades corresponding to strains identified as pneumococcus or non-pneumococcus by kmer WGS analysis. Extended k-mer database analysis and ribosomal MLST placed the non-pneumococcal isolates within the S. mitis group. Biochemical and bile solubility assays showed differences between the unusual isolates and S. pneumoniae. All isolates had detectable pneumolysin (ply) genes, but only those that identified as pneumococcus contained the genes for autolysin (lytA) or the ABC transporter lipoprotein A (piaA) with >80% coverage and >95% similarity. Here we report the existence of a novel group of strains distinct from S. pneumoniae, but which can express a pneumococcal serotype 37 capsular polysaccharide which can be associated with clinical disease.Entities:
Keywords: Microbial genomics; Microbial identification; Phylogenetics; Streptococci
Year: 2017 PMID: 28729954 PMCID: PMC5518733 DOI: 10.7717/peerj.3571
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Phenotypic analysis of serotype 37 isolates, with BAA-960T as a Streptococcus pseudopneumoniae reference.
| Isolate | Colony appearance | Serotype by slide agglutination | Optochin sensitivity | Bile solubility | API 32 Strep profile | API species identification | API % |
|---|---|---|---|---|---|---|---|
| PHESPD0338 | Very Mucoid | 37 | Sensitive | Soluble | 72076741110 | 88.0 | |
| PHESPD0344 | Very Mucoid | 37 | Sensitive | Soluble | 40272741110 | 99.9 | |
| PHESPD0356 | Very Mucoid | 37 | Sensitive | Soluble | 46072741100 | 77.5 | |
| PHESPD0383 | Very Mucoid | 37 | Sensitive | Soluble | 60076741110 | 99.9 | |
| PHESPV0691 | Very Mucoid | 37 | Sensitive | Soluble | 76076741110 | 87.4 | |
| PHESPV1034 | Very Mucoid | 37 | Sensitive | Soluble | 40076741100 | 99.6 | |
| PHESPV1405 | Very Mucoid | 37 | Sensitive | Soluble | 60076741110 | 99.9 | |
| PHESPD0363 | Very Mucoid | 37 | Sensitive | Soluble | 46072741100 | 77.5 | |
| PHESPV0789 | Very Mucoid | 37 | Sensitive | Soluble | 40072601100 | 97.9 | |
| PHESPV1119 | Very Mucoid | 37 | Sensitive | Soluble | 50076741100 | 99.9 | |
| PHESPD0355 | Very Mucoid | 37 | Sensitive | Soluble | 50072541100 | 99.9 | |
| SSI-37 | Very Mucoid | 37 | Sensitive | Soluble | 52076741100 | 98.3 | |
| PHESPD0357 | Small | 37 | Sensitive | Insoluble | 00132541100 | 97.3 | |
| PHENP00003 | Small | 37 | Sensitive | Indeterminate | 40036441100 | 95.3 | |
| PHENP00005 | Small | 37 | Sensitive | Indeterminate | 40016441100 | 90.6 | |
| PHENP00006 | Small | 37 | Sensitive | Indeterminate | 42052541100 | 92.9 | |
| PHENP00001 | Very Small | 37 | Sensitive | Indeterminate | 40116441120 | 94.6 | |
| PHENP00002 | Very Small | Untypeable | Sensitive | Indeterminate | 40112441110 | 95.2 | |
| PHENP00007 | Very Small | Untypeable | Sensitive | Indeterminate | 60016441100 | 96.0 | |
| BAA-960T | Rough | Not done | Sensitive | Indeterminate | 64012441140 | 98.3 |
Serotype 37 (tts gene detected) isolates with standard Streptococcus pneumoniae WGS workflow analysis results, including the S. pseudopneumoniae (BAA-960T) and SSI serotype 37 strains for reference.
| Isolate number | Isolation site | MLST Sequence type | MLST profile ( | kmerID top match reference genome | kmerID % similarity | |
|---|---|---|---|---|---|---|
| PHESPD0338 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.4 | |
| PHESPD0344 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.4 | |
| PHESPD0356 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.3 | |
| PHESPD0383 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.3 | |
| PHESPV0691 | CSF | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.4 | |
| PHESPV1034 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 79.7 | |
| PHESPV1405 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.5 | |
| PHESPD0363 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.7 | |
| PHESPV0789 | Unknown | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.9 | |
| PHESPV1119 | Blood | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.5 | |
| PHESPD0355 | Unknown | 447 | 29, 33, 19, 1, 36, 22, 31 | + | 77.4 | |
| SSI-37 | Unknown | 7243 | 192, 34, 19, 1, 36, 22, 445 | + | 77.6 | |
| PHESPD0357 | Blood | Novel | all loci novel | + | 41.0 | |
| PHENP00003 | Sputum | Novel | all loci novel | + | 34.6 | |
| PHENP00005 | Sputum | Novel | all loci novel | + | 37.0 | |
| PHENP00006 | Sputum | Novel | all loci novel | + | 36.4 | |
| PHENP00001 | Sputum | Novel | all loci novel | + | 33.8 | |
| PHENP00002 | Blood | Novel | all loci novel | + | 33.8 | |
| PHENP00007 | Blood | Novel | all loci novel | + | 34.0 | |
| BAA-960T | Unknown | Novel | 139, 371, 345, | – | 99.8 |
Notes.
novel allele.
cerebrospinal fluid
Figure 1Electron micrographs of (A) mucoid serotype 37 pneumococcus PHESPV1034, (B) unusual serotype 37 PHENP00003, (C) BAA-960 Streptococcus pseudopneumoniae.
Scale bar 1 um.
Sites within the TTS protein sequence that show amino acid variability for 19 study panel isolates (with amino acid codes), including BLOSUM62 score for likelihood of substitution in a homologous protein.
| Isolate | Amino acid position | ||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17 | 79 | 82 | 85 | 86 | 88 | 89 | 99 | 118 | 126 | 243 | 247 | 341 | 348 | 376 | 389 | 416 | 463 | 473 | 476 | 479 | 492 | 509 | |
| S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V | |
| PHESPD0338 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPD0344 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPD0356 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPD0383 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPV0691 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPV1034 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPV1405 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPD0363 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPV0789 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPV1119 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| PHESPD0355 | S | N | H | – | – | S | S | A | V | A | K | Y | W | V | F | V | V | L | T | T | I | A | V |
| SSI-37 | S | N | H | – | – | S | S | A | V | A | K | Y | G | V | F | V | V | L | T | T | I | A | V |
| PHESPD0357 | T | K | Y | – | – | N | N | D | A | V | N | R | W | I | Y | F | L | F | I | M | F | V | L |
| PHENP00003 | T | K | Y | – | – | N | N | D | A | V | N | R | W | I | Y | F | L | F | I | M | F | V | L |
| PHENP00005 | T | K | Y | – | – | N | N | D | A | V | N | R | W | I | Y | F | L | F | I | M | F | V | L |
| PHENP00006 | T | K | Y | – | – | N | N | D | A | V | N | R | W | I | Y | F | L | F | I | M | F | V | L |
| PHENP00007 | T | K | Y | – | – | N | N | D | A | V | N | R | W | I | Y | F | L | F | I | M | F | V | L |
| PHENP00001 | T | K | Y | I | Y | N | N | D | A | V | N | R | W | I | Y | F | L | F | I | M | F | V | L |
| PHENP00002 | T | K | Y | I | Y | N | N | D | A | V | N | R | W | I | Y | F | L | F | I | M | F | V | L |
| BLOSUM62 Score | 1 | 0 | 2 | n/a | n/a | 1 | 1 | −2 | 0 | 0 | 0 | −2 | −2 | 2 | 2 | −1 | 1 | 0 | −1 | −1 | 0 | 0 | 1 |
Notes.
Amino acids insertions. All other stated amino acid positions are referenced according to the sequence in isolates that do not contain the insertions; n/a. not applicable.
Residues in with positive scores (green) substitute frequently in homologous proteins (positive BLOSUM62 score), while those in pink substitute rarely.
Further genomic analysis, extended kmerID database and rMLST.
| Isolate | PubMLST BIGSdb rMLST | Extended kmerID | |||||
|---|---|---|---|---|---|---|---|
| rST | Profile match species | No. Alleles matched | Speciation match (single | Speciation first match gene | Top match reference genome | Similarity % | |
| PHESPD0338 | 23,921 | 52 | 81.1 | ||||
| PHESPD0344 | 23,921 | 52 | 81.1 | ||||
| PHESPD0356 | 23,921 | 52 | 80.8 | ||||
| PHESPD0383 | 23,921 | 52 | 80.9 | ||||
| PHESPV0691 | 23,921 | 52 | 81.1 | ||||
| PHESPV1034 | 23,921 | 52 | 83.2 | ||||
| PHESPV1405 | 23,921 | 52 | 81.1 | ||||
| PHESPD0363 | novel | – | 51 | 81.2 | |||
| PHESPV0789 | novel | – | 51 | 81.1 | |||
| PHESPV1119 | novel | – | 50 | 81.1 | |||
| PHESPD0355 | novel | – | 51 | 81.1 | |||
| SSI-37 | novel | – | 48 | 81.3 | |||
| PHESPD0357 | novel | – | 37 | 41.3 | |||
| PHENP00003 | novel | – | 47 | 36.9 | |||
| PHENP00005 | novel | – | 39 | 37.1 | |||
| PHENP00006 | novel | – | 40 | 38.0 | |||
| PHENP00001 | novel | – | 48 | 37.1 | |||
| PHENP00002 | novel | – | 48 | 37.4 | |||
| PHENP00007 | novel | – | 52 | 37.5 | |||
| BAA-960T | 31,046 | 52 | ND | ND | |||
Notes.
no match
Not Done
Figure 2Whole genome SNP analysis maximum likelihood tree (RAxML bipartition) of study set isolates using the acapsular R6 reference.
Blue squares represent study set of S. pneumoniae isolates, purple squares the S. pseudopneumoniae type strain and red squares the study set non-pneumococcal isolates. Additional contextual data were provided by Streptococcus pneumoniae belonging to different serotypes (N = 5, blue circles), S. pseudopneumoniae (N = 6, purple circles) and S.mitis strains (N = 5, green circles).Branch length corresponds to substitutions per base. Bootstrap support shown for all branches.
Figure 3Minimum evolution tree of concatenated MLST sequences (except ddl) of study set isolates with contextual isolates.
Blue squares represent S. pneumoniae isolates, purple squares the S. pseudopneumoniae type strain and red squares non-pneumococcal isolates included in this study. Additional contextual data from other Streptococcus pneumoniae serotypes (blue circles), S. pseudopneumoniae (purple circles) and S. mitis strains (green circles).
lytA, ply and piaA gene detection, coverage and identity statistics for isolates in the study.
| Isolate | ||||||
|---|---|---|---|---|---|---|
| PHESPD0338 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPD0344 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPD0356 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPD0383 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPV0691 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPV1034 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPV1405 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPD0363 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPV0789 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPV1119 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPD0355 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| SSI-37 | 100 | 99.58 | 100 | 99.65 | 100 | 99.6 |
| PHESPD0357 | 88 | 73.25 | 100 | 96.96 | 0 | 0 |
| PHENP00003 | 85.8 | 76.28 | 100 | 97.1 | 0 | 0 |
| PHENP00005 | 92.7 | 78.16 | 100 | 95.41 | 0 | 0 |
| PHENP00006 | 93.3 | 77.32 | 100 | 95.34 | 0 | 0 |
| PHENP00001 | 78.2 | 67.92 | 100 | 97.03 | 0 | 0 |
| PHENP00002 | 86.3 | 75.03 | 100 | 96.75 | 0 | 0 |
| PHENP00007 | 87.4 | 75.44 | 100 | 96.19 | 0 | 0 |
| BAA-960T | 91.8 | 78.47 | 100 | 96.68 | 0 | 0 |