| Literature DB >> 32509595 |
Lucia Gonzales-Siles1,2,3, Roger Karlsson1,2,3, Patrik Schmidt1, Francisco Salvà-Serra1,2,3,4,5, Daniel Jaén-Luchoro1,3, Susann Skovbjerg1,2,3, Edward R B Moore1,2,3,4, Margarita Gomila5.
Abstract
Correct identifications of isolates and strains of the Mitis-Group of the genus Streptococcus are particularly difficult, due to high genetic similarity, resulting from horizontal gene transfer and homologous recombination, and unreliable phenotypic and genotypic biomarkers for differentiating the species. Streptococcus pneumoniae and Streptococcus pseudopneumoniae are the most closely related species of the clade. In this study, publicly-available genome sequences for Streptococcus pneumoniae and S. pseudopneumoniae were analyzed, using a pangenomic approach, to find candidates for species-unique gene markers; ten species-unique genes for S. pneumoniae and nine for S. pseudopneumoniae were identified. These species-unique gene marker candidates were verified by PCR assays for identifying S. pneumoniae and S. pseudopneumoniae strains isolated from clinical samples. All determined species-level unique gene markers for S. pneumoniae were detected in all S. pneumoniae clinical isolates, whereas fewer of the unique S. pseudopneumoniae gene markers were present in more than 95% of the clinical isolates. In parallel, taxonomic identifications of the clinical isolates were confirmed, using conventional optochin sensitivity testing, targeted PCR-detection for the "Xisco" gene, as well as genomic ANIb similarity analyses for the genome sequences of selected strains. Using mass spectrometry-proteomics, species-specific peptide matches were observed for four of the S. pneumoniae gene markers and for three of the S. pseudopneumoniae gene markers. Application of multiple species-level unique biomarkers of S. pneumoniae and S. pseudopneumoniae, is proposed as a protocol for the routine clinical laboratory for improved, reliable differentiation, and identification of these pathogenic and commensal species.Entities:
Keywords: S. pseudopneumoniae; Streptococcus; gene markers; identification; pangenome; pneumococcus; proteotyping
Mesh:
Year: 2020 PMID: 32509595 PMCID: PMC7248185 DOI: 10.3389/fcimb.2020.00222
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Figure 1Schematic representation of the workflow used for identifications of unique genes for S. pneumoniae and S. pseudopneumoniae by applying a pangenome approach.
List of PCR-primers used for amplification of unique gene markers of S. pneumoniae and S. pseudopneumoniae.
| Pseudo_901 –F | ATG ACA ACT GCA AAA CTC G | This study | ||
| Pseudo_901 –R | CCA TTG ATA GCA CAA CTG AC | This study | ||
| Pseudo_902 –F | ABC-2 type transporter | TGG CTA CCC TCT AGT TAT TG | This study | |
| Pseudo_902 –R | CGA CTA CGG AAA TGT TTC TC | This study | ||
| Pseudo_231 –F | ATC AGT TCG GAC TGG AGA | This study | ||
| Pseudo_231 –R | CGA ATT AGG ATT GGG TTA CTC | This study | ||
| Pseudo_899 –F | TAG GGC AAG CTG TAT TTA CG | This study | ||
| Pseudo_899 –R | TGA CAG AGT TTG ATT CGC A | This study | ||
| Pseudo_232 –F | ACA GCC CTG TAT ATT GGT AG | This study | ||
| Pseudo_232 –R | GTG ATG TGG TGA TTT ATC CTG | This study | ||
| Pseudo_228–F | Potassium-transporting | CTG TTC AAG CCA ATG GTA G | This study | |
| Pseudo_228 –R | ACA TCG GCT TCG GGA TTG | This study | ||
| Pneumo_1011 –F | GCA AAT TAC GGT GTA AGT GCT GA | This study | ||
| Pneumo_1011 –R | TAT TGA AAG TGG TGT TGG AGT GCA | This study | ||
| Pneumo_1012 –F | AGC AGG TTC TAG TCT TGC CAT AA | This study | ||
| Pneumo_1012 –R | AAG ACC AAC AGC CAT TTC ATC AC | This study | ||
| Pneumo_1013 –F | TCC TGA TAT AAT CGG TGT CAC AAG | This study | ||
| Pneumo_1013 –R | CAG TTA CAA CAC CTA CTG GAT ATC T | This study | ||
| Pneumo_1014 –F | CAA ATG GTT GTG GGA AAT CAA CAC T | This study | ||
| Pneumo_1014 –R | CCC AGA AAG TTC TTC AAC TAG GTT A | This study | ||
| Pneumo_1961 –F | TTG GAA GGA GCT GCA AGT AAT G | This study | ||
| Pneumo_1962 –R | AAG CTT TAG ACT TGT TAG TTT CTG AG | This study | ||
| Pneumo_127 –F | Putative ABC transporter | GAT TTC CCG CTT CCA CTT TCA C | This study | |
| Pneumo_127 –R | CGA AAT AGA GTT GCC ACA GAC AT | This study | ||
| 5202_cpsA_ | GCA GTA CAG CAG TTT GTT GGA CTG ACC | Pai et al., | ||
| 3202_cpsA_R | GAA TAT TTT CAT TAT CAG TCC CAG TC | Pai et al., | ||
| 5203_ply – | ATT TCT GTA ACA GCT ACC AAC GA | Salo et al., | ||
| 3203_ply_–R | GAA TTC CCT GTC TTT TCA AAG TC | Salo et al., | ||
| 5322_lytA –F | CAA CCG TAC AGA ATG AAG CGG | Nagai et al., | ||
| 3322_lytA_–R | TTAT TCG TGC AAT ACT CGT GCG | Nagai et al., | ||
| Spne-CW-F2 | “Xisco” gene | TGA CGA TTC TAG GAA AAG ATA CAG | Salvà-Serra et al., | |
| Spne-CW-R | AGC AGG TGA CTG GTA GGT AAC | Salvà-Serra et al., |
Pangenome distribution of the genes for each species, using 70C/70S (70% Contiguously aligned/70% sequence Similarity) criteria.
| Core | 1,007 | 19.5 | 1,306 | 35.0 | 1,169 | 31.1 |
| Soft-core | 1,196 | 23.2 | 1,495 | 40.1 | 1,614 | 42.9 |
| Cloud | 2,856 | 55.4 | 1,232 | 33.0 | 1,626 | 43.2 |
| Shell | 1,101 | 21.4 | 1,004 | 26.9 | 522 | 13.9 |
| Pangenome | 5,153 | 3,731 | 3,762 | |||
Figure 2Venn diagram showing the number of shared and unique genes between S. pneumoniae. S. pseudopneumoniae and S. mitis.
Figure 3Schematic representation of the number of unique genes found at each step of the analysis for identifications of specific unique biomarkers for S. pneumoniae and S. pseudopneumoniae.
List of unique genes of S. pseudopneumoniae and S. pneumoniae.
| Pseudo_232 | WP_000847726.1 | Potassium-transporting ATPase C chain | 2,634 | 99.8 | |
| Pseudo_901 | WP_000205301.1 | Putative ABC transporter ATP-binding protein YbhF | 912 | 99.9 | |
| Pseudo_231 | WP_000787304.1 | KDP operon transcriptional regulatory protein KdpE | 727 | 99.8 | |
| Pseudo_902 | WP_000191422.1 | ABC-2 type transporter | 724 | 100.0 | |
| Pseudo_899 | WP_000912214.1 | Transcriptional regulatory protein YpdB | 708 | 99.9 | |
| Pseudo_228 | WP_001225808.1 | Sensor protein KdpC | 619 | 99.3 | |
| Pseudo_1764 | WP_023937803.1 | Hypothetical protein | 318 | 99.7 | |
| Pseudo_641 | WP_000907298.1 | Hypothetical protein | 240 | 99.7 | |
| Pseudo_1933 | NA | Hypothetical protein | 147 | 98.2 | |
| Pneumo_127 | WP_000288029.1 | Putative ABC transporter ATP-binding protein | 1,242 | 100.0 | |
| Pneumo_1011 | WP_000790743.1 | Fe(3+)-citrate-binding protein YfmC precursor | 1,026 | 98.3 | |
| Pneumo_1012 | WP_000543061.1 | Ferric enterobactin transport system permease protein FepD | 1,008 | 99.8 | |
| Pneumo_1013 | WP_001180357.1 | Putative siderophore transport system permease protein YfhA | 1,008 | 99.9 | |
| Pneumo_1014 | WP_000677520.1 | Putative siderophore transport system ATP-binding protein YusV | 795 | 99.8 | |
| Pneumo_1961 | WP_000105270.1 | HTH-type transcriptional regulator GmuR | 729 | 99.7 | |
| Pneumo_1964 | WP_000809626.1 | Lichenan-specific phosphotransferase enzyme IIB component | 310 | 99.4 | |
| Pneumo_436 | WP_001846568.1 | Hypothetical protein | 237 | 99.1 | |
| Pneumo_1362 | WP_000500399.1 | ApaLI-like restriction endonuclease | 192 | 87.6 | |
| Pneumo_1361 | WP_001262530.1 | ApaLI-like restriction endonuclease | 183 | 99.9 |
The names of the genes, the protein IDs and the proteins encoded by the genes are indicated, as well as the sizes of the genes in nucleotides and the percentages of similarity with the respective genes in the genome of the closest related strains.
NA, No annotation available.
Genotypic and phenotypic characterization of S. pneumoniae strains used for confirmation of gene markers.
| CCUG 28588T | + | + | + | + | + | + | S | S | + | 99.9 |
| CCUG 1350 | + | + | + | + | + | + | S | S | + | 98.3 |
| CCUG 6798 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 7206 | + | + | + | + | + | + | S | S | + | 98.3 |
| CCUG 11780 | + | + | + | + | + | + | S | S | + | 98.2 |
| CCUG 32672 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 33774 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 35180 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 35229 | + | + | + | + | + | + | S | S | + | 98.5 |
| CCUG 35272 | + | + | + | + | + | + | S | S | + | 98.2 |
| CCUG 35561 | + | + | + | + | + | + | S | S | + | 98.5 |
| CCUG 36618 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 36800 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 45673 | + | + | + | + | + | + | S | S | + | 98.3 |
| CCUG 63093 | + | + | + | + | + | + | S | S | + | 98.3 |
| CCUG 63665 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 68718 | + | + | + | + | + | + | S | S | + | 98.2 |
| CCUG 69380 | + | + | + | + | + | + | S | S | + | 98.4 |
| CCUG 69381 | + | + | + | + | + | + | S | S | + | 98.3 |
| CCUG 69382 | + | + | + | + | + | + | S | S | + | 98.3 |
ANIb values calculated with JSpeciesWS against the type strain of S. pneumoniae NCTC 7465.
Genotypic and phenotypic characterization of the 29 S. pseudopneumoniae strains used for confirmation of gene markers; ANIb similarity values calculated with JSpeciesWS against S. pseudopneumoniae type strain CCUG 49455T are indicated if the whole genome sequence was determined.
| CCUG 47366 | + | + | + | + | + | + | S | R | – | – | + | – | 98.6 |
| CCUG 48465 | + | + | + | + | + | + | S | R | – | + | + | – | * |
| CCUG 49455T | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 50866 | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 50867 | + | + | + | + | + | + | S | R | – | + | + | – | * |
| CCUG 50868 | + | + | + | + | – | + | S | R | – | – | + | – | 97.3 |
| CCUG 50869 | + | + | + | + | + | + | S | R | – | + | + | – | * |
| CCUG 50870 | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 50871 | – | + | + | + | – | + | R | R | – | – | + | – | 98.8 |
| CCUG 61551 | – | + | + | + | – | + | S | R | – | – | + | – | 98.2 |
| CCUG 62647 | + | + | + | + | + | + | R | R | – | – | + | – | * |
| CCUG 63747 | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 63793 | – | + | – | + | – | + | S | S | – | – | + | – | 97.5 |
| CCUG 64062 | – | + | + | + | – | + | S | R | – | – | + | – | 96.8 |
| CCUG 69906 | + | + | + | + | + | + | S | R | – | + | + | – | * |
| CCUG 70658 | + | + | + | + | + | + | S | R | – | + | + | – | * |
| CCUG 70988 | – | + | + | + | – | + | S | S | – | – | – | – | 97.6 |
| CCUG 71653 | – | + | – | + | – | + | S | R | – | – | + | – | 98.6 |
| CCUG 71770 | – | + | + | + | – | + | S | R | – | – | + | – | * |
| CCUG 71776 | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 71942 | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 71983 | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 71996 | – | – | – | + | – | + | S | R | – | – | – | – | 98.5 |
| CCUG 72029 | + | + | + | + | – | + | S | R | – | – | + | – | 97.0 |
| CCUG 72040 | + | + | + | + | – | + | S | R | – | – | + | – | 96.9 |
| CCUG 72012 | + | + | + | + | + | + | S | R | – | – | + | – | * |
| CCUG 72018 | + | + | + | + | + | + | S | R | – | – | + | – | 97.3 |
| CCUG 72019 | + | + | + | + | + | – | R | R | – | – | + | – | 98.5 |
| CCUG 72028 | + | + | + | + | – | + | S | R | – | – | + | – | 96.8 |
S, sensitive; R, resistant; *Not whole genome sequenced.
List of the peptides detected from the unique genes of S. pneumoniae and S. pseudopneumoniae when analyzed using inclusion lists and targeted proteomics.
| Pseudo_228 | DIISGSQNLAPSNPELK | 17 | x | x | x | |
| DIPADLVTTSASGLDPEISPESAK | 24 | x | x | |||
| ELSLLIEENPTISIR | 15 | x | ||||
| LEEIIDKHTVTK | 12 | x | x | x | ||
| LIGSALIGQEFSSAAFLHGR | 20 | x | x | |||
| PSAIQYNTYLSEGDPSGQKR | 20 | x | x | |||
| VQKELSLLIEENPTISIR | 18 | x | ||||
| Pseudo_899 | FLNLGQAVFTFTFGK | 15 | x | |||
| Pseudo_902 | ALPFVPSSNLLR | 12 | x | |||
| Pneumo_1011 | VATIAWGNHDVALALGIVPVGFSK | 24 | x | x | ||
| ANLFDDLDGLNFEAISNSK | 19 | x | ||||
| INDADVIITYGDDK | 14 | x | ||||
| VLFTMINAADTSK | 13 | x | x | x | ||
| EISAEEANK | 9 | x | ||||
| ANYGVSADK | 9 | x | x | x | ||
| EDYDTLSK | 8 | x | x | x | ||
| IAPVAAYK | 8 | x | ||||
| GYSGITK | 7 | x | x | x | ||
| PWQTLWR | 7 | x | x | |||
| EGDELIK | 7 | x | x | x | ||
| TLEALQK | 7 | x | ||||
| ALGMEK | 6 | x | x | x | ||
| DPLLGK | 6 | x | ||||
| Pneumo_1014 | HIAILPQSPIIPESITVADLVSR | 23 | x | x | x | |
| ANVEDLANNLVEELSGGQR | 19 | x | x | |||
| DPISNSPLMIPIGK | 14 | x | x | |||
| PLEGEVLLDNK | 11 | x | ||||
| GVLPWTEEK | 9 | x | x | x | ||
| DDLEIINR | 8 | x | x | x | ||
| SINSYK | 6 | x | ||||
| Pneumo_1961 | ISEDAHSTIDSR | 12 | x | x | x | |
| Pneumo_1964 | NIHEADVILIGPQIR | 15 | x | x | x | |
| EIAGNIPVDTIDMR | 14 | x | ||||
| VLEQALAWIGEIR | 13 | x | x | |||
| DYGMMNGAK | 9 | x |
“X” indicates presence of a particular peptide of the three strains analyzed for each species.