| Literature DB >> 29298677 |
Gustavo Gámez1,2,3,4, Andrés Castro5,6, Alejandro Gómez-Mejia5,7, Mauricio Gallego5,6, Alejandro Bedoya5,6, Mauricio Camargo5, Sven Hammerschmidt7.
Abstract
BACKGROUND: In recent years, the idea of a highly immunogenic protein-based vaccine to combat Streptococcus pneumoniae and its severe invasive infectious diseases has gained considerable interest. However, the target proteins to be included in a vaccine formulation have to accomplish several genetic and immunological characteristics, (such as conservation, distribution, immunogenicity and protective effect), in order to ensure its suitability and effectiveness. This study aimed to get comprehensive insights into the genomic organization, population distribution and genetic conservation of all pneumococcal surface-exposed proteins, genetic regulators and other virulence factors, whose important function and role in pathogenesis has been demonstrated or hypothesized.Entities:
Keywords: Streptococcus Pneumoniae; Two component systems; Variome; Virulence factors
Mesh:
Substances:
Year: 2018 PMID: 29298677 PMCID: PMC5753484 DOI: 10.1186/s12864-017-4376-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Phylogenetic tree (slanted cladogram) of the pneumococcal genome / strains. By using the online NCBI Tools Genome Tree Report (ncbi.nlm.nih.gov/genome/tree/176) and the Tree Viewer 1.17.0 (ncbi.nlm.nih.gov/projects/treeview), a phylogenetic tree was constructed from the analysis by genomic BLAST of 8290 sequencing projects of pneumococci reported in the NCBI database. The topology of this slanted cladogram showed different pneumococcal lineages, where the selected set of 25 pneumococcal strains can be identified in red as external nodes (the “well-distributed” key features also highlighted in red), evidencing an optimal representation of the pneumococcal population. The overall number of sequenced pneumococcal genomes is provided for each external node. The blue lines depicted those external nodes where fully sequenced and annotated genomes are located
The study population set of 25 S. pneumoniae strains included in this study and their serotypes
| Serotype | # of Genes | NCBI Annotation | |
|---|---|---|---|
| D39 | 2 | 2069 | NC_008533.1 |
| R6 | No Capsule | 1967 | NC_003098.1 |
| TIGR4 | 4 | 2228 | NC_003028.3 |
| INV104 | 1 | 2003 | NC_017591.1 |
| AP200 | 11A | 2284 | NC_014494.1 |
| JJA | 14 | 2235 | NC_012466.1 |
| ATCC 700669 | 23F | 2224 | NC_011900.1 |
| INV200 | 14 | 2113 | NC_017593.1 |
| CGSP14 | 14 | 2276 | NC_010582.1 |
| G54 | 19F | 2186 | NC_011072.1 |
| gamPNI0373 | 1 | 2226 | NC_018630.1 |
| P1031 | 1 | 2254 | NC_012467.1 |
| SPN034156 | 3 | 1956 | NC_021006.1 |
| SPN994039 | 3 | 1974 | NC_021005.1 |
| SPN994038 | 3 | 1974 | NC_021026.1 |
| SPN034183 | 3 | 1985 | NC_021028.1 |
| OXC141 | 3 | 2037 | NC_017592.1 |
| 670-6B | 6B | 2430 | NC_014498.1 |
| A026 | 19F | 2153 | NC_022655.1 |
| Taiwan19F-14 | 19F | 2205 | NC_012469.1 |
| ST556 | 19F | 2219 | NC_017769.1 |
| TCH8431/19A | 19A | 2355 | NC_014251.1 |
| SPNA45 | 3 | 1921 | NC_018594.1 |
| 70,585 | 5 | 2323 | NC_012468.1 |
| Hungary19A 6 | 19A | 2402 | NC_010380.1 |
Function or pathogenic role of the virulence factors and two-component regulatory systems of S. pneumoniae
| Virulence Factors | Protein Name | Function and/or Pathogenic Role | |
|---|---|---|---|
| LPxTG - Proteins | BgaA | β-Galactosidase | β-Galactosidase Enzyme |
| EndoD | Endo-β-N-Acetylglucosaminidase D | Virulence | |
| PclA | Pneumococcal Collagen-Like Protein | Adherence and Invasion | |
| SpGH101 | Endo-α-N-Acetylgalactosaminidase | Virulence | |
| StrH | β-N-Acetylhexosaminidase | β-N-Acetylhexosaminidase Enzyme | |
| NanA | Neuraminidase A | Hydrolytic Enzyme, Adherence and Colonization | |
| PfbA | Plasmin- and Fibronectin-Binding Protein A | Adherence, Immune Evasion and Antiphagocytosis | |
| PrtA | Subtilysin-Like Serine Protease | Virulence | |
| PavB | Pneumococcal Adherence and Virulence Protein B | Adherence and Colonization | |
| KsgA | Dimethyladenosine Transferase | Virulence | |
| SpuA | Alkaline Amyllopullullanase | Pullullanase Enzyme and Immune Evasion | |
| HysA | Hyaluronate Lyase | Hyaluronidase Enzyme and Colonization | |
| SP_1492 | Cell Wall Surface Anchor Protein Family, Mucin-Binding Protein | Virulence | |
| ZmpA | Zinc Metalloprotease A, IgA1 | IgA1 Protease Enzyme and Colonization | |
| ZmpB | Zinc Metalloprotease B | Immune Evasion and Colonization | |
| ZmpC | Zinc Metalloprotease C | Immune Evasion and Colonization | |
| ZmpD | Zinc Metalloprotease D, IgA1 Paralog Protease | Immune Evasion and Colonization | |
| PsrP | Pneumococcal Serine-Rich Repeats Protein | Adherence | |
| RrgA | Pilus-1 Tip Protein (Adhesin) | Adherence | |
| RrgB | Pilus-1 Backbone Protein | Adherence | |
| RrgC | Pilus-1 Anchore Protein | Adherence | |
| PitA | Pilus-2 Subunit, Ancillary Protein | Adherence | |
| PitB | Pilus-2 Subunit, Backbone Protein | Adherence | |
| Choline-Binding Proteins (CBPs) | LytA | Autolysin (N-Acetyl-Muramoyl-L-Alanine Amidase) | Autolytic Enzyme, Cell Wall Digestion and Autolysis |
| LytB | Endo-β-N-Acetylglucosamidase | Immune Evasion and Colonization | |
| LytC | Lysozyme (1,4-β-N-Acetylmuramidase) | Adherence, Immune Evasion and Colonization | |
| Pce | Choline-Binding Protein E, Phosphorylcholine Estearase | Phosphorylcholine Estearase Enzyme, Adherence, Colonization and Cellular Metabolism | |
| PcpA | Pneumococcal Choline-Binding Protein A | Protection Against Lung Infection and Sepsis | |
| PspA | Pneumococcal Surface Protein A | Cellular Metabolism and Immune Evasion | |
| PspC | Pneumococcal Surface Protein C, Choline-Binding Protein A | Adherence, Immune Evasion, Colonization and Invasion | |
| CbpC | Choline-Binding Protein C | Virulence | |
| CbpD | Choline-Binding Protein D | Colonization | |
| CbpF | Choline-Binding Protein F | Virulence | |
| CbpG | Choline-Binding Protein G | Adherence and Colonization | |
| CbpI | Choline-Binding Protein I | Virulence | |
| CbpJ | Choline-Binding Protein J | Virulence | |
| SP_0667 | Pneumococcal Surface Protein (Putative Lysozyme) | Virulence | |
| Lipoproteins | GlnQ | Glutamine Transporter | Cellular Metabolism |
| PiaA | Iron-Compound ABC Transporter | Peptidil-Prolil Isomerase (PPIase) Enzyme | |
| PiuA | Iron-Compound ABC Transporter | Peptidil-Prolil Isomerase (PPIase) Enzyme | |
| PpiA | Streptococcal Lipoprotein Rotamase A, Peptidil-Prolil Isomerase (PPIases) Enzyme | Peptidil-Prolil Isomerase (PPIase) Enzyme | |
| PsaA | Peptide Permease Enzyme, Manganese ABC Transporter, Manganese-Binding Lipoprotein | Immune Evasion | |
| PpmA | Foldase Protein PrsA, Proteinase Maturation A | Adherence, Immune Evasion, Strain-Specific Colonization and Evasion of Phagocytosis | |
| AliA | Oligopeptide ABC Trasporter | Adherence | |
| PhtA | Pneumococcal Histidine Triad A | Adherence and Immune Evasion | |
| PhtB | Pneumococcal Histidine Triad B | Adherence and Immune Evasion | |
| PhtD | Pneumococcal Histidine Triad D | Adherence and Immune Evasion | |
| PhtE | Pneumococcal Histidine Triad E | Adherence and Immune Evasion | |
| Non-Classical Surface-Exposed Proteins | Eno | Enolase (2-Phosphoglycerate Dehydratase) | Glycolytic Enzyme, Adherence and Colonization |
| GAPDH | Glyceraldehyde-3-Phosphate Dehydrogenase | Glycolytic Enzyme, Adherence and Colonization | |
| HtrA | High-Temperature Requirement A, Serine Protease (Heat Shock Protein) | Serine Protease Enzyme | |
| PavA | Pneumococcal Adherence and Virulence Protein A | Adherence, Immune Evasion, Colonization and Translocation | |
| Pbp1B | Penicillin-Binding Protein 1B | Antibiotic Resistance | |
| StkP | Serine/Threonine Protein Kinase | Cellular Metabolism and Fitness | |
| Usp45 | PcsB, Secreted 45-KDa Protein | Virulence | |
| NanB | Neuraminidase B | Hydrolytic Enzyme, Adherence and Colonization | |
| PppA | Pneumococcal Protective Protein A, Non-Heme Iron-Containing Ferritine | Colonization | |
| Fic-Like | Fic-Like Cell Fillamentation Protein | Putative Cytotoxicity | |
| 6PGD | 6-Phosphogluconate Dehydrogenase | Virulence | |
| Ply | Pneumolysin | Cytolytic Toxin, Adherence, Immune Evasion, Invasion, Dissemination and Complement Activation | |
| NanC | Neuraminidase C | Virulence | |
| Regulators | RlrA | Pathogenicity Island | Virulence |
| MgrA | MgrA Family Transcriptional Regulator | Virulence | |
| MerR | MerR Family Transcriptional Regulator | Virulence | |
| PsaR | Iron-Dependent Transcriptional Regulator | Virulence | |
| Histidine Kinases (HKs) | HK01 | Sensor Histidine Kinase | Virulence |
| HK02 | Sensor Histidine Kinase, VicK | Antibiotic Resistance, Virulence and Fitness | |
| HK03 | Sensor Histidine Kinase, LiaS | Antibiotic Resistance and Stress Protection | |
| HK04 | Sensor Histidine Kinase, PnpS | Genetic Competence, Fitness, Immune Evasion | |
| HK05 | Sensor Histidine Kinase, CiaH | Antibiotic Resistance, Genetic Competence and Pathogenesis | |
| HK06 | Sensor Histidine Kinase | Colonization and Invasion | |
| HK07 | Sensor Histidine Kinase, YesM | Fitness | |
| HK08 | Sensor Histidine Kinase, SaeS | Pathogenesis and Fitness | |
| HK09 | Sensor Histidine Kinase | Virulence | |
| HK10 | Sensor Histidine Kinase, VncS | Antibiotic Resistance | |
| HK11 | Sensor Histidine Kinase | Biofilm Formation | |
| HK12 | Sensor Histidine Kinase, ComD | Genetic Competence | |
| HK13 | Sensor Histidine Kinase, BlpH | Virulence | |
| Response Regulators (RRs) | RR01 | Response Regulator | Virulence |
| RR02 | Response Regulator, VicR | Antibiotic Resistance, Virulence and Fitness | |
| RR03 | Response Regulator, LiaR | Antibiotic Resistance and Stress Protection | |
| RR04 | Response Regulator, PnpR | Genetic Competence, Fitness, Immune Evasion | |
| RR05 | Response Regulator, CiaR | Antibiotic Resistance, Genetic Competence, Pathogenesis | |
| RR06 | Response Regulator | Colonization and Invasion | |
| RR07 | Response Regulator, YesN | Fitness | |
| RR08 | Response Regulator, SaeR | Pathogenesis and Fitness | |
| RR09 | Response Regulator | Virulence | |
| RR10 | Response Regulator, VncR | Antibiotic Resistance | |
| RR11 | Response Regulator | Biofilm Formation | |
| RR12 | Response Regulator, ComE | Genetic Competence | |
| RR13 | Response Regulator, BlpR | Virulence | |
| RR14 | Response Regulator | Virulence | |
The proteins are grouped by classes, depending on their surface-exposure mechanism. The names, abreviations and function of the proteins were obtained from literature references
Fig. 2Chromosomal localization and direction of the virulence factor genes of S. pneumoniae TIGR4. Lineal representation of the pneumococcus genome. The arrows, drawn at scale, localize 62 of the 65 virulence factors and simple regulation genes considered in this study (pitA, pitB and zmpD are not present in the genome of TIGR4). Each color represents a different class of codified protein: blue = sortase-anchored proteins with an LPxTG cleavage motif; violet = choline-binding proteins (CBPs); green = lipoproteins, yellow = non-classical surface proteins (NCSP), and red = stand-alone regulators. This map was constructed using the Software SnapGene® (GSL Biotech; Available at snapgene.com)
Fig. 3Localization and direction of the two component systems genes in S. pneumoniae TIGR4. Circular representation of the pneumococcal genome. The arrows, not drawn at scale, localize the 27 genes which codifies for the proteins of the 13 two component systems +1 incomplete. Each color indicates a different class of codified protein: red = histidine kinase sensors and blue = response regulators Proteins. This map was constructed using the Software SnapGene® (GSL Biotech; Available at snapgene.com)
Distribution of the virulence factor and regulation genes of S. pneumoniae
| Virulence Factors | TIGR4 | D39 | R6 | 70,585 | Hungary19A 6 | SPN994038 | SPN994039 | SPN034183 | OXC141 | SPN034156 | ST556 | Taiwan19F-14 | TCH8431/19A | A026 | INV200 | CGSP14 | INV104 | G54 | AP200 | P1031 | gamPNI0373 | SPNA45 | ATCC 700669 | JJA | 670-6B | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LPxTG - Proteins |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | |
|
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | |
|
| 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
|
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
|
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
|
| 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
| 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Choline-Binding Proteins (CBPs) |
| 1 | 1 | 1 |
|
|
|
|
|
|
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
|
|
|
|
|
|
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 |
| 0 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | |
|
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | |
|
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
| 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Lipoproteins |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | |
|
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Non-Classical Surface-Exposed Proteins |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | ||
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 1 | 1 | 1 | |
|
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | |
| Regulators |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | |
|
| 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
The present table shows the absence (0) or presence (1, 2 or 3) of each considered genes in the 25 strains selected for this study. The number (1, 2 or 3) indicates the amount of copies of each gene in the genome. lytA is the only factor with more than one copy per genome. In the strain SPNA45, the gene gnd was found duplicated (2) and fused with a duplication of its neighbor gene (rr14) downstream. In the gene nanA of TIGR4 (1) a shift in its ORF was found. However, it has also been reported that NanA is expressed in this pneumoccoccal strain. Gene defective copies (genes with any alteration in their primary DNA sequences) are depicted in bold and italics: In the SPNA45 strain ply is fused with a copy of lytA, and pspA is defective in the ATCC700669 pneumococcal strain
Distribution of the genes that conform the two-component systems in S. pneumoniae
| Two-Component Systems (TCSs) | TIGR4 | D39 | R6 | 70,585 | Hungary19A 6 | SPN994038 | SPN994039 | SPN034183 | OXC141 | SPN034156 | ST556 | Taiwan19F-14 | TCH8431/19A | A026 | INV200 | CGSP14 | INV104 | G54 | AP200 | P1031 | gamPNI0373 | SPNA45 | ATCC 700669 | JJA | 670-6B | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Histidine Kinases (HKs) |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Response Regulators (RRs) |
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| 1 | 1 | 1 | |
The table shows, the absence (0) or presence (1 or 2) of each gene considered in the 25 strains selected for this study. The number (1 or 2) indicates the amount of copies per gene in each genome. rr14 is the only gene with more than one copy per genome (2), which is actually fused with its neighbor gene (gnd) upstream. In bold and italics, three genes are observed (hk01, hk12 and rr04) in two different strains which might have some alteration (insertion or deletion) in their primary DNA and protein Sequence. The two-component system NisK-NisR of the pneumococcus is rare, of the 25 strains analyzed in this study only in the strain 70,585 was found
Analysis of the Variome of the virulence factor genes of S. pneumoniae
| Virulence Factor Genes | Length | Mutations | Variants | Analyzed Sequences | |||||
|---|---|---|---|---|---|---|---|---|---|
| Name | Locus in TIGR4 / R6 | Gene (bp) | Protein (a.a.) | Overall | Synonimous | Non-Synonimous | Allelles | Protein | |
|
|
| 957 | 318 | 208 | 154 | 54 | 25 | 20 | 42 |
|
|
| 1425 | 474 | 82 | 75 | 7 | 19 | 11 | 26 |
|
|
| 3939 | 1312 | 87 | 32 | 55 | 21 | 21 | 25 |
|
|
| 1977 | 658 | 132 | 84 | 48 | 22 | 20 | 25 |
|
|
| 4980 | 1659 | 139 | 70 | 69 | 20 | 20 | 25 |
|
|
| 3108 | 1035 | 460 | 282 | 178 | 20 | 19 | 25 |
|
|
| 6702 | 2233 | 415 | 247 | 168 | 19 | 18 | 25 |
|
|
| 5304 | 1767 | 348 | 238 | 110 | 18 | 18 | 25 |
|
|
| 765 | 254 | 40 | 22 | 18 | 17 | 17 | 25 |
|
|
| 3120 | 1039 | 56 | 28 | 28 | 20 | 16 | 25 |
|
|
| 2466 | 821 | 81 | 16 | 65 | 17 | 16 | 25 |
|
|
| 1884 | 627 | 148 | 80 | 68 | 16 | 16 | 25 |
|
|
| 1656 | 551 | 40 | 18 | 22 | 17 |
|
|
|
|
| 1347 | 448 | 49 | 28 | 21 | 18 |
|
|
|
|
| 966 | 321 | 22 | 7 | 15 | 14 |
|
|
|
|
| 1182 | 393 | 12 | 8 | 4 | 14 |
|
|
|
|
| 630 | 209 | 19 | 9 | 10 | 13 |
|
|
|
|
| 1482 | 493 | 21 | 15 | 6 | 13 |
|
|
|
|
| 804 | 267 | 16 | 8 | 8 | 12 |
|
|
|
|
| 1980 | 659 | 52 | 46 | 6 | 18 |
|
|
|
|
| 1008 | 335 | 14 | 12 | 2 | 13 |
|
|
|
|
| 930 | 309 | 53 | 41 | 12 | 13 |
|
|
|
|
| 651 | 216 | 10 | 5 | 5 | 9 |
|
|
|
|
| 1026 | 341 | 9 | 2 | 7 | 7 |
|
|
|
|
| 1179 | 392 | 9 | 6 | 3 | 8 |
|
|
|
|
| 1305 | 434 | 17 | 16 | 1 | 12 |
|
|
|
|
| 3843 | 1280 | 121 | 77 | 44 | 18 | 18 | 24 |
|
|
| 6423 | 2140 | 436 | 272 | 164 | 18 | 16 | 24 |
|
|
| 2094 | 697 | 45 | 20 | 25 | 18 | 16 | 24 |
|
|
| 2127 | 708 | 209 | 80 | 129 | 15 | 15 | 24 |
|
|
| 537 | 178 | 69 | 43 | 23 | 16 | 12 | 24 |
|
|
| 1416 | 471 | 20 | 19 | 1 | 14 |
|
|
|
|
| 2574 | 857 | (−) | (−) | (−) | 20 | 17 | 23 |
|
|
| 2520 | 839 | 604 | 331 | 273 | 17 | 17 | 23 |
|
|
| 5646 | 1881 | (−) | (−) | (−) | 15 | 15 | 23 |
|
|
| 2235 | 744 | (−) | (−) | (−) | 18 | 17 | 22 |
|
|
| 1023 | 340 | 176 | 87 | 89 | 15 | 14 | 22 |
|
|
| 666 | 221 | 36 | 8 | 28 | 14 | 14 | 22 |
|
|
| 942 | 313 | 9 | 6 | 3 | 9 | 6 | 22 |
|
|
| 3201 | 1066 | 91 | 41 | 50 | 18 | 17 | 20 |
|
|
| 1473 | 490 | 80 | 20 | 60 | 17 | 16 | 20 |
|
|
| 2082 | 693 | (−) | (−) | (−) | 19 | 19 | 19 |
|
|
| 1986 | 661 | 67 | 50 | 17 | 17 | 17 | 19 |
|
|
| 999 | 332 | 87 | 49 | 38 | 13 | 13 | 19 |
|
|
| 741 | 246 | 17 | 9 | 8 | 12 | 9 | 19 |
|
|
| 1866 | 621 | 43 | 25 | 18 | 17 | 13 | 18 |
|
|
| 609 | 202 | 18 | 6 | 12 | 11 | 11 | 18 |
|
|
| 858 | 285 | 106 | 47 | 59 | 15 | 15 | 17 |
|
|
| 6015 | 2004 | (−) | (−) | (−) | 10 | 10 | 17 |
|
|
| 999 | 332 | 87 | 52 | 35 | 10 | 9 | 15 |
|
|
| 2223 | 740 | 64 | 39 | 25 | 7 | 7 | 10 |
|
|
| 2409 | 802 | 48 | 34 | 14 | 8 | 7 | 9 |
|
|
| 2460 | 819 | 331 | 202 | 129 | 7 | 7 | 8 |
|
|
| 1023 | 340 | 57 | 29 | 28 | 7 | 7 | 8 |
|
|
| 5238 | 1745 | (−) | (−) | (−) | 6 | 5 | 7 |
|
|
| 2682 | 893 | 420 | 225 | 195 | 5 | 5 | 7 |
|
|
| 1182 | 393 | 19 | 8 | 11 | 5 | 5 | 7 |
|
|
| 1998 | 665 | 738 | 290 | 448 | 4 | 4 | 7 |
|
|
| 1530 | 509 | 0 | 0 | 0 | 2 | 2 | 7 |
|
|
| 1770 | 589 | 1 | 0 | 1 | 5 | 5 | 6 |
|
|
| 1233 | 410 | 0 | 0 | 0 | 3 | 3 | 6 |
|
|
| 14,331 | 4776 | (−) | (−) | (−) | 5 | 5 | 5 |
|
|
| 5571 | 1856 | 2 | 1 | 1 | 2 | 2 | 3 |
|
|
| 636 | 211 | 0 | 0 | 0 | 1 | 1 | 3 |
Data of the punctual genetic variability (total mutations, synonymous and nonsynonymous + allelic and protein variants) estimated for each one of the virulence factors and simple regulators genes. The analyzed sequences depend on the presence, absence or number of copies of the genes in the different strains. The size of the sequences and loci are also shown in TIGR4 and R6, pneumococcal representative strains. Factors in bold were identified as the most conserved. (−) = mutations could not be estimated for different reasons, like repetitive sequences
Analysis of the genetic variation (Variome) of the genes that conform the two-component systems in S. pneumoniae
| Virulence Factor Genes | Length | Mutations | Variants | Analyzed Sequences | |||||
|---|---|---|---|---|---|---|---|---|---|
| Name | Locus in TIGR4 / R6 | Gene (bp) | Protein (a.a.) | Overall | Synonimous | Non-Synonimous | Allelles | Protein | |
|
|
| 690 | 229 | 17 | 15 | 2 | 14 | 5 | 26 |
|
|
| 1098 | 365 | 247 | 142 | 105 | 18 | 18 | 25 |
|
|
| 1329 | 442 | 33 | 16 | 17 | 17 | 16 | 25 |
|
|
| 1332 | 443 | 33 | 23 | 10 | 17 | 12 | 25 |
|
|
| 1341 | 446 | 272 | 153 | 119 | 12 | 11 | 25 |
|
|
| 738 | 245 | 78 | 65 | 13 | 11 | 11 | 25 |
|
|
| 600 | 199 | 77 | 56 | 21 | 15 | 10 | 25 |
|
|
| 996 | 331 | 36 | 26 | 10 | 13 | 10 | 25 |
|
|
| 975 | 324 | 20 | 11 | 9 | 13 | 10 | 25 |
|
|
| 1692 | 563 | 23 | 17 | 6 | 15 | 8 | 25 |
|
|
| 678 | 225 | 13 | 10 | 3 | 13 | 8 | 25 |
|
|
| 738 | 245 | 14 | 10 | 4 | 12 | 7 | 25 |
|
|
| 708 | 235 | 49 | 17 | 32 | 9 | 6 | 25 |
|
|
| 633 | 210 | 14 | 10 | 4 | 12 | 5 | 25 |
|
|
| 657 | 218 | 15 | 10 | 5 | 11 | 5 | 25 |
|
|
| 654 | 217 | 9 | 5 | 4 | 7 | 5 | 25 |
|
|
| 699 | 232 | 10 | 6 | 4 | 10 |
|
|
|
|
| 1332 | 443 | 12 | 9 | 3 | 10 |
|
|
|
|
| 1053 | 350 | 14 | 12 | 2 | 11 |
|
|
|
|
| 675 | 224 | 6 | 5 | 1 | 11 |
|
|
|
|
| 1350 | 449 | 12 | 10 | 2 | 10 |
|
|
|
|
| 705 | 234 | 7 | 7 | 0 | 11 |
|
|
|
|
| 1335 | 444 | 11 | 10 | 1 | 9 |
|
|
|
|
| 1647 | 548 | 68 | 46 | 22 | 19 | 17 | 24 |
|
|
| 1287 | 428 | 50 | 33 | 17 | 17 | 14 | 24 |
|
|
| 753 | 250 | 11 | 8 | 3 | 10 | 3 | 24 |
|
|
| 1326 | 441 | 42 | 16 | 26 | 11 | 9 | 23 |
Data of the punctual genetic variability (total mutations, synonymous and non-synonymous + allelic and protein variants) estimated for each one of the two-component system genes. The analyzed sequences depend on the presence, absence or number of copies of the genes in the different strains. The size of the sequences and loci are also shown in TIGR4 and R6, pneumococcal representative strains. Factors in bold are the most conserved