| Literature DB >> 28081968 |
Jennifer E Cornick1, Özlem Tastan Bishop2, Feyruz Yalcin3, Anmol M Kiran4, Benjamin Kumwenda5, Chrispin Chaguza4, Shanil Govindpershad6, Sani Ousmane7, Madikay Senghore8, Mignon du Plessis6, Gerd Pluschke9, Chinelo Ebruke10, Lesley McGee11, Beutel Sigaùque12, Jean-Marc Collard7, Stephen D Bentley2, Aras Kadioglu13, Martin Antonio14, Anne von Gottberg6, Neil French13, Keith P Klugman15, Robert S Heyderman5, Mark Alderson16, Dean B Everett4.
Abstract
Serotype 1 is one of the most common causes of pneumococcal disease worldwide. Pneumococcal protein vaccines are currently being developed as an alternate intervention strategy to pneumococcal conjugate vaccines. Pre-requisites for an efficacious pneumococcal protein vaccine are universal presence and minimal variation of the target antigen in the pneumococcal population, and the capability to induce a robust human immune response. We used in silico analysis to assess the prevalence of seven protein vaccine candidates (CbpA, PcpA, PhtD, PspA, SP0148, SP1912, SP2108) among 445 serotype 1 pneumococci from 26 different countries, across four continents. CbpA (76%), PspA (68%), PhtD (28%), PcpA (11%) were not universally encoded in the study population, and would not provide full coverage against serotype 1. PcpA was widely present in the European (82%), but not in the African (2%) population. A multi-valent vaccine incorporating CbpA, PcpA, PhtD and PspA was predicted to provide coverage against 86% of the global population. SP0148, SP1912 and SP2108 were universally encoded and we further assessed their predicted amino acid, antigenic and structural variation. Multiple allelic variants of these proteins were identified, different allelic variants dominated in different continents; the observed variation was predicted to impact the antigenicity and structure of two SP0148 variants, one SP1912 variant and four SP2108 variants, however these variants were each only present in a small fraction of the global population (<2%). The vast majority of the observed variation was predicted to have no impact on the efficaciousness of a protein vaccine incorporating a single variant of SP0148, SP1912 and/or SP2108 from S. pneumoniae TIGR4. Our findings emphasise the importance of taking geographic differences into account when designing global vaccine interventions and support the continued development of SP0148, SP1912 and SP2108 as protein vaccine candidates against this important pneumococcal serotype.Entities:
Keywords: Antigenic diversity; Antigenic profiling; Multi-valent; PCV; Pneumococcal disease; Protein modelling; Structural diversity; Variant
Mesh:
Substances:
Year: 2017 PMID: 28081968 PMCID: PMC5287219 DOI: 10.1016/j.vaccine.2016.12.037
Source DB: PubMed Journal: Vaccine ISSN: 0264-410X Impact factor: 3.641
In silico PCR primers used to identify genes encoding S. pneumoniae protein vaccine candidates in a global dataset of 445 serotype 1 isolates.
| Target gene | Primer pair | NCBI reference accession | Genomic coordinates of the gene |
|---|---|---|---|
| 5′-GGAAGTCAGTATTAACTAGTTATATTAGGTT | NC_011900.1 | 896104…898620 | |
| 3′- TGTTTATTTCCTTCTATATTTTTTCTTTAAC | |||
| 5′-TCAGAATGATTAGATTTAGCTAATGGATACC | NC_011900.1 | 2108466…2110451 | |
| 3′-TTTAAATTTCCTTACATATTTATTTTCTAAT | |||
| 5′-TTCTAGCAGAAGAATTAAAGTGAGGAAAGAA | NC_011900.1 | 119801…121630 | |
| 3′-GTAAAATGAATGGAGCATATTTTATGGAGAA | |||
| 5′AAATGACTATCAGAAAAGAGGTAAATTTAGA | NC_017593.1 | 2170171…2172795 | |
| 3′AGCCGATTAAATTAAATCATGTTAAGAACAT | |||
| 5′-GGAAGTCAGTATTAACTAGTTATATTAGGTT | NC_011900.1 | 2076464…2077735 | |
| 3′-TGTTTATTTCCTTCTATATTTTTTCTTTAAC | |||
| 5′-ATGCTAATTCTTCTAAACTTGCTGGCTGTAT | NC_011900.1 | 1882330…1882620 | |
| 3′-TACCATTCATTTTAACACAAAAAAGGCTTCA | |||
| 5′-AAACTTGCTATTCTTTGGGAGGAATACACTA | NC_011900.1 | 150062…150892 | |
| 3′-TGTTTATTTCCTTCTATATTTTTTCTTTAAC |
Derived from the 31 nucleotides flanking the 3′ and 5′ of the target gene in the reference genome.
Fig. 1Global percentage distribution of seven S. pneumoniae protein vaccine candidates among serotype 1 pneumococci recovered from Asia (n = 76), Africa (n = 324), Europe (n = 28) and South America (n = 17), expressed as a percentage of the number of samples submitted from each continent.
The predicted % coverage of multi-valent protein vaccine based on combinations of two or more of CbpA, PcpA, PhtD and PspA would provide against serotype 1 pneumococci by continent and also globally.
| Protein combination | Asia (n = 76) | Africa (n = 324) | Europe (n = 28) | South America (n = 17) | Global (n = 445) |
|---|---|---|---|---|---|
| CbpA, PcpA | 86 | 75 | 96 | 82 | 79 |
| CbpA, PhtD | 86 | 75 | 96 | 82 | 78 |
| CbpA, PspA | 84 | 80 | 68 | 65 | 79 |
| PcpA, PhtD | 29 | 34 | 82 | 59 | 37 |
| PhtD, PspA | 78 | 77 | 75 | 47 | 76 |
| CbpA, PcpA, PhtD | 84 | 83 | 96 | 82 | 84 |
| CbpA, PcpA, PspA | 84 | 80 | 96 | 82 | 82 |
| PcpA, PhtD, PspA | 80 | 78 | 96 | 82 | 80 |
| CbpA, PcpA, PhtD, PspA | 86 | 85 | 96 | 82 | 86 |
Fig. 2Global percentage distribution of amino acid variants of three S. pneumoniae protein vaccine candidates; [A] SP0148, [B] SP1912 and [C] SP2108. These three protein candidates were present in all serotype 1 pneumococci within the global serotype 1 study population.
Amino acid variations in three S. pneumoniae protein vaccine candidates [A] SP0148, [2] SP1912 and [C] SP2108, among a global dataset of serotype 1 pneumococci. The tables show the amino acid residues at which substitutions have been identified when using the corresponding protein sequences from Streptococcus pneumoniae TIGR4 as a reference. ‘.’ Indicates that a residue is identical to the TIGR4 reference sequence. Residues highlighted in grey indicate regions where the amino acid variation is predicted to result in a change to the antigenicity profile of the protein. Underlined residues indicate that the amino acid variation is predicted to be deleterious to the protein. Amino acid numbering starts at the translation site (the methionine) of the TIGR4 reference sequences.
Fig. 3Comparison of the hydrophobicity plots of amino acid variants of three S. pneumoniae protein vaccine candidates [A] SP0148, [B] SP1912 and [C] SP2108. The vertical axis represents hydrophilicity values; the y axis indicates amino acid number. The region of maximal hydrophilicity in each plot is likely to be the antigenic site. The black data line represents the antigenicity values for each of the three proteins from the TIGR4 reference.
Fig. 4Cartoon representations of the protein structure of [A] SP0148 [B] SP1912 and [C] SP2108. [A] SP0148 – The locations of the variant-3 amino acid substitutions are mapped into the structure; green for the potential deleterious variations detected by PROVEAN only (also indicated by their residue numbers); purple for the non-deleterious variations. Additionally, the position of D199A variation, detected in variant 5 and agreed as deleterious by 3 SNP programs, is colored in turquoise. [B] SP1912 – Variant-1 and variant-2. The locations of the amino acid variations are mapped into the structures, and indicated by residue numbers and purple colouring. [C] SP2018 – The locations of the amino acid substitutions are mapped into the structure in two different colours; green for potential deleterious variations (also indicated by their residue numbers); purple for the rest.