| Literature DB >> 28352257 |
LingBing Zeng1, Dongliang Wang2, NiYa Hu3, Qing Zhu3, Kaishen Chen3, Ke Dong4, Yan Zhang4, YuFeng Yao5, XiaoKui Guo4, Yung-Fu Chang6, YongZhang Zhu4.
Abstract
Reverse vaccinology (RV) has been widely used for screening of surface-exposed proteins (PSEs) of important pathogens, including outer membrane proteins (OMPs), and extracellular proteins (ECPs) as potential vaccine candidates. In this study, we applied a novel RV negative strategy and a pan-genome analysis for screening of PSEs from 17 L. interrogans strains covering 11 predominately epidemic serovars and 17 multilocus typing (MLST) sequence types (STs) worldwide. Our results showed, for instance, out of a total of 633 predicted PSEs in strain 56601, 92.8% were OMPs or ECPs (588/633). Among the 17 strains, 190 core PSEs, 913 dispensable PSEs and 861 unique PSEs were identified. Of the 190 PSEs, 121 were further predicted to be highly antigenic and thus may serve as potential vaccine candidates against leptospirosis. With the exception of LipL45, OmpL1, and LigB, the majority of the 121 PSEs were newly identified antigens. For example, hypothetical proteins BatC, LipL71, and the OmpA family proteins sharing many common features, such as surface-exposed localization, universal conservation, and eliciting strong antibody responses in patients, are regarded as the most promising vaccine antigens. Additionally, a wide array of potential virulence factors among the predicted PSEs including TonB-dependent receptor, sphingomyelinase 2, leucine-rich repeat protein, and 4 neighboring hypothetical proteins were identified as potential antigenicity, and deserve further investigation. Our results can contribute to the prediction of suitable antigens as potential vaccine candidates against leptospirosis and also provide further insights into mechanisms of leptospiral pathogenicity. In addition, our novel negative-screening strategy combined with pan-genome analysis can be a routine RV method applied to numerous other pathogens.Entities:
Keywords: L. interrogans; negative selection strategy; reverse vaccinology (RV); surface-exposed proteins; vaccine candidate
Year: 2017 PMID: 28352257 PMCID: PMC5348505 DOI: 10.3389/fmicb.2017.00396
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
All information of the 17 representative strains of pathogenic .
| ST51 | Australia | Hawaii, USA | Human | 146 | |
| ST50 | Bataviae | Egypt | Human | 314 | |
| ST42 | Bataviae | Thailand | Human | 157 | |
| ST79 | Bataviae | Laos | Human | 300 | |
| ST112 | Bulgarica | India | Human | 335 | |
| ST17 | Copenhageni | Brazil | Human | 2 | |
| ST77 | Grippotyphosa | Laos | Human | 369 | |
| ST82 | Grippotyphosa | Laos | Human | 237 | |
| ST85 | Grippotyphosa | Laos | Human | 147 | |
| ST1 | Lai | China | Human | 2 | |
| ST57 | Manilae | Philippinnes | Human | 271 | |
| ST46 | Medanensis | Thailand | Human | 188 | |
| ST24 | Muenchen | Germany | Horse | 322 | |
| ST37 | Pomona | Australia | Human | 118 | |
| ST88 | Pyrogenes | Egypt | Human | 344 | |
| ST49 | Pyrogenes | Thailand | Human | 169 | |
| ST75 | Pyrogenes | Tanzania | Rodent, Mastomyssp | 667 |
Figure 1Schematic representation of the novel strategy of reverse vaccinology applied to Pathogenic . In Figure 1, L. interrogans str.56601 was selected as a representative example for elucidating the step-by-step process and predicted results of PSEs using the novel RV negative strategy. Similarly, PSEs of the other 16 representative strains of pathogenic L. interrogans were predicted following same strategy as str.56601. First of all, PSORTb3.0, CELLO, and SOSUI-GramN were used to predict subcellular localization of these proteins by majority voting strategy. Proteins predicted as CYTs and IMPs by at least two of the three bioinformatic tools were defined as consensus CYTs and IMPs and were directly removed from further study. Proteins predicted as CYTs or IMPs by only one of the three tools were labeled as non-consensus CYTs or IMPs, respectively. The remaining proteins were labeled as PSEs. Then, both the non-consensus CYTs with no signal peptides predicted by all of SignalP3.0, TatP and SecretomeP and the non-consensus IMPs with positive transmembrane structures predicted by TMHMM or Phobius were defined as Non-PSEs and removed from further study, whereas the remaining non-consensus CYTs with positive signal peptide and non-consensus IMPs with no transmembrane structure were added into the predicted PSEs. In addition, SignalP3.0, Tat and SecretomeP as well as BOMP, TMBETADISC-RBF, and LipoP were utilized to further investigate extracellular features of these PSEs. Finally, pan-genome analysis of the predicted PSEs among the 17 pathogenic L. interrogans strains identified the core, dispensable, and unique PSEs. And the core PSEs with high antigenicity values predicted by the VaxiJen server were determined as final vaccine antigen candidates. PSE, potential surface-exposed proteins; CVPSE, Conserved Vaxijen antigenicity predicted PSE.
Figure 2Subcellular localizations of these PSEs among the 17 representative strains of Pathogenic . EC, extracellular; OM, outer membrane; UN, unknown; VA, variable (proteins with multiple locations-EC or OM).
Figure 3Calculation of core- and pan-genome sizes of Pathogenic .
Figure 4Pan-genome representation for the 17 representative strains of Pathogenic Core genes of the 17 L.interrogans strains. In our L.interrogans candidates, nine strains belong to three different serovars (B–D). Show pan-genome of the three serovars themselves.
The detailed information of the final 121 PSEs with predicted high antigenicity among the 17 representative strains of pathogenic .
| LA_0022 | LIC10021 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.7022 |
| LA_0071 | LIC10064 | – | – | – | – | – | – | COG4731S | Hypothetical protein | ECPs | 0.7164 |
| LA_0074 | LIC10067 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.6286 |
| LA_0075 | LIC10068 | – | – | ↓ | – | – | – | – | Hypothetical protein | ECPs | 0.5617 |
| LA_0136 | LIC10123 | – | – | – | – | – | ↓ | COG4254S | Hypothetical protein | OMPs | 0.6461 |
| LA_0303 | LIC10260 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.6756 |
| LA_0322 | LIC10280 | – | – | – | – | – | – | – | fibronectin binding protein | OMPs | 0.7026 |
| LA_0333 | LIC10288 | – | – | – | – | – | – | COG0603R | PP-loop superfamily ATPase | OMPs | 0.6805 |
| LA_0346 | LIC10298 | – | – | – | – | – | – | COG1558N | Flagellar basal body rod protein FlgC | ECPs | 0.633 |
| LA_0357 | LIC10307 | – | – | – | – | – | – | COG0412Q | Dienelactone hydrolase family protein | OMPs | 0.6134 |
| LA_0364 | LIC10313 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 1.094 |
| LA_0410 | LIC10359 | – | – | – | – | – | – | COG2834M | Outer membrane lipoprotein-sorting protein | OMPs | 0.6661 |
| LA_0419 | LIC10368 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.7143 |
| LA_0430 | LIC10377 | – | – | – | – | ↑ | – | – | Hypothetical protein | OMPs | 0.8101 |
| LA_0505 | LIC13050 | – | – | – | – | – | ↓ | – | Hypothetical protein | OMPs | 0.8807 |
| LA_0568 | LIC13002 | – | – | – | – | ↓ | – | COG2067I | Fatty acid transport protein | OMPs | 0.6682 |
| LA_0589 | LIC12986 | – | ↑ | – | – | – | – | – | hypothetical protein | OMPs | 0.5822 |
| LA_0591 | LIC12985 | – | – | – | ↑ | ↑ | – | – | Hypothetical protein | ECPs | 0.5631 |
| LA_0663 | LIC12930 | – | – | ↑ | – | – | – | – | Hypothetical protein | ECPs | 0.5964 |
| LA_0862 | LIC12765 | – | – | ↓ | – | ↓ | ↓ | COG2077O | Thiol peroxidase | ECPs | 0.5274 |
| LA_1122 | LIC12558 | – | – | – | ↑ | ↑ | – | – | Hypothetical protein | OMPs | 0.7834 |
| LA_1159 | LIC12525 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.5873 |
| LA_1167 | LIC12519 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.765 |
| LA_1168 | LIC12518 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.6311 |
| LA_1180 | LIC12509 | – | – | – | ↑ | – | – | – | Hypothetical protein | ECPs | 0.6433 |
| LA_1192 | LIC12499 | – | – | – | ↑ | ↑ | – | – | Hypothetical protein | OMPs | 0.564 |
| LA_1356 | LIC12374 | – | – | – | – | – | – | COG4206H | TonB-dependent outer membrane receptor | OMPs | 0.6274 |
| LA_1404 | LIC12337 | – | – | – | – | – | ↑ | – | hypothetical protein | OMPs | 0.5464 |
| LA_1458 | LIC12295 | – | – | – | – | – | – | COG1134GM | ABC transporter ATP-binding protein | OMPs | 0.5095 |
| LA_1499 | LIC12259 | – | – | – | – | – | – | COG0026F | Phosphoribosylaminoimidazole carboxylase ATPase subunit | OMPs | 0.6708 |
| LA_1507 | LIC12254 | – | – | – | – | – | – | COG4775M | Hypothetical protein | OMPs | 0.5181 |
| LA_1508 | LIC12253 | – | – | – | ↑ | ↑ | ↓ | COG4775M | Hypothetical protein | ECPs | 0.599 |
| LA_1897 | LIC12002 | – | – | ↓ | – | – | – | COG0753P | Catalase | ECPs | 0.5618 |
| LA_1931 | LIC11975 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.802 |
| LA_1968 | LIC11935 | – | – | – | – | – | ↓ | – | Hypothetical protein | OMPs | 0.5621 |
| LA_2069 | LIC11846 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 1.0393 |
| LA_2105 | LIC11813 | – | – | – | – | ↑ | – | COG1886NU | Flagellar motor switch protein | OMPs | 0.5388 |
| LA_2186 | LIC11739 | – | – | – | – | – | – | COG0405E | gamma-glutamyltranspeptidase | OMPs | 0.5669 |
| LA_2272 | LIC11665 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.6501 |
| LA_2316 | LIC11625 | – | – | – | – | – | – | COG2013S | Hypothetical protein | ECPs | 0.8212 |
| LA_2377 | LIC11568 | – | – | – | – | – | – | COG2133G | Glucose/sorbosone dehydrogenase | OMPs | 0.6489 |
| LA_2498 | LIC11467 | – | – | – | – | – | – | COG0739M | M23 family metalloendopeptidase | OMPs | 0.6045 |
| LA_2538 | LIC11435 | – | – | – | – | – | – | COG5184DZ | Regulator of chromosome condensation | OMPs | 0.6316 |
| LA_2550 | LIC11424 | – | – | – | – | – | ↑ | COG0596R | Esterase/lipase | ECPs | 0.7281 |
| LA_2595 | LIC11388 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.863 |
| LA_2601 | LIC11382 | – | – | – | – | ↑ | – | – | Hypothetical protein | OMPs | 0.5082 |
| LA_2613 | LIC11370 | – | – | – | – | – | – | COG0545O | FKBP-type peptidylprolyl isomerase | OMPs | 0.6189 |
| LA_2617 | LIC11366 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.7008 |
| LA_2641 | LIC11345 | – | – | – | – | – | – | COG1886NU | Endoflagellar motor switch protein | OMPs | 0.6637 |
| LA_2672 | LIC11320 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.5604 |
| LA_2741 | LIC11271 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.715 |
| LA_2742 | LIC11270 | – | – | – | – | – | – | COG1629P | Ferrichrome-iron receptor | ECPs | 0.5877 |
| LA_2746 | LIC11268 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.7177 |
| LA_2757 | LIC11259 | – | – | – | – | ↓ | – | – | Hypothetical protein | OMPs | 0.517 |
| LA_2764 | LIC11254 | – | – | – | – | ↑ | – | COG1613P | Sulfate ABC transporter substrate-binding protein | OMPs | 0.5849 |
| LA_2796 | LIC11228 | – | – | ↓ | – | – | – | COG0768M | Transpeptidase/penicillin binding protein | OMPs | 0.6239 |
| LA_2815 | LIC11213 | – | – | – | – | – | – | COG1792M | Rod shape-determining protein MreC | OMPs | 0.6909 |
| LA_2823 | LIC11207 | – | – | – | ↑ | ↑ | – | – | Hypothetical protein | ECPs | 0.7306 |
| LA_2848 | LIC11188 | – | – | – | – | ↓ | – | – | Hypothetical protein | OMPs | 0.7231 |
| LA_2849 | LIC11187 | – | – | – | – | ↓ | – | – | Hypothetical protein | ECPs | 0.5677 |
| LA_2850 | LIC11186 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.6522 |
| LA_2854 | LIC11184 | – | – | ↓ | – | – | – | COG1749N | Flagellar hook protein FlgE | OMPs | 0.6733 |
| LA_2949 | LIC11112 | – | – | – | ↓ | ↓ | – | COG1843N | Flagellar hook assembly scaffolding protein | OMPs | 0.5626 |
| LA_2958 | LIC11103 | – | – | – | ↑ | ↑ | – | COG3144N | Flagellar protein | OMPs | 0.5731 |
| LA_2975 | LIC11087 | – | – | – | – | – | – | COG0265O | Serine protease | OMPs | 0.5597 |
| LA_2992 | LIC11074 | – | – | – | – | – | – | COG2267I | Hypothetical protein | ECPs | 0.7857 |
| LA_2993 | LIC11073 | – | – | – | – | – | – | COG1858P | Cytochrome c peroxidase | OMPs | 0.6704 |
| LA_2998 | LIC11067 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.8504 |
| LA_3026 | LIC11052 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.6387 |
| LA_3050 | LIC11040 | – | – | – | – | ↑ | – | – | Hypothetical protein | OMPs | 0.5837 |
| LA_3064 | LIC11030 | – | – | – | – | – | – | COG1664M | Cell shape determination protein | OMPs | 0.7079 |
| LA_3097 | LIC11003 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.7837 |
| LA_3138 | LIC10973 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.6481 |
| LA_3145 | LIC10968 | ↑ | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.6892 |
| LA_3150 | LIC10963 | – | – | – | – | – | – | COG1652S | Hypothetical protein | ECPs | 0.6863 |
| LA_3210 | LIC10920 | – | – | – | – | ↓ | – | – | OmpL1 | OMPs | 0.9344 |
| LA_3268 | LIC10873 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.5264 |
| LA_3303 | LIC10845 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.6679 |
| LA_3319 | LIC10833 | – | – | – | – | – | – | COG2010C | Hypothetical protein | ECPs | 0.6143 |
| LA_3410 | LIC10760 | – | – | – | – | ↑ | – | – | Hypothetical protein | ECPs | 0.9564 |
| LA_3454 | LIC10723 | – | – | – | – | – | – | COG3865S | Hypothetical protein | OMPs | 0.5181 |
| LA_3468 | LIC10714 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.5529 |
| LA_3469 | LIC10713 | – | – | – | – | – | – | COG2353S | Hypothetical protein | OMPs | 0.6583 |
| LA_3470 | LIC10712 | – | – | – | – | – | – | COG1345N | Flagellar hook-associated protein FliD | ECPs | 0.5272 |
| LA_3501 | LIC10686 | – | – | – | – | – | – | COG3487P | Hypothetical protein | OMPs | 0.57 |
| LA_3508 | LIC10683 | – | – | – | – | – | – | COG3489R | Hypothetical protein | ECPs | 0.7949 |
| LA_3571 | LIC10628 | – | – | – | – | – | – | COG3794C | Methylamine utilization protein | OMPs | 0.6392 |
| LA_3711 | LIC10520 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.7514 |
| LA_3778 | LIC10464 | ↑ | – | ↑ | – | ↑ | – | COG2885M | OmpA family protein | OMPs | 0.5249 |
| LA_3834 | LIC13066 | ↑ | – | – | – | – | – | COG3607R | Glyoxalase/bleomycin resistance protein/dioxygenase | OMPs | 0.6016 |
| LA_3838 | LIC13070 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.5302 |
| LA_3849 | LIC13076 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.6748 |
| LA_3853 | LIC13078 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.5234 |
| LA_3867 | LIC13086 | ↑ | ↑ | – | – | ↑ | – | – | Hypothetical protein | OMPs | 0.5336 |
| LA_3870 | LIC13089 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.5439 |
| LA_3881 | LIC13101 | – | – | – | – | ↑ | – | – | Hypothetical protein | OMPs | 0.7749 |
| LA_4059 | LIC13238 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.7565 |
| LA_4083 | LIC13255 | – | – | – | ↓ | ↓ | – | COG2303E | Cholesterol oxidase precursor | OMPs | 0.667 |
| LA_4144 | LIC13306 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.5584 |
| LA_4178 | LIC13334 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.5203 |
| LA_4202 | LIC13354 | – | – | – | ↓ | ↓ | – | COG4642S | Hypothetical protein | OMPs | 0.6064 |
| LA_4203 | LIC13355 | – | – | – | – | – | ↓ | COG0331I | Fatty acid synthase subunit beta | OMPs | 0.5784 |
| LA_4272 | LIC13418 | – | – | – | – | – | ↓ | – | Hypothetical protein | OMPs | 0.7046 |
| LA_4291 | LIC13434 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.5418 |
| LA_4293 | LIC13436 | – | – | – | – | – | ↓ | – | polysaccharide deacetylase | OMPs | 0.6614 |
| LA_4335 | LIC13477 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.6873 |
| LB_001 | LIC20001 | – | – | – | – | – | ↓ | – | Hypothetical protein | OMPs | 0.6864 |
| LB_056 | LIC20042 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.8468 |
| LB_072 | LIC20056 | – | – | – | – | – | – | – | Hypothetical protein | ECPs | 0.5987 |
| LB_098 | LIC20077 | – | – | – | – | – | – | – | Hypothetical Protein | ECPs | 0.5732 |
| LB_110 | LIC20087 | – | – | – | – | – | – | COG5010U | TPR repeat-containing protein | OMPs | 0.5788 |
| LB_191 | LIC20151 | – | – | – | – | – | – | – | Hypothetical protein | OMPs | 0.7302 |
| LB_192 | LIC20152 | – | – | – | – | – | – | COG1022I | Long-chain-fatty-acid–CoA ligase | OMPs | 0.6367 |
| LB_194 | LIC20153 | – | – | – | – | – | - | COG0726G | Xylanase/chitin deacetilase | OMPs | 0.6861 |
| LB_242 | LIC20185 | – | – | ↓ | – | – | – | – | Putative outermembrane protein | OMPs | 0.5456 |
| LB_258 | LIC20197 | – | – | – | – | – | – | COG4206H | Putative TonB-dependent outer membrane receptor protein | OMPs | 0.5271 |
| LB_268 | LIC20205 | – | – | – | – | – | ↓ | – | Hypothetical protein | ECPs | 0.5724 |
| LB_277 | LIC20212 | – | – | – | – | ↓ | – | COG4254S | Hypothetical protein | ECPs | 0.6918 |
| LB_279 | LIC20214 | – | – | – | – | – | ↑ | COG4870O | Cysteine protease | OMPs | 0.6034 |
| LB_280 | LIC20215 | – | – | – | – | – | – | COG2267I | Hypothetical protein | ECPs | 0.5028 |
| LA_1462 | LIC12293 | – | – | – | – | – | ↑ | COG1633S | Hypothetical protein | Unknown | 0.5146 |
Proteins were confirmed to be reactive with human serum (Lessa-Aquino et al., .
Figure 5Venn diagram detailed the unique and common PSEs among our negative-screening, Yang's positive-screening, and Pinne's OMP array results with known experimentally identified surface-exposed antigens.
Figure 6Protein-protein interaction of the potential virulence factors (LA_1761 to LA_1764) located in the 54 kb circular prophage of .