Literature DB >> 28352257

A Novel Pan-Genome Reverse Vaccinology Approach Employing a Negative-Selection Strategy for Screening Surface-Exposed Antigens against leptospirosis.

LingBing Zeng1, Dongliang Wang2, NiYa Hu3, Qing Zhu3, Kaishen Chen3, Ke Dong4, Yan Zhang4, YuFeng Yao5, XiaoKui Guo4, Yung-Fu Chang6, YongZhang Zhu4.   

Abstract

Reverse vaccinology (RV) has been widely used for screening of surface-exposed proteins (PSEs) of important pathogens, including outer membrane proteins (OMPs), and extracellular proteins (ECPs) as potential vaccine candidates. In this study, we applied a novel RV negative strategy and a pan-genome analysis for screening of PSEs from 17 L. interrogans strains covering 11 predominately epidemic serovars and 17 multilocus typing (MLST) sequence types (STs) worldwide. Our results showed, for instance, out of a total of 633 predicted PSEs in strain 56601, 92.8% were OMPs or ECPs (588/633). Among the 17 strains, 190 core PSEs, 913 dispensable PSEs and 861 unique PSEs were identified. Of the 190 PSEs, 121 were further predicted to be highly antigenic and thus may serve as potential vaccine candidates against leptospirosis. With the exception of LipL45, OmpL1, and LigB, the majority of the 121 PSEs were newly identified antigens. For example, hypothetical proteins BatC, LipL71, and the OmpA family proteins sharing many common features, such as surface-exposed localization, universal conservation, and eliciting strong antibody responses in patients, are regarded as the most promising vaccine antigens. Additionally, a wide array of potential virulence factors among the predicted PSEs including TonB-dependent receptor, sphingomyelinase 2, leucine-rich repeat protein, and 4 neighboring hypothetical proteins were identified as potential antigenicity, and deserve further investigation. Our results can contribute to the prediction of suitable antigens as potential vaccine candidates against leptospirosis and also provide further insights into mechanisms of leptospiral pathogenicity. In addition, our novel negative-screening strategy combined with pan-genome analysis can be a routine RV method applied to numerous other pathogens.

Entities:  

Keywords:  L. interrogans; negative selection strategy; reverse vaccinology (RV); surface-exposed proteins; vaccine candidate

Year:  2017        PMID: 28352257      PMCID: PMC5348505          DOI: 10.3389/fmicb.2017.00396

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

Leptospirosis, caused by pathogenic spirochete bacteria of the genus Leptospira, is one of the most common zoonotic diseases worldwide. Leptospirosis has been recognized as an emerging disease with more than half a million patients reported annually (Adler et al., 2011). Pathogenic Leptospira spp. are transmitted mainly by direct contact with infected animals or by exposure to water or soil contaminated by the urine of infected animals (Faine, 1994). To date, more than 250 serovars have been observed in pathogenic Leptospira (Zhang et al., 2012). At the present time, available leptospira vaccines are inactivated whole cell products that provide inadequate protection against most serovars and cannot provide cross-protection against a large number of serogroups of pathogenic leptospires (Faisal et al., 2008). Therefore, there is an urgent need to develop a long-term and cross-protective vaccine set against leptospirosis. A revolutionary vaccine research strategy, reverse vaccinology (RV), was able to identify five suitable serogroup B meningococcal vaccine candidates (Pizza et al., 2000). Subsequently, RV has been widely applied to a wide range of bacterial pathogens, including Streptococcus pneumoniae, S. agalactiae, Staphylococcus aureus, Porphyromonas gingivalis, Chlamydia pneumonia, and L. interrogans (Paton and Giammarinaro, 2001; Wizemann et al., 2001; Hava and Camilli, 2002; Gamberini et al., 2005; Maione et al., 2005; Mora et al., 2005; Tettelin et al., 2005; Falugi et al., 2008; Seib et al., 2012). Generally, Gram-negative bacteria have five subcellular location sites including cytoplasm, inner membrane, outer membrane, periplasm, and extracellular space. According to RV theory, except for cytoplasmic and inner membrane proteins, proteins located in the other sites can be regarded as PSEs, and are the most suitable vaccine candidates due to their high susceptibility to antibody recognition and eliciting protective immune responses. The in silico approach of RV is a novel and integrative method that uses available bioinformatic tools in the first step of vaccine development. The currently used in silico strategy of RV is to focus only on OMPs and ECPs positively predicted by several bioinformatic tools, such as PSORTb, Cello, and P-classifier. This approach may overlook numerous unknown proteins as potential vaccine candidates because a relatively high proportion of proteins are not covered by these bioinformatic tools. For instance, the most frequently used tool, PSORTb, achieved the greatest degree of precision, but as many as 30.8% (1,140) of str.56601 proteins were not selected as potential vaccine candidates for further screening, simply due to the fact that the localization sites of these proteins were unknown. This is illustrated by the extracellular virulence factor of Bordetella pertussis-pertussis toxin, the only indispensable component of acellular pertussis vaccines, which was predicted as an “unknown” protein by PSORTB. Furthermore, OMPs, ECPs, and Periplasmic proteins (PMs) were predicted far less accurately and reliably than cytoplasmic proteins (CYTs) and inner membrane proteins (IMPs) by these frequently used bioinformatic tools, including PSORTb, Cello, Proteome Analysis, Subloc, and LOCtree (Gardy and Brinkman, 2006). The usage of these popular bioinformatic tools remains a matter for further investigation, as they may miss or exclude highly antigenic vaccine candidates. Here, in this study, a novel RV prediction method employing a negative selection strategy was developed to reliably identify potential vaccine candidates by removing CYTs and IMPs. Based on our novel RV strategy, these “unknown” proteins that are further predicted as CYTs or IMPs by multiple tools according to our criteria are excluded, and the remaining “unknown” proteins, which might be surface-exposed, are retained in the final vaccine candidates list for further screening. Thus, we can greatly reduce the risk of missing potential vaccine candidates among these “unknown” proteins predicted by one of these computational methods. Early RV efforts were focused mainly on a single genome of a pathogenic strain or species. This limited focus renders it impossible to develop a universal vaccine comprising biologically cross-protective antigens against multiple serovars, strains, or pathovars of one pathogen. To alleviate this shortcoming, pan-genome strategies have been developed to identify potential cross-protective antigens using multiple genomes of the same species, such as group B Streptococcus spp. (Maione et al., 2005). In this study, we have applied a new in silico RV negative selection strategy combining a pan-genome analysis to screen PSEs as vaccine candidates to provide a framework for future vaccine development against leptospirosis. In addition, potential virulence factors of leptospira were also further analyzed in this study. Future efforts will be targeted toward the experimental characterization of these identified PSEs in our study, as well as screening their potential as vaccine candidates in an animal model.

Materials and methods

Selection of leptospiral genome sequences

Information for leptospiral serovars and multilocus sequence typing were combined to select suitable strains of L. interrogans. Finally, the 17 representative L. interrogans strains covering 11 dominantly epidemic serovars and 17 MLST sequence types (STs) worldwide were selected. For instance, more than 90% of Chinese epidemic or outbreak strains belonged to the 11 dominant serovars (Zhang et al., 2012). The proteomes of all strains were downloaded from the Pathosystems Resource Integration Center (PATRIC) website (www.patricbrc.org) and detailed information about the selected strains is presented in Table 1.
Table 1

All information of the 17 representative strains of pathogenic .

StrainsMLSTSerovarIsolated locationHostContig
Leptospirainterrogansserovar Australia str. 2002000624ST51AustraliaHawaii, USAHuman146
LeptospirainterrogansserovarBataviae str. 2006006976ST50BataviaeEgyptHuman314
LeptospirainterrogansserovarBataviae str. L1111ST42BataviaeThailandHuman157
LeptospirainterrogansserovarBataviae str. UI 08561ST79BataviaeLaosHuman300
LeptospirainterrogansserovarBulgarica str. MallikaST112BulgaricaIndiaHuman335
LeptospirainterrogansserovarCopenhageni str. Fiocruz L1-130ST17CopenhageniBrazilHuman2
LeptospirainterrogansserovarGrippotyphosa str. UI 08368ST77GrippotyphosaLaosHuman369
LeptospirainterrogansserovarGrippotyphosa str. UI 08434ST82GrippotyphosaLaosHuman237
LeptospirainterrogansserovarGrippotyphosa str. UI 12764ST85GrippotyphosaLaosHuman147
Leptospirainterrogansserovar Lai str. 56601ST1LaiChinaHuman2
LeptospirainterrogansserovarManilae str. M001_Tn_Mutant_ParentST57ManilaePhilippinnesHuman271
LeptospirainterrogansserovarMedanensis str. UT053ST46MedanensisThailandHuman188
LeptospirainterrogansserovarMuenchen str. Brem 129ST24MuenchenGermanyHorse322
Leptospirainterrogansserovar Pomona str. PomonaST37PomonaAustraliaHuman118
LeptospirainterrogansserovarPyrogenes str. 2006006956ST88PyrogenesEgyptHuman344
LeptospirainterrogansserovarPyrogenes str. L0374ST49PyrogenesThailandHuman169
LeptospirainterrogansserovarPyrogenes str. SriLanka1ST75PyrogenesTanzaniaRodent, Mastomyssp667
All information of the 17 representative strains of pathogenic .

Predicting strategy for PSEs of L. interrogans

A novel RV approach employing a negative selection strategy was used in this work (Figure 1). At first, the three currently used bioinformatic tools, PSORTb3.0 (Yu et al., 2010), CELLO (Yu et al., 2004), and SOSUI-GramN (Imai et al., 2008), were used to predict subcellular localization of these proteins by a majority voting strategy. Proteins predicted as CYTs by at least two of the three bioinformatic tools were defined as consensus CYTs. Similarly, proteins predicted as IMPs by at least two of the three tools were defined as consensus IMPs. Proteins predicted as CYTs or IMPs by only one of the three tools were labeled as non-consensus CYTs or IMPs, respectively. The remaining proteins were labeled as PSEs. Thus, the predicted results were preliminarily divided into three groups: consensus CYTs/IMPs, non-consensus CYTs/IMPs, and PSEs. The consensus CYTs and IMPs as non-PSEs were directly removed from further study. Non-consensus CYTs and IMPs were further analyzed by combination of additional bioinformatic tools. If these non-consensus CYTs were predicted to be negative by SignalP3.0 (Bendtsen et al., 2004b), TatP (Juncker et al., 2003), and SecretomeP (Bendtsen et al., 2004a), they were removed from further analysis. Non-consensus CYTs with positive signal peptide results were retained as PSEs. Non-consensus IMPs with transmembrane structures predicted by TMHMM (Krogh et al., 2001) or Phobius (Kall et al., 2004) were also removed for further study. Non-consensus IMPs with no transmembrane structures predicted by TMHMM and Phobius were retained as PSEs. Thus, the remaining proteins classified as PSEs were categorized as follows: (1) ECPs or periplasmic proteins predicted by SignalP3.0, Tat and SecretomeP; (2) OMPs predicted by BOMP (Berven et al., 2004), TMBETADISC-RBF (Ou et al., 2008) and LipoP (Juncker et al., 2003); and (3) proteins with unknown localization. Finally, based on amino acid sequences, the antigenicity value of each PSE was predicted using the VaxiJen server with default parameter “bacteria” and the threshold of 0.5 (Doytchinova and Flower, 2007).
Figure 1

Schematic representation of the novel strategy of reverse vaccinology applied to Pathogenic . In Figure 1, L. interrogans str.56601 was selected as a representative example for elucidating the step-by-step process and predicted results of PSEs using the novel RV negative strategy. Similarly, PSEs of the other 16 representative strains of pathogenic L. interrogans were predicted following same strategy as str.56601. First of all, PSORTb3.0, CELLO, and SOSUI-GramN were used to predict subcellular localization of these proteins by majority voting strategy. Proteins predicted as CYTs and IMPs by at least two of the three bioinformatic tools were defined as consensus CYTs and IMPs and were directly removed from further study. Proteins predicted as CYTs or IMPs by only one of the three tools were labeled as non-consensus CYTs or IMPs, respectively. The remaining proteins were labeled as PSEs. Then, both the non-consensus CYTs with no signal peptides predicted by all of SignalP3.0, TatP and SecretomeP and the non-consensus IMPs with positive transmembrane structures predicted by TMHMM or Phobius were defined as Non-PSEs and removed from further study, whereas the remaining non-consensus CYTs with positive signal peptide and non-consensus IMPs with no transmembrane structure were added into the predicted PSEs. In addition, SignalP3.0, Tat and SecretomeP as well as BOMP, TMBETADISC-RBF, and LipoP were utilized to further investigate extracellular features of these PSEs. Finally, pan-genome analysis of the predicted PSEs among the 17 pathogenic L. interrogans strains identified the core, dispensable, and unique PSEs. And the core PSEs with high antigenicity values predicted by the VaxiJen server were determined as final vaccine antigen candidates. PSE, potential surface-exposed proteins; CVPSE, Conserved Vaxijen antigenicity predicted PSE.

Schematic representation of the novel strategy of reverse vaccinology applied to Pathogenic . In Figure 1, L. interrogans str.56601 was selected as a representative example for elucidating the step-by-step process and predicted results of PSEs using the novel RV negative strategy. Similarly, PSEs of the other 16 representative strains of pathogenic L. interrogans were predicted following same strategy as str.56601. First of all, PSORTb3.0, CELLO, and SOSUI-GramN were used to predict subcellular localization of these proteins by majority voting strategy. Proteins predicted as CYTs and IMPs by at least two of the three bioinformatic tools were defined as consensus CYTs and IMPs and were directly removed from further study. Proteins predicted as CYTs or IMPs by only one of the three tools were labeled as non-consensus CYTs or IMPs, respectively. The remaining proteins were labeled as PSEs. Then, both the non-consensus CYTs with no signal peptides predicted by all of SignalP3.0, TatP and SecretomeP and the non-consensus IMPs with positive transmembrane structures predicted by TMHMM or Phobius were defined as Non-PSEs and removed from further study, whereas the remaining non-consensus CYTs with positive signal peptide and non-consensus IMPs with no transmembrane structure were added into the predicted PSEs. In addition, SignalP3.0, Tat and SecretomeP as well as BOMP, TMBETADISC-RBF, and LipoP were utilized to further investigate extracellular features of these PSEs. Finally, pan-genome analysis of the predicted PSEs among the 17 pathogenic L. interrogans strains identified the core, dispensable, and unique PSEs. And the core PSEs with high antigenicity values predicted by the VaxiJen server were determined as final vaccine antigen candidates. PSE, potential surface-exposed proteins; CVPSE, Conserved Vaxijen antigenicity predicted PSE.

Bioinformatic tools used in reverse vaccinology

Subcellular localization of L. interrogans proteins was predicted by PSORTb, CELLO and SOSUI-GramN. These were classified into CYTs, IMPs, periplasmic proteins (PMs), OMPs, or ECPs. SignalP3.0, TatP, SecretomeP, LipoP, TMBETADISC-RBF, and BOMP were used for further extracellular feature prediction. String database was used for analyzing protein–protein interactions (PPI) of L. interrogans PSEs (Franceschini et al., 2013).

Pangenomic analysis of predicted PSEs among 17 leptospiral strains

Reciprocal blast with bidirectional best hit (BBH) and e-values of 10−10 were used for ortholog clustering of L. interrogans in a pan-genome analysis. Additionally, in order to avoid homologous mismatches, both the coverage and identity percent of cut-offs were set to at least 50%. The concepts of core, dispensable, and unique PSEs were used in this study according to the pan-genome classification. Core PSEs were highly conserved among all 17 strains. Dispensable PSEs and unique PSEs existed in less than 16 strains and exclusively in only one strain, respectively. Finally, these core PSEs with high antigenicity values predicted by the VaxiJen server were determined as the final vaccine antigens candidates against leptospirosis.

Results

General information of selected L. interrogans strains

A total of 17 leptospiral strains covering 11 different serovars and 17 STs were selected for analysis (Table 1). Among these strains, serovars Bataviae, Grippotyphosa, and Pyrogenes consisted of three different STs. The present study was focused mainly on those selected strains that are the most common serovars in China; further, the STs associated with evolutionary information were taken into account (Varni et al., 2013).

Prediction schema of PSEs by the negative selection method

The new combined RV strategy is illustrated by Figure 1. We chose L. interrogans str.56601 as an example. A total of 3,702 proteins were analyzed using our novel RV strategy; 2,706 consensus CYTs and IMPs, 666 non-consensus proteins, and 330 PSEs were predicted. Among these 2,706 proteins, 2,166 proteins were predicted as CYTs and 540 as IMPs by at least two of the three software (PSORTb3.0, CELLO and SOSUI-GramN). Moreover, these 666 non-consensus proteins predicted as CYT or IMP by only one of the three software were further assessed according to the following rules: For example, LA_0012 was predicted to be unknown in PSORT, OMP in Cello and CYT in SoSui-GramN, respectively; And LA_0009 was predicted to be unknown in PSORT, OMP in Cello, IMP in SoSui-GramN. A total of 398 non-consensus proteins like LA_0012 and 157 proteins like LA_0009 were subdivided as non-consensus CYTs and non-consensus IMPs, respectively. In addition, the remaining 111 non-consensus proteins like LA_0293 with unknown location in PSORTb, CYT in Cello and IMPs in SoSui-GramN, were defined as both non-consensus CYTs and IMPs. Therefore, the 666 non-consensus proteins were divided into 509 non-consensus CYTs (398 plus 111) and 268 non-consensus IMPs (157 plus 111). Among the 509 non-consensus CYTs, 311 were predicted negative using the three programs (SignalP3.0, TatP, and SecretomeP) and were removed from further analysis. There were 198 non-consensus CYTs with positive signal peptide results; these were retained as PSEs. Another 268 non-consensus IMPs were further analyzed by TMHMM (Krogh et al., 2001) or Phobius (Kall et al., 2004). One hundred and twenty-seven of these were predicted to have transmembrane structures and eliminated from further study. The remaining 141 with no transmembrane structure were retained and classified as PSEs. Finally, 303 were also predicted to be PSEs out of the 666 non-consensus proteins. Altogether, in addition to the 330 PSEs mentioned above, we predicted a total of 633 PSEs from 3,702 proteins in this study. Among them, the subcellular localization of 45 proteins was unknown and the remaining proteins were almost all predicted as OMPs or ECPs. The predicted PSEs were as high as 92.8% (588/633). The detailed information of PSEs in the remaining strains identified was shown in Figure 2.
Figure 2

Subcellular localizations of these PSEs among the 17 representative strains of Pathogenic . EC, extracellular; OM, outer membrane; UN, unknown; VA, variable (proteins with multiple locations-EC or OM).

Subcellular localizations of these PSEs among the 17 representative strains of Pathogenic . EC, extracellular; OM, outer membrane; UN, unknown; VA, variable (proteins with multiple locations-EC or OM).

Pan-genome analysis of predicted PSEs among 17 leptospiral strains

The number of predicted PSEs in the various strains of L. interrogans ranged from 600 to 780 (Figure 2). Gene accumulation curves showed that core genome size fits an exponential decay curve that reached a plateau at 11,043 proteins, whereas the pan PSE grouping fits a power law curve, suggesting the 17 leptospiral strains selected are sufficient to characterize pan core PSEs (Figure 3). Among the 1,103 leptospiral ortholog clusters, 190 core PSEs (17.2%) and 913 dispensable PSEs (82.8%) were shared by all 17 of L. interrogans strains and partly conserved among 2–16 strains, respectively. Furthermore, the pan PSEs included 861 unique PSEs that were found only in one strain. The numbers of unique PSEs in each strain range from 17 (serovar Manilae str.M001) to 103 (serovar Medanensis str.UT053). The dispensable and unique PSEs might be related to different serotypes. The detailed information of all strains and those three dependent serovars was shown in Figure 4. In the present study, our main goal was to predict potential novel protective antigens for the development of universal vaccines against leptospirosis; special attention was given to the 121 high antigenic PSEs from 190 core PSEs, including 37 ECPs, 83 OMPs, and 1 unknown protein localization (see Table 2). As more than 40% of L. interrogans proteins have been annotated as hypothetical proteins, further study of these proteins' functions is needed. Among them, only 55 were categorized into the following COG groups, including main cell wall/membrane/envelope biogenesis (9); function unknown (9); cell motility (7); inorganic ion transport and metabolism (5); general function prediction only (4); Posttranslational modification, protein turnover, chaperones (4); Carbohydrate transport and metabolism (3); Energy production and conversion (2), etc. 16 PSEs were predicted as being involved in(a) cell wall/membrane/envelope biogenesis or (b) cell motility, which are related to the classical function of PSEs (Table S1). In addition, we predicted dispensable and unique PSEs in our pan-genome analysis. For instance, there were 28 unique PSEs in str.56601 and 27 in str. Fiocruz L1-130 (Table S2).
Figure 3

Calculation of core- and pan-genome sizes of Pathogenic .

Figure 4

Pan-genome representation for the 17 representative strains of Pathogenic Core genes of the 17 L.interrogans strains. In our L.interrogans candidates, nine strains belong to three different serovars (B–D). Show pan-genome of the three serovars themselves.

Table 2

The detailed information of the final 121 PSEs with predicted high antigenicity among the 17 representative strains of pathogenic .

Str. 56601 LocusStr. L1-130 LocusOsmolarity in vivo/in vitro) (Matsunaga et al., 2007)Low iron (Lo et al., 2010)Serum (Patarakul et al., 2010)Temperature P-E (Lo et al., 2006)Temperature upshift (Lo et al., 2006)Lai/IPAV (Zhong et al., 2011)COGProductLocalizationVaxijen antigenicity
LA_0022LIC10021Hypothetical proteinOMPs0.7022
LA_0071LIC10064COG4731SHypothetical proteinECPs0.7164
LA_0074LIC10067Hypothetical proteinECPs0.6286
LA_0075LIC10068Hypothetical proteinECPs0.5617
LA_0136LIC10123COG4254SHypothetical proteinOMPs0.6461
LA_0303LIC10260Hypothetical proteinOMPs0.6756
LA_0322LIC10280fibronectin binding proteinOMPs0.7026
LA_0333LIC10288COG0603RPP-loop superfamily ATPaseOMPs0.6805
LA_0346LIC10298COG1558NFlagellar basal body rod protein FlgCECPs0.633
LA_0357LIC10307COG0412QDienelactone hydrolase family proteinOMPs0.6134
LA_0364LIC10313Hypothetical proteinOMPs1.094
LA_0410LIC10359COG2834MOuter membrane lipoprotein-sorting proteinOMPs0.6661
LA_0419LIC10368Hypothetical proteinOMPs0.7143
LA_0430LIC10377Hypothetical proteinOMPs0.8101
LA_0505LIC13050*Hypothetical proteinOMPs0.8807
LA_0568LIC13002COG2067IFatty acid transport proteinOMPs0.6682
LA_0589LIC12986hypothetical proteinOMPs0.5822
LA_0591LIC12985Hypothetical proteinECPs0.5631
LA_0663LIC12930Hypothetical proteinECPs0.5964
LA_0862LIC12765COG2077OThiol peroxidaseECPs0.5274
LA_1122LIC12558Hypothetical proteinOMPs0.7834
LA_1159LIC12525Hypothetical proteinOMPs0.5873
LA_1167LIC12519Hypothetical proteinOMPs0.765
LA_1168LIC12518Hypothetical proteinECPs0.6311
LA_1180LIC12509Hypothetical proteinECPs0.6433
LA_1192LIC12499Hypothetical proteinOMPs0.564
LA_1356LIC12374COG4206HTonB-dependent outer membrane receptorOMPs0.6274
LA_1404LIC12337hypothetical proteinOMPs0.5464
LA_1458LIC12295COG1134GMABC transporter ATP-binding proteinOMPs0.5095
LA_1499LIC12259COG0026FPhosphoribosylaminoimidazole carboxylase ATPase subunitOMPs0.6708
LA_1507LIC12254COG4775MHypothetical proteinOMPs0.5181
LA_1508LIC12253COG4775MHypothetical proteinECPs0.599
LA_1897LIC12002COG0753PCatalaseECPs0.5618
LA_1931LIC11975Hypothetical proteinOMPs0.802
LA_1968LIC11935Hypothetical proteinOMPs0.5621
LA_2069LIC11846Hypothetical proteinOMPs1.0393
LA_2105LIC11813COG1886NUFlagellar motor switch proteinOMPs0.5388
LA_2186LIC11739COG0405Egamma-glutamyltranspeptidaseOMPs0.5669
LA_2272LIC11665Hypothetical proteinOMPs0.6501
LA_2316LIC11625COG2013SHypothetical proteinECPs0.8212
LA_2377LIC11568COG2133GGlucose/sorbosone dehydrogenaseOMPs0.6489
LA_2498LIC11467COG0739MM23 family metalloendopeptidaseOMPs0.6045
LA_2538LIC11435COG5184DZRegulator of chromosome condensationOMPs0.6316
LA_2550LIC11424COG0596REsterase/lipaseECPs0.7281
LA_2595LIC11388Hypothetical proteinOMPs0.863
LA_2601LIC11382Hypothetical proteinOMPs0.5082
LA_2613LIC11370COG0545OFKBP-type peptidylprolyl isomeraseOMPs0.6189
LA_2617LIC11366Hypothetical proteinOMPs0.7008
LA_2641LIC11345COG1886NUEndoflagellar motor switch proteinOMPs0.6637
LA_2672LIC11320Hypothetical proteinOMPs0.5604
LA_2741LIC11271*Hypothetical proteinECPs0.715
LA_2742LIC11270COG1629PFerrichrome-iron receptorECPs0.5877
LA_2746LIC11268Hypothetical proteinOMPs0.7177
LA_2757LIC11259Hypothetical proteinOMPs0.517
LA_2764LIC11254COG1613PSulfate ABC transporter substrate-binding proteinOMPs0.5849
LA_2796LIC11228COG0768MTranspeptidase/penicillin binding proteinOMPs0.6239
LA_2815LIC11213COG1792MRod shape-determining protein MreCOMPs0.6909
LA_2823LIC11207Hypothetical proteinECPs0.7306
LA_2848LIC11188Hypothetical proteinOMPs0.7231
LA_2849LIC11187Hypothetical proteinECPs0.5677
LA_2850LIC11186Hypothetical proteinECPs0.6522
LA_2854LIC11184COG1749NFlagellar hook protein FlgEOMPs0.6733
LA_2949LIC11112COG1843NFlagellar hook assembly scaffolding proteinOMPs0.5626
LA_2958LIC11103COG3144NFlagellar proteinOMPs0.5731
LA_2975LIC11087COG0265OSerine proteaseOMPs0.5597
LA_2992LIC11074COG2267IHypothetical proteinECPs0.7857
LA_2993LIC11073COG1858PCytochrome c peroxidaseOMPs0.6704
LA_2998LIC11067Hypothetical proteinECPs0.8504
LA_3026LIC11052*Hypothetical proteinOMPs0.6387
LA_3050LIC11040Hypothetical proteinOMPs0.5837
LA_3064LIC11030COG1664MCell shape determination proteinOMPs0.7079
LA_3097LIC11003Hypothetical proteinOMPs0.7837
LA_3138LIC10973Hypothetical proteinECPs0.6481
LA_3145LIC10968Hypothetical proteinOMPs0.6892
LA_3150LIC10963COG1652SHypothetical proteinECPs0.6863
LA_3210LIC10920OmpL1OMPs0.9344
LA_3268LIC10873Hypothetical proteinECPs0.5264
LA_3303LIC10845Hypothetical proteinECPs0.6679
LA_3319LIC10833COG2010CHypothetical proteinECPs0.6143
LA_3410LIC10760Hypothetical proteinECPs0.9564
LA_3454LIC10723COG3865SHypothetical proteinOMPs0.5181
LA_3468LIC10714Hypothetical proteinOMPs0.5529
LA_3469LIC10713COG2353SHypothetical proteinOMPs0.6583
LA_3470LIC10712COG1345NFlagellar hook-associated protein FliDECPs0.5272
LA_3501LIC10686COG3487PHypothetical proteinOMPs0.57
LA_3508LIC10683COG3489RHypothetical proteinECPs0.7949
LA_3571LIC10628COG3794CMethylamine utilization proteinOMPs0.6392
LA_3711LIC10520Hypothetical proteinECPs0.7514
LA_3778LIC10464COG2885MOmpA family proteinOMPs0.5249
LA_3834LIC13066COG3607RGlyoxalase/bleomycin resistance protein/dioxygenaseOMPs0.6016
LA_3838LIC13070Hypothetical proteinOMPs0.5302
LA_3849LIC13076Hypothetical proteinOMPs0.6748
LA_3853LIC13078Hypothetical proteinOMPs0.5234
LA_3867LIC13086Hypothetical proteinOMPs0.5336
LA_3870LIC13089Hypothetical proteinOMPs0.5439
LA_3881LIC13101Hypothetical proteinOMPs0.7749
LA_4059LIC13238Hypothetical proteinOMPs0.7565
LA_4083LIC13255COG2303ECholesterol oxidase precursorOMPs0.667
LA_4144LIC13306Hypothetical proteinECPs0.5584
LA_4178LIC13334Hypothetical proteinECPs0.5203
LA_4202LIC13354COG4642SHypothetical proteinOMPs0.6064
LA_4203LIC13355COG0331IFatty acid synthase subunit betaOMPs0.5784
LA_4272LIC13418Hypothetical proteinOMPs0.7046
LA_4291LIC13434Hypothetical proteinOMPs0.5418
LA_4293LIC13436polysaccharide deacetylaseOMPs0.6614
LA_4335LIC13477Hypothetical proteinOMPs0.6873
LB_001LIC20001Hypothetical proteinOMPs0.6864
LB_056LIC20042*Hypothetical proteinECPs0.8468
LB_072LIC20056Hypothetical proteinECPs0.5987
LB_098LIC20077Hypothetical ProteinECPs0.5732
LB_110LIC20087*COG5010UTPR repeat-containing proteinOMPs0.5788
LB_191LIC20151Hypothetical proteinOMPs0.7302
LB_192LIC20152COG1022ILong-chain-fatty-acid–CoA ligaseOMPs0.6367
LB_194LIC20153-COG0726GXylanase/chitin deacetilaseOMPs0.6861
LB_242LIC20185Putative outermembrane proteinOMPs0.5456
LB_258LIC20197COG4206HPutative TonB-dependent outer membrane receptor proteinOMPs0.5271
LB_268LIC20205Hypothetical proteinECPs0.5724
LB_277LIC20212COG4254SHypothetical proteinECPs0.6918
LB_279LIC20214COG4870OCysteine proteaseOMPs0.6034
LB_280LIC20215COG2267IHypothetical proteinECPs0.5028
LA_1462LIC12293COG1633SHypothetical proteinUnknown0.5146

Proteins were confirmed to be reactive with human serum (Lessa-Aquino et al., .

Calculation of core- and pan-genome sizes of Pathogenic . Pan-genome representation for the 17 representative strains of Pathogenic Core genes of the 17 L.interrogans strains. In our L.interrogans candidates, nine strains belong to three different serovars (B–D). Show pan-genome of the three serovars themselves. The detailed information of the final 121 PSEs with predicted high antigenicity among the 17 representative strains of pathogenic . Proteins were confirmed to be reactive with human serum (Lessa-Aquino et al., .

Discussion

PSEs of pathogens are potential immune targets for the host immune system (Solis and Cordwell, 2011). In this study, we analyzed the PSEs of 17 leptospiral representative strains covering 11 main serovars and 17 STs, and identified potential vaccine candidates or virulence factors. Recently, we identified a total of 33 highly reliable ECPs in serovar Lai str.56601 using a newly modified protein-free medium, and 26 of them were found in predicted PSEs of str.56601 in the current study, including LipL32, LipL36, LipL48, LenC, LenE, TonB receptor, OmpA family protein, and 8 putative lipoproteins and 6 hypothetical proteins (Zeng et al., 2013). In addition, a novel L. interrogans OMP microarray was developed and contained a total of 366 predicted lipoproteins and transmembrane OMPs (Pinne et al., 2012). About 70% (239/346) of these OMPs or lipoproteins in the protein array were found in our predicted PSEs of str. Fiocruz L1-130. It has been reported that 1,026 proteins in the TX-114 OMP-enriched fraction were found from the transcriptional and translational responses to temperature shift by high-throughput liquid chromatography tandem mass spectrometry (LC/MS-MS); however, only 154 of the 1026 proteins were found in our predicted PSEs of str.56601. The significant discrepancies could be due to lower coverage of OMPs or lipoproteins within the 1,026 proteins, which comprised no more than 80 predicted or known OMPs or lipoproteins (Lo et al., 2009). In order to comprehensively evaluate the advantages and disadvantages of our negative-screening RV strategy, we further compared another three different data sets including experimentally identified 78 surface-exposed antigens or virulence factors (see Table S3). 499 PSEs of L. interrogans were identified by a positive-selection RV strategy as previously described by Yang et al. (2006) and 346 OMPs/lipoproteins of L. interrogans in the L. interrogans OMP array (Pinne et al., 2012), with our negative-screening results (See Figure 5 and Table S3). Among all 78 known surface-exposed antigens, 63, 55, and 43 were identified in the OMP array (Pinne et al., 2012), in this study and Yang's studies (Yang et al., 2006), respectively. Actually, the highest consistency between protein array result and the known surface-exposed antigens might mainly be due to more than 90% (70/78) of known antigens being located in the outer membrane. Moreover, there are 95 common OMPs/Lipoproteins among Yang's, Pinne's and our study's antigen inventory. There were 84 common OMPs/Lipoproteins between Pinne's and our study while there were only 40 proteins between Pinne's and Yang's study. Thus, for OMP/Lipoprotein, our negative RV strategy predicted more proteins than that of Yang's positive RV strategy. However, the information of extracellular proteins in pathogenic Leptospira spp. is still limited. Further, studies to identify more ECPs and to assess the prediction precision of the two different RV strategies are needed.
Figure 5

Venn diagram detailed the unique and common PSEs among our negative-screening, Yang's positive-screening, and Pinne's OMP array results with known experimentally identified surface-exposed antigens.

Venn diagram detailed the unique and common PSEs among our negative-screening, Yang's positive-screening, and Pinne's OMP array results with known experimentally identified surface-exposed antigens. In this study, pan-genome analysis showed 121 highly antigenic PSEs conserved completely among all 17 strains. Except for several known proteins, including LipL45, OmpL1, and LigB, the majority of these candidates are identified in Leptospira for the first time (Pinne et al., 2012). Among the 121 PSEs, the most promising new vaccine antigens appear to be hypothetical proteins (LA_2741), BatC (LB_056), and lipL71/LruA (LA_3097). LA_2741 and BatC were recognized in leptospirosis patients and identified as differentially reactive antigens between acute- or convalescent-phase leptospirosis patients and healthy individuals (Lessa-Aquino et al., 2013). The lipoprotein LruA, present in pathogenic L. interrogans but not in non-pathogenic L. biflexa, could induce high levels of humoral antibody responses in equine uveitis eyes and in sera of humans with leptospiral uveitis (Verma et al., 2005). Thus, these three PSEs could be worthy of further investigation as novel vaccine candidates and/or diagnostic markers for leptospirosis because of common features, including surface-exposed localization, universal conservation, and eliciting strong antibody production in patients (Verma et al., 2005). Surface-exposed proteins generally comprise a wide array of virulence factors involved in pathogen–host interactions and are responsible for causing disease. Comparing our predicted results to the previous leptospiral OMP microarray data (Pinne et al., 2012), 11 of 15 fibronectin-binding proteins were found in the predicted PSEs of str. Fiocruz L1-130, which were subdivided into four core PSEs (hypothetical protein, TonB-dependent receptor, iron-regulated lipoprotein, and OmpA family proteins) and seven dispensable PSEs (lipoprotein, Lsa66, leucine-rich repeat protein, sphingomyelinases 2 and 3; Pinne et al., 2012). All four core PSEs are involved in adherence to fibronectin during the initial attachment stage of infection and have significant potential to exhibit key roles in the pathogenesis of leptospirosis. For example, TonB-dependent receptor (LA_3468), and iron-regulated lipoprotein (LA_3469) are related to iron uptake, which is essential for pathogenic leptospires (Murray et al., 2008). In our study, iron-regulated lipoprotein (LA_3469) was confirmed to be up-regulated at 37°C as compared to 28°C and could activate the host's immune system to produce a high-level antibody response (our unpublished data), indicating this protein might have an indispensable function in the pathogenesis of L. interrogans. The dispensable PSEs sphingomyelinases Sph2 and Sph3 (LA_1029 and LA_4004) showed distinctly different conservation. It has been confirmed that Sph2 secreted as sphingomyelinase hemolysin has strong hemolytic activity against sheep erythrocytes as well as cytotoxic activity against mouse lymphocytes and macrophages (Zhang et al., 2005, 2008). Thus, Sph2 might be important as a novel virulence factor involved in leptospiral pathogenesis and might be associated with virulence differences among different leptospirosis serovars. Another dispensable PSE is the leucine-rich repeat protein (LA_3028) found exclusively in the highly pathogenic strains: str.56601 and str. Fiocruz L1-130. The leucine-rich repeat protein (LRR) has been reported frequently as a virulence factor in numerous pathogens involved in cell adhesion, invasion, and stimulation of host defense mechanisms (Kobe and Kajava, 2001; Brinster et al., 2007). The leucine-rich repeat protein was identified as a fibronectin-binding protein and it should be, at least partly, related to the high virulence of str.56601 and str. Fiocruz L1-130. The other core PSE like hypothetical protein LA_0505 predicted as a secretion protein through non-classical pathway, has been shown to bind some host extracellular matrices (such as laminin, plasma fibronectin, fibrinogen, etc.) and play an important role in adhesion of L. interrogans (Pinne et al., 2012). Interestingly, LA_0505 was found in the supernatant of L. interrogansstr. 56601 and up-regulated in vivo in our recent study (Zeng et al., 2013). Moreover, LA_0505 has a BIG domain as Ca2+-binding modules during the process of leptospirosis (Raman et al., 2010). The potential virulence factors in predicted PSEs are the four hypothetical proteins LA_1761–1764 identified here. These four PSEs are located in the 54 kb separate circular prophage of str.56601, which was inserted into the larger chromosome at the same time; however, the 54 kb prophage was absent from the genome of str. Fiocruz L1-130 (Bourhy et al., 2007). Until now, there was no experimental evidence suggesting these four proteins might be associated with the virulence of Leptospira; however, PPI analysis in the string database suggested that the four proteins interact mostly with other hypothetical proteins in the PPI network (Figure 6). LA_1762 interacts with lipoproteins LA_3730 and LA_3867, both of which were identified as putative extracellular proteins and thus were recommended as novel candidates for the development of leptospirosis vaccines (Viratyosin et al., 2008). LA_3867 was identified as one of the most strongly up-regulated genes of pathogenic L. interrogans at physiologic osmolarity as compared to low osmolarity, indicating over-expression of LA_3867 in pathogenic leptospires might be associated with transition from survival in the outside environment to infection of mammalian hosts (Matsunaga et al., 2007). Therefore, as an interacting partner of LA_3867, LA_1762 could have a crucial role in successful establishment of host infection.
Figure 6

Protein-protein interaction of the potential virulence factors (LA_1761 to LA_1764) located in the 54 kb circular prophage of .

Protein-protein interaction of the potential virulence factors (LA_1761 to LA_1764) located in the 54 kb circular prophage of .

Conclusions

A new RV negative-screening strategy combined with pan-PSE analysis was used to screen PSEs among 17 L. interrogans strains. We identified 190 core PSEs, 913 dispensable PSEs, and 861 unique PSEs. Further, antigenicity analysis finally identified 121 highly antigenic PSEs as potential vaccine candidates from the 190 core PSEs, which include several known antigens, including LipL45, OmpL1, and LigB, and the vast majority of newly identified potential vaccine candidates against leptospirosis. At the same time, we also characterized many potential virulence factors in our inventory of predicted PSEs. Our prediction results may accelerate vaccine development against leptospirosis and deepen our understanding of leptospiral virulence mechanisms. Moreover, this in silico strategy combined with pan-genome analysis could be a routine method of reverse vaccinology applied widely to similar pathogens. Further, cloning, expression, and purification of these proteins and screening of these potential vaccine candidates are needed.

Author contributions

Conceived and design the experiment: YZZ, XG, Y-FC, and YY; compartive genomic analysis: LZ and DW; predicting subcellular localization: LZ, NH, QZ, and KC; screening known surface-exposed antigens: KD and YZ; write the manuscript: LZ, XG, YZZ, Y-FC, and YY.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  47 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  A genomic island of the pathogen Leptospira interrogans serovar Lai can excise from its chromosome.

Authors:  Pascale Bourhy; Laurence Salaün; Aurélie Lajus; Claudine Médigue; Caroline Boursaux-Eude; Mathieu Picardeau
Journal:  Infect Immun       Date:  2006-11-21       Impact factor: 3.441

3.  Identification of a universal Group B streptococcus vaccine by multiple genome screen.

Authors:  Domenico Maione; Immaculada Margarit; Cira D Rinaudo; Vega Masignani; Marirosa Mora; Maria Scarselli; Hervé Tettelin; Cecilia Brettoni; Emilia T Iacobini; Roberto Rosini; Nunzio D'Agostino; Lisa Miorin; Scilla Buccato; Massimo Mariani; Giuliano Galli; Renzo Nogarotto; Vincenzo Nardi-Dei; Vincenzo Nardi Dei; Filipo Vegni; Claire Fraser; Giuseppe Mancuso; Giuseppe Teti; Lawrence C Madoff; Lawrence C Paoletti; Rino Rappuoli; Dennis L Kasper; John L Telford; Guido Grandi
Journal:  Science       Date:  2005-07-01       Impact factor: 47.728

Review 4.  Pathogenesis of leptospirosis: the influence of genomics.

Authors:  Ben Adler; Miranda Lo; Torsten Seemann; Gerald L Murray
Journal:  Vet Microbiol       Date:  2011-03-05       Impact factor: 3.293

5.  Transcriptional response of Leptospira interrogans to iron limitation and characterization of a PerR homolog.

Authors:  Miranda Lo; Gerald L Murray; Chen Ai Khoo; David A Haake; Richard L Zuerner; Ben Adler
Journal:  Infect Immun       Date:  2010-08-30       Impact factor: 3.441

6.  Global transcriptomic response of Leptospira interrogans serovar Copenhageni upon exposure to serum.

Authors:  Kanitha Patarakul; Miranda Lo; Ben Adler
Journal:  BMC Microbiol       Date:  2010-01-29       Impact factor: 3.605

7.  Feature-based prediction of non-classical and leaderless protein secretion.

Authors:  Jannick Dyrløv Bendtsen; Lars Juhl Jensen; Nikolaj Blom; Gunnar Von Heijne; Søren Brunak
Journal:  Protein Eng Des Sel       Date:  2004-04-28       Impact factor: 1.650

8.  Cytotoxic activity and probable apoptotic effect of Sph2, a sphigomyelinase hemolysin from Leptospira interrogans strain Lai.

Authors:  Yi-Xuan Zhang; Yan Geng; Jun-Wei Yang; Xiao-Kui Guo; Guo-Ping Zhao
Journal:  BMB Rep       Date:  2008-02-29       Impact factor: 4.778

9.  Big domains are novel Ca²+-binding modules: evidences from big domains of Leptospira immunoglobulin-like (Lig) proteins.

Authors:  Rajeev Raman; V Rajanikanth; Raghavan U M Palaniappan; Yi-Pin Lin; Hongxuan He; Sean P McDonough; Yogendra Sharma; Yung-Fu Chang
Journal:  PLoS One       Date:  2010-12-29       Impact factor: 3.240

10.  Genome-wide subcellular localization of putative outer membrane and extracellular proteins in Leptospira interrogans serovar Lai genome using bioinformatics approaches.

Authors:  Wasna Viratyosin; Supawadee Ingsriswang; Eakasit Pacharawongsakda; Prasit Palittapongarnpim
Journal:  BMC Genomics       Date:  2008-04-21       Impact factor: 3.969

View more
  10 in total

Review 1.  Human leptospirosis vaccines in China.

Authors:  Yinghua Xu; Qiang Ye
Journal:  Hum Vaccin Immunother       Date:  2017-12-19       Impact factor: 3.452

2.  Genome-wide identification of novel vaccine candidates for Plasmodium falciparum malaria using integrative bioinformatics approaches.

Authors:  Satarudra Prakash Singh; Deeksha Srivastava; Bhartendu Nath Mishra
Journal:  3 Biotech       Date:  2017-09-15       Impact factor: 2.406

Review 3.  Optimizing the Parametrization of Homologue Classification in the Pan-Genome Computation for a Bacterial Species: Case Study Streptococcus pyogenes.

Authors:  Erwin Tantoso; Birgit Eisenhaber; Frank Eisenhaber
Journal:  Methods Mol Biol       Date:  2022

4.  Development of Human Recombinant Leptospirosis Vaccines.

Authors:  Natasha Rodrigues de Oliveira; Thaís Larré Oliveira; Sérgio Jorge; Odir Antônio Dellagostin
Journal:  Methods Mol Biol       Date:  2022

5.  EpitoCore: Mining Conserved Epitope Vaccine Candidates in the Core Proteome of Multiple Bacteria Strains.

Authors:  Tayna S Fiuza; João P M S Lima; Gustavo A de Souza
Journal:  Front Immunol       Date:  2020-05-05       Impact factor: 7.561

Review 6.  Current trends in targeted therapy for drug-resistant infections.

Authors:  Leila Rahbarnia; Safar Farajnia; Behrooz Naghili; Vahideh Ahmadzadeh; Kamal Veisi; Roghayyeh Baghban; Sayna Toraby
Journal:  Appl Microbiol Biotechnol       Date:  2019-08-14       Impact factor: 4.813

Review 7.  Application of TonB-Dependent Transporters in Vaccine Development of Gram-Negative Bacteria.

Authors:  Jia Wang; Kun Xiong; Qu Pan; Weifeng He; Yanguang Cong
Journal:  Front Cell Infect Microbiol       Date:  2021-01-27       Impact factor: 5.293

8.  Identification of Vibrio parahaemolyticus and Vibrio spp. Specific Outer Membrane Proteins by Reverse Vaccinology and Surface Proteome.

Authors:  Wenbin Wang; Jianxin Liu; Shanshan Guo; Lei Liu; Qianyun Yuan; Lei Guo; Saikun Pan
Journal:  Front Microbiol       Date:  2021-01-28       Impact factor: 5.640

9.  Putative antigenic proteins identified by comparative and subtractive reverse vaccinology in necrotic enteritis-causing Clostridium perfringens isolated from broiler chickens.

Authors:  Ilhem Meniaï; Alexandre Thibodeau; Sylvain Quessy; Valeria R Parreira; Philippe Fravalo; Guy Beauchamp; Marie-Lou Gaucher
Journal:  BMC Genomics       Date:  2021-12-13       Impact factor: 3.969

10.  Surface proteomics and label-free quantification of Leptospira interrogans serovar Pomona.

Authors:  Teerasit Techawiwattanaboon; Praparat Thaibankluay; Chahya Kreangkaiwal; Suwitra Sathean-Anan-Kun; Prasong Khaenam; Jiradej Makjaroen; Trairak Pisitkun; Kanitha Patarakul
Journal:  PLoS Negl Trop Dis       Date:  2021-11-29
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.