Literature DB >> 17109759

In silico and microarray-based genomic approaches to identifying potential vaccine candidates against Leptospira interrogans.

Hong-Liang Yang1, Yong-Zhang Zhu, Jin-Hong Qin, Ping He, Xu-Cheng Jiang, Guo-Ping Zhao, Xiao-Kui Guo.   

Abstract

BACKGROUND: Currently available vaccines against leptospirosis are of low efficacy, have an unacceptable side-effect profile, do not induce long-term protection, and provide no cross-protection against the different serovars of pathogenic leptospira. The current major focus in leptospirosis research is to discover conserved protective antigens that may elicit longer-term protection against a broad range of Leptospira. There is a need to screen vaccine candidate genes in the genome of Leptospira interrogans.
RESULTS: Bioinformatics, comparative genomic hybridization (CGH) analysis and transcriptional analysis were used to identify vaccine candidates in the genome of L. interrogans serovar Lai strain #56601. Of a total of 4727 open reading frames (ORFs), 616 genes were predicted to encode surface-exposed proteins by P-CLASSIFIER combined with signal peptide prediction, alpha-helix transmembrane topology prediction, integral beta-barrel outer membrane protein and lipoprotein prediction, as well as by retaining the genes shared by the two sequenced L. interrogans genomes and by subtracting genes with human homologues. A DNA microarray of L. interrogans strain #56601 was constructed for CGH analysis and transcriptome analysis in vitro. Three hundred and seven differential genes were identified in ten pathogenic serovars by CGH; 1427 genes had high transcriptional levels (Cy3 signal > or = 342 and Cy5 signal > or = 363.5, respectively). There were 565 genes in the intersection between the set encoding surface-exposed proteins and the set of 307 differential genes. The number of genes in the intersection between this set of 565 and the set of 1427 highly transcriptionally active genes was 226. These 226 genes were thus identified as putative vaccine candidates. The proteins encoded by these genes are not only potentially surface-exposed in the bacterium, but also conserved in two sequenced L. interrogans. Moreover, these genes are conserved among ten epidemic serovars in China and have high transcriptional levels in vitro.
CONCLUSION: Of the 4727 ORFs in the genome of L. interrogans, 226 genes were identified as vaccine candidates by bioinformatics, CGH and transcriptional analysis on the basis of the theory of reverse vaccinology. The proteins encoded by these genes might be useful as vaccine candidates as well as for diagnosis of leptospirosis.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17109759      PMCID: PMC1664576          DOI: 10.1186/1471-2164-7-293

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Leptospirosis is a globally important zoonotic disease caused by pathogenic Leptospira species[1]. Leptospires are thin, helically coiled, motile bacteria, classified into 17 genomospecies (including the saprophyte Leptospira biflexa and the pathogen Leptospira interrogans) on the basis of DNA-DNA hybridization studies, or serologically classified into more than two hundred pathogenic serovars on the basis of structural heterogeneity in the carbohydrate component of the lipopolysaccharide[2,3]. Currently available vaccines, based on inactivated whole bacteria or membrane preparations from pathogenic leptospires, are of low efficacy, have an unacceptable side-effect profile, require annual booster immunizations and do not confer cross-protective immunity against different serovars [4-6]. Because of these concerns, the current major focus in leptospirosis research is to discover cross-species-conserved or cross-serovar-conserved protective antigens that may elicit longer-term protection against a broad range of Leptospira[5,7]. New vaccine development strategies are thus needed for preventing this zoonosis. Reverse vaccinology, which based on the genomic approach, has been applied to some bacteria, and novel vaccine candidate sequences have been identified [8-11]. The genome projects of two Leptospira strains give us intensive knowledge on the whole genome level [12-14]. Although many efforts have been made to identify the surface-exposed proteins of leptospires, finding perfect vaccine candidate antigens that provide cross-protection against different serovars of pathogenic L. interrogans still requires further work[7,15-17]. In our current study, we identified 226 potential candidate vaccine genes against L. interrogans using in silico analysis, comparative genomic hybridization (CGH) and transcriptional analysis, based on a genome-wide DNA microarray comprising 3528 open reading frames (ORFs) derived from the original annotation of L. interrogans strain #56601. These candidate genes not only encode surface-exposed proteins of L. interrogans strain #56601, but also have high transcription levels in vitro. Moreover, the proteins encoded by these genes are conserved in two sequenced L. interrogans and ten epidemic pathogenic serovars in China.

Results

In silico analysis for identification of genes encoding surface-exposed proteins

In 4727 ORFs of L. interrogans strain #56601, 1282 proteins were predicted to be surface-exposed using P-CLASSIFIER, 654 proteins had signal peptides, 813 were predicted to have no more than four α-helices with transmembrane topology, 96 were predicted to have β-barrel topology implying that they are integral β-barrel outer membrane proteins, and 158 were predicted have a lipoprotein signal peptide using SpLiP. The number of genes in the intersection between the set of surface-exposed proteins identified by P-CLASSIFIER and the set of proteins characterized by at least one of the four characteristic topologies is 688. We calculated the similarity of proteins between serovar Lai and serovar Copenhageni as well as between serovar Lai and human (cut-off value: similarity >70% and E value = 1e-10 for two serovars, E value = 1e-10 for serovar Lai and human) using BLASTP. We found 3672 orthologs between the two serovars, and 605 proteins that are similar in serovar Lai and human. Finally, 616 genes were yielded by the bioinformatics study by retaining the orthologs between the two serovars and subtracting the genes that were similar in serovar Lai and human.

Comparative genomic hybridization

We prepared a gene chip microarray corresponding to the complete genome sequence of L. interrogans strain #56601. The chips were hybridized to labelled total DNA extracted from strain Fiocruz L1–130 and ten pathogenic serovars. On the basis of test hybridizations of strain Fiocruz L1–130 vs. the reference sample, we considered genes that gave hybridization ratios between 1.0 and 3.0 to be present in both strains and greater than 10.0 to be absent from the test strain. Ambiguous values between 3.0 and 10.0 may have been due to highly divergent genes or hybridization to paralogous genes. The CGH results revealed that 307 genes of L. interrogans strain #56601 were absent or highly divergent in at least one strain tested. After subtracting these 307 differential genes, we were left with 565 genes, which not only encode presumably surface-exposed proteins but also are conserved in the ten pathogenic serovars.

Transcriptome analysis

Microarray analysis of the mRNA extracted from in vitro grown leptospires revealed that the fluorescence signals of Cy3 and Cy5 ranged from 10.5 to 51,707 (see Figure 1); 1427 genes were expressed above the median level (Cy3 signal ≥ 342 and Cy5 signal ≥ 363.5) in the microarray and therefore as genes with high transcriptional levels. The intersection between the sets of 565 and 1427 genes contained 226 genes. Among them, 8.0% (18/226) were located extracellularly, 53.1% (120/226) in the outer membrane, 16.4% (37/226) in the periplasmic space and 22.6% (51/226) in the inner membrane according to predictions. These vaccine candidates were classified further according their gene names and clusters of orthologous groups (COGs) [18,19](Table 1, 2, 3, 4); 60.6% (137/226) of the candidates had COG annotations.
Figure 1

Identification of highly expressed genes in L. interrogans by microarray. Bacteria were grown in EMJH medium at 37°C and were collected when the culture reached mid-exponential-phase. RNA was purified and labelled with either Cy3 or Cy5 and hybridized with the microarray of L. interrogans strain #56601 (3528 genes). Transcription analysis revealed that 1427 genes were highly expressed (cy3 signal ≥ 342 and cy5 signal ≥ 363.5).

Table 1

The result of vaccine candidates according to localization sites: extracellular

geneCy3 signalCy5 signalCOGproduct
LA0074402.2574.7-hypothetical protein
LA03221,118760.5-hypothetical protein
LA04441,6991,024COG1196D COG4254Shypothetical protein
LA05873,2462,998COG1075RLactonizing lipase
LA0617672487.5-hypothetical protein
LA14331,9461,552-hypothetical protein
LA1508559.2853-putative outermembrane protein
LA1569755.3391.3COG5651Nputative lipoprotein
LA2471354.5572.5COG0457Rputative outermembrane protein
LA28231,466884-putative lipoprotein
LA2975478.8454.2-hypothetical protein
LA2992663.8535.3COG0419Lhypothetical protein
LA3210410.7847.7-hypothetical protein
LA3338899.8795.8-putative lipoprotein
LA3394392.3652-hypothetical protein
LA3779374.8431.7-hypothetical protein
LA3848395.5368.8-putative lipoprotein
LB2257981,069-hypothetical protein
Table 2

The result of vaccine candidates according to localization sites: outermembrane

geneCy3 signalCy5 signalCOGproduct
LA0049471478.2COG0840NT COG2202Taerotaxis sensor receptor, flavoprotein
LA00991,081853.5-hypothetical protein
LA01662,1491,113COG1196Dhypothetical protein
LA01781,224953.8COG0706U60Kd inner membrane protein
LA0241554.5905.7COG1999RSCO1/SenC family protein
LA0253755.8559.2COG2849Shypothetical protein
LA0272462.2734.8-hypothetical protein
LA0301787771.8COG2885Mouter membrane protein OmpA family
LA0330554397.3COG2366RPenicillin G acylase precursor
LA03391,0601,185COG0584CGlycerophosphoryl diester phosphodiesterase
LA03651,2901,959-hypothetical protein
LA0370720.7952.8-hypothetical protein
LA0378692.5560.3COG0457RTPR-repeat-containing proteins
LA03791,134957-hypothetical protein
LA04103,4513,604COG2834Mhypothetical protein
LA0423553712.8COG2931Qhypothetical protein
LA05056,3177,727COG1409Rprobable glycosyl hydrolase
LA0532736.5684.7-hypothetical protein
LA0568434.8435.3COG2067Ihypothetical protein
LA06351,131712.8-S-layer-like array protein
LA0678973.3952.5COG0840NTMethyl-accepting chemotaxis protein mcpB
LA0811478.8770.5-hypothetical protein
LA0818481.3946.8-hypothetical protein
LA0878886.7555.7COG0266LDshA protein
LA09401,4671,026-hypothetical protein
LA09572,7981,542COG1538MUouter membrane efflux protein
LA1009764.8887.7COG5009MPenicillin-binding protein 1A
LA10101,124829-putative outermembrane protein
LA1087376.5536-hypothetical protein
LA10992,3881,500COG3103Thypothetical protein
LA11002,9012,796COG1538MUouter membrane efflux protein
LA1161474.2403COG2067Ilong-chain fatty acid transport protein
LA1174615423COG0834ETamino acid ABC transporter, periplasmic amino acid-binding protein
LA1192616.3545.3-putative outermembrane protein
LA14041,377977-putative outermembrane protein
LA14121,034640.3-hypothetical protein
LA14951,9202,172-putative outermembrane protein
LA1501558.2545.7COG4775M COG5009Mhypothetical protein
LA15071,6151,747-hypothetical protein
LA1690744.7471.7COG0449Mhypothetical protein
LA17331,4181,557-hypothetical protein
LA1912873.8745-putative outermembrane protein
LA1917595.5535.3-hypothetical protein
LA1931941.51,540-putative outermembrane protein
LA1987909.3994.5-putative outermembrane protein
LA1996556.8674-hypothetical protein
LA20242,5942,079-hypothetical protein
LA20631,4631,967-hypothetical protein
LA2094548.2380.3COG1716TFHA-domain containing protein
LA21261,223979.7COG0616OUPutative signal peptide peptidase sppA
LA22151,045679.2COG1196D COG1360NChemotaxis motB protein
LA2238420464.7COG0726Gpolysaccharide deacetylase
LA2266367.3364.5-putative outermembrane protein
LA2267886.21,542COG0457Rputative outermembrane protein
LA2268971.71,074-putative outermembrane protein
LA22954,4456,689COG0532J COG4254SLipL45 protein
LA2318813.2673.8COG4775MPredicted outer membrane protein
LA2368347.8585COG1555L COG3156U COG0477GEPR COG0075Etype II secretion pathway related protein etpK-like protein
LA23751,2552,047COG1450NUGeneral secretory pathway protein D
LA2377377.5418COG0739Mpeptidase, M23/M37 family protein
LA2395847.31,736COG2815Sputative outermembrane protein
LA2413540.7381.2COG0791MProbable lipoprotein nlpC precursor
LA2464362435.7COG3225Ngliding motility protein GldG
LA24683,6536,205COG1196Dhypothetical protein
LA25101,230846COG1452Mhypothetical protein
LA25371,3291,304-hypothetical protein
LA2538624.2606.5-hypothetical protein
LA2612532.8574COG3190Nflagellar protein required for flagellar formation
LA2617656.5697.8-hypothetical protein
LA26561,128637.2COG2968Shypothetical protein
LA2664905.3867.8COG1706Nflagellar P-ring protein precursor
LA2672662.31,116-hypothetical protein
LA27411,649916.7-hypothetical protein
LA2742814.8524.2-hypothetical protein
LA27554,1752,808COG0768Mprobable penicillin-binding protein
LA27571,2131,270COG1792Mrod shape-determining protein mreC
LA28006651,591-hypothetical protein
LA2818681.7440.8-hypothetical protein
LA2857506.7516.5COG0596RPredicted hydrolase or acyltransferase, alpha/beta hydrolase superfamily
LA2949407.5460COG0265Oheat shock protein, HtrA1
LA30691,221786.3-hypothetical protein
LA3091995.5879.7-hypothetical protein
LA3118771.71,239COG0466Ohypothetical protein
LA3149608.3421.8COG1629PHemin receptor
LA3165749.2454.7COG4642Sconserved hypothetical protein with MORN repeat
LA3353432.2698.2-hypothetical protein
LA3403391.3388.8-hypothetical protein
LA3434724.7625.7COG0860MN-acetylmuramoyl-L-alanine amidase
LA3440915.5864.3COG0237Hhypothetical protein
LA3468618584.8COG1629Pprobable TonB-dependent receptor
LA3469658.2685.7COG3487Piron-reglulated protein A
LA35061,4071,266COG0840NTMethyl-accepting chemotaxis protein
LA35521,6522,900-hypothetical protein
LA36321,0281,337COG1413CPBS lyase HEAT-like repeat containing protein
LA3681463.5459-phage-related-like protein
LA37441,318840.2-hypothetical protein
LA3862537.3761.3COG0532Jhypothetical protein
LA3872385.3648.8COG0616OUPutative signal peptide peptidase sppA
LA39385891,026COG0457Rhypothetical protein
LA3970532.7398.5-hypothetical protein
LA40709,7645,630-hypothetical protein
LA42121,6781,684-hypothetical protein
LA42272,4651,953COG5621Rhypothetical protein
LA4232509.3612.2COG2982Mhypothetical protein
LA4261485.7612.5COG0451MGUDP-glucose 4-epimerase
LA42631,0121,290-hypothetical protein
LA4285726.3791.2COG3858Rhypothetical protein
LA43411,0091,146COG0739MPeptidase family M23/M37
LB0181,5491,589COG1635Hhypothetical protein
LB025371.8382.5-hypothetical protein
LB050344533.3-hypothetical protein
LB056443.5523.8COG0457RTPR-repeat-containing protein
LB061550.2769.7COG3211Rhypothetical protein
LB191344.3410COG1629P COG4771Pputative TonB-dependent outer membrane receptor protein
LB199917.3925.2COG1629Pputative outermembrane protein
LB258552.51,082COG4870OCysteine protease
LB2771,634984.3-hypothetical protein
LB2791,115804.3COG1629Phypothetical protein
LB3281,5912,672COG1360N COG2885Mouter membrane protein OmpA
LB3621,2467,69-hypothetical protein
Table 3

The result of vaccine candidates according to localization sites: periplasmic

geneCy3 signalCy5 signalCOGproduct
LA04302,6142,094COG1830Ghypothetical protein
LA00111472.22164-putative lipoprotein
LA0093963.2539.3-hypothetical protein
LA0107476466.3-hypothetical protein
LA02229,87318,863COG2885Mouter membrane protein OmpA family
LA0312526.2366.7COG0739MM23/M37 family protein
LA0413505.3544.2-hypothetical protein
LA04945511,165-hypothetical protein
LA0569404.2366.3-hypothetical protein
LA06168,8777,462COG0457Router membrane lipoprotein lipL41
LA1118610.2614.3-putative outermembrane protein
LA1136636.51,301COG2834Mhypothetical protein
LA1155534.3563.8COG1613Psulfate-binding protein precursor
LA13121,5141,070-hypothetical protein
LA14481,0901,857COG1464Pputative outermembrane protein
LA1998676700.8COG0726Gpolysaccharide deacetylase
LA2023622405COG2010Ccytochrome c
LA22082,2522,334COG3858Rhypothetical protein
LA2277609.5391.3-hypothetical protein
LA2316633.3707.2-putative outermembrane protein
LA23721,4272,257COG2165NUGeneral secretory pathway protein G
LA25311,177894.5COG1196Dhypothetical protein
LA263751,70737,602-LipL32 protein
LA2748714.5537.3COG1613PSulfate-binding protein precursor
LA2820691.3525.5-hypothetical protein
LA2950373.8661COG0265OHtrA2
LA2993349433.8-hypothetical protein
LA35071,360721.7COG2010Cputative cytochrome c
LA3535541.2659.8-hypothetical protein
LA3571607.2492.8COG2010Cputative cytochrome c
LA3576595.8594.5COG1360Nflagellar motor protein
LA37801,3651,432-hypothetical protein
LA3839664618.3COG1881RPhosphatidylethanolamine-binding family protein
LA3944507.3595.2-hypothetical protein
LA4262355515.8-hypothetical protein
LB047506.32,137COG2849Shypothetical protein
LB098735.5507.3COG0726GPredicted xylanase/chitin deacetilase
Table 4

The result of vaccine candidates according to localization sites: innermembrane

geneCy3 signalCy5 signalCOGproduct
LA0238662.5433.2COG1612Ocytochrome-c oxidase assembly factor ctaA
LA0250651.2738.8COG4956RTRAM family protein
LA0314577.2368COG0168PTrk system potassium uptake protein trkH
LA05501,353886.5COG0841VNolG efflux transporter
LA0639858.2469.7-hypothetical protein
LA0650870.7628COG0705RRhomboid family protein
LA0680530.2707.5COG0004Phypothetical protein
LA0960760.7452-hypothetical protein
LA1056702.7607.8COG0840NThypothetical protein
LA11434,0274,074COG0341UPreprotein translocase subunit SecF
LA11911,014790.7COG0840NTMethyl-accepting chemotaxis protein
LA1283902.21,162COG0845Mhypothetical protein
LA1284415.7543COG4591MLipoprotein releasing system transmembrane protein lolC
LA1321374.8860.8COG4232OCthiol:disulfide interchange protein DsbD
LA1397722.8672.3COG1033Rputative Protein export membrane protein SecD/SecF
LA1435612.2524.3COG0392Shypothetical protein
LA1451415.2435.2COG1183IPhosphatidylglycerophosphate synthase
LA14713,3607,809COG3808CPyrophosphate-energized vacuolar membrane proton pump
LA1477566.7436.8COG1519M3-deoxy-D-manno-octulosonic-acid transferase
LA1535521.5685.5-hypothetical protein
LA1554498.7398.2COG1502Ihypothetical protein
LA16954,4932,360-CrcB-like protein
LA19582,6631,551COG0526OCputative outermembrane protein
LA1979483.8657.8COG0463MPutative glycosyl transferase
LA1982342.5452COG3307Mhypothetical protein
LA2050411.3848.8COG0707MUDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase
LA225010,7429,624-Nuclease S1
LA22751,4151,071COG0586SdedA protein
LA23201,3191,496COG0811Ubiopolymer transport protein, putative
LA2604464.3448.7-hypothetical protein
LA27373,8132,157COG0204Iputative acyltransferase
LA28915,2293,140COG1055Phypothetical protein
LA30721,9701,665COG0477GEPRhypothetical protein
LA31101,2622,371COG2156Ppotassium-transporting ATPase, C chain
LA3146877523.2COG2076Phypothetical protein
LA35771,6181,198COG1291Nmotility protein A
LA35862,3481,746COG4270Shypothetical protein
LA3754667.3449.7COG0681USignal peptidase I
LA3777497.3539COG0239DProtein crcB homolog
LA38062,1162,869COG0004PProbable ammonium transporter
LA39165,5185,510-hypothetical protein
LA3926967.81,802COG0841Vtransmembrane efflux pump protein
LA40621,3262,138-hypothetical protein
LA4154638.7759COG3225Nhypothetical protein
LA41551,1401,015COG1277Rprobable permease of ABC transporter
LA4172411392.5-hypothetical protein
LA4228559.5627.8COG4174RDipeptide transport system permease protein dppB
LA4233409985.7COG1172Ghypothetical protein
LA42691,9072,240COG2207K COG0477GEPRtranscriptional regulator, AraC family
LB1742,1503,440COG0501Oheat shock protein HtpX
LB2815,0262,708COG0811Utransport protein ExbB

Discussion

Vaccines composed of whole cells or outer membrane envelope are available in some countries to prevent human leptospirosis, and clinical trials have been reported [20-23]. In view of their disadvantages, especially their inability to elicit longer-term protection against different serovars of pathogenic leptospires, efforts have been focused on developing subunit vaccines[24]. During recent years, Hap1[25] (also known as LipL32[26]), LipL41, OmpL1[27] and Lig[28,29] proteins have been identified as promising vaccine candidates for preclinical trials. The availability of complete genome sequence information for many pathogens and the development of sophisticated computer programs have led to a new paradigm in vaccine development. Now it is possible to screen potential vaccine candidate genes in a reverse manner starting from the genome. This reverse vaccinology was first applied to MenB[30] and is now applied routinely in vaccine development, as in the search for vaccines against S. pneumoniae, Streptococcus agalactiae, Staphylococcus aureus, Porphyromonas gingivalis, Chlamydia pneumoniae and other microorganisms[10]. Bioinformatics analysis is the first important strategy of reverse vaccinology. Gram-negative bacteria have five subcellular location sites: cytoplasm, inner membrane, outer membrane, periplasm and extracellular space. The surface-exposed proteins, i.e. those located in sites other than the cytoplasm, are the most suitable vaccine candidates because they are more susceptible to antibody recognition and can therefore elicit protective immune responses. Many sophisticated computer programs have been developed to predict the subcellular locations of putative proteins in the whole genome [31-33]. Analyzing the gene transcription profile using DNA microarrays provides a second vaccine candidate selection strategy in reverse vaccinology. A gene having a fluorescent signal above the median value corresponds to an expression level higher than 5–10 mRNA copies per genome[34]. Those highly expressed genes could be potential vaccine candidates[34]. Finally, other approaches such as proteomic technology can be used to screen vaccine candidates. Using combined these strategies, genes encoding potential vaccine antigens can eventually be identified. In our preliminary selection, all genes in L. interrogans strain #56601 were searched using P-CLASSIFIER, a system for predicting the subcellular locations of proteins on the basis of amino acid subalphabets and a combination of multiple support vector machines[33]. Moreover, four topologies were predicted by the corresponding programs. Proteins predicted to be surface-exposed and having any of these four topologies were screened as preliminary vaccine candidates. All proteins with more than four predicted transmembrane spanning regions were removed from the list of candidates, not only because they are likely to be completely embedded in the cell membrane and therefore inaccessible to antibodies, but also because they are difficult to express in E. coli[34]. We retained the genes shared by the two sequenced serovars and subtracted genes that had human homologues. The reason we subtracted human homologues is they are likely to cause problems of autoimmunity[35]. Finally, we narrowed the list of vaccine candidates to 616 genes in the genome of L. interrogans strain #56601. In order to explore vaccine candidates that could generate cross-protection against the diverse serovars of leptospires, we applied CGH to identify genes that are conserved among the ten pathogenic strains involved in most infections[36]. This approach allowed us to refine the vaccine candidate shortlist further by eliminating antigens that were not conserved among these serovars. The 565 vaccine candidates not only presumably surface-exposed but also conserved among the ten prevalent serovars in China were identified as the result of this approach. Transcriptome analysis was performed using DNA microarrays of L. interrogans in order to assess the transcription levels of all genes in the genome. A graph of the signal obtained for each gene gave a diagonal distribution reflecting the expression level of that gene. After subtracting genes with transcriptional levels below the median, we were left with 226 genes as vaccine candidates. Applying the theory of reverse vaccinology, 226 genes had been identified as potential vaccine candidates against L. interrogans combined bioinformatics, CGH and transcriptional analysis. Among them, 60.6% (137/226) have COG annotations; thus, nearly 40% either have an unknown function or have no COG annotation. This group of gene products offers great promise as it comprises a pool of previously unexploited vaccine targets. To evaluate our results, we compared our candidates with those identified by others. Gamberini et al. (2005) found approximately 20% potential surface proteins using in silico approach, and sixteen proteins were recognized by antibodies present in human sera[15]. However, only three of them (LA0222, LA2637 and LA2741) appear in our final set. This is not unexpected, since 206 genes encoding hypothetical or unknown proteins were selected from approximately 20% of the genome for cloning and expression. Nally et al. (2005) characterized 32 proteins in outer membrane vesicles of L. interrogans serovar Copenhageni by two-dimensional gel electrophoresis, including previously-described outer membrane proteins (OMPs); in addition, unknown, hypothetical and putative OMPs were also identified[17]. Interestingly, only two proteins (LA0222 and LA2637) are represented among the sixteen proteins found by Gamberini and co-workers. There is an overlap of eight genes between our result and that of Nally et al. (2005) (LA0222, LA0505, LA0616, LA1495, LA2024, LA2295, LA2637 and LA3091). The reasons responsible for the discrepancies among the results may be due to differing methodologies. Genomics, transcriptional profiling and proteomics have emerged in the post genomic era with potential to speed up the vaccine discovery research process. It should be pointed out that those methods have their respective advantages and limitations, and can be complementally utilized in the development of the novel vaccines. Genomics involves the use of various softwares to predict sublocalization of proteins. However, some algorithms have limited accuracy. Although transcriptome analysis uses gene chip array to measure gene expression but suffers from the fact that mRNA levels may not reflect protein levels. Expression of a transcribed gene may be regulated at the level of translation. It is believed that the proteome maps of microorganisms are important to understand cellular status at the protein level, which cannot be deciphered from genome or transcriptome analysis[37]. Proteomics of outer membrane can rapidly identify almost all proteins in outer membrane. However, some of the proteins identified in membrane preparations are in fact typical cytoplasmic proteins[10,38]. Moreover, one of the major disadvantages of subproteomic studies by 2-D gel electrophoresis and mass spectrometry is the potential for contamination via leaky fractionation or lysis[39]. Nally et al. (2005) also revealed that outer membrane vesicles contain small amounts of inner membrane or cytoplasmic proteins in their proteomic study[17]. It is worth mentioning here that mainly surface-exposed proteins such as LipL32 (LA2637)[26,40], LipL41 (LA0616)[27,40], LipL45 (LA2295)[41] and LipL21 (LA0011)[42] have higher transcriptional levels in our results; this suggests that the genes with higher transcriptional levels identified in our current research may be preferable for development as vaccine candidates. This is the first time that CGH and transcription analysis have been used to identify potential candidates for vaccines against L. interrogans. Our present work corroborates previous studies, showing the advantages of reverse vaccinology[8,11]. The next step following our present research is to verify whether the selected vaccine candidates are surface-exposed and to evaluate the protective activities of these proteins. Such studies will lead to the development of safe and effective new vaccines against leptospirosis in the future.

Conclusion

We have performed high-throughput in silico and microarray-based processes that are useful for determining potential vaccine candidates against leptospirosis. In total, 226 genes were identified in the genome of L. interrogans serovar Lai type strain #56601 using bioinformatics, CGH and transcriptional analysis. The proteins encoded by these genes are not only potentially surface-exposed in the bacterium, but also conserved in two sequenced L. interrogans. Moreover, these genes are conserved among ten epidemic serovars in China and have high transcriptional levels in vitro. These proteins might therefore be useful for vaccine candidates as well as for the diagnosis of leptospirosis. Further research, including verification that these vaccine candidates are surface-exposed and evaluation their protective activities, will aid in the study of vaccines against leptospirosis in the future.

Methods

Bacteria strains and growth condition

Ten strains of L. interrogans were used in this study (Table 5). All the strains were obtained from the Institute for Infectious Disease Control and Prevention (IIDC), Beijing, China. Leptospires were maintained by serial passages in guinea pigs for preservation of virulence and were cultured in liquid Ellinghausen-McCullough-Johnson-Harris (EMJH) medium at 28°C or 37°C with shaking under aerobic conditions. Culture conditions were then developed to ensure that only mid-exponential-phase bacterial cultures at a mean density of 106/ml were used in further experimentation. The cells were harvested by centrifugation at 10,000 g for 10 min at 4°C.
Table 5

Bacterial strains used in the study

serogroupserovarstrain
IcterohaemorrhagiaeLaiLai(56601)
CanicolaCanicolaLin
PyrogenesPyrogenes4
AutumnalisAutumnalisLin 4
AustralisAustralis65-9
PomonaPomonaLuo
GrippotyphosaLinhaiLin 6
HebdomadisHebdomadisP 7
BataviaePaidjanL 37
SejroeWolffiL 183
The L. interrogans serogroup Icterohaemorrhagiae serovar Lai type strain #56601 (strain Lai) was used to construct the DNA microarray. The genomic DNA of strain Fiocruz L1–130 was kindly provided by the Centro de Pesquisas Goncalo Moniz.

In silico analysis

Genes and protein data for human and for the sequenced L. interrogans (serovar Lai and serovar Copenhageni) were downloaded from NCBI. P-CLASSIFIER[33] was applied to predict the subcellular locations of proteins in L. interrogans strain #56601. Signal peptide prediction was carried out using SignalP 3.0[43]. α-Helix transmembrane topology prediction was carried out using TMHMM[44]. BOMP was used to predict β-barrel outer membrane proteins[45]. Putative lipoproteins were predicted by SpLiP[46]. To identify proteins orthologous between serovar Lai and serovar Copenhageni as well as between serovar Lai and human, all predicted proteins were searched against each other locally using BLASTP[47]. DNA microarrays of L. interrogans strain #56601 consisting of 3528 annotated ORFs longer than 250bp were prepared as previously described [48]. The genomic DNA of L. interrogans strain #56601 was used for reference in the double-fluorescence hybridization, and the genomic DNA of strain Fiocruz L1–130 was used as a control. A CGH microarray analysis of strain Lai and strain Fiocruz L1–130 was performed first. The qualified threshold determined in this control experiment was used to identify gene deletions in other strains. Reference or test DNA was fluorescently labelled through direct incorporation of Cy3-dCTP or Cy5-dCTP (Amersham Pharmacia Biotech) respectively by a randomly primed polymerization reaction. Unincorporated nucleotides and random primers were removed using QIAquick Nucleotide Removal columns (QIAGEN) according to the manufacturer's instructions. Hybridizations were conducted in a hybridization chamber at 42°C overnight. Slides were washed at 55°C with 1 × SSC containing 0.2% SDS for 10 min and then at 55°C with 0.1 × SSC containing 0.2% SDS for 20 min and finally at room temperature with 0.1 × SSC for 3 min. Competitive hybridization was performed twice for each strain. In the first experiment, L. interrogans strain #56601 reference DNA and the sample DNA were labelled with Cy3 and Cy5, respectively. In the second hybridization, the dyes for labelling were interchanged. Microarrays were scanned using a Chipreader laser scanner GenePix 4000B AXON (Axon Instruments, Union City, CA) according to the manufacturer's recommendations. Spot quantification, signal normalization and data visualization were performed using the programs GeneSpring 5.0.2 (Silicon Genetics) and Microsoft Excel. L. interrogans was grown in EMJH medium at 37°C under aerobic conditions for transcriptome analysis. Only mid-log-phase cultures at a mean density of 106/ml in 100 ml were used in transcriptional experiments. Total RNA was isolated from leptospires using Trizol reagent (Invitrogen) according to the manufacturer's protocol. Contaminating DNA was digested with RQ1 RNase-free DNase (Promega Corp.). The treated RNA was purified with a QIAGEN RNeasy Kit (QIAGEN). RNA (10 μg) was labelled with Cy3 by reverse transcription using Superscript α (Invitrogen). Unincorporated dye was removed using a QIAquick Nucleotide Removal Kit (QIAGEN) as specified in the manufacturer's protocol. Samples were hybridized under cover slides to the microarray slides overnight at 42°C, and then washed as usual. The hybridization slides were processed by Tiffsplit (Agilent) and data were further analyzed using Genespring software 5.0.2 and normalized using mean values combined with Microsoft Excel software. Microarrays were used to assay relative RNA abundance. Flagged spots or SN<2 spots were excluded for intrachip and interchip reproducibility analysis. We calculated the coefficients of three spots in same chip for each gene to estimate intrachip reproducibility using Microsoft Excel. The signal values from the experiments represent average mRNA abundances. As in the CGH experiments, the dyes for labelling Cy3 and Cy5 were interchanged in the second hybridization. Figure 2 is a scheme of the procedure we used to identify the vaccine candidates as described above (the numbers in parentheses are the results after the corresponding procedure step).
Figure 2

Schematic representation of general procedure to identify the vaccine candidates in the genome of L. interrogans (the numbers in parentheses are the results after the corresponding procedure step).

Authors' contributions

HLY and XKG designed the research project. HLY and YZZ carried out the bioinformatics analysis. PH and HLY completed the CGH. JHQ and HLY coordinated the transcriptome analysis. HLY and XKG drafted the manuscript. XCJ and GPZ participated in the design of the study and helped to draft the manuscript. All authors contributed to the writing and preparation of the manuscript. All authors read and approved the final manuscript.
  44 in total

1.  [The evaluation of the reactogenicity and immunogenic activity of a new concentrated inactivated leptospirosis vaccine].

Authors:  V N Ikoev; M A Gorbunov; B F Vachaev; E A Iagovkin; V F Kondratenko; Iu V Anan'ina; T I Ansimova; N I Kostina; I L Iur'eva; M G Nikitin
Journal:  Zh Mikrobiol Epidemiol Immunobiol       Date:  1999 Jul-Aug

2.  [The reactogenicity and immunogenicity of the first Cuban vaccine against human leptospirosis].

Authors:  R Martínez Sánchez; A M Obregón Fuentes; A Pérez Sierra; A Baly Gil; M Díaz González; M Baró Suárez; R Menéndez Capote; A Ruiz Pérez; G Sierra González; A U López Chávez
Journal:  Rev Cubana Med Trop       Date:  1998

3.  PSORTb v.2.0: expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis.

Authors:  J L Gardy; M R Laird; F Chen; S Rey; C J Walsh; M Ester; F S L Brinkman
Journal:  Bioinformatics       Date:  2004-10-22       Impact factor: 6.937

Review 4.  A genomic perspective on protein families.

Authors:  R L Tatusov; E V Koonin; D J Lipman
Journal:  Science       Date:  1997-10-24       Impact factor: 47.728

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  Leptospiral outer membrane proteins OmpL1 and LipL41 exhibit synergistic immunoprotection.

Authors:  D A Haake; M K Mazel; A M McCoy; F Milward; G Chao; J Matsunaga; E A Wagar
Journal:  Infect Immun       Date:  1999-12       Impact factor: 3.441

7.  Evidence of cross-protection within Leptospira interrogans in an experimental model.

Authors:  C Sonrier; C Branger; V Michel; N Ruvoën-Clouet; J P Ganière; G André-Fontaine
Journal:  Vaccine       Date:  2000-08-15       Impact factor: 3.641

8.  The leptospiral major outer membrane protein LipL32 is a lipoprotein expressed during mammalian infection.

Authors:  D A Haake; G Chao; R L Zuerner; J K Barnett; D Barnett; M Mazel; J Matsunaga; P N Levett; C A Bolin
Journal:  Infect Immun       Date:  2000-04       Impact factor: 3.441

9.  Purification and proteomic analysis of outer membrane vesicles from a clinical isolate of Leptospira interrogans serovar Copenhageni.

Authors:  Jarlath E Nally; Julian P Whitelegge; Rodrigo Aguilera; Martha M Pereira; David R Blanco; Michael A Lovett
Journal:  Proteomics       Date:  2005-01       Impact factor: 3.984

10.  Comparative analysis of the LPS biosynthetic loci of the genetic subtypes of serovar Hardjo: Leptospira interrogans subtype Hardjoprajitno and Leptospira borgpetersenii subtype Hardjobovis.

Authors:  A de la Peña-Moctezuma; D M Bulach; T Kalambaheti; B Adler
Journal:  FEMS Microbiol Lett       Date:  1999-08-15       Impact factor: 2.742

View more
  15 in total

Review 1.  Leptospira: the dawn of the molecular genetics era for an emerging zoonotic pathogen.

Authors:  Albert I Ko; Cyrille Goarant; Mathieu Picardeau
Journal:  Nat Rev Microbiol       Date:  2009-10       Impact factor: 60.633

Review 2.  Secretome, surfome and immunome: emerging approaches for the discovery of new vaccine candidates against bacterial infections.

Authors:  Pratistha Dwivedi; Syed Imteyaz Alam; Rajesh Singh Tomar
Journal:  World J Microbiol Biotechnol       Date:  2016-07-27       Impact factor: 3.312

3.  Identification of conserved surface proteins as novel antigenic vaccine candidates of Actinobacillus pleuropneumoniae.

Authors:  Xiabing Chen; Zhuofei Xu; Lu Li; Huanchun Chen; Rui Zhou
Journal:  J Microbiol       Date:  2012-12-30       Impact factor: 3.422

Review 4.  The future of human DNA vaccines.

Authors:  Lei Li; Fadi Saade; Nikolai Petrovsky
Journal:  J Biotechnol       Date:  2012-09-07       Impact factor: 3.307

5.  Mining the Moraxella catarrhalis genome: identification of potential vaccine antigens expressed during human infection.

Authors:  Elizabeth A Ruckdeschel; Charmaine Kirkham; Alan J Lesse; Zihua Hu; Timothy F Murphy
Journal:  Infect Immun       Date:  2008-01-28       Impact factor: 3.441

6.  Nephele: genotyping via complete composition vectors and MapReduce.

Authors:  Marc E Colosimo; Matthew W Peterson; Scott Mardis; Lynette Hirschman
Journal:  Source Code Biol Med       Date:  2011-08-18

7.  Leptospirosis vaccines.

Authors:  Zhijun Wang; Li Jin; Alicja Wegrzyn
Journal:  Microb Cell Fact       Date:  2007-12-11       Impact factor: 5.328

8.  Comparative genomic analysis of Tropheryma whipplei strains reveals that diversity among clinical isolates is mainly related to the WiSP proteins.

Authors:  My-Van La; Nicolas Crapoulet; Pascal Barbry; Didier Raoult; Patricia Renesto
Journal:  BMC Genomics       Date:  2007-10-02       Impact factor: 3.969

9.  Genetic diversity among major endemic strains of Leptospira interrogans in China.

Authors:  Ping He; Yue-Ying Sheng; Yao-Zhou Shi; Xiu-Gao Jiang; Jin-Hong Qin; Zhi-Ming Zhang; Guo-Ping Zhao; Xiao-Kui Guo
Journal:  BMC Genomics       Date:  2007-07-01       Impact factor: 3.969

10.  The OmpA-like protein Loa22 is essential for leptospiral virulence.

Authors:  Paula Ristow; Pascale Bourhy; Flávia Weykamp da Cruz McBride; Claudio Pereira Figueira; Michel Huerre; Patrick Ave; Isabelle Saint Girons; Albert I Ko; Mathieu Picardeau
Journal:  PLoS Pathog       Date:  2007-07       Impact factor: 6.823

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.