Literature DB >> 20875156

Bioinformatics analysis of Brucella vaccines and vaccine targets using VIOLIN.

Yongqun He1, Zuoshuang Xiang.   

Abstract

BACKGROUND: Brucella spp. are Gram-negative, facultative intracellular bacteria that cause brucellosis, one of the commonest zoonotic diseases found worldwide in humans and a variety of animal species. While several animal vaccines are available, there is no effective and safe vaccine for prevention of brucellosis in humans. VIOLIN (http://www.violinet.org) is a web-based vaccine database and analysis system that curates, stores, and analyzes published data of commercialized vaccines, and vaccines in clinical trials or in research. VIOLIN contains information for 454 vaccines or vaccine candidates for 73 pathogens. VIOLIN also contains many bioinformatics tools for vaccine data analysis, data integration, and vaccine target prediction. To demonstrate the applicability of VIOLIN for vaccine research, VIOLIN was used for bioinformatics analysis of existing Brucella vaccines and prediction of new Brucella vaccine targets.
RESULTS: VIOLIN contains many literature mining programs (e.g., Vaxmesh) that provide in-depth analysis of Brucella vaccine literature. As a result of manual literature curation, VIOLIN contains information for 38 Brucella vaccines or vaccine candidates, 14 protective Brucella antigens, and 68 host response studies to Brucella vaccines from 97 peer-reviewed articles. These Brucella vaccines are classified in the Vaccine Ontology (VO) system and used for different ontological applications. The web-based VIOLIN vaccine target prediction program Vaxign was used to predict new Brucella vaccine targets. Vaxign identified 14 outer membrane proteins that are conserved in six virulent strains from B. abortus, B. melitensis, and B. suis that are pathogenic in humans. Of the 14 membrane proteins, two proteins (Omp2b and Omp31-1) are not present in B. ovis, a Brucella species that is not pathogenic in humans. Brucella vaccine data stored in VIOLIN were compared and analyzed using the VIOLIN query system.
CONCLUSIONS: Bioinformatics curation and ontological representation of Brucella vaccines promotes classification and analysis of existing Brucella vaccines and vaccine candidates. Computational prediction of Brucella vaccine targets provides more candidates for rational vaccine development. The use of VIOLIN provides a general approach that can be applied for analyses of vaccines against other pathogens and infection diseases.

Entities:  

Year:  2010        PMID: 20875156      PMCID: PMC2946783          DOI: 10.1186/1745-7580-6-S1-S5

Source DB:  PubMed          Journal:  Immunome Res        ISSN: 1745-7580


Background

Brucella is a Gram-negative, facultative intracellular bacterium that causes brucellosis in humans and animals [1]. Brucella are taxonomically placed in the alpha-2 subdivision of the class Proteobacteria. Traditionally there are six species of Brucella based on the preferential host specificity: B. melitensis (goats), B. abortus (cattle), B. suis (swine), B. canis (dogs), B. ovis (sheep) and B. neotomae (desert mice). The first four species listed in decreasing order of severity are pathogenic to humans making brucellosis a zoonotic disease. These bacteria are also amenable for use in biological warfare and bio-terrorism. Recently, two new species B. cetaceae (cetacean) and B. pinnipediae (seal) have been described [2]. Complete genome sequences of 10 Brucella strains are currently available in the NCBI RefSeq database. Four genomes from B. abortus, B. melitensis, and B. suis have been extensively analyzed [3-6]. While animal brucellosis vaccines are commercially available, there is no effective and safe human vaccine against virulent Brucella infections. Extensive studies on Brucella have recently been concentrated on understanding the mechanisms for protective Brucella immunity and the development of effective human brucellosis vaccines. VIOLIN (http://www.violinet.org) is a web-based vaccine database and analysis system. VIOLIN contains general information on microbial pathogenesis, host ranges, and host protective immunity, as well as vaccine-specific information such as vaccine type, preparation method, genetically engineered genes, and host responses in various animal models. VIOLIN contains information about 454 vaccines and vaccine candidates for 73 pathogens. VIOLIN contains many bioinformatics tools for vaccine literature mining, vaccine data analysis and integration, and vaccine target prediction. For example, VIOLIN includes Vaxmesh and Vaxpresso programs that may be used to mine vaccine literature based on MeSH controlled vocabulary and natural language processing (NLP), respectively. Dr. Yongqun He, the founder of the VIOLIN initiated and leads community-based development of the Vaccine Ontology to support vaccine integration and automated reasoning. A web-based vaccine target prediction program Vaxign available in VIOLIN is used to predict vaccine targets based on genome sequence analysis using a reverse vaccinology strategy. As of May 13, 2010, more than 2,000 Brucella vaccine-related literature papers were searchable in PubMed, and 10 Brucella genomes have been published in the NCBI RefSeq database. To support Brucella vaccine research and development, we systematically curated from the literature existing Brucella vaccine information, which are stored in VIOLIN for query and further analyses. Different VIOLIN tools are also used to analyze Brucella vaccines and predict new vaccine targets.

Results

Brucella vaccine literature mining in VIOLIN

All Brucella vaccine-related articles were downloaded from PubMed and stored in VIOLIN. Information for these articles was processed and used for varying literature mining applications in VIOLIN. For example, Vaxmesh, a MeSH-based vaccine literature visualization and mining tool in VIOLIN, was used (Figure 1). The Medical Subject Headings (MeSH; http://www.nlm.nih.gov/mesh/) is the controlled vocabulary thesaurus developed by the National Library of Medicine (NLM) to index articles deposited for the MEDLINE/PubMed database. There are over 25,000 MeSH terms organized in a hierarchical fashion based on 15 top-level categories. The MeSH hierarchical structure permits literature searching at various levels of specificity. Vaxmesh provides an interactive web interface for users to locate articles using MeSH terms in a hierarchical MeSH tree structure. Figure 1 demonstrates a MeSH hierarchy for the term “Gene Deletion”. This major MeSH term is associated with five papers in Brucella vaccine area (Figure 1A-1B). A click on the MeSH term links the program to another VIOLIN web page that reveals detailed information about each of the five papers. A web link to PubMed is also available (Figure 1C). According the MeSH indexing, those articles associated with Brucella vaccines also cover different areas such as anatomy (261 articles), physical sciences (194 articles), and geographic locations (47 articles) (Figure 1).
Figure 1

Vaxmesh analysis of (A) Visualization of MeSH hierarchy in Vaxmesh after keyword “Brucella” search; (B) The two clickable numbers next to each MeSH term links to all publications with the term as a MeSH term or a major MeSH term, respectively. A click on “5” next to the MeSH term “Gene Deletion” links to another page with detailed citation information; (C) The PubMed record is accessible after a click on an article title in (B).

Vaxmesh analysis of (A) Visualization of MeSH hierarchy in Vaxmesh after keyword “Brucella” search; (B) The two clickable numbers next to each MeSH term links to all publications with the term as a MeSH term or a major MeSH term, respectively. A click on “5” next to the MeSH term “Gene Deletion” links to another page with detailed citation information; (C) The PubMed record is accessible after a click on an article title in (B). Vaxperts is a new MeSH-based VIOLIN program that provides a literature-based social network of vaccine experts based on their publication records in PubMed. Vaxperts allows vaccine experts to find their co-authors and co-authors's co-authors of shared publications. This approach facilitates collaborative vaccine research and development. For example, a search for the keyword “Brucella” in Vaxperts resulted in the listing of 2454 authors that have contributed to at least one Brucella vaccine article. VIOLIN also contains three additional literature mining programs. These are: Vaxpresso, a natural language processing (NLP)-based vaccine literature mining program; VIOLIN Litesearch, an advanced keyword- and category-based search for vaccine literature; and Vaxlert, a literature alert program that provides periodical literature updates through Emails based on the specification of a VIOLIN user.

Brucella vaccines curated in VIOLIN

With many literature mining programs available in VIOLIN, it is possible to make manual curation of Brucella vaccine information more efficient. Brucella vaccine curation was performed using a web-based literature mining and curation system called Limix [7,8]. Limix was developed to efficiently combine semi-automatic literature mining, manual curation, and data submission. . All curated data includes references. The curated data is published in VIOLIN and available for query only after it is critically reviewed and verified by an expert. VIOLIN contains 38 curated Brucella vaccines or vaccine candidates that have been officially licensed or proven to provide protection in an animal model (Table 1). Specifically, VIOLIN includes 20 B. abortus vaccines, 16 B. melitensis vaccines, and two B. suis vaccines. Among them, four Brucella vaccines are licensed for commercial uses in cattle, sheep, goat, and pigs. All others are research vaccines which have been demonstrated to induce protection in vivo against virulent Brucella challenges at least in some laboratory models (mostly in the mouse model). In terms of vaccine types, 1, 8, 10, and 19 vaccines are bacterial vector vaccine, DNA vaccines, subunit vaccines, and live attenuated vaccines, respectively.
Table 1

Brucella vaccines curated in VIOLIN and listed in VO.

#Vaccine namesVO IDTypeLicensed
Brucella abortus vaccines

1B. abortus DNA vaccine pcDNA-SODVO_0000018DNAResearch
2B. abortus RB51VO_0000021LALicensed
3B. abortus strain 19VO_0000022LALicensed
4B. abortus DNA vaccine encoding BCSP31, SOD and L7/L12VO_0000321DNAResearch
5B. abortus subunit vaccine using L7/L12VO_0000323SubResearch
6Brucellaabortus bacA mutantVO_0000347LAResearch
7B. recombinant SurA protein vaccineVO_0000358SubResearch
8B. recombinant DnaK protein vaccineVO_0000373SubResearch
9B. abortus DNA vaccine using L7/L12 and Omp16VO_0000374DNAResearch
10B. abortus DNA vaccine encoding L7/L12 and P39VO_0000385DNAResearch
11B. abortus with znuA deletion VO_0000386LAResearch
12B. abortus porin-S-LPS VO_0000403SubResearch
13B. abortus RB51WboAVO_0000404LAResearch
14Recombinant O. anthropi 49237SODVO_0000407BTResearch
15B. abortus pcDNA-BLSVO_0000421DNAResearch
16Escheriosome delivery of B. abortus L7/L12VO_0000423SubResearch
17NPAP Brucella vaccineVO_0000450IAResearch
18B. abortus strain RB51SODVO_0000720LAResearch
19B. abortus strain 45/20VO_0000723LAResearch
20B. abortus S19 with P39 deletionVO_0000826LAResearch

Brucella melitensis vaccines

21B. melitensis Rev. 1 with bp26 and omp31 deletionsVO_0001171LAResearch
22B. melitensis strain VTRM1VO_0000300LAResearch
23B. melitensis lipopolysaccharide vaccineVO_0000311SubResearch
24B. melitensis LPS-GBOMP noncovalent complex VO_0000312SubResearch
25B. melitensis DNA vaccine encoding Omp31VO_0000325DNAResearch
26B. melitensis bp26 deletion vaccineVO_0000338LAResearch
27B. melitensis WR201VO_0000345LAResearch
28B. ovis microparticle subunit vaccineVO_0000354SubResearch
29microencapsulated B. melitensis mutant vaccineVO_0000398LAResearch
30B. melitensis Bp26 and Tf vaccineVO_0000411SubResearch
31B. melitensis P39 recombinant protein vaccineVO_0000412LAResearch
32recombinant chimera BLSOmp31VO_0000413SubResearch
33B. melitensis DNA vaccine encoding Omp31 boosted with Omp31VO_0000436DNAResearch
34B. melitensis Rev. 1 with P39 deletionVO_0000633LAResearch
35B. melitensis strain Rev. 1VO_0000710LALicensed
36Brucella DNA vaccine encoding chimera BLSOmp31VO_0001144DNAResearch

Brucella suis vaccines

37B. suis strain VTRS1VO_0000303LAResearch
38B. suis strain 2VO_0000722LALicensed

Note: The abbreviations LA, Sub, DNA, Con, IV, and BV represent live attenuated vaccine, subunit vaccine, DNA vaccine, conjugation vaccine, inactivated vaccine, and bacterial vector vaccine, respectively.

Brucella vaccines curated in VIOLIN and listed in VO. Note: The abbreviations LA, Sub, DNA, Con, IV, and BV represent live attenuated vaccine, subunit vaccine, DNA vaccine, conjugation vaccine, inactivated vaccine, and bacterial vector vaccine, respectively.

Ontology representation of Brucella vaccines

A biomedical ontology represents the consensus-based controlled vocabularies of terms and relations which are logically formulated in such a way as to promote automated reasoning. Ontologies are able to structure complex biomedical domains and relate the myriad of data to shared understanding of biomedicine. Ontologies can be used for different purposes. The Gene Ontology (GO) is a well-known example of an ontology created for the primary purpose of providing controlled and standardized terms for naming different types of biological processes, cellular components, and molecular functions [9]. This ontology allows the common representation of attributes of gene products regardless of species of origin. Creating such ontology-based annotations is highly valuable both for querying databases and analyzing high throughput data. This has a significant impact since as of August 2010, over 2,500 peer-reviewed publications are identified through a PubMed search of “Gene Ontology”, and approximately 35,000 hits are identified through a Google Scholar search using the same keywords. Ontologies can also be used for representation of encyclopedic knowledge, data exchange, and computational data analysis and reasoning. The Vaccine Ontology (VO; http://www.violinet.org/vaccineontology) is a collaborative, community-based ontology in the vaccine domain. VO can be used for vaccine data standardization, integration, and computer-assisted reasoning. VO utilizes the Basic Formal Ontology (BFO) (http://www.ifomis.org/bfo), a domain-independent ontology, as an upper level ontology. The VO was developed using the W3C standard Web Ontology Language (OWL) (http://www.w3.org/TR/owl-guide/). The latest version of VO is always available at http://purl.obolibrary.org/obo/vo.owl. In addition, VO has been listed in the OBO (Open Biomedical Ontologies) website (http://www.obofoundry.org/cgi-bin/detail.cgi?id=vaccine), and deposited in the NCBO BioPortal (http://bioportal.bioontology.org/virtual/1172). To provide a means for users to visualize the definitions and usages of VO terms and their relations, a VO Browser (http://www.violinet.org/vaccineontology/vobrowser/) was developed. As with other vaccines, Brucella vaccines in VO are asserted using single inheritance based on Brucella species. Figure 2A demonstrates the asserted hierarchy of B. abortus vaccines in VO. As an OWL document, VO also supports computational inference with an OWL reasoner, such as FACT++ [10]. For example, RB51 is asserted under Brucella abortus vaccine (Figure 2A). Since RB51 has the qualities of ‘live’ and ‘attenuated’, it is also inferred as a ‘live attenuated Brucella vaccine’ using FACT++ (Figure 2B). Figure 2 provides a screenshot of Brucella vaccines listed in VO based on computational reasoning.
Figure 2

VO hierarchy of (A) Asserted hierarchy; (B) Inferred hierarchy.

VO has been used in many applications associated with Brucella vaccines. It can be used to improve PubMed searching efficiency in the vaccine domain. A user case study would be to search “live attenuated Brucella vaccine” in PubMed. As of April 10, 2009, a direct PubMed search of this string of keywords returned 56 papers (or PubMed hits). VO includes 13 live attenuated Brucella vaccines that have the qualities of ‘live’ and ‘attenuated’. When these specific Brucella vaccine terms were also included in a PubMed search, the number of positive paper hits in PubMed increased by more than 10-fold [11]. The combination of VO with SciMiner, a literature mining program, significantly improves PubMed searching efficiency in the general vaccine domain [12]. It was also found that the application of VO dramatically increased the performance of vaccine-induced IFN- interaction networks [13]. VO hierarchy of (A) Asserted hierarchy; (B) Inferred hierarchy. Besides vaccine hierarchy, VO can also be used to represent (or model) vaccine investigation. As demonstrated in our two recent reports, vaccine protection investigation can be represented in VO by three continuous steps: vaccination, pathogen challenge, and vaccine efficacy measurement [14,15]. A measurement of vaccine efficacy can be assessed by host survival for the pathogens (e.g., Influenza virus) which kill the infected host (e.g., mouse) [14] or by pathogen colony forming units (CFU), a measurement for those pathogens (e.g., Brucella) which cannot kill infected host but exhibit diminished replication in a vaccinated host than that in unvaccinated host [15]. It is hypothesized that some parameters will play more important roles than others in determining the protection efficacy of Brucella vaccines. To test this hypothesis, the data for 151 groups of Brucella vaccine protection investigations were collected in VIOLIN from peer-reviewed literature publications and analyzed using ANOVA. Out of 16 parameters, 10 were found statistically significant (P-value <0.05) in contributing to protection based on a statistical ANOVA analysis. Examples of these parameters included vaccine strain, vaccine viability, vaccination route, vaccination dose. However, other six parameters, including IL-12 vaccine adjuvant, mouse sex, vaccination route, animal age, vaccination-challenge interval, and challenge dose, were not found statistically significant (P-value > 0.05). A careful study of this use case led to building and validating an ontology-based semantic framework to formally represent ANOVA [15]. Such an ontology-based representation of biomedical data for statistical analysis allows data consistency checking and data sharing in the Semantic Web [16].

Literature curation of Brucella protective antigens

The VIOLIN Protegen program stores protective antigens that have been verified experimentally to induce protective immunity. Protegen contains 14 protective Brucella antigens (Table 2). Among the 14 Brucella proteins, four proteins are outer membrane proteins. The other nine proteins are located in cytoplasm (5 proteins), periplasm (4 proteins), and cytoplasmic membrane (1 protein).
Table 2

Vaxign-predicted vaccine targets from B. abortus strain 2308.

#Locus TagRefSeq #SymbolTMHAdhesin Prob.Con-edHost Simil.Protein Notes
Cell Motility

1BAB2_1097YP_419224.1FlgK 00.535Xflagellar hook-associated protein FlgK
2BAB2_1098YP_419225.1FlgE 00.749Xflagellar hook protein FlgE
3BAB1_0260YP_413736.2FlgJ00.656Flagellar protein FlgJ:Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase
4BAB1_1726YP_415076.110.229Xhypothetical protein

TonB-dependent Receptor Protein: Inorganic Ion Transport and Metabolism

5BAB2_0233YP_418452.100.405XTonB-dependent receptor protein
6BAB2_1150YP_419272.100.691XTonB-dependent receptor protein:Pollen allergen Poa pIX/Phl pVI, C-terminal
7BAB1_1367YP_414742.100.655XTonB-dependent receptor protein

ATP/GTP-binding Site Motif A (P-loop): Porin, Alpha proteobacteria type

8BAB1_0659YP_414101.1Omp2a 00.611Porin, alpha proteobacteria type
9BAB1_0660YP_414102.1Omp2b00.585XPorin, alpha proteobacteria type

Cell wall/membrane/envelope Biogenesis

10BAB1_0045YP_413545.100.388XBacterial surface antigen (D15)
11BAB1_0115YP_413611.100.793Xouter membrane protein, putative
12BAB1_0116YP_413612.100.58Xouter membrane protein, putative
13BAB1_0707YP_414149.100.635XOrganic solvent tolerance protein
14BAB1_0722YP_414164.1Omp2500.554XOmpA-like transmembrane domain
15BAB1_1176YP_414567.110.408XBacterial surface antigen (D15)
16BAB1_1226YP_414612.130.571XMotY protein: OmpA/MotB domain
17BAB1_1302YP_414685.1RopB10.815Xhypothetical protein
18BAB1_1579YP_414943.110.669XOmpW family
19BAB1_1639YP_414995.1Omp31-100.736XOmpA-like transmembrane domain
20BAB1_1707YP_415057.100.371XMotY protein: OmpA/MotB domain
21BAB2_0314YP_418525.110.649Xheat resistant agglutinin 1 precursor
22BAB1_0963YP_414386.100.415XOuter membrane efflux protein

Replication, Recombination and Repair

23BAB2_0636YP_418811.100.299XDNA topoisomerase I
24BAB1_0121YP_413617.100.162XXDEAD/DEAH box helicase

Lipid Transport and Metabolism

25BAB1_0967YP_414390.100.764Membrane protein involved in aromatic hydrocarbon degradation

Posttranslational Modification, Protein Turnover, Chaperones

26BAB1_1944YP_415281.110.518XPpiC-type peptidyl-prolyl cis-trans isomerase

Unknown Function

27BAB1_1705YP_415055.100.594XTPR repeat:Molluscan rhodopsin C-terminal tail
28BAB1_1854YP_415198.100.759hypothetical protein
29BAB2_0071YP_418316.100.492Xhypothetical protein
30BAB1_0069YP_413569.100.886hypothetical protein BAB1_0069
31BAB1_0897YP_414322.120.279XAntifreeze protein, type I
32BAB1_0942YP_414367.110.346XRNA-binding region RNP-1 (RNA recognition motif)

Abbreviations: TMH, transmembrane helixes; Adhesin Prob., Adhesin probability; Con-ed represents the conserved proteins among five other genomes from virulent B. abortus strain 9-941, B. melitensis strains 16M and ATCC 23457, and B. suis strains 1330 and ATCC 23445; Host Simil., similarity to human and mouse genomes.

Vaxign-predicted vaccine targets from B. abortus strain 2308. Abbreviations: TMH, transmembrane helixes; Adhesin Prob., Adhesin probability; Con-ed represents the conserved proteins among five other genomes from virulent B. abortus strain 9-941, B. melitensis strains 16M and ATCC 23457, and B. suis strains 1330 and ATCC 23445; Host Simil., similarity to human and mouse genomes. For vaccine development against Brucella infections where T cell response is critical, subcellular localization is not usually an issue since a T cell response could be directed to any protein target. Our curated results confirm that protective Brucella antigens may occur in different subcellular locations.

Prediction of potential Brucella vaccine targets

Reverse vaccinology is an emerging vaccine development approach that starts with the prediction of vaccine targets using bioinformatics screening of an entire genome of a pathogenic organism [17]. As part of VIOLIN, Vaxign is the first web-based vaccine design program that predicts vaccine targets based on reverse vaccinology [18,19]. The Vaxign computational pipeline includes the following features: subcellular localization, topology (transmembrane helices and beta barrel structure), adhesin probability, similarity to other pathogen sequences, similarity to host genome sequences (e.g., human or mouse), and MHC class I and II epitope predictions. To predict Brucella vaccine targets, all 10 sequenced Brucella genomes available in NCBI RefSeq were used for a Vaxign analysis. As with other intracellular pathogens, protection against Brucella infections requires cell-mediated immunity (CMI). Secreted pathogen proteins are likely to stimulate cytotoxic T lymphocyte (CTL) responses [20]. However, no Brucella protein has been found to be secreted in any in vitro culture in a standard culture medium. An O-sialoglycoprotein endopeptidase (Gcp; RefSeq: YP_415230.1) in B. abortus strain 2308 was identified by Vaxign to be a potential secreted protein. This protein is also conserved in the other virulent B. abortus, B. melitensis, and B. suis strains. Vaxign was used to predict Brucella outer membrane proteins (OMP) as potential vaccine targets using B. abortus strain 2308 genome [6] as the seed genome (Figure 3). Among 3034 proteins in this genome, 32 were identified as OMPs. These OMPs from B. abortus strain 2308 are listed in Table 2. Some specific groups such as cell wall/membrane/envelope biogenesis and cell motility were enriched based on the COG analysis [21]. Two proteins among the 32 OMPs contain more than one transmembrane spanning region each. These two proteins are excluded for further consideration since the presence of multiple transmembrane spanning regions may make the purification of such recombinant proteins difficult [22]. Adhesins present in microbial pathogens are essential for bacterial invasion and survival and represent possible targets for vaccine development. If only adhesins are considered, 10 out of the remaining 30 proteins have a probability < 0.51 for being an adhesin and hence were discarded. Fifteen out the remaining 20 proteins are conserved in the genomes from virulent B. abortus strain 9-941, B. melitensis strain 16M and ATCC 23457, and B. suis strains 1330 and ATCC 23445. Each of these strains is pathogenic to humans. One protein (BAB1_1944) has homology with human and mouse proteomes. Among these 14 predicted Brucella vaccine targets, Omp25 (YP_414164.1) and Omp31-1 (YP_414995.1) have been verified to be protective Brucella antigens [23,24]. The list of predicted targets also includes two flagellar hook proteins FlgE (YP_419225.1) and FlgK (YP_419224.1), one porin protein Omp2b (YP_414102.1), and two TonB-dependent receptor proteins BAB1_1367 and BAB2_1150. The roles of these potential proteins as protective Brucella antigens have not been studied. The flagellar protein FlgJ appears in B. abortus strains 2308 and 9-941, B. melitensis strain 16M, and B. suis strain ATCC 23445; however, FlgJ is absent from B. suis strain 1330 and B. microti strain CCM 4915. Brucella flagellar genes have recently been found important in Brucella survival in vivo [25]. It remains unknown whether these Brucella flagellar genes can be used for Brucella vaccine development.
Figure 3

Prediction of *, represents five genomes from virulent B. abortus strain 9-941, B. melitensis strains 16M and ATCC 23457, and B. suis strains 1330 and ATCC 23445. **, represents the genome from B. ovis strain ATCC 25840; The indicated two proteins are Omp2b (YP_414102.1) and Omp31-1 (YP_414995.1).

Prediction of *, represents five genomes from virulent B. abortus strain 9-941, B. melitensis strains 16M and ATCC 23457, and B. suis strains 1330 and ATCC 23445. **, represents the genome from B. ovis strain ATCC 25840; The indicated two proteins are Omp2b (YP_414102.1) and Omp31-1 (YP_414995.1). To develop a human Brucella vaccine, those Brucella proteins that exist in Brucella strains pathogenic to humans but are absent in Brucella strains that are non-pathogenic to humans would be ideal for vaccine development. Our studies have identified two proteins, Omp2b (YP_414102.1) and Omp31-1 (YP_414995.1), which are conserved in the above mentioned virulent B. abortus, B. melitensis, and B. suis strains that are pathogenic to humans, but absent from B. ovis that is non-pathogenic to humans. Omp2b and Omp31 are two major outer membrane proteins in B. abortus [26]. It is likely that these two proteins are critical for human-specific Brucella infections. If a human Brucella vaccine is developed, these two proteins are considered as priority antigens. A further bioinformatics analysis indicates that the porin protein Omp2b does not exist in live attenuated B. abortus vaccine strain 19, suggesting that Omp2b likely contributes to the attenuation of this mutant. Omp2b also exists in B. canis that is weakly pathogenic to humans. However, Omp31-1 does not exist in B. canis. Vaxign identified 46 Brucella periplasmic proteins that are conserved in all B. abortus, B. melitensis, and B. suis genomes and lack sequence similarity with proteins in human or mouse genomes. The values of these proteins for vaccine development also deserve further analysis. Using the same criteria (sequence conservation and dissimilarity from human or mouse proteins), Vaxign detected approximately 1,000 cytoplasmic proteins. It is impractical to individually test this high number of proteins for vaccine development. Considering only five cytoplasmic proteins have been experimentally confirmed to be protective antigens out of 1,000 conserved cytoplasmic proteins (Table 3), it is much less likely that cytoplasmic proteins serve as protective antigens compared to outer membrane and periplasmic proteins.
Table 3

Brucella protective antigens verified experimentally.

#SymbolLocus tagProtein DescriptionLocationReferences (PMIDs)
1BLSCAA86936Brucella lumazine synthaseCytoplasm11953389
2L7/L12BRURPL712XRibosomal protein L7/L12Cytoplasm8873388
3P39ABM67295sugar-binding 39-kDa proteinPeriplasm11447155
4BfrBAB2_0675Ferritin:BacterioferritinCytoplasm11447155
5Bp26BMEI0536Periplasmic immunogenic proteinPeriplasm17239499
6DnaKBruAb1_2100Molecular chaperone DnaKCytoplasm17686554
7IalBBMEI1584Invasion protein BCytoplasmic membrane17049676
8Omp16BAB1_1707Outer membrane protein MotYOuter membrane18981242
9Omp19BAB1_1930Lipoprotein Omp19Outer membrane18981242
10Omp25BMEI124925 kDa outer-membrane immunogenic protein precursorOuter membrane18981242
11Omp31BAB1_1639OmpA-like transmembrane domainOuter membrane17014873
12SodC BAB2_0535Cu/Zn superoxide dismutasePeriplasm15039330
13SurABAB1_0706Peptidyl-prolyl cis-trans isomerasePeriplasm17686554
14TigBMEI1069Trigger factorCytoplasm17239499
Brucella protective antigens verified experimentally. Vaxign also contains an epitope prediction component that can predict MHC class I and II binding epitopes [19]. The addition of epitope prediction allows further analysis for the existence of potential Brucella vaccine targets.

Other programs in VIOLIN

VIOLIN provides user-friendly web interface for users to query Brucella vaccine data in VIOLIN. For example, Vaxquery is a user-friendly web query tool to query vaccine data (Figure 4).
Figure 4

After typing the “live attenuated Brucella abortus vaccines” in the Vaxquery query bar (A), 11 Brucella vaccines were displayed (B). Two vaccines (RB51 and RB51SOD) were chosen for advanced comparison (B). After query settings were chosen (C), the results were displayed (E). All curated information has associated references which can be linked to PubMed (E).

After typing the “live attenuated Brucella abortus vaccines” in the Vaxquery query bar (A), 11 Brucella vaccines were displayed (B). Two vaccines (RB51 and RB51SOD) were chosen for advanced comparison (B). After query settings were chosen (C), the results were displayed (E). All curated information has associated references which can be linked to PubMed (E). VIOLIN VBLAST is a customized BLAST sequence similarity search program. The BLAST library in VBLAST includes those vaccine-associated genes, including protective antigens, virulent factors whose mutations lead to live attenuated vaccine development, and host protective immune factors. These vaccine-associated genes can also be queried through our Vaxgen web interface. Two VIOLIN programs Vaxjo and Vaxvec permit analysis of vaccine adjuvants and vaccine vectors. The adjuvants used for Brucella vaccine development include Complete and Incomplete Freund’s Adjuvants, CpG, Cholera toxin (CT) adjuvant, Maltose binding protein (MBP). Additionally, VIOLIN contains the information for host responses to Brucella vaccines. Animal response information can be searched through VIOLIN Vaxar (http://www.violinet.org/vaxar). Currently, annotated information for 68 host response studies of Brucella vaccines is available in Vaxar. VIOLIN contains many pages that are associated with other vaccine related topics, such as vaccine conferences, manufacturers, and useful web links.

Discussion

A large number of vaccine-related databases exist on the web. There are many government-supported vaccine databases. For example, the Centers for Disease Control and Prevention (CDC) maintain a Vaccine Information Statements (VISs) system (http://www.cdc.gov/vaccines/pubs/vis/default.htm). The Center for Biologics Evaluation and Research (CBER) under the Food and Drug Administration (FDA) regulates vaccine products and posts relevant information in their vaccine site: http://www.fda.gov/cber/vaccines.htm. There is also a Vaccine Adverse Event Reporting System (VAERS, http://vaers.hhs.gov/), co-sponsored by FDA and CDC in USA. Many agent-specific databases are also available, for example, the HIV vaccine resource (http://www3.niaid.nih.gov/research/topics/HIV/vaccines/default.htm) created by the National Institute of Allergy and Infectious Diseases (NIAID) at the National Institutes of Health (NIH). Other vaccine resources include, the Vaccine Page: http://www.vaccines.org/), the Vaccine Resource Library (PATH, http://www.path.org/vaccineresources/), and the Immunization Action Coalition (http://www.immunize.org/). These databases primarily focus on available information concerning existing licensed vaccines and vaccine regulation. VIOLIN is unique in that it stores and analyzes research data concerning commercial vaccines and vaccines under clinical trial or in early stages of development [8]. The development of the Vaccine Ontology (VO) is a community effort and involves many experts in the vaccine and biomedical ontology communities [27]. With the large number of vaccine data types and publications available, VO is developed as an efficient strategy for vaccine data standardization, retrieval, and integration. VO makes it possible for computer programs to understand various vaccine types and research data associated with different vaccines. VO will also help to ensure that data is annotated in a way that ensures comparability. Therefore, VO-based software programs can be developed to support high throughput vaccine data processing and analysis. We are currently developing a VO-based literature mining and curation program that would increase the efficacy of vaccine literature mining and manual curation. The VO-based literature mining program will also relieve the burden of continuous database updating. VO will also be used to integrate all vaccine data in VIOLIN, making vaccine information exchange more efficient. Compared to the traditional vaccine development approach that starts from a wet laboratory, reverse vaccinology begins with dry laboratory bioinformatics analysis, which makes the vaccine development more specific and efficient. Reverse vaccinology was first used by Rino Rappuoli in the development of a vaccine against serogroup B Neisseria meningitidis (MenB), the major cause of sepsis and meningitis in children and young adults [28]. Since then, this strategy has been applied to many other pathogens such as Bacillus anthracis [29], Streptococcus pneumoniae [30], and Mycobacterium tuberculosis [31]. While the criteria for vaccine prediction are known and many individual programs are available, it is still time consuming and requires expertise in these individual programs to predict vaccine targets using genome sequences. Vaxign is the first web-based automated pipeline that identifies potential vaccine targets based on the reserve vaccinology strategy [19]. Vaxign has been applied successfully to predict vaccine targets for uropathogenic E. coli [19]. This study demonstrated that Vaxign can predict novel Brucella vaccine targets. Experimental verification of many of these targets is currently under way. Vaxign also contains a program to predict immune epitopes that bind to MHC class I and II molecules in different animal species. Studies analyzing and ranking potential immune epitopes from predicted Brucella proteins are in progress. Promising epitopes will be tested in a wet laboratory. VIOLIN is also associated with other existing data resources. For example, many VIOLIN programs (e.g., Vaxign and Protegen) obtain Brucella genome sequences and share Brucella gene annotations with the web-based Pathogen-host Interaction Data Integration and Analysis System (PHIDIAS, http://www.phidias.us) [32]. PHIDIAS focuses on the analysis of pathogen-host interactions. Additionally, PHIDIAS contains the Brucella Bioinformatics Portal, a web-based portal with a special emphasis on Brucella genome annotation and literature mining [7]. PHIDIAS and BBP, also developed in our group, integrate more than 20 existing data resources. The close interaction between PHIDIAS/BBP and VIOLIN makes bioinformatics analysis of Brucella vaccines and vaccine targets more efficient. VIOLIN currently includes vaccine data for 73 pathogens. The VIOLIN methods described for Brucella vaccine analysis in this report are generic and also feasible for vaccine studies for other pathogens. It is noted that Brucella is one of the most annotated pathogens among these 73 pathogens listed in VIOLIN. The vaccine information for many pathogens is not systematically annotated to the extent of Brucella vaccines. More work and collaborations with the research experts in these pathogens are necessary to curate and analyze vaccines and vaccine candidates for these pathogens.

Conclusions

VIOLIN provides manually curated Brucella vaccine data and ontology representation of these vaccines using the Vaccine Ontology (VO). Many tools are developed in VIOLIN to support literature mining and data curation. Examples of data stored in the VIOLIN database include protective Brucella antigens and host responses induced by different Brucella vaccines. Brucella vaccine targets may be predicted using the VIOLIN Vaxign program. Various Brucella vaccine data can be queried using user-friendly web query programs in VIOLIN. The VIOLIN approach is generic and can be used for analyses of vaccines against other pathogens and infection diseases.

Methods

Literature mining of The information of all PubMed papers associated with Brucella vaccine and vaccination were downloaded from the PubMed web service. The literature contents were processed using VIOLIN literature mining pipelines [8]. The processed results are available for users to analyzed using individual VIOLIN literature mining programs. Bioinformatics curation of Brucella vaccine curation was performed on the VIOLIN web page using the Limix literature mining and curation system [7]. Limix allows data curators to submit data to the website, data reviewers to review and approve the submitted data, and eventual publication of high quality data. Specifically, a VIOLIN curator curates and compiles relevant information on vaccine information from peer-reviewed journals, books, and credible websites. The curated information is initially saved as a draft document and, when completed, is submitted to a MySQL database. The data submitted is initially invisible to the public and subject to critical review by an expert reviewer. Once approved, data becomes public and available for users to query. The database administrator manages users’ accounts and curation tasks. The VIOLIN database is routinely maintained by the database administrator. Published database content is periodically reviewed to ensure that new, pertinent information is captured. When new information is found, a curator and/or a domain expert will update the database content using the standard procedure described above. In addition, the VIOLIN team also periodically emails the authors of new vaccine research publications and encourages them to submit their data through the VIOLIN online submission system. VIOLIN also includes internally developed scripts to automatically update gene annotations based on updated records from existing databases (e.g., NCBI Gene database). VO representation of Manually curated Brucella vaccines are entered into VO by following the VO development standards [27]. The VO is edited by Protégé (http://protege.stanford.edu/). The FACT++ OWL reasoner [10] is used to obtain inferred Brucella vaccine hierarchy. Vaxign prediction of All ten Brucella genomes stored in the NCBI RefSeq database were used for prediction of Brucella vaccine targets. The genome of B. abortus strains 2308 was used as a seed genome. The other genomes include five sequenced virulent strains from three main pathogenic Brucella species: B. abortus strain 9-941), B. melitensis strains 16M and ATCC 23457, and B. suis strains 1330 and ATCC 23445. These strain are pathogenic to human. The genome of Brucella vaccine strain S19 was also included in this study for comparison purposes. The other three Brucella genomes are from B. ovis strain ATCC 25840, B. canis ATCC 23365, and B. microti strain CCM 4915. More Brucella genomes have been sequenced and available at http://www.broadinstitute.org/annotation/genome/brucella_group. Since the annotations are not yet finished and their records are not stored in the NCBI RefSeq database, these genomes were not typically used in this study. The Vaxign pipeline was executed by using the Brucella genomes as input data. The processed results were stored in the Vaxign database. The Vaxign web query interface was used to query and analyzed the predicted results. Query of All manually curated or computational processed data can be queried through various VIOLIN web pages. Selected query functions are described in detail in the body of this manuscript.

List of Abbreviations

COG: The Clusters of Orthologous Groups; GO: Gene Ontology; Limix: Literature Mining and Curation System; MeSH: Medical Subject Headings; NLM: National Library of Medicine; NCBI: National Center for Biotechnology Information; NCBO: National Center for Biomedical Ontology; OBO: Open Biomedical Ontologies; OWL: Web Ontology Language; SOD: Superoxide Dismutase; VIOLIN: Vaccine Investigation and Online Information Network; VO: Vaccine Ontology; W3C: World Wide Web Consortium.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

YH: Brucella vaccine data analysis, VIOLIN design and project manager, manuscript writing. ZX: Brucella vaccine data analysis, VIOLIN software developer and database administrator, manuscript editing.
  24 in total

Review 1.  Reverse vaccinology.

Authors:  R Rappuoli
Journal:  Curr Opin Microbiol       Date:  2000-10       Impact factor: 7.934

2.  The genome sequence of the facultative intracellular pathogen Brucella melitensis.

Authors:  Vito G DelVecchio; Vinayak Kapatral; Rajendra J Redkar; Guy Patra; Cesar Mujer; Tamara Los; Natalia Ivanova; Iain Anderson; Anamitra Bhattacharyya; Athanasios Lykidis; Gary Reznik; Lynn Jablonski; Niels Larsen; Mark D'Souza; Axel Bernal; Mikhail Mazur; Eugene Goltsman; Eugene Selkov; Philip H Elzer; Sue Hagius; David O'Callaghan; Jean-Jacques Letesson; Robert Haselkorn; Nikos Kyrpides; Ross Overbeek
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-26       Impact factor: 11.205

3.  Proteomic analysis of Brucella abortus cell envelope and identification of immunogenic candidate proteins for vaccine development.

Authors:  Joseph P Connolly; Diego Comerci; Timothy G Alefantis; Alexander Walz; Marian Quan; Ryan Chafin; Paul Grewal; Cesar V Mujer; Rodolfo A Ugalde; Vito G DelVecchio
Journal:  Proteomics       Date:  2006-07       Impact factor: 3.984

4.  The identification of two protective DNA vaccines from a panel of five plasmid constructs encoding Brucella melitensis 16M genes.

Authors:  Nicola J Commander; Stephen A Spencer; Brendan W Wren; Alastair P MacMillan
Journal:  Vaccine       Date:  2006-08-04       Impact factor: 3.641

5.  Search for potential vaccine candidate open reading frames in the Bacillus anthracis virulence plasmid pXO1: in silico and in vitro screening.

Authors:  N Ariel; A Zvi; H Grosfeld; O Gat; Y Inbar; B Velan; S Cohen; A Shafferman
Journal:  Infect Immun       Date:  2002-12       Impact factor: 3.441

6.  Modeling biomedical experimental processes with OBI.

Authors:  Ryan R Brinkman; Mélanie Courtot; Dirk Derom; Jennifer M Fostel; Yongqun He; Phillip Lord; James Malone; Helen Parkinson; Bjoern Peters; Philippe Rocca-Serra; Alan Ruttenberg; Susanna-Assunta Sansone; Larisa N Soldatova; Christian J Stoeckert; Jessica A Turner; Jie Zheng
Journal:  J Biomed Semantics       Date:  2010-06-22

7.  The Brucella suis genome reveals fundamental similarities between animal and plant pathogens and symbionts.

Authors:  Ian T Paulsen; Rekha Seshadri; Karen E Nelson; Jonathan A Eisen; John F Heidelberg; Timothy D Read; Robert J Dodson; Lowell Umayam; Lauren M Brinkac; Maureen J Beanan; Sean C Daugherty; Robert T Deboy; A Scott Durkin; James F Kolonay; Ramana Madupu; William C Nelson; Bola Ayodeji; Margaret Kraul; Jyoti Shetty; Joel Malek; Susan E Van Aken; Steven Riedmuller; Herve Tettelin; Steven R Gill; Owen White; Steven L Salzberg; David L Hoover; Luther E Lindler; Shirley M Halling; Stephen M Boyle; Claire M Fraser
Journal:  Proc Natl Acad Sci U S A       Date:  2002-09-23       Impact factor: 11.205

8.  Improved immunogenicity of a vaccination regimen combining a DNA vaccine encoding Brucella melitensis outer membrane protein 31 (Omp31) and recombinant Omp31 boosting.

Authors:  Juliana Cassataro; Carlos A Velikovsky; Laura Bruno; Silvia M Estein; Silvia de la Barrera; Raúl Bowden; Carlos A Fossati; Guillermo H Giambartolomei
Journal:  Clin Vaccine Immunol       Date:  2007-04-11

9.  Vaxign: the first web-based vaccine design program for reverse vaccinology and applications for vaccine development.

Authors:  Yongqun He; Zuoshuang Xiang; Harry L T Mobley
Journal:  J Biomed Biotechnol       Date:  2010-07-04

10.  BBP: Brucella genome annotation with literature mining and curation.

Authors:  Zuoshuang Xiang; Wenjie Zheng; Yongqun He
Journal:  BMC Bioinformatics       Date:  2006-07-16       Impact factor: 3.169

View more
  24 in total

Review 1.  Systems biology approaches to new vaccine development.

Authors:  Ann L Oberg; Richard B Kennedy; Peter Li; Inna G Ovsyannikova; Gregory A Poland
Journal:  Curr Opin Immunol       Date:  2011-05-11       Impact factor: 7.486

2.  A comprehensive proteogenomic study of the human Brucella vaccine strain 104 M.

Authors:  Xiaodong Zai; Qiaoling Yang; Kun Liu; Ruihua Li; Mengying Qian; Taoran Zhao; Yaohui Li; Ying Yin; Dayong Dong; Ling Fu; Shanhu Li; Junjie Xu; Wei Chen
Journal:  BMC Genomics       Date:  2017-05-23       Impact factor: 3.969

Review 3.  Pathogenesis and immunobiology of brucellosis: review of Brucella-host interactions.

Authors:  Paul de Figueiredo; Thomas A Ficht; Allison Rice-Ficht; Carlos A Rossetti; L Garry Adams
Journal:  Am J Pathol       Date:  2015-04-17       Impact factor: 4.307

4.  Immunogenicity of adenovirus and DNA vaccines co-expressing P39 and lumazine synthase proteins of Brucella abortus in BALB/c mice.

Authors:  Guo-Zhen Lin; Ju-Tian Yang; Suo-Cheng Wei; Shi-En Chen; Sheng-Dong Huo; Zhong-Ren Ma
Journal:  Trop Anim Health Prod       Date:  2018-02-28       Impact factor: 1.559

Review 5.  Ontology-supported research on vaccine efficacy, safety and integrative biological networks.

Authors:  Yongqun He
Journal:  Expert Rev Vaccines       Date:  2014-06-07       Impact factor: 5.217

6.  Knowledge engineering tools for reasoning with scientific observations and interpretations: a neural connectivity use case.

Authors:  Thomas A Russ; Cartic Ramakrishnan; Eduard H Hovy; Mihail Bota; Gully A P C Burns
Journal:  BMC Bioinformatics       Date:  2011-08-22       Impact factor: 3.307

7.  Screening of potential vaccine candidates against pathogenic Brucella spp. using compositive reverse vaccinology.

Authors:  Xiaodong Zai; Ying Yin; Fengyu Guo; Qiaoling Yang; Ruihua Li; Yaohui Li; Jun Zhang; Junjie Xu; Wei Chen
Journal:  Vet Res       Date:  2021-06-02       Impact factor: 3.683

8.  Meta-analysis of variables affecting mouse protection efficacy of whole organism Brucella vaccines and vaccine candidates.

Authors:  Thomas E Todd; Omar Tibi; Yu Lin; Samantha Sayers; Denise N Bronner; Zuoshuang Xiang; Yongqun He
Journal:  BMC Bioinformatics       Date:  2013-04-17       Impact factor: 3.169

9.  Genome-wide prediction of vaccine targets for human herpes simplex viruses using Vaxign reverse vaccinology.

Authors:  Zuoshuang Xiang; Yongqun He
Journal:  BMC Bioinformatics       Date:  2013-03-08       Impact factor: 3.169

Review 10.  Analyses of Brucella pathogenesis, host immunity, and vaccine targets using systems biology and bioinformatics.

Authors:  Yongqun He
Journal:  Front Cell Infect Microbiol       Date:  2012-02-01       Impact factor: 5.293

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.