MOTIVATION: Discovery of novel protective antigens is fundamental to the development of vaccines for existing and emerging pathogens. Most computational methods for predicting protein antigenicity rely directly on homology with previously characterized protective antigens; however, homology-based methods will fail to discover truly novel protective antigens. Thus, there is a significant need for homology-free methods capable of screening entire proteomes for the antigens most likely to generate a protective humoral immune response. RESULTS: Here we begin by curating two types of positive data: (i) antigens that elicit a strong antibody response in protected individuals but not in unprotected individuals, using human immunoglobulin reactivity data obtained from protein microarray analyses; and (ii) known protective antigens from the literature. The resulting datasets are used to train a sequence-based prediction model, ANTIGENpro, to predict the likelihood that a protein is a protective antigen. ANTIGENpro correctly classifies 82% of the known protective antigens when trained using only the protein microarray datasets. The accuracy on the combined dataset is estimated at 76% by cross-validation experiments. Finally, ANTIGENpro performs well when evaluated on an external pathogen proteome for which protein microarray data were obtained after the initial development of ANTIGENpro. AVAILABILITY: ANTIGENpro is integrated in the SCRATCH suite of predictors available at http://scratch.proteomics.ics.uci.edu. CONTACT: pfbaldi@ics.uci.edu
MOTIVATION: Discovery of novel protective antigens is fundamental to the development of vaccines for existing and emerging pathogens. Most computational methods for predicting protein antigenicity rely directly on homology with previously characterized protective antigens; however, homology-based methods will fail to discover truly novel protective antigens. Thus, there is a significant need for homology-free methods capable of screening entire proteomes for the antigens most likely to generate a protective humoral immune response. RESULTS: Here we begin by curating two types of positive data: (i) antigens that elicit a strong antibody response in protected individuals but not in unprotected individuals, using human immunoglobulin reactivity data obtained from protein microarray analyses; and (ii) known protective antigens from the literature. The resulting datasets are used to train a sequence-based prediction model, ANTIGENpro, to predict the likelihood that a protein is a protective antigen. ANTIGENpro correctly classifies 82% of the known protective antigens when trained using only the protein microarray datasets. The accuracy on the combined dataset is estimated at 76% by cross-validation experiments. Finally, ANTIGENpro performs well when evaluated on an external pathogen proteome for which protein microarray data were obtained after the initial development of ANTIGENpro. AVAILABILITY: ANTIGENpro is integrated in the SCRATCH suite of predictors available at http://scratch.proteomics.ics.uci.edu. CONTACT: pfbaldi@ics.uci.edu
Authors: Alan G Barbour; Algimantas Jasinskas; Matthew A Kayala; D Huw Davies; Allen C Steere; Pierre Baldi; Philip L Felgner Journal: Infect Immun Date: 2008-05-12 Impact factor: 3.441
Authors: Santiago J Carmona; Morten Nielsen; Claus Schafer-Nielsen; Juan Mucci; Jaime Altcheh; Virginia Balouz; Valeria Tekiel; Alberto C Frasch; Oscar Campetella; Carlos A Buscaglia; Fernán Agüero Journal: Mol Cell Proteomics Date: 2015-04-28 Impact factor: 5.911
Authors: Emmanuel Cornillot; Amina Dassouli; Niseema Pachikara; Lauren Lawres; Isaline Renard; Celia Francois; Sylvie Randazzo; Virginie Brès; Aprajita Garg; Janna Brancato; Joseph E Pazzi; Jozelyn Pablo; Chris Hung; Andy Teng; Adam D Shandling; Vu T Huynh; Peter J Krause; Timothy Lepore; Stephane Delbecq; Gary Hermanson; Xiaowu Liang; Scott Williams; Douglas M Molina; Choukri Ben Mamoun Journal: Transfusion Date: 2016-05-17 Impact factor: 3.157