Literature DB >> 14629874

Selecting informative data for developing peptide-MHC binding predictors using a query by committee approach.

Jens Kaae Christensen1, Kasper Lamberth, Morten Nielsen, Claus Lundegaard, Peder Worning, Sanne Lise Lauemøller, Søren Buus, Søren Brunak, Ole Lund.   

Abstract

Strategies for selecting informative data points for training prediction algorithms are important, particularly when data points are difficult and costly to obtain. A Query by Committee (QBC) training strategy for selecting new data points uses the disagreement between a committee of different algorithms to suggest new data points, which most rationally complement existing data, that is, they are the most informative data points. In order to evaluate this QBC approach on a real-world problem, we compared strategies for selecting new data points. We trained neural network algorithms to obtain methods to predict the binding affinity of peptides binding to the MHC class I molecule, HLA-A2. We show that the QBC strategy leads to a higher performance than a baseline strategy where new data points are selected at random from a pool of available data. Most peptides bind HLA-A2 with a low affinity, and as expected using a strategy of selecting peptides that are predicted to have high binding affinities also lead to more accurate predictors than the base line strategy. The QBC value is shown to correlate with the measured binding affinity. This demonstrates that the different predictors can easily learn if a peptide will fail to bind, but often conflict in predicting if a peptide binds. Using a carefully constructed computational setup, we demonstrate that selecting peptides with a high QBC performs better than low QBC peptides independently from binding affinity. When predictors are trained on a very limited set of data they cannot be expected to disagree in a meaningful way and we find a data limit below which the QBC strategy fails. Finally, it should be noted that data selection strategies similar to those used here might be of use in other settings in which generation of more data is a costly process.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14629874     DOI: 10.1162/089976603322518803

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  8 in total

1.  Conference report--adjuvants and delivery: improving on vaccine immunogenicity highlights from the viral vaccine meeting; October 25-28, 2003; Barcelona, Spain.

Authors:  Elena Armandola
Journal:  MedGenMed       Date:  2004-01-26

2.  The peptide-binding specificity of HLA-A*3001 demonstrates membership of the HLA-A3 supertype.

Authors:  Kasper Lamberth; Gustav Røder; Mikkel Harndahl; Morten Nielsen; Claus Lundegaard; Claus Schafer-Nielsen; Ole Lund; Soren Buus
Journal:  Immunogenetics       Date:  2008-09-04       Impact factor: 2.846

3.  Prediction of supertype-specific HLA class I binding peptides using support vector machines.

Authors:  Guang Lan Zhang; Ivana Bozic; Chee Keong Kwoh; J Thomas August; Vladimir Brusic
Journal:  J Immunol Methods       Date:  2007-01-25       Impact factor: 2.303

4.  Porcine major histocompatibility complex (MHC) class I molecules and analysis of their peptide-binding specificities.

Authors:  Lasse Eggers Pedersen; Mikkel Harndahl; Michael Rasmussen; Kasper Lamberth; William T Golde; Ole Lund; Morten Nielsen; Soren Buus
Journal:  Immunogenetics       Date:  2011-07-08       Impact factor: 2.846

5.  NNAlign: a web-based prediction method allowing non-expert end-user discovery of sequence motifs in quantitative peptide data.

Authors:  Massimo Andreatta; Claus Schafer-Nielsen; Ole Lund; Søren Buus; Morten Nielsen
Journal:  PLoS One       Date:  2011-11-02       Impact factor: 3.240

Review 6.  From functional genomics to functional immunomics: new challenges, old problems, big rewards.

Authors:  Ulisses M Braga-Neto; Ernesto T A Marques
Journal:  PLoS Comput Biol       Date:  2006-07-28       Impact factor: 4.475

7.  Quantitative predictions of peptide binding to any HLA-DR molecule of known sequence: NetMHCIIpan.

Authors:  Morten Nielsen; Claus Lundegaard; Thomas Blicher; Bjoern Peters; Alessandro Sette; Sune Justesen; Søren Buus; Ole Lund
Journal:  PLoS Comput Biol       Date:  2008-07-04       Impact factor: 4.475

8.  Subtractive Proteomics and Immuno-informatics Approaches for Multi-peptide Vaccine Prediction Against Klebsiella oxytoca and Validation Through In Silico Expression.

Authors:  Qudsia Yousafi; Humaira Amin; Shabana Bibi; Rafea Rafi; Muhammad S Khan; Hamza Ali; Ashir Masroor
Journal:  Int J Pept Res Ther       Date:  2021-09-20       Impact factor: 1.931

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.