Hao Zhang1, Ole Lund, Morten Nielsen. 1. Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby 2800, Denmark.
Abstract
MOTIVATION: Receptor-ligand interactions play an important role in controlling many biological systems. One prominent example is the binding of peptides to the major histocompatibility complex (MHC) molecules controlling the onset of cellular immune responses. Thousands of MHC allelic versions exist, making determination of the binding specificity for each variant experimentally infeasible. Here, we present a method that can extrapolate from variants with known binding specificity to those where no experimental data are available. RESULTS: For each position in the peptide ligand, we extracted the polymorphic pocket residues in MHC molecules that are in close proximity to the peptide residue. For MHC molecules with known specificities, we established a library of pocket-residues and corresponding binding specificities. The binding specificity for a novel MHC molecule is calculated as the average of the specificities of MHC molecules in this library weighted by the similarity of their pocket-residues to the query. This PickPocket method is demonstrated to accurately predict MHC-peptide binding for a broad range of MHC alleles, including human and non-human species. In contrast to neural network-based pan-specific methods, PickPocket was shown to be robust both when data is scarce and when the similarity to MHC molecules with characterized binding specificity is low. A consensus method combining the PickPocket and NetMHCpan methods was shown to achieve superior predictive performance. This study demonstrates how integration of diverse algorithmic approaches can lead to improved prediction. The method may also be used for making ligand-binding predictions for other types of receptors where many variants exist.
MOTIVATION: Receptor-ligand interactions play an important role in controlling many biological systems. One prominent example is the binding of peptides to the major histocompatibility complex (MHC) molecules controlling the onset of cellular immune responses. Thousands of MHC allelic versions exist, making determination of the binding specificity for each variant experimentally infeasible. Here, we present a method that can extrapolate from variants with known binding specificity to those where no experimental data are available. RESULTS: For each position in the peptide ligand, we extracted the polymorphic pocket residues in MHC molecules that are in close proximity to the peptide residue. For MHC molecules with known specificities, we established a library of pocket-residues and corresponding binding specificities. The binding specificity for a novel MHC molecule is calculated as the average of the specificities of MHC molecules in this library weighted by the similarity of their pocket-residues to the query. This PickPocket method is demonstrated to accurately predict MHC-peptide binding for a broad range of MHC alleles, including human and non-human species. In contrast to neural network-based pan-specific methods, PickPocket was shown to be robust both when data is scarce and when the similarity to MHC molecules with characterized binding specificity is low. A consensus method combining the PickPocket and NetMHCpan methods was shown to achieve superior predictive performance. This study demonstrates how integration of diverse algorithmic approaches can lead to improved prediction. The method may also be used for making ligand-binding predictions for other types of receptors where many variants exist.
Authors: T Sturniolo; E Bono; J Ding; L Raddrizzani; O Tuereci; U Sahin; M Braxenthaler; F Gallazzi; M P Protti; F Sinigaglia; J Hammer Journal: Nat Biotechnol Date: 1999-06 Impact factor: 54.908
Authors: Nebojsa Jojic; Manuel Reyes-Gomez; David Heckerman; Carl Kadie; Ora Schueler-Furman Journal: Bioinformatics Date: 2006-07-15 Impact factor: 6.937
Authors: Guang Lan Zhang; Asif M Khan; Kellathur N Srinivasan; J Thomas August; Vladimir Brusic Journal: Nucleic Acids Res Date: 2005-07-01 Impact factor: 16.971
Authors: Thomas Trolle; Imir G Metushi; Jason A Greenbaum; Yohan Kim; John Sidney; Ole Lund; Alessandro Sette; Bjoern Peters; Morten Nielsen Journal: Bioinformatics Date: 2015-02-25 Impact factor: 6.937
Authors: Dinler A Antunes; Gustavo F Vieira; Maurício M Rigo; Samuel P Cibulski; Marialva Sinigaglia; José A B Chies Journal: PLoS One Date: 2010-04-26 Impact factor: 3.240