Massimo Andreatta1, Ole Lund, Morten Nielsen. 1. Center for Biological Sequence Analysis, Technical University of Denmark, DK-2800 Lyngby, Denmark. massimo@cbs.dtu.dk
Abstract
MOTIVATION: Proteins recognizing short peptide fragments play a central role in cellular signaling. As a result of high-throughput technologies, peptide-binding protein specificities can be studied using large peptide libraries at dramatically lower cost and time. Interpretation of such large peptide datasets, however, is a complex task, especially when the data contain multiple receptor binding motifs, and/or the motifs are found at different locations within distinct peptides. RESULTS: The algorithm presented in this article, based on Gibbs sampling, identifies multiple specificities in peptide data by performing two essential tasks simultaneously: alignment and clustering of peptide data. We apply the method to de-convolute binding motifs in a panel of peptide datasets with different degrees of complexity spanning from the simplest case of pre-aligned fixed-length peptides to cases of unaligned peptide datasets of variable length. Example applications described in this article include mixtures of binders to different MHC class I and class II alleles, distinct classes of ligands for SH3 domains and sub-specificities of the HLA-A*02:01 molecule. AVAILABILITY: The Gibbs clustering method is available online as a web server at http://www.cbs.dtu.dk/services/GibbsCluster.
MOTIVATION: Proteins recognizing short peptide fragments play a central role in cellular signaling. As a result of high-throughput technologies, peptide-binding protein specificities can be studied using large peptide libraries at dramatically lower cost and time. Interpretation of such large peptide datasets, however, is a complex task, especially when the data contain multiple receptor binding motifs, and/or the motifs are found at different locations within distinct peptides. RESULTS: The algorithm presented in this article, based on Gibbs sampling, identifies multiple specificities in peptide data by performing two essential tasks simultaneously: alignment and clustering of peptide data. We apply the method to de-convolute binding motifs in a panel of peptide datasets with different degrees of complexity spanning from the simplest case of pre-aligned fixed-length peptides to cases of unaligned peptide datasets of variable length. Example applications described in this article include mixtures of binders to different MHC class I and class II alleles, distinct classes of ligands for SH3 domains and sub-specificities of the HLA-A*02:01 molecule. AVAILABILITY: The Gibbs clustering method is available online as a web server at http://www.cbs.dtu.dk/services/GibbsCluster.
Authors: Christian Garde; Sri H Ramarathinam; Emma C Jappe; Morten Nielsen; Jens V Kringelum; Thomas Trolle; Anthony W Purcell Journal: Immunogenetics Date: 2019-06-10 Impact factor: 2.846
Authors: Christopher J Holland; Rory M Crean; Johanne M Pentier; Ben de Wet; Angharad Lloyd; Velupillai Srikannathasan; Nikolai Lissin; Katy A Lloyd; Thomas H Blicher; Paul J Conroy; Miriam Hock; Robert J Pengelly; Thomas E Spinner; Brian Cameron; Elizabeth A Potter; Anitha Jeyanthan; Peter E Molloy; Malkit Sami; Milos Aleksic; Nathaniel Liddy; Ross A Robinson; Stephen Harper; Marco Lepore; Chris R Pudney; Marc W van der Kamp; Pierre J Rizkallah; Bent K Jakobsen; Annelise Vuidepot; David K Cole Journal: J Clin Invest Date: 2020-05-01 Impact factor: 14.808
Authors: Fabian Coscia; Ernst Lengyel; Jaikumar Duraiswamy; Bradley Ashcroft; Michal Bassani-Sternberg; Michael Wierer; Alyssa Johnson; Kristen Wroblewski; Anthony Montag; S Diane Yamada; Blanca López-Méndez; Jakob Nilsson; Andreas Mund; Matthias Mann; Marion Curtis Journal: Cell Date: 2018-09-20 Impact factor: 41.582