MOTIVATION: The prediction of a protein's contact map has become in recent years, a crucial stepping stone for the prediction of the complete 3D structure of a protein. In this article, we describe a methodology for this problem that was shown to be successful in CASP8 and CASP9. The methodology is based on (i) the fusion of the prediction of a variety of structural aspects of protein residues, (ii) an ensemble strategy used to facilitate the training process and (iii) a rule-based machine learning system from which we can extract human-readable explanations of the predictor and derive useful information about the contact map representation. RESULTS: The main part of the evaluation is the comparison against the sequence-based contact prediction methods from CASP9, where our method presented the best rank in five out of the six evaluated metrics. We also assess the impact of the size of the ensemble used in our predictor to show the trade-off between performance and training time of our method. Finally, we also study the rule sets generated by our machine learning system. From this analysis, we are able to estimate the contribution of the attributes in our representation and how these interact to derive contact predictions. AVAILABILITY: http://icos.cs.nott.ac.uk/servers/psp.html. CONTACT: natalio.krasnogor@nottingham.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The prediction of a protein's contact map has become in recent years, a crucial stepping stone for the prediction of the complete 3D structure of a protein. In this article, we describe a methodology for this problem that was shown to be successful in CASP8 and CASP9. The methodology is based on (i) the fusion of the prediction of a variety of structural aspects of protein residues, (ii) an ensemble strategy used to facilitate the training process and (iii) a rule-based machine learning system from which we can extract human-readable explanations of the predictor and derive useful information about the contact map representation. RESULTS: The main part of the evaluation is the comparison against the sequence-based contact prediction methods from CASP9, where our method presented the best rank in five out of the six evaluated metrics. We also assess the impact of the size of the ensemble used in our predictor to show the trade-off between performance and training time of our method. Finally, we also study the rule sets generated by our machine learning system. From this analysis, we are able to estimate the contribution of the attributes in our representation and how these interact to derive contact predictions. AVAILABILITY: http://icos.cs.nott.ac.uk/servers/psp.html. CONTACT: natalio.krasnogor@nottingham.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Anna L Swan; Dov J Stekel; Charlie Hodgman; David Allaway; Mohammed H Alqahtani; Ali Mobasheri; Jaume Bacardit Journal: BMC Genomics Date: 2015-01-15 Impact factor: 3.969
Authors: Chen Keasar; Liam J McGuffin; Björn Wallner; Gaurav Chopra; Badri Adhikari; Debswapna Bhattacharya; Lauren Blake; Leandro Oliveira Bortot; Renzhi Cao; B K Dhanasekaran; Itzhel Dimas; Rodrigo Antonio Faccioli; Eshel Faraggi; Robert Ganzynkowicz; Sambit Ghosh; Soma Ghosh; Artur Giełdoń; Lukasz Golon; Yi He; Lim Heo; Jie Hou; Main Khan; Firas Khatib; George A Khoury; Chris Kieslich; David E Kim; Pawel Krupa; Gyu Rie Lee; Hongbo Li; Jilong Li; Agnieszka Lipska; Adam Liwo; Ali Hassan A Maghrabi; Milot Mirdita; Shokoufeh Mirzaei; Magdalena A Mozolewska; Melis Onel; Sergey Ovchinnikov; Anand Shah; Utkarsh Shah; Tomer Sidi; Adam K Sieradzan; Magdalena Ślusarz; Rafal Ślusarz; James Smadbeck; Phanourios Tamamis; Nicholas Trieber; Tomasz Wirecki; Yanping Yin; Yang Zhang; Jaume Bacardit; Maciej Baranowski; Nicholas Chapman; Seth Cooper; Alexandre Defelicibus; Jeff Flatten; Brian Koepnick; Zoran Popović; Bartlomiej Zaborowski; David Baker; Jianlin Cheng; Cezary Czaplewski; Alexandre Cláudio Botazzo Delbem; Christodoulos Floudas; Andrzej Kloczkowski; Stanislaw Ołdziej; Michael Levitt; Harold Scheraga; Chaok Seok; Johannes Söding; Saraswathi Vishveshwara; Dong Xu; Silvia N Crivelli Journal: Sci Rep Date: 2018-07-02 Impact factor: 4.379