| Literature DB >> 11473014 |
G Pollastri1, P Baldi, P Fariselli, R Casadio.
Abstract
Knowing the number of residue contacts in a protein is crucial for deriving constraints useful in modeling protein folding, protein structure, and/or scoring remote homology searches. Here we use an ensemble of bi-directional recurrent neural network architectures and evolutionary information to improve the state-of-the-art in contact prediction using a large corpus of curated data. The ensemble is used to discriminate between two different states of residue contacts, characterized by a contact number higher or lower than the average value of the residue distribution. The ensemble achieves performances ranging from 70.1% to 73.1% depending on the radius adopted to discriminate contacts (6Ato 12A). These performances represent gains of 15% to 20% over the base line statistical predictors always assigning an aminoacid to the most numerous state, 3% to 7% better than any previous method. Combination of different radius predictors further improves the performance. SERVER: http://promoter.ics.uci.edu/BRNN-PRED/.Entities:
Mesh:
Substances:
Year: 2001 PMID: 11473014 DOI: 10.1093/bioinformatics/17.suppl_1.s234
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937