S D Dunn1, L M Wahl, G B Gloor. 1. Department of Biochemistry, University of Western Ontario, London, Ontario, Canada, N6A 5C1.
Abstract
MOTIVATION: Compensating alterations during the evolution of protein families give rise to coevolving positions that contain important structural and functional information. However, a high background composed of random noise and phylogenetic components interferes with the identification of coevolving positions. RESULTS: We have developed a rapid, simple and general method based on information theory that accurately estimates the level of background mutual information for each pair of positions in a given protein family. Removal of this background results in a metric, MIp, that correctly identifies substantially more coevolving positions in protein families than any existing method. A significant fraction of these positions coevolve strongly with one or only a few positions. The vast majority of such position pairs are in contact in representative structures. The identification of strongly coevolving position pairs can be used to impose significant structural limitations and should be an important additional constraint for ab initio protein folding. AVAILABILITY: Alignments and program files can be found in the Supplementary Information.
MOTIVATION: Compensating alterations during the evolution of protein families give rise to coevolving positions that contain important structural and functional information. However, a high background composed of random noise and phylogenetic components interferes with the identification of coevolving positions. RESULTS: We have developed a rapid, simple and general method based on information theory that accurately estimates the level of background mutual information for each pair of positions in a given protein family. Removal of this background results in a metric, MIp, that correctly identifies substantially more coevolving positions in protein families than any existing method. A significant fraction of these positions coevolve strongly with one or only a few positions. The vast majority of such position pairs are in contact in representative structures. The identification of strongly coevolving position pairs can be used to impose significant structural limitations and should be an important additional constraint for ab initio protein folding. AVAILABILITY: Alignments and program files can be found in the Supplementary Information.
Authors: Benjamin Schubert; Rohan Maddamsetti; Jackson Nyman; Maha R Farhat; Debora S Marks Journal: Nat Microbiol Date: 2018-12-03 Impact factor: 17.745
Authors: Faruck Morcos; Andrea Pagnani; Bryan Lunt; Arianna Bertolino; Debora S Marks; Chris Sander; Riccardo Zecchina; José N Onuchic; Terence Hwa; Martin Weigt Journal: Proc Natl Acad Sci U S A Date: 2011-11-21 Impact factor: 11.205
Authors: Siddharth S Dey; Yuhua Xue; Marcin P Joachimiak; Gregory D Friedland; John C Burnett; Qiang Zhou; Adam P Arkin; David V Schaffer Journal: J Biol Chem Date: 2012-01-17 Impact factor: 5.157
Authors: Mary Hongying Cheng; Luca Ponzoni; Tatiana Sorkina; Ji Young Lee; She Zhang; Alexander Sorkin; Ivet Bahar Journal: Neuropharmacology Date: 2019-06-20 Impact factor: 5.250