Timothy Luciani1, John Wenskovitch2, Koonwah Chen3, David Koes4, Timothy Travers4, G Elisabeta Marai5. 1. Department of Computer Science, Brown University, Box 1910, 02912 Providence, RI, US. 2. Department of Computer Science, University of Pittsburgh, 210 South Bouquet, 15260 Pittsburgh, PA, US. 3. School of Information Science, University of Pittsburgh, 135 North Bellefield Avenue, 15260 Pittsburgh, PA, US. 4. Department of Computational and Systems Biology, University of Pittsburgh, 3501 Fifth Avenue, 15260 Pittsburgh, PA, US. 5. Department of Computer Science, University of Illinois at Chicago, 851 S. Morgan St., 60607 Chicago, IL, US.
Abstract
BACKGROUND: Knowledge of the 3D structure and functionality of proteins can lead to insight into the associated cellular processes, speed up the creation of pharmaceutical products, and develop drugs that are more effective in combating disease. METHODS: We present the design and implementation of a visual mining and analysis tool to help identify protein mutations across a family of structural models and to help discover the effect of these mutations on protein function. We integrate 3D structure and sequence information in a common visual interface; multiple linked views and a computational backbone allow comparison at the molecular and atomic levels, while a novel trend-image visual abstraction allows for the sorting and mining of large collections of sequences and of their residues. RESULTS: We evaluate our approach on the triosephosphate isomerase (TIM) family structural models and sequence data and show that our tool provides an effective, scalable way to navigate a family of proteins, as well as a means to inspect the structure and sequence of individual proteins. CONCLUSIONS: The TIM application shows that our tool can assist in the navigation of families of proteins, as well as in the exploration of individual protein structures. In conjunction with domain expert knowledge, this interactive tool can help provide biophysical insight into why specific mutations affect function and potentially suggest additional modifications to the protein that could be used to rescue functionality.
BACKGROUND: Knowledge of the 3D structure and functionality of proteins can lead to insight into the associated cellular processes, speed up the creation of pharmaceutical products, and develop drugs that are more effective in combating disease. METHODS: We present the design and implementation of a visual mining and analysis tool to help identify protein mutations across a family of structural models and to help discover the effect of these mutations on protein function. We integrate 3D structure and sequence information in a common visual interface; multiple linked views and a computational backbone allow comparison at the molecular and atomic levels, while a novel trend-image visual abstraction allows for the sorting and mining of large collections of sequences and of their residues. RESULTS: We evaluate our approach on the triosephosphate isomerase (TIM) family structural models and sequence data and show that our tool provides an effective, scalable way to navigate a family of proteins, as well as a means to inspect the structure and sequence of individual proteins. CONCLUSIONS: The TIM application shows that our tool can assist in the navigation of families of proteins, as well as in the exploration of individual protein structures. In conjunction with domain expert knowledge, this interactive tool can help provide biophysical insight into why specific mutations affect function and potentially suggest additional modifications to the protein that could be used to rescue functionality.
Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971
Authors: W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler Journal: Genome Res Date: 2002-06 Impact factor: 9.043
Authors: Ursula Pieper; Narayanan Eswar; Hannes Braberg; M S Madhusudhan; Fred P Davis; Ashley C Stuart; Nebojsa Mirkovic; Andrea Rossi; Marc A Marti-Renom; Andras Fiser; Ben Webb; Daniel Greenblatt; Conrad C Huang; Thomas E Ferrin; Andrej Sali Journal: Nucleic Acids Res Date: 2004-01-01 Impact factor: 16.971
Authors: Stephen B Montgomery; Tamara Astakhova; Mikhail Bilenky; Ewan Birney; Tony Fu; Maik Hassel; Craig Melsopp; Marcin Rak; A Gordon Robertson; Monica Sleumer; Asim S Siddiqui; Steven J M Jones Journal: Genome Res Date: 2004-05 Impact factor: 9.043
Authors: Christiam Camacho; George Coulouris; Vahram Avagyan; Ning Ma; Jason Papadopoulos; Kevin Bealer; Thomas L Madden Journal: BMC Bioinformatics Date: 2009-12-15 Impact factor: 3.169
Authors: Andrew M Waterhouse; James B Procter; David M A Martin; Michèle Clamp; Geoffrey J Barton Journal: Bioinformatics Date: 2009-01-16 Impact factor: 6.937
Authors: Nansheng Chen; Todd W Harris; Igor Antoshechkin; Carol Bastiani; Tamberlyn Bieri; Darin Blasiar; Keith Bradnam; Payan Canaran; Juancarlos Chan; Chao-Kung Chen; Wen J Chen; Fiona Cunningham; Paul Davis; Eimear Kenny; Ranjana Kishore; Daniel Lawson; Raymond Lee; Hans-Michael Muller; Cecilia Nakamura; Shraddha Pai; Philip Ozersky; Andrei Petcherski; Anthony Rogers; Aniko Sabo; Erich M Schwarz; Kimberly Van Auken; Qinghua Wang; Richard Durbin; John Spieth; Paul W Sternberg; Lincoln D Stein Journal: Nucleic Acids Res Date: 2005-01-01 Impact factor: 16.971
Authors: William C Ray; R Wolfgang Rumpf; Brandon Sullivan; Nicholas Callahan; Thomas Magliery; Raghu Machiraju; Bang Wong; Martin Krzywinski; Christopher W Bartlett Journal: BMC Proc Date: 2014-08-28