Davide Sala1,2, Yuanpeng Janet Huang3,4, Casey A Cole5, David A Snyder6, Gaohua Liu3,7, Yojiro Ishida3,8, G V T Swapna3, Kelly P Brock9, Chris Sander10,11, Krzysztof Fidelis12, Andriy Kryshtafovych12, Masayori Inouye8, Roberto Tejero13, Homayoun Valafar5, Antonio Rosato1,2, Gaetano T Montelione3,4,8. 1. Magnetic Resonance Center, University of Florence, Sesto Fiorentino, Italy. 2. Department of Chemistry, University of Florence, Sesto Fiorentino, Italy. 3. Center for Advanced Biotechnology and Medicine, and Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey. 4. Department of Chemistry and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York. 5. Department of Computer Science & Engineering, University of South Carolina, Columbia, South Carolina. 6. Department of Chemistry, College of Science and Health, William Paterson University, Wayne, New Jersey. 7. Nexomics Biosciences, Bordentown, New Jersey. 8. Department of Biochemistry and Molecular Biology, The Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, New Jersey. 9. Department of Systems Biology, Harvard Medical School, Boston, Massachusetts. 10. Department of Cell Biology, Harvard Medical School, Boston, Massachusetts. 11. cBio Center, Dana-Farber Cancer Institute, Boston, Massachusetts. 12. Genome Center, University of California, Davis, California. 13. Departamento de Quimica Fisica, Universidad de Valencia, Valencia, Spain.
Abstract
CASP13 has investigated the impact of sparse NMR data on the accuracy of protein structure prediction. NOESY and 15 N-1 H residual dipolar coupling data, typical of that obtained for 15 N,13 C-enriched, perdeuterated proteins up to about 40 kDa, were simulated for 11 CASP13 targets ranging in size from 80 to 326 residues. For several targets, two prediction groups generated models that are more accurate than those produced using baseline methods. Real NMR data collected for a de novo designed protein were also provided to predictors, including one data set in which only backbone resonance assignments were available. Some NMR-assisted prediction groups also did very well with these data. CASP13 also assessed whether incorporation of sparse NMR data improves the accuracy of protein structure prediction relative to nonassisted regular methods. In most cases, incorporation of sparse, noisy NMR data results in models with higher accuracy. The best NMR-assisted models were also compared with the best regular predictions of any CASP13 group for the same target. For six of 13 targets, the most accurate model provided by any NMR-assisted prediction group was more accurate than the most accurate model provided by any regular prediction group; however, for the remaining seven targets, one or more regular prediction method provided a more accurate model than even the best NMR-assisted model. These results suggest a novel approach for protein structure determination, in which advanced prediction methods are first used to generate structural models, and sparse NMR data is then used to validate and/or refine these models.
CASP13 has investigated the impact of sparse NMR data on the accuracy of protein structure prediction. n class="Chemical">NOESY and 15 N-1 H residual dipolar coupling data, typical of that obtained for 15 N,13 C-enriched, perdeuterated proteins up to about 40 kDa, were simulated for 11 CASP13 targets ranging in size from 80 to 326 residues. For several targets, two prediction groups generated models that are more accurate than those produced using baseline methods. Real NMR data collected for a de novo designed protein were also provided to predictors, including one data set in which only backbone resonance assignments were available. Some NMR-assisted prediction groups also did very well with these data. CASP13 also assessed whether incorporation of sparse NMR data improves the accuracy of protein structure prediction relative to nonassisted regular methods. In most cases, incorporation of sparse, noisy NMR data results in models with higher accuracy. The best NMR-assisted models were also compared with the best regular predictions of any CASP13 group for the same target. For six of 13 targets, the most accurate model provided by any NMR-assisted prediction group was more accurate than the most accurate model provided by any regular prediction group; however, for the remaining seven targets, one or more regular prediction method provided a more accurate model than even the best NMR-assisted model. These results suggest a novel approach for protein structure determination, in which advanced prediction methods are first used to generate structural models, and sparse NMR data is then used to validate and/or refine these models.
Authors: Antonio Rosato; Anurag Bagaria; David Baker; Benjamin Bardiaux; Andrea Cavalli; Jurgen F Doreleijers; Andrea Giachetti; Paul Guerry; Peter Güntert; Torsten Herrmann; Yuanpeng J Huang; Hendrik R A Jonker; Binchen Mao; Thérèse E Malliavin; Gaetano T Montelione; Michael Nilges; Srivatsan Raman; Gijs van der Schot; Wim F Vranken; Geerten W Vuister; Alexandre M J J Bonvin Journal: Nat Methods Date: 2009-09 Impact factor: 28.547
Authors: Debora S Marks; Lucy J Colwell; Robert Sheridan; Thomas A Hopf; Andrea Pagnani; Riccardo Zecchina; Chris Sander Journal: PLoS One Date: 2011-12-07 Impact factor: 3.240
Authors: Sergey Ovchinnikov; Lisa Kinch; Hahnbeom Park; Yuxing Liao; Jimin Pei; David E Kim; Hetunandan Kamisetty; Nick V Grishin; David Baker Journal: Elife Date: 2015-09-03 Impact factor: 8.140
Authors: Yuanpeng Janet Huang; Ning Zhang; Beate Bersch; Krzysztof Fidelis; Masayori Inouye; Yojiro Ishida; Andriy Kryshtafovych; Naohiro Kobayashi; Yutaka Kuroda; Gaohua Liu; Andy LiWang; G V T Swapna; Nan Wu; Toshio Yamazaki; Gaetano T Montelione Journal: Proteins Date: 2021-10-19
Authors: Pau Martin-Malpartida; Silvia Arrastia-Casado; Josep Farrera-Sinfreu; Rudolf Lucas; Hendrik Fischer; Bernhard Fischer; Douglas C Eaton; Susan Tzotzos; Maria J Macias Journal: Comput Struct Biotechnol J Date: 2022-04-27 Impact factor: 6.155