Literature DB >> 32558561

Bayesian Active Learning for Optimization and Uncertainty Quantification in Protein Docking.

Yue Cao1, Yang Shen1,2.   

Abstract

Ab initio protein docking represents a major challenge for optimizing a noisy and costly "black box"-like function in a high-dimensional space. Despite progress in this field, there is a lack of rigorous uncertainty quantification (UQ). To fill the gap, we introduce a novel algorithm, Bayesian active learning (BAL), for optimization and UQ of such black-box functions with applications to flexible protein docking. BAL directly models the posterior distribution of the global optimum (i.e., native structures) with active sampling and posterior estimation iteratively feeding each other. Furthermore, it uses complex normal modes to span a homogeneous, Euclidean conformation space suitable for high-dimensional optimization and constructs funnel-like energy models for quality estimation of encounter complexes. Over a protein-docking benchmark set and a CAPRI set including homology docking, we establish that BAL significantly improves against starting points from rigid docking and refinements by particle swarm optimization, providing a top-3 near-native prediction for one third targets. Quality assessment empowered with UQ leads to tight quality intervals with half range around 25% of the actual interface root-mean-square deviation and confidence level at 85%. BAL's estimated probability of a prediction being near-native achieves binary classification AUROC at 0.93 and area under the precision recall curve over 0.60 (compared to 0.50 and 0.14, respectively, by chance), which also improves ranking predictions. This study represents the first UQ solution for protein docking, with rigorous theoretical frameworks and comprehensive empirical assessments.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32558561      PMCID: PMC7429362          DOI: 10.1021/acs.jctc.0c00476

Source DB:  PubMed          Journal:  J Chem Theory Comput        ISSN: 1549-9618            Impact factor:   6.006


  25 in total

Review 1.  Convergence and combination of methods in protein-protein docking.

Authors:  Sandor Vajda; Dima Kozakov
Journal:  Curr Opin Struct Biol       Date:  2009-03-25       Impact factor: 6.809

2.  Flexible protein docking refinement using pose-dependent normal mode analysis.

Authors:  Vishwesh Venkatraman; David W Ritchie
Journal:  Proteins       Date:  2012-06-18

3.  Improved flexible refinement of protein docking in CAPRI rounds 22-27.

Authors:  Yang Shen
Journal:  Proteins       Date:  2013-10-17

4.  Predicting protein conformational changes for unbound and homology docking: learning from intrinsic and induced flexibility.

Authors:  Haoran Chen; Yuanfei Sun; Yang Shen
Journal:  Proteins       Date:  2016-12-05

5.  Interactome3D: adding structural details to protein networks.

Authors:  Roberto Mosca; Arnaud Céol; Patrick Aloy
Journal:  Nat Methods       Date:  2012-12-16       Impact factor: 28.547

6.  Flexible Protein-Protein Docking with SwarmDock.

Authors:  Iain H Moal; Raphael A G Chaleil; Paul A Bates
Journal:  Methods Mol Biol       Date:  2018

7.  Efficient flexible backbone protein-protein docking for challenging targets.

Authors:  Nicholas A Marze; Shourya S Roy Burman; William Sheffler; Jeffrey J Gray
Journal:  Bioinformatics       Date:  2018-10-15       Impact factor: 6.937

8.  SwarmDock and the use of normal modes in protein-protein docking.

Authors:  Iain H Moal; Paul A Bates
Journal:  Int J Mol Sci       Date:  2010-09-28       Impact factor: 5.923

9.  Estimation of Uncertainties in the Global Distance Test (GDT_TS) for CASP Models.

Authors:  Wenlin Li; R Dustin Schaeffer; Zbyszek Otwinowski; Nick V Grishin
Journal:  PLoS One       Date:  2016-05-05       Impact factor: 3.240

10.  Refinement of protein-protein complexes in contact map space with metadynamics simulations.

Authors:  Erik Pfeiffenberger; Paul A Bates
Journal:  Proteins       Date:  2018-10-30
View more
  6 in total

1.  Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment.

Authors:  Marc F Lensink; Guillaume Brysbaert; Théo Mauri; Nurul Nadzirin; Sameer Velankar; Raphael A G Chaleil; Tereza Clarence; Paul A Bates; Ren Kong; Bin Liu; Guangbo Yang; Ming Liu; Hang Shi; Xufeng Lu; Shan Chang; Raj S Roy; Farhan Quadir; Jian Liu; Jianlin Cheng; Anna Antoniak; Cezary Czaplewski; Artur Giełdoń; Mateusz Kogut; Agnieszka G Lipska; Adam Liwo; Emilia A Lubecka; Martyna Maszota-Zieleniak; Adam K Sieradzan; Rafał Ślusarz; Patryk A Wesołowski; Karolina Zięba; Carlos A Del Carpio Muñoz; Eiichiro Ichiishi; Ameya Harmalkar; Jeffrey J Gray; Alexandre M J J Bonvin; Francesco Ambrosetti; Rodrigo Vargas Honorato; Zuzana Jandova; Brian Jiménez-García; Panagiotis I Koukos; Siri Van Keulen; Charlotte W Van Noort; Manon Réau; Jorge Roel-Touris; Sergei Kotelnikov; Dzmitry Padhorny; Kathryn A Porter; Andrey Alekseenko; Mikhail Ignatov; Israel Desta; Ryota Ashizawa; Zhuyezi Sun; Usman Ghani; Nasser Hashemi; Sandor Vajda; Dima Kozakov; Mireia Rosell; Luis A Rodríguez-Lumbreras; Juan Fernandez-Recio; Agnieszka Karczynska; Sergei Grudinin; Yumeng Yan; Hao Li; Peicong Lin; Sheng-You Huang; Charles Christoffer; Genki Terashi; Jacob Verburgt; Daipayan Sarkar; Tunde Aderinwale; Xiao Wang; Daisuke Kihara; Tsukasa Nakamura; Yuya Hanazono; Ragul Gowthaman; Johnathan D Guest; Rui Yin; Ghazaleh Taherzadeh; Brian G Pierce; Didier Barradas-Bautista; Zhen Cao; Luigi Cavallo; Romina Oliva; Yuanfei Sun; Shaowen Zhu; Yang Shen; Taeyong Park; Hyeonuk Woo; Jinsol Yang; Sohee Kwon; Jonghun Won; Chaok Seok; Yasuomi Kiyota; Shinpei Kobayashi; Yoshiki Harada; Mayuko Takeda-Shitaka; Petras J Kundrotas; Amar Singh; Ilya A Vakser; Justas Dapkūnas; Kliment Olechnovič; Česlovas Venclovas; Rui Duan; Liming Qiu; Xianjin Xu; Shuang Zhang; Xiaoqin Zou; Shoshana J Wodak
Journal:  Proteins       Date:  2021-09-13

2.  Machine learning-assisted elucidation of CD81-CD44 interactions in promoting cancer stemness and extracellular vesicle integrity.

Authors:  Tujin Shi; Yang Shen; Nurmaa K Dashzeveg; Huiping Liu; Erika K Ramos; Chia-Feng Tsai; Yuzhi Jia; Yue Cao; Megan Manu; Rokana Taftaf; Andrew D Hoffmann; Lamiaa El-Shennawy; Marina A Gritsenko; Valery Adorno-Cruz; Emma J Schuster; David Scholten; Dhwani Patel; Xia Liu; Priyam Patel; Brian Wray; Youbin Zhang; Shanshan Zhang; Ronald J Moore; Jeremy V Mathews; Matthew J Schipma; Tao Liu; Valerie L Tokars; Massimo Cristofanilli
Journal:  Elife       Date:  2022-10-04       Impact factor: 8.713

3.  ICAM1 initiates CTC cluster formation and trans-endothelial migration in lung metastasis of breast cancer.

Authors:  Rokana Taftaf; Xia Liu; Salendra Singh; Yuzhi Jia; Nurmaa K Dashzeveg; Andrew D Hoffmann; Lamiaa El-Shennawy; Erika K Ramos; Valery Adorno-Cruz; Emma J Schuster; David Scholten; Dhwani Patel; Youbin Zhang; Andrew A Davis; Carolina Reduzzi; Yue Cao; Paolo D'Amico; Yang Shen; Massimo Cristofanilli; William A Muller; Vinay Varadan; Huiping Liu
Journal:  Nat Commun       Date:  2021-08-11       Impact factor: 17.694

4.  Energy-based graph convolutional networks for scoring protein docking models.

Authors:  Yue Cao; Yang Shen
Journal:  Proteins       Date:  2020-03-16

Review 5.  Advances to tackle backbone flexibility in protein docking.

Authors:  Ameya Harmalkar; Jeffrey J Gray
Journal:  Curr Opin Struct Biol       Date:  2020-12-23       Impact factor: 7.786

6.  Learning to Optimize in Swarms.

Authors:  Yue Cao; Tianlong Chen; Zhangyang Wang; Yang Shen
Journal:  Adv Neural Inf Process Syst       Date:  2019-12
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.