Literature DB >> 35656889

Sparse group selection and analysis of function-related residue for protein-state recognition.

Fangyun Bai1, Kin Ming Puk2, Jin Liu3, Hongyu Zhou4, Peng Tao4, Wenyong Zhou1, Shouyi Wang2.   

Abstract

Machine learning methods have helped to advance wide range of scientific and technological field in recent years, including computational chemistry. As the chemical systems could become complex with high dimension, feature selection could be critical but challenging to develop reliable machine learning based prediction models, especially for proteins as bio-macromolecules. In this study, we applied sparse group lasso (SGL) method as a general feature selection method to develop classification model for an allosteric protein in different functional states. This results into a much improved model with comparable accuracy (Acc) and only 28 selected features comparing to 289 selected features from a previous study. The Acc achieves 91.50% with 1936 selected feature, which is far higher than that of baseline methods. In addition, grouping protein amino acids into secondary structures provides additional interpretability of the selected features. The selected features are verified as associated with key allosteric residues through comparison with both experimental and computational works about the model protein, and demonstrate the effectiveness and necessity of applying rigorous feature selection and evaluation methods on complex chemical systems.
© 2022 Wiley Periodicals LLC.

Entities:  

Keywords:  classification; feature selection; function-related residues; protein states; sparse group lasso

Mesh:

Substances:

Year:  2022        PMID: 35656889      PMCID: PMC9248267          DOI: 10.1002/jcc.26937

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.672


  53 in total

Review 1.  Is allostery an intrinsic property of all dynamic proteins?

Authors:  K Gunasekaran; Buyong Ma; Ruth Nussinov
Journal:  Proteins       Date:  2004-11-15

2.  Correlating allostery with rigidity.

Authors:  A J Rader; Stephen M Brown
Journal:  Mol Biosyst       Date:  2010-11-08

3.  A Selective Review of Group Selection in High-Dimensional Models.

Authors:  Jian Huang; Patrick Breheny; Shuangge Ma
Journal:  Stat Sci       Date:  2012       Impact factor: 2.901

4.  Identifying key residues for protein allostery through rigid residue scan.

Authors:  Robert Kalescky; Jin Liu; Peng Tao
Journal:  J Phys Chem A       Date:  2014-12-11       Impact factor: 2.781

5.  Allosteric sites can be identified based on the residue-residue interaction energy difference.

Authors:  Xiaomin Ma; Yifei Qi; Luhua Lai
Journal:  Proteins       Date:  2015-08

6.  Combining protein sequence, structure, and dynamics: A novel approach for functional evolution analysis of PAS domain superfamily.

Authors:  Zheng Dong; Hongyu Zhou; Peng Tao
Journal:  Protein Sci       Date:  2017-11-02       Impact factor: 6.725

7.  Computation of conformational coupling in allosteric proteins.

Authors:  Brian A Kidd; David Baker; Wendy E Thomas
Journal:  PLoS Comput Biol       Date:  2009-08-28       Impact factor: 4.475

8.  t-Distributed Stochastic Neighbor Embedding Method with the Least Information Loss for Macromolecular Simulations.

Authors:  Hongyu Zhou; Feng Wang; Peng Tao
Journal:  J Chem Theory Comput       Date:  2018-10-09       Impact factor: 6.006

Review 9.  Principles of allosteric interactions in cell signaling.

Authors:  Ruth Nussinov; Chung-Jung Tsai; Jin Liu
Journal:  J Am Chem Soc       Date:  2014-12-15       Impact factor: 15.419

10.  Application of wavelet transform for PDZ domain classification.

Authors:  Khaled Daqrouq; Rami Alhmouz; Ahmed Balamesh; Adnan Memic
Journal:  PLoS One       Date:  2015-04-10       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.