Literature DB >> 33017626

iAMY-SCM: Improved prediction and analysis of amyloid proteins using a scoring card method with propensity scores of dipeptides.

Phasit Charoenkwan1, Sakawrat Kanthawong2, Chanin Nantasenamat3, Md Mehedi Hasan4, Watshara Shoombuatong5.   

Abstract

Fast, accurate identification and characterization of amyloid proteins at a large-scale is essential for understating their role in therapeutic intervention strategies. As a matter of fact, there exist only one in silico model for amyloid protein identification using the random forest (RF) model in conjunction with various feature types namely the RFAmy. However, it suffers from low interpretability for biologists. Thus, it is highly desirable to develop a simple and easily interpretable prediction method with robust accuracy as compared to the existing complicated model. In this study, we propose iAMY-SCM, the first scoring card method-based predictor for predicting and analyzing amyloid proteins. Herein, the iAMY-SCM made use of a simple weighted-sum function in conjunction with the propensity scores of dipeptides for the amyloid protein identification. Cross-validation results indicated that iAMY-SCM provided an accuracy of 0.895 that corresponded to 10-22% higher performance than that of widely used machine learning models. Furthermore, iAMY-SCM achieving an accuracy of 0.827 as evaluated by an independent test, which was found to be comparable to that of RFAmy and was approximately 9-13% higher than widely used machine learning models. Furthermore, the analysis of estimated propensity scores of amino acids and dipeptides were performed to provide insights into the biophysical and biochemical properties of amyloid proteins. As such, this demonstrates that the proposed iAMY-SCM is efficient and reliable in terms of simplicity, interpretability and implementation. To facilitate ease of use of the proposed iAMY-SCM, a user-friendly and publicly accessible web server at http://camt.pythonanywhere.com/iAMY-SCM has been established. We anticipate that that iAMY-SCM will be an important tool for facilitating the large-scale prediction and characterization of amyloid protein.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Amyloid protein; Classification; Machine learning; Propensity score; Protein function; Scoring card method

Mesh:

Substances:

Year:  2020        PMID: 33017626     DOI: 10.1016/j.ygeno.2020.09.065

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  11 in total

1.  StackHCV: a web-based integrative machine-learning framework for large-scale identification of hepatitis C virus NS5B inhibitors.

Authors:  Aijaz Ahmad Malik; Warot Chotpatiwetchkul; Chuleeporn Phanus-Umporn; Chanin Nantasenamat; Phasit Charoenkwan; Watshara Shoombuatong
Journal:  J Comput Aided Mol Des       Date:  2021-10-08       Impact factor: 3.686

Review 2.  Protein Design: From the Aspect of Water Solubility and Stability.

Authors:  Rui Qing; Shilei Hao; Eva Smorodina; David Jin; Arthur Zalevsky; Shuguang Zhang
Journal:  Chem Rev       Date:  2022-08-03       Impact factor: 72.087

3.  SCMRSA: a New Approach for Identifying and Analyzing Anti-MRSA Peptides Using Estimated Propensity Scores of Dipeptides.

Authors:  Phasit Charoenkwan; Sakawrat Kanthawong; Nalini Schaduangrat; Pietro Li'; Mohammad Ali Moni; Watshara Shoombuatong
Journal:  ACS Omega       Date:  2022-09-01

4.  AMYPred-FRL is a novel approach for accurate prediction of amyloid proteins by using feature representation learning.

Authors:  Phasit Charoenkwan; Saeed Ahmed; Chanin Nantasenamat; Julian M W Quinn; Mohammad Ali Moni; Pietro Lio'; Watshara Shoombuatong
Journal:  Sci Rep       Date:  2022-05-11       Impact factor: 4.996

5.  PUP-Fuse: Prediction of Protein Pupylation Sites by Integrating Multiple Sequence Representations.

Authors:  Firda Nurul Auliah; Andi Nur Nilamyani; Watshara Shoombuatong; Md Ashad Alam; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Int J Mol Sci       Date:  2021-02-20       Impact factor: 5.923

6.  A novel sequence-based predictor for identifying and characterizing thermophilic proteins using estimated propensity scores of dipeptides.

Authors:  Phasit Charoenkwan; Warot Chotpatiwetchkul; Vannajan Sanghiran Lee; Chanin Nantasenamat; Watshara Shoombuatong
Journal:  Sci Rep       Date:  2021-12-10       Impact factor: 4.379

Review 7.  Large-scale comparative review and assessment of computational methods for phage virion proteins identification.

Authors:  Muhammad Kabir; Chanin Nantasenamat; Sakawrat Kanthawong; Phasit Charoenkwan; Watshara Shoombuatong
Journal:  EXCLI J       Date:  2022-01-03       Impact factor: 4.068

8.  IRC-Fuse: improved and robust prediction of redox-sensitive cysteine by fusing of multiple feature representations.

Authors:  Md Mehedi Hasan; Md Ashad Alam; Watshara Shoombuatong; Hiroyuki Kurata
Journal:  J Comput Aided Mol Des       Date:  2021-01-04       Impact factor: 3.686

9.  PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features.

Authors:  Andi Nur Nilamyani; Firda Nurul Auliah; Mohammad Ali Moni; Watshara Shoombuatong; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Int J Mol Sci       Date:  2021-03-08       Impact factor: 5.923

10.  SCMTHP: A New Approach for Identifying and Characterizing of Tumor-Homing Peptides Using Estimated Propensity Scores of Amino Acids.

Authors:  Phasit Charoenkwan; Wararat Chiangjong; Chanin Nantasenamat; Mohammad Ali Moni; Pietro Lio'; Balachandran Manavalan; Watshara Shoombuatong
Journal:  Pharmaceutics       Date:  2022-01-04       Impact factor: 6.321

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.