Literature DB >> 35286461

ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.

Shihu Jiao1, Zheng Chen2,3, Lichao Zhang4, Xun Zhou5, Lei Shi6.   

Abstract

Autophagy plays an important role in biological evolution and is regulated by many autophagy proteins. Accurate identification of autophagy proteins is crucially important to reveal their biological functions. Due to the expense and labor cost of experimental methods, it is urgent to develop automated, accurate and reliable sequence-based computational tools to enable the identification of novel autophagy proteins among numerous proteins and peptides. For this purpose, a new predictor named ATGPred-FL was proposed for the efficient identification of autophagy proteins. We investigated various sequence-based feature descriptors and adopted the feature learning method to generate corresponding, more informative probability features. Then, a two-step feature selection strategy based on accuracy was utilized to remove irrelevant and redundant features, leading to the most discriminative 14-dimensional feature set. The final predictor was built using a support vector machine classifier, which performed favorably on both the training and testing sets with accuracy values of 94.40% and 90.50%, respectively. ATGPred-FL is the first ATG machine learning predictor based on protein primary sequences. We envision that ATGPred-FL will be an effective and useful tool for autophagy protein identification, and it is available for free at http://lab.malab.cn/~acy/ATGPred-FL , the source code and datasets are accessible at https://github.com/jiaoshihu/ATGPred .
© 2022. The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature.

Entities:  

Keywords:  Autophagy protein; Computational prediction; Feature representation learning scheme; Feature selection; Support vector machine

Mesh:

Substances:

Year:  2022        PMID: 35286461     DOI: 10.1007/s00726-022-03145-5

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  61 in total

Review 1.  Logistic regression and artificial neural network classification models: a methodology review.

Authors:  Stephan Dreiseitl; Lucila Ohno-Machado
Journal:  J Biomed Inform       Date:  2002 Oct-Dec       Impact factor: 6.317

2.  The evolutionarily conserved domain of Beclin 1 is required for Vps34 binding, autophagy and tumor suppressor function.

Authors:  Norihiko Furuya; Jie Yu; Maya Byfield; Sophie Pattingre; Beth Levine
Journal:  Autophagy       Date:  2005-04-17       Impact factor: 16.016

3.  Prediction of bio-sequence modifications and the associations with diseases.

Authors:  Chunyan Ao; Liang Yu; Quan Zou
Journal:  Brief Funct Genomics       Date:  2020-12-14       Impact factor: 4.241

4.  Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique.

Authors:  Fu-Ying Dao; Hao Lv; Fang Wang; Chao-Qin Feng; Hui Ding; Wei Chen; Hao Lin
Journal:  Bioinformatics       Date:  2019-06-01       Impact factor: 6.937

5.  THANATOS: an integrative data resource of proteins and post-translational modifications in the regulation of autophagy.

Authors:  Wankun Deng; Lili Ma; Ying Zhang; Jiaqi Zhou; Yongbo Wang; Zexian Liu; Yu Xue
Journal:  Autophagy       Date:  2018       Impact factor: 16.016

Review 6.  Machine learning techniques for protein function prediction.

Authors:  Rosalin Bonetta; Gianluca Valentino
Journal:  Proteins       Date:  2019-11-14

Review 7.  Machine intelligence in peptide therapeutics: A next-generation tool for rapid disease screening.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Med Res Rev       Date:  2020-01-10       Impact factor: 12.944

Review 8.  Autophagy as a cell death and tumor suppressor mechanism.

Authors:  Devrim Gozuacik; Adi Kimchi
Journal:  Oncogene       Date:  2004-04-12       Impact factor: 9.867

9.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

10.  CoV-Spectrum: Analysis of Globally Shared SARS-CoV-2 Data to Identify and Characterize New Variants.

Authors:  Chaoran Chen; Sarah Nadeau; Michael Yared; Philippe Voinov; Ning Xie; Cornelius Roemer; Tanja Stadler
Journal:  Bioinformatics       Date:  2021-12-25       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.