Literature DB >> 33834199

A comprehensive review of the imbalance classification of protein post-translational modifications.

Lijun Dou1, Fenglong Yang1, Lei Xu2, Quan Zou3.   

Abstract

Post-translational modifications (PTMs) play significant roles in regulating protein structure, activity and function, and they are closely involved in various pathologies. Therefore, the identification of associated PTMs is the foundation of in-depth research on related biological mechanisms, disease treatments and drug design. Due to the high cost and time consumption of high-throughput sequencing techniques, developing machine learning-based predictors has been considered an effective approach to rapidly recognize potential modified sites. However, the imbalanced distribution of true and false PTM sites, namely, the data imbalance problem, largely effects the reliability and application of prediction tools. In this article, we conduct a systematic survey of the research progress in the imbalanced PTMs classification. First, we describe the modeling process in detail and outline useful data imbalance solutions. Then, we summarize the recently proposed bioinformatics tools based on imbalanced PTM data and simultaneously build a convenient website, ImClassi_PTMs (available at lab.malab.cn/∼dlj/ImbClassi_PTMs/), to facilitate the researchers to view. Moreover, we analyze the challenges of current computational predictors and propose some suggestions to improve the efficiency of imbalance learning. We hope that this work will provide comprehensive knowledge of imbalanced PTM recognition and contribute to advanced predictors in the future.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  data imbalance; machine learning; performance evaluation; post-translational modifications; sequence analysis

Year:  2021        PMID: 33834199     DOI: 10.1093/bib/bbab089

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  2 in total

1.  Forkhead Box Protein P3 (FOXP3) Represses ATF3 Transcriptional Activity.

Authors:  Chiung-Min Wang; William Harry Yang; Leticia Cardoso; Ninoska Gutierrez; Richard Henry Yang; Wei-Hsiung Yang
Journal:  Int J Mol Sci       Date:  2021-10-22       Impact factor: 5.923

2.  DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins.

Authors:  Feifei Cui; Shuang Li; Zilong Zhang; Miaomiao Sui; Chen Cao; Abd El-Latif Hesham; Quan Zou
Journal:  Comput Struct Biotechnol J       Date:  2022-04-26       Impact factor: 6.155

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.