Literature DB >> 33527980

Sequence representation approaches for sequence-based protein prediction tasks that use deep learning.

Feifei Cui1, Zilong Zhang1, Quan Zou2.   

Abstract

Deep learning has been increasingly used in bioinformatics, especially in sequence-based protein prediction tasks, as large amounts of biological data are available and deep learning techniques have been developed rapidly in recent years. For sequence-based protein prediction tasks, the selection of a suitable model architecture is essential, whereas sequence data representation is a major factor in controlling model performance. Here, we summarized all the main approaches that are used to represent protein sequence data (amino acid sequence encoding or embedding), which include end-to-end embedding methods, non-contextual embedding methods and embedding methods that use transfer learning and others that are applied for some specific tasks (such as protein sequence embedding based on extracted features for protein structure predictions and graph convolutional network-based embedding for drug discovery tasks). We have also reviewed the architectures of various types of embedding models theoretically and the development of these types of sequence embedding approaches to facilitate researchers and users in selecting the model that best suits their requirements.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  deep learning; end-to-end learning; protein sequence embedding; sequence representation; transfer learning

Year:  2021        PMID: 33527980     DOI: 10.1093/bfgp/elaa030

Source DB:  PubMed          Journal:  Brief Funct Genomics        ISSN: 2041-2649            Impact factor:   4.241


  9 in total

1.  iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets.

Authors:  Zhen Chen; Xuhan Liu; Pei Zhao; Chen Li; Yanan Wang; Fuyi Li; Tatsuya Akutsu; Chris Bain; Robin B Gasser; Junzhou Li; Zuoren Yang; Xin Gao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2022-05-07       Impact factor: 19.160

2.  DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins.

Authors:  Feifei Cui; Shuang Li; Zilong Zhang; Miaomiao Sui; Chen Cao; Abd El-Latif Hesham; Quan Zou
Journal:  Comput Struct Biotechnol J       Date:  2022-04-26       Impact factor: 6.155

3.  PIPENN: Protein Interface Prediction from sequence with an Ensemble of Neural Nets.

Authors:  Bas Stringer; Hans de Ferrante; Sanne Abeln; Jaap Heringa; K Anton Feenstra; Reza Haydarlou
Journal:  Bioinformatics       Date:  2022-02-12       Impact factor: 6.937

4.  Identification of plant vacuole proteins by exploiting deep representation learning features.

Authors:  Shihu Jiao; Quan Zou
Journal:  Comput Struct Biotechnol J       Date:  2022-06-08       Impact factor: 6.155

5.  iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss.

Authors:  Lijun Dou; Zilong Zhang; Lei Xu; Quan Zou
Journal:  Comput Struct Biotechnol J       Date:  2022-06-16       Impact factor: 6.155

6.  Identify Bitter Peptides by Using Deep Representation Learning Features.

Authors:  Jici Jiang; Xinxu Lin; Yueqi Jiang; Liangzhen Jiang; Zhibin Lv
Journal:  Int J Mol Sci       Date:  2022-07-17       Impact factor: 6.208

7.  Knowledge structure and emerging trends in the application of deep learning in genetics research: A bibliometric analysis [2000-2021].

Authors:  Bijun Zhang; Ting Fan
Journal:  Front Genet       Date:  2022-08-23       Impact factor: 4.772

8.  Identification of Helicobacter pylori Membrane Proteins Using Sequence-Based Features.

Authors:  Mujiexin Liu; Hui Chen; Dong Gao; Cai-Yi Ma; Zhao-Yue Zhang
Journal:  Comput Math Methods Med       Date:  2022-01-12       Impact factor: 2.238

Review 9.  Bioinformatics Research on Drug Sensitivity Prediction.

Authors:  Yaojia Chen; Liran Juan; Xiao Lv; Lei Shi
Journal:  Front Pharmacol       Date:  2021-12-09       Impact factor: 5.810

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.