Literature DB >> 31504150

Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.

Jiajun Hong1,2, Yongchao Luo2, Yang Zhang2,3, Junbiao Ying2, Weiwei Xue3, Tian Xie1, Lin Tao1, Feng Zhu1,2.   

Abstract

Functional annotation of protein sequence with high accuracy has become one of the most important issues in modern biomedical studies, and computational approaches of significantly accelerated analysis process and enhanced accuracy are greatly desired. Although a variety of methods have been developed to elevate protein annotation accuracy, their ability in controlling false annotation rates remains either limited or not systematically evaluated. In this study, a protein encoding strategy, together with a deep learning algorithm, was proposed to control the false discovery rate in protein function annotation, and its performances were systematically compared with that of the traditional similarity-based and de novo approaches. Based on a comprehensive assessment from multiple perspectives, the proposed strategy and algorithm were found to perform better in both prediction stability and annotation accuracy compared with other de novo methods. Moreover, an in-depth assessment revealed that it possessed an improved capacity of controlling the false discovery rate compared with traditional methods. All in all, this study not only provided a comprehensive analysis on the performances of the newly proposed strategy but also provided a tool for the researcher in the fields of protein function annotation.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Keywords:  annotation accuracy; deep learning; false discovery rate; prediction stability; protein function prediction

Year:  2020        PMID: 31504150      PMCID: PMC7412958          DOI: 10.1093/bib/bbz081

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  105 in total

1.  A scalable method for integration and functional analysis of multiple microarray datasets.

Authors:  Curtis Huttenhower; Matt Hibbs; Chad Myers; Olga G Troyanskaya
Journal:  Bioinformatics       Date:  2006-09-27       Impact factor: 6.937

2.  Update of TTD: Therapeutic Target Database.

Authors:  Feng Zhu; BuCong Han; Pankaj Kumar; XiangHui Liu; XiaoHua Ma; Xiaona Wei; Lu Huang; YangFan Guo; LianYi Han; ChanJuan Zheng; YuZong Chen
Journal:  Nucleic Acids Res       Date:  2009-11-20       Impact factor: 16.971

3.  How Does Chirality Determine the Selective Inhibition of Histone Deacetylase 6? A Lesson from Trichostatin A Enantiomers Based on Molecular Dynamics.

Authors:  Yang Zhang; Jun Biao Ying; Jia Jun Hong; Feng Cheng Li; Ting Ting Fu; Feng Yuan Yang; Guo Xun Zheng; Xiao Jun Yao; Yan Lou; Yunqing Qiu; Wei Wei Xue; Feng Zhu
Journal:  ACS Chem Neurosci       Date:  2019-02-26       Impact factor: 4.418

4.  Global protein function prediction from protein-protein interaction networks.

Authors:  Alexei Vazquez; Alessandro Flammini; Amos Maritan; Alessandro Vespignani
Journal:  Nat Biotechnol       Date:  2003-05-12       Impact factor: 54.908

5.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

6.  Gene2vec: gene subsequence embedding for prediction of mammalian N 6-methyladenosine sites from mRNA.

Authors:  Quan Zou; Pengwei Xing; Leyi Wei; Bin Liu
Journal:  RNA       Date:  2018-11-13       Impact factor: 4.942

7.  Comparison of FDA Approved Kinase Targets to Clinical Trial Ones: Insights from Their System Profiles and Drug-Target Interaction Networks.

Authors:  Jingyu Xu; Panpan Wang; Hong Yang; Jin Zhou; Yinghong Li; Xiaoxu Li; Weiwei Xue; Chunyan Yu; Yubin Tian; Feng Zhu
Journal:  Biomed Res Int       Date:  2016-07-28       Impact factor: 3.411

8.  Therapeutic target database update 2016: enriched resource for bench to clinical drug target and targeted pathway information.

Authors:  Hong Yang; Chu Qin; Ying Hong Li; Lin Tao; Jin Zhou; Chun Yan Yu; Feng Xu; Zhe Chen; Feng Zhu; Yu Zong Chen
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

9.  ASAP: a machine learning framework for local protein properties.

Authors:  Nadav Brandes; Dan Ofer; Michal Linial
Journal:  Database (Oxford)       Date:  2016-10-02       Impact factor: 3.451

10.  The Pfam protein families database in 2019.

Authors:  Sara El-Gebali; Jaina Mistry; Alex Bateman; Sean R Eddy; Aurélien Luciani; Simon C Potter; Matloob Qureshi; Lorna J Richardson; Gustavo A Salazar; Alfredo Smart; Erik L L Sonnhammer; Layla Hirsh; Lisanna Paladin; Damiano Piovesan; Silvio C E Tosatto; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  25 in total

1.  Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

Authors:  Fuyi Li; Jinxiang Chen; Zongyuan Ge; Ya Wen; Yanwei Yue; Morihiro Hayashida; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

2.  Improving language model of human genome for DNA-protein binding prediction based on task-specific pre-training.

Authors:  Hanyu Luo; Wenyu Shan; Cheng Chen; Pingjian Ding; Lingyun Luo
Journal:  Interdiscip Sci       Date:  2022-09-22       Impact factor: 3.492

3.  RBP-TSTL is a two-stage transfer learning framework for genome-scale prediction of RNA-binding proteins.

Authors:  Xinxin Peng; Xiaoyu Wang; Yuming Guo; Zongyuan Ge; Fuyi Li; Xin Gao; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

4.  Understanding the mutational frequency in SARS-CoV-2 proteome using structural features.

Authors:  Puneet Rawat; Divya Sharma; Medha Pandey; R Prabakaran; M Michael Gromiha
Journal:  Comput Biol Med       Date:  2022-06-07       Impact factor: 6.698

5.  SAResNet: self-attention residual network for predicting DNA-protein binding.

Authors:  Long-Chen Shen; Yan Liu; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2021-09-02       Impact factor: 11.622

6.  nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.

Authors:  Yong-Zi Chen; Zhuo-Zhi Wang; Yanan Wang; Guoguang Ying; Zhen Chen; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

7.  NOREVA: enhanced normalization and evaluation of time-course and multi-class metabolomic data.

Authors:  Qingxia Yang; Yunxia Wang; Ying Zhang; Fengcheng Li; Weiqi Xia; Ying Zhou; Yunqing Qiu; Honglin Li; Feng Zhu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

8.  Predicting Functions of Uncharacterized Human Proteins: From Canonical to Proteoforms.

Authors:  Ekaterina Poverennaya; Olga Kiseleva; Anastasia Romanova; Mikhail Pyatnitskiy
Journal:  Genes (Basel)       Date:  2020-06-21       Impact factor: 4.096

9.  Databases for the targeted COVID-19 therapeutics.

Authors:  Yunxia Wang; Fengcheng Li; Ying Zhang; Ying Zhou; Ying Tan; Yuzong Chen; Feng Zhu
Journal:  Br J Pharmacol       Date:  2020-09-28       Impact factor: 8.739

Review 10.  Systems biology approaches integrated with artificial intelligence for optimized metabolic engineering.

Authors:  Mohamed Helmy; Derek Smith; Kumar Selvarajoo
Journal:  Metab Eng Commun       Date:  2020-10-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.