Literature DB >> 33584818

Identification of Protein Subcellular Localization With Network and Functional Embeddings.

Xiaoyong Pan1,2, Hao Li3, Tao Zeng4, Zhandong Li3, Lei Chen5, Tao Huang6, Yu-Dong Cai1.   

Abstract

The functions of proteins are mainly determined by their subcellular localizations in cells. Currently, many computational methods for predicting the subcellular localization of proteins have been proposed. However, these methods require further improvement, especially when used in protein representations. In this study, we present an embedding-based method for predicting the subcellular localization of proteins. We first learn the functional embeddings of KEGG/GO terms, which are further used in representing proteins. Then, we characterize the network embeddings of proteins on a protein-protein network. The functional and network embeddings are combined as novel representations of protein locations for the construction of the final classification model. In our collected benchmark dataset with 4,861 proteins from 16 locations, the best model shows a Matthews correlation coefficient of 0.872 and is thus superior to multiple conventional methods.
Copyright © 2021 Pan, Li, Zeng, Li, Chen, Huang and Cai.

Entities:  

Keywords:  KEGG pathway; functional embedding; gene ontology; network embedding; protein subcellular localization

Year:  2021        PMID: 33584818      PMCID: PMC7873866          DOI: 10.3389/fgene.2020.626500

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  25 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Comparing two K-category assignments by a K-category correlation coefficient.

Authors:  J Gorodkin
Journal:  Comput Biol Chem       Date:  2004-12       Impact factor: 2.877

3.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

4.  KEGG: Kyoto Encyclopedia of Genes and Genomes.

Authors:  H Ogata; S Goto; K Sato; W Fujibuchi; H Bono; M Kanehisa
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

5.  Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features.

Authors:  Hang Zhou; Yang Yang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

6.  LocTree3 prediction of localization.

Authors:  Tatyana Goldberg; Maximilian Hecht; Tobias Hamp; Timothy Karl; Guy Yachdav; Nadeem Ahmed; Uwe Altermann; Philipp Angerer; Sonja Ansorge; Kinga Balasz; Michael Bernhofer; Alexander Betz; Laura Cizmadija; Kieu Trinh Do; Julia Gerke; Robert Greil; Vadim Joerdens; Maximilian Hastreiter; Katharina Hembach; Max Herzog; Maria Kalemanov; Michael Kluge; Alice Meier; Hassan Nasir; Ulrich Neumaier; Verena Prade; Jonas Reeb; Aleksandr Sorokoumov; Ilira Troshani; Susann Vorberg; Sonja Waldraff; Jonas Zierer; Henrik Nielsen; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2014-05-21       Impact factor: 16.971

7.  node2vec: Scalable Feature Learning for Networks.

Authors:  Aditya Grover; Jure Leskovec
Journal:  KDD       Date:  2016-08

8.  DeepLoc: prediction of protein subcellular localization using deep learning.

Authors:  José Juan Almagro Armenteros; Casper Kaae Sønderby; Søren Kaae Sønderby; Henrik Nielsen; Ole Winther
Journal:  Bioinformatics       Date:  2017-11-01       Impact factor: 6.937

9.  Copy Number Variation Pattern for Discriminating MACROD2 States of Colorectal Cancer Subtypes.

Authors:  ShiQi Zhang; XiaoYong Pan; Tao Zeng; Wei Guo; Zijun Gan; Yu-Hang Zhang; Lei Chen; YunHua Zhang; Tao Huang; Yu-Dong Cai
Journal:  Front Bioeng Biotechnol       Date:  2019-12-19

10.  LocTree2 predicts localization for all domains of life.

Authors:  Tatyana Goldberg; Tobias Hamp; Burkhard Rost
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

View more
  16 in total

1.  Predicting gene phenotype by multi-label multi-class model based on essential functional features.

Authors:  Lei Chen; Zhandong Li; Tao Zeng; Yu-Hang Zhang; Hao Li; Tao Huang; Yu-Dong Cai
Journal:  Mol Genet Genomics       Date:  2021-04-29       Impact factor: 3.291

2.  Screening gene signatures for clinical response subtypes of lung transplantation.

Authors:  Yu-Hang Zhang; Zhan Dong Li; Tao Zeng; Lei Chen; Tao Huang; Yu-Dong Cai
Journal:  Mol Genet Genomics       Date:  2022-07-03       Impact factor: 2.980

3.  Identifying COVID-19-Specific Transcriptomic Biomarkers with Machine Learning Methods.

Authors:  Lei Chen; Zhandong Li; Tao Zeng; Yu-Hang Zhang; KaiYan Feng; Tao Huang; Yu-Dong Cai
Journal:  Biomed Res Int       Date:  2021-07-06       Impact factor: 3.411

4.  Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions.

Authors:  Lei Chen; ZhanDong Li; ShiQi Zhang; Yu-Hang Zhang; Tao Huang; Yu-Dong Cai
Journal:  Biomed Res Int       Date:  2022-01-13       Impact factor: 3.411

5.  iMPT-FDNPL: Identification of Membrane Protein Types with Functional Domains and a Natural Language Processing Approach.

Authors:  Wei Chen; Lei Chen; Qi Dai
Journal:  Comput Math Methods Med       Date:  2021-10-11       Impact factor: 2.238

6.  Identification of Novel Lung Cancer Driver Genes Connecting Different Omics Levels With a Heat Diffusion Algorithm.

Authors:  Fei Yuan; Xiaoyu Cao; Yu-Hang Zhang; Lei Chen; Tao Huang; ZhanDong Li; Yu-Dong Cai
Journal:  Front Cell Dev Biol       Date:  2022-01-26

7.  Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines.

Authors:  ShiJian Ding; Hao Li; Yu-Hang Zhang; XianChao Zhou; KaiYan Feng; ZhanDong Li; Lei Chen; Tao Huang; Yu-Dong Cai
Journal:  Front Cell Dev Biol       Date:  2021-11-30

8.  Exploring the Genomic Patterns in Human and Mouse Cerebellums Via Single-Cell Sequencing and Machine Learning Method.

Authors:  ZhanDong Li; Deling Wang; HuiPing Liao; ShiQi Zhang; Wei Guo; Lei Chen; Lin Lu; Tao Huang; Yu-Dong Cai
Journal:  Front Genet       Date:  2022-03-04       Impact factor: 4.599

9.  Similarity-Based Method with Multiple-Feature Sampling for Predicting Drug Side Effects.

Authors:  Zixin Wu; Lei Chen
Journal:  Comput Math Methods Med       Date:  2022-04-01       Impact factor: 2.238

10.  Identification of Microbiota Biomarkers With Orthologous Gene Annotation for Type 2 Diabetes.

Authors:  Yu-Hang Zhang; Wei Guo; Tao Zeng; ShiQi Zhang; Lei Chen; Margarita Gamarra; Romany F Mansour; José Escorcia-Gutierrez; Tao Huang; Yu-Dong Cai
Journal:  Front Microbiol       Date:  2021-07-09       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.