Literature DB >> 33989156

Identifying Protein Subcellular Locations With Embeddings-Based node2loc.

Xiaoyong Pan, Lei Chen, Min Liu, Zhibin Niu, Tao Huang, Yu-Dong Cai.   

Abstract

Identifying protein subcellular locations is an important topic in protein function prediction. Interacting proteins may share similar locations. Thus, it is imperative to infer protein subcellular locations by taking protein-protein interactions (PPIs)into account. In this study, we present a network embedding-based method, node2loc, to identify protein subcellular locations. node2loc first learns distributed embeddings of proteins in a protein-protein interaction (PPI)network using node2vec. Then the learned embeddings are further fed into a recurrent neural network (RNN). To resolve the severe class imbalance of different subcellular locations, Synthetic Minority Over-sampling Technique (SMOTE)is applied to artificially synthesize proteins for minority classes. node2loc is evaluated on our constructed human benchmark dataset with 16 subcellular locations and yields a Matthews correlation coefficient (MCC)value of 0.800, which is superior to baseline methods. In addition, node2loc yields a better performance on a Yeast benchmark dataset with 17 locations. The results demonstrate that the learned representations from a PPI network have certain discriminative ability for classifying protein subcellular locations. However, node2loc is a transductive method, it only works for proteins connected in a PPI network, and it needs to be retrained for new proteins. In addition, the PPI network needs be annotated to some extent with location information. node2loc is freely available at https://github.com/xypan1232/node2loc.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 33989156     DOI: 10.1109/TCBB.2021.3080386

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  12 in total

1.  Identifying COVID-19 Severity-Related SARS-CoV-2 Mutation Using a Machine Learning Method.

Authors:  Feiming Huang; Lei Chen; Wei Guo; Xianchao Zhou; Kaiyan Feng; Tao Huang; Yudong Cai
Journal:  Life (Basel)       Date:  2022-05-28

2.  Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data.

Authors:  ZhanDong Li; FeiMing Huang; Lei Chen; Tao Huang; Yu-Dong Cai
Journal:  Front Bioeng Biotechnol       Date:  2022-05-30

3.  Identifying Functions of Proteins in Mice With Functional Embedding Features.

Authors:  Hao Li; ShiQi Zhang; Lei Chen; Xiaoyong Pan; ZhanDong Li; Tao Huang; Yu-Dong Cai
Journal:  Front Genet       Date:  2022-05-16       Impact factor: 4.772

4.  SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information.

Authors:  Adeel Malik; Sathiyamoorthy Subramaniyam; Chang-Bae Kim; Balachandran Manavalan
Journal:  Comput Struct Biotechnol J       Date:  2021-12-14       Impact factor: 7.271

5.  iMPT-FDNPL: Identification of Membrane Protein Types with Functional Domains and a Natural Language Processing Approach.

Authors:  Wei Chen; Lei Chen; Qi Dai
Journal:  Comput Math Methods Med       Date:  2021-10-11       Impact factor: 2.238

6.  Identification of Novel Lung Cancer Driver Genes Connecting Different Omics Levels With a Heat Diffusion Algorithm.

Authors:  Fei Yuan; Xiaoyu Cao; Yu-Hang Zhang; Lei Chen; Tao Huang; ZhanDong Li; Yu-Dong Cai
Journal:  Front Cell Dev Biol       Date:  2022-01-26

7.  Identifying luminal and basal mammary cell specific genes and their expression patterns during pregnancy.

Authors:  Zhan Dong Li; Xiangtian Yu; Zi Mei; Tao Zeng; Lei Chen; Xian Ling Xu; Hao Li; Tao Huang; Yu-Dong Cai
Journal:  PLoS One       Date:  2022-04-29       Impact factor: 3.752

8.  Identification of uveitis-associated functions based on the feature selection analysis of gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment scores.

Authors:  Shiheng Lu; Hui Wang; Jian Zhang
Journal:  Front Mol Neurosci       Date:  2022-09-08       Impact factor: 6.261

9.  Identification of methylation signatures associated with CAR T cell in B-cell acute lymphoblastic leukemia and non-hodgkin's lymphoma.

Authors:  Jiwei Song; FeiMing Huang; Lei Chen; KaiYan Feng; Fangfang Jian; Tao Huang; Yu-Dong Cai
Journal:  Front Oncol       Date:  2022-08-11       Impact factor: 5.738

10.  Comparative Study on Feature Selection in Protein Structure and Function Prediction.

Authors:  Wenjing Yi; Ao Sun; Manman Liu; Xiaoqing Liu; Wei Zhang; Qi Dai
Journal:  Comput Math Methods Med       Date:  2022-10-11       Impact factor: 2.809

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.