Literature DB >> 34337652

PScL-HDeep: image-based prediction of protein subcellular location in human tissue using ensemble learning of handcrafted and deep learned features with two-layer feature selection.

Matee Ullah1, Ke Han2, Fazal Hadi3, Jian Xu2, Jiangning Song4, Dong-Jun Yu2.   

Abstract

Protein subcellular localization plays a crucial role in characterizing the function of proteins and understanding various cellular processes. Therefore, accurate identification of protein subcellular location is an important yet challenging task. Numerous computational methods have been proposed to predict the subcellular location of proteins. However, most existing methods have limited capability in terms of the overall accuracy, time consumption and generalization power. To address these problems, in this study, we developed a novel computational approach based on human protein atlas (HPA) data, referred to as PScL-HDeep, for accurate and efficient image-based prediction of protein subcellular location in human tissues. We extracted different handcrafted and deep learned (by employing pretrained deep learning model) features from different viewpoints of the image. The step-wise discriminant analysis (SDA) algorithm was applied to generate the optimal feature set from each original raw feature set. To further obtain a more informative feature subset, support vector machine-based recursive feature elimination with correlation bias reduction (SVM-RFE + CBR) feature selection algorithm was applied to the integrated feature set. Finally, the classification models, namely support vector machine with radial basis function (SVM-RBF) and support vector machine with linear kernel (SVM-LNR), were learned on the final selected feature set. To evaluate the performance of the proposed method, a new gold standard benchmark training dataset was constructed from the HPA databank. PScL-HDeep achieved the maximum performance on 10-fold cross validation test on this dataset and showed a better efficacy over existing predictors. Furthermore, we also illustrated the generalization ability of the proposed method by conducting a stringent independent validation test.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  bioimage analysis; deep learned features; feature selection; handcrafted features; protein subcellular location

Mesh:

Substances:

Year:  2021        PMID: 34337652      PMCID: PMC8574991          DOI: 10.1093/bib/bbab278

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  44 in total

1.  Support vector machine approach for protein subcellular localization prediction.

Authors:  S Hua; Z Sun
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

2.  MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data.

Authors:  Xin Zhou; David P Tuck
Journal:  Bioinformatics       Date:  2007-05-01       Impact factor: 6.937

3.  Multilabel learning via random label selection for protein subcellular multilocations prediction.

Authors:  Xiao Wang; Guo-Zheng Li
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Mar-Apr       Impact factor: 3.710

4.  iPreny-PseAAC: Identify C-terminal Cysteine Prenylation Sites in Proteins by Incorporating Two Tiers of Sequence Couplings into PseAAC.

Authors:  Yan Xu; Zu Wang; Chunhui Li; Kuo-Chen Chou
Journal:  Med Chem       Date:  2017       Impact factor: 2.745

5.  Multiclass cancer diagnosis using tumor gene expression signatures.

Authors:  S Ramaswamy; P Tamayo; R Rifkin; S Mukherjee; C H Yeang; M Angelo; C Ladd; M Reich; E Latulippe; J P Mesirov; T Poggio; W Gerald; M Loda; E S Lander; T R Golub
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-11       Impact factor: 11.205

Review 6.  An Unprecedented Revolution in Medicinal Chemistry Driven by the Progress of Biological Science.

Authors:  Kuo-Chen Chou
Journal:  Curr Top Med Chem       Date:  2017       Impact factor: 3.295

7.  A framework for the automated analysis of subcellular patterns in human protein atlas images.

Authors:  Justin Newberg; Robert F Murphy
Journal:  J Proteome Res       Date:  2008-04-25       Impact factor: 4.466

8.  Metadata management for high content screening in OMERO.

Authors:  Simon Li; Sébastien Besson; Colin Blackburn; Mark Carroll; Richard K Ferguson; Helen Flynn; Kenneth Gillen; Roger Leigh; Dominik Lindner; Melissa Linkert; William J Moore; Balaji Ramalingam; Emil Rozbicki; Gabriella Rustici; Aleksandra Tarkowska; Petr Walczysko; Eleanor Williams; Chris Allan; Jean-Marie Burel; Josh Moore; Jason R Swedlow
Journal:  Methods       Date:  2015-10-22       Impact factor: 3.608

9.  Bioimage-Based Prediction of Protein Subcellular Location in Human Tissue with Ensemble Features and Deep Networks.

Authors:  Guang-Hui Liu; Bei-Wei Zhang; Gang Qian; Bin Wang; Bo Mao; Isabelle Bichindaritz
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2020-12-08       Impact factor: 3.710

10.  Automated analysis and reannotation of subcellular locations in confocal images from the Human Protein Atlas.

Authors:  Jieyue Li; Justin Y Newberg; Mathias Uhlén; Emma Lundberg; Robert F Murphy
Journal:  PLoS One       Date:  2012-11-30       Impact factor: 3.240

View more
  2 in total

1.  PSL-LCCL: a resource for subcellular protein localization in liver cancer cell line SK_HEP1.

Authors:  Fang Huang; Xia Tang; Bo Ye; Songfeng Wu; Keyue Ding
Journal:  Database (Oxford)       Date:  2022-02-04       Impact factor: 4.462

2.  Immune ULBP1 is Elevated in Colon Adenocarcinoma and Predicts Prognosis.

Authors:  Guo-Tian Ruan; Hai-Lun Xie; Li-Chen Zhu; Yi-Zhong Ge; Lin Yan; Cun Liao; Yi-Zhen Gong; Han-Ping Shi
Journal:  Front Genet       Date:  2022-02-08       Impact factor: 4.599

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.