Literature DB >> 22591473

A sequence-based approach for predicting protein disordered regions.

Tao Huang1, Zhi-Song He, Wei-Ren Cui, Yu-Dong Cai, Xiao-He Shi, Le-Le Hu, Kuo-Chen Chou.   

Abstract

Protein disordered regions are associated with some critical cellular functions such as transcriptional regulation, translation and cellular signal transduction, and they are responsible for various diseases. Although experimental methods have been developed to determine these regions, they are time-consuming and expensive. Therefore, it is highly desired to develop computational methods that can provide us with this kind information in a rapid and inexpensive manner. Here we propose a sequence-based computational approach for predicting protein disordered regions by means of the Nearest Neighbor algorithm, in which conservation, amino acid factor and secondary structure status of each amino acid in a fixed-length sliding window are taken as the encoding features. Also, the feature selection based on mRMR (maximum Relevancy Minimum Redundancy) is applied to obtain an optimal 51-feature set that includes 39 conservation features and 12 secondary structure features. With the optimal 51 features, our predictor yielded quite promising MCC (Mathew's correlation coefficients): 0.371 on a rigorous benchmark dataset tested by 5-fold cross-validation and 0.219 on an independent test dataset. Our results suggest that conservation and secondary structure play important roles in intrinsically disordered proteins.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 22591473     DOI: 10.2174/0929866511320030002

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  9 in total

1.  Discriminating between deleterious and neutral non-frameshifting indels based on protein interaction networks and hybrid properties.

Authors:  Ning Zhang; Tao Huang; Yu-Dong Cai
Journal:  Mol Genet Genomics       Date:  2014-09-24       Impact factor: 3.291

2.  REGULATOR: a database of metazoan transcription factors and maternal factors for developmental studies.

Authors:  Kai Wang; Hiroki Nishida
Journal:  BMC Bioinformatics       Date:  2015-04-10       Impact factor: 3.169

3.  Novel candidate key drivers in the integrative network of genes, microRNAs, methylations, and copy number variations in squamous cell lung carcinoma.

Authors:  Tao Huang; Jing Yang; Yu-Dong Cai
Journal:  Biomed Res Int       Date:  2015-02-23       Impact factor: 3.411

4.  A novel approach for predicting disordered regions in a protein sequence.

Authors:  Meijing Li; Seong Beom Cho; Keun Ho Ryu
Journal:  Osong Public Health Res Perspect       Date:  2014-07-01

5.  iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier.

Authors:  Wang-Ren Qiu; Xuan Xiao; Zhao-Chun Xu; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-08-09

6.  A novel method of predicting protein disordered regions based on sequence features.

Authors:  Tong-Hui Zhao; Min Jiang; Tao Huang; Bi-Qing Li; Ning Zhang; Hai-Peng Li; Yu-Dong Cai
Journal:  Biomed Res Int       Date:  2013-04-22       Impact factor: 3.411

7.  Signal propagation in protein interaction network during colorectal cancer progression.

Authors:  Yang Jiang; Tao Huang; Lei Chen; Yu-Fei Gao; Yudong Cai; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2013-03-20       Impact factor: 3.411

8.  iMethyl-PseAAC: identification of protein methylation sites via a pseudo amino acid composition approach.

Authors:  Wang-Ren Qiu; Xuan Xiao; Wei-Zhong Lin; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2014-05-22       Impact factor: 3.411

Review 9.  Digested disorder: Quarterly intrinsic disorder digest (January/February/March, 2013).

Authors:  Vladimir N Uversky
Journal:  Intrinsically Disord Proteins       Date:  2013-04-01
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.