Literature DB >> 20887256

Analysis and prediction of RNA-binding residues using sequence, evolutionary conservation, and predicted secondary structure and solvent accessibility.

Tuo Zhang1, Hua Zhang, Ke Chen, Jishou Ruan, Shiyi Shen, Lukasz Kurgan.   

Abstract

Identification and prediction of RNA-binding residues (RBRs) provides valuable insights into the mechanisms of protein-RNA interactions. We analyzed the contributions of a wide range of factors including amino acid sequence, evolutionary conservation, secondary structure and solvent accessibility, to the prediction/characterization of RBRs. Five feature sets were designed and feature selection was performed to find and investigate relevant features. We demonstrate that (1) interactions with positively charged amino acids Arg and Lys are preferred by the egatively charged nucleotides; (2) Gly provides flexibility for the RNA binding sites; (3) Glu with negatively charged side chain and several hydrophobic residues such as Leu, Val, Ala and Phe are disfavored in the RNA-binding sites; (4) coil residues, especially in long segments, are more flexible (than other secondary structures) and more likely to interact with RNA; (5) helical residues are more rigid and consequently they are less likely to bind RNA; and (6) residues partially exposed to the solvent are more likely to form RNA-binding sites. We introduce a novel sequence-based predictor of RBRs, RBRpred, which utilizes the selected features. RBRpred is comprehensively tested on three datasets with varied atom distance cutoffs by performing both five-fold cross validation and jackknife tests and achieves Matthew's correlation coefficient (MCC) of 0.51, 0.48 and 0.42, respectively. The quality is comparable to or better than that for state-of-the-art predictors that apply the distancebased cutoff definition. We show that the most important factor for RBRs prediction is evolutionary conservation, followed by the amino acid sequence, predicted secondary structure and predicted solvent accessibility. We also investigate the impact of using native vs. predicted secondary structure and solvent accessibility. The predictions are sufficient for the RBR prediction and the knowledge of the actual solvent accessibility helps in predictions for lower distance cutoffs.

Mesh:

Substances:

Year:  2010        PMID: 20887256     DOI: 10.2174/138920310794109193

Source DB:  PubMed          Journal:  Curr Protein Pept Sci        ISSN: 1389-2037            Impact factor:   3.272


  15 in total

1.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors:  Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

2.  JAK2-binding long noncoding RNA promotes breast cancer brain metastasis.

Authors:  Shouyu Wang; Ke Liang; Qingsong Hu; Ping Li; Jian Song; Yuedong Yang; Jun Yao; Lingegowda Selanere Mangala; Chunlai Li; Wenhao Yang; Peter K Park; David H Hawke; Jianwei Zhou; Yan Zhou; Weiya Xia; Mien-Chie Hung; Jeffrey R Marks; Gary E Gallick; Gabriel Lopez-Berestein; Elsa R Flores; Anil K Sood; Suyun Huang; Dihua Yu; Liuqing Yang; Chunru Lin
Journal:  J Clin Invest       Date:  2017-11-13       Impact factor: 14.808

3.  GENN: a GEneral Neural Network for learning tabulated data with examples from protein structure prediction.

Authors:  Eshel Faraggi; Andrzej Kloczkowski
Journal:  Methods Mol Biol       Date:  2015

Review 4.  Prediction of RNA binding proteins comes of age from low resolution to high resolution.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  Mol Biosyst       Date:  2013-10

5.  Accurate single-sequence prediction of solvent accessible surface area using local and global features.

Authors:  Eshel Faraggi; Yaoqi Zhou; Andrzej Kloczkowski
Journal:  Proteins       Date:  2014-09-25

6.  Exploiting a reduced set of weighted average features to improve prediction of DNA-binding residues from 3D structures.

Authors:  Yi Xiong; Junfeng Xia; Wen Zhang; Juan Liu
Journal:  PLoS One       Date:  2011-12-08       Impact factor: 3.240

7.  ccPDB: compilation and creation of data sets from Protein Data Bank.

Authors:  Harinder Singh; Jagat Singh Chauhan; M Michael Gromiha; Gajendra P S Raghava
Journal:  Nucleic Acids Res       Date:  2011-12-01       Impact factor: 16.971

8.  NEAT1 is essential for metabolic changes that promote breast cancer growth and metastasis.

Authors:  Mi Kyung Park; Li Zhang; Kyung-Won Min; Jung-Hyun Cho; Chih-Chen Yeh; Hyesu Moon; Daniel Hormaechea-Agulla; Hyejin Mun; Seungbeom Ko; Ji Won Lee; Sonali Jathar; Aubrey S Smith; Yixin Yao; Nguyen Thu Giang; Hong Ha Vu; Victoria C Yan; Mary C Bridges; Antonis Kourtidis; Florian Muller; Jeong Ho Chang; Su Jung Song; Shinichi Nakagawa; Tetsuro Hirose; Je-Hyun Yoon; Min Sup Song
Journal:  Cell Metab       Date:  2021-12-07       Impact factor: 27.287

9.  Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.

Authors:  Rasna R Walia; Cornelia Caragea; Benjamin A Lewis; Fadi Towfic; Michael Terribilini; Yasser El-Manzalawy; Drena Dobbs; Vasant Honavar
Journal:  BMC Bioinformatics       Date:  2012-05-10       Impact factor: 3.169

Review 10.  Computational Prediction of RNA-Binding Proteins and Binding Sites.

Authors:  Jingna Si; Jing Cui; Jin Cheng; Rongling Wu
Journal:  Int J Mol Sci       Date:  2015-11-03       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.