Literature DB >> 15039428

Classification of nuclear receptors based on amino acid composition and dipeptide composition.

Manoj Bhasin1, Gajendra P S Raghava.   

Abstract

Nuclear receptors are key transcription factors that regulate crucial gene networks responsible for cell growth, differentiation, and homeostasis. Nuclear receptors form a superfamily of phylogenetically related proteins and control functions associated with major diseases (e.g. diabetes, osteoporosis, and cancer). In this study, a novel method has been developed for classifying the subfamilies of nuclear receptors. The classification was achieved on the basis of amino acid and dipeptide composition from a sequence of receptors using support vector machines. The training and testing was done on a non-redundant data set of 282 proteins obtained from the NucleaRDB data base (1). The performance of all classifiers was evaluated using a 5-fold cross validation test. In the 5-fold cross-validation, the data set was randomly partitioned into five equal sets and evaluated five times on each distinct set while keeping the remaining four sets for training. It was found that different subfamilies of nuclear receptors were quite closely correlated in terms of amino acid composition as well as dipeptide composition. The overall accuracy of amino acid composition-based and dipeptide composition-based classifiers were 82.6 and 97.5%, respectively. Therefore, our results prove that different subfamilies of nuclear receptors are predictable with considerable accuracy using amino acid or dipeptide composition. Furthermore, based on above approach, an online web service, NRpred, was developed, which is available at www.imtech.res.in/raghava/nrpred.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15039428     DOI: 10.1074/jbc.M401932200

Source DB:  PubMed          Journal:  J Biol Chem        ISSN: 0021-9258            Impact factor:   5.157


  65 in total

1.  GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  Computational prediction of human proteins that can be secreted into the bloodstream.

Authors:  Juan Cui; Qi Liu; David Puett; Ying Xu
Journal:  Bioinformatics       Date:  2008-08-12       Impact factor: 6.937

3.  Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles.

Authors:  Justin Bo-Kai Hsu; Kai-Yao Huang; Tzu-Ya Weng; Chien-Hsun Huang; Tzong-Yi Lee
Journal:  J Comput Aided Mol Des       Date:  2014-01-19       Impact factor: 3.686

4.  Development of a biomarker database toward performing disease classification and finding disease interrelations.

Authors:  Shaikh Farhad Hossain; Ming Huang; Naoaki Ono; Aki Morita; Shigehiko Kanaya; Md Altaf-Ul-Amin
Journal:  Database (Oxford)       Date:  2021-03-11       Impact factor: 3.451

5.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

6.  Dipeptide analysis of p53 mutations and evolution of p53 family proteins.

Authors:  Qiang Huang; Long Yu; Arnold J Levine; Ruth Nussinov; Buyong Ma
Journal:  Biochim Biophys Acta       Date:  2013-04-10

7.  Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains.

Authors:  Bharat Panwar; Gajendra P S Raghava
Journal:  BMC Genomics       Date:  2010-09-22       Impact factor: 3.969

8.  Kinome-wide interaction modelling using alignment-based and alignment-independent approaches for kinase description and linear and non-linear data analysis techniques.

Authors:  Maris Lapins; Jarl Es Wikberg
Journal:  BMC Bioinformatics       Date:  2010-06-22       Impact factor: 3.169

9.  BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.

Authors:  Bin Liu; Xin Gao; Hanyu Zhang
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

10.  LncRNA-Encoded Short Peptides Identification Using Feature Subset Recombination and Ensemble Learning.

Authors:  Siyuan Zhao; Jun Meng; Yushi Luan
Journal:  Interdiscip Sci       Date:  2021-07-25       Impact factor: 2.233

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.