Literature DB >> 21993537

Fuzzy clustering of physicochemical and biochemical properties of amino acids.

Indrajit Saha1, Ujjwal Maulik, Sanghamitra Bandyopadhyay, Dariusz Plewczynski.   

Abstract

In this article, we categorize presently available experimental and theoretical knowledge of various physicochemical and biochemical features of amino acids, as collected in the AAindex database of known 544 amino acid (AA) indices. Previously reported 402 indices were categorized into six groups using hierarchical clustering technique and 142 were left unclustered. However, due to the increasing diversity of the database these indices are overlapping, therefore crisp clustering method may not provide optimal results. Moreover, in various large-scale bioinformatics analyses of whole proteomes, the proper selection of amino acid indices representing their biological significance is crucial for efficient and error-prone encoding of the short functional sequence motifs. In most cases, researchers perform exhaustive manual selection of the most informative indices. These two facts motivated us to analyse the widely used AA indices. The main goal of this article is twofold. First, we present a novel method of partitioning the bioinformatics data using consensus fuzzy clustering, where the recently proposed fuzzy clustering techniques are exploited. Second, we prepare three high quality subsets of all available indices. Superiority of the consensus fuzzy clustering method is demonstrated quantitatively, visually and statistically by comparing it with the previously proposed hierarchical clustered results. The processed AAindex1 database, supplementary material and the software are available at http://sysbio.icm.edu.pl/aaindex/ .

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21993537      PMCID: PMC3397137          DOI: 10.1007/s00726-011-1106-9

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  32 in total

1.  AAindex: amino acid index database.

Authors:  S Kawashima; M Kanehisa
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A novel method for similarity analysis and protein sub-cellular localization prediction.

Authors:  Bo Liao; Benyou Liao; Xingming Sun; Qingguang Zeng
Journal:  Bioinformatics       Date:  2010-09-08       Impact factor: 6.937

3.  Use of amino acid composition to predict epitope residues of individual antibodies.

Authors:  Shinji Soga; Daisuke Kuroda; Hiroki Shirai; Masato Kobori; Noriaki Hirayama
Journal:  Protein Eng Des Sel       Date:  2010-03-19       Impact factor: 1.650

4.  Protein classification using texture descriptors extracted from the protein backbone image.

Authors:  Loris Nanni; Jian-Yu Shi; Sheryl Brahnam; Alessandra Lumini
Journal:  J Theor Biol       Date:  2010-03-20       Impact factor: 2.691

5.  AAIndexLoc: predicting subcellular localization of proteins based on a new representation of sequences using amino acid indices.

Authors:  E Tantoso; Kuo-Bin Li
Journal:  Amino Acids       Date:  2007-12-28       Impact factor: 3.520

6.  Variable context Markov chains for HIV protease cleavage site prediction.

Authors:  Hasan Oğul
Journal:  Biosystems       Date:  2009-03-24       Impact factor: 1.973

7.  Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins.

Authors:  K Tomii; M Kanehisa
Journal:  Protein Eng       Date:  1996-01

8.  Use of fuzzy clustering technique and matrices to classify amino acids and its impact to Chou's pseudo amino acid composition.

Authors:  D N Georgiou; T E Karakasidis; J J Nieto; A Torres
Journal:  J Theor Biol       Date:  2008-11-12       Impact factor: 2.691

9.  AMS 3.0: prediction of post-translational modifications.

Authors:  Subhadip Basu; Dariusz Plewczynski
Journal:  BMC Bioinformatics       Date:  2010-04-28       Impact factor: 3.169

10.  AAindex: amino acid index database, progress report 2008.

Authors:  Shuichi Kawashima; Piotr Pokarowski; Maria Pokarowska; Andrzej Kolinski; Toshiaki Katayama; Minoru Kanehisa
Journal:  Nucleic Acids Res       Date:  2007-11-12       Impact factor: 16.971

View more
  22 in total

1.  Protein-protein interaction site prediction in Homo sapiens and E. coli using an interaction-affinity based membership function in fuzzy SVM.

Authors:  Brijesh Kumar Sriwastava; Subhadip Basu; Ujjwal Maulik
Journal:  J Biosci       Date:  2015-10       Impact factor: 1.826

2.  AMS 4.0: consensus prediction of post-translational modifications in protein sequences.

Authors:  Dariusz Plewczynski; Subhadip Basu; Indrajit Saha
Journal:  Amino Acids       Date:  2012-05-04       Impact factor: 3.520

3.  RF-Hydroxysite: a random forest based predictor for hydroxylation sites.

Authors:  Hamid D Ismail; Robert H Newman; Dukka B Kc
Journal:  Mol Biosyst       Date:  2016-07-19

4.  Computational identification of ubiquitination sites in Arabidopsis thaliana using convolutional neural networks.

Authors:  Xiaofeng Wang; Renxiang Yan; Yong-Zi Chen; Yongji Wang
Journal:  Plant Mol Biol       Date:  2021-02-01       Impact factor: 4.076

5.  Some theoretical aspects of reprogramming the standard genetic code.

Authors:  Kuba Nowak; Paweł Błażej; Małgorzata Wnetrzak; Dorota Mackiewicz; Paweł Mackiewicz
Journal:  Genetics       Date:  2021-05-17       Impact factor: 4.562

6.  Prediction of DNA-Binding Protein-Drug-Binding Sites Using Residue Interaction Networks and Sequence Feature.

Authors:  Wei Wang; Yu Zhang; Dong Liu; HongJun Zhang; XianFang Wang; Yun Zhou
Journal:  Front Bioeng Biotechnol       Date:  2022-04-20

7.  CarSPred: a computational tool for predicting carbonylation sites of human proteins.

Authors:  Hongqiang Lv; Jiuqiang Han; Jun Liu; Jiguang Zheng; Ruiling Liu; Dexing Zhong
Journal:  PLoS One       Date:  2014-10-27       Impact factor: 3.240

8.  Prediction of protein-protein interaction sites by means of ensemble learning and weighted feature descriptor.

Authors:  Xiuquan Du; Shiwei Sun; Changlin Hu; Xinrui Li; Junfeng Xia
Journal:  J Biol Res (Thessalon)       Date:  2016-07-04       Impact factor: 1.889

9.  Consensus classification of human leukocyte antigen class II proteins.

Authors:  Indrajit Saha; Giovanni Mazzocco; Dariusz Plewczynski
Journal:  Immunogenetics       Date:  2012-11-16       Impact factor: 2.846

10.  PPIcons: identification of protein-protein interaction sites in selected organisms.

Authors:  Brijesh K Sriwastava; Subhadip Basu; Ujjwal Maulik; Dariusz Plewczynski
Journal:  J Mol Model       Date:  2013-06-02       Impact factor: 1.810

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.