Literature DB >> 1304347

An optimization approach to predicting protein structural class from amino acid composition.

C T Zhang1, K C Chou.   

Abstract

Proteins are generally classified into four structural classes: all-alpha proteins, all-beta proteins, alpha + beta proteins, and alpha/beta proteins. In this article, a protein is expressed as a vector of 20-dimensional space, in which its 20 components are defined by the composition of its 20 amino acids. Based on this, a new method, the so-called maximum component coefficient method, is proposed for predicting the structural class of a protein according to its amino acid composition. In comparison with the existing methods, the new method yields a higher general accuracy of prediction. Especially for the all-alpha proteins, the rate of correct prediction obtained by the new method is much higher than that by any of the existing methods. For instance, for the 19 all-alpha proteins investigated previously by P.Y. Chou, the rate of correct prediction by means of his method was 84.2%, but the correct rate when predicted with the new method would be 100%! Furthermore, the new method is characterized by an explicable physical picture. This is reflected by the process in which the vector representing a protein to be predicted is decomposed into four component vectors, each of which corresponds to one of the norms of the four protein structural classes.

Mesh:

Substances:

Year:  1992        PMID: 1304347      PMCID: PMC2142205          DOI: 10.1002/pro.5560010312

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  33 in total

1.  Isolation and characterization of the ZWF1 gene of Saccharomyces cerevisiae, encoding glucose-6-phosphate dehydrogenase.

Authors:  I Nogae; M Johnston
Journal:  Gene       Date:  1990-12-15       Impact factor: 3.688

2.  A prediction of tertiary structures of peptide by the Monte Carlo simulated annealing method.

Authors:  H Kawai; T Kikuchi; Y Okamoto
Journal:  Protein Eng       Date:  1989-11

3.  Codon usage tabulated from the GenBank genetic sequence data.

Authors:  K Wada; S Aota; R Tsuchiya; F Ishibashi; T Gojobori; T Ikemura
Journal:  Nucleic Acids Res       Date:  1990-04-25       Impact factor: 16.971

4.  Energy of stabilization of the right-handed beta alpha beta crossover in proteins.

Authors:  K C Chou; G Némethy; M Pottle; H A Scheraga
Journal:  J Mol Biol       Date:  1989-01-05       Impact factor: 5.469

5.  Prediction of protein structural class by discriminant analysis.

Authors:  P Klein
Journal:  Biochim Biophys Acta       Date:  1986-11-21

6.  Glucose 6-phosphate dehydrogenase from Leuconostoc mesenteroides. Kinetic studies.

Authors:  C Olive; M E Geroch; H R Levy
Journal:  J Biol Chem       Date:  1971-04-10       Impact factor: 5.157

7.  Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structure.

Authors:  V I Lim
Journal:  J Mol Biol       Date:  1974-10-05       Impact factor: 5.469

8.  Sequence and genetic organization of a Zymomonas mobilis gene cluster that encodes several enzymes of glucose metabolism.

Authors:  W O Barnell; K C Yi; T Conway
Journal:  J Bacteriol       Date:  1990-12       Impact factor: 3.490

9.  Isolation of human glucose-6-phosphate dehydrogenase (G6PD) cDNA clones: primary structure of the protein and unusual 5' non-coding region.

Authors:  M G Persico; G Viglietto; G Martini; D Toniolo; G Paonessa; C Moscatelli; R Dono; T Vulliamy; L Luzzatto; M D'Urso
Journal:  Nucleic Acids Res       Date:  1986-03-25       Impact factor: 16.971

10.  cDNA sequence of adrenodoxin reductase. Identification of NADP-binding sites in oxidoreductases.

Authors:  I Hanukoglu; T Gutfinger
Journal:  Eur J Biochem       Date:  1989-03-15
View more
  38 in total

1.  Prediction of the transmembrane regions of beta-barrel membrane proteins with a neural network-based predictor.

Authors:  I Jacoboni; P L Martelli; P Fariselli; V De Pinto; R Casadio
Journal:  Protein Sci       Date:  2001-04       Impact factor: 6.725

2.  Cascaded multiple classifiers for secondary structure prediction.

Authors:  M Ouali; R D King
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

3.  iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.

Authors:  Hao Lin; En-Ze Deng; Hui Ding; Wei Chen; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2014-10-31       Impact factor: 16.971

4.  Prediction of protein structural classes using hybrid properties.

Authors:  Wenjin Li; Kao Lin; Kaiyan Feng; Yudong Cai
Journal:  Mol Divers       Date:  2008-10-25       Impact factor: 2.943

5.  Prediction of protein folding types from amino acid composition by correlation angles.

Authors:  K C Chou
Journal:  Amino Acids       Date:  1994-10       Impact factor: 3.520

6.  A time-series-based feature extraction approach for prediction of protein structural class.

Authors:  Ravi Gupta; Ankush Mittal; Kuldip Singh
Journal:  EURASIP J Bioinform Syst Biol       Date:  2008

7.  iN6-methylat (5-step): identifying DNA N6-methyladenine sites in rice genome using continuous bag of nucleobases via Chou's 5-step rule.

Authors:  Nguyen Quoc Khanh Le
Journal:  Mol Genet Genomics       Date:  2019-05-04       Impact factor: 3.291

8.  Classification of Homo sapiens gene behavior using linear discriminant analysis fused with minimum entropy mapping.

Authors:  Joyshri Das; Soma Barman Mandal
Journal:  Med Biol Eng Comput       Date:  2021-02-17       Impact factor: 2.602

Review 9.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

10.  Semi-supervised protein subcellular localization.

Authors:  Qian Xu; Derek Hao Hu; Hong Xue; Weichuan Yu; Qiang Yang
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.