Literature DB >> 9342141

Relations of the numbers of protein sequences, families and folds.

C T Zhang1.   

Abstract

The relations among the numbers of protein sequences, families and folds have been studied theoretically. It is found that the number of families is related to the natural logarithm of the number of sequences. The logarithmic relation should not be changed regardless of what value of the homology threshold is applied in the protein sequence comparison routines. To study the relation between the numbers of families and folds, the degenerate degree of a fold has been introduced. The degenerate degree of a fold is the number of protein families which adopt the same fold. The distribution of the degenerate degrees of folds has been found to be very likely exponential. Based on the distribution, the average degenerate degree d is calculated. The number of folds is simply equal to that of families divided by the average degenerate degree of folds. It is shown that d is an increasing function of time. The current value of d is about 2. It will continue to increase and reach the value of at least 3.3 in some years. By using the above result, the numbers of protein folds for four species have been estimated. In particular, the number of folds for human proteins is estimated to be < or =5200.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9342141     DOI: 10.1093/protein/10.7.757

Source DB:  PubMed          Journal:  Protein Eng        ISSN: 0269-2139


  5 in total

Review 1.  Protein folds and protein folding.

Authors:  R Dustin Schaeffer; Valerie Daggett
Journal:  Protein Eng Des Sel       Date:  2010-11-03       Impact factor: 1.650

2.  Fold designability, distribution, and disease.

Authors:  Philip Wong; Dmitrij Frishman
Journal:  PLoS Comput Biol       Date:  2006-05-05       Impact factor: 4.475

3.  Protein folds as synapomorphies of the tree of life.

Authors:  Martin Romei; Guillaume Sapriel; Pierre Imbert; Théo Jamay; Jacques Chomilier; Guillaume Lecointre; Mathilde Carpentier
Journal:  Evolution       Date:  2022-07-13       Impact factor: 4.171

4.  Exploring dynamics of protein structure determination and homology-based prediction to estimate the number of superfamilies and folds.

Authors:  Ruslan I Sadreyev; Nick V Grishin
Journal:  BMC Struct Biol       Date:  2006-03-20

5.  Visualisation and graph-theoretic analysis of a large-scale protein structural interactome.

Authors:  Dan Bolser; Panos Dafas; Richard Harrington; Jong Park; Michael Schroeder
Journal:  BMC Bioinformatics       Date:  2003-10-08       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.