| Literature DB >> 26962440 |
Jeffrey Skolnick1, Mu Gao1, Hongyi Zhou1.
Abstract
Native proteins perform an amazing variety of biochemical functions, including enzymatic catalysis, and can engage in protein-protein and protein-DNA interactions that are essential for life. A key question is how special are these functional properties of proteins. Are they extremely rare, or are they an intrinsic feature? Comparison to the properties of compact conformations of artificially generated compact protein structures selected for thermodynamic stability but not any type of function, the artificial (ART) protein library, demonstrates that a remarkable number of the properties of native-like proteins are recapitulated. These include the complete set of small molecule ligand-binding pockets and most protein-protein interfaces. ART structures are predicted to be capable of weakly binding metabolites and cover a significant fraction of metabolic pathways, with the most enriched pathways including ancient ones such as glycolysis. Native-like active sites are also found in ART proteins. A small fraction of ART proteins are predicted to have strong protein-protein and protein-DNA interactions. Overall, it appears that biochemical function is an intrinsic feature of proteins which nature has significantly optimized during evolution. These studies raise questions as to the relative roles of specificity and promiscuity in the biochemical function and control of cells that need investigation.Entities:
Keywords: Enzymatic active sites; Protein structure; biochemical function; native proteins; protein conformation; protein interactions
Year: 2016 PMID: 26962440 PMCID: PMC4765716 DOI: 10.12688/f1000research.7374.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Top 18 most enriched pathways by FINDSITE comb.
| Pathway | Enrichment factor | Number of proteins
|
|---|---|---|
| Insulin signaling pathway | 77.5 | 67 |
| Alcoholism | 68.0 | 5 |
| Amphetamine addiction | 68.0 | 46 |
| Cocaine addiction | 68.0 | 42 |
| Huntington's disease | 68.0 | 36 |
| Amyotrophic lateral sclerosis (ALS) | 68.0 | 29 |
| GABAergic synapse | 68.0 | 28 |
| Taurine and hypotaurine metabolism | 68.0 | 27 |
| Proximal tubule bicarbonate reclamation | 67.3 | 14 |
| HMG-CoA reductase inhibitors | 59.8 | 4 |
| Galactose metabolism | 56.8 | 20 |
| Phosphotransferase system (PTS) | 54.1 | n.a. |
| Glycerolipid metabolism | 48.7 | 20 |
| Butirosin and neomycin biosynthesis | 45.9 | 3 |
| Glyoxylate and dicarboxylate metabolism | 45.9 | 18 |
| Alanine, aspartate, and glutamate metabolism | 45.6 | 27 |
| Glycolysis/gluconeogenesis | 39.7 | 30 |
| Retrograde endocannabinoid signaling | 39.6 | 23 |
Figure 1. Cumulative fraction of enzymes whose active sites match pocket residues in ( A) other classes of enzymes in native structures with different first two digit Enzyme Commission (EC) numbers, ( B) in non-enzymes, and ( C) in ART structures. For each target enzyme, we count the number of alternative enzyme classes that contain at least a hit by the target enzyme at various root-mean-square-deviation (RMSD) cut-offs.
For a subset of ART proteins containing active site residues and geometrics, % of pockets that have a significant p-value to the native enzymatic pocket.
| Native enzyme | Enzyme Commission
| % (number) of ART
| Number of
| Template modeling
|
|---|---|---|---|---|
| 2a8yB-ART1
| 2.4.2.28 | 0% (0/11,680) | 25 | 0.25 |
| 2cstA-ART1 | 2.6.1.1 | 0.33% (39/11,680) | 14 | 0.23 |
| 2fniA-ART1
| 2.6.1.92 | 0 (0/11,680) | 45 | 0.25 |
| 1im5A-ART1 | 3.5.1.19 | 0.027% (3/11,520) | 17 | 0.27 |
| 1kopB-ART1 | 4.2.1.1 | 0.21% (24/11,680) | 26 | 0.28 |
| 1nu3A-ART1 | 3.3.2.8 | 0.38% (82/211,440) | 21 | 0.27 |
| 1oygA-ART1
| 2.4.1.10 | 0.0% (4/11,680) | 32 | 0.23 |
| 1oygA-ART2 | 2.4.1.10 | 0.043% (5/11,680) | 32 | 0.17 |
| 1sd1A-ART1 | 2.4.2.28 | 1.4% (167/11,680) | 31 | 0.23 |
| 1sd1A-ART2
| 2.4.2.28 | 0 (0/11,520) | 31 | 0.27 |
| 1w23B-ART1
| 2.6.1.52 | 0% (0/221,280) | 10 | 0.19 |
| 1xffA-ART1 | 2.6.1.16 | 1.8% (211/11,680) | 28 | 0.34 |
| 1yxhA-ART1 | 3.1.1.4 | 0.45% (52/11,520) | 34 | 0.35 |
| 2z2xA-ART1 | 3.4.21.62 | 0% (0/11,680) | 27 | 0.26 |
aNo pockets matched even without the active site residue matching restraint imposed.
b8/221,280 pockets match without the active site residue matching restraint imposed.
Figure 2. Root-mean-square-deviation (RMSD) of native enzymatic to ART pockets versus the number of pocket residues aligned.
The number of residues in the native active site pocket, N, is shown in the figure legend.
Figure 3. Artificial protein-protein, protein-DNA complexes.
( A) Statistics of putative artificial protein-protein complexes. Joint probability density of interaction energy E [43] and the p-value of the interface similarity (IS)-score [42, 44] between an artificial complex and its corresponding native template. Darker blue indicates higher density, with the 100 lowest density spots represented by grey spheres. A vertical/horizontal dashed line is placed at E = -15 (a cut-off for high likelihood of interaction) and P = 1×10 -3. ( B) Protein-binding propensity scores (>0 implies favorable binding) of native protein-protein interfaces versus putatively attractive ( E <-15) and repulsive ( E >10) artificial protein-protein interfaces. ( C) Example of an ART protein-protein complex. The complex was built by superimposing two artificial structures (cyan and orange) onto a native dimeric template (Protein Data Bank [PDB] code 2f4m, chain A and B, colored in green and purple). Interface alignment according to iAlign [42]. Both structures are shown in line representations, with the non-interfacial regions of the native template shown in transparent mode for clarity. ( D) Statistics of artificial DNA-protein complexes. Joint probability density of DNA-protein interaction energy, E [46], and the interfacial template modeling (TM)-score [22] between an ART protein and its corresponding native template. A vertical/horizontal dashed line is placed at E = -10 and iTM-score = 0.4. ( E) DNA-binding propensity scores (>0 implies favorable binding) of native DNA-protein interfaces versus putatively attractive ( E <-10) and repulsive ( E >10) artificial DNA-protein interfaces. ( F) Example of an artificial DNA-protein complex. The complex was built by superimposing the ART structure (red) onto a native template (PDB code 1akh, the native protein and DNA are colored in green and cyan, respectively).