| Literature DB >> 28142195 |
Sen Yao1,2,3,4,5, Robert M Flight3,4,5, Eric C Rouchka1,2, Hunter N B Moseley3,4,5.
Abstract
Metalloproteins bind and utilize metal ions for a variety of biological purposes. Due to the ubiquity of metalloprotein involvement throughout these processes across all domains of life, how proteins coordinate metal ions for different biochemical functions is of great relevance to understanding the implementation of these biological processes. Toward these ends, we have improved our methodology for structurally and functionally characterizing metal binding sites in metalloproteins. Our new ligand detection method is statistically much more robust, producing estimated false positive and false negative rates of ∼0.11% and ∼1.2%, respectively. Additional improvements expand both the range of metal ions and their coordination number that can be effectively analyzed. Also, the inclusion of additional quality control filters has significantly improved structure-function Spearman correlations as demonstrated by rho values greater than 0.90 for several metal coordination analyses and even one rho value above 0.95. Also, improvements in bond-length distributions have revealed bond-length modes specific to chemical functional groups involved in multidentation. Using these improved methods, we analyzed all single metal ion binding sites with Zn, Mg, Ca, Fe, and Na ions in the wwPDB, producing statistically rigorous results supporting the existence of both a significant number of unexpected compressed angles and subsequent aberrant metal ion coordination geometries (CGs) within structurally known metalloproteins. By recognizing these aberrant CGs in our clustering analyses, high correlations are achieved between structural and functional descriptions of metal ion coordination. Moreover, distinct biochemical functions are associated with aberrant CGs versus nonaberrant CGs. Proteins 2017; 85:885-907.Entities:
Keywords: 3D structure; bidentation; compressed angle; structural bioinformatics; structure-function relationship
Mesh:
Substances:
Year: 2017 PMID: 28142195 PMCID: PMC5389913 DOI: 10.1002/prot.25257
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
Figure 1Structure of canonical CG models. For each structure, the magenta ball represents the metal ion center, and the white balls represent the binding ligands. The three‐letter code for each CG is shown in parenthesis. The major CG names are shown in red with their minor CGs follow in the same row. The minor CGs can often be viewed as missing ligands from their corresponding majors ones. CGs are also separated by lines according to their ligand numbers. [Color figure can be viewed at wileyonlinelibrary.com]
Numbers of Metalloproteins in wwPDB as of Feb 2015
| Metal | Number of PDB entries | Number of total metal sites | Metal | Number of PDB entries | Number of total metal sites |
|---|---|---|---|---|---|
| Zn | 9360 | 26,788 | Pb | 48 | 152 |
| Mg | 9145 | 53,896 | Gd | 42 | 197 |
| Ca | 7762 | 24,335 | Tl | 40 | 261 |
| Fe | 6359 | 27,514 | Rb | 37 | 153 |
| Na | 4888 | 16,527 | Sm | 33 | 111 |
| Mn | 2266 | 8138 | Ir | 31 | 48 |
| K | 1673 | 5306 | Pr | 22 | 55 |
| Cu | 1134 | 4397 | Rh | 20 | 46 |
| Ni | 935 | 2252 | Eu | 19 | 61 |
| Co | 915 | 2087 | Pd | 19 | 85 |
| Cd | 758 | 4289 | Ag | 18 | 75 |
| Hg | 528 | 1923 | Os | 14 | 33 |
| Pt | 191 | 629 | Lu | 13 | 56 |
| Mo | 176 | 664 | Ho | 12 | 35 |
| Al | 158 | 351 | Tb | 11 | 32 |
| V | 120 | 364 | Cr | 9 | 21 |
| Ba | 118 | 311 | Ga | 8 | 10 |
| Sr | 118 | 3551 | La | 8 | 18 |
| Ru | 99 | 134 | Sb | 5 | 10 |
| Cs | 88 | 393 | Ce | 4 | 7 |
| W | 76 | 1443 | Er | 2 | 6 |
| Yb | 72 | 177 | In | 2 | 3 |
| Au | 64 | 322 | Bi | 1 | 1 |
| Y | 53 | 202 | Dy | 1 | 30 |
| Li | 52 | 88 | Total | 47,527 | 187,587 |
Figure 2The workflow of metalloprotein CG analysis. The gray ribbons identify specific steps of the overall analysis as described in the methods.
Derived Distance Cutoffs and Parameters for Defining the Coordination Shell of the Five Most Prevalent Metalloproteins in Different Steps
| Step 1 | Step 2 | Step 3 | Step 5 | Step 6 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Metal | Atomic radius (pm) | Initial distance upper cutoff (Å) | The most abundant element | Bond length mean of the most abundant element (Å) | Bond length standard deviation of the most abundant element (Å) | Carbon mean peak (Å) | Element included | Updated distance upper cutoff (Å) | IA small angle removal cutoff (Å) | Random forest cutoff (degrees) |
| Zn | 135 | 3.20 | S | 2.340 | 0.152 | 3.071 | S, O, N | 2.782 | 68 | 60/70 |
| Mg | 150 | 3.35 | O | 2.350 | 0.368 | 3.067 | O, N | 2.892 | 65 | 58/68 |
| Ca | 180 | 3.65 | O | 2.481 | 0.271 | 3.432 | O | 3.092 | 60 | 55/65 |
| Fe | 140 | 3.25 | N | 2.063 | 0.134 | 3.081 | S, O, N | 2.639 | 68 | 63/73 |
| Na | 180 | 3.65 | O | 2.697 | 0.369 | 3.568 | O | 3.317 | 60 | 50/65 |
Figure 3The updated upper Zn bond‐length cutoff for generating final bond‐length statistics. The gray histogram is the detected Zn‐S bond lengths based on canonical CGs. The blue histogram is the fictional Zn‐C bond lengths based on canonical CGs. The red line is the upper bond‐length cutoff used for calculating final bond‐length statistics: (Zn‐S mean + 1 Zn‐S standard deviation + the Zn‐C mode)/2. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 4(A) Zn‐S bond length as a function of crystallographic resolution. (B) Scatter plot of bond length standard deviations versus average crystallographic resolutions by bond type. Symbols show smoothed values for individual metal‐ligand bond types. Symbols are specific to the ligand element. Symbols and regression lines are color‐coded by metal. The magenta‐color line is the combined regression with an r 2 of 0.692 and a p‐value of 5.5 − 10−59. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 5Average normalized bond length deviation histograms of five most abundant metalloproteins. The cutoff 0.91 was derived from the clear bimodal separation in Na, and was applied to all metals, which is represented as the red line in each sub‐graph. Bidentation means that two of the binding ligands come from the same molecule or residue. 3+ multidentation means that three or more of the binding ligands come from the same molecule or residue. They are the contributing factor to the shoulders in each histogram. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 6Bond length distributions and statistics of all bond types involving elements O, N, and S, which have >5% occurrence.
6‐angle Space for All CGs in Figure 1. [Color table can be viewed at wileyonlinelibrary.com]
| CG | Largest | Ordered middle angles, with smallest‐middle, 33‐quantile‐middle, 66‐quantile‐middle, largest‐middle positions are in red | Smallest opposite |
|---|---|---|---|
| 4‐ligand: | |||
| Tet | 109.5 |
| 109.5 |
| Bva | 120 |
| 90 |
| Bvp | 180 |
| 120 |
| Pyv | 180 |
| 90 |
| Spl | 180 |
| 180 |
| 5‐ligand: | |||
| Tbp | 180 |
| 120 |
| Spy | 180 |
| 90 |
| Tpv | 131.8 |
| 70.6 |
| 6‐ligand: | |||
| Oct | 180 |
| 90 |
| Pva | 144 |
| 72 |
| Pvp | 180 |
| 72 |
| Tpr | 131.8 |
| 70.6 |
| 7‐ligand: | |||
| Pbp | 180 |
| 72 |
| Hva | 180 |
| 60 |
| Hvp | 180 |
| 60 |
| Sav | 143.6 |
| 70.5 |
| 8‐ligand: | |||
| Hbp | 180 |
| 60 |
| Sqa | 143.6 |
| 70.5 |
CG abbreviations are based on Figure 1.
Figure 7Chemical functional group and multidentation specific bond length modes. On the left is the overall bidentation short and long arms for each metal and some specific functional groups that contributing to the overall bond length histogram. On the right is a breakdown of the most abundant functional groups in the bidentation and multidentation, as they often exhibit distinct modes. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 8Bond length modification by two different filters: average bond length deviation (left) and X‐ray crystallography resolution (right). The average deviation filter can detect potential misassigned metal ion, and removes the skewed long tails. The resolution filter removes the whole spectrum proportionally and leaves a much smaller number of data for analysis. [Color figure can be viewed at wileyonlinelibrary.com]
Ligand Counts and Error Rates by Metal
| Metal | Number of metal clusters | Number of usable metal sites (>3‐ ligand) | Number of unusable metal sites (<=3‐ligand) | 4‐ligand | 5‐ligand | 6‐ligand | 7‐ligand | 8‐ligand | 9‐ligand | Total | Estimated Ligand Detection Error rate | Nonredundant set |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Zn | 572 | 21,257 | 4959 | 11,380 | 2365 |
|
| ‐ | ‐ | 14,497 | 0.000443 | 4800 |
| Mg | 691 | 29,859 | 23,346 | 3595 | 2941 |
|
|
| ‐ | 12,281 | 0.002113 | 2813 |
| Ca | 196 | 21,057 | 3082 | 918 | 1490 | 4485 | 5399 |
|
| 13,568 | 0.001760 | 4080 |
| Fe | 11,287 | 14,990 | 1237 | 1071 | 3929 |
|
| ‐ | ‐ | 10,806 | 0.000057 | 2370 |
| Na | 240 | 11,475 | 4812 | 703 | 1557 | 1840 | 186 |
| ‐ | 4303 | 0.000000 | 1184 |
| Overall | 0.001128 | |||||||||||
Highest coordination number considered valid for the given metal.
Coordination numbers considered erroneous and thus used in ligand detection error estimation.
Figure 9Smallest angle distributions for the five most abundant metalloproteins. The left histograms show smallest angle propensities broken down by coordination number and metal. The right histograms show smallest angle propensities broken down by ligand type and metal. bi is short for bidentation, which means that the two atoms composing the smallest angle are from the same residue or molecule. aa is short for amino acid, which means that the composing ligands are the 20 standard amino acids. Similarly, nonaa means that at least one of the composing ligands is not the 20 standard amino acids. H2O‐aa means that one of the composing ligand is the 20 standard amino acids and the other is water. And H2O‐both means that both of the composing ligands are water molecules.
Figure 10Scatter plot of the structure‐function Spearman's rank correlation coefficient (rho) as a function of data size for real datasets. On the left panel, red points represent structure‐function correlation coefficients for datasets combined by metal. Green points represent structure‐function correlation coefficients for datasets combined by coordination number. Blue points represent structure‐function correlation coefficients of datasets specific to the metal and coordination number. On the right panel is the same graph with individual metal and coordination number identified. A data size cutoff of 600 is shown as a black line on both panel. For data points with a size smaller than 600, the correlation rho and p‐value are not reliable. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 11The structure‐function Spearman's rank correlation coefficient (rho) as a function of data size for subsampled zinc 4‐ligand datasets. The average rho was calculated for k = 6, 10, and 13 respectively based on 21 independent subsamplings of the original zinc 4‐ligand dataset at each specific dataset size. [Color figure can be viewed at wileyonlinelibrary.com]
Optimal k and Corresponding rho and p‐value for Each Metal and Ligand Number
| Metal | Ligand number | Group | Size | Optimal k | Functionally mapped n_lig | Rho |
| Category |
|---|---|---|---|---|---|---|---|---|
| Zn | 4 | normal | 3300 | 10 | 4 | 0.8976 | 0.0000 | single |
| Zn | 4 | compressed | 94 | ‐ | ‐ | ‐ | ‐ | single |
| Zn | 4 | combined | 3398 | 11 | 4 | 0.9053 | 0.0000 | single |
|
| 5 | normal | 584 | 8 | 2 | 0.6420 | 0.0003 | single |
|
| 5 | compressed | 518 | 7 | 2 | 0.7494 | 0.0001 | single |
| Zn | 5 | combined | 1103 | 7 | 2 | 0.8831 | 0.0000 | single |
|
| 6 | normal | 150 | 6 | 2 | 0.6935 | 0.0041 | single |
|
| 6 | compressed | 128 | ‐ | ‐ | ‐ | ‐ | single |
|
| 6 | combined | 298 | 7 | 3 | 0.8303 | 0.0056 | single |
| Zn | combined | normal | 4034 | 7 | 2 | 0.8545 | 0.0000 | combineNumLig |
| Zn | combined | compressed | 741 | 10 | 3 | 0.3953 | 0.0072 | combineNumLig |
| Zn | combined | combined | 4800 | 9 | 1 | 0.6785 | 0.0000 | combineNumLig |
|
| 4 | normal | 280 | 7 | 1 | 0.2956 | 0.1933 | single |
|
| 4 | compressed | 44 | ‐ | ‐ | ‐ | ‐ | single |
|
| 4 | combined | 326 | 13 | 3 | 0.6215 | 0.0026 | single |
|
| 5 | normal | 530 | 5 | 1 | 0.6848 | 0.0351 | single |
| Mg | 5 | compressed | 74 | ‐ | ‐ | ‐ | ‐ | single |
| Mg | 5 | combined | 608 | 21 | 3 | 0.3553 | 0.0001 | single |
| Mg | 6 | normal | 1665 | 5 | 1 |
| 0.0000 | single |
|
| 6 | compressed | 173 | 8 | 1 | 0.2403 | 0.2180 | single |
| Mg | 6 | combined | 1843 | 6 | 1 | 0.9321 | 0.0000 | single |
| Mg | combined | normal | 2477 | 11 | 1 | 0.6732 | 0.0000 | combineNumLig |
|
| combined | compressed | 319 | 6 | 2 | 0.7964 | 0.0006 | combineNumLig |
| Mg | combined | combined | 2813 | 7 | 3 | 0.6299 | 0.0028 | combineNumLig |
|
| 4 | normal | 293 | 5 | 4 | 0.0857 | 0.9194 | single |
| Ca | 4 | compressed | 88 | ‐ | ‐ | ‐ | ‐ | single |
|
| 4 | combined | 391 | 5 | 1 | 0.2242 | 0.5367 | single |
|
| 5 | normal | 369 | 7 | 3 | 0.5857 | 0.0061 | single |
|
| 5 | compressed | 181 | 6 | 4 | 0.2824 | 0.3078 | single |
|
| 5 | combined | 575 | 5 | 1 | 0.4303 | 0.2180 | single |
| Ca | 6 | normal | 776 | 6 | 3 | 0.7000 | 0.0049 | single |
|
| 6 | compressed | 401 | 8 | 2 | 0.3645 | 0.0572 | single |
| Ca | 6 | combined | 1241 | 8 | 1 | 0.7630 | 0.0000 | single |
|
| 7 | normal | 335 | 8 | 3 | 0.3361 | 0.0809 | single |
| Ca | 7 | compressed | 1077 | 10 | 1 | 0.4929 | 0.0007 | single |
| Ca | 7 | combined | 1518 | 11 | 1 | 0.3527 | 0.0086 | single |
|
| 8 | normal | 80 | ‐ | ‐ | ‐ | ‐ | single |
|
| 8 | compressed | 123 | 4 | 3 | 0.8857 | 0.0333 | single |
|
| 8 | combined | 350 | 10 | 4 | 0.5710 | 0.0003 | single |
| Ca | combined | normal | 1853 | 6 | 2 | 0.8643 | 0.0000 | combineNumLig |
| Ca | combined | compressed | 1870 | 10 | 4 | 0.8664 | 0.0000 | combineNumLig |
| Ca | combined | combined | 4080 | 13 | 4 | 0.8176 | 0.0000 | combineNumLig |
|
| 4 | normal | 184 | 5 | 1 | 0.6688 | 0.0345 | single |
| Fe | 4 | compressed | 38 | ‐ | ‐ | ‐ | ‐ | single |
|
| 4 | combined | 222 | 5 | 1 | 0.5273 | 0.1228 | single |
|
| 5 | normal | 533 | 7 | 4 | 0.9605 | 0.0000 | single |
|
| 5 | compressed | 111 | 4 | 3 | 0.8286 | 0.0583 | single |
| Fe | 5 | combined | 644 | 10 | 4 | 0.8327 | 0.0000 | single |
| Fe | 6 | normal | 1349 | 7 | 1 | 0.9000 | 0.0000 | single |
|
| 6 | compressed | 149 | 4 | 2 | 0.8286 | 0.0583 | single |
| Fe | 6 | combined | 1503 | 7 | 1 | 0.8377 | 0.0000 | single |
| Fe | combined | normal | 2066 | 7 | 1 | 0.6571 | 0.0016 | combineNumLig |
|
| combined | compressed | 298 | 6 | 1 | 0.7571 | 0.0016 | combineNumLig |
| Fe | combined | combined | 2370 | 6 | 1 | 0.7571 | 0.0016 | combineNumLig |
|
| 4 | normal | 212 | 10 | 1 | 0.6049 | 0.0000 | single |
|
| 4 | compressed | 25 | 10 | 2 | 0.4926 | 0.0006 | single |
| Na | 4 | combined | 240 | ‐ | ‐ | ‐ | ‐ | single |
|
| 5 | normal | 360 | 7 | 3 | 0.7239 | 0.0002 | single |
|
| 5 | compressed | 37 | 10 | 1 | 0.3321 | 0.0258 | single |
|
| 5 | combined | 406 | ‐ | ‐ | ‐ | ‐ | single |
|
| 6 | normal | 362 | 7 | 2 | 0.7636 | 0.0001 | single |
|
| 6 | compressed | 82 | 6 | 3 | 0.8036 | 0.0005 | single |
|
| 6 | combined | 471 | ‐ | ‐ | ‐ | ‐ | single |
| Na | combined | normal | 946 | 7 | 3 | 0.8481 | 0.0000 | combineNumLig |
|
| combined | compressed | 173 | 5 | 4 | 0.7939 | 0.0098 | combineNumLig |
| Na | combined | combined | 1184 | 8 | 2 | 0.6085 | 0.0034 | combineNumLig |
| combined | 4 | normal | 4269 | 6 | 3 | 0.9071 | 0.0000 | combineMetal |
|
| 4 | compressed | 289 | 5 | 3 | 0.6606 | 0.0440 | combineMetal |
| combined | 4 | combined | 4577 | 7 | 4 | 0.7610 | 0.0001 | combineMetal |
| combined | 5 | normal | 2376 | 9 | 1 | 0.8077 | 0.0000 | combineMetal |
| combined | 5 | compressed | 921 | 7 | 4 | 0.7753 | 0.0001 | combineMetal |
| combined | 5 | combined | 3336 | 7 | 3 | 0.7831 | 0.0000 | combineMetal |
| combined | 6 | normal | 4302 | 6 | 1 | 0.9464 | 0.0000 | combineMetal |
| combined | 6 | compressed | 933 | 12 | 1 | 0.5146 | 0.0000 | combineMetal |
| combined | 6 | combined | 5356 | 9 | 1 | 0.9019 | 0.0000 | combineMetal |
|
| 7 | normal | 347 | 10 | 3 | 0.5428 | 0.0001 | combineMetal |
| combined | 7 | compressed | 1133 | 12 | 3 | 0.5049 | 0.0000 | combineMetal |
| combined | 7 | combined | 1613 | 12 | 1 | 0.4860 | 0.0000 | combineMetal |
|
| 8 | normal | 82 | 6 | 2 | 0.4857 | 0.3556 | combineMetal |
|
| 8 | compressed | 125 | 4 | 3 | 0.8857 | 0.0333 | combineMetal |
|
| 8 | combined | 360 | 10 | 2 | 0.4951 | 0.0021 | combineMetal |
| combined | combined | normal | 11376 | 7 | 4 | 0.7312 | 0.0002 | combinedAll |
| combined | combined | compressed | 3401 | 7 | 3 | 0.7013 | 0.0006 | combinedAll |
| combined | combined | combined | 15247 | 8 | 4 | 0.8396 | 0.0000 | combinedAll |
The structure‐function correlation rho and associated p‐value is not reliable because of the low count of the data.
Highest correlation rho value for all structure‐function analyses with data counts above 600.
Figure 12Three examples of structural versus functional dendrograms and the characteristic χ 2 probabilities of k‐means clusters. All dendrogram pairs show high similarity between each other, and also match their highest probability canonical CG model descriptions. (A, D) 4‐ligand normal combined metalloproteins. (B, E) 5‐ligand normal combined metalloproteins. (C, F) 6‐ligand normal combined metalloproteins. Similar graphs for the other metalloproteins can be found in the Supporting Information material. [Color figure can be viewed at wileyonlinelibrary.com]
Instances of Largest Size Aberrant Clusters of the Compressed Group for Different Metals. The Complete Cluster Information can be Found in Supporting Information Material
| Metal | Ligand Number | Cluster Number | Size | Angle 1 | Angle 2a | Angle 3a | Angle 4a | Angle 5a | Angle 6a | Tet | Bva | Bvp | Spv | Spl |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Combined | 4 | 7 | 79 | 142.5 ± 12.1 | 56 ± 3.7 | 87.6 ± 8.1 | 99.1 ± 7.4 | 107 ± 8 | 101.3 ± 11 | 0.027 | 0.034 | 0.024 | 0.048 | 0 |
| Tbp | Spy | Tpv | ||||||||||||
| Zn | 5 | 3 | 128 | 148.5 ± 4.6 | 56.5 ± 2.8 | 93.1 ± 3.2 | 101.1 ± 3.4 | 134.6 ± 3.8 | 102.3 ± 3.2 | 0.091 | 0.002 | 0.112 | ||
| Ca | 5 | 6 | 59 | 160.2 ± 5.8 | 51.8 ± 3 | 81.8 ± 4.7 | 91.1 ± 4.5 | 144.5 ± 5.4 | 79.6 ± 7.3 | 0.01 | 0.007 | 0.083 | ||
| Fe | 5 | 1 | 41 | 151.1 ± 8.1 | 58 ± 3.5 | 91.1 ± 3.6 | 101.9 ± 3.7 | 134.2 ± 7.7 | 100.4 ± 6.8 | 0.008 | 0 | 0.004 | ||
| Combined | 5 | 2 | 202 | 156.5 ± 6.8 | 56 ± 3.4 | 90.4 ± 4.2 | 102.8 ± 3.5 | 124.1 ± 4.9 | 105.7 ± 5.1 | 0.095 | 0 | 0.064 | ||
| Oct | Tpr | Pvp | Pva | |||||||||||
| Mg | 6 | 8 | 41 | 175.6 ± 2.2 | 84.3 ± 2.7 | 90.2 ± 1.2 | 96.6 ± 2.1 | 158.7 ± 2.9 | 58.8 ± 1.9 | 0.342 | 0 | 0 | 0.166 | |
| Ca | 6 | 3 | 75 | 161.3 ± 4 | 72.9 ± 3.7 | 84.1 ± 3.1 | 104.7 ± 5.6 | 155.2 ± 2.8 | 51.3 ± 2.7 | 0.01 | 0.071 | 0.063 | 0.072 | |
| Fe | 6 | 2 | 54 | 172.9 ± 3.6 | 81.3 ± 3.5 | 90.2 ± 1.7 | 97.4 ± 2.2 | 164.6 ± 4.6 | 60.6 ± 3.5 | 0.025 | 0 | 0 | 0.004 | |
| Combined | 6 | 2 | 107 | 174.3 ± 3.1 | 82.3 ± 3.2 | 90.2 ± 1.8 | 97.2 ± 2.4 | 157.5 ± 2.9 | 58.4 ± 2.9 | 0.177 | 0.003 | 0 | 0.11 |
Angle positions are based on the 6‐angle space description.
CG abbreviations are based on Figure 1.
Instances of Highly Aberrant Clusters of the Compressed Group for Different Metals. The Complete Cluster Information can be Found in Supporting Information Material
| Metal | Ligand Number | Cluster Number | Size | Angle 1 | Angle 2 a | Angle 3 a | Angle 4 a | Angle 5 a | Angle 6 a | Tet | Bva | Bvp | Spv | Spl |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Combined | 4 | 2 | 41 | 155.4 ± 10.8 | 54.9 ± 5.6 | 79.2 ± 9.1 | 107.3 ± 13.4 | 134.5 ± 10.8 | 90.5 ± 15.5 | 0 | 0.013 | 0 | 0.001 | 0 |
| Tbp | Spy | Tpv | ||||||||||||
| Zn | 5 | 5 | 61 | 164.2 ± 5.7 | 56.8 ± 4.7 | 87.3 ± 4.1 | 104.5 ± 3.6 | 123.6 ± 5.9 | 103.8 ± 5.7 | 0.038 | 0.001 | 0.022 | ||
| Ca | 5 | 3 | 14 | 144 ± 8.4 | 56.1 ± 6.9 | 74.2 ± 5.9 | 86 ± 7 | 126.7 ± 7.6 | 62.6 ± 8.5 | 0.001 | 0.008 | 0.017 | ||
| Fe | 5 | 3 | 22 | 147.4 ± 4.5 | 59.2 ± 6.1 | 87.2 ± 6 | 96 ± 4.3 | 141 ± 4.7 | 78.9 ± 8.5 | 0 | 0 | 0 | ||
| Combined | 5 | 5 | 121 | 156.1 ± 8 | 53.1 ± 4.3 | 80.6 ± 5.7 | 95.5 ± 9 | 141.6 ± 7.2 | 75.2 ± 6 | 0.006 | 0.006 | 0.063 | ||
| Oct | Tpr | Pvp | Pva | |||||||||||
| Mg | 6 | 4 | 13 | 166.8 ± 4.5 | 61.2 ± 5.8 | 87.1 ± 3.7 | 104 ± 5.8 | 155.5 ± 6.1 | 63 ± 4.4 | 0 | 0 | 0 | 0 | |
| Ca | 6 | 4 | 45 | 169 ± 5.2 | 51 ± 3.1 | 84.6 ± 3.7 | 102.2 ± 4.9 | 159 ± 5.9 | 73.5 ± 6.5 | 0.012 | 0.032 | 0.004 | 0.067 | |
| Fe | 6 | 4 | 21 | 162.4 ± 8 | 59.8 ± 3.2 | 87.8 ± 3 | 104.8 ± 4.6 | 155 ± 7.9 | 70.2 ± 6.8 | 0 | 0 | 0 | 0.001 | |
| Combined | 6 | 10 | 88 | 168.3 ± 4.3 | 52.7 ± 4.8 | 85.8 ± 4 | 104.9 ± 4.9 | 155.6 ± 6.5 | 72 ± 5.7 | 0.015 | 0.03 | 0.003 | 0.048 |
Angle positions are based on the 6‐angle space description.
CG abbreviations are based on Figure 1.
Figure 13PDB structure and electron density maps of examples from clusters listed in Table VI. Aberrant CG structures are shown in balls and sticks, featured by bidentated compressed angles. These structures are also supported by their fitness to the electron density maps. All structures were generated in LiteMol Viewer 54, with 2F o – F c at 1.5σ and F o – F c at −3σ (red) and 3σ (green), except for panel E with 2F o – F c at 1.01 σ. Metal ions are put at the center of each subgraph with larger size, where Zn is represented as light blue, Fe as purple, and Mg and Ca as green. The cluster identifier, PDB metal site ID, and its resolutions are as follows: A, 5‐ligand Zn, cluster 3, 2B13.B.401, resolution 1.55 Å; B, 5‐ligand Ca, cluster 6, 3RYD.C.267, resolution 2.37 Å; C, 5‐ligand Fe, cluster 1, 4AM4.A.1161, resolution 1.68 Å; D, 6‐ligand Mg, cluster 8, 3ETH.A.402, resolution 1.60 Å; E, 6‐ligand Ca, cluster 3, 4P99.B.509 resolution 1.80 Å; F, 6‐ligand Fe, cluster 2, 2GYQ.B.404, resolution 1.40 Å. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 14PDB structure and electron density maps of examples from clusters listed in Table VII. Aberrant CG structures are shown in balls and sticks, featured by bidentated compressed angles. These structures are also supported by their fitness to the electron density maps. All structures were generated in LiteMol Viewer 54, with 2F o – F c at 1.5σ and F o – F c at −3σ (red) and 3σ (green), except for panel C with 2F o – F c at 1.02 σ. Metal ions are put at the center of each subgraph with larger size, where Zn is represented as light blue, Fe as purple, and Mg and Ca as green. The cluster identifier, PDB metal site ID, and its resolution are as follows: A, 5‐ligand Zn, cluster 5, 2R2D.A.277, resolution 1.75 Å; B, 5‐ligand Ca, cluster 3, 3HR4.H.202, resolution 2.50 Å; C, 5‐ligand Fe, cluster 3, 2VZB.B.6204; resolution 2.30 Å; D, 6‐ligand Mg, cluster 4, 3CVJ.C.243, resolution 2.00 Å; E, 6‐ligand Ca, cluster 4, 1LHV.A.401, resolution 2.00 Å; F, 6‐ligand Fe, cluster 4, 3DHI.A.601, resolution 1.68 Å. [Color figure can be viewed at wileyonlinelibrary.com]
Enriched GO Terms, Having Corrected p‐value ≤0.05 and Consistent Enrichment Patterns in combineLig and Individual Ligand Enrichments
| Normal | Compress | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Id | Description | Type | IPR.group | Consistent | p | padjust | metal | perc | sig | p | padjust | metal | perc | sig |
|
| Cellular biosynthetic process | BP | BO | TRUE | 4.419 e −08 | 6.595 e −06 | zn | 0.4264 | TRUE | 1.000 e +00 | 1.000 e +00 | mg | 0.3297 | FALSE |
|
| Pyridine‐containing compound metabolic process | BP | BU | TRUE | 1.518 e −03 | 4.483 e− 02 | mg | 0.4430 | TRUE | 9.996 e −01 | 1.000 e +00 | mg | 0.5000 | FALSE |
|
| Cellular nitrogen compound metabolic process | BP | CJ | TRUE | 8.436 e −11 | 2.359 e −08 | zn | 0.4413 | TRUE | 1.000 e +00 | 1.000 e +00 | mg | 0.3211 | FALSE |
|
| Monovalent inorganic cation transmembrane transporter activity | MF | DE | TRUE | 1.273 e −03 | 3.796 e− 02 | fe | 0.4242 | TRUE | 9.997 e −01 | 1.000 e +00 | ca | 0.7500 | FALSE |
|
| Reactive oxygen species metabolic process | BP | F | TRUE | 2.465 e −04 | 9.952 e −03 | fe | 0.4821 | TRUE | 9.999 e −01 | 1.000 e +00 | zn | 1.0000 | FALSE |
|
| Organonitrogen compound biosynthetic process | BP | G | TRUE | 2.530 e −05 | 1.320 e −03 | mg | 0.3933 | TRUE | 1.000 e +00 | 1.000 e +00 | mg | 0.4865 | FALSE |
|
| Organic acid biosynthetic process | BP | GL | TRUE | 9.988 e −04 | 3.090 e −02 | mg | 0.3543 | TRUE | 9.996 e −01 | 1.000 e +00 | mg | 0.3077 | FALSE |
|
| Small molecule biosynthetic process | BP | GL | TRUE | 1.993 e −05 | 1.108 e −03 | mg | 0.4000 | TRUE | 1.000 e +00 | 1.000 e +00 | mg | 0.3333 | FALSE |
|
| Carboxylic acid biosynthetic process | BP | GL | TRUE | 9.988 e −04 | 3.090 e −02 | mg | 0.3543 | TRUE | 9.996 e −01 | 1.000 e +00 | mg | 0.3077 | FALSE |
|
| Regulation of cellular process | BP | GN | TRUE | 2.047 e −06 | 1.459 e −04 | zn | 0.4454 | TRUE | 1.000 e +00 | 1.000 e +00 | zn | 0.4545 | FALSE |
|
| Response to stimulus | BP | GN | TRUE | 2.192 e −06 | 1.530 e −04 | mg | 0.3825 | TRUE | 1.000 e +00 | 1.000 e +00 | ca | 0.4860 | FALSE |
|
| Regulation of biological quality | BP | CQ | TRUE | 9.999 e −01 | 1.000 e +00 | fe | 0.2778 | FALSE | 1.661 e− 04 | 6.807 e −03 | fe | 0.3878 | TRUE |
|
| Sulfuric ester hydrolase activity | MF | FV | TRUE | 9.999 e −01 | 1.000 e +00 | ca | 0.3333 | FALSE | 6.402 e −04 | 2.187 e −02 | ca | 0.4286 | TRUE |
|
| Ion transport | BP | I | TRUE | 1.000 e +00 | 1.000 e +00 | mg | 0.3889 | FALSE | 7.213 e −08 | 6.392 e −06 | ca | 0.4364 | TRUE |