| Literature DB >> 35639953 |
A A Aptekmann1,2, J Buongiorno3, D Giovannelli2,4,5, M Glamoclija6, D U Ferreiro7, Y Bromberg1.
Abstract
MOTIVATION: Metal binding proteins have a central role in maintaining life processes. Nearly one-third of known protein structures contain metal ions that are used for a variety of needs, such as catalysis, DNA/RNA binding, protein structure stability, etc. Identifying metal binding proteins is thus crucial for understanding the mechanisms of cellular activity. However, experimental annotation of protein metal binding potential is severely lacking, while computational techniques are often imprecise and of limited applicability.Entities:
Year: 2022 PMID: 35639953 PMCID: PMC9272798 DOI: 10.1093/bioinformatics/btac358
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
mebipred performance versus MetalDetector2
| Precision(%) | Recall(%) | |||
|---|---|---|---|---|
| Ligand |
| MetalDetector2 |
| |
| Zn | 817 | 63 | 90 | 70 |
| Fe(Heme) | 234 | 67 | 93 | 77 |
| Fe(Fe-S) | 202 | 68 | 97 | 67 |
| Cu | 87 | 57 | 96 | 64 |
Note:We report Heme and Fe-S performance separately although both methods predict Fe binding without further specification.
Fig. 1.mebipred outperforms BLAST in identifying metal-binding proteins and peptides. (A) At all cutoffs, mebipred (MBP; dashed line) is more precise than BLAST (solid line). For example, at the default cutoff (score = 0.4; black dot) it achieves 71% precision at 75% recall, as compared to 29% precision attained by BLAST at a similar recall. (B) mebipred also outperforms BLAST in identifying the metal-binding propensity of proteins from their 50 amino acid fragments. For example, for half of the fragments, it attains 67% accuracy, as compared to 35% attained by BLAST
mebipred performance across metals
| ANN | AUROC | AUPRC | Prec | Rec | F1 |
|---|---|---|---|---|---|
| metal-binding | 0.91 | 0.83 | 0.71 | 0.75 | 0.73 |
| Fe | 0.95 | 0.95 | 0.96 | 0.91 | 0.94 |
| Ca | 0.86 | 0.91 | 0.91 | 0.77 | 0.83 |
| Na | 0.83 | 0.83 | 0.86 | 0.68 | 0.76 |
| K | 0.91 | 0.91 | 0.88 | 0.84 | 0.86 |
| Mg | 0.82 | 0.82 | 0.79 | 0.8 | 0.8 |
| Mn | 0.91 | 0.91 | 0.89 | 0.83 | 0.86 |
| Cu | 0.97 | 0.97 | 0.98 | 0.92 | 0.95 |
| K | 0.91 | 0.91 | 0.88 | 0.84 | 0.86 |
| Co | 0.85 | 0.85 | 0.89 | 0.71 | 0.79 |
| Ni | 0.91 | 0.86 | 0.84 | 0.67 | 0.75 |
| Zn | 0.9 | 0.92 | 0.95 | 0.7 | 0.8 |
AUCs, F1, Precision, and Recall are reported at a cutoff = 0.4 for the first (metal-binding, MB) tier and at cutoff = 0.5 for the second tier of per ion mebipred predictions; in both cases, the default cutoffs are established via F1max. At the cutoff = 0.5, the first tier model attains 0.92 precision at 0.26 recall (see Supplementary Data: AUPRCTraining_folds for per-fold performance).
mebipred accuracy versus other methods
| Ligand |
|
| MIB ( |
|
|---|---|---|---|---|
| Sequence | Sequence | Structure | Sequence | |
| Ca | 74.8 | 75.4 | 94.1 | 86.7 |
| Co | 83 | 85.3 | 94.7 | 86.2 |
| Cu | 96.3 | 78.1 | 95.3 | 87.2 |
| Fe2 | 91.3 | 75.6 | 95.1 | 89.2 |
| Fe3 | 87.8 | 74 | 94.9 | 89.2 |
| K | 80.3 | – | – | 74.0 |
| Mg | 75.3 | 74 | 94.6 | 75.6 |
| Mn | 83.2 | 68.8 | 95.0 | 89.7 |
| Na | 79.4 | 79.4 | – | 84.5 |
| Ni | – | 90.7 | 94.7 | 79.2 |
| Zn | 83 | 69 | 94.8 | 82.2 |
Fig. 2.Prediction of metal-binding in Black Sea microbiomes. The points on the graph indicate the relative abundance of ion-binding proteins (left y-axis) predicted from metagenomic samples collected at different depths of the Black Sea (x-axis). The black line represents the Euclidean distance (right y-axis) between the vectors of predicted abundances at sequential depths; line markers are placed between the depth measurements in each comparison. Samples show a phase transition (large Euclidean distance) at the photosynthetic limit (60–100 m) (Callieri ; Gorlenko )
Fig. 3.Differential abundance of metal-binding proteins across environments. Each bar represents the relative abundance of predicted metal-binding proteins (y-axis) in a metagenomic sample (four per environment; x-axis). Concentration of these proteins per environment (column colors and sizes) is similar within and is different across environments, suggesting signature metal ion preferences