| Literature DB >> 31823742 |
Shan Wang1, Xiuzhen Hu2, Zhenxing Feng1, Xiaojin Zhang1, Liu Liu1, Kai Sun1, Shuang Xu1.
Abstract
BACKGROUND: In many important life activities, the execution of protein function depends on the interaction between proteins and ligands. As an important protein binding ligand, the identification of the binding site of the ion ligands plays an important role in the study of the protein function.Entities:
Keywords: Binding site; Ion ligand; SMO algorithm; Sequence information
Mesh:
Substances:
Year: 2019 PMID: 31823742 PMCID: PMC6905020 DOI: 10.1186/s12860-019-0237-9
Source DB: PubMed Journal: BMC Mol Cell Biol ISSN: 2661-8850
Benchmark datasets of the sequence segment with length 17
| Ion ligand | Protein chains | Positive set | Negative set |
|---|---|---|---|
| NO2− | 22 | 98 | 8144 |
| CO32− | 62 | 316 | 22,766 |
| SO42− | 303 | 2125 | 99,729 |
| PO43− | 339 | 2168 | 112,279 |
| Zn2+ | 1428 | 6408 | 405,113 |
| Cu2+ | 117 | 485 | 33,948 |
| Fe2+ | 92 | 382 | 29,345 |
| Fe3+ | 217 | 1057 | 68,829 |
| Ca2+ | 1237 | 6789 | 396,957 |
| Mg2+ | 1461 | 5212 | 480,307 |
| Mn2+ | 459 | 2124 | 156,625 |
| Na+ | 78 | 489 | 27,408 |
| K+ | 57 | 535 | 18,777 |
| Co2+ | 194 | 875 | 55,050 |
Fig. 1Violin plot of positive and negative segments of amino acid composition of SO42−
Fig. 2The position conservation of positive and negative amino acid in SO42−. Note: the left figure indicates the position conservation in positive sequence segments and the right figure indicates the position conservation in negative sequence segments
The hydropathy characteristic of amino acid
| Classification | Amino Acids | Classification | Amino Acids |
|---|---|---|---|
| strongly hydrophilic | R,D,E,N,Q,K,H | Proline | P |
| strongly hydrophobic | L,I,V,A,M,F | Glycine | G |
| weakly hydrophilic | S,T,Y,W | Cysteine | C |
Fig. 3The results of SO42−‘s evaluation index under different window sizes. Note: (a) MCC values of SO42− under different window sizes; (b) Acc values of SO42− under different window sizes; (c) Sn values of SO42− under different window sizes; (d) Sp values of SO42− under different window sizes
Recognition results of ion binding sites based on component information
| Ligand | Sn (%) | Sp (%) | Acc (%) | MCC |
|---|---|---|---|---|
| NO2− | 65.3 | 57.1 | 61.2 | 0.225 |
| CO32− | 68.7 | 67.7 | 68.2 | 0.364 |
| SO42− | 68.9 | 67.2 | 68.0 | 0.360 |
| PO43− | 67.8 | 63.8 | 65.8 | 0.435 |
| Zn2+ | 75.9 | 81.6 | 78.8 | 0.576 |
| Cu2+ | 73.8 | 80.6 | 77.2 | 0.546 |
| Fe2+ | 77.0 | 80.1 | 78.5 | 0.571 |
| Fe3+ | 75.1 | 78.1 | 76.6 | 0.533 |
| Ca2+ | 68.1 | 72.5 | 70.3 | 0.406 |
| Mg2+ | 66.7 | 74.8 | 70.8 | 0.417 |
| Mn2+ | 72.5 | 77.3 | 74.9 | 0.498 |
| Na+ | 72.6 | 70.8 | 71.7 | 0.434 |
| K+ | 75.0 | 75.7 | 75.3 | 0.507 |
| Co2+ | 71.7 | 74.3 | 73.0 | 0.460 |
Recognition results of ion binding sites based on position conservation information
| Ligand | Sn (%) | Sp (%) | Acc (%) | MCC |
|---|---|---|---|---|
| NO2− | 73.5 | 61.2 | 67.4 | 0.350 |
| CO32− | 67.4 | 78.5 | 72.9 | 0.462 |
| SO42− | 74.2 | 71.8 | 73.0 | 0.460 |
| PO43− | 75.3 | 79.4 | 77.4 | 0.548 |
| Zn2+ | 91.6 | 87.2 | 89.4 | 0.789 |
| Cu2+ | 84.5 | 88.9 | 86.7 | 0.735 |
| Fe2+ | 90.1 | 79.6 | 84.8 | 0.700 |
| Fe3+ | 80.9 | 85.7 | 83.3 | 0.667 |
| Ca2+ | 69.4 | 75.7 | 72.5 | 0.451 |
| Mg2+ | 69.5 | 76.5 | 73.0 | 0.461 |
| Mn2+ | 78.1 | 84.4 | 81.2 | 0.626 |
| Na+ | 65.8 | 73.2 | 69.5 | 0.392 |
| K+ | 73.8 | 58.7 | 66.3 | 0.329 |
| Co2+ | 75.7 | 84.1 | 79.9 | 0.600 |
Comparison results with SVM
| Ligand | Method | Sn (%) | Sp (%) | Acc (%) | MCC |
|---|---|---|---|---|---|
| Zn2+ | OUR’S | 94.2 | 84.2 | 89.2 | 0.789 |
| SMO | 94.8 | 83.7 | 89.3 | 0.790 | |
| SVM | 99.8 | 99.5 | 99.7 | 0.993 | |
| Cu2+ | OUR’S | 91.3 | 86.8 | 89.0 | 0.782 |
| SMO | 90.3 | 88.9 | 89.6 | 0.792 | |
| SVM | 95.5 | 97.1 | 96.3 | 0.926 | |
| Fe2+ | OUR’S | 90.1 | 81.9 | 86.0 | 0.722 |
| SMO | 89.3 | 82.5 | 85.9 | 0.719 | |
| SVM | 91.9 | 90.7 | 91.3 | 0.826 | |
| Fe3+ | OUR’S | 86.2 | 85.5 | 85.9 | 0.717 |
| SMO | 85.5 | 86.0 | 85.8 | 0.715 | |
| SVM | 86.9 | 88.7 | 87.8 | 0.756 | |
| Ca2+ | OUR’S | 68.8 | 75.3 | 72.1 | 0.443 |
| SMO | 69.5 | 75.4 | 72.5 | 0.450 | |
| SVM | 71.3 | 79.1 | 74.8 | 0.502 | |
| Mg2+ | OUR’S | 71.1 | 73.1 | 72.1 | 0.442 |
| SMO | 70.0 | 72.3 | 71.2 | 0.423 | |
| SVM | 76.6 | 73.9 | 75.3 | 0.505 | |
| Mn2+ | OUR’S | 82.0 | 83.9 | 83.0 | 0.659 |
| SMO | 80.3 | 83.3 | 81.8 | 0.636 | |
| SVM | 82.1 | 84.4 | 83.2 | 0.664 | |
| Na+ | OUR’S | 68.9 | 74.0 | 71.0 | 0.430 |
| SMO | 70.8 | 71.8 | 71.3 | 0.425 | |
| SVM | 82.2 | 76.2 | 79.4 | 0.586 | |
| K+ | OUR’S | 71.6 | 64.5 | 68.0 | 0.362 |
| SMO | 74.2 | 62.6 | 68.4 | 0.371 | |
| SVM | 77.3 | 83.2 | 80.3 | 0.607 | |
| Co2+ | OUR’S | 75.3 | 86.4 | 80.9 | 0.621 |
| SMO | 75.1 | 86.2 | 80.6 | 0.616 | |
| SVM | 80.8 | 85.1 | 83.0 | 0.660 | |
| NO2− | OUR’S | 80.6 | 88.8 | 84.7 | 0.696 |
| SMO | – | – | – | – | |
| SVM | – | – | – | – | |
| CO32− | OUR’S | 79.4 | 81.6 | 80.5 | 0.611 |
| SMO | – | – | – | – | |
| SVM | – | – | – | – | |
| SO42− | OUR’S | 75.6 | 72.2 | 74.0 | 0.478 |
| SMO | – | – | – | – | |
| SVM | – | – | – | – | |
| PO43− | OUR’S | 76.2 | 78.0 | 77.1 | 0.542 |
| SMO | – | – | – | – | |
| SVM | – | – | – | – |