| Literature DB >> 29145826 |
Sérgio E D Dias1,2, Ana Mafalda Martins1, Quoc T Nguyen1,2, Abel J P Gomes3,4.
Abstract
BACKGROUND: Protein cavities play a key role in biomolecular recognition and function, particularly in protein-ligand interactions, as usual in drug discovery and design. Grid-based cavity detection methods aim at finding cavities as aggregates of grid nodes outside the molecule, under the condition that such cavities are bracketed by nodes on the molecule surface along a set of directions (not necessarily aligned with coordinate axes). Therefore, these methods are sensitive to scanning directions, a problem that we call cavity ground-and-walls ambiguity, i.e., they depend on the position and orientation of the protein in the discretized domain. Also, it is hard to distinguish grid nodes belonging to protein cavities amongst all those outside the protein, a problem that we call cavity ceiling ambiguity.Entities:
Keywords: Cavity detection; Gaussian kernel function; GaussianFinder; Pocket detection
Mesh:
Substances:
Year: 2017 PMID: 29145826 PMCID: PMC5691400 DOI: 10.1186/s12859-017-1913-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The protein 1A7X with 2155 atoms: (a) the inner surface; (b) the outer surface; (c) the inner surface with 4 out of 10 cavity locations determined by GaussianFinder (in red) and their homologous cavity locations set by the PDBsum ground truth (in blue)
Fig. 2Flowchart of the GaussianFinder method
Fig. 3Gaussian surfaces and cavity locations determined by GaussianFinder (in red) and their homologous cavity locations set by the PDBsum ground truth (in blue) of: (a) the protein 1B2L with 1969 atoms and 2 out of 7 cavities; (b) the protein 1A58 with 1365 atoms and 3 out of 7 cavities; (c) the protein 148L with 1323 atoms and 4 out of 7 cavities
Performance of benchmarking detection methods for apo proteins in terms of: (d) distance (FN) false negatives to PDBsum ground-truth cavity centers; (TP) true positives; (FP) false positives; (TN) true negatives; (S ) sensitivity; (S ) specificity; (a) accuracy; (r ) ratio of detected ground-truth cavities; and (C ) cumulative number of undetected ground-truth cavities
| GaussianFinder | ConCavity | POCASA | SURFNET | PASS | GHECOM | Fpocket | 3V | KVFinder | |
|---|---|---|---|---|---|---|---|---|---|
|
| 7100 | 1512 | 2310 | 1737 | 4915 | 4562 | 5869 | 3751 | 4578 |
|
| 188 | 107 | 176 | 1227 | 884 | 255 | 195 | 417 | 476 |
|
| 227 | 127 | 226 | 520 | 117 | 380 | 207 | 291 | 305 |
|
| 182 | 203 | 297 | 432 | 157 | 413 | 257 | 148 | 193 |
|
| 7697 | 1949 | 3009 | 3014 | 6073 | 5610 | 6528 | 4607 | 5552 |
|
| 1033 | 2103 | 863 | 902 | 325 | 581 | 878 | 1017 | 979 |
|
| 2045 | 860 | 1697 | 1247 | 3086 | 2540 | 2827 | 1782 | 1889 |
|
| 393 | 207 | 289 | 216 | 44 | 500 | 148 | 428 | 467 |
|
| 0.951 | 0.904 | 0.912 | 0.933 | 0.993 | 0.918 | 0.978 | 0.915 | 0.922 |
|
| 0.664 | 0.290 | 0.663 | 0.580 | 0.905 | 0.814 | 0.763 | 0.636 | 0.659 |
|
| 0.872 | 0.549 | 0.803 | 0.792 | 0.961 | 0.883 | 0.901 | 0.815 | 0.837 |
|
| 0.944 | 0.239 | 0.369 | 0.369 | 0.745 | 0.688 | 0.801 | 0.565 | 0.681 |
|
| 453 | 6201 | 5141 | 5136 | 2077 | 2540 | 1622 | 3543 | 2598 |
Performance of benchmarking detection methods for holo proteins in terms of: (d) distance (FN) false negatives to PDBsum ground-truth cavity centers; (TP) true positives; (FP) false positives; (TN) true negatives; (S ) sensitivity; (S ) specificity; (a) accuracy; (r ) ratio of detected ground-truth cavities; and (C ) cumulative number of undetected ground-truth cavities
|
| GaussianFinder | ConCavity | POCASA | SURFNET | PASS | GHECOM | Fpocket | 3V | KVFinder |
|---|---|---|---|---|---|---|---|---|---|
|
| 16081 | 3133 | 5234 | 3668 | 11738 | 10438 | 14063 | 9174 | 12493 |
|
| 366 | 239 | 338 | 2574 | 2100 | 571 | 432 | 703 | 719 |
|
| 410 | 296 | 419 | 1063 | 281 | 789 | 406 | 811 | 813 |
|
| 334 | 488 | 609 | 925 | 360 | 932 | 504 | 349 | 418 |
|
| 17191 | 4156 | 6600 | 8230 | 14479 | 12730 | 15405 | 12049 | 14443 |
|
| 2460 | 2155 | 2083 | 1806 | 634 | 1278 | 2151 | 1564 | 1941 |
|
| 3231 | 916 | 2673 | 1559 | 4080 | 4968 | 3423 | 2476 | 2725 |
|
| 440 | 362 | 214 | 227 | 207 | 658 | 391 | 511 | 553 |
|
| 0.975 | 0.919 | 0.969 | 0.973 | 0.986 | 0.951 | 0.975 | 0.959 | 0.963 |
|
| 0.568 | 0.298 | 0.562 | 0.463 | 0.866 | 0.795 | 0.614 | 0.612 | 0.584 |
|
| 0.876 | 0.668 | 0.801 | 0.828 | 0.957 | 0.901 | 0.881 | 0.875 | 0.873 |
|
| 0.963 | 0.233 | 0.369 | 0.461 | 0.811 | 0.713 | 0.863 | 0.675 | 0.809 |
|
| 659 | 13694 | 11250 | 9620 | 3371 | 5120 | 2445 | 5801 | 3407 |
Fig. 4Cumulative cavity percentage (100. r ) of various detection methods in function of the distance d to ground-truth geometric centers for: (a) apo structures; and (b) holo structures
Fig. 5GaussianFinder on GPU: (a) experimental time performance; (b) experimental memory space occupancy