| Literature DB >> 20845954 |
Jacob D Durrant1, J Andrew McCammon.
Abstract
As high-throughput biochemical screens are both expensive and labor intensive, researchers in academia and industry are turning increasingly to virtual-screening methodologies. Virtual screening relies on scoring functions to quickly assess ligand potency. Although useful for in silico ligand identification, these scoring functions generally give many false positives and negatives; indeed, a properly trained human being can often assess ligand potency by visual inspection with greater accuracy. Given the success of the human mind at protein-ligand complex characterization, we present here a scoring function based on a neural network, a computational model that attempts to simulate, albeit inadequately, the microscopic organization of the brain. Computer-aided drug design depends on fast and accurate scoring functions to aid in the identification of small-molecule ligands. The scoring function presented here, used either on its own or in conjunction with other more traditional functions, could prove useful in future drug-discovery efforts.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20845954 PMCID: PMC2964041 DOI: 10.1021/ci100244v
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956
Figure 1Schematic of a simple neural network. All neural networks have an input layer, through which information about the system to be analyzed is passed, and an output layer, which encodes the results of the analysis. Optional hidden layers receive input from the input layer and transmit it to the output layer, allowing for even more complex behavior.
Figure 2Accuracy of protein−ligand complex characterization. The x axis shows the size of the training set, and the y axis shows the percent accuracy. Each data point represents the average accuracy of 10 independent neural networks with one hidden layer of five neurodes. Error bars represent standard deviations. In blue are shown the accuracies with which the various networks were able to characterize the binding constants of the protein−ligand complexes in their respective training sets. In green are shown the accuracies with which the various networks were able to characterize the binding constants of the complexes in their respective validation sets. In purple is shown the likelihood that a given protein−ligand complex has a Kd value less than 25 μM given that the network predicts high-affinity binding (i.e., the true-positive rate when the respective validation sets were analyzed). In red is shown the likelihood that a given protein−ligand complex has a binding affinity greater than 25 μM given that the network predicts poor binding (i.e., the true-negative rate when the respective validation sets were analyzed).
Figure 3Average score (N) over 24 networks as a function of the experimentally measured Kd value. To facilitate visualization, the data were ordered by log10(Kd) value. Moving averages of both the log10(Kd) values and the associated N values were calculated over 100 points. This data-averaged function (shown in black) crosses the x axis at 25 μM [log10(25 × 10−6) = −4.60, shown as a dotted line]. Individual, unaveraged data points are shown in gray.
Results of a Small Virtual Screen against Influenza N1 Neuraminidase Used to Test the Novel Neural-Network Scoring Functiona
| top 10 | EF | rankZMR | rankOMR | rankPMR | |
|---|---|---|---|---|---|
| Vina | 2/3 | 6.9 | 10 | 55 | 5 |
| NN1 | 2/3 | 6.9 | 24 | 4 | 1 |
| NN2 | 1/3 | 3.4 | 16 | 31 | 1 |
| NN3 | 3/3 | 10.3 | 8 | 9 | 2 |
| 3/3 | 10.3 | 7 | 10 | 2 |
Five scoring functions compared: AutoDock Vina score, predictions of the top three individual neural networks (NN1, NN2, and NN3, respectively), and average prediction of the top 24 networks (N).
Number of known inhibitors that ranked in the top ten for each scoring function.
Enrichment factor when the top 10 ligands were considered.
Rank of the known inhibitor zanamivir.
Rank of the known inhibitor oseltamivir.
Rank of the known inhibitor peramivir.
Results of a Small Virtual Screen against Tb REL1 Used to Test the Novel Neural-Network Scoring Functiona
| top 10 | EF | rankATP | rank | rank | |
|---|---|---|---|---|---|
| Vina | 2/3 | 6.9 | 8 | 18 | 9 |
| NN1 | 1/3 | 3.4 | 99 | 2 | 25 |
| NN2 | 1/3 | 3.4 | 2 | 16 | 24 |
| NN3 | 2/3 | 6.9 | 7 | 1 | 69 |
| 2/3 | 6.9 | 8 | 5 | 27 |
Five scoring functions compared: AutoDock Vina score, predictions of the top three individual neural networks (NN1, NN2, and NN3, respectively), and average prediction of the top 24 networks (N).
Number of known inhibitors that ranked in the top 10 for each scoring function.
Enrichment factor when the top 10 ligands were considered.
Rank of the known inhibitor ATP.
Rank of the known inhibitor V1.
Rank of the known inhibitor S5.