| Literature DB >> 31392432 |
Joel Wahl1, Joel Freyss2, Modest von Korff3, Thomas Sander3.
Abstract
The <span class="Chemical">Platinumn> dataset of protein-bound ligand conformations was used to benchmark the ability of the MMFF94s force field to generate bioactive conformations by minimization of randomly generated conformers. Torsion angle parameters that generally caused wrong geometries were reparameterized by conducting dihedral scans using ab initio calculations at the <span class="Gene">MP2 level. This reparameterization resulted in a systematic improvement of generated conformations.Entities:
Keywords: Conformer generation; Force field parameterization; MMFF94s
Year: 2019 PMID: 31392432 PMCID: PMC6686419 DOI: 10.1186/s13321-019-0371-6
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1Example representations of torsion fragments. Example 1 refers to the dihedral angle of the rotation around a C–C bond, whereas C1 is part of a delocalized system, neighboring two C-atoms with two neighbors each (n2C). C2 is a non-aromatic C (!aC) bound to a nitrogen atom. In example 2, we again refer to the rotation around a C1–C2 bond. C1 is a non-aromatic carbon, bound to a non-aromatic carbon with less than 4 neighboring atoms (C!a,n<4; sp2 or sp hybridized). C2 is an amide carbonyl carbon. Example 3 is a combination of Examples 1 and 2, where C1 is part of a delocalized system, neighboring two Cn2 atoms, and C2 is an amide carbonyl carbon
Summed up ratio of bioactive molecules recovered with varying RMSD thresholds for different force fields
| RMSD | Platinum Diverse Dataset 2017_01 (2581 molecules) | ||
|---|---|---|---|
| MMFF94s | OPLS3 | MM2 | |
| < 0.5 Å | 0.53 | 0.55 | 0.43 |
| < 1.0 Å | 0.89 | 0.90 | 0.81 |
| < 1.5 Å | 0.98 | 0.97 | 0.92 |
| < 2.0 Å | 0.99 | 0.99 | 0.96 |
| Failure rate (%) | 0.8 | 0.8 | 9.7 |
Note that the original Platinum Diverse Dataset 2017_01 consists of 2859 molecules. The 2581 molecules correspond to the subset that could successfully be processed by all three force fields. The failure rate indicates the percentage of the structure of the total dataset that could not be processed by the force field
Fig. 2Torsional fragments for which the MMFF94s-minimized structures exhibited systematic deviation of the central dihedral angle compared to the experimental structures
Fig. 3Torsional energy profiles for various substructures with problematic torsion angles for the MMFF94s. The four atoms defining the scanned torsion angles are indicated by asterisks. Conformational energies are given in kJ/mol. These structures are not present as ligands in the platinum dataset. Torsion histograms from the CSD with relative frequencies were added for several compounds
Newly derived torsion parameters obtained by fitting the torsional profiles to the PES obtained from torsion scans conducted at the MP2 level
| TT-I-J-K-L | A-B-C-D | V1, V2, V3 |
|---|---|---|
| 0-0-10-37-0 | *-(NC=O)-CB-* | 0.0, 2.7, 0.0 |
| 0-3-10-64-66 | (C=ON)-(NC=O)-C5B-N5B | − 0.31, − 5.82, 0.0 |
| 0-28-10-64-66 | (HNCO)-(NC=O)-C5B-N5B | 0.00, 1.29, 0.00 |
| 0-3-10-64-63 | (C=ON)-(NC=O)-C5B-C5A | 0.00, 7.97, 0.00 |
| 0-28-10-64-63 | (HNCO)-(NC=O)-C5B-C5A | 0.00, 7.97, 0.00 |
| 0-3-10-64-64 | (C=ON)-(NC=O)-C5B-C5B | 2.47, − 3.26, 0.00 |
| 0-28-10-64-64 | (HNCO)-(NC=O)-C5B-C5B | 0.00, 4.76, 0.00 |
| 0-3-10-64-65 | (C=ON)-(NC=O)-C5B-N5A | − 0.76, 8.14, 0.00 |
| 0-28-10-64-65 | (HNCO)-(NC=O)-C5B-N5A | 0.00, 5.27, 0.00 |
| 0-44-63-10-3 | STHI-C5A)-(NC=O)-(C=ON) | − 11.5, 6.60, 0.00 |
| 0-44-63-10-28 | STHI-C5A)-(NC=O)-(HNCO) | 0.00, 3.34, 0.00 |
Atom types: 0 = wild card, 3 = carbonyl carbon, 10 = amide nitrogen, 28 = amide hydrogen, 37 = aromatic carbon, 44 = thiophene sulfur, 63 = alpha carbon, 64 = beta carbon, 65 = alpha aromatic heterocyclic 5-ring nitrogen, 66 = beta aromatic heterocyclic 5-ring nitrogen. TT corresponds to the torsion type index [18]
Newly derived torsional parameters for phenylpyrroles
| TT-I-J-K-L | V1, V2, V3 |
|---|---|
| 1-0-37-39-0 | 0.0, 2.6, 0.0 |
Summed up ratios of bioactive molecules recovered with different RMSD values
| RMSD | Phenylpyrroles (63 molecules) | |||||
|---|---|---|---|---|---|---|
| MMFF94s | MMFF94s new | OPLS3 | MMFF94s | MMFF94s new | OPLS3 | |
| < 0.5 Å | 0.34 | 0.43 | 0.41 | 0.52 | 0.59 | 0.52 |
| < 1.0 Å | 0.85 | 0.89 | 0.89 | 0.87 | 0.90 | 0.83 |
| < 1.5 Å | 0.98 | 0.98 | 0.98 | 1.00 | 0.97 | 0.98 |
| < 2.0 Å | 0.99 | 0.98 | 0.99 | 1.00 | 0.98 | 1.00 |
Fig. 4Plot of RMSDs between the closest low-energy conformer and the bioactive conformation for the original MMFF94s against the MMFF94s with the new parameters (MMFF94s_new) for 2833 structures from the Platinum Diverse Dataset. Data points where the unsigned difference between the two RMSD values is less than 0.1 are colored orange, data points where the MMFF94s_new performs better are shown in blue, whereas the grey data points indicate a better performance by the original MMFF94s