| Literature DB >> 31965404 |
Gustav Olanders1, Hiba Alogheli1, Peter Brandt2, Anders Karlén3.
Abstract
Macrocycles represent an important class of medicinally relevant small molecules due to their interesting biological properties. Therefore, a firm understanding of their conformational preferences is important for drug design. Given the importance of macrocycle-protein modelling in drug discovery, we envisaged that a systematic study of both classical and recent specialized methods would provide guidance for other practitioners within the field. In this study we compare the performance of the general, well established conformational analysis methods Monte Carlo Multiple Minimum (MCMM) and Mixed Torsional/Low-Mode sampling (MTLMOD) with two more recent and specialized macrocycle sampling techniques: MacroModel macrocycle Baseline Search (MD/LLMOD) and Prime macrocycle conformational sampling (PRIME-MCS). Using macrocycles extracted from 44 macrocycle-protein X-ray crystallography complexes, we evaluated each method based on their ability to (i) generate unique conformers, (ii) generate unique macrocycle ring conformations, (iii) identify the global energy minimum, (iv) identify conformers similar to the X-ray ligand conformation after Protein Preparation Wizard treatment (X-rayppw), and (v) to the X-rayppw ring conformation. Computational speed was also considered. In addition, conformational coverage, as defined by the number of conformations identified, was studied. In order to study the relative energies of the bioactive conformations, the energy differences between the global energy minima and the energy minimized X-rayppw structures and, the global energy minima and the MCMM-Exhaustive (1,000,000 search steps) generated conformers closest to the X-rayppw structure, were calculated and analysed. All searches were performed using relatively short run times (10,000 steps for MCMM, MTLMOD and MD/LLMOD). To assess the performance of the methods, they were compared to an exhaustive MCMM search using 1,000,000 search steps for each of the 44 macrocycles (requiring ca 200 times more CPU time). Prior to our analysis, we also investigated if the general search methods MCMM and MTLMOD could also be optimized for macrocycle conformational sampling. Taken together, our work concludes that the more general methods can be optimized for macrocycle modelling by slightly adjusting the settings around the ring closure bond. In most cases, MCMM and MTLMOD with either standard or enhanced settings performed well in comparison to the more specialized macrocycle sampling methods MD/LLMOD and PRIME-MCS. When using enhanced settings for MCMM and MTLMOD, the X-rayppw conformation was regenerated with the greatest accuracy. The, MD/LLMOD emerged as the most efficient method for generating the global energy minima.Entities:
Keywords: Conformational sampling; Drug design; Macrocycles
Mesh:
Substances:
Year: 2020 PMID: 31965404 PMCID: PMC7036058 DOI: 10.1007/s10822-020-00277-2
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 3.686
Fig. 1A graphical summary of the study design
Structures of the Macrocycles in the Tautomer/Ionization States Used for Conformational Analysis
The PDB code of the complex structure are shown to the lower left, whereas the ligand name (when given) and the ligand code are shown to the upper left and lower right, respectively. Macrocycles that were included in the subset are marked with “subset” to the upper right.
Characteristics of the Full Data set Consisting of 44 Macrocycles
| Property | Average | Median | Minimum | Maximum |
|---|---|---|---|---|
| PDB resolution (Å) | 1.88 | 1.88 | 0.95 | 2.50 |
| Ring size | 17 | 16 | 11 | 29 |
| #Torsional angles sampleda | 25 | 23 | 8 | 47 |
| Molecular weight | 571 | 538 | 280 | 1041 |
| DonorHBb | 2.5 | 2.0 | 0 | 9.3 |
| AcceptHBc | 12.0 | 11.2 | 5.3 | 26.9 |
| QPlogPo/wd | 2.7 | 2.8 | − 2.6 | 6.8 |
| PSAe | 142 | 124 | 71 | 411 |
All descriptors were calculated using QikProp, except for ring size and the number of torsional angles sampled, which were calculated by hand. For 2XYT descriptors were calculated using Instant JChem [83]
aNumber of torsional angles sampled during the MCMM and MTLMOD conformational searches
bNumber of hydrogen bond donors
cNumber of hydrogen bond acceptors
dCalculated octanol/water partition coefficient
ePolar surface area
Comparing different strategies to generate non-biased starting conformers
| RMSD (Å)a | |||
|---|---|---|---|
| Conformer | < 1 Å | 1 Å–2 Å | > 2 Å |
| Energy minimized X-rayppw ligand | 40 | 5 | 0 |
| Starting conformer | 0 | 7 | 38 |
| SMILES conformer | 4 | 15 | 26 |
aRMSD for the conformer identified with the lowest RMSD value to the X-rayppw ligand. The conformers are, dependent on their calculated RMSD values, divided into three different groups with RMSD values: below 1 Å, between 1–2 Å, and greater than 2 Å
Computational times used in the conformational analysis of the ten macrocycles in the diverse subset using two different minimization methods
| Conformational search method | Energy minimization method | Computational timea |
|---|---|---|
| MCMMb | PRCGc | 856 |
| MCMMb | TNCGd | 544 |
| MTLMODe | PRCGc | 4109 |
| MTLMODe | TNCGd | 551 |
aThe sum total of computational time (minutes) consumption for conformational analysis of ten macrocycles
bMonte Carlo Multiple Minimum
cPolak-Ribiere Conjugated Gradient
dTruncated Newton Conjugated Gradient
eMixed torsional/Low-mode
Summary of conformational analysis settings and results of the 10 macrocycles in the diverse subset
| Method | Ring opening | Ring closure distance (å) | No. confa | No. unique ring confb | CPU time (min)c | Global energy minimum found for no. Macrocyclesd | Best fit conformation RMSD (Å)e | ||
|---|---|---|---|---|---|---|---|---|---|
| < 1 Å | 1 Å–2 Å | > 2 Å | |||||||
| MCMM | Standard | 0.5–2.5 | 40,507 | 2482 | 666 | 4 | 7 | 0 | 3 |
| MCMM | Standard | 0.1–5.0 | 40,946 | 5158 | 646 | 3 | 8 | 1 | 1 |
| MCMM | Standard | 0–100 | 36,034 | 9326 | 813 | 5 | 7 | 3 | 0 |
| MCMM | Moved | 0.5–2.5 | 41,538 | 2811 | 614 | 3 | 6 | 2 | 2 |
| MCMM-Enhanced | Moved | 0–100 | 38,037 | 9402 | 800 | 5 | 7 | 3 | 0 |
| MTLMOD | Standard | 0.5–2.5 | 29,417 | 4082 | 714 | 4 | 6 | 1 | 3 |
| MTLMOD-Enhanced | Moved | 0–100 | 32,489 | 7574 | 704 | 3 | 8 | 2 | 0 |
aThe sum total of conformers generated
bThe sum total of unique ring conformations generated
cThe sum total of computational time (minutes) used for conformational analysis
dNumber of macrocycles where the lowest energy conformer was identified or a conformer with an energy difference no greater than 1 kJ mol−1
eRMSD for the conformer identified with the lowest RMSD value to the X-ray ligand after protein preparation treatment (X-rayppw). The conformers are, dependent on their RMSD values, divided into three different groups with RMSD values: below 1 Å, between 1 Å–2 Å, and greater than 2 Å
Summary of the conformational analysis settings for the evaluated methods and literature protocols
| Method | Number of search steps | Energy window (kcal mol−1) | Torsion sampling optiona | Elimination of redundant conformationsb | Ring closure distance (Å)c | Placement of ring openingd | Energy minimization methode | Maximum energy minimization iterations | Energy minimization threshold (kJ Å−1 mol−1) | Force field | Solvent |
|---|---|---|---|---|---|---|---|---|---|---|---|
| MCMM defaultf | 1000 | 5.02 | Intermediate | AD 0.5 Å | 0.5–2.5 | Standard | PRCG | 2500 | 0.05 | OPLS-3 | water |
| MTLMOD defaultf | 1000 | 5.02 | Intermediate | AD 0.5 Å | 0.5–2.5 | Standard | PRCG | 2500 | 0.05 | OPLS-3 | water |
| MD/LLMOD defaultf | 5000 MD, 5000 LLMOD | 10 | Enhanced | RMSD 0.75 Å | NAg | NAg | NAVh | 50,000 | 0.01 | OPLS-3 | water |
| MD/LLMOD | 5000 MD, 5000 LLMOD | 15.01 | Extended | RMSD 0.75 Å | NAg | NAg | NAVh | 50,000 | 0.01 | OPLS-3 | water |
| MCMM | 10,000 | 15.01 | Extended | AD 0.5 Å | 0.5–2.5 | Standard | TNCG | 50,000 | 0.05 | OPLS-3 | water |
| MCMM-Enhanced | 10,000 | 15.01 | Extended | AD 0.5 Å | 0 – 100 | Stereocenters avoided | TNCG | 50,000 | 0.05 | OPLS-3 | water |
| MCMM-Exhaustive | 1,000,000 | 15.01 | Extended | AD 0.5 Å | 0–100 | Stereocenters avoided | TNCG | 50,000 | 0.05 | OPLS-3 | water |
| MTLMOD | 10,000 | 15.01 | Extended | AD 0.5 Å | 0.5–2.5 | Standard | TNCG | 50,000 | 0.05 | OPLS-3 | water |
| MTLMOD-Enhanced | 10,000 | 15.01 | Extended | AD 0.5 Å | 0–100 | Stereocenters avoided | TNCG | 50,000 | 0.05 | OPLS-3 | water |
| PRIME-MCS | Spinroot 10i | 100 | Peptide bonds | Torsional fingerprint | NAg | NAg | TNCG | Chain minimizationj | 0.04 | OPLS-2005 | vacuum |
| CF-MTLMODk | 10,000 (400 RotStep)l | 15 | Intermediate | RMSD 0.25 Å | 0.5–2.5 | Standard | PRCG | 3000 | 0.05 | OPLS-2005 | water |
| CF-MD/LLMODk | 5000 MD, 5000 LLMOD | 15 | Enhanced | RMSD 0.25 Å | NAg | NAg | NAVh | 50,000 | 0.01 | OPLS-2005 | water |
| CF-LowModeMD MOEk | 10,000 | 15 | NAg | RMSD 0.25 Å | NAg | NAg | NAVh | 500 | 0.021 | MMFF94x | water |
aIntermediate—Sample C–N and C–O single bonds other than in standard amides and esters; Enhanced—Sample all C–N and C–O single bonds; Extended—Sample all C–N and C–O single bonds and C = N and N = N double bonds. Sampling of peptide bonds are allowed (“peptide bonds”)
bAtom deviation (AD): A conformation is unique if one (or several) of the defined atoms deviates more than specified, from the compared conformations after superposition. Root Mean Square Deviation (RMSD): A conformation is unique if the RMSD value between two conformations exceeds the specified value. Torsional Fingerprint: Two conformations are considered redundant if they have identical torsional fingerprints
cRe-close ring system if the ring closure atoms are within the defined distance range
dPlacement of the macrocyclic ring opening bond could be adjacent to a stereocenter using the automatic setup (standard)
eEnergy minimization method. Polak-Ribiere Conjugated Gradient (PRCG). Truncated Newton Conjugated Gradient (TNCG)
fDefault refers to the predefined values in Schrödinger
gNot Applicable
hNot Available
iSpinroot 1, and 10 generating up to 100 and 1000 conformations, respectively
jChain energy minimization, starts with a conjugate gradient followed by a Truncated Newton minimization
kEnhanced settings presented by I-Chen and Foloppe
lLimits the total number of search steps as a function of the number of rotatable bond. Only active if multiple ligands are sampled simultaneously
Summary of conformational analysis results for the full data set of 44 macrocycles
| Method | No. confa | No. unique ring confb | Computational time (min)c | Global energy minimum found for no. macrocyclesd | Best fit conformation RMSD (Å)e | RMSDRING (Å)f | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| < 1 Å | 1 Å–2 Å | > 2 Å | < 0.5 Å | 0.5 Å–1 Å | > 1 Å | |||||
| Energy minimized X-rayppw ligand | NAg | NAg | NAg | NAg | 40 | 5 | 0 | 45 | 1 | 0 |
| Starting conformer | NAg | NAg | NAg | NAg | 0 | 7 | 38 | 3 | 19 | 24 |
| SMILES conformer | NAg | NAg | NAg | NAg | 4 | 15 | 26 | 5 | 25 | 16 |
| MCMMh | ||||||||||
| Average | 155,296 | 18,950 | 3740 | 48% (63/132) | 37 | 6 | 2 | 38 | 8 | 0 |
| Max | 159,973 | 20,274 | 4138 | NAg | 42 | 0 | 3 | 42 | 2 | 2 |
| Min | 150,558 | 17,840 | 3340 | NAg | 32 | 13 | 0 | 35 | 11 | 0 |
| MTLMODh | ||||||||||
| Average | 117,490 | 23,367 | 4125 | 48% (64/132) | 35 | 8 | 2 | 39 | 6 | 1 |
| Max | 121,147 | 25,015 | 4468 | NAg | 39 | 5 | 1 | 42 | 1 | 3 |
| Min | 113,512 | 21,804 | 3806 | NAg | 31 | 10 | 4 | 37 | 9 | 0 |
| MCMM-Enhancedh | ||||||||||
| Average | 149,831 | 49,324 | 4642 | 52% (68/132) | 40 | 5 | 0 | 44 | 2 | 0 |
| Max | 155,304 | 52,181 | 5022 | NAg | 41 | 4 | 0 | 45 | 1 | 0 |
| Min | 144,227 | 46,906 | 4325 | NAg | 35 | 10 | 0 | 40 | 6 | 0 |
| MTLMOD-Enhancedh | ||||||||||
| Average | 134,396 | 41,040 | 4182 | 50% (66/132) | 36 | 9 | 0 | 45 | 1 | 0 |
| Max | 137,988 | 42,589 | 4515 | NAg | 41 | 3 | 1 | 46 | 0 | 0 |
| Min | 130,954 | 39,443 | 3932 | NAg | 32 | 13 | 0 | 41 | 5 | 0 |
| MD/LLMODi | ||||||||||
| Average | 45,917 | 19,189 | 4163 | 55% (24/44) | 31 | 11 | 3 | 38 | 8 | 0 |
| Max | NA | NA | NA | NAg | NA | NA | NA | NA | NA | NA |
| Min | NA | NA | NA | NAg | NA | NA | NA | NA | NA | NA |
| PRIME-MCSh | ||||||||||
| Average | 31,118 | 24,953 | 9791 | NAg | 24 | 19 | 2 | 35 | 11 | 0 |
| Max | 31,286 | 25,122 | 9804 | NAg | 24 | 19 | 2 | 37 | 9 | 0 |
| Min | 30,952 | 24,795 | 9779 | NAg | 22 | 21 | 2 | 34 | 12 | 0 |
| MCMM-Exhaustive | 7,528,356 | 967,844 | 925,026 | 45 | 44 | 1 | 0 | 46 | 0 | 0 |
aThe sum total of conformers generated
bThe sum total of unique ring conformations
cThe sum total of computational time (minutes) used for conformational analysis
dNumber of macrocycles where the lowest energy conformer was identified or a conformer with an energy difference not greater than 1 kJ mol−1 and an RMSD below 0.1 Å to the global energy conformer using all heavy atoms
eRMSD for the conformer identified with the lowest RMSD value to the X-ray ligand after protein preparation treatment (X-rayppw). The conformers are, dependent on their RMSD values, divided into three different groups with RMSD values: below 1 Å, between 1–2 Å, and greater than 2 Å
fRMSDRING for the conformer identified with the lowest RMSDRING value to the heavy ring atoms in the X-ray ligand after protein preparation treatment. The conformers are, dependent on their RMSD values, divided into three different groups with RMSD values: below 0.5 Å, between 0.5–1 Å, and greater than 1 Å. 3BXR consist of two macrocyclic rings, therefore 46 (instead of 45) RMSDRING values are presented
gNot Applicable
hRun three time using different seeds.
iMD/LLMOD was run one time
Fig. 2Relationship between rotatable bonds defined as the number of torsion angles sampled per macrocycle and the number of times the global energy minima were identified using the different methods (13 runs per macrocycle using 6 different methods)
Fig. 3All heavy-atoms RMSD. The mean RMSD value was used for those methods that were run more than one time. The cumulative performance describing how successful the methods are at generating a conformer close to the X-rayppw conformation is shown. The performance is benchmarked against the energy minimized X-rayppw conformations (shown by the pink line near the bottom) and an exhaustive MCMM run (1,000,000 search steps shown by the purple line at the bottom)
Fig. 4Ring atoms RMSD. The mean RMSD value was used for those methods that were run more than one time. The cumulative performance describing how successful the methods are at generating a conformer close to the X-rayppw conformation is shown. The performance is benchmarked against the energy minimized X-rayppw conformations (shown by the pink line near the bottom) and an exhaustive MCMM run (1,000,000 search steps shown by the purple line at the bottom)
Summary of the X-ray accuracy reported in the literature
| Method | Authors | Data set (total no. structures/PDB structures) | % Below 1 Å RMSDa | Median RMSD (Å)b | Median RMSDRING (Å)c |
|---|---|---|---|---|---|
| MD/LLMOD | Chen and Foloppe [ | Chen and Foloppe (30/30) | |||
| CF-MTLMODe | 79 | NAd | NAd | ||
| MOE LowModeMD | 72 | NAd | NAd | ||
| Stochastic search | 53 | NAd | NAd | ||
| MD/LLMOD | Watts et al. [ | Watts et al. (150/67) | |||
| MD/LLMOD | Sindhikara et al. [ | Watts et al. (208/60) 60 PDB structures from Watts et al. | |||
| PRIME-MCS | NAd | 1.49 (PDB) | 0.40 (PDB) | ||
| MOE LowModeMD | NAd | 1.69 (PDB) | 0.41 (PDB) | ||
| Molecular dynamics simulation (24 ns) | NAd | 1.89 (PDB) | 0.56 (PDB) | ||
| BRIKARD | Coutsias et al. [ | Coutsias et al. (67/39) | NAd | NAd | 0.47 (all), 0.42 (PDB) |
| CF-MD/LLMODe | NAd | NAd | 0.54 (all), 0.47 (PDB) | ||
| MD/LLMOD | |||||
| CF-LowModeMDe | NAd | NAd | 0.64 (all), 0.54 (PDB) | ||
| PLOP | Wang et al. [ | Wang et al. (37/12) | NAd | NAd | 0.25 (70% below 0.5 Å) |
| MD/LLMOD | NAd | ||||
| CF-MTLMODe | Cleves and Jain [ | Chen and Foloppe (30/30) | NAd | NAd | NAd |
| CF-LowModeMDe | NAd | NAd | NAd | ||
| ForceGen | NAd | NAd | NAd | ||
| MCMM | Current work 2019 | Alogheli and Watts et al. (44/44) 31 PDB structures from Watts el al. | 78 | 0.58 | 0.16 |
| MCMM-Enhanced | 84 | 0.58 | 0.16 | ||
| MTLMOD-Enhanced | 89 | 0.59 | 0.17 | ||
| MTLMOD | 78 | 0.77 | 0.18 | ||
| MD/LLMOD | |||||
| PRIME-MCS spinroot 30 | 51 | 0.98 | 0.27 |
In all the publications above (exception of ForceGen), the MD/LLMOD method (shown in bold) has been included, which allows it to serve as a reference method
aPercent of macrocycles in the data set that the methods successfully generated a conformer below 1 Å RMSD to the X-ray conformation using all heavy atoms
bMedian RMSD using all heavy atoms
cMedian RMSD using all only the heavy atoms in the macrocyclic ring
dNot Applicable
eOptimized settings presented by Chen and Foloppe
Fig. 5Number of generated conformers for the ten macrocycles in the diverse subset within: a 15 kcal mol−1; b 10 kcal mol−1; and c 5 kcal mol−1 from the lowest energy conformer using 1,000,000 search steps in total. The discontinuities in the lines are due to elimination of high energy conformers when a new “global energy minimum” is generated during the search. The line for 1S22 in plot (B) do not reach 1 million steps because only up to 100,000 conformers within 10 kcal mol−1 are registered in the .log-file