Application of ESMACS binding free energy protocols to diverse datasets: Bromodomain-containing protein 4

Wright, David W.; Wan, Shunzhou; Meyer, Christophe; van Vlijmen, Herman; Tresadern, Gary; Coveney, Peter V.

doi:10.1038/s41598-019-41758-1

Download PDF

Article
Open access
Published: 12 April 2019

Application of ESMACS binding free energy protocols to diverse datasets: Bromodomain-containing protein 4

Scientific Reports volume 9, Article number: 6017 (2019) Cite this article

2584 Accesses
17 Citations
3 Altmetric
Metrics details

Subjects

Abstract

As the application of computational methods in drug discovery pipelines becomes more widespread it is increasingly important to understand how reproducible their results are and how sensitive they are to choices made in simulation setup and analysis. Here we use ensemble simulation protocols, termed ESMACS (enhanced sampling of molecular dynamics with approximation of continuum solvent), to investigate the sensitivity of the popular molecular mechanics Poisson-Boltzmann surface area (MMPBSA) methodology. Using the bromodomain-containing protein 4 (BRD4) system bound to a diverse set of ligands as our target, we show that robust rankings can be produced only through combining ensemble sampling with multiple trajectories and enhanced solvation via an explicit ligand hydration shell.

The performance of ensemble-based free energy protocols in computing binding affinities to ROS1 kinase

Article Open access 21 June 2022

Automation of absolute protein-ligand binding free energy calculations for docking refinement and compound evaluation

Article Open access 13 January 2021

SQM2.20: Semiempirical quantum-mechanical scoring function yields DFT-quality protein–ligand binding affinity predictions in minutes

Article Open access 06 February 2024

Introduction

The discovery and design of novel drugs is immensely expensive, with one study putting the cost of each new therapeutic molecule that reaches the clinic at US$1.8 billion¹. A diversity of computational approaches, specifically binding free energy calculations which rely on physics-based molecular dynamics simulations (MD) have been developed², and blind tests show that many have considerable predictive potential^3,4. In this context, recent developments in algorithms and hardware that have reduced the cost and time of these computational approaches have seen an increase in their appeal to the pharmaceutical industry^5,6,7,8,9. With commercial approaches that claim accuracy of below 1 kcal mol⁻¹ now on the market¹⁰ it is becoming of increasing interest to understand the accuracy of and uncertainties inherent in different approaches¹¹. These concerns echo wider interest in the scientific community in the lack of reproducible results in the published literature^12,13.

One of the most common computational binding affinity prediction techniques is molecular mechanics Poisson–Boltzmann surface area (MMPBSA)¹⁴. This is an approximate post-processing end-state method, which uses continuum solvent models to reduce the computational cost of obtaining results. The speed and ease of setup (compared to rigorous free energy calculations) make MMPBSA an attractive candidate for use throughout the drug discovery pipeline. However, results are often seen to be system dependent and are widely perceived to be less accurate than those obtained from more expensive and theoretically rigorous approaches (such as free energy perturbation, FEP, and thermodynamic integration, TI)^2,15. Furthermore, the term MMPBSA as used in the literature permits a wide range of variants which incorporate different sampling strategies (for example, all ligand conformers can be drawn from simulation of the complex or from independent runs) and differing solvation and entropy terms. Our previous work has demonstrated that MMPBSA analysis of single simulations is highly unreliable with calculations initiated from the same structures varying by up to 12 kcal mol⁻¹ for small molecules bound to HIV-1 protease and even more for flexible ligands binding to MHC^16,17. This served as the inspiration for our ESMACS (enhanced sampling of molecular dynamics with approximation of continuum solvent) protocols which use ensemble simulations that have been shown to produce results with reproducible uncertainties of less than 2 kcal mol⁻¹ for a range of systems^9,16,18. In this work we seek to assess the performance of the approach in a challenging dataset containing a highly varied set of ligands which interact with water in the protein binding site. We assess the impact on protocol performance of multiple trajectory sampling, ligand parameterization, inclusion of explicit water molecules and a recently developed approach to calculating the entropic contribution to the binding free energy.

The target of our investigation is the bromodomain-containing protein 4 (BRD4). Bromodomains are a major and rapidly evolving focus for the pharmaceutical industry with inhibitors targeting them having shown promising pre-clinical efficacy in pathologies ranging from cancer to inflammation. BRD4, in particular, has recently become something of a benchmark system for free energy calculations^15,19,20,21, including for those based on MMPBSA²².

Computational Methods

The principle behind the ESMACS family of protocols is that many short simulations provide better sampling than single long simulations, facilitating the rapid and reproducible calculation of binding affinities using variations of MMPBSA. The ESMACS simulation and analysis workflow has been automated using the Binding Affinity Calculator (BAC)²³ which we have recently enhanced using Radical Cybertools^24,25 to create HTBAC²⁶. The goal of HTBAC is to provide a programmable interface to create computational pipelines built from selected software tools and services, and execute them on remote resources. It automates much of the complexity of running and marshalling the molecular dynamics simulations, as well as collecting and analyzing data.

Our ESMACS protocols are flexible, allowing for the analysis to be tailored to the target system. Previous targets we have studied include small molecule inhibitors of HIV proteins^18,27,28, kinases^8,29 and larger more flexible ligands such as peptides which bind to MHC¹⁷. In all these studies correlation coefficients of better than 0.7 were obtained. MMPBSA is most commonly used to assess binding affinities from a single trajectory of a protein bound to its target ligand but in this work we explore the influence of protein and ligand flexibility using independent trajectories.

Free energy of binding computations

When two reactants combine at constant temperature and pressure the binding affinity is characterized by the change in Gibbs free energy, ΔG. MMPBSA is an endpoint free energy calculation; in such methods ΔG is calculated using:

$${\rm{\Delta }}G=\langle {G}_{complex}\rangle -\langle {G}_{receptor}\rangle -\langle {G}_{ligand}\rangle ,$$

(1)

where $\langle {G}_{complex}\rangle $, $\langle {G}_{receptor}\rangle $ and $\langle {G}_{ligand}\rangle $ are the average values of the Gibbs free energy for the complex, receptor (protein) and ligand respectively.

Sampling of the complex and its two components can be performed independently or conformations of the receptor and ligand extracted from simulation of the complex. The latter approach is more commonly used due to its improved convergence behaviour, a consequence of cancellation between the noisy terms describing the internal energy of the ligand, receptor and complex³⁰. However, recent work has indicated that adaptation energies associated with confining the receptor and ligand in a complex can differ significantly even for closely related complexes⁹. Here we investigate a range of ESMACS protocols incorporating different component sampling strategies. When both the receptor and ligand contributions are computed from the complex trajectory we designate this a “1traj protocol”. When all three derive from independent trajectories we refer to this as the “3traj protocol” and when only one or other of the receptor or ligand contributions do so a “2traj protocol”. A suffix (either -fl or -fr, for flexible ligand and receptor respectively) is added to the protocol name to signify which component is derived from the independent simulation. Additional variants involve the use of the average receptor contribution across the complex simulations for all comparable ligands, which is indicated with an –ar (averaged receptor) suffix in the protocol name. A summary of all of the protocols, describing from which simulation component data is obtained, is given in Table 1. It should be noticed that the statistical performance of the pair of protocols 1traj-ar and 2traj-fr, and 2traj-ar and 3traj are the same, as the receptor contribution in all cases is constant. Consequently, we do not analyze the 3traj or 2traj-fr protocols explicitly.

Table 1 Summary of the origin of component contributions in 6 ESMACS protocols indicating whether they come from the ensemble of simulations run for the complex (C) or separate ensembles performed for the receptor (R) and ligands (L).

Full size table

The binding free energy change calculated by MMPBSA (${\rm{\Delta }}{G}_{MMPBSA}$) can be broken down into a number of components:

$${\rm{\Delta }}{G}_{MMPBSA}={\rm{\Delta }}{G}_{ele}^{MM}+{\rm{\Delta }}{G}_{vdW}^{MM}+{\rm{\Delta }}{G}_{{\rm{i}}{\rm{n}}{\rm{t}}}^{MM}+{\rm{\Delta }}{G}_{nonpol}^{sol}+{\rm{\Delta }}{G}_{pol}^{sol},$$

(2)

where ${\rm{\Delta }}{G}_{ele}^{MM}$, ${\rm{\Delta }}{G}_{vdW}^{MM}$ and ${\rm{\Delta }}{G}_{{int}}^{MM}$ are the electrostatic, van der Waals and the internal bonded contributions to the molecular mechanics free energy difference, respectively, and ${\rm{\Delta }}{G}_{pol}^{sol}$ and ${\rm{\Delta }}{G}_{nonpol}^{sol}$ are the polar and non-polar solvation terms, respectively.

The MMPBSA.py³¹ program, provided as part of the AmberTools 14 package³², was used in the evaluation of all components of the MMPBSA calculation. The electrostatic free energy of solvation, ${\rm{\Delta }}{G}_{pol}^{sol}$, is the part of the calculation described by the Poisson-Boltzmann (PB) calculation. Default values were used for the PB calculation (grid spacing of 0.5 Å, internal and external dielectric constants of 1 and 80, respectively). The non-polar solvation free energy calculation is calculated from the solvent accessible surface area using the traditional one component method (specified using inp = 1 in the input file). In this approach the surface tension, γ, is set to 0.00542 kcal mol⁻¹ Å⁻²) and the off-set, β, to 0.92 kcal mol⁻¹. The fill ratio parameter was set to 4.0 which does not impact the results but ensures the stability of the calculations. For calculations in which explicit water molecules were incorporated as part of the receptor, the closest N molecules to the ligand were chosen for inclusion.

Entropic contribution to binding free energies

A variety of options are available to incorporate entropic contributions to ΔG. The most common approach is normal mode analysis^33,34 but it can require similar computational effort to the underlying simulations in order to obtain converged results¹⁸. Consequently, here we explore the use of another, more computationally efficient, alternative approach proposed by Duan et al.³⁵. In their formulation the “variational entropy” can be derived from the fluctuations of the receptor-ligand interaction energy, E^inter. This energy can be calculated using components of the MMPBSA calculation:

$${E}^{{inter}}={G}_{ele}^{MM}+{G}_{vdW}^{MM}.$$

(3)

The fluctuation in interaction energy is then given by:

$${\rm{\Delta }}{E}^{{inter}}={E}^{{inter}}-\langle {E}^{{inter}}\rangle ,$$

(4)

where angle braces indicate an ensemble average. This is then used to compute the entropic contribution to binding via:

$$-T{\rm{\Delta }}{S}_{{var}}={k}_{B}T\,\mathrm{ln}\,\langle {e}^{\beta {\rm{\Delta }}{E}^{{inter}}}\rangle $$

(5)

where k_B is the Boltzmann constant and $\beta =1/{k}_{B}T$.

Simulation setup

Ensembles of 25 replica MD simulations were conducted using the package NAMD 2.11³⁶ for each system (complex, receptor or ligand) studied. All simulations were conducted using the protocol incorporated into BAC²³. We have previously shown that the use of 25 replica ensembles provides a good balance of computational cost and calculation uncertainty for a number of varied systems^8,17,18.

Each system was minimized with all heavy protein atoms restrained at their initial positions (with a restraining force constant of 4 kcal mol⁻¹ Å⁻²). Initial velocities were then generated independently for each replica from a Maxwell–Boltzmann distribution at 50 K. Each system was virtually heated to 300 K over 60 ps and subsequently maintained at this temperature using a thermostat (employing a coupling coefficient of 1 ps⁻¹) during which time the restraints applied during minimization were retained. Once the system reached the correct temperature the pressure was maintained at 1 bar using a Berendesen barostat (with a pressure coupling constant of 0.1 ps). Subsequent to the heating, a series of equilibration runs, totaling 2 ns, were conducted, during which the restraints on heavy atoms were gradually reduced. The restraint reduction occurs in ten 100 ps steps, after each one the force constant was halved. Finally, 4 ns production simulations were executed with snapshots output for analysis every 100 ps. A 2 fs time step was used for all MD simulation steps. The workflow of the ESMACS protocols is shown in Fig. 1. For each system run through the 1traj protocol an ensemble of independent NAMD simulations is executed, consisting of four steps. The first minimization (min), which is followed by two equilibration steps (labelled eq1 and eq2 respectively). In Eq. 1 the system is heated while restraints are applied to heavy atoms. In Eq. 2 restraints are gradually reduced before free simulation is undertaken. After 2 ns of aggregate equilibration the 4 ns production phase is initiated. It is the production trajectory which is analysed by MMPBSA.py. A script is then run to aggregate these results from the ensemble of simulations and values of ΔG_MMPBSA computed along with bootstrap statistics. In multiple trajectory approaches a second ensemble of ligand-only simulations is conducted and fed into the aggregation and bootstrapping script. Full simulation details are provided in the main text.

Experimental Datasets

This study investigates a combination of BRD4 ligand binding datasets which have been the subjects of earlier studies. The first, previously studied by Aldeghi et al.¹⁵ using a combination of FEP based absolute binding free energy and MMPBSA techniques, contains a diverse set of 11 ligands which will be referred to as the diverse (DIV) dataset. The second was recently studied by our group in collaboration with GlaxoSmithKline⁹ (using a combination of ESMACS and ensemble thermodynamic integration approaches) and contains 16 ligands, all based on a single tetrahydroquinoline (THQ) template (consequently we identify this as the THQ dataset). The compounds were selected to represent a range of chemical functionality and binding affinities, despite their shared scaffold. The first 11 compounds are labeled 1 to 9 according to the scheme used by Aldeghi et al.¹⁵, the THQ based ligands are labeled THQ1 to THQ16 (the numbers correspond to those used in Wan et al.⁹). The chemical structure of the first 11 compounds and the THQ scaffold are shown in Fig. 2. Details of the groups found at positions R1 to R4 in the THQ based ligands are detailed in Fig. 3 and Table 2. Ligand 4 was parameterized with a charge of +1. Compounds THQ10 to THQ12 and THQ16 are positively charged (+1), and compounds THQ13 to THQ15 are negatively charged (−1).

Table 2 Composition of the ligands of the THQ dataset.

Full size table

Experimental binding free energies (ΔG_expt) for the first dataset were obtained from a combination of SPR, Alphascreen and Isothermal Titration Calorimetry (ITC) experiments¹⁵, whereas those for the THQ dataset are derived from IC₅₀ values from FRET⁹. These techniques are very different from one another and will necessarily introduce varying levels of uncertainty into the data they provide. The divergence in the origin of the measurements is representative of the sources of experimental data to which free energy calculations are typically compared. This, alongside the lack of rigorously derived uncertainty estimates in the experimental data, must be borne in mind when assessing protocol performance. In Table 3

Table 3 Experimental binding affinities for both the diverse (DIV) and tetrahydroquinoline scaffold (THQ) datasets.

Full size table

we provide the full experimental binding affinities for both the diverse (DIV) and tetrahydroquinoline scaffold (THQ) datasets.

Structural models

The ligands from both datasets were simulated bound to the two BRD4 structural models based on PDBs 2OSS and 4BJX respectively (these are the initial structures used in Aldeghi et al.¹⁵ and Wan et al.⁹). The former represents the apo BRD4 and the latter the protein bound to a THQ based ligand. The secondary structure of both models is very similar (see Fig. 4a) and the RMSD between the two structures is 0.44 Å. All crystallographic water molecules were retained, including four which are conserved in the binding site of both models. The poses of the ligands in the DIV dataset were extracted from crystal structures (PDBs: 3U5J, 3U5L, 4OGI, 4OGJ, 3MXF, 4MR3, 4MR4, 3SVG, 4J0R and 4HBV), except for one ligand, labelled 10, which was modeled (based on PDB 3SVG) and docked into 2OSS as two conformers. These are the same two conformers used in Aldeghi et al.¹⁵, differing by a 180° flip of the trifluorophenyl moiety. The modelled poses were aligned and copied into the 4BJX based models. Poses of the THQ ligands were based on that of I-BET726 as found in the 4BJX structure.

System setup, including the creation of a water box and addition of neutralizing ions, was performed using AmberTools 17^37,38. The majority of simulations were conducted using protein parameters taken from the standard Amber force field for bioorganic systems (ff14SB)³⁹. Reproducibility studies of the THQ ligands were conducted using an earlier version of the forcefield, ff99SBildn⁴⁰.

Drug parameters were produced using the general Amber force field (GAFF)⁴¹. The majority of the simulations presented here employ ligands prepared using the Gaussian/RESP protocol. In this approach, Gaussian 98⁴² was used to perform geometric optimization of the inhibitor with 6–31G** basis functions, and the restrained electrostatic potential (RESP) procedure was used to calculate the partial atomic charges. Reproducibility studies of the DIV dataset were conducted using AM1-BCC⁴³ derived charges. All charge assignment and input file generation was performed in the Antechamber component of AmberTools.

Statistics and uncertainties

All statistics presented use their standard definitions with the exception of the mean unsigned error (MUE). It is well known that MMPBSA results have a significant offset from experimental values (typically of the order of 15 to 25 kcal mol⁻¹) due to a range of factors, in particularly the neglect of entropic contributions^33,34. Consequently we present values corrected for the systematic (mean signed) error and designate them cMUE.

We compute uncertainties for all metrics through bootstrapping analysis. This method involves resampling with replacement the N input data points (in this case, the replica averages of ΔG_MMPBSA) to provide a new bootstrap sample also containing N data points. This process is repeated many times (in our case 5000 times) and the statistic of interest of each bootstrap population calculated. The standard deviation of these values provides an estimate of the uncertainty associated with an average derived from a given sample; this is what is quoted as the bootstrap error measure of our statistics. For correlation coefficients samples are drawn from the overall averages for each ligand paired with the relevant experimental value. In addition to this metric, when making a direct comparison of specific correlation coefficients we will also quote 95% confidence intervals. These intervals are calculated by sorting the bootstrap sample distribution of correlation coefficients and taking the values falling at the 2.5 and 97.5 percentiles.

Results

Here we evaluate the performance of a range of ESMACS protocols in reproducing the experimental rankings across the full diverse ligand dataset, the robustness of this ranking to choices in system setup and the influence of non-standard MMPBSA components.

Standard ESMACS Performance and Robustness to Initial Structure Variation

Comparison of the results of all ESMACS protocols across the full DIV + THQ dataset shows a distinct trend in which inclusion of the receptor average energy considerably improves the predictions obtained for both initial protein models. In both cases 1traj results have a Spearman rank coefficient, r_s, of 0.46 [CI: 0.16–0.84 for both] which improves to 0.66 [CI: 0.50–0.94]/0.60 [CI: 0.40–0.91] (2OSS/4BJX) when both ligand and receptor flexibility are accounted for in the 2traj-ar protocol. In the DIV dataset better ranking can be obtained using receptor flexibility alone, but in order to obtain good rankings for THQ both additional contributions are required. This is the same behaviour observed in the simulation results for the THQ dataset in Wan et al.⁹; however the overall ranking is worse (the original study obtained an r_s of 0.78 [CI: 0.53–0.92]), primarily due to the stronger predicted binding affinity for the experimentally least potent drug, THQ16, in the present study.

The improvement between 1traj and 2traj-ar is illustrated in Fig. 5, which shows that outliers are moved closer to the overall trend line (particularly apparent for the DIV ligands 3, 4 and 5 which were also outliers in Aldeghi et al.¹⁵). These three ligands have similar experimental binding energies but a difference of 15 kcal mol⁻¹ in 1traj and 10 kcal mol⁻¹ in 2traj-ar is seen in ΔG_MMPBSA The ranking improvement is larger for the THQ ligands than the DIV dataset, with the 1traj results exhibiting little if any correlation with experiment. The main THQ outliers in the 1traj results are THQ12, THQ13 and THQ9. The first two are moved closer to the trend in the 2traj-ar results but THQ9 remains more negative than might be expected. Another feature of the 2traj-ar data here is that greater separation is seen between the results obtained from the two BRD4 structures for TH12, THQ13 and most pronouncedly THQ16. This is in contrast to nearly all other ligands where the values obtained from simulations with either model are well within the error margin, many sitting on top of one another in Fig. 5.

It can also be seen in Table 4 that the impact of the incorporation of receptor ‘strain’ in the 1traj-ar and 2traj-ar protocols is different in the DIV and THQ subsets. In the 4BJX simulations the DIV rankings are notably less good than that in the 1traj, whilst they are fairly similar in the 2OSS case. Whereas for THQ, we find that accounting for the receptor and ligand flexibility is necessary to obtain a good ranking in both cases. Overall the results from the 2OSS structure are better than those from 4BJX. However, it should be noted that the ΔG_MMPBSA values for all drugs using the 1traj protocol agree within error (see Fig. 5a).

Table 4 Performance of different MMPBSA based ESMACS protocols in reproducing experimental binding free energies, measured by mean unsigned error (MUE), Pearson’s predictivity index (PI), correlation coefficient (r) and Spearman’s rank coefficient (r_s).

Full size table

Robustness of Ranking to Parameterization

Two of the key decisions in ligand binding free energy calculations are the choices of the forcefield and how small molecules are parameterized. For simulations using Amber forcefields the choice of procedures for ligand preparation is usually whether to use AM1-BCC or Gaussian/RESP based protocols to determine atom charges in combination with the GAFF general purpose forcefield parameters. Following the choice in Wan et al.⁹ we used Gaussian/RESP for the majority of simulations in this work, but to evaluate the influence of this we re-ran the DIV dataset in the 2OSS model using the AM1-BCC methodology. Figure 6a shows that the ΔG_MMPBSA values for the large majority of the ligands are highly correlated between the two schemes (within 1–2 kcal mol⁻¹). This and the similar correlation with experiment (shown in Table 5) indicates that our results are robust with respect to this choice.

Table 5 Performance of different MMPBSA based ESMACS protocols in reproducing experimental binding free energies using the AM1-BCC method to parameterize ligands.

Full size table

The Wan et al.⁹ study employed the Amber ff99ildn forcefield for the protein, whilst in this study we have used ff14. In general the results obtained for all ligands are consistent but two ligands at either end of the rankings, THQ9 and THQ15, differ significantly as shown in Fig. 6b. The ranking performance with ff99ildn is described in Table 6. Comparing to those for ff14 (the THQ subset values in Table 4) shows ff99ildn provides better results, especially those for 2traj-ar in the 4BJX model (r_s of 0.80 compared to 0.46). There are many factors which may cause this difference but one we identified was the possibility that the balance between direct and water mediated interactions might be altered by modifications to the amino acid side chain parameters. This in part motivated our investigation of the impact of including explicit water molecules in the receptor component of our calculations (see the following section).

Table 6 Performance of different MMPBSA based ESMACS protocols in reproducing experimental binding free energies using the ff99ildn protein forcefield.

Full size table

Inclusion of Explicit Water

Aldeghi et al.²² found that the inclusion of explicit water molecules as part of the receptor in MMPBSA calculations improved the correlation with experiment in the DIV dataset. Here we explore whether this finding is reproducible using ensemble simulations and is robust to the addition of THQ ligands to the dataset under investigation. We use the same strategy in selecting water molecules for inclusion as the previous work, namely using the closest N to the ligand in each frame of the simulation trajectory.

We found a large difference in the impact of explicit water molecules between the combined DIV + THQ and DIV alone datasets. The correlations within the THQ dataset do not benefit from the inclusion of the additional water molecules in any protocol. For the combined dataset we find that up to around 5 explicit water molecules improves the rankings for all protocols (see Fig. 7a). After 50 water molecules are included 1traj performance drops to show no significant correlation with experiment and is only slightly improved as more molecules are added. A similar pattern is observed for the 1traj-ar and 2traj-ar results although, after the initial improvements, performance is more stable until 100 water molecules are included when an even sharper fall off is observed.

For the DIV dataset as shown in Fig. 7b the improvements are yet more marked. The biggest improvement is seen in the 1traj results. Furthermore, the MUE for these rankings does not increase with adding more water near peak performance 3.38/3.54 for 0 and 3.08/3.08 for 5 water molecules (2OSS/4BJX). In line with the results of Aldeghi et al.²² the peak performance has ${r}_{s} > 0.9$; however, unlike in the previous work here we see this at 2 water molecules included with a decline after 5 (as opposed to a peak at 20 and consistent performance thereafter). A number of factors could impact this including our use of ensembles of 4 ns trajectories (compared to single 16 ns runs) and Gaussian/RESP charges (as opposed to AM1-BCC). Overall though, it is important to retain at least four of the conserved water molecules in the binding site for the ESMACS calculations in order to obtain consistently good rankings across datasets. Moreover, the impact of adding water molecules differs between runs initiated with different starting structures, as shown in Table 7.

Table 7 Performance of 1traj and 2traj-ar MMPBSA based ESMACS protocols in reproducing experimental binding free energies incorporating different numbers of explicit water molecules.

Full size table

The combined DIV + THQ 4BJX 1traj ranking shows only a consistent result, with no improvement, as the first 5 water molecules were incorporated, whereas in 2OSS the ranking improves from an r_s of 0.46 [CI: 0.16–0.84] to 0.54 [CI: 0.16–0.84] after the first two water molecules are included. In 1traj-ar the 4BJX results improve from a lower baseline rapidly whilst those from 2OSS remain consistent until 5 water molecules are added, at which point the results from both structures give an r_s of around 0.6. A similar pattern is seen in 2traj-ar, but with the peak performance at 2 water molecules of 0.70 [CI: 0.54–0.97]/0.67 [CI: 0.49–0.96] (2OSS/4BJX) as shown in Table 7. The increase in MUE which accompanies the improvement in correlation indicates that the effects are not uniform across all ligands. Marginal gains in correlation coefficient should not be over emphasized (as can be seen in Table 7, improvements are often within error); we rather wish to draw attention to the trend that inclusion of water molecules likely to be involved in mediating stable ligand-protein interactions improves (or at least does not degrade) calculation performance. The most important observation is that the addition of explicit water molecules improves the reproducibility of the ranking when using different starting models.

Variational Entropy

Accounting correctly, and computationally efficiently, for the entropic component of binding free energies remains a challenge for MMPBSA based computations. Here we investigated the use of the variational entropy technique on the ranking of different ESMACS protocols. In all cases the variational entropy was computed using the fluctuations from the 1traj simulations. As shown in Table 8 the inclusion of this term results in a reduction in the performance of all protocols in simulations based on both initial models. Furthermore, the incorporation of explicit water molecules into the receptor reduces this to an even greater extent. Looking in more detail we see that some compounds suffer a deterioration in prediction whilst others manifest an improvement. For instance, Fig. 8 shows that the three DIV outliers 3, 4 and 5 are closer to the trend line than in Fig. 5, whereas THQ12 and THQ13 are more poorly predicted. The entropic term is based on the variation in interaction energy during the complex simulation. As it compares versus the average it captures properties of the interaction energy surface. For molecules such as 6, 8, 9 and 10 that have few degrees of freedom, the interaction energy surface is likely to be steep, with small changes in conformation or translations leading to a rapid loss of interaction energy. Meanwhile, larger more flexible compounds such as 4 and 5 (which has a flexible benzhydryl core) can adapt to conformational changes of the receptor and maintain a favourable interaction energy, leading to a flatter potential surface. The results suggest that this entropic term is suited to the latter but not the former examples. Correctly capturing entropic contributions is key to obtaining truly reliable rankings in diverse datasets and further work in this area is required. Also, components of the MMPBSA calculation (particularly the surface area term) incorporate some entropic contributions and such double counting may account at least in part for the poor performance of variational entropy here.

Table 8 Performance of 1traj and 2traj-ar MMPBSA based ESMACS protocols in reproducing experimental binding free energies incorporating both variational entropy and different numbers of explicit water molecules.

Full size table

Discussion

In summary, we have investigated the influence of different analysis choices on the results of ensemble MMPBSA based free energy calculations. The basis of our tests are two datasets which cover common computational chemistry challenges - one which is based on a set of related ligands and the other a highly diverse set of ligands with differing binding modes. In order to obtain successful rankings across the two datasets we found it necessary to incorporate receptor and ligand strains. Using the 2traj-ar ESMACS protocol we obtained Spearman correlations of between 0.60 [CI: 0.46–0.91] and 0.66 [CI: 0.50–0.94] for two different starting structures despite differences in charge and scaffold in the ligands. The lower confidence bounds of both these estimates are comparable to the average correlation coefficient from the 1traj protocol 0.46 [CI: 0.16–0.84], suggesting the result is statistically significant despite the relatively modest size of the dataset (which contains a total of 27 ligands). It should be noted that increase in computational cost is minimal here as the only additional simulations required are of the ligand (which are much smaller than either complex or receptor) with the receptor energy replaced by a constant. Hence, for prospective day to day applications, we recommend accounting for both ligand and receptor strain through independent ligand simulations and either further simulation of the apo receptor (as in the 3 traj ESMACS protocol) or the use of an average value for the receptor energies (2traj-ar).

A key consideration in the use of binding free energy calculations in real world (industrial or clinical) settings is the reproducibility of the results. Other considerations include computational cost and calculation stability. ESMACS protocols offer advantages in both these regards as they make use of relatively simple and fast classical MD simulations compared to many parallel simulations of intermediate states as required in alchemical calculations of absolute binding free energies⁴⁴. We have shown that the results obtained in this study are robust to changing the ligand charge generation protocol (to use AM1-BCC instead of Gaussian/RESP) and the forcefield used to parameterize the protein (from Amber ff14SB to ff99SBildn). The use of ensemble simulations is the key to obtaining this reproducibility as individual replicas in ensembles varied by as much as 15 kcal mol⁻¹ (which is in line with our own and other groups previous results^9,16,18,45). Despite this, performance differences were found for all initial protocols when simulations were initiated from different crystal structures.

This observation, along with the fact that some ligands which have very similar experimental binding energies were widely separated even using protocols which accounted for receptor flexibility (1traj-ar and 2traj-ar), prompted us to investigate potential enhancements of the pure MMPBSA protocol. Specifically, we looked at the inclusion of an explicit ligand hydration shell in the receptor and variational entropy which had previously been investigated for single replica simulations by Aldeghi et al.²² (though they also only investigated what we would term “1traj” calculations). These additional components capture chemical and physical features of the system neglected by MMPBSA but at minimal computational cost, a key consideration for practical binding affinity calculation applications. The entropy term reduced extreme outliers but at the expense of decreased overall ranking performance. This observation replicates that obtained by Aldeghi et al.²² for the DIV dataset bound to BRD4, although they found the term improved results for sensitivity based datasets including multiple proteins. When less than five water molecules were incorporated into the receptor our rankings were improved with the best ranking across the full dataset obtained using this in combination with the 2traj-ar protocol. The most important observation of our work, however, is that the inclusion of these bound water molecules considerably reduced the performance difference between simulations initiated from models based on different crystal structures. A criticism of continuum based methods is that they are incapable of capturing the effect of crucial water molecules, possible activity cliffs, etc, that are now a well understood feature of structure-activity relationship (SAR) landscapes and medicinal chemistry lead optimization. Here it is shown again how this challenge can be met, with the simple inclusion of explicit water molecules. Future work should address how to consider this in prospective application scenarios and in a wider range of protein targets.

The reason for the improved performance observed for the diverse datasets in this study is presumably due to the capture of interactions between the ligand and the closest of the four conserved water molecules also found in the binding site. This observation is in line with other work in which system dependent numbers of water molecules were found to improve rankings^46,47,48,49 and the broader phenomenon of the impact of crucial water molecules on SAR landscapes. Incorporation of the water molecules was highly effective in differentiating the ligands with diverse binding modes but less effective in the set of related THQ-scaffold based compounds. The fact that our observations fit a general pattern, and that the level of explicit water hydration which improves results is similar to the number of conserved water molecules suggests that the approach can be applied more generally.

Overall we have shown that, for a diverse set of ligands, in order to deliver reproducible results from ESMACS (MMPBSA) calculations it is necessary to account for receptor and ligand strain and account explicitly for water molecules bound alongside ligands. Essential to obtaining these results is the use of ensemble simulations to generate meaningfully quantified uncertainties.

Data Availability

Simulation input topologies and coordinates (alongside ligand parameters) for all protein-ligand systems and collated MMPBSA results are made available via Zenodo, 10.5281/zenodo.1484050. Trajectories are available from the corresponding author on reasonable request.

References

Paul, S. M. et al. How to improve R&D productivity: the pharmaceutical industry’s grand challenge. Nature Reviews Drug Discovery 9, 203–214, https://www.nature.com/articles/nrd3078 (2010).
Mobley, D. L. & Klimovich, P. V. Perspective: Alchemical free energy calculations for drug discovery. The Journal of Chemical Physics 137, 230901, https://doi.org/10.1063/1.4769292 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Mey, A. S. J. S., Jiménez, J. J. & Michel, J. Impact of domain knowledge on blinded predictions of binding energies by alchemical free energy calculations. J. Comput.-Aided Mol. Des., https://doi.org/10.1007/s10822-017-0083-9 (2017).
Article ADS Google Scholar
Yin, J. et al. Overview of the sampl5 host–guest challenge: Are we doing better? J. Comput.-Aided Mol. Des. 31, 1–19, https://doi.org/10.1007/s10822-016-9974-4 (2017).
Article ADS CAS PubMed Google Scholar
Ganesan, A., Coote, M. L. & Barakat, K. Molecular dynamics-driven drug discovery: leaping forward with confidence. Drug Discovery Today 22, 249–269, http://www.sciencedirect.com/science/article/pii/S1359644616304147 (2017).
Pérez-Benito, L., Keränen, H., van Vlijmen, H. & Tresadern, G. Predicting binding free energies of pde2 inhibitors. the difficulties of protein conformation. Sci. Rep. 8, https://doi.org/10.1038/s41598-018-23039-5 (2018).
Keränen, H. et al. Acylguanidine beta secretase 1 inhibitors: A combined experimental and free energy perturbation study. J. Chem. Theory Comput. 13, 1439–1453, https://doi.org/10.1021/acs.jctc.6b01141 (2017). PMID: 28103438.
Article CAS PubMed Google Scholar
Wan, S. et al. Evaluation and characterization of trk kinase inhibitors for the treatment of pain: Reliable binding affinity predictions from theory and computation. Journal of Chemical Information and Modeling 57, 897–909, https://doi.org/10.1021/acs.jcim.6b00780 (2017). PMID: 28319380.
Article CAS PubMed Google Scholar
Wan, S. et al. Rapid and reliable binding affinity prediction of bromodomain inhibitors: a computational study. J. Chem. Theory Comput. (2016).
Wang, L. et al. Accurate and Reliable Prediction of Relative Ligand Binding Potency in Prospective Drug Discovery by Way of a Modern Free-Energy Calculation Protocol and Force Field. Journal of the American Chemical Society 137, 2695–2703, https://doi.org/10.1021/ja512751q (2015).
Article CAS PubMed Google Scholar
Sherborne, B. et al. Collaborating to improve the use of free-energy and other quantitative methods in drug discovery. J. Comput.-Aided Mol. Des. 30, 1139–1141, https://doi.org/10.1007/s10822-016-9996-y (2016).
Article ADS CAS PubMed Google Scholar
Baker, M. 1,500 scientists lift the lid on reproducibility. Nature 533, 452–454, https://doi.org/10.1038/533452a (2016).
Article ADS CAS PubMed Google Scholar
Ioannidis, J. P. A. WhyMost Published Research Findings Are False. PLoS Med. 2, e124, https://doi.org/10.1371/journal.pmed.0020124 (2005).
Article PubMed PubMed Central Google Scholar
Kollman, P. A. et al. Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc. Chem. Res. 33, 889–897 (2000).
Article CAS Google Scholar
Aldeghi, M., Heifetz, A., BodkinJ, M. J., Knapp, S. & Biggin, P. C. Accurate calculation of the absolute free energy of binding for drug molecules. Chem. Sci. 7, 207–218 (2016).
Article CAS Google Scholar
Wright, D. W., Hall, B. A., Kenway, O. A., Jha, S. & Coveney, P. V. Computing clinically relevant binding free energies of HIV-1 protease inhibitors. J. Chem. Theory Comput. 10, 1228–1241 (2014).
Article CAS Google Scholar
Wan, S., Knapp, B., Wright, D. W., Deane, C. M. & Coveney, P. V. Rapid, precise, and reproducible prediction of peptide–MHC binding affinities from molecular dynamics that correlate well with experiment. J. Chem. Theory Comput. 11, 3346–3356 (2015).
Article CAS Google Scholar
Sadiq, S. K., Wright, D. W., Kenway, O. A. & Coveney, P. V. Accurate ensemble molecular dynamics binding free energy ranking of multidrug-resistant HIV-1 proteases. J. Chem. Inf. Model. 50, 890–905, https://doi.org/10.1021/ci100007w (2010).
Article CAS PubMed Google Scholar
Aldeghi, M., Heifetz, A., Bodkin, M. J., Knapp, S. & Biggin, P. C. Predictions of Ligand Selectivity from Absolute Binding free Energy Calculations. J. Am. Chem. Soc. 139, 946–957, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5253712/ (2017).
Mobley, D. L. & Gilson, M. K. Predicting Binding Free Energies: Frontiers and Benchmarks. Annu. Rev. Biophys. 46, 531–558, https://doi.org/10.1146/annurev-biophys-070816-033654 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mobley, D. L. & Slochower, D. Mobleylab/Benchmarksets: Version 1.2, https://zenodo.org/record/839047 (2017).
Aldeghi, M., Bodkin, M. J., Knapp, S. & Biggin, P. C. Statistical Analysis on the Performance of Molecular mechanics Poisson–Boltzmann Surface Area versus Absolute Binding free Energy Calculations: Bromodomains as a Case Study. J. Chem. Inf. Model. 57, 2203–2221, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5615372/, https://doi.org/10.1021/acs.jcim.7b00347 (2017).
Article CAS Google Scholar
Sadiq, S. K. et al. Automated Molecular Simulation Based Binding Affinity Calculator for Ligand-Bound HIV-1 Proteases. J. Chem. Inf. Model. 48, 1909–1919, https://doi.org/10.1021/ci8000937 (2008).
Article CAS PubMed Google Scholar
Balasubramanian, V., Treikalis, A., Weidner, O. & Jha, S. Ensemble Toolkit: Scalable and Flexible Execution of Ensembles of Tasks. arXiv:1602.00678 [cs], http://arxiv.org/abs/1602.00678, ArXiv: 1602.00678 (2016).
Merzky, A., Turilli, M., Maldonado, M., Santcroos, M. & Jha, S. Using Pilot Systems to Execute Many Task Workloads on Supercomputers. arXiv:1512.08194 [cs], http://arxiv.org/abs/1512.08194, ArXiv: 1512.08194 (2015).
Dakka, J. et al. High-throughput Binding Affinity Calculations at Extreme Scales. arXiv:1712.09168 [cs], http://arxiv.org/abs/1712.09168, ArXiv: 1712.09168 (2017).
Wright, D. W. & Coveney, P. V. Resolution of Discordant HIV-1 Protease Resistance Rankings Using Molecular Dynamics Simulations. J. Chem. Inf. Model. 51, 2636–2649, https://doi.org/10.1021/ci200308r (2011).
Article CAS PubMed Google Scholar
Hall, B. A., Wright, D. W., Jha, S. & Coveney, P. V. Quantized water access to the HIV-1 protease active site as a proposed mechanism for cooperative mutations in drug affinity. Biochemistry (Mosc.) 51, 6487–6489 (2012).
Article CAS Google Scholar
Wan, S. & Coveney, P. V. Rapid and accurate ranking of binding affinities of epidermal growth factor receptor sequences with selected lung cancer drugs. J. R. Soc. Interface 8, 1114–1127, https://doi.org/10.1098/rsif.2010.0609 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hou, T., Wang, J., Li, Y. & Wang, W. Assessing the performance of the MM/PBSA and MM/GBSA methods. 1. The accuracy of binding free energy calculations based on molecular dynamics simulations. J. Chem. Inf. Model. 51, 69–82, https://doi.org/10.1021/ci100275a (2011).
Article CAS PubMed Google Scholar
Miller, B. R. III et al. MMPBSA. py: an efficient program for end-state free energy calculations. J. Chem. Theory Comput. 8, 3314–3321 (2012).
Article CAS Google Scholar
Case, D. A. et al. Amber 14. (University of California, San Francisco, 2014).
Google Scholar
Genheden, S., Kuhn, O., Mikulskis, P., Hoffmann, D. & Ryde, U. The normal-mode entropy in the MM/GBSA method: effect of system truncation, buffer region, and dielectric constant. J. Chem. Inf. Model. 52, 2079–2088 (2012).
Article CAS Google Scholar
Wang, C., Greene, D., Xiao, L., Qi, R. & Luo, R. Recent Developments and Applications of the MMPBSA Method. Frontiers in Molecular Biosciences 4, https://doi.org/10.3389/fmolb.2017.00087/full (2018).
Duan, L., Liu, X. & Zhang, J. Z. Interaction entropy: A new paradigm for highly efficient and reliable computation of protein–ligand binding free energy. Journal of the American Chemical Society 138, 5722–5728, https://doi.org/10.1021/jacs.6b02682, PMID: 27058988 (2016).
Article CAS Google Scholar
Phillips, J. C. et al. Scalable molecular dynamics with NAMD. J. Comput. Chem. 26, 1781–1802, https://doi.org/10.1002/jcc.20289 (2005).
Article CAS PubMed PubMed Central Google Scholar
Case, D. A. et al. The Amber biomolecular simulation programs. J. Comput. Chem. 26, 1668–1688, https://doi.org/10.1002/jcc.20290 (2005).
Article CAS PubMed PubMed Central Google Scholar
Case, D. et al. Amber 17. (University of California, San Francisco, 2017).
Google Scholar
Maier, J. A. et al. ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 11, 3696–3713 (2015).
Article CAS Google Scholar
Hornak, V. et al. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins: Struct., Funct., Bioinf. 65, 712–725, https://doi.org/10.1002/prot.21123 (2006).
Article CAS Google Scholar
Wang, J., Wolf, R. M., Caldwell, J. W., Kollman, P. A. & Case, D. A. Development and testing of a general Amber force field. J. Comput. Chem. 25, 1157–1174, https://doi.org/10.1002/jcc.20035 (2004).
Article CAS PubMed Google Scholar
Frisch, M. J. et al. Gaussian 98 (Gaussian, Inc., 1998).
Jakalian, A., Jack, D. B. & Bayly, C. I. Fast, efficient generation of high-quality atomic charges. am1-bcc model: Ii. parameterization and validation. J. Comput. Chem. 23, 1623–1641, https://doi.org/10.1002/jcc.10128 (2002).
Article CAS PubMed Google Scholar
Bhati, A. P., Wan, S., Hu, Y., Sherborne, B. & Coveney, P. V. Uncertainty Quantification in Alchemical Free Energy Methods. J. Chem. Theory Comput. 14, 2867–2880, https://doi.org/10.1021/acs.jctc.7b01143 (2018).
Article CAS PubMed PubMed Central Google Scholar
Genheden, S. & Ryde, U. A comparison of different initialization protocols to obtain statistically independent molecular dynamics simulations. J. Comput. Chem. 32, 187–195, https://doi.org/10.1002/jcc.21546 (2011).
Article CAS PubMed Google Scholar
Zhu, Y.-L., Beroza, P. & Artis, D. R. Including explicit water molecules as part of the protein structure in mm/pbsa calculations. J. Chem. Inf. Model. 54, 462–469, https://doi.org/10.1021/ci4001794, PMID: 24432790 (2014).
Article CAS Google Scholar
Maffucci, I. & Contini, A. Explicit ligand hydration shells improve the correlation between mm-pb/gbsa binding energies and experimental activities. J. Chem. Theory Comput. 9, 2706–2717, https://doi.org/10.1021/ct400045d, PMID: 26583864 (2013).
Article CAS Google Scholar
Genheden, S. et al. Accurate predictions of nonpolar solvation free energies require explicit consideration of binding-site hydration. Journal of the American Chemical Society 133, 13081–13092, https://doi.org/10.1021/ja202972m, PMID: 21728337 (2011).
Article CAS Google Scholar
Wong, S., Amaro, R. E. & McCammon, J. A. Mm-pbsa captures key role of intercalating water molecules at a protein–protein interface. Journal of Chemical Theory and Computation 5, 422–429, https://doi.org/10.1021/ct8003707, PMID: 19461869 (2009).
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank the EU H2020 projects ComPat (http://www.compat-project.eu/, Grant No. 671564), CompBioMed (http://www.compbiomed.eu/, Grant No. 675451) and VECMA (http://www.vecma.eu/, Grant No. 800925), NSF Award (https://www.nsf.gov/pubs/2017/nsf17542/nsf17542.htm, Award No. NSF 1713749), the MRC Medical Bioinformatics project (MR/L016311/1), and funding from the UCL Provost. We made use of the BlueWaters supercomputer at the National Center for Supercomputing Applications of the University of Illinois at Urbana–Champaign https://bluewaters.ncsa.illinois.edu), access to which was made available through the aforementioned NSF award. We acknowledge the Leibniz Supercomputing Centre for providing access to SuperMUC (https://www.lrz.de/services/compute/) and the very able assistance of its scientific support staff. Additional calculation were conducted using an award of computer time on the Titan machine provided by the US Department of Energy’s Innovative and Novel Computational Impact on Theory and Experiment (INCITE) program (through the INSPIRE project). This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

Author information

Authors and Affiliations

Centre for Computational Science, Department of Chemistry, University College London, London, WC1H 0AJ, United Kingdom
David W. Wright, Shunzhou Wan & Peter V. Coveney
Janssen Research & Development, Turnhoutseweg 30, B-2340, Beerse, Belgium
Christophe Meyer, Herman van Vlijmen & Gary Tresadern

Authors

David W. Wright
View author publications
You can also search for this author in PubMed Google Scholar
Shunzhou Wan
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Herman van Vlijmen
View author publications
You can also search for this author in PubMed Google Scholar
Gary Tresadern
View author publications
You can also search for this author in PubMed Google Scholar
Peter V. Coveney
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.W.W. performed and analyzed simulations and wrote the main manuscript text. S.W. performed additional simulations and analysis. The study was designed by C.M., H.v.V., G.T., D.W.W. and P.V.C. All authors contributed to and reviewed the manuscript.

Corresponding author

Correspondence to Peter V. Coveney.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

LaTeX Supplementary File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wright, D.W., Wan, S., Meyer, C. et al. Application of ESMACS binding free energy protocols to diverse datasets: Bromodomain-containing protein 4. Sci Rep 9, 6017 (2019). https://doi.org/10.1038/s41598-019-41758-1

Download citation

Received: 15 November 2018
Accepted: 08 March 2019
Published: 12 April 2019
DOI: https://doi.org/10.1038/s41598-019-41758-1

This article is cited by

Structure and dynamics of an archetypal DNA nanoarchitecture revealed via cryo-EM and molecular dynamics simulations
- Katya Ahmad
- Abid Javed
- Stefan Howorka
Nature Communications (2023)
The performance of ensemble-based free energy protocols in computing binding affinities to ROS1 kinase
- Shunzhou Wan
- Agastya P. Bhati
- Peter V. Coveney
Scientific Reports (2022)
PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications
- Divya B. Korlepara
- C. S. Vasavi
- U. Deva Priyakumar
Scientific Data (2022)
The effect of protein mutations on drug binding suggests ensuing personalised drug selection
- Shunzhou Wan
- Deepak Kumar
- Peter V. Coveney
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Computational Methods

Free energy of binding computations

Entropic contribution to binding free energies

Simulation setup

Experimental Datasets

Structural models

Statistics and uncertainties

Results

Standard ESMACS Performance and Robustness to Initial Structure Variation

Robustness of Ranking to Parameterization

Inclusion of Explicit Water

Variational Entropy

Discussion

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links