The role of water in host-guest interaction

Rizzi, Valerio; Bonati, Luigi; Ansari, Narjes; Parrinello, Michele

doi:10.1038/s41467-020-20310-0

Download PDF

Article
Open access
Published: 04 January 2021

The role of water in host-guest interaction

Nature Communications volume 12, Article number: 93 (2021) Cite this article

6840 Accesses
37 Citations
14 Altmetric
Metrics details

Subjects

Abstract

One of the main applications of atomistic computer simulations is the calculation of ligand binding free energies. The accuracy of these calculations depends on the force field quality and on the thoroughness of configuration sampling. Sampling is an obstacle in simulations due to the frequent appearance of kinetic bottlenecks in the free energy landscape. Very often this difficulty is circumvented by enhanced sampling techniques. Typically, these techniques depend on the introduction of appropriate collective variables that are meant to capture the system’s degrees of freedom. In ligand binding, water has long been known to play a key role, but its complex behaviour has proven difficult to fully capture. In this paper we combine machine learning with physical intuition to build a non-local and highly efficient water-describing collective variable. We use it to study a set of host-guest systems from the SAMPL5 challenge. We obtain highly accurate binding free energies and good agreement with experiments. The role of water during the binding process is then analysed in some detail.

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

De novo design of protein structure and function with RFdiffusion

Article Open access 11 July 2023

Collective intelligence: A unifying concept for integrating biology across scales and substrates

Article Open access 28 March 2024

Introduction

Host–guest interactions regulate the working of proteins and have been intensively studied^1,2. Atomistic simulations have been widely used^3,4,5,6,7 to calculate key parameters like ligand affinity and residence time, and to gain a microscopic understanding of how protein–ligand binding works. The accuracy of these simulations depends on two key aspects: the quality of the model used to describe the interatomic interactions and the thoroughness of the statistical sampling^8,9. In this work, we will focus only on the latter and we will show that sampling can be much improved if the role of water in the binding–unbinding processes is duly taken into account.

Binding processes take place on a timescale that is unreachable with current computer resources; thus, the use of enhanced sampling methods is mandatory. We will frame our discussion in the context of Metadynamics (MetaD)^10,11,12 or, more precisely, of its most recent evolution, the on-the-fly probability-enhanced sampling method (OPES)¹³. OPES, like MetaD and many other methods^14,15,16, relies on the identification of suitable order parameters or collective variables (CVs). In these methods^14,17, the CV distribution is made to follow a preassigned law. This allows CV fluctuations to be amplified in a controlled way. For such methods to work in an accurate and efficient manner, the CVs must be able to describe the slow degrees of freedom of the system. Here we will identify one such powerful CV of general applicability aimed at describing the role of water in the ligand-binding process.

Water is expected to play an important role since, upon entering the binding site, the ligand has to shed its solvation shell in total or in part, while the water that originally was in the binding site has to rearrange and negotiate its way out of the binding cavity. Not surprisingly, much effort has been devoted to the role of water in ligand–host binding^{18,19,20,21,22,23,24}. In the context of enhanced sampling, many attempts have been made at capturing the role of water in a CV, leading to an improvement in binding free energy estimations^{5,25,26,27,28}. We show here that there is room for a further decisive step as none of these water-related CVs has been able to describe accurately the highly non-local changes in water structure that take place during binding, both in the vicinity of the ligand and in and around the binding pocket.

In order to succeed in our endeavour, we rely on a combination of physical considerations and modern machine learning (ML) techniques. In particular, we use a method that we have recently developed, which goes under the name of Deep Linear Discriminant Analysis (Deep-LDA)²⁹. Deep-LDA builds efficient CVs from the equilibrium fluctuations of a large set of descriptors, expressing them as a neural network (NN). In this context, the choice of descriptors is essential and we appeal to our physical understanding to introduce one such set that is capable of characterising not only the ligand solvation shell but also the water structure inside and outside the binding cavity. After building such a CV, we use it in OPES for accelerating the sampling of binding–unbinding events.

We measure the performance of our approach on a set of test systems taken from the SAMPL5 competition^30,31,32 and study the interaction of six ligands with an octa-acid calixarene host (OAMe) (see Fig. 1). We choose this system because, despite its relative simplicity, it retains most of the key features of a biologically relevant protein–ligand system. Very recently, a closely related system has been used to investigate how water flows in and out of the system in the absence of a ligand³³. Furthermore, the host’s symmetry simplifies the analysis, and a comparison can be made to existing theoretical calculations³². The choice to perform simulations on a system with a standard set of simulation parameters allows our results to be compared to a range of different techniques, among which are the attach-pull-release method³⁴, alchemical protocols³⁵, and metadynamics³⁶.

**Fig. 1: Sketch of the octa-acid host OAMe with the funnel restraint geometry and the guest molecules from the SAMPL5 challenge.**

Results

Collective variables from equilibrium fluctuations with Deep-LDA

In this work, we are mainly interested in computing the free energy difference ΔG between the bound state (B) in which the ligand sits in the lowest free energy binding pose and the unbound state (U) where the ligand is solvated in water and free to diffuse. In order to obtain a CV able to capture water behaviour, we use the recently developed machine learning Deep-LDA method²⁹.

Deep-LDA is a non-linear evolution of the time-honoured Linear Discriminant Analysis (LDA) classification method³⁷. In LDA, one takes two sets of data, in our case the configurations visited in short unbiased simulations in B and U, and defines a set of N_d descriptors d that are able to distinguish between the two. The aim of LDA is to find the linear combination of descriptors s = w^Td that best separates the two sets of data, w being an N_d-dimensional vector.

To this effect, one calculates for each set of data the vectors of the average descriptor values μ_B, μ_U, and their variance matrices S_B, S_U. With these quantities, one then computes the so-called Fisher’s ratio:

$${\mathcal{J}}({\boldsymbol{w}})=\frac{{{\boldsymbol{w}}}^{\mathrm{T}}{{\boldsymbol{S}}}_{{\rm{b}}}{\boldsymbol{w}}}{{{\boldsymbol{w}}}^{\mathrm{T}}{{\boldsymbol{S}}}_{{\rm{w}}}{\boldsymbol{w}}}.$$

(1)

where one has defined the within–scatter matrix S_w = S_B + S_U and the between one ${{\boldsymbol{S}}}_{{\rm{b}}}=({{\boldsymbol{\mu }}}_{{\rm{B}}}-{{\boldsymbol{\mu }}}_{{\rm{U}}}){({{\boldsymbol{\mu }}}_{{\rm{B}}}-{{\boldsymbol{\mu }}}_{{\rm{U}}})}^{\mathrm{T}}$. The w that maximises this ratio is the direction that optimally discriminates the two states and gives the best-separated projection of the data in the one-dimensional s space. The variable thus obtained has been shown to perform well as the CV in many cases, especially if one uses its Harmonic LDA variant^38,39.

In Deep-LDA, a similar paradigm applies with the key difference that LDA is performed on a non-linear transformation of the descriptors. The non-linearity is introduced by a neural network (NN) (see Fig. 2) whose input is the set of N_d descriptors d and the outputs are the N_h components of the last hidden layer h. LDA is performed on the components of h, so that, after determining the corresponding S_w and S_b, the NN is optimised using ${\mathcal{J}}({\boldsymbol{w}})$ as the loss function. At convergence, one determines the weights of the NN and the N_h-dimensional optimal vector w that produces the Deep-LDA projection:

$$s={{\boldsymbol{ w}}}^{\mathrm{T}}{{{\mathbf{h}}}}.$$

(2)

**Fig. 2: Schematics of the Deep-LDA architecture used in this work.**

Deep-LDA is a powerful classifier that tends to compress the data into very sharp distributions which are unsuitable for enhanced sampling applications. To address this issue, we smooth the distributions by applying the following cubic transformation s_w = s + s³, in the spirit of what was done in ref. ⁴⁰. The CV thus obtained will be used to describe the water behaviour in our simulations.

Including water in the model

The choice of the descriptors d is of paramount importance since it implies the physics that we want to describe. In our case, we are interested in capturing the role of water in the binding process. To this effect, we choose two sets of points around which we compute the water coordination number. One set is located on the ligand, while the other one is fixed along the host’s axis z at regular intervals (see Fig. 1 and the Supplementary Methods).

The first set of coordination numbers {L_i} describes water solvation around the ligand and is similar in spirit to the ligand solvation variables that have been used in the past^5,28. The second one {V_i} is aimed instead at capturing the water arrangement inside and outside the binding pocket without any explicit reference to the ligand. It is essential that the descriptors capture all the water molecules that contribute to the host and the guest solvation. Missing some of them would create an incomplete picture of solvation, which in turn would lead to Deep-LDA classification errors and ineffective bias.

The set of descriptors {L_i, V_i} gives information on the structure of water and its non-local changes on a small to medium length scale during the binding–unbinding process. Its effectiveness does not lie in the individual action of each descriptor but in its collective capability to capture the many-body concerted movements of the host, guest, and water molecules. The use of these descriptors is one of the elements of novelty in our approach and one of the keys to its success.

Binding free energies from enhanced sampling simulations

We perform OPES simulations to estimate the binding free energies of all the six ligands of Fig. 1. We use the Deep-LDA CV s_w together with a second CV s_z, which is the projection of the ligand centre of mass on the binding axis z. In the ligand-binding context, using the latter is a natural choice^5,36 as it has a clear physical interpretation and helps in distinguishing B from U. Furthermore, we employ a funnel-like restraint potential⁴ to encourage the ligand to find its way back to the binding site once it is out in the solution. The entropic correction to the free energy due to the funnel restriction can be calculated analytically (see Eq. 4 in the Supplementary Methods) and is taken into account when computing the binding free energies ΔG. We refer the interested reader to the Supplementary Methods for further details.

The combined use of these two CVs leads to an efficient sampling, which is reflected in a high number of binding–unbinding events per unit time (see for example Supplementary Fig. 18). We notice a clear improvement over a more standard set of CVs³⁶, namely s_z itself, and the cosine of the angle θ between the binding axis z and the ligand orientation (see Supplementary Fig. 17). The introduction of a water-based CV in enhanced sampling simulations allows the system to reach a regime where it diffuses without hysteresis from one metastable state to another, yielding a high accuracy in estimating ensemble averages of physical quantities. This makes it possible to significantly reduce the error bars without having to increase the computational time relative to what is reported in the literature³⁴.

Performing enhanced sampling simulations allows retrieving the equilibrium distribution P(s) of any collective variable s¹⁴. Here we focus on the free energy surface (FES), defined as ${\rm{FES}}(s)=-{k}_{{\rm{B}}}T\,\mathrm{log}\,P(s)$, where k_B is the Boltzmann constant and T is the temperature of the system. In the context of ligand binding, it is customary to look at the FES as a function of the host–guest distance s_z. For each of the six ligands, we compute the FES and estimate the errors with a block average analysis. We report these results in Fig. 3, in which we also assess the robustness of the Deep-LDA CV by showing the results corresponding to three different rounds of Deep-LDA training.

**Fig. 3: Free energy surfaces projected along the host–guest distance.**

We then report the binding free energies ΔG corrected for the presence of the funnel in Table 1. In Fig. 4 we compare them with experimental values and theoretical calculations performed on the same model but with different sampling techniques^34,35,36. We assess the quality of our estimates through the metrics used in the SAMPL5 overview paper³² and obtain a root-mean-squared error of 0.68 kcal mol⁻¹, a Pearson coefficient of determination of 0.93, a linear regression slope of 1.21, and a Kendall correlation coefficient of 0.87. With some exceptions, we are in line with the SAMPL5 results (see Fig. 4 and Supplementary Tables 1 and 2). However, the error bars are significantly reduced over the whole set of ligands investigated.

Table 1 Binding free energies.

Full size table

**Fig. 4: Comparison of the binding free energies with experiments and other calculations.**

To test the generality of our procedure, we investigate the interaction of the six ligands with the OAH host also studied in the SAMPL5 challenge. The results are in agreement with those reported in refs. ^34,35,36 and in Supplementary Figs. 31–57 and Tables 9–16 we provide a complete report. As a further check of our method and of the role of water, we also perform simulations of the host OAMe with the six ligands using the TIP4P/EW water model⁴¹ instead of the TIP3P model⁴². While the binding/unbinding process is unchanged, we find that the binding free energies depend on the water model chosen. Modulo a shift of about 1.3 kcal mol⁻¹, the two sets of results correlate reasonably well with one another and with the experiments. For a quantitative assessment of these statements, see Supplementary Figs. 59 –78 and Tables 19–26. The root of this change can be possibly attributed to a different solubility of the ligand in the two water models and to a different host–water interaction.

The case of G4

The use of the Deep-LDA CV s_w allows us to obtain not only accurate binding free energies but also a detailed insight into water behaviour during the binding process. We illustrate here the case of G4, the guest that exhibits the most complex behaviour, and refer the interested reader to Supplementary Figs. 5 –30 and Tables. 3–8 for a detailed analysis of all the other ligands.

In Fig. 5 we show the FES of G4 and the cylindrically averaged water density in the metastable states. We find that the system presents two binding poses B and B1. The lowest free energy binding pose B is the same as the one found in the experiments and contains no water. Our simulation discovered a second binding pose B1 that differs from B for the presence of a water molecule at the centre of the cavity. This second pose is ≈ 2 k_BT higher in free energy and thus it is occupied with a much lower probability.

**Fig. 5: Binding FES of ligand G4 with a study of the water presence in the visited states.**

When the ligand exits the pocket, before being fully solvated, it can pass through two intermediate short-lived states I and I1. In I, the cavity is dry and the ligand is free to rotate in front of the cavity entrance. In I1, the ligand sits again in front of the host entrance but its rotation favours configurations in which the ligand bromine atom points towards the cavity forming a linear arrangement where a water at the centre of the cavity is bridged by another water to the Br⁻ anion (see Supplementary Fig. 21). We underline that neither B1 nor I and I1 were part of the Deep-LDA training.

The ability of the Deep-LDA CV s_w to capture the non-local water structural changes is the main reason behind our capability to study the system’s FES and its metastable states at this level of detail. For instance, the use of CVs that concentrate solely on the position of the ligand with respect to the binding site such as s_z alone would clearly lead to an incomplete picture. In fact, B and B1 (and similarly I and I1) cannot be distinguished properly by s_z and, without the presence of a bias changing the cavity’s solvation, the limiting timescale of the simulations would be the water movement in and out of the pocket. Furthermore, local CVs that only describe the average ligand solvation can only partially take into account these non-local effects.

Analysis of the role of water

We can gain a deeper insight into the role of water by investigating the dependence of the Deep-LDA CV on the {L_i, V_i} descriptors. This can be done by analysing the descriptors’ relevance in the action of s_w and, for doing that, we use the derivative ranking method illustrated in ref. ²⁹. Here, we separate the role of the descriptors in the bound and unbound states and we report the results of ligand G4 in Fig. 6 (see Supplementary Fig. 22 for analysis over all the G4 metastable states).

**Fig. 6: Descriptors’ relative weights for guest G4.**

In both states B and U, the weights are distributed over a wide range of descriptors, pointing to the fact that the Deep-LDA CV is able to capture the complex non-local action of water. However, different descriptors act in different ways in the two states. In the B state, the descriptors V₄, V₅, which are linked to the water molecules that reside in the proximity of the host’s entrance, have more weight. This indicates that the fluctuations in this part of the water system need to be amplified for the ligand to exit.

In contrast, in the U state, the descriptors that gain more weight are L₄, which measures the solvation around the bromine atom of the ligand, and V₁, V₂, which control the quantity of water contained in the binding cavity. Fluctuations towards the dry state of the cavity need to occur for the ligand to bind. Such fluctuations can occur with a small but not negligible probability also in the holo state (see Supplementary Fig. 2). Even larger fluctuations have been observed experimentally in ref. ³³ in a related system. We expect these fluctuations to be an important part of the reaction process in many host–guest systems.

The non-local action of the Deep-LDA CV is thus reflected in the relevance given to different water-based descriptors, depending on whether the system is in the bound or unbound state. When enhancing the sampling of this CV, this non-locality determines a collective motion of water that encourages the occurrence of binding/unbinding events.

Discussion

We have shown that, even in the relatively simple systems studied here, a complex and subtle reorganisation of water structure takes place and our strategy is able to capture it. Our calculations offer a powerful analysis tool and lead to accurate binding free energies.

Often, in the paper, we have underlined the efficiency of our method. However, this was not done in a spirit of competition with the SAMPL5 participants who, by the way, did not have the benefit of knowing the results beforehand. Our aim was instead to uncover and describe the role of water through the design and the application of an effective CV. In a scheme like MetaD, the efficiency of a CV is measured by its ability to capture the physics of the problem, hence our insistence on efficiency.

Having been able to reduce this much the sampling error on a commonly used model, we might even be tempted to claim that the discrepancies with respect to experiments can be blamed mainly on the inaccuracy of the force field. It would be interesting in this respect to investigate the force field limitations and how the inclusion of effects like polarisation could bring the results closer to experiments. The method is very robust and defines a protocol that can be naturally applied to larger and more complex systems. In fact, the sampling proficiency of our method will prove even more crucial in complex scenarios where a large number of water molecules can be trapped in multiple pocket locations.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The simulation inputs were taken from https://github.com/michellab/Sire-SAMPL5. We perform the simulations with GROMACS 2019.4⁴³ using the GAFF force field⁴⁴ with RESP charges⁴⁵ and the TIP3P water model⁴². For enhanced sampling, we use a custom version of the PLUMED plugin 2.5.4⁴⁶ where we include OPES¹³ and the Pytorch library 1.4⁴⁷. More details can be found in the Supplementary Methods. Simulations data are available on the Materials Cloud Archive at https://doi.org/10.24435/materialscloud:p3-1x.

Code availability

All the inputs and instructions to reproduce the results presented in this manuscript are deposited in the PLUMED-NEST repository at plumID:20.025. A tutorial about the Deep-LDA training can be found at this link.

References

Michel, J. & Essex, J. W. Prediction of protein-ligand binding affinity by free energy simulations: assumptions, pitfalls and expectations. J. Computer-Aided Mol. Des. 24, 639–658 (2010).
Article ADS CAS Google Scholar
Mobley, D. L. & Gilson, M. K. Predicting binding free energies: frontiers and benchmarks. Annu. Rev. Biophys. 46, 531–558 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bronowska, A. In Thermodynamics - Interaction Studies - Solids, Liquids and Gases Vol. i (InTech, 2011) https://www.intechopen.com/books/thermodynamics-interaction-studies-solids-liquids-and-gases/thermodynamics-of-ligand-protein-interactions-implications-for-molecular-design.
Limongelli, V., Bonomi, M. & Parrinello, M. Funnel metadynamics as accurate binding free-energy method. Proc. Natl Acad. Sci. USA110, 6358–6363 (2013).
Article ADS CAS PubMed Google Scholar
Tiwary, P., Mondal, J. & Berne, B. J. How and when does an anticancer drug leave its binding site? Sci. Adv. 3, e1700014 (2017).
Article ADS PubMed PubMed Central Google Scholar
Evans, R. et al. Combining machine learning and enhanced sampling techniques for efficient and accurate calculation of absolute binding free energies. J. Chem. Theory Comput. 16, 4641–4654 (2020).
Article CAS PubMed PubMed Central Google Scholar
Limongelli, V. Ligand binding free energy and kinetics calculation in 2020. WIREs Comput. Mol. Sci. 10, 1–32 (2020).
Article CAS Google Scholar
Mobley, D. L. Let’s get honest about sampling. J. Computer-Aided Mol. Des. 26, 93–95 (2012).
Article ADS CAS Google Scholar
Rizzi, A. et al. The SAMPL6 SAMPLing challenge: assessing the reliability and efficiency of binding free energy calculations. J. Computer-Aided Mol. Des. 34, 601–633 (2020).
Article ADS CAS Google Scholar
Laio, A. & Parrinello, M. Escaping free-energy minima. Proc. Natl Acad. Sci. USA 99, 12562–12566 (2002).
Article ADS CAS PubMed Google Scholar
Barducci, A., Bussi, G. & Parrinello, M. Well-tempered metadynamics: a smoothly converging and tunable free-energy method. Phys. Rev. Lett. 100, 020603 (2008).
Article ADS PubMed CAS Google Scholar
Bussi, G. & Laio, A. Using metadynamics to explore complex free-energy landscapes. Nat. Rev. Phys. 2, 200–212 (2020).
Article Google Scholar
Invernizzi, M. & Parrinello, M. Rethinking metadynamics: from bias potentials to probability distributions. J. Phys. Chem. Lett. 11, 2731–2736 (2020).
Article CAS PubMed Google Scholar
Valsson, O., Tiwary, P. & Parrinello, M. Enhancing important fluctuations: rare events and metadynamics from a conceptual viewpoint. Annu. Rev. Phys. Chem. 67, 159–184 (2016).
Article ADS CAS PubMed Google Scholar
Tiwary, P. & van de Walle, A. In Multiscale Materials Modeling for Nanomechanics Chap. 6, 195–221 (Springer, 2016) https://link.springer.com/chapter/10.1007%2F978-3-319-33480-6_6.
Debnath, J. & Parrinello, M. Gaussian mixture-based enhanced sampling for statics and dynamics. J. Phys. Chem. Lett. 11, 5076–5080 (2020).
Article CAS PubMed Google Scholar
Invernizzi, M., Piaggi, P. M. & Parrinello, M. Unified approach to enhanced sampling. Phys. Rev. X 10, 041034 (2020).
Google Scholar
Ladbury, J. E. Just add water! The effect of water on the specificity of protein-ligand binding sites and its potential application to drug design. Chem. Biol. 3, 973–980 (1996).
Article CAS PubMed Google Scholar
Ewell, J., Gibb, B. C. & Rick, S. W. Water inside a hydrophobic cavitand molecule. J. Phys. Chem. B. 112, 10272–10279 (2008).
Article CAS PubMed Google Scholar
Abel, R., Young, T., Farid, R., Berne, B. J. & Friesner, R. A. Role of the active-site solvent in the thermodynamics of factor Xa ligand binding. J. Am. Chem. Soc. 130, 2817–2831 (2008).
Article CAS PubMed PubMed Central Google Scholar
Wang, L., Berne, B. J. & Friesner, R. A. Ligand binding to protein-binding pockets with wet and dry regions. Proc. Natl. Acad. Sci. USA108, 1326–1330 (2011).
Article ADS CAS PubMed Google Scholar
Mahmoud, A. H., Masters, M. R., Yang, Y. & Lill, M. A. Elucidating the multiple roles of hydration for accurate protein-ligand binding prediction via deep learning. Commun. Chem. 3, 19 (2020).
Article CAS Google Scholar
Bergazin, T. D. et al. Enhancing water sampling of buried binding sites using nonequilibrium candidate Monte Carlo. J. Computer-Aided Mol. Des. (2020).
Ben-Shalom, I. Y. et al. Accounting for the central role of interfacial water in protein-ligand binding free energy calculations. J. Chem. Theory Comput. (2020).
Limongelli, V. et al. Sampling protein motion and solvent effect during ligand binding. Proc. Natl Acad. Sci. USA 109, 1467–1472 (2012).
Article ADS CAS PubMed Google Scholar
Casasnovas, R., Limongelli, V., Tiwary, P., Carloni, P. & Parrinello, M. Unbinding kinetics of a p38 MAP kinase type II inhibitor from metadynamics simulations. J. Am. Chem. Soc. 139, 4780–4788 (2017).
Article CAS PubMed Google Scholar
Brotzakis, Z. F., Limongelli, V. & Parrinello, M. Accelerating the calculation of protein-ligand binding free energy and residence times using dynamically optimized collective variables. J. Chem. Theory Comput. 15, 743–750 (2019).
Article CAS PubMed Google Scholar
Pérez-Conesa, S., Piaggi, P. M. & Parrinello, M. A local fingerprint for hydrophobicity and hydrophilicity: from methane to peptides. J. Chem. Phys. 150, 204103 (2019).
Article ADS PubMed CAS Google Scholar
Bonati, L., Rizzi, V. & Parrinello, M. Data-driven collective variables for enhanced sampling. J. Phys. Chem. Lett. 2998–3004 (2020).
Bannan, C. C. et al. Blind prediction of cyclohexane-water distribution coefficients from the SAMPL5 challenge. J. Computer-Aided Mol. Des. 30, 927–944 (2016).
Article ADS CAS Google Scholar
Sullivan, M. R., Sokkalingam, P., Nguyen, T., Donahue, J. P. & Gibb, B. C. Binding of carboxylate and trimethylammonium salts to octa-acid and TEMOA deep-cavity cavitands. J. Computer-Aided Mol. Des. 31, 21–28 (2017).
Article ADS CAS Google Scholar
Yin, J. et al. Overview of the SAMPL5 host-guest challenge: are we doing better? J. Computer-Aided Mol. Des. 31, 1–19 (2017).
Article ADS CAS Google Scholar
Barnett, J. W. et al. Spontaneous drying of non-polar deep-cavity cavitand pockets in aqueous solution. Nat. Chem. 12, 589–594 (2020).
Article CAS PubMed Google Scholar
Yin, J., Henriksen, N. M., Slochower, D. R. & Gilson, M. K. The SAMPL5 host-guest challenge: computing binding free energies and enthalpies from explicit solvent simulations by the attach-pull-release (APR) method. J. Computer-Aided Mol. Des. 31, 133–145 (2017).
Article ADS CAS Google Scholar
Bosisio, S., Mey, A. S. & Michel, J. Blinded predictions of host-guest standard free energies of binding in the SAMPL5 challenge. J. Computer-Aided Mol. Des. 31, 61–70 (2017).
Article ADS CAS Google Scholar
Bhakat, S. & Söderhjelm, P. Resolving the problem of trapped water in binding cavities: prediction of host-guest binding free energies in the SAMPL5 challenge by funnel metadynamics. J. Computer-Aided Mol. Des. 31, 119–132 (2017).
Article ADS CAS Google Scholar
Welling, M. Fisher linear discriminant analysis. Technical Reports (Department of Computer Science, University of Toronto, 2005).
Mendels, D., Piccini, G. & Parrinello, M. Collective variables from local fluctuations. J. Phys. Chem. Lett. 9, 2776–2781 (2018).
Article CAS PubMed Google Scholar
Capelli, R. et al. Chasing the full free energy landscape of neuroreceptor/ligand unbinding by metadynamics simulations. J. Chem. Theory Comput. 15, 3354–3361 (2019).
Article CAS PubMed Google Scholar
Bjelobrk, Z. et al. Naphthalene crystal shape prediction from molecular dynamics simulations. CrystEngComm 21, 3280–3288 (2019).
Article CAS Google Scholar
Horn, H. W. et al. Development of an improved four-site water model for biomolecular simulations: TIP4P-Ew. J. Chem. Phys. 120, 9665–9678 (2004).
Article ADS CAS PubMed Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983).
Article ADS CAS Google Scholar
Abraham, M. J. et al. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 19–25 (2015).
Article ADS Google Scholar
Wang, J., Wolf, R. M., Caldwell, J. W., Kollman, P. A. & Case, D. A. Development and testing of a general amber force field. J. Comput. Chem. 25, 1157–1174 (2004).
Article CAS PubMed Google Scholar
Bayly, C. I., Cieplak, P., Cornell, W. & Kollman, P. A. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: the RESP model. J. Phys. Chem. 97, 10269–10280 (1993).
Article CAS Google Scholar
Tribello, G. A., Bonomi, M., Branduardi, D., Camilloni, C. & Bussi, G. PLUMED 2: new feathers for an old bird. Computer Phys. Commun. 185, 604–613 (2014).
Article ADS CAS Google Scholar
Paszke, A. et al. Automatic differentiation in PyTorch. Adv. Neural Inf. Process. Syst. 32, 8024–8035 (2019).
Google Scholar

Download references

Acknowledgements

We acknowledge the Swiss National Science Foundation Grant Nr. 200021_169429/1 and the European Union Grant Nr. ERC-2014-AdG-670227/VARMET for funding. This research was also supported by the NCCR MARVEL, funded by the Swiss National Science Foundation. The simulations were performed on the ETH Euler cluster. Many people helped us during the process of developing and writing this article. We give our sincere thanks to Sergio Pérez, Pablo Piaggi, Riccardo Capelli, Michele Invernizzi, Zoran Bjelobrk, Sandro Bottaro, Yue-Yu Zhang, Tarak Karmakar, Jayashrita Debnath, and Paolo Carloni. We also express our gratitude to the SAMPL challenges organisers for their precious initiative.

Author information

Authors and Affiliations

Department of Chemistry and Applied Biosciences, ETH Zurich, 8092, Zurich, Switzerland
Valerio Rizzi, Narjes Ansari & Michele Parrinello
Facoltà di Informatica, Istituto di Scienze Computazionali, Università della Svizzera Italiana, Via G. Buffi 13, 6900, Lugano, Switzerland
Valerio Rizzi, Luigi Bonati, Narjes Ansari & Michele Parrinello
Department of Physics, ETH Zurich, 8092, Zurich, Switzerland
Luigi Bonati
Italian Institute of Technology, Via Morego 30, 16163, Genova, Italy
Michele Parrinello

Authors

Valerio Rizzi
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Bonati
View author publications
You can also search for this author in PubMed Google Scholar
Narjes Ansari
View author publications
You can also search for this author in PubMed Google Scholar
Michele Parrinello
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.R. performed the simulations. V.R., L.B., N.A., and M.P. discussed the results and reviewed the manuscript.

Corresponding author

Correspondence to Michele Parrinello.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks David Mobley and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rizzi, V., Bonati, L., Ansari, N. et al. The role of water in host-guest interaction. Nat Commun 12, 93 (2021). https://doi.org/10.1038/s41467-020-20310-0

Download citation

Received: 23 June 2020
Accepted: 23 November 2020
Published: 04 January 2021
DOI: https://doi.org/10.1038/s41467-020-20310-0

This article is cited by

Adaptive insertion of a hydrophobic anchor into a poly(ethylene glycol) host for programmable surface functionalization
- Shaohua Zhang
- Wei Li
- Daniela A. Wilson
Nature Chemistry (2023)
Water regulates the residence time of Benzamidine in Trypsin
- Narjes Ansari
- Valerio Rizzi
- Michele Parrinello
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.