Configurational space discretization and free energy calculation in complex molecular systems

Wang, Kai; Long, Shiyang; Tian, Pu

doi:10.1038/srep22217

Download PDF

Article
Open access
Published: 14 March 2016

Configurational space discretization and free energy calculation in complex molecular systems

Kai Wang¹,
Shiyang Long¹ &
Pu Tian^1,2

Scientific Reports volume 6, Article number: 22217 (2016) Cite this article

972 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We sought to design a free energy calculation scheme with the hope of saving cost for generating dynamical information that is inherent in trajectories. We demonstrated that snapshots in a converged trajectory set are associated with implicit conformers that have invariant statistical weight distribution (ISWD). Since infinite number of sets of implicit conformers with ISWD may be created through independent converged trajectory sets, we hypothesized that explicit conformers with ISWD may be constructed for complex molecular systems through systematic increase of conformer fineness and tested the hypothesis in lipid molecule palmitoyloleoylphosphatidylcholine (POPC). Furthermore, when explicit conformers with ISWD were utilized as basic states to define conformational entropy, change of which between two given macrostates was found to be equivalent to change of free energy except a mere difference of a negative temperature factor and change of enthalpy essentially cancels corresponding change of average intra-conformer entropy. By implicitly taking advantage of entropy enthalpy compensation and forgoing all dynamical information, constructing explicit conformers with ISWD and counting thermally accessible number of which for interested end macrostates is likely to be an efficient and reliable alternative end point free energy calculation strategy.

Efficient sampling of high-dimensional free energy landscapes using adaptive reinforced dynamics

Article 24 December 2021

Quantum simulation of exact electron dynamics can be more efficient than classical mean-field methods

Article Open access 10 July 2023

Sampling of the conformational landscape of small proteins with Monte Carlo methods

Article Open access 23 October 2020

Introduction

For two arbitrary macrostates A and B visited in a set of converged molecular dynamics (MD) simulation trajectories, the free energy difference may be expressed as:

with being observed number of snapshots in macrostate , being Boltzmann constant and being the temperature. However, if a converged MD trajectory set was generated for the sole purpose of calculating free energy differences between interested macrostate pairs, all dynamical information contained would have been discarded. One question we sought to answer is that if there is a way to save computational cost used for generating dynamical information by designing a free energy calculation method without explicit utilization of trajectories. A rarely discussed fact is that each snapshot represents an implicit microscopic volume (termed conformer hereafter) in configurational space (see Fig. 1a). More importantly, eq. (1) implies that, in a set of converged trajectories, implicit conformers associated with snapshots have invariant statistical weight distribution (ISWD) across the whole configurational space (see Fig. 1c). Therefore, one way to answer our original question is to accomplish the following two tasks: i) to construct a set of configurational-space-filling (Let the volume of the whole configurational space of a -atom molecular system being , for a set of conformers each has a non-overlapping volume v_i(i = 1, 2, ·, M), if , then this set of conformers are configurational-space-filling, see Fig. 1b for a schematic representation) explicit conformers, with thermally accessible ones among which have the property of ISWD (or a sufficiently good approximation of it) and ii) to design an efficient method to count such conformers that are thermally accessible in given macrostates. To be concise, we use “explicit conformers with ISWD (ECISWD)” to represent “configurational-space-filling explicit conformers, with thermally accessible ones among which have the property of ISWD (or a sufficiently good approximation of it)” hereafter. For two arbitrary macrostates and that have and (Note that both are functions of potential energy) thermally accessible conformers, denoting corresponding average statistical weight of conformers as and , the change of free energy between these two macrostates may be written as:

For ECISWD, , therefore:

It was demonstrated that sequential Monte Carlo (SMC) in combination with importance sampling^1,2 may rapidly count the number of explicit conformers that are thermally accessible. Therefore, the hinging issue is to construct a set of ECISWD. We set to address this issue and accompanying implications in this study.

Hypothesis on ECISWD

Conformers associated with MD snapshots are implicit with no information available for their shapes or sizes, we consequently may not directly learn from MD trajectories. One principal consideration for defining ECISWD is sufficient fineness since statistical weight of complex molecular systems are in general exponentially different for different macrostates³, very coarse conformers are associated with the possibility that the heavist conformer in the statistically most dominant macrostate weighs more than the total of all other macrotates, hence rendering ISWD impossible. Better uniformity is another factor to consider for the same reason. It is noted that ISWD holds for each set of implicit conformers associated with snapshots of corresponding independent and converged MD trajectory set. Therefore, infinite number of ways exist for constructing sets of implicit conformers with ISWD for a given complex molecular systems. Based on this thought, we hypothesized that any set of sufficiently fine and uniform conformers should approximately have the property of ISWD and we may consequently define ECISWD through systematically increasing their fineness according to our convenience.

This hypothesis is immediately disproved by a simple double well system shown in Fig. 2. With increasingly different between two wells and , regardless of the fineness for any uniformly defined conformers, the statistical weight distribution of which in two macrostates will be increasingly different. The only way to achieve sufficiently good approximate ISWD is to construct conformers that were properly weighted by , the potential energy surface that we do not know a priori in a real complex molecular system. Nonetheless, complex molecular systems are very different from a double well system. As shown in Fig. 2, if we divide macrostates and into and (e.g. ) conformers, is consistently higher in than in in terms of conformer average and within each conformer is essentially a constant. Such situation is unlikely, if ever possible, to occur in a complex molecular system. With large number of degrees of freedom (DOFs), tight packing and steep van der waals repulsive core of constituting atoms, potential energy may vary significantly within a microscopic volume of configurational space. Therefore, we think that competitions among large number of DOFs may render construction of ECISWD an achievable task and the above mentioned hypothesis may well be valid for complex molecular systems.

Sufficiently well-converged MD trajectory sets of specific molecular systems provide ideal test grounds for ISWD property of given explicit conformers based on the following two arguments. Firstly, trajectory sets are generated by known force fields and therefore no convolution of force fields inaccuracy and experimental error exists as in the case of comparing computational results with experimental ones; Secondly, we may arbitrarily partition configurational space visited in a trajectory set and a hypothesis tested for arbitrarily given partitions should remain true for the whole configurational space. This is an important logic since traversing configurational space for complex molecular systems is practically impossible. The symbolic equivalence between eqs (1) and (3) suggests that for a set of ECISWD, if we assign each snapshot in a trajectory set to a corresponding conformer and utilizing eqs (1) and (3) respectively to calculate free energy changes for arbitrarily selected pairs of macrostates, differences in results caused by different conformer definitions (between a given explicit conformer set and the implicit one associated with snapshots) should decrease with increasing size of trajectory set and essentially disappear for a fully converged trajectory set, the reason is that free energy difference between two arbitrarily given macrostates does not depend on the way it is calculated. Conversely, if statistical weight distribution of a set of explicit conformers is widely different in different part of the configurational space, the corresponding differences in results would increase with increasing size of trajectory set and saturate for a fully converged trajectory set since the largest possible error is limited by the number of available snapshots in any trajectory sets that are not fully converged. Both complete disappearance of differences resulted from eqs (1) and (3) for the case of ECISWD and full saturation of differences resulted from these two equations for the case of explicit conformers without ISWD will be extremely difficult to observe for complex molecular systems due to excessive amount of data needed. Nonetheless, the trend should be equivalently informative as long as the largest trajectory set is sufficiently well-converged.

We chose lipid POPC to carry out such tests based on the fact that large MD trajectory sets are available for this molecule. Specifically, we firstly extracted MD trajectories of POPC from trajectories of M2 muscarinic acetylcholine receptor study⁴. Three increasingly larger trajectory sets, TSA1, TSA2 and TSA3 were constructed with smaller trajectory sets being subsets of larger ones. Secondly, we defined four different sets of conformers, which were denoted as CONF1 through CONF4 (see Fig. 3 and Table 1) respectively, with CONF1 being the finest and CONF4 being the coarsest. Thirdly, we used backbone dihedrals as order parameters to construct macrostates through projection operations. Finally, number of conformers () were calculated for each macrostate of the given combination of trajectory set and definition of conformers (see Methods for details).

Table 1 Detailed list of comprising atoms of the 43 torsions utilized in defining conformers and macrostates for POPC.

Full size table

With the above given definitions of conformers, macrostates and trajectory sets, we calculated for all pairs of macrostates on each combination of conformer definition and trajectory set according to eq. (1) (denoted as ) and eq. (3) (denoted as ) respectively and their differences were denoted as (see Methods for details), which essentially measures differences between our constructed set of explicit conformers and implicit conformers associated with snapshots. Distributions of and cumulative probability density (CPD) of its absolute values for the four sets of explicit conformers (CONF1 through CONF4) are shown in Fig. 4. Firstly, for CONF2 through CONF4 (Fig. 4b–d), distribution of is significantly broader for larger trajectory set. Secondly, it is noted that the range of horizontal axis is widely different for these three sets of conformers (ranging from less than 0.1 to a few ). For a given trajectory set, dramatically broader distribution of is observed for coarser conformer definitions. Correspondingly, CPD plots of (Fig. 4f–h) exhibit the extent of errors more directly. These observations match our expectation for coarse conformers that do not have sufficiently good approximation of ISWD. Finally and most importantly, for CONF1 (Fig. 4a), distribution of is narrower for larger trajectory set and is significantly narrower than that of all other conformers (Fig. 4b–d), the CPD plot (Fig. 4e) shows the differences among trajectory sets more clearly. Therefore, conformers in set CONF1 match our expectation for ECISWD. The observation of the behavior for CONF1 through CONF4 suggest that, as hypothesized, we may define a set of ECISWD through systematic increase of conformer fineness. Regarding the uniformity of conformers, we equally partitioned each torsional DOF into three torsional states since we have no better information a priori to divide otherwise. To test further the hypothesis that any sufficiently fine conformers should have similarly good approximation of ISWD, we defined a few more different set of conformers with similar fineness to CONF1 through CONF4 respectively and similar observations were made (see Fig. 5). On different trajectory sets of POPC with similar size to TSA1 through TSA3, similar observations were made (data not shown). It is noted that regardless of conformer definition and trajectory set size, distributions of is approximately symmetric with the mode at zero (Fig. 4a–d, Fig. S1a–d and Fig. S2a–d), this is inevitable since selection of start and end macrostate is arbitrary and consistent in calculating both and .

For coarser explicit conformers without ISWD, deviations from ISWD are expected to occur in the heaviest macrostates, where larger probability for occurrence of excessively heavy conformers would cause uneven distribution of statistical weight. Again, such deviations are expected to be larger for larger trajectory sets (and eventually saturate for a fully converged trajectory set). To this end, we plotted vs for all constructed macrostates in Fig. 6 for CONF1 and CONF4. Indeed, deviations occur for the heaviest macrostates and are larger for larger trajectory set for CONF4 (Fig. 6b,d,f). Perfect scaling was observed for CONF1 (Fig. 6a,c,e) as expected.

Conformational entropy based on ECISWD

Typical molecular systems in chemical, materials and biological studies, when treated quantum mechanically, present intractable complexity. Classical (continuous) representation of atomic DOFs, however, presents an awkward situation for the definition of microstates and entropy⁵. Correspondingly, density of states of classical systems may be determined only up to a multiplicative factor⁶. The term “conformational entropy”, despite its widespread usage, has no well established definition available for major complex biomolecular systems. Explicit conformers with ISWD, despite its system dependence and the fact that infinite number of specific definitions exist for each given complex molecular systems, may be utilized as basic states for defining conformational entropy in an abstract and general sense for any complex molecular systems and we explore this idea and its implications in this section.

It is well established in the informational theory field⁷ that for a given static distribution with well-defined basic states, entropy may be constructed by arbitrary division of the whole system into subparts.

with , and being properly normalized:

is the global informational entropy and s are local informational entropies, it is noted that such division may be carried out recursively. We may similarly construct both local entropies of macrostates (say and ) and global entropy for the given molecular system based on a set of explicit conformers:

is the probability of the th conformer in the global configurational space, is the probability of the th conformer in macrostate . is the intra-conformer entropy of the th conformer in the global configurational space. is the intra-conformer entropy for the th conformer in macrostate . , and are number of thermally accessible conformers in the full configurational space, in macrostate and in macrostate respectively. Again, , and are properly normalized:

The first terms on the right hand side of eqs (8, 9 and 10) describe distributions of conformer statistical weights within a macrostate or within the whole configurational space and is referred to as “conformational entropy” (), the second terms are averages of the intra-conformer entropies of corresponding conformers and are denoted . We may rewrite and in the following form:

With a simple algebraic manipulation shown below:

Conformational entropy of macrostate () is divided into two terms. The first term is the Boltzmann entropy (or ideal gas entropy, denoted as ) based on the number of conformers. The second term represents deviation from the Boltzmann entropy (denoted as ). It is the product of the Boltzmann constant and the Kullback-Leibler divergence⁸ between the actual probability distribution of conformer statistical weights in macrostate () and the uniform distribution (). may be rewritten as:

Similarly, denote probability distribution of conformer statistical weights in macrostate as and the corresponding uniform distribution as , we have:

For ECISWD, if we denote the corresponding ISWD with a continuous probability density , then and . Denote the continuous uniform distribution as , we have:

Note that (eq. 26) is equivalent to (eq. 3) except a mere difference of a negative temperature factor. reflect the difference between two KL divergences, which correspond to distances between the statistical weight distribution of conformers in macrostate and the uniform distribution. The advantage of utilizing ECISWD for defining conformational entropy is the generality by concealing system specific molecular structural information in specific definition of conformers. Additionally, when difference of conformational entropy is taken between two arbitrary macrostates, deviation of the unknown ISWD from the uniform distribution is cancelled and we need only to deal with the number of conformers. Based on the same logic as in the case of free energy analysis, with increasingly larger subsets of a sufficiently well-converged MD trajectory set, we expect to observe systematic decrease of calculated for arbitrarily defined macrostate pairs as long as ECISWD are basic states of conformational entropy. Conversely, we expect to observe systematic increase of when explicit conformers with widely variant statistical weight distributions are basic states of conformational entropy. To this end, we took the same trajectory sets, definition of conformers and macrostates as in the analysis of and calculated corresponding based on eqs (20) and (22) for each macrostate pair. Both distributions of and corresponding CPD of its absolute value were shown in Fig. 7. As expected and consistent with free energy analysis as shown in Fig. 4, trend of based on conformers in set CONF1 (Fig. 7a,e) matches our expectation for that of ECISWD, while trends of based on conformers in sets CONF2 through CONF4 (Fig. 7b–d,f–h) match our expectation for that of conformers with variant statistical weight distribution, with coarser conformers and larger trajectory sets correspond to wider distributions of .

Entropy enthalpy compensation

In canonical ensemble, we have:

with being the change of potential energy between the two macrostates and . Let and substitute eqs 12, 15, 19, 21 and 25 into eq. (27), we have:

While the derivation is carried out in canonical ensemble, it should be applicable for many isobaric-isothermal processes (e.g. many biomolecular systems under physiological conditions or routine experimental conditions) where change of the term is negligible. Note that eq. 28 is the intriguing entropy-enthalpy compensation (EEC) phenomena (when the term is negligible), which had long been an enigma^{9,10,11,12,13} and has attracted a revival of interest due to its critical relevance in protein-ligand interactions^{14,15,16,17,18,19,20,21,22,23,24,25}. Careful statistical analysis confirm that EEC does exist to various extent in many protein-ligand interaction systems after experimental errors are effectively removed¹⁹. For a given molecular system, once we have constructed a set of ECISWD, eqs (3) and (28) state that change of molecular interactions does not necessarily cause change of free energy, which depends on relative number of thermally accessible ECISWD in end macrostates and local effects from change of molecular interactions will be cancelled almost completely by corresponding change of average intra-conformer entropy. Note that correlation of neither signs nor magnitudes between and is implied. Therefore, depending upon signs and magnitudes of and (we neglect the term here), this theory is compatible with molecular processes driven by enthalpy, entropy or both and various extent of observed EEC. When , perfect EEC would be observed; when and (or ), a seemingly entropy driven (and a reverse entropy limited) process would be observed; when and (or ), depending upon the sign of , a seemingly enthalpy or entropy-enthalpy jointly driven (and a reverse enthalpy or entropy-enthalpy jointly limited) process would be observed. The fundamental new perspective provided by eqs (3, 26 and 28) is that EEC is directly related to local redistribution of microstates in configurational space, while change of free energy and conformational entropy reflect the collective thermal accessibility of relevant macrostates. System complexity is essential for construction of ECISWD as demonstrated by our initial discussions on the double well model. Consistently, robustness of approximations in eqs (3) and (26) corresponds to the near-perfect cancellation of change of intra-conformer entropy and change of enthalpy as reflected by eq (28). Without sufficient number of complex and heterogeneous microstates within each conformer, it is hard to imagine how such EEC occur. Along the same lines, a simple Morse potential type of protein-ligand interaction model was not found to allow significant EEC²². Based on the widespread observation of strong EEC effect in many molecular systems, it was suggested²² that any attempt to calculate the change of free energy as a sum of its enthalpic and entropic contributions is likely to be unreliable. The proposed conformer counting strategy (eq. 3) implicitly utilizes EEC by completely avoiding direct calculation of and , which is expensive and error prone.

Conclusions

In summary, we presented the idea that snapshots in a converged MD trajectory set map directly to implicit thermally accessible conformers with ISWD. Based on the thought that infinite number of ways exist for defining implicit conformers with ISWD for a given molecular system, we hypothesized that any sufficiently fine set of conformers should have sufficiently good approximate ISWD. This hypothesis, while being disproved by a double well potential, tested successfully on extensive MD trajectories of lipid POPC. We think that competition of many DOFs, each allowed to vary significantly in both potential energy and spatial position within a conformer, constitutes the foundation for the observed validity of the hypothesis. Considering the moderate complexity of lipid POPC, it is likely that the hypothesis holds for complex molecular systems in general. This is a useful demonstration of the idea that “More is different”²⁶. Active research is undergoing in our group toward defining ECISWD for more biomolecular systems (e.g. protein-ligand, protein-protein interaction and protein-nucleic acid interactions systems with explicit or implicit solvation). Furthermore, when ECISWD are utilized as basic states for definition of conformational entropy, change of which between two macrostates was found to be equivalent with corresponding change of free energy except a mere difference of a negative temperature factor. Meanwhile, change of potential energy between two macrostates was found to cancel corresponding change of average intra-conformer entropy. This finding suggests that EEC is inherently a local phenomenon in configurational space and is likely universal in complex molecular systems. While providing an alternative perspective to the long-standing enigmatic EEC, this result is consistent with different extent of EEC observed for both enthalpy driven and entropy driven molecular processes in conventional sense where change of enthalpy is compared with change of total entropy. Counting thermally accessible ECISWD (eq. 3) is a natural extension of the population based free energy formula (eq. 1), which is only useful posterior to a converged simulation. However, eq. 3 effectively utilizes EEC implicitly through separation of entropy into conformational entropy based on ECISWD and intra-conformer entropy and renders direct utilization of SMC and importance sampling possible for rapid free energy difference estimation^1,2. In accordance with “no free lunch theorem”²⁷, this expected gain in efficiency pays the price of all dynamical and pathway information associated with converged trajectories.

Methods

Definition of trajectory sets

Trajectory sets TSA1, TSA2 and TSA3 are constructed from snapshots of POPC collected in simulation condition A in the supplementary Table 2 of the GPCR simulation study⁴. There were totally 34143653 snapshots, which collectively amount to (). Five subsets, with collective length (CL) being , , , and respectively, were available for this simulation condition. We take the first six trajectories out of the total 66 trajectories of the first subset as TSA1, which has a CL of . The first subset () was taken as TSA2 and the union of all subsets was taken as TSA3 ().

Definition of conformers

To define conformers, we first take a given set of torsional DOFs (Fig. 3), with each being divided into three equally sized torsional states with boundaries at , and and subsequently utilize their unique combinations as conformers. The whole configurational space is therefore divided into conformers. Sets CONF1 through CONF4 divide the configurational space into , , and conformers respectively. Two structural states (i.e. snapshots) of a POPC molecule belong to the same conformer if and only if they share the same torsional state for each selected torsional DOF. Apparently, infinite number of ways exist to define set of conformers with similar fineness and uniformity.

Defintion of macrostates and corresponding number of conformers within each of which

To prepare macrostates, all snapshots in a given trajectory set were projected onto a selected backbone dihedral that was partitioned into 20 18°-windows, snapshots fall within each of which constitute an observed macrostate. Such projections were performed for each of 43 dihedrals (Fig. 3) and we have collectively 860 macrostates for each given combination of trajectory set and conformer definition. Apparently, macrostates based on the same dihedral angle do not overlap, while those based on different dihedral angles may overlap to different extent. To assign each snapshots to its belonging conformer and calculate for the th macrostates, torsional states for the selected torsional DOFs were encoded into bit vectors and the radix sort algorithm²⁸ was utilized. Take CONF1 and TSA1 as an example, all snapshots from TSA1 that satisfy the criteria for the th macrostate are binned into the possible conformers and total number of non-empty bins is the , which is subsequently utilized in eq. (3) to calculate explicit-conformer-based free energy difference as . For each specific combination of conformer definition and trajectory set, () are also used in constructing Fig. 6.

Calculation of δΔF

For a given pair of macrostates (, ) under specific definition of conformer and trajectory set, we first calculate and according to eqs (1) and (3) and we subsequently calculate . With 860 macrostates, runs from 2 through 860 and runs from 1 through for each , we calculated for 369370 macrostate pairs. Distribution of these 369370 values were plotted in Figs 4a–d and 5a–d. Since we care more the magnitude of free energy differences than their signs, we calculated distribution of and CPD of which is plotted in Figs 4e–h and 5e–h. The area under curves (AUC) for CPD curves provide clearer description of extent of differences between based on implicit conformers with ISWD (eq. 1) and based on specific definition of explicit conformers (eq. 3), with larger AUC corresponds to smaller differences.

Calculation of δΔS

For the th macrostate under specific definition of conformer and trajectory set, we first calculated according to eq. (20) (or eq. (22)), with runs from 1 through 860. Subsequently, for each macrostate pair , is calculated, with runs from 2 through 860 and runs from 1 to for each . We therefore had 369370 values for each specific combination of trajectory set and definition of conformers. Probability distribution of these values are plotted in Fig. 7a–d and CPD of their absolute values in Fig. 7e–h, similar to plotting of distributions and CPD of their absolute values in Figs 4 and 5. AUC for CPD curves of describes extent of differences between change of ideal gas entropy based on number of conformers (eq. 26) and observed change of conformational entropy , with being defined in eq. (13) or eq. (16). Again, larger AUC corresponds to smaller difference.

Additional Information

How to cite this article: Wang, K. et al. Configurational space discretization and free energy calculation in complex molecular systems. Sci. Rep. 6, 22217; doi: 10.1038/srep22217 (2016).

References

Zhang, J., Chen, R., Tang, C. & Liang, J. Origin of scaling behavior of protein packing density: A sequential Monte Carlo study of compact long chain polymers. J. Chem. Phys. 118, 6102–9 (2003).
Article CAS ADS Google Scholar
Zhang, J. & Liu, J. S. On side-chain conformational entropy of proteins. PLoS Comput. Biol. 2, e168 (2006).
Article ADS Google Scholar
Skilling, J. Nested Sampling for Bayesian Computations. Bayesian Anal. 1, 833–860 (2006).
Article MathSciNet Google Scholar
Dror, R. O. et al. Structural basis for modulation of a G-protein-coupled receptor by allosteric drugs. Nature 503, 295–9 (2013).
Article CAS ADS Google Scholar
Wehrl, A. General properties of entropy. Rev. Mod. Phys. 50, 221–260 (1978).
Article ADS MathSciNet Google Scholar
Chipot, C. & Pohorille, A. Free Energy Calculations, Theory and Applications in Chemistry and Biology (Springer, Berlin Heidelherg New York, 2007).
Shannon, C. A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423 (1948).
Article MathSciNet Google Scholar
Kullback, S. & Leibler, R. A. On information and sufficiency. Ann. Math. Statist. 22, 79–86 (1951).
Article MathSciNet Google Scholar
Lumry, R. & Rajender, S. Enthalpy-entropy compensation phenomena in water solutions of proteins and small molecules: a ubiquitous property of water. Biopolymers 9, 1125–227 (1970).
Article CAS Google Scholar
Imai, K. & Yonetani, T. Thermodynamic Studies of Oxygen Equilibrium of Hemoglobin. J. Biol. Chem. 250, 7093–7098 (1976).
Google Scholar
Grunwald, E. & Steel, C. Solvent Reorganization and Thermodynamic Enthalpy-Entropy Compensation. J. Am. Chem. Soc. 117, 5687–5692 (1995).
Article CAS Google Scholar
Gallicchio, E., Kubo, M. M. & Levy, R. M. Entropy - Enthalpy compensation in solvation and ligand binding revisited. J. Am. Chem. Soc. 120, 4526–4527 (1998).
Article CAS Google Scholar
Liu, L. & Guo, Q.-x. Isokinetic Relationship, Isoequilibrium Relationship and Enthalpy Entropy Compensation. Chem. Rev. 101, 673–695 (2001).
Article CAS Google Scholar
Ford, D. M. Enthalpy-entropy compensation is not a general feature of weak association. J. Am. Chem. Soc. 127, 16167–70 (2005).
Article CAS Google Scholar
Krishnamurthy, V. M., Bohall, B. R., Semetey, V. & Whitesides, G. M. The paradoxical thermodynamic basis for the interaction of ethylene glycol, glycine and sarcosine chains with bovine carbonic anhydrase II: an unexpected manifestation of enthalpy/entropy compensation. J. Am. Chem. Soc. 128, 5802–12 (2006).
Article CAS Google Scholar
Starikov, E. B. & Nordén, B. Enthalpy-entropy compensation: a phantom or something useful? J. Phys. Chem. B 111, 14431–5 (2007).
Article CAS Google Scholar
Ward, J. M., Gorenstein, N. M., Tian, J., Martin, S. F. & Post, C. B. Constraining binding hot spots: NMR and molecular dynamics simulations provide a structural explanation for enthalpy-entropy compensation in SH2-ligand binding. J. Am. Chem. Soc. 132, 11058–70 (2010).
Article CAS Google Scholar
Liu, G., Gu, D., Liu, H., Ding, W. & Li, Z. Enthalpy-entropy compensation of ionic liquid-type Gemini imidazolium surfactants in aqueous solutions: a free energy perturbation study. J. Colloid. Interf. Sci. 358, 521–6 (2011).
Article CAS ADS Google Scholar
Olsson, T. S. G., Ladbury, J. E., Pitt, W. R. & Williams, M. a. Extent of enthalpy-entropy compensation in protein-ligand interactions. Protein Sci. 20, 1607–1618 (2011).
Article CAS Google Scholar
Ferrante, A. & Gorski, J. Enthalpy-entropy compensation and cooperativity as thermodynamic epiphenomena of structural flexibility in ligand-receptor interactions. J. Mol. Biol. 417, 454–67 (2012).
Article CAS Google Scholar
Starikov, E. B. & Nordén, B. Entropy-enthalpy compensation as a fundamental concept and analysis tool for systematical experimental data. Chem. Phys. Lett. 538, 118–120 (2012).
Article CAS ADS Google Scholar
Chodera, J. D. & Mobley, D. L. Entropy-enthalpy compensation: role and ramifications in biomolecular ligand recognition and design. Annu. Rev. Biophys. 42, 121–42 (2013).
Article CAS Google Scholar
Breiten, B. et al. Water Networks Contribute to Enthalpy/Entropy Compensation in Protein-Ligand Binding. J. Am. Chem. Soc. 135, 15579–15584 (2013).
Article CAS Google Scholar
Tidemand, K. D., Scho, C., Holm, R., Westh, P. & Peters, G. H. Computational Investigation of Enthalpy-Entropy Compensation in Complexation of Glycoconjugated Bile Salts with β Cyclodextrin and Analogs. J. Chem. Phys. 118, 10889–10897 (2014).
Article CAS Google Scholar
Ahmad, M., Helms, V., Lengauer, T. & Kalinina, O. V. Enthalpy-Entropy Compensation upon Molecular Conformational Changes. J. Chem. Theory Comput. 11, 1410–8 (2015).
Article CAS Google Scholar
Anderson, P. More is different. Science 177, 393–6 (1972).
Article CAS ADS Google Scholar
Wolpert, D. & Macready, W. No free lunch theorems for optimization. IEEE T. Evolut. Comput. 1, 67–81 (1997).
Article Google Scholar
Cormen, T. H., Leiserson, C. E., Rivest, R. L. & Stein, C. Introduction to Algorithms (MIT Press and McGraw-Hill, 2009), 3rd edn.

Download references

Acknowledgements

This research was supported by National Natural Science Foundation of China under grant number 31270758 and by the Research fund for the doctoral program of higher education under grant number 20120061110019. Computational resources were partially supported by High Performance Computing Center of Jilin University, China. We thank DE Shaw Research for providing trajectory sets. We thank Zhonghan Hu for insightful discussions.

Author information

Authors and Affiliations

College of Life Science, Jilin University, 2699 Qianjin Street, Changchun, 130012, China
Kai Wang, Shiyang Long & Pu Tian
MOE Key Laboratory of Molecular Enzymology and Engineering, Jilin University, 2699 Qianjin Street, Changchun, 130012, China
Pu Tian

Authors

Kai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shiyang Long
View author publications
You can also search for this author in PubMed Google Scholar
Pu Tian
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.T. designed the whole study and wrote the manuscript. K.W. performed data collection and analysis. S.L. wrote partial codes for data analysis and assisted in data analysis.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Wang, K., Long, S. & Tian, P. Configurational space discretization and free energy calculation in complex molecular systems. Sci Rep 6, 22217 (2016). https://doi.org/10.1038/srep22217

Download citation

Received: 06 December 2015
Accepted: 10 February 2016
Published: 14 March 2016
DOI: https://doi.org/10.1038/srep22217

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Configurational space discretization and free energy calculation in complex molecular systems

Subjects

Abstract

Similar content being viewed by others

Efficient sampling of high-dimensional free energy landscapes using adaptive reinforced dynamics

Quantum simulation of exact electron dynamics can be more efficient than classical mean-field methods

Sampling of the conformational landscape of small proteins with Monte Carlo methods

Introduction

Hypothesis on ECISWD

Conformational entropy based on ECISWD

Entropy enthalpy compensation

Conclusions

Methods

Definition of trajectory sets

Definition of conformers

Defintion of macrostates and corresponding number of conformers within each of which

Calculation of δΔF

Calculation of δΔS

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Information

Rights and permissions

About this article

Cite this article

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Efficient sampling of high-dimensional free energy landscapes using adaptive reinforced dynamics

Quantum simulation of exact electron dynamics can be more efficient than classical mean-field methods

Sampling of the conformational landscape of small proteins with Monte Carlo methods

Introduction

Hypothesis on ECISWD

Conformational entropy based on ECISWD

Entropy enthalpy compensation

Conclusions

Methods

Definition of trajectory sets

Definition of conformers

Defintion of macrostates and corresponding number of conformers within each of which

Calculation of δΔF

Calculation of δΔS

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links