Finding multiple reaction pathways via global optimization of action

Lee, Juyong; Lee, In-Ho; Joung, InSuk; Lee, Jooyoung; Brooks, Bernard R.

doi:10.1038/ncomms15443

Download PDF

Article
Open access
Published: 26 May 2017

Finding multiple reaction pathways via global optimization of action

Juyong Lee¹,
In-Ho Lee^2,3,
InSuk Joung^3,4,
Jooyoung Lee^3,4 &
…
Bernard R. Brooks¹

Nature Communications volume 8, Article number: 15443 (2017) Cite this article

5700 Accesses
29 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Global searching for reaction pathways is a long-standing challenge in computational chemistry and biology. Most existing approaches perform only local searches due to computational complexity. Here we present a computational approach, Action-CSA, to find multiple diverse reaction pathways connecting fixed initial and final states through global optimization of the Onsager–Machlup action using the conformational space annealing (CSA) method. Action-CSA successfully overcomes large energy barriers via crossovers and mutations of pathways and finds all possible pathways of small systems without initial guesses on pathways. The rank order and the transition time distribution of multiple pathways are in good agreement with those of long Langevin dynamics simulations. The lowest action folding pathway of FSD-1 is consistent with recent experiments. The results show that Action-CSA is an efficient and robust computational approach to study the multiple pathways of complex reactions and large-scale conformational changes.

Accurate transition state generation with an object-aware equivariant elementary reaction diffusion model

Article 15 December 2023

Computing the committor with the committor to study the transition state ensemble

Article 05 June 2024

A computer algorithm to discover iterative sequences of organic reactions

Article 12 January 2022

Introduction

Finding multiple plausible reaction pathways between two end states is a long-standing challenge in computational sciences¹. One of the common approaches is to perform long-time molecular dynamics (MD) simulations. Despite recent advances in the MD methodologies and computational technologies, this approach suffers from a timescale problem. Many biological reactions such as protein folding and protein conformational transitions occur in the microsecond or millisecond ranges, which are still hard to be performed even with the fastest computers available today. Also, MD simulations starting from one end state are not guaranteed to reach the other end state of interest especially considering the inaccuracies of current force fields. Thus, developing an efficient computational method to find multiple possible reaction pathways connecting two end states can serve as the ultimate and practical solution of the challenge. Although several such methods have been suggested^{1,2,3,4,5,6,7,8}, exploring and producing multiple reaction pathways of a complex system remains a challenge. The objective of this work is to present a method that can efficiently explore and produce multiple reaction pathways connecting two end states. Other approaches using a conformational driving force do not sample alternatives^9,10. Methods that are robust, such as transition path sampling^2,3, are very expensive to use for complex systems in the presence of multiple steps and barriers.

Various chain-of-states methods have been suggested based on the assumption that a dominant transition pathway between two states follows the minimum energy pathway^11,12,13. The limitations of these methods are that they do not consider the dynamics of a system and find only the nearest local minimum solution from a given initial pathway^1,9. Alternative methods based on the principle of least action have been suggested^{5,14,15,16,17,18,19,20}. Passerone and Parrinello suggested the action-derived molecular dynamics (ADMD) method based on the combination of classical action and a penalty term that conserves the total energy of a system^18,19. To enhance the convergence of ADMD calculations, Lee et al.^20,21,22,23 introduced a kinetic energy penalty term based on the equipartition theorem. Although the ADMD approaches yield physically relevant pathways, they have two practical limitations^20,24: (a) they strongly depend on the initial guesses of a pathway; and (b) they cannot identify the relative dominance of multiple pathways because the classical principle of least action is an extremum principle²⁵.

For diffusive processes, the second problem can be avoided by using the Onsager-Machlup (OM) action S_OM^{15,26,27,28,29,30,31}. Onsager and Machlup showed that the relative probability to observe a pathway with an OM action of S is proportional to where k_B is the Boltzmann constant and T is a temperature. Thus the most dominant pathway corresponds to the one that minimizes S_OM and the same result can be obtained by solving the Fokker–Planck equation^7,8,32. This property recasts the problem of finding multiple pathways into a global search and optimization problem. However, finding multiple low action pathways is a challenging task because the minimization of S_OM requires the second derivatives of a potential function, which are computationally expensive, at best, and wholly unavailable for many quantum mechanical energy surfaces.

In this work, we propose an efficient computational method, Action-CSA, that finds multiple low OM action pathways without second derivative calculations. For global search and optimization of a pathway space, we used an efficient global optimization method called conformational space annealing (CSA), which is based on a combination of genetic algorithm, simulated annealing, and Monte Carlo with minimization^33,34. CSA has been demonstrated to be extremely efficient in solving various global optimization problems including finding low energy conformations of Lennard–Jones clusters³⁵, protein structure prediction^{34,35,36,37,38,39,40}, community detection in networks^41,42,43, and designing the first-ever direct bandgap silicon and carbon allotropes^44,45,46,47. CSA is the most robust method available in CHARMM^48,49 for generating low energy conformations of peptides. We extend the CSA approach to examine pathways, preserving all features that make it robust and efficient, by applying it to sets of entire pathways represented as a chain-of-states.

Action-CSA efficiently explores the pathway space regardless of the heights of energy barriers via crossovers and mutations of pathways. Without calculating the second derivatives of a potential energy, multiple diverse pathways with low OM action were obtained by combining local optimization of pathways using classical action and selection of pathways using the OM action. From benchmark simulations using alanine dipeptide, our method finds multiple transition pathways, which are consistent with long-time Langevin dynamics (LD) simulations. The rank order statistics and transition time distributions of the multiple pathways are in good agreement with those of the LD results. For the conformational change of hexane from the all-gauche(−) to all-gauche(+) states, Action-CSA finds all possible transition pathways. Also, the lowest action folding pathway of FSD-1 is consistent with recent experiments reported after the submission of this work. These results demonstrate that Action-CSA searches multiple reaction pathways including the most dominant one in an efficient and robust way.

Results

Conformational change of alanine dipeptide

A comparison of Action-CSA and LD simulations demonstrate that Action-CSA finds multiple possible pathways and correctly identifies the most probable one. Eight different pathways were identified for the C7_eq→C7_ax transition by clustering all pathways sampled from the Action-CSA simulations (Fig. 1a). From the S_OM values obtained with different transition times (Fig. 1b), it is clear that the pathway that crosses barrier B has the lowest S_OM value along all transition times tested indicating that it is the most probable pathway regardless of the transition time. This is consistent with the 500 μs LD simulation results (Table 1). From the LD simulations, 1,350 transitions starting from C7_eq to C7_ax were observed. They were clustered by finding the nearest neighbor from the eight pathways obtained by Action-CSA. From the clustering, the pathway crossing barrier B was identified as the most probable one with all transition times considered. This demonstrates that Action-CSA correctly identified the minimum OM action pathway and that it matched the most dominant pathway observed in the LD simulations.

**Figure 1: Conformational transition pathways of alanine dipeptide.**

Table 1 The frequencies of transition pathways of alanine dipeptide from C7_eq to C7_ax observed from 500 μs Langevin dynamics simulations.

Full size table

In addition, it is also identified that Action-CSA simulations provide information on the transition times of various pathways. Until t<0.8 ps, the pathway that crosses barrier C (Path2) has the second lowest S_OM and the lowest S_OM value was observed at 0.4 ps. These are consistent with the LD results in which all 118 transitions that crossed barrier C occurred within 1.1 ps and their most probable transition time was 0.7 ps (the inset of Fig. 1b). However, when t>0.8 ps, Path3, which passes the fully extended conformation region (Φ, Ψ)=(−180°, 180°) and barrier A and B becomes the pathway with the second lowest S_OM. From the LD simulations, when t>0.9 ps, 25 pathways similar to Path3 were identified, which makes them the second dominant pathway. These results demonstrate that the profile of S_OM values is consistent with the distributions of transition times obtained from the LD simulations. Note that the most probable transition times observed from the LD simulations are longer than the minimum action transition times obtained from the CSA simulations. This is because high-frequency motions due to thermal fluctuations are filtered out in the minimum action pathways^1,15,16. This means that the dwell time is well filtered out in the simulation, where a physically sufficient sampling time is assumed.

Conformational transition of hexane

The second example is finding multiple low-lying pathways for the conformational change of hexane from the all-gauche(−) state (g−g−g−) to the all-gauche(+) state (g+g+g+). We assessed the sampling ability of Action-CSA by investigating the diversity of sampled pathways. If it is assumed that dihedral angles do not cross the highest energy barrier around the cis state, all possible transition pathways can be enumerated (Supplementary Table 2). For the transition under this assumption, there exist 44 possible pathways in total, excluding cases where a torsional barrier is crossed multiple times. If the symmetries of dihedral angles and the atomic order are considered, these 44 pathways can be reduced to 14 pathway types.

We repeated the Action-CSA calculation of the transition 40 times by using 200 initial pathways consisting of 100 replicas and a transition time of 3 ps. In all 40 simulations, the 6 lowest action pathways, CC+, CC−, TC+, TC−, CM+ and CM−, were found in a robust fashion. The highest-action pathway, MXM, was found in nine simulations, and the other seven higher-action pathways were found in at least 29 simulations. On average, a single CSA simulation sampled 12 out of 14 unique path types and 26 out of 44 possible pathways. These results show that Action-CSA can sample a number of lowest action pathways including the most dominant one. The majority of the remaining pathways with higher actions can also be found with a tendency that lower action-value pathways are more frequently found. We note that the sampling ability of Action-CSA can be further improved by increasing the bank size. The potential energy landscape of the CC+ pathway corresponding to the least S_OM shows that hexane crosses six energy barriers (Fig. 2). It should be noted that the fraction of possible pathways found in a given Action-CSA simulation depends on the number of replicas and the transition time. This example represents typical use, and not a best case scenario.

**Figure 2: The lowest OM action pathway of conformational transition of hexane.**

Folding pathway of mini protein FSD-1

The third example is finding the folding pathway of FSD-1, a 28-residue mini-protein that has been widely investigated as a model system for studying the protein folding problem^{22,50,51,52,53,54}. Folding pathways of FSD-1 from the fully extended conformation to the native structure were represented by using 100 replicas, a total folding time of 10 ps, and a temperature of 300 K. The protein was represented by the PARAM19 force field⁵⁵ and solvation effects were considered by the FACTS implicit solvent model⁵⁶. This calculation required approximately 160 h with 72 Haswell cores and a diverse set of about twenty low action pathways were generated.

The lowest OM action folding pathway is consistent with a recent experiment⁵⁷ published after the submission of this work, where the early formation of C-terminal α-helix is observed to be followed by the concurrent formation of the β-hairpin and hydrophobic contacts. A comparison of the root mean square deviation values indicates that the α-helix approaches to the native structure earlier than the β-hairpin. Afterward, the folding of β-hairpin and the formation of hydrophobic core occur concurrently (Fig. 3a). The potential energy landscape of the FSD-1 folding shows that the potential energy decreases quickly after the 80 step suggesting that this step may be the transition state of folding (Fig. 3b). The conformation at the 80 step shows that the α-helix is almost fully formed while the C-terminal region is not folded yet and the hydrophobic core is partially exposed.

**Figure 3: The lowest action folding pathway of FSD-1.**

After our manuscript was submitted, it came to our attention that Meuzelaar and co-workers reported that the folding of FSD-1 occurs via an intermediate state where only the α-helix is formed⁵⁷. After this intermediate state, the β-hairpin and hydrophobic contacts form. The pathway was determined by combining temperature-dependent UV circular dichroism, Fourier transform infrared spectroscopy, two-dimensional infrared spectroscopy, and temperature-jump transient-IR spectroscopy. This folding mechanism shows good agreement with the dominant folding pathway identified in this study. This agreement strongly indicates that our method can serve as a powerful tool to study the folding mechanism of a protein with atomic details. We note that additional sub-optimal folding pathways were also obtained, where the second lowest OM action pathway suggested a different pathway: the β-hairpin folds first, followed by the concurrent formation of α-helix and hydrophobic contacts (Supplementary Fig. 1).

Discussion

The goal of Action-CSA is to search multiple diverse pathways of low OM action values in a fast and efficient fashion, rather than sampling a specific physical ensemble. Throughout an Action-CSA calculation, the OM action is used to identify the relative probabilities of multiple trial pathways obtained by performing crossover and mutation operations followed by local minimization using the classical action. Ideally, we should have performed the local minimization using the OM action, which was not feasible due to the high cost of calculating second derivatives. Instead, we performed one-point evaluation of the OM action after the minimization. Here we assumed that the CSA selection procedure using the OM action drives the CSA population to low OM action basins. A similar approach was used in designing the first-ever direct bandgap silicon and carbon allotropes^44,45,46,47, where the local optimization was performed in terms of enthalpy but the selection was done by the band gap property. Thus, an Action-CSA calculation yields a set of low OM action pathways, but they do not correspond to any physical ensembles. Action-CSA results can be used as the starting point for existing pathway sampling methods, such as transition pathway sampling³ or the replica-exchange pathway sampling^30,31, which aim to generate canonical ensembles. In addition, our method can be used to find low potential energy pathways or multiple Newtonian pathways via selection using the height of potential energy barrier or the Gauss action^58,59 instead of the OM action.

We note that the OM action depends on the friction parameter. As friction approaches to zero, the Langevin dynamics, described by the OM action, converges to the Newtonian dynamics, described by the classical action. Thus, when friction is small, we expect that pathways sampled with the classical action will be close to those sampled with the OM action. However, when friction is large, this assumption may not hold. In such cases, one should minimize the OM action directly, which will be computationally much more expensive than the current scheme because the analytic gradients of the OM action require Hessian calculations (equation (4)).

Action-CSA has three unique characteristics compared with existing path sampling methods^{2,3,4,29,60,61,62}: (a) the use of the bank of diverse pathways, (b) the generation of new trial pathways by swapping and mutating of pathway segments followed by local optimization using classical action with total energy restraint, and (c) the selection of pathways with higher probabilities by using the OM action. By maintaining a diverse bank population, one can perform more extensive search of the pathway space, and is less reliant on the initial pathway chosen. The crossover and mutations of pathways followed by local optimization also facilitate extensive search of pathway space because those operations generate new pathways by overcoming large energy barriers, which is the major limitation of MD-based approaches. Last, the combined use of the classical action for local minimization and the OM action for selection is computationally relatively efficient, and allowed us to find multiple low OM action pathways without performing the computationally expensive Hessian calculation. Combined, this computational efficiency thus allows investigation of larger/more complicated systems.

Elber et al.¹⁶ optimized the Gauss action using simulated annealing (SA) to find the folding pathway of C-peptide, which is a 16-residue long peptide forming a helical conformation. The sampling efficiency of SA depends on its annealing schedule, the maximum temperature set during the annealing, and the heights of energy barriers. Since SA is based on molecular dynamics, which is history-dependent, the probability to find the global minimum of a system depends on the initial state. Therefore, it may take enormously long time to sample the entire pathway space when the degrees of freedom are large, and/or the energy landscape is highly rugged. In addition to these limitations, SA using the Gauss action requires computationally expensive Hessian calculation of the potential energy. In the work by Fujisaki et al.^30,31, the ensemble of pathways was sampled using the replica exchange molecular dynamics (REMD) and the OM action. Although REMD is known to be superior to SA in terms of its sampling efficiency, it also suffers from similar limitations of SA. Due to these limitations, only relatively simple model systems, Bolhuis’ two-dimensional potential⁶³ and a coarse-grained self-avoiding polymer with one bead type and three interaction terms⁶⁴, were studied.

In conclusion, we demonstrated that efficient global optimization of Onsager-Machlup action reveals multiple transition pathways including the most dominant one successfully. In this work, we introduced a computational method that samples multiple possible pathways and provides information on the relative dominance of them via efficient global optimization of Onsager-Machlup action using the CSA method. The advantages of our method over existing pathway sampling methods are in the fact that its sampling efficiency is independent of the quality of initial guesses on pathways; only the calculation of first derivatives is required; and its sampling ability is not limited by the existence of high energy barriers separating pathways, which is a major limiting factor of previous MD-based pathway sampling methods in exploring the pathway space^{7,8,16,28,30,31,32}. Also, it is identified that the profile of minimum Onsager-Machlup actions found with different transition time parameters provides kinetic information on multiple pathways. In terms of implementation, Action-CSA calculation is massively parallel because the local optimization of each trial pathway is independent of each other. Thus, pathway samplings for larger systems are possible with the help of a large computer cluster system. We anticipate that the Action-CSA method will be used as a first-step exploration for complex reactions and large-scale conformational changes due to its low cost and robust nature. Results from Action-CSA can be used as the starting point for many other methods.

Methods

Classical and Onsager-Machlup actions

Here, we briefly review the theoretical background behind Action-CSA. If a system with N atoms with a potential energy V follows the overdamped Langevin dynamics,

where x is a 3N dimensional mass-weighted coordinate vector, γ is collision frequency, and R is a Gaussian random force, the relative probability of finding a final state x_f at a time t from an initial state x_i via diffusive trajectories x(t′) is determined by using the path integral approach and OM action S_OM[x(t′)]^26,27:

where indicates that the integration runs over all possible pathways x(t′). This relationship suggests that if the S_OM values of all physically accessible pathways are obtained, one can determine the relative populations of multiple pathways. Thus, S_OM is a proper target objective function of global optimization. The generalized OM action of a pathway x(t) is defined^26,27,65,66:

where ΔV=V(x_f)−V(x_i). In the original formula of action derived by Onsager and Machlup, the last term of equation (3) was absent^26,27. It was shown that, for the purposes of reweighting and sampling diffusive pathways, two OM actions with and without the Hessian term are equivalent. However, for the purpose of finding the most probable trajectory motif, the term should be considered because it represents the entropic corrections connected with fluctuations and the neighborhood of a given trajectory motif, which is also represented as a tube around the motif^66,67. Note that the minimization of S_OM using analytic local minimization algorithms requires analytic third derivatives. This makes the direct global optimization of S_OM hard to be applied to detect transition pathways of biomolecules with all-atom force fields due to the complexity of implementation and high computational cost. For numerical calculations based on a chain-of-states representation, the OM action should be discretized. The method uses the second-order discretization of the symmetric OM formula, which uses only gradients for S_OM calculations⁶⁸:

where P+1 is the number of replicas, Δt is a time step between successive replicas, and t=PΔt is the total transition time. This formula is more efficient than the direct implementation of equation (3) since it requires only the first derivatives of V to evaluate S_OM.

Global action optimization

Here, we describe the application of CSA to optimize S_OM. In general, a pathway is represented as a chain of P−1 replicas with N atoms for each replica leading to 3N(P−1) total degrees of freedom. Each replica is represented by a sequence of 3N−6 internal dihedral angles and 6 net translational/rotational degrees of freedom. An Action-CSA calculation starts with a set of random pathways on a pathway space. Subsequently, the actions of the random pathways are locally optimized.

As stated previously, direct minimization of S_OM using analytic gradients is computationally challenging. For a computationally feasible local action optimization, we optimized a pathway using a modified action from ADMD instead of using S_OM. The discretized classical action is defined:

Physically accessible pathways correspond to the stationary points of S_classical. Finding such pathways is a computationally difficult task because S_classical is not bounded; S_classical can be minimized or maximized, and the stationary points of S_classical can be minima, maxima or saddle points. Another practical problem is that the total energies of pathways satisfying the stationary condition δS_classical=0 may not be conserved¹⁸. To find pathways that satisfy the principle of least action and conserve total energies, a modified action with a penalty term restraining total energy was suggested¹⁸:

where E is a targeted total energy of a system, μ_A and μ_E are the weighting parameters of the classical action, and the restraint term for energy conservation. The minimization of Θ[x(t); E] requires only the first derivatives of V.

The set of locally optimized initial random pathways using Θ[x(t);E] is called the first bank. The first bank remains the same throughout the optimization and is used as the reservoir of partially optimized pathways to enhance the diversity of pathway search. A copy of the first bank is generated and called a bank. The pathways in the bank are updated during a calculation while the size of the bank is kept constant. By using the pathways included in the first bank and the bank, new trial pathways are generated by crossover and mutation (random perturbation) operations. For a crossover operation, two pathways, a seed pathway from the bank and a random pathway either from the bank or the first bank, are selected and random parts of two selected pathways are swapped. For a random perturbation, a certain number of degrees of freedom of a seed pathway, up to 5% of total degrees of freedom, are randomly changed. The generated trial pathways are locally optimized by minimizing Θ[x(t);E] to remove any possible artifacts generated by the crossover and the mutation operations. After local optimizations, the bank is updated by comparing the S_OM values of the existing pathways and the new ones instead of Θ[x(t);E].

A key feature of CSA is a sophisticated bank-update procedure that prevents a search being trapped in local minima during the optimization and keeps the diversity of the bank. For a newly obtained configuration, a pathway in this work, α, the pathway separation distances D between α and the existing ones in the bank are calculated. If the distance between α and its closest neighbor is less than a cutoff distance D_cut, only the better configuration in terms of the objective function, S_OM in this work, is selected. If D>D_cut, α is considered a new configuration and it replaces the worst configuration in the bank if it is better. At initial stages of a calculation, D_cut is kept large for wider sampling. As the calculation proceeds, it gradually decreases for a refined search near the global minimum. The bank keeps updating until no better configuration is found. In this work, a distance between two pathways was measured by the Fréchet distance⁶⁹. More details on a general CSA procedure are described elsewhere^{33,34,35,37,39,40}.

Action-CSA simulation

To verify that Action-CSA successfully finds multiple pathways and allows one to determine the rank order of the pathways based on their optimized S_OM values, we applied our method to investigate the conformational transition of alanine dipeptide from C7_eq to C7_ax in the vacuum. Here, we used the polar hydrogen representation in the PARAM19 force field⁵⁵ and the dielectric constant was set to 1.0 (ref. 70). We performed Action-CSA simulations with various transition times, t in equations (4) and (6), ranging from 0.2 to 2.0 ps with an interval of 0.1 ps. The numbers of replicas were adjusted with t to keep the time step between successive replicas Δt=5 fs. All simulations were performed at temperature T=350 K with a collision frequency γ=1.0 ps⁻¹. The reference total energy E in equation (5) was obtained by adding the initial potential energy V(x_i)=−43.3 kcal mol⁻¹ and a kinetic energy of 12.5 kcal mol⁻¹ estimated by 3Nk_BT/2 with the number of atoms N=12. The weighting parameters μ_A and μ_E in equation (5) were set to −1.0 and 1.0, respectively. For comparison purposes, we performed 5,000 independent 100 ns LD simulations of alanine dipeptide under the same condition amounting to 500 μs LD simulations and counted the number of the C7_eq→C7_ax transitions. An Action-CSA calculation requires 10 adjustable parameters, and they are listed in Supplementary Table 1. The parameters for the calculations presented in this study were not extensively optimized. Rigorous optimization of the parameters is out of the scope of this study, and requires a series of subsequent benchmark studies.

Data availability

The Action-CSA code is freely available for academic, government and nonprofit use as a part of the CHARMM molecular dynamics package (http://charmm.chemistry.harvard.edu/). All relevant data are available from the authors upon request.

Additional information

How to cite this article: Lee, J. et al. Finding multiple reaction pathways via global optimization of action. Nat. Commun. 8, 15443 doi: 10.1038/ncomms15443 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Elber, R. Perspective: computer simulations of long time dynamics. J. Chem. Phys. 144, 060901 (2016).
Article ADS Google Scholar
Dellago, C., Bolhuis, P. G., Csajka, F. S. & Chandler, D. Transition path sampling and the calculation of rate constants. J. Chem. Phys. 108, 1964–1977 (1998).
Article CAS ADS Google Scholar
Bolhuis, P. G., Chandler, D., Dellago, C. & Geissler, P. L. Transition path sampling: throwing ropes over rough mountain passes, in the dark. Annu. Rev. Phys. Chem. 53, 291–318 (2002).
Article CAS ADS Google Scholar
Wales, D. J. Discrete path sampling. Mol. Phys. 100, 3285–3305 (2002).
Article CAS ADS Google Scholar
Bai, D. & Elber, R. Calculation of point-to-point short-time and rare trajectories with boundary value formulation. J. Chem. Theory Comput. 2, 484–494 (2006).
Article CAS Google Scholar
Carr, J. M. & Wales, D. J. Folding pathways and rates for the three-stranded β²-sheet peptide beta3s using discrete path sampling. J. Phys. Chem. B 112, 8760–8769 (2008).
Article CAS Google Scholar
Faccioli, P., Sega, M., Pederiva, F. & Orland, H. Dominant pathways in protein folding. Phys. Rev. Lett. 97, 108101 (2006).
Article CAS ADS Google Scholar
Beccara, S. a., Skrbic, T., Covino, R. & Faccioli, P. Dominant folding pathways of a WW domain. Proc. Natl Acad. Sci. USA 109, 2330–2335 (2012).
Article CAS ADS Google Scholar
Elber, R. Simulations of allosteric transitions. Curr. Opin. Struct. Biol. 21, 167–172 (2011).
Article CAS Google Scholar
Schlitter, J., Engels, M. & Krüger, P. Targeted molecular dynamics: a new approach for searching pathways of conformational transitions. J. Mol. Graph. 12, 84–89 (1994).
Article CAS Google Scholar
Czerminski, R. & Elber, R. Self-avoiding walk between 2 fixed-points as a tool to calculate reaction paths in large molecular-systems. Int. J. Quantum Chem. 186, 167–186 (1990).
Article Google Scholar
Henkelman, G., Uberuaga, B. P. & Jónsson, H. Climbing image nudged elastic band method for finding saddle points and minimum energy paths. J. Chem. Phys. 113, 9901–9904 (2000).
Article CAS ADS Google Scholar
Weinan, E., Ren, W. & Vanden-Eijnden, E. String method for the study of rare events. Phys. Rev. B 66, 052301 (2002).
ADS Google Scholar
Gillilan, R. E. & Wilson, K. R. Shadowing, rare events, and rubber bands - a variational verlet algorithm for molecular-dynamics. J. Chem. Phys. 97, 1757–1772 (1992).
Article CAS ADS Google Scholar
Olender, R. & Elber, R. Calculation of classical trajectories with a very large time step: formalism and numerical examples. J. Chem. Phys. 105, 9299 (1996).
Article CAS ADS Google Scholar
Elber, R., Meller, J. & Olender, R. Stochastic path approach to compute atomically detailed trajectories: application to the folding of C peptide. J. Phys. Chem. B 103, 899–911 (1999).
Article CAS Google Scholar
Elber, R. & Shalloway, D. Temperature dependent reaction coordinates. J. Chem. Phys. 112, 5539 (2000).
Article CAS ADS Google Scholar
Passerone, D. & Parrinello, M. Action-derived molecular dynamics in the study of rare events. Phys. Rev. Lett. 87, 108302 (2001).
Article CAS ADS Google Scholar
Passerone, D., Ceccarelli, M. & Parrinello, M. A concerted variational strategy for investigating rare events. J. Chem. Phys. 118, 2025–2032 (2003).
Article CAS ADS Google Scholar
Lee, I.-H., Lee, J. & Lee, S. Kinetic energy control in action-derived molecular dynamics simulations. Phys. Rev. B 68, 064303 (2003).
Article ADS Google Scholar
Lee, I.-H., Kim, S.-Y. & Lee, J. Dynamic folding pathway models of α-helix and β-hairpin structures. Chem. Phys. Lett. 412, 307–312 (2005).
Article CAS ADS Google Scholar
Lee, I.-H., Kim, S.-Y. & Lee, J. Folding models of mini-protein FSD-1. J. Phys. Chem. B 116, 6916–6922 (2012).
Article CAS Google Scholar
Lee, I.-H., Kim, S.-Y. & Lee, J. Transition pathway and its free-energy profile: a protocol for protein folding simulations. Int. J. Mol. Sci. 14, 16058–16075 (2013).
Article Google Scholar
Crehuet, R. & Field, M. J. Comment on ‘Action-derived molecular dynamics in the study of rare events’. Phys. Rev. Lett. 90, 089801 author reply 089802 (2003).
Article ADS Google Scholar
Goldstein, H., Poole, C. & Safko, J. Classical Mechanics Addison Wesley (2002).
Onsager, L. & Machlup, S. Fluctuations and irreversible processes. Phys. Rev. 91, 1505–1512 (1953).
Article CAS ADS MathSciNet Google Scholar
Machlup, S. & Onsager, L. Fluctuations and irreversible process. II. Phys. Rev. 91, 1512–1515 (1953).
Article CAS ADS Google Scholar
Eastman, P., Grønbech-Jensen, N. & Doniach, S. Simulation of protein folding by reaction path annealing. J. Chem. Phys. 114, 3823–3841 (2001).
Article CAS ADS Google Scholar
Zuckerman, D. M. & Woolf, T. B. Efficient dynamic importance sampling of rare events in one dimension. Phys. Rev. E 63, 016702 (2000).
Article ADS Google Scholar
Fujisaki, H., Shiga, M. & Kidera, A. Onsager-Machlup action-based path sampling and its combination with replica exchange for diffusive and multiple pathways. J. Chem. Phys. 132, 134101 (2010).
Article ADS Google Scholar
Fujisaki, H., Shiga, M., Moritsugu, K. & Kidera, A. Multiscale enhanced path sampling based on the Onsager-Machlup action: application to a model polymer. J. Chem. Phys. 139, 054117 (2013).
Article ADS Google Scholar
Sega, M., Faccioli, P., Pederiva, F., Garberoglio, G. & Orland, H. Quantitative protein dynamics from dominant folding pathways. Phys. Rev. Lett. 99, 118102 (2007).
Article CAS ADS Google Scholar
Lee, J., Scheraga, H. & Rackovsky, S. New optimization method for conformational energy calculations on polypeptides: conformational space annealing. J. Comput. Chem. 18, 1222–1232 (1997).
Article CAS Google Scholar
Lee, J., Liwo, A. & Scheraga, H. Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: application to the 10-55 fragment of staphylococcal protein A and to apo calbindin D9K. Proc. Natl Acad. Sci. USA 96, 2025–2030 (1999).
Article CAS ADS Google Scholar
Lee, J., Lee, I.-H. & Lee, J. Unbiased global optimization of Lennard-Jones clusters for N≤201 using the conformational space annealing method. Phys. Rev. Lett. 91, 080201 (2003).
Article ADS Google Scholar
Joo, K. et al. High accuracy template based modeling by global optimization Keehyoung. Proteins 69, 83–89 (2007).
Article CAS Google Scholar
Joo, K., Lee, J., Kim, I., Lee, S. J. & Lee, J. Multiple sequence alignment by conformational space annealing. Biophys. J. 95, 4813–4819 (2008).
Article CAS ADS Google Scholar
Lee, J., Joo, K., Kim, S.-Y. & Lee, J. Re-examination of structure optimization of off-lattice protein AB models by conformational space annealing. J. Comput. Chem. 29, 2479–2484 (2008).
Article CAS Google Scholar
Joo, K. et al. All-atom chain-building by optimizing MODELLER energy function using conformational space annealing. Proteins 75, 1010–1023 (2009).
Article CAS Google Scholar
Lee, J. et al. De novo protein structure prediction by dynamic fragment assembly and conformational space annealing. Proteins 79, 2403–2417 (2011).
Article CAS Google Scholar
Lee, J., Gross, S. P. & Lee, J. Modularity optimization by conformational space annealing. Phys. Rev. E 85, 056702 (2012).
Article ADS Google Scholar
Lee, J. & Lee, J. Hidden information revealed by optimal community structure from a protein-complex bipartite network improves protein function prediction. PLoS ONE 8, e60372 (2013).
Article CAS ADS Google Scholar
Lee, J., Gross, S. P. & Lee, J. Improved network community structure improves function prediction. Sci. Rep. 3, 2197 (2013).
Article ADS Google Scholar
Lee, I. H., Oh, Y. J., Kim, S., Lee, J. & Chang, K. J. Ab initio materials design using conformational space annealing and its application to searching for direct band gap silicon crystals. Comput. Phys. Commun. 203, 110–121 (2016).
Article CAS ADS Google Scholar
Lee, I. H., Lee, J., Oh, Y. J., Kim, S. & Chang, K. J. Computational search for direct band gap silicon crystals. Phys. Rev. B 90, 115209 (2014).
Article ADS Google Scholar
Oh, Y. J., Lee, I.-H., Kim, S., Lee, J. & Chang, K. J. Dipole-allowed direct band gap silicon superlattices. Sci. Rep. 5, 18086 (2015).
Article CAS ADS Google Scholar
Oh, Y. J., Kim, S., Lee, I. H., Lee, J. & Chang, K. J. Direct band gap carbon superlattices with efficient optical transition. Phys. Rev. B 93, 085201 (2016).
Article ADS Google Scholar
Brooks, B. R. et al. CHARMM: the biomolecular simulation program. J. Comput. Chem. 30, 1545–1614 (2009).
Article CAS Google Scholar
Joo, K. et al. Protein structure determination by conformational space annealing using NMR geometric restraints. Proteins 83, 2251–2262 (2015).
Article CAS Google Scholar
Jang, S., Shin, S. & Pak, Y. Molecular dynamics study of peptides in implicit water: ab initio folding of beta-hairpin, beta-sheet, and beta beta alpha-motif. J. Am. Chem. Soc. 124, 4976–4977 (2002).
Article CAS Google Scholar
Lei, H., Dastidar, S. G. & Duan, Y. Folding transition-state and denatured-state ensembles of FSD-1 from folding and unfolding simulations. J. Phys. Chem. B 110, 22001–22008 (2006).
Article CAS Google Scholar
Wu, C. & Shea, J. E. On the origins of the weak folding cooperativity of a designed bba Ultrafast Protein FSD-1. PLoS Comput. Biol. 6, e1000998 (2010).
Article Google Scholar
Feng, J. A., Kao, J. & Marshall, G. R. A second look at mini-protein stability: Analysis of FSD-1 using circular dichroism, differential scanning calorimetry, and simulations. Biophys. J. 97, 2803–2810 (2009).
Article CAS ADS Google Scholar
Sadqi, M., de Alba, E., Pérez-Jiménez, R., Sanchez-Ruiz, J. M. & Muñoz, V. A designed protein as experimental model of primordial folding. Proc. Natl Acad. Sci. USA 106, 4127–4132 (2009).
Article CAS ADS Google Scholar
Neria, E., Fischer, S. & Karplus, M. Simulation of activation free energies in molecular systems. J. Chem. Phys. 105, 1902–1921 (1996).
Article CAS ADS Google Scholar
Haberthür, U. & Caflisch, A. FACTS: fast analytical continuum treatment of solvation. J. Comput. Chem. 29, 701–715 (2008).
Article Google Scholar
Meuzelaar, H., Panman, M. R., van Dijk, C. N. & Woutersen, S. Folding of a zinc-finger ββα-motif investigated using two-dimensional and time-resolved vibrational spectroscopy. J. Phys. Chem. B 120, 11151–11158 (2016).
Article CAS Google Scholar
Lanczos, C. The Variational Principles of Mechanics Dover Publications (1970).
Elber, R. In Computer Simulations in Condensed Matter Systems: from Materials to Chemical Biology Vol. 1 (eds Ferrario, M., Ciccotti, G. & Binder, K.) 435–451 (Springer, 2006).
Huber, G. A. & Kim, S. Weighted-ensemble Brownian dynamics simulations for protein association reactions. Biophys. J. 70, 97–110 (1996).
Article CAS ADS Google Scholar
Faradjian, A. K. & Elber, R. Computing time scales from reaction coordinates by milestoning. J. Chem. Phys. 120, 10880 (2004).
Article CAS ADS Google Scholar
Allen, R. J., Warren, P. B. & Ten Wolde, P. R. Sampling rare switching events in biochemical networks. Phys. Rev. Lett. 94, 018104 (2005).
Article ADS Google Scholar
Bolhuis, P. G. Rare events via multiple reaction channels sampled by path replica exchange. J. Chem. Phys. 129, 114108 (2008).
Article ADS Google Scholar
Toan, N. M., Marenduzzo, D., Cook, P. R. & Micheletti, C. Depletion effects and loop formation in self-avoiding polymers. Phys. Rev. Lett. 97, 178302 (2006).
Article CAS ADS Google Scholar
Hunt, K. L. C. & Ross, J. Path integral solutions of stochastic equations for nonlinear irreversible processes: The uniqueness of the thermodynamic Lagrangian. J. Chem. Phys. 75, 976 (1981).
Article CAS ADS MathSciNet Google Scholar
Adib, A. B. Stochastic actions for diffusive dynamics: reweighting, sampling, and minimization. J. Phys. Chem. B 112, 5910–5916 (2008).
Article CAS Google Scholar
Haas, K. R., Yang, H. & Chu, J. W. Trajectory entropy of continuous stochastic processes at equilibrium. J. Phys. Chem. Lett. 5, 999–1003 (2014).
Article CAS Google Scholar
Miller, T. F. & Predescu, C. Sampling diffusive transition paths. J. Chem. Phys. 126, 144102 (2007).
Article ADS Google Scholar
Alt, H. & Godau, M. Computing the fréchet distance between two polygonal curves. Int. J. Comput. Geom. Appl. 05, 75–91 (1995).
Article Google Scholar
Loncharich, R. J., Brooks, B. R. & Pastor, R. W. Langevin dynamics of peptides: the frictional dependence of isomerization rates of N-acetylalanyl-N′-methylamide. Biopolymers 32, 523–535 (1992).
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge helpful discussions with Attila Szabo and Richard Pastor. The authors wish to acknowledge Steven Gross for his critical reading of the manuscript. Juyong Lee and B.R.B. were supported by the Intramural Research Program of the NIH, NHLBI under Project No. Z01 HL001051-20. I.-H.L., I.J. and Jooyoung Lee were supported by the National Research Foundation of Korea (NRF) under Grant No. 2008-0061987 funded by the Korea government (MEST). I.-H.L. was also supported by Samsung Science and Technology Foundation under Grant No. SSTF-BA1401-08. Computational resources and services used in this work were provided by the LoBoS cluster of the National Institutes of Health.

Author information

Authors and Affiliations

Laboratory of Computational Biology, National Heart, Lung, and Blood Institute (NHLBI), National Institutes of Health (NIH), Bethesda, 20892, Maryland, USA
Juyong Lee & Bernard R. Brooks
Center for Materials Genome, Korea Research Institute of Standards and Science, Daejeon, 34113, Republic of Korea
In-Ho Lee
Center for In Silico Protein Science, School of Computational Science, Korea Institute for Advanced Study, Seoul, 02455, Republic of Korea
In-Ho Lee, InSuk Joung & Jooyoung Lee
School of Computational Sciences, Korea Institute for Advanced Study, Seoul, 02455, Republic of Korea
InSuk Joung & Jooyoung Lee

Authors

Juyong Lee
View author publications
You can also search for this author in PubMed Google Scholar
In-Ho Lee
View author publications
You can also search for this author in PubMed Google Scholar
InSuk Joung
View author publications
You can also search for this author in PubMed Google Scholar
Jooyoung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Bernard R. Brooks
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Juyong Lee conceived and designed the study, designed and implemented the Action-CSA algorithm, performed the simulations, analysed the results and wrote the paper. I.-H.L. contributed to the implementation of the Action-CSA algorithm. I.J. contributed to the design of the Action-CSA algorithm. Jooyoung Lee supervised the study, contributed to the design of the Action-CSA algorithm, analysed the results and wrote the paper. B.R.B. supervised the study, contributed to the design of the Action-CSA algorithm, analysed the results and wrote the paper. All authors were involved in manuscript editing.

Corresponding authors

Correspondence to Juyong Lee, Jooyoung Lee or Bernard R. Brooks.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Tables and Supplementary Figure (PDF 72 kb)

Peer Review File (PDF 309 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Lee, J., Lee, IH., Joung, I. et al. Finding multiple reaction pathways via global optimization of action. Nat Commun 8, 15443 (2017). https://doi.org/10.1038/ncomms15443

Download citation

Received: 09 December 2016
Accepted: 24 March 2017
Published: 26 May 2017
DOI: https://doi.org/10.1038/ncomms15443

This article is cited by

MolFinder: an evolutionary algorithm for the global optimization of molecular properties and the extensive exploration of chemical space using SMILES
- Yongbeom Kwon
- Juyong Lee
Journal of Cheminformatics (2021)
Global optimization of ~ 1 nm MoS2 and CaCO3 nanoparticles
- Jiří Hostaš
- Alain Tchagang
- Dennis R. Salahub
Theoretical Chemistry Accounts (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.