Abstract
Intra-molecular energy transport between distant functional sites plays important roles in allosterically regulating the biochemical activity of proteins. How to identify the specific intra-molecular signaling pathway from protein tertiary structure remains a challenging problem. In the present work, a non-equilibrium dynamics method based on the elastic network model (ENM) was proposed to simulate the energy propagation process and identify the specific signaling pathways within proteins. In this method, a given residue was perturbed and the propagation of energy was simulated by non-equilibrium dynamics in the normal modes space of ENM. After that, the simulation results were transformed from the normal modes space to the Cartesian coordinate space to identify the intra-protein energy transduction pathways. The proposed method was applied to myosin and the third PDZ domain (PDZ3) of PSD-95 as case studies. For myosin, two signaling pathways were identified, which mediate the energy transductions form the nucleotide binding site to the 50 kDa cleft and the converter subdomain, respectively. For PDZ3, one specific signaling pathway was identified, through which the intra-protein energy was transduced from ligand binding site to the distant opposite side of the protein. It is also found that comparing with the commonly used cross-correlation analysis method, the proposed method can identify the anisotropic energy transduction pathways more effectively.
Similar content being viewed by others
Introduction
Energy transport through specific pathways in proteins enables the communication between different parts of the molecule, which is thought to play important roles in many functional processes of proteins. Especially, the signaling between the distant allosteric and active sites may facilitate the long-range allosteric regulation of protein functions1,2,3,4,5. However, how to effectively identify the energy transport pathway between remote sites in protein tertiary structure remains a challenging problem in the research field of protein structure-function relationship.
Molecular dynamics (MD) simulation is an effective method to investigate the dynamical properties of proteins in atomistic detail6,7, which has been used to explore the signaling pathway in protein structures8,9. Ghosh and Vishveshwara performed MD simulations for methionyl-tRNA synthetase and the cross-correlation of motions between different residues was evaluated, which were then combined with the protein structure network analysis to elucidate the communication pathways between distant functional sites in the protein10. Kong and Karplus proposed a method, named as interaction correlation analysis, to identify the signaling pathways of proteins, in which the interaction energy correlations between all residue pairs were calculated based on the equilibrium MD simulations11. Ishikura and Yamato developed a theoretical method based on the long-time equilibrium MD simulations to analyze the pathways of inter-residue energy flow between different regions in proteins. In this method, the energy conductivity was evaluated by the time-correlation function of the inter-residue energy flux, which was numerically computed from the long MD simulation trajectories12. However, strictly speaking, the process of intra-protein energy transport is a non-equilibrium behavior, and therefore many studies utilized non-equilibrium MD simulations to investigate the transfer of energy in proteins. Ota and Agard proposed a non-equilibrium MD simulation method, called as anisotropic thermal diffusion, to detect the intramolecular signaling pathways. In their method, the probed residues or atoms are coupled to a higher temperature, serving as the energy source, whereas the temperature couplings for other atoms in the protein are turned off. Then, non-equilibrium MD simulations were performed and the signaling pathways were observed by detecting the propagation of the root mean square deviation for the residues in the protein13. Sharp and Skinner developed a pump-probe MD simulation method, in which the selected residues or atoms were excited with a set of oscillating forces and the energy transduction pathways were revealed by using Fourier transform of the atomic motions in the protein14. Stock group developed several methods of non-equilibrium MD simulations to study energy transport in molecules, where the selected atoms were perturbed from the ground-state to the excited-state to mimic the ultraviolet excitation and infrared vibration pumping experiments, or the selected atoms were heated by higher temperature to generate an energy flow in molecules. Then, based on the non-equilibrium MD simulation trajectories, a non-equilibrium correlation function was used to evaluate the cumulative response of the residues to the energy source4,15,16,17,18. However, for the equilibrium and non-equilibrium MD simulations, the signals are usually submerged in the noise, and some subtle analysis methods are required to achieve a better signal-to-noise ratio, such as simulation at the super-low temperature, comparison with the control simulations, and so on.
Besides the MD simulation, another commonly used method to investigate the dynamical properties of proteins is the normal mode analysis of the elastic network model (ENM)19,20,21,22. ENM describes the protein structure as an elastic network, in which the interactions between residues are simplified as springs. Because the inter-residue interactions are harmonic, the partition function and the thermodynamical properties of the protein can be analytically calculated. Many studies have shown that such a simple model is sufficient to account for many experimental data, such as the X-ray crystallographic temperature factors, the order parameters derived from NMR relaxation experiments, and the protection factors for H/D exchange experiments23,24. ENM can also reproduce MD simulation data but does not require the high computational cost of MD simulation23,24,25. ENM has been proved in many applications to be a simple yet effective method for investigating large-scale conformational transitions, allosteric motions, equilibrium fluctuation of residues, decomposition of domains for proteins, as well as identification of key residues and refinement of low-resolution structural data19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34. However, at present, most of the application studies for ENM are focus on exploring the dynamical properties of the equilibrium state. In the present work, a non-equilibrium dynamics method was proposed based on ENM to simulate the energy transduction, which is a non-equilibrium process, in proteins.
In our method, the selected residue was perturbed by assigning a displacement from its equilibrium position, which serves as the energy source. After that, the propagation of the perturbation was simulated by non-equilibrium dynamics in the normal modes space of ENM. The simulation results were then transformed from the normal modes space to the Cartesian coordinate space to monitor the intra-protein energy transduction process. The functionally key residues involved in energy propagation were identified as those dynamically coupled with the perturbed residue, which form the signaling pathway in the protein. In the present work, the proposed method was applied to myosin and the third PDZ domain of PSD-95 as case studies. Myosin is an important allosteric protein that transforms chemical energy provided by the hydrolysis ATP into force and movement along actin35,36. In this study, a residue in the ATP binding site was perturbed and the specific signaling pathway responsible for the allosteric regulation of the protein functions was identified. PSD-95, as a member of the PDZ domain protein family, plays important roles in regulating protein-protein interactions associated with synapses of the central nervous system37. It has been found that the residue His372 is responsible for the specificity of ligand binding11,13,14,38. In this work, the residue His372 was perturbed and then the signaling pathway from this residue to the remote functional sites was revealed by our proposed method. Our prediction results are consistent with the available experimental and simulation data.
Methods
Gaussian network model
Gaussian network model (GNM)19,21,22,23,24, one type of the ENM, describes the protein tertiary structure as an elastic network, in which each residue is simplified as a node represented by its Cα atom and the interactions between residues are simplified as springs. If the distance between two residues is less than the cutoff value (7.0 Å is adopted in this work), these two residues are considered to have interactions between them, and thus they are connected by a spring. Otherwise, there is no spring between them. In this model, the force constant is identical for all the springs. Considering all the harmonic interactions between residues, the potential energy of the system can be written as
where γ is the force constant of the springs, ΔR ij is the displacement of the distance between the ith and jth residues, {ΔR} is the N-dimensional column vector of the fluctuations of all the residues, N is the number of residues in the protein, the superscript T denotes the transpose of the column vector, and Γ is the N × N dimensional Kirchhoff matrix whose elements are expressed as
where R ij is the distance between the ith and jth residues, and r c is the cutoff distance whose value is set to be 7.0 Å.
The normal modes of the motion for the system can be calculated by decomposing the Kirchhoff matrix Γ as
where U is an orthonormal matrix whose columns are the eigenvectors of the Kirchhoff matrix Γ, and Λ is a diagonal matrix of the eigenvalues Λ ii (1 ≤ i ≤ N) of the matrix Γ. The value of the first eigenvalue is zero, and correspondingly the first eigenvector represents the overall translational motion of the system. In the present work, only the intra-protein motions are meaningful, therefore, the first eigenvector with zero eigenvalue is removed from the calculation.
According to the theory of statistical physics, the mean-square fluctuation of the ith residue is given by
The cross-correlation fluctuations between the ith and jth residues can be calculated by
The normalized cross-correlation between the ith and jth residues is given by
The GNM-based non-equilibrium dynamics method for monitoring the intra-protein energy transport pathway
According to Newton’s law, the dynamical equation of the system can be written as
where M is a diagonal matrix whose diagonal elements are the mass of each residue, R is the position vector of the residues in the protein, and ∇ is the gradient operator. In ENM, the mass is set to be identical for all the residues, which is represented as m, then the dynamical eq. (7) can be further expressed as
Then, the solution of the dynamical eq. (8) is
where U k is the kth eigenvector of the matrix Γ, \({\omega }_{k}=\sqrt{\frac{\gamma }{m}{{\rm{\Lambda }}}_{kk}}\) represent the angular frequency of the kth normal mode, and A k and φ k are respectively the amplitude and initial phase of the kth normal mode, which are determined by the initial condition of the system.
When the selected residue, supposed as the mth residue, was perturbed as the energy source, a displacement from the equilibrium position is assigned to this residue. The amplitude of the displacement is set to 1. Then, the initial column vector of the residue fluctuations at the time t = 0 can be written as
where the mth element of the fluctuation vector ΔR is 1, and all the other elements are 0. In eq. (9), it is assumed that all the φ k = 0, then the initial fluctuation vector can be projected onto all the normal modes as
The initial amplitude for each normal mode can be determined as
The time-evolution of each normal mode is \({U}_{k}{A}_{k}\,\cos \,{\omega }_{k}t={U}_{k}{U}_{mk}\,\cos \,{\omega }_{k}t\). Considering all the normal modes, the residue fluctuations as a function of time can be calculated by
It should be noted that here \({\rm{\Delta }}R(t)={({\rm{\Delta }}{R}_{1}(t),{\rm{\Delta }}{R}_{2}(t),\cdots ,{\rm{\Delta }}{R}_{N}(t))}^{T}\), which is the N-dimensional column vector of the fluctuations of all the residues. In this work, the energy transport pathway was investigated by monitoring the time-evolution residue fluctuations, and the residues with relative large fluctuation are identified as the key residues involved in energy transport.
Data availability
All data generated or analysed during this study are included in this published article, and the source code of our method is available from the corresponding author upon request.
Results and Discussion
The signaling pathways in myosin revealed by the GNM-based non-equilibrium dynamics method
Myosin is an ATP-driven motor protein that transforms the chemical energy of ATP into mechanical force and movement along actin. Myosin plays essential roles in a wide range of biological processes, such as muscle contraction, cell division, shape changing of the cell, gating of the force-activated ion channel, transporting of the vesicles, structural changing of neurons, and so on35,36,39,40. The structure of myosin is composed four subdomains, i.e., the N-terminal subdomain, the upper (U50) and lower (L50) 50 kDa subdomains, and the converter subdomain (also called the lever arm), as shown in Fig. 1. X-ray crystallographic and biochemical experiments for different conformations of myosin have indicated that there exist allosteric communications linking the nucleotide binding site, the actin binding cleft, and the movements of the lever arm35,36,40,41,42,43. The crystal structures of different myosin states have shown that without the binding of ATP, myosin binds strongly to actin with a closed 50 kDa cleft between the catalytic site and the actin binding site, and the lever arm is in the down position. The binding of ATP to the catalytic site results in the opening of 50 kDa cleft and the release of myosin from actin. Then, the hydrolysis of ATP induces the lever arm moving from its down position to up position42,43,44,45. The binding and hydrolysis of ATP is allosteric coupled with the open-closed conformational transition of 50 kDa cleft and the movement of the lever arm. In the present work, the proposed GNM-based non-equilibrium dynamics method was used to investigate how the signaling propagates from the nucleotide binding site to the distant regions in myosin. The structure of the near-rigor state of the Dictyostelium discoideum myosin (PDB code: 1MMA) was used in our calculation.
The residue Asn127 in the ATP binding site, which is directly involved in the nucleotide binding, was chosen as the perturbation site. According to our proposed method introduced in the “Methods” section, an initial displacement was assigned to this residue as the energy source, and then the energy propagation process was investigated by the ENM-based non-equilibrium dynamics calculation. The time-evolution of the amplitude of residue fluctuations for the system in response to the perturbation is shown in Fig. 2A. It is found that there are several clusters of residues that have relative large amplitude of fluctuations during energy propagation, which implies that these residues are involved in the signaling transduction. Figure 2B displays the residues with squared fluctuation larger than 0.01, along with the time when these residues are excited. From Fig. 2A,B, it is found that there are 56 energy-excited residues that distinctly involved in the intra-protein energy transduction. In order to display the results more intuitively, these 56 key residues involved in energy transduction are mapped onto the tertiary structure of myosin, which are exhibited in yellow color, as shown in Fig. 3. It is found that these excited residues are mainly located on the β strands and the associated connecting loops in the transducer subdomain. Considering that the functional sites, i.e. the 50 kDa cleft and the converter, are located at the top and bottom of the transducer, the energy propagation between β strands and between different loops (i.e. along the direction perpendicular to the β strands of the transducer) is thought to be responsible for the transduction of the functional signals. Therefore, in the present study, the excited residues located on the same β strand or loop are grouped into a residue cluster, and we only focused on the signaling pathway between different clusters. According to this definition, these 56 excited residues can be grouped into 11 clusters centered at residues Met91 (including Asp90 and Met91), Arg109 (including Asn105, Arg109 and Tyr110), Ile115 (including Ile115-Ser119), Val124 (including Leu121-Arg131), Gly179 (including Ser174-Leu176 and Thr178-Lys185), Asn188 (including Asn188, Thr189 and Ile193), Ser236 (including Asn233 and Ser236), Gly457(including Asp454-Gly457), Phe652 (including His651-Val653 and Cys655-Asn659), Lys666 (including Leu663-Lys666), and Gln675 (including Val671, Val672 and Gln675). These clusters are marked by the numbers 1–11 in Fig. 2A,B, and the central residues for the residue clusters are marked by red balls in Fig. 3. The residues of clusters 1–11 are respectively located on the loop at the interface between the N-terminal subdomain and the lever arm (cluster 1), helix 5 (cluster 2), β1 (cluster 3), β2 (cluster 4), β4 and the connecting P-loop (cluster 5), helix 9 connected to the perturbation residue (cluster 6), β6 and the connecting switch I (cluster 7), β5 and the connecting switch II (cluster 8), β3 (cluster 9), the loop connecting to SH1 helix (cluster 10), and SH1 helix in the converter subdomain (cluster 11), as shown in the enlarged sub-graph in Fig. 3. β1–β6, as well as switch I and II, belong to the so-called transducer of myosin, in which β5 and β6 are located at the 50 kDa cleft. It is found that many of these energy-excited residues are distant from the energy source Asn127 whereas many residues nearer the energy source are not excited, which implies that the energy can be allosterically transduced to the remote functional site through specific pathways. Our calculation results show that the transducer, the SH1 helix, and the bottom of the 50 kDa cleft are involved in the transfer of the energy from ATP binding site, which form the energy transduction pathway. Experimental studies have also indicated that the transducer region is crucial for the chemo-mechanical transduction, where the binding and hydrolysis of ATP results in the mechanical distortion of the transducer and then the conformational change of the transducer induces the opening of 50 kDa cleft and the swing of the lever arm35,42,46,47. There exist distinct allosteric couplings among the nucleotide binding site, the 50 kDa cleft and the converter subdomain. Our calculation results are well consistent with the experimental observations.
In order to investigate how the perturbation energy propagates among these clusters of residues, the sequence of energy excitation of these residue clusters was obtained from Fig. 2B. For each residue cluster, the time when any residue in the cluster was firstly excited was considered to be the excitation time of that cluster. As shown in Fig. 2B, the excitation times for the 11 residue clusters are 13.420 (cluster 1), 7.327 (cluster 2), 0.591 (cluster 3), 0.001 (cluster 4), 0.590 (cluster 5), 7.931 (cluster 6), 8.543 (cluster 7), 3.052 (cluster 8), 0.553 (cluster 9), 3.385 (cluster 10) and 6.150 (cluster 11), respectively. The unit of time is \(\sqrt{\frac{m}{\gamma }}\). According to the chronological order of the excitation of these residue clusters, two pathways of energy transduction can be identified, as indicated by arrows in Fig. 3. One pathway is Asn127 → β2 (cluster 4) → β3 (cluster 9) → β4 and the connecting P-loop (cluster 5) → β5 and the connecting switch II (cluster 8) → β6 and the connecting switch I (cluster 7), as denoted by red arrows in Fig. 3. Along this pathway, the energy was transferred from the ATP binding site to the distant 50 kDa cleft, which may contribute to the regulation of the actin binding and dissociation. The other signaling pathway is Asn127 → β1 (cluster 3) → the loop connecting to SH1 helix (cluster 10) → SH1 helix in the converter subdomain (cluster 11) → helix 5 (cluster 2) → the loop at the interface between the N-terminal subdomain and the lever arm (cluster 1), as indicated by blue arrows in Fig. 3. Through this pathway the perturbation energy at the ATP binding site is transduced to the converter subdomain, which may facilitate the produce of force and movement of the lever arm. Many experimental studies have indicated that the transducer region in myosin mediates the allosteric couplings among the nucleotide binding site, the 50 kDa cleft and the converter subdomain35,36,40,41,42,43,46,47. However, the detailed signaling pathways among these sites are still largely unknown. Our study provided the signaling pathways and the associated key residues for the allosteric communications between the functional sites of myosin. It should be noted that the excitation time for the residues within the cluster span a broad range, as shown in Fig. 2B, which implies that it takes a relative long time for the energy transduction within residue cluster. Here the residue clusters mainly corresponds to the secondary structures in the protein. Our calculation results indicate that the signaling transduction within the secondary structure is usually slower than the signaling between the secondary structures.
It should be mentioned that in our study the initial phase of each normal mode is set to zero, which means that the initial velocity of each normal mode is zero and thus the initial velocity of the perturbed residue is zero. We also investigated whether the initial velocity of the perturbed residue has influences on the energy transduction pathway. Therefore, both an initial displacement and an initial velocity are assigned to the perturbed residue. It should be noted that the initial velocity will result in non-zero initial phases of the normal modes. The detailed theoretical derivation of our method for non-zero initial velocity is given in the Supplementary material. The calculation results are displayed in Supplemental Fig. 1 and Supplemental Table 1 in the Supplementary material, which were compared with the case of zero initial velocity. It is found that the initial velocity has little influences on the identification of the excited residues as well as the excitation times of these key residues. The same energy transduction pathways were obtained both for the zero and non-zero initial velocity of the perturbed residue. (see the detailed discussions provided in the Supplementary material).
The signaling pathways in the third PDZ domain of PSD-95 revealed by the GNM-based non-equilibrium dynamics method
The third PDZ domain (PDZ3) of PSD-95 (PDB code: 1BE9) plays an important role in mediating the signaling of the central nervous system, which can bind to the C-terminus of ion channels and form signaling complexes at cellular membranes37,48,49. The PDZ3 domain of PSD-95 has 97 residues, which is composed of six β strands and three α helices, as shown in Fig. 4. Comparing with the canonical PDZ domain that has six β strands and two α helices, α3 in the C-terminus is unique to PDZ3. Several theoretical and experimental studies have indicated that the allosteric signaling can be transferred in the PDZ3 domain from the ligand binding site to the remote region through specific residue-residue interactions13,14,50. In the ligand binding site, the residue His372 has proved by experiments to be responsible for the specificity of ligand binding, which directly interacts with the ligand through hydrogen bonds11,13,14,37,38. In this study, the residue His372 was perturbed as the energy source, and then the transduction pathway of the energy was predicted by using the proposed ENM-based non-equilibrium dynamics method.
The time-evolution of the amplitude of residue fluctuations for the system in response to the perturbation of His372 is shown in Fig. 5A. The residues with fluctuation amplitude larger than 0.015, along with the time when these residues are excited, are displayed in Fig. 5B. It is found that there are 32 energy-excited residues that involved in intra-protein energy transduction. Figure 6 displays these 32 residues mapped onto the tertiary structure of PDZ3 domain, in which the locations of these energy-excited residues are shown in yellow color. The residues located on the same β strand, loop or α helix are grouped into one cluster, and we only focused on the energy transduction pathway between different clusters. According to their location on the protein structure, these 32 excited residues are grouped into 5 clusters, i.e., Phe325, Ile328, Gly329, Gly330, Glu331 (cluster 1), Gly333, Glu334, Gly335, Ile336, Ser339, Ile341, Leu342 (cluster 2), Gln358, Ile359, Leu360, Ser361 (cluster 3), Asp366, Leu367, Arg368, Asn369 (cluster 4), and Ala370, Ser371, His372, Glu373, Gln374, Ala375, Ala376, Ile377, Ala378, Lys380 Asn381, Gly383 (cluster 5). The cluster number is marked in Fig. 5A,B, and the central residues of these clusters are highlighted by red balls and labeled by the cluster numbers in Fig. 6. These residue clusters are located at the βB (cluster 1), the βC (cluster 2), the βD (cluster 3), the βE and the connecting loop between βE and α2 (cluster 4), α2 along with the loop between α2 and βF (cluster 5). The sequence of energy excitation of these residue clusters was obtained from Fig. 5B to investigate how the perturbation energy propagates among these clusters of residues. The excitation times for the 5 residue clusters are 0.667 (cluster 1), 2.719 (cluster 2), 13.056 (cluster 3), 2.380 (cluster 4) and 0.001 (cluster 5), respectively, as marked by red circles in Fig. 6. The unit of time is \(\sqrt{\frac{m}{\gamma }}\). According to the chronological sequence of residue excitations, the signaling pathway can be identified as His372 → βB → the βE and the connecting loop between βE and α2 → βC → βD, as indicated by red arrows in Fig. 6. Through this pathway, the intra-protein energy was transduced from ligand binding site to the opposite side of the protein. Several computational13,14,50 and experimental studies50,51,52,53,54 have showed that the dynamic coupling of the residues between different β strands in the core of PDZ domain plays critical roles for the transduction of allosteric signals. Our predicted signaling pathway is consistent with the results obtained from the evolutionary analysis of sequences, MD simulations, and the mutation experiments.
The anisotropy of the intra-protein signaling propagation
Our calculation results discussed in the above section show that both for myosin and PDZ3 domain many residues distant from the energy source were excited, whereas the residues nearer to the energy source may not be excited. The result indicates that the energy transduction is anisotropic. In order to more intuitively illustrate the anisotropy of the energy propagation in myosin, the amplitude of residue fluctuation versus the shortest path length of that residue to the energy source is shown in the left sub-graph of Fig. 7A. The shortest path length for each residue to the energy source was calculated by the Dijkstra algorithm55. If the energy propagation is isotropic, the residues with the same shortest path length should have similar fluctuation amplitude and the fluctuation amplitude should decrease along with the increase of the shortest path length. However, the left sub-graph in Fig. 7A shows that the residues with the same shortest path length span a wide range of fluctuation amplitude, and the anti-correlation between the residue fluctuation amplitude and its shortest path length is not pronounced. This result implies that the intra-protein energy propagation is distinctly anisotropic in myosin. The upper sub-graph in Fig. 7B displays the fluctuation amplitude of the residues as a function of their distance to the energy source for myosin. If the energy diffusion is isotropic, an inverse relationship between the fluctuation amplitude of the residues and their distance to the energy source should be observed, in which the residue fluctuation decreases with the increase of the distance. However, the inverse relationship is not appeared in the upper sub-graph of Fig. 7B. These above results indicate that the intra-protein energy propagation is anisotropy and there exists specific signaling pathways between different regions in myosin, which may facilitate the allosteric regulation of protein functions. The similar calculation results were obtained for the PDZ3 domain (data not shown).
Comparison with the results obtained by the normalized cross-correlation analysis of the equilibrium dynamics
The cross-correlation analysis of the residue fluctuations in the equilibrium state is a commonly used method to investigate the dynamical couplings between different regions in protein tertiary structure11,56,57,58. In this work, the ENM-based normalized cross-correlation analysis method was also applied for myosin to illustrate the difference between this method and our proposed method. According to eq. (6) discussed in the “Methods” section, the normalized cross-correlation between each residue and the residue Asn127 in myosin was calculated. The first 10% residues that are mostly dynamically coupled with Asn127 were mapped on to the tertiary structure of myosin, as shown by yellow color in Fig. 8. Comparing Fig. 8 with Fig. 3, it is found that the residues identified by the cross-correlation analysis method distribute more isotropically around Asn127, which indicates that this method does not work well in detecting the anisotropic signaling pathway. In order to illustrate the isotropic property of the cross-correlation analysis method, the normalized cross-correlation between each residue and Asn127 versus its shortest path length to Asn127 is displayed in the right sub-graph of Fig. 7A. The normalized cross-correlation between each residue and Asn127 versus its distance to Asn127 is displayed in the lower sub-graph of Fig. 7B. This figure exhibits that the value of cross-correlation decreases almost linearly as the distance increases, implying that the cross-correlation values distribute isotropically. Comparing the sub-graphs in Fig. 7A,B, it is found that the key residues identified by our method distribute more anisotropically than the cross-correlation analysis method. These results imply that our method is more effective than the cross-correlation analysis method in revealing the anisotropic signaling pathway. For example, our method revealed that the perturbation energy in Asn127 transfers through β3 and β4 to the remote regions in β5 and β6, however β5 and β6 cannot be identified by the cross-correlation analysis method, as shown in Figs 3 and 8. The similar results were obtained for the PDZ3 domain of PSD-95 (data not shown). From a physical standpoint, our method is a non-equilibrium dynamics method theoretically applicable to study the non-equilibrium process of intra-protein energy transduction. Whereas, the cross-correlation between residue fluctuations only reflects the equilibrium dynamical properties, which may not suitable for detecting the signaling in the non-equilibrium process.
Conclusion
Intra-molecular signaling between distant functional sites plays important roles in many functional processes of proteins. In the present work, a non-equilibrium dynamics method based on ENM was proposed to simulate the energy propagation process and identify the specific signaling pathways within proteins. In this method, a given residue was perturbed as the energy source, and then the propagation of the perturbation energy was calculated by non-equilibrium dynamics in the normal modes space of ENM. Then, the calculation results were transformed from the normal modes space to the Cartesian coordinate space to reveal the intra-protein energy transduction pathways.
Using the proposed method, the long-range signaling pathways between distant functional sites in myosin and PDZ3 domain of PSD-95 were successfully identified. For myosin, two signaling pathways were obtained: one is Asn127 → β2 → β3 → β4 → β5 → β6, and the other is Asn127 → β1 → the loop connecting to SH1 helix → SH1 helix → helix 5 → the loop at the interface between the N-terminal subdomain and the lever arm. The first pathway connects the nucleotide binding site to the distant 50 kDa cleft, which may contribute to the allosteric coupling between the binding of ATP the dissociation of actin. The second pathway mediates the energy transduction from the ATP binding site to the converter subdomain, which may facilitate the produce of force and movement of the lever arm. For the PDZ3 domain of PSD-95, the specific signaling pathway was identified as His372 → βB → the βE and the connecting loop between βE and α2 → βC → βD, through which the intra-protein energy was transduced from ligand binding site to the distant opposite side of the protein.
The calculation results of our method for myosin were compared with those obtained by the cross-correlation analysis method. It is found that the residues dynamically coupled with Asn127 identified by the cross-correlation analysis method are isotropically distributed around Asn127. The cross-correlation analysis method can reveal the equilibrium dynamical properties of proteins, but it is failed to identify the anisotropic pathways for the non-equilibrium energy transduction. In contrast, our method as a non-equilibrium dynamics method can effectively identify the long-range signaling pathways, and the pathways are highly anisotropic. Our method is generally applicable for different protein systems, and this study provides a starting point for the investigations of other proteins.
It should be mentioned that compared with the conventional MD simulation method, our proposed GNM-based non-equilibrium dynamics method is rigorous in theoretical deduction and simple in calculation. In the conventional MD method, the motion of atoms in protein system is simulated by numerically integrating Newton’s motion equations, and the dynamic properties of the system are calculated based on a large number of samplings in the conformational space. MD simulation method is time-consuming and easy to trap in local optimum, especially for larger protein systems. Besides that, in MD simulations the signals may be submerged in the noise and some subtle analysis methods usually need to be well-designed to improve the signal-to-noise ratio. In our proposed GNM-based method, the dynamic properties of protein systems can be calculated analytically, without long-time numerically simulations. Therefore, our method is computationally efficient and can be easily applied to large-size protein systems.
References
Leitner, D. & Straub, J. Proteins: energy, heat and signal flow (Taylor & Francis, 2009).
Smock, R. G. & Gierasch, L. M. Sending signals dynamically. Science 324, 198–203 (2009).
Li, G., Magana, D. & Dyer, R. B. Anisotropic energy flow and allosteric ligand binding in albumin. Nat. Commun. 5, 3100 (2014).
Buchenberg, S., Leitner, D. M. & Stock, G. Scaling rules for vibrational energy transport in globular proteins. J. Phys. Chem. Lett. 7, 25–30 (2016).
Weber, J. K., Shukla, D. & Pande, V. S. Heat dissipation guides activation in signaling proteins. Proc. Natl. Acad. Sci. USA 112, 10377–10382 (2015).
Van Gunsteren, W. F., Weiner, P. K. & Wilkinson, A. J. Computer simulation of biomolecular systems: Theoretical and experimental applications (Springer, 1997).
Karplus, M. & McCammon, J. A. Molecular dynamics simulations of biomolecules. Nat. Struct. Biol. 9, 646–652 (2002).
Feher, V. A., Durrant, J. D., Wart, A. T. V. & Amaro, R. E. Computational approaches to mapping allosteric pathways. Curr. Opin. Struct. Biol. 25, 98–103 (2014).
Guo, J. & Zhou, H. X. Protein allostery and conformational dynamics. Chem. Rev. 11, 6503–6515 (2016).
Ghosh, A. & Vishveshwara, S. A study of communication pathways in methionyl-tRAN synthetase by molecular dynamics simulations and structure network analysis. Proc. Natl. Acad. Sci. USA 104, 15711–15716 (2007).
Kong, Y. & Karplus, M. Signaling pathways of PDZ2 domain: A molecular dynamics interaction correlation analysis. Proteins 74, 145–154 (2009).
Ishikura, T. & Yamato, T. Energy transfer pathways relevant for long-range intramolecular signaling of photosensory protein revealed by microscopic energy conductivity analysis. Chem. Phys. Lett. 432, 533–537 (2006).
Ota, N. & Agard, D. A. Intramolecular signaling pathways revealed by modeling anisotropic thermal diffusion. J. Mol. Biol. 351, 345–354 (2005).
Sharp, K. & Skinner, J. J. Pump-probe molecular dynamics as a tool for studying protein motion and long range coupling. Proteins 65, 347–361 (2006).
Nguyen, P. H., Park, S. & Stock, G. Nonequilirium molecular dynamics simulation of the energy transport through a peptide helix. J. Chem. Phys. 132, 025102 (2010).
Nguyen, P. H., Derreumaux, P. & Stock, G. Energy flow and long-range correlations in guanine-binding riboswitch: A nonequilibrium molecular dynamics study. J. Phys. Chem. 113, 9340–9347 (2009).
Kobus, M., Nguyen, P. H. & Stock, G. Coherent vibrational energy transfer along a peptide helix. J. Chem. Phys. 134, 124518 (2011).
Nguyen, P. H. & Stock, G. Nonequilibrium molecular dynamics simulation of a photoswitchable peptide. Chem. Phys. 323, 36–44 (2006).
Haliloglu, T., Bahar, I. & Erman, B. Gaussian dynamics of folded proteins. Phys. Rev. Lett. 79, 3090–3093 (1997).
Atilgan, A. R. et al. Anisotropy of fluctuation dynamics of proteins with an elastic network model. Biophys. J. 80, 505–515 (2001).
Li., H., Chang, Y. Y., Yang, L. W. & Bahar, I. iGNM 2.0: the Gaussian network model database for bimolecular structural dynamics. Nucleic. Acids. Res. 44, D415–422 (2016).
Bastolla, U. Computing protein dynamics from protein structure with elastic network models. WIREs comput. Mol. Sci. 4, 488–503 (2014).
Yang, L. W. et al. iGNM: A database of protein functional motions based on Gaussian network model. Bioinformatics 21, 2978–2987 (2005).
Rader, A. J., Chennubhotla, C., Yang, L. W. & Bahar, I. The Gaussian network model: Theory and applications in normal mode analysis. In Theory and applications to biological and chemical systems math & comp biology series. (Taylor & Francis Group, 2006).
Fuglebakk, E., Reuter, N. & Hinsen, K. Evaluation of protein elastic network models based on an analysis of collective motions. J. Chem. Theory Comput. 9, 5618–5628 (2013).
Su, J. G. et al. Analysis of domain movements in glutamine-binding protein with simple models. Biophys. J. 92, 1326–1335 (2007).
Su, J. G., Xu, X. J., Li, C. H., Chen, W. Z. & Wang, C. X. Identification of key residues for protein conformational transition using elastic network model. J. Chem. Phys. 135, 174101 (2011).
Su, J. G. et al. Identification of functionally key residues in AMPA receptor with a thermodynamic method. J. Phys. Chem. B 117, 8689–8696 (2013).
Su, J. G. et al. Prediction of allosteric sites on protein surface with an elastic-network-model-based thermodynamic method. Phys. Rev. E 90, 022719 (2014).
Su, J. G. et al. Conformational motions and functionally key residues for vitamin B12 transporter BtuCD-BtuF revealed by elastic network model with a function-related internal coordinate. Inter. J. Mol. Sci. 16, 17933–17951 (2015).
Su, J. G. et al. Analysis of conformational motions and related key residue interactions responsible for a specific function of proteins with elastic network model. J. Biomol. Struct. Dyn. 34, 560–571 (2016).
López-Blanco, J. R. & Chacón, P. Structural modeling from electron microscopy data. WIREs Comput. Mol. Sci. 5, 62–81 (2015).
Polles, G. et al. Mechanical and assembly units of viral capsids identified via quasi-rigid domain decomposition. PLoS Compt. Biol. 9, e1003331 (2013).
Na, H. & Song, G. Predicting the functional motions of p97 using symmetric normal modes. Proteins 84, 1823–1835 (2016).
Preller, M. & Manstein, D. J. Myosin structure, allostery, and mechano-chemistry. Structure 21, 1911–1922 (2013).
Gulick, A. M., Bauer, C. B., Thoden, J. B. & Rayment, I. X-ray structures of the MgADP, MgATPgammaS, and MgAMPPNP complexes of the Dictyostelium discoideum myosin motor domain. Biochemistry 36, 11619–11628 (1997).
Doyle, D. A. et al. Crystal structures of a complexed and peptide-free membrane protein-binding domain: molecular basis of peptide recognition by PDZ. Cell 85, 1067–1076 (1996).
Van Ham, M. & Hendriks, W. PDZ domains glue and guide. Mol. Biol. Rep. 30, 69–82 (2003).
Kneussel, M. & Wagner, W. Myosin motors at neuronal synapses: drivers of membrane transport and actin dynamics. Nat. Rev. Neurosci. 14, 233–247 (2013).
Masters, T. A., Kendrick-Jones, J. & Buss, F. Myosins: domain organization, motor properties, physiological roles and cellular functions. In The action cytoskeleton (Springer, 2016).
Houdusse, A., Szent-Gyorgyi, A. G. & Cohen, C. Three conformational states of scallop myosin S1. Proc. Natl. Acad. Sci. USA 97, 11238–11243 (2000).
Coureux, P. D., Sweeney, H. L. & Houdusse, A. Three myosin V structures delineate essential features of chemo-mechanical transduction. EMBO J. 23, 4527–4537 (2004).
Reubold, T. F., Eschenburg, S., Becker, A., Kull, F. J. & Manstein, D. J. A structural model for actin-induced nucleotide release in myosin. Nature Struct. Biol. 10, 826–830 (2003).
Tang, S. et al. Predicting allosteric communication in myosin via a pathway of conserved residues. J. Mol. Biol. 373, 1361–1373 (2007).
Zheng, W. & Brooks, B. Identification of dynamical correlations within the myosin motor domain by the normal mode analysis of an elastic network model. J. Mol. Biol. 346, 745–759 (2005).
Houdusse, A. & Sweeney, H. L. How myosin generates force on actin filaments. Trends Biochem. Sci. 41, 989–997 (2016).
Wulf, S. F., Ropars, V., Fujita-Becker, S., Oster, M. & Hofhaus, G. Force-producing ADP state of myosin bound to actin. Proc. Natl. Acad. Sci. USA 113, E1844–E1852 (2016).
Kornau, H. C., Schenker, L. T., Kennedy, M. B. & Seeburg, P. H. Domain interaction between NMDA receptor subunits and the postsynaptic density protein PSD-95. Science 269, 1737–1740 (1995).
Kim, E. & Sheng, M. PDZ domain proteins of synapses. Nat. Rev. Neurosci. 5, 771–781 (2004).
Lockless, S. W. & Ranganathan, R. Evolutionarily conserved pathways of energetic connectivity in protein families. Science 286, 295–299 (1999).
Fuentes, E. J., Der, C. J. & Lee, A. L. Ligand-dependent dynamics and intramolecular signaling in a PDZ domain. Journal Of Molecular Biology 335, 1105–1115 (2004).
Liu, X., Shepherd, T. R., Murray, A. M., Xu, Z. & Fuentes, E. J. The Structure of the Tiam1 PDZ Domain/Phospho-Syndecan1 Complex Reveals a Ligand Conformation that Modulates Protein Dynamics. Structure 21, 342–354 (2013).
Peterson, F. C., Penkert, R. R., Volkman, B. F. & Prehoda, K. E. Cdc42 regulates the Par-6 PDZ domain through an allosteric CRIB-PDZ transition. Molecular Cell 13, 665–676 (2004).
Zhang, J. et al. Crystallographic and Nuclear Magnetic Resonance Evaluation of the Impact of Peptide Binding to the Second PDZ Domain of Protein Tyrosine Phosphatase 1E. Biochemistry 49, 9280–9291 (2010).
Dijkstra, E. W. A note on two problems in connection with graphs. Numerische Mathematik 1, 269–271 (1959).
Ma, X., Meng, H. & Lai, L. Motions of allosteric and orthosteric ligand-binding sites in proteins are highly correlated. J. Chem. Inf. Model. 56, 1725–1733 (2016).
Gasper, P. M., Fuglestad, B., Komives, E. A., Markwick, P. R. L. & McCammon, J. A. Allosteric networks in thrombin distinguish procoagulant vs. anticoagulant activities. Proc. Natl. Acad. Sci. USA 109, 21216–21222 (2012).
McClendon, C. L., Friedland, G., Mobley, D. L., Amirkhani, H. & Jacobson, M. P. Quantifying correlations between allosteric sites in thermodynamic ensembles. J. Chem. Theory Comput. 5, 2486–2502 (2009).
Acknowledgements
This work was supported by the National Natural Science Foundation of China (No. 11704328 and 31671577), the Program for the Outstanding Young Talents of Hebei Province (No. 201317), and the Basic Research Project of Yanshan University (No. 16LGA010).
Author information
Authors and Affiliations
Contributions
J.G.S., J.Z.Z. and Q.M.L. conceived the project; J.G.S. and Y.D.W. derived the algorithms; W.B.W. and J.J.D. wrote the program code and conducted the simulations; W.B.W., Y.L. and J.Z. performed the data analysis; J.G.S. and J.Z.Z. wrote the manuscript; Q.M.L. reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, W.B., Liang, Y., Zhang, J. et al. Energy transport pathway in proteins: Insights from non-equilibrium molecular dynamics with elastic network model. Sci Rep 8, 9487 (2018). https://doi.org/10.1038/s41598-018-27745-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-27745-y
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.