Article | Open | Published:

# Understanding the differences of the ligand binding/unbinding pathways between phosphorylated and non-phosphorylated ARH1 using molecular dynamics simulations

## Introduction

ADP-ribosylation is a reversible post-translational modification (PTM) of specific target proteins (such for GTP-binding or ATP-binding proteins) and DNA1. The extent and duration of mono-ADP-ribosylation govern the activities of ADP-ribosylhydrolases (ARHs) and poly-ADP-ribosylglycohydrolases (PARGs), the latter of which reverse the transfer reaction by hydrolyzing the protein-ADP-ribose bonds and/or ADP-ribose–ADP-ribose bonds (see Figure S1) (i.e. the release of ADP-ribose from the acceptor molecule), and is catalyzed by ADP-ribosylhydrolases (ARHs) or poly-ADP-ribosylglycohydrolases (PARGs)1,2,3.

The ARH gene family comprises ARH1, ARH2, ARH34 and one PARG5. ARH1 has been shown as an Arg-specific mono-ADP-ribosyl hydrolase that can de-ADP-ribosylate many different Arg-ADP-ribosylated target proteins6,7,8. Neither the substrate specificity nor the enzymatic activity of ARH2 is known7. ARH3 is indicated to transfer to de-ADP-ribosylate poly-ADP-ribosylated substrates in mitochondria and to hydrolyse O-acetyl-ADP-ribose, a side product of sirtuin-dependent deacetylation7,9,10,11.

Until now, no structural information is available for ADP-ribosylhydrolases. To date, only the three-dimensional structures of human and mouse ARH31,7,9,12 and human ARH1 (PDB code 3HFW) have been determined.

Protein PTM by the attachment of chemical groups is used by cells from all kingdoms of life and occurs in several forms. Through the reversible covalent modification of specific amino acid side chains, enzyme activity or other protein function can be switched on and off as a rapid response to environmental stimuli. ADP-ribosylation is a PTM existing in two distinct forms, mono-and poly-ADP-ribosylation, resulting from the attachment of a single ADP-ribose moiety or a polymeric ADP-ribose chain structure, respectively, to the protein. However, the effect of the phosphorylation on ARH1 is still unknown. How do these four phosphorylated residues induce the structural changes upon the ligand binding? Do these phosphorylated residues affect the ligand binding and unbinding pathway?

Molecular dynamics (MD) simulation is a computer simulation method for studying the physical movements of macromolecules such as proteins13. MD has been applied mostly in chemical physics, material science and modeling of biomolecules atomic scales. These days, a new approach induces unbinding of ligands based on the ligand−receptor interactions and modeling of long-time movement of biomolecules, named steered molecular dynamics (SMD) simulations14,15,16. SMD can reveal structural changes in a protein at the atomic level for the ligand binding (unbinding) pathway through tunnel at a time scale inaccessible to standard MD. In this study, we employ MD and SMD to study structural changes induced by phosphorylation and their impact on protein binding (unbinding) pathway of ADP-ribose from ARH1 ligand-binding domain (LBD) in phosphorylated and non-phosphorylated ARH1.

The ligand binding mode and unbinding pathways of phosphorylated and non-phosphorylated ARH1 were revealed in this work, and hence provide the associated unbinding mechanism of ADP-ribose from phosphorylated and non-phosphorylated ARH1, respectively. In this case, for the two systems of ADP-RA in complex with phosphorylated and non-phosphorylated ARH1, 200 ns conventional MD simulations were applied respectively. After that, based on the 3D structure of X-ray, the possible ligand unbinding pathways were explored by using two SMD simulations, and the relevant unbinding mechanism was elucidated by analyzing the unbinding trajectories.

## Materials and Methods

### System Preparation

The starting structure of human ARH1 (the non-phosphorylated type) was taken from Protein Data Bank17 (PDB code 3HFW) and the phosphorylation on tyrosine was added into the phosphorylated type. The initial models of ADP-RA and ADP-ribose were downloaded from ChemSpider18. Geometric optimization of ADP-ribose and ADP-RA were performed at the B3LYP/6-31G* level using Gaussian09 [www.gaussian.com].

### Molecular docking

Autodock Vina19 was selected for docking. For the macromolecules, Kollman charges and AD4 type atoms were assigned, while hydrogen atoms were also merged into associated heavy atoms. In addition, grid box was added using AutoDock Tools program. Moreover, the docking method could reproduce the binding pose of the co-crystallized ligands in the used crystal structure.

### Conventional MD Simulations and analysis

The conventional MD simulations were performed using the NAMD software20 with CHARMM27 all-force field parameters20,21 support. For the macromolecules (human ARH1) and ligands (ADP-RA), generalized CHARMM27 all-force field parameters20,21 were applied. The initial models were constructed with crystallization water molecules. After that the resulting model were solvated in a cubic periodic box with TIP3P22 water. Counter ions (Na+, Cl) neutralized the systems. The resulting models were solvated with TIP3P22 water in a cubic periodic boundary conditions. The distance between the periodic boundary conditions and the closest protein atom was set to 10.0 Å. Prior to the MD simulation, each system was energetically minimized through the steepest descent algorithm with 50,000 steps to avoid steric clashes or improper geometries. After the minimization, an isothermal-isobaric (NPT) simulation was run by weak coupling to a bath of constant pressure (P0 = 1 bar, coupling time = 2.0 ps). In our study, the constant temperature control was based on Langevin dynamics23 with a damping coefficient (gamma) of 1.0 ps. The full-system periodic electrostatics were calculated by using the particle-mesh Ewald (PME) algorithm24. The MD simulation was carried out for 200 ns for the two protein-ligand complexes under the normal temperature (300 K) and pressure (1 bar), using a temperature coupling time constant of 0.1 ps and a pressure coupling time constant of 2.0 ps. And three replications for two complexes have been simulated in this study (Figures S2S3).

### Principal Component Analysis and Free Energy Landscape Analysis

Principal component analysis (PCA)25,26,27 is an effective and useful technique to reduce or simplify large and complicated movements of long trajectories generated by MD simulations. The goal of PCA is to produce a transformed set of variables, z1, z2,…, zp. The indices zi are called the principal components (PCs). Free Energy Landscape (FEL)28,29,30 is used to describe the energies of sets of macromolecules conformations. In detail, the first two principal components (PC1, PC2) of motions were displayed in the FEL map, projecting the trajectories on their first two principal components of motion.

### SMD Simulations

To reveal the human ARH1 processes underlying the unbinding of ADP-ribose, the center of mass of the ligands were forced to pull out it along a predefined direction using NAMD 2.10b1 version software and the CHARMM27 all-atom force field20,21. The direction of pulling was defined by the two points, the first point was the location of the active site, i.e. the center between the Cα atoms of Gly-101 and Ser-264, while another point was the centroid of phenyl ring coordinate in the ligand (ADP-ribose). Commonly implemented using a constant force or a constant velocity, SMD is useful for exploring possible permeation pathways through a macromolecule. In our study, the constant-velocity ensemble SMD simulations were performed. And the spring does not move in time accessible SMD simulation, a constant of 0.5 kcal· mol−1 ·A−2 was performed to stretch the imaginary atom from the central mass of SMD atom with constant velocity. The two ligand-receptor systems were performed in 6 ns SMD simulation. And three replications for two complexes have been simulated in this study (Figures S4S5).

### Potential of mean force calculations

The substrate pathway can be characterized by the substrate’s free energy along it, known as the potential of mean force (PMF)31,32,33, which dictates the speed of transport and the selectivity. The adaptive biasing force (ABF)34 was performed to generate quasi-equilibrium trajectories from which the PMF of the NAMD software can be deduced. In our study, for the two complexes, two different reaction coordinates exist, which are the distances between the substrates’ centers of mass and Mg2+ toward the opening of the pathway. The reaction coordinate was subdivided into equally spaced windows, and each was simulated for 6 ns. Finally, all the separated simulations belonging to the same reaction coordinate were combined into a single PMF, while the upper and lower boundaries were changed so that they encompass the entire range.

### MM-GBSA calculations

The Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) calculation35 was performed to calculate the binding free energies of ADP-RA with phosphorylated and non-phosphorylated ARH1, respectively. Amber 14 package was used to performed 2 ns with Amber ff99 force field parameter36. The binding free energy ΔGbind estimated as follow:

$$\begin{array}{rcl}{{\rm{\Delta }}{\rm{G}}}_{{\rm{bind}}} & = & {{\rm{G}}}_{{\rm{complex}}}-{{\rm{G}}}_{{\rm{recptor}}}-{{\rm{G}}}_{{\rm{ligand}}}\\ {{\rm{\Delta }}{\rm{G}}}_{{\rm{bind}}} & = & {{\rm{\Delta }}{\rm{E}}}_{{\rm{gas}}}+{{\rm{\Delta }}{\rm{G}}}_{{\rm{sol}}}-{\rm{T}}{\rm{\Delta }}{\rm{S}}\\ {{\rm{\Delta }}{\rm{G}}}_{{\rm{sol}}} & = & {\rm{\Delta }}{\rm{GGB}}+{{\rm{\Delta }}{\rm{G}}}_{{\rm{SA}}}\\ {{\rm{\Delta }}{\rm{G}}}_{{\rm{SA}}} & = & {\rm{\gamma }}\cdot {\rm{\Delta }}\mathrm{SASA}\end{array}$$

where ΔEgas is known as the gas-phase interaction energy, including the van der Waals and the electrostatic energy, and Gsol is the solvation free energy, including the polar (ΔGGB) and non-polar (ΔGSA) energies. TΔS is the change of conformational entropy, which was neglected in the present study due to the low prediction accuracy and the expensive computational cost. In this study, the atomic radii were set to the default bondi values; the GB model was calculated by GB-OBC1 and the radii were set to the mbondi values. The solute dielectric constant was set to the default (ε = 1) for both of the polar solvation energies (∆G GB), and we used 80 for the exterior dielectric constants. The LCPO method was applied and the value of γ was 0. 007237.

### Data Availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

All data generated or analysed during this study are included in this published article (and its Supplementary Information files).

## Results and Discussion

### Conventional MD simulations

In the study of ADP-RA with phosphorylated and non-phosphorylated ARH1 complexes conformations. The substrate (ADP-RA) was docked in ARH1 with AutoDock Vina. The compartment between the docked ligand conformation (blue) and the reference conformation in the crystal structure (red) located in the active site are shown in Fig. 1. The RMSD of the substrate (ADP-RA) is 1.923, which is similar to the RMSD of the reference ligand (=1.659). The root-mean-square (RMSD) analyses of protein backbone atoms and the ligand atomic coordinates were performed to measure the structure stability of the complexes. Figure 2(a), the RMSD variations of the two systems on 200 ns time scales’ MD simulation indicate that the complex atomic coordinates and the initial structures are similar, and the RMSD values of the two systems converge to 1.1 and 1.2 Å, respectively, which show the two systems are all stable.

The root-mean-square fluctuation (RMSF) of the backbone atoms of ADP-RA with phosphorylated and non-phosphorylated ARH1 complexes were analyzed to measure the mobility of the protein residues. Figure 2(b) shows that the large fluctuations of residues mainly occur in the ADP-RA with non-phosphorylated ARH1 complex connecting the regular secondary structure elements. Residues G100, G113, S124, Y211, S241, S248 and S264 have higher score in the ADP-RA non-phosphorylated ARH1 complex. The score is the Cα RMSF value. For the residue E25, the values which are 1.41 Å in the non-phosphorylated type and 1.42 Å in the phosphorylated type are similar. For the residues G100, G113, S124, Y211, S241, S248 and S264 in the non-phosphorylated type, the values which are 1.68 Å (G100), 1.99 Å (G113), 1.82 Å (S124), 1.41 Å (Y211), 1.30 Å (S241), 1.47 Å (S248), 1.09 Å (S264) are higher than the values in the phosphorylated type, which are 1.47 Å (G100), 1.17 Å (G113), 0.49 Å (S124), 0.72 Å (Y211), 1.08 Å (S241), 1.10 Å (S248), 0.82 Å (S264) respectively, which exhibit a high mobility during MD simulations. Figure 2(c) shows the radius of gyration (Rg) for the non-phosphorylated ARH1 (blue) and phosphorylated ARH1 (purple).

The mean Rg (The mean radius of gyration of residues during MD simulations was computed to test the flexibility of conformation quantitatively of protein) for the non-phosphorylated ARH1 is about 1.93 Å, while for the phosphorylated ARH1 is about 1.92 Å. The result in Fig. 2(c) shows that in the phosphorylated type, the ARH1 complex is less flexibility than in the non-phosphorylated type.

Time-dependent solvent-accessible surface area (SASA) has also been calculated for the ensemble of structures from the simulations Fig. 2(d). The solvent-accessible surface area of residues to find the points on a sphere that are exposed to solvent. we carried out the analysis of tunnels in protein (ARH1) by CAVER software. The results were shown in Figure S6 and Tables S1 and S2, indicate that Y19 and E25 are the residues in the tunnel-lining. Moreover Y19 is the phosphorylated site in the protein (ARH1) and E25 is able to form the intermediate α helix on the time scales simulation. So we selected these two residues to perform SASA calculation. The SASA of the hydrophilic residues (Y19 and E25) were performed as shown in Fig. 2(d). The area contributions for the non-phosphorylated type and the phosphorylated type show that the difference of the two residues folding degrees. For the phosphorylated type, the area contributions of Y19 and E25 are larger than in the non-phosphorylated complex, show the more active of two residues in the phosphorylated state. After a period of 200 ns, among the active pocket residues, the four phosphorylated residues and eight residues which have a higher RMSF score in MD simulations, Y19 and E25 have some changes in the phosphorylated ARH1.

Subsequently, the secondary structure contents were also analyzed and the corresponding data was shown in Fig. 3(a–d), Figure S7 and Table 1. It can be seen that the α-helix content of non-phosphorylated ARH1 in E25 is 1.06% while in the phosphorylated ARH1 is about 25.88% (Table 1). The results of secondary structure analysis indicate that the short intermediate α helix of phosphorylated ARH1 during 200 ns MD simulations represents the ordered structure which may help the binding of ADP-RA to ARH1. Seen from Table S3 and Table S4, a hydrogen bond (Y19 and E25) disappear in the phosphorylated state, which may be useful to ADP binding.

Seen from Figure S8, S124, D302, G100 and S264 are key residues in ADP-ribose binding pocket, which can make hydrogen bonds with ADP-ribose. The hydrogen bond occupancy between ADP-RA and residues of ARH1 were listed in Table 2. From Table 2, it can be seen that, the hydrogen bond occupancy between ADP-RA and D302 and G100 had not a large change, while the hydrogen bond occupancy between ADP-RA and S124 and S264 increased in the phosphorylated type. To qualify the rotation of S124 and S264, the variations of the mainchain of S124 and S264 (see Figure S9) were measured during the MD simulation, as shown in Fig. 4(a–f). The torsion angles of two residues all have a significant change non-phosphorylated ARH1, which indicate the two residues are disordered in the non-phosphorylated ARH1 type. The swing of ADP-RA may force S124 and S264 to undergo a displacement and thence to increase the hydrogen bond occupancy between ADP-RA and ARH in the phosphorylated type (Table 2), which can stabilize the complex.

Reliable free energy calculations based on molecular dynamics (MD) simulations can provide the missing links between experimental binding affinities and 3D structures of protein-ligand complexes. When ligands differ by larger substitutes, or receptors differ by more drastic mutations (e.g., to alanine, alanine scanning calculation), it can be used to test the performance of the residues. Alanine scanning38 is a direct way of mutagenesis with A replacing the special residues. In this study, alanine scanning calculation was used to S124 and S264, and the results were listed in Table 3 and Figure S10. The results show that in the non-phosphorylated/ phosphorylated type, the binding energies of the residues (S124 and S264) are lower than the energies of the mutations (S124A and S264A), thus S124 and S264 have more close interactions with the ADP-RA than mutations (to alanine). Moreover, comparing the binding energy of the phosphorylated type with ADP-RA complex to the non-phosphorylated type, the calculation shows that the binding energies of the residues (S124 and S264) in the phosphorylated are lower than in the non-phosphorylated, which can help the binding to the ADP-RA.

According to the previous reports, the cation–π interaction is regarded as a key interaction in the protein structure39. As demonstrated in Table 4, the cation–π interactions in ARH1 were calculated. R119 and Y211 show the strongest cation–π interactions in ARH1 as calculated by using the Realistic Electrostatics program39,40. Figure 5(a) and (b) indicate that most of the distances between R119 and Y211 (from the cation to the centroid of the aromatic ring) are more than 5 Å in the phosphorylated ARH1-ADP-RA, showing that the interaction between R119 and Y211 is more stable in the phosphorylated ARH1 system. For demonstrated the relation between the less flexibility of the residue in the simulation with the binding of ADP-RA. We supplied the distances between R119 and Y211 (from the cation to the centroid of the aromatic ring) on 200 ns time scales. As shown in Fig. 5, the residues R119 and Y211 are located near to the loop of the active substrate binding pocket. These structural changes near to the loop of the binding pockets in the phosphorylated ARH1 may useful to substrate binding. In additional, from Tables S3 and S4, hydrogen bonds among Y205 become more in the phosphorylated state than that of non- phosphorylated state. Hydrogen bonds may be important for protein stability, which are useful to ADP binding.

To sum up, the phosphorylations on ARH1 lead three structural changes as follow: (1) E25 is more frequently in the α-helix conformationin the phosphorylated type with ADP-RA complex (51.56%) than that of the non-phosphorylated complex (2.12%); (2) S124 and S264 become less flexible in the phosphorylated type with the ADP-RA complex, which is helpful for the two residues formhydrogen bonds with ADP-RA; (3) Y211 is also less flexible in the phosphorylated type with ADP-RA complex, and it is useful for the stability of cation-π interaction of Y211-R119. The significance of difference between the phosphorylated and non-phosphorylated proteins lay in the phosphorylation of four residues can disrupt or form some H-bonds and lead to conformational changes in the active pocket, which may help the ligand binding and Mg2+ coordination, and thus can affect the catalytic efficiency of ARH1.

### Principal Component Analysis

The Free Energy Landscape (FEL), whose representation is achieved by projecting the trajectories on their first two principal components of motion41,42,43,44 are shown in Fig. 6(a) and (b). The first two principal components of motion are the considered reaction-coordinates. From this analysis, the minimum frequency value45, or the probability of the most likely conformation, and the number of minima over a case dependent threshold of frequency46.

The conformational transitions of the E25 are reflected in different regions of the free energy landscape (FEL), which is depicted on the basis of the projection of the first two principal components of the Cα trajectory (Fig. 6(a) and (b)). As shown in Fig. 6(a), the lowest-energy region of the FEL ofthe phosphorylated ARH1-ADP-RA is achieved after 26 ns, while the non-phosphorylated ARH1-ADP-RA complex does not become stable until 34 ns (Fig. 6(b)). In addition, the low-energy zone of the phosphorylated ARH1-ADP-RA is larger than that of the non-phosphorylated ARH1-ADP-RA, which suggests that the non-phosphorylated ARH1 protein undergoes a relatively long transition stage to reach the equilibrium. The dynamic conformational changes are driven by modifications (in the phosphorylated ARH1-ADP-RA complex, E25 located at α helix, while in the non-phosphorylated ARH1-ADP-RA complex, E25 located in the loop), mainly phosphorylation, to the four tyrosine residues.

The first few eigenvalues correspond to the concerted motions, and their amplitudes decrease to reach a large number of constrained but more localized fluctuations47. The first two principal components account for 37.36 and 42.11% of the motion observed in 200 ns of the trajectories for non-phosphorylated ARH1-ADP-RA, and phosphorylated ARH1-ADP-RA, respectively (Table 5), which reveals that the properties of the motions described by the first few principal components are different to non-phosphorylated ARH1-ADP-RA, and phosphorylated ARH1-ADP-RA as the magnitudes of the eigenvalues are higher for phosphorylated ARH1-ADP-RA, suggesting that phosphorylated ARH1 is the most affected by the presence of ADP-RA.

### SMD study

SMD45 can provide qualitative insights into the interactions and conformational changes between the ligand and its surrounding residues when inducing unbinding of ligand along the pathway on the molecular dynamics simulation. The samples for the same unbinding pathway are more extensive. In this study of SMD simulation, we performed the representative force profiles of the ligand, as showed here, Fig. 7(a) and (b) indicate the force profiles for the ADP-ribose specific unbinding from the binding pocket of phosphorylated ARH1 and non-phosphorylated ARH1. As demonstrated here, in Fig. 7(a), the time-dependent external pulling force applied to ADP-ribose with non-phosphorylated ARH1 increased linearly in the initial stage of the SMD simulation. Afterward, at around 700 ps, the pulling force reached a peak value of just about 2621 pN. At this moment, ADP-ribose was about to leave the active site of the non-phosphorylated ARH1. The value of the external force then rapidly decreased to zero and featured minor fluctuations near zero. Figure 7(a) indicates that the pulling force exerted on ADP-ribose with phosphorylated ARH1 increased directly in the first stage on the time scale SMD simulation until around 736 ps, at this time the pulling force reached a value of 1965 pN. The differences of the pulling force reflect the conformational changes ADP-ribose in the solvent, describing its full dissociation from the binding pocket of the protein.

At the beginning for the X-ray structure and thence used for SMD, ADP-ribose had literally tight interactions with its surrounding residues including S124, H165, S264, G101, G100, G127, G128, E25, D302, S270 and S269 by the strong hydrogen bonding and the cation-π interaction with Y263 at t = 0 ps with non-phosphorylated ARH1-ADP-ribose complex (see Figure S11). After ~700 ps for the non-phosphorylated ARH1-ADP-ribose complex, some hydrogen bonds were broken thus the interactions of the ligand with non-phosphorylated ARH were weakened. At this stage, S269, S124, and D302 formed hydrogen bonds with ADP-ribose (Fig. 8(a) and (b). The cation-π interaction between ADP-ribose and Tyr263 during SMD are shown in Fig. 9(a) and (b). Interestingly, although the ligand had a ~4.0 Å fluctuation align to its initial position, the strong hydrogen bonding with S264 with ADP-ribose still existed (t = 900 ps; see Figure S11) and disappeared at ~960 ps.

At the beginning for the X-ray structure and thence used for SMD, ADP-ribose had literally tight interactions with its surrounding residues including S124, H125, G127, G128, G101, G100, D55, A98, B302, S269 and S264, and the cation-π interaction with Y263 at t = 0 ps with phosphorylated ARH1-ADP-ribose complex (see Figure S11). After ~736 ps for the non-phosphorylated ARH1-ADP-ribose complex, S124, H125, G101, A98, D302, and S269 formed hydrogen bonds with ADP-ribose (Fig. 8(c) and (d). Compared with ADP-ribose with phosphorylated ARH1 and non-phosphorylated ARH1 type, it can be seen that E25 does not appeared at the unbinding pathway in the phosphorylated ARH1 and cannot hinder ADP-ribose leaving.

The average values of Fmax and Fsum based on ten SMD trajectories are presented in Table 6. The Fsum value is calculated based on the saved data points, for avoiding deviation of its real energy, an arbitrary unit is performed in the average values calculation44. An arbitrary unit is given to avoid deviation of its real energy. In this study of the average values, either Fmax or Fsum supports phosphorylated ARH1, which is favorable for ADP-ribose to be detached from ARH1.

### PMF and MM-GBSA calculations

In the following we performed the reconstruction of a potential of mean force (PMF) from SMD data.Based on the SMD trajectories, we separated a reaction coordinate into 12 equal sections. As shown in Fig. 10(a) and (b), the free energy differences for the two approaches of ligand-receptor complexes in PMF method were calculated. From the PMF calculated profiles, something interesting about the dissociation from the binding pocket of the ARH1 were revealed. At the initial stage, when the ligand depart from the initial conformational position, the free energy curve displayed a linearly increase, and this free energy demonstrated the interaction between the ligand and the binding pocket residues. The differences of free energy curve between non-phosphorylated with phosphorylated complexes are indicated in Fig. 10, for ADP with non-phosphorylated ARH1 in Fig. 10(a), the initial conformation of the inhibitor was in the considerable stable binding mode with the ARH1. Conversely, in Fig. 10(b), with the departure of the inhibitor from the protein binding pocket, the free energy value rapidly increased, which can be due to the interaction of the inhibitor with the active site residues. The free energy central barrier of the inhibitor unbinding process for the phosphorylated ARH1 is around 16.61 kcal·mol−1 while for the non-phosphorylated ARH1 it is around 18.14 kcal· mol−1. Thus, non-phosphorylated ARH1 is easy for ADP-ribose to be taken off.

The binding free energy calculated from the MM-GBSA is shown in Table 7, the results list ligand affinity with enzyme. For testing the binding free energy of ARH1-ADP-ribose complex, we extracted a representative comformation and a peak-timing structure on SMD simulation time scales. As shown in Table 7, the binding energies of the non-phosphorylated ARH1-ADP-ribose are −11.05 kcal·mol−1 (representative ARH1-ADP-ribose) and −18.33 kcal·mol−1 (ARH1-ADP-ribose-Peak) while for the phosphorylated ARH1-ADP-ribose are −23.57 kcal·mol−1 (representative ARH1-ADP-ribose) and −24.00 kcal·mol−1 (ARH1-ADP-ribose-Peak). The phosphorylated ARH1-ADP-ribose has lower binding energies, thus the ADP-ribose binding to ARH1 is more stable.

## Conclusion

ADP-ribosylation is a reversible and covalent PTM. Until now, the mechanism of PTM of ARH1 and phosphorylationinduced the ligands binding and unbinding processions is still poorly understood. MD simulations were used and the results indicate that E25 is more frequently in the α-helix conformationin the phosphorylated with ADP-RA complex than that of the non-phosphorylated complex, S124 and S264 become less flexible in the phosphorylated with ADP-RA complex, and Y211 is also less flexible in the phosphorylated with ADP-RA complex, in which these changes are useful for ADP-RA to bind the phosphorylated ARH1. The possible unbinding pathways of ADP-ribose from non-phosphorylated and phosphorylated ARH1 were explored by using SMD simulations. The results have showed phosphorylated ARH1 has more ordered structure than that of non-phosphorylated type, which can induce ligands to make more stable interactions ether in the SMD simulations. Our results will be useful for further studying PTM-induced ARHs structural change study.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## References

1. 1.

Mueller-Dieckmann, C., Kernstock, A. S., Mueller-Dieckmann, B., J., Weiss, C. M. S. & B, K. N. Structure of mouse ADP-ribosylhydrolase 3 (mARH3). Acta Crystallographica Section F 64, 156–162 (2008).

2. 2.

Koch-Nolte, F. & Haag, F. Mono(Adp-Ribosyl)Transferases and Related Enzymes in Animal Tissues. (Springer US, 1997).

3. 3.

Moss, J., Zolkiewska, A. & Okazaki, I. ADP-ribosylarginine hydrolases and ADP-ribosyltransferases. Partners in ADP-ribosylation cycles. (Springer US, 1997).

4. 4.

Glowacki, G. et al. The family of toxin-related ecto-ADP-ribosyltransferases in humans and the mouse. Protein Science 11, 1657–1670 (2002).

5. 5.

Koch-Nolte, F., Kernstock, S., Mueller-Dieckmann, C., Weiss, M. S. & Haag, F. Mammalian ADP-ribosyltransferases and ADP-ribosylhydrolases. Frontiers in Bioscience A Journal & Virtual Library 13, 6716–6729 (2008).

6. 6.

Moss, J. et al. Molecular and Immunological Characterization of ADP-ribosylarginine Hydrolases. (Springer New York, 1992).

7. 7.

Muellerdieckmann, C. et al. The structure of human ADP-ribosylhydrolase 3 (ARH3) provides insights into the reversibility of protein ADP-ribosylation. Proceedings of the National Academy of Sciences 103, 15026–15031 (2006).

8. 8.

Kernstock, S., Koch-Nolte, F., Mueller-Dieckmann, J., Weiss, M. S. & Mueller-Dieckmann, C. Cloning, expression, purification and crystallization as well as X-ray fluorescence and preliminary X-ray diffraction analyses of human ADP-ribosylhydrolase 1. Acta Crystallographica Section F 65, 529–532 (2009).

9. 9.

Oka, S., Kato, J. & Moss, J. Identification and characterization of a mammalian 39-kDa poly(ADP-ribose) glycohydrolase. Journal of Biological Chemistry 281, 705–713 (2006).

10. 10.

Kumpel, B. M., Sibley, K., Jackson, D. J., White, G. & Soothill, P. W. Ultrastructural localization of glycoprotein IIIa (GPIIIa, beta 3 integrin) on placental syncytiotrophoblast microvilli: implications for platelet alloimmunization during pregnancy. Molecular & Cellular Biology 28, 814–824 (2008).

11. 11.

Ono, T., Kasamatsu, A., Oka, S. & Moss, J. The 39-kDa poly(ADP-ribose) glycohydrolase ARH3 hydrolyzes O-acetyl-ADP-ribose, a product of the Sir2 family of acetyl-histone deacetylases. Proceedings of the National Academy of Sciences of the United States of America 103, 16687–16691 (2006).

12. 12.

Kuper, J. et al. On the routine use of soft X-rays in macromolecular crystallography. Part IV. Efficient determination of anomalous substructures in biomacromolecules using longer X-ray wavelengths. Acta Crystallographica 63, 366–380 (2007).

13. 13.

Alder, B. J. & Wainwright, T. E. Studies in Molecular Dynamics. I. General Method. Journal of Chemical Physics 31, 459–466 (1959).

14. 14.

Izrailev, S. et al. Steered Molecular. Dynamics. 4, 39–65 (1999).

15. 15.

Isralewitz, B., Gao, M. & Schulten, K. Steered molecular dynamics and mechanical functions of proteins. Current Opinion in Structural Biology 11, 224–230 (2001).

16. 16.

Wang, D. et al. Exploration of the chlorpyrifos escape pathway from acylpeptide hydrolases using steered molecular dynamics simulations. Journal of Biomolecular Structure & Dynamics 1–13 (2015).

17. 17.

Berman, H. M. et al. The Protein Data Bank. Genetica 106, 149–158 (2000).

18. 18.

Ayers, M. ChemSpider: The Free Chemical Database. Reference Reviews 26, 45–46 (2012).

19. 19.

Trott, O. & Olson, A. J. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry 31, 455–461 (2010).

20. 20.

Best, R. B. et al. Optimization of the Additive CHARMM All-Atom Protein Force Field Targeting Improved Sampling of the Backbone ϕ, ψ and Side-Chain χ1 and χ2 Dihedral Angles. Journal of Chemical Theory & Computation 8, 3257–3273 (2012).

21. 21.

Jr, M. A., Feig, M. & Rd, B. C. Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. Journal of Computational Chemistry 25, 1400–1415 (2004).

22. 22.

Harrach, M. F. & Drossel, B. Structure and dynamics of TIP3P, TIP4P, and TIP5P water near smooth and atomistic walls of different hydroaffinity. Journal of Chemical Physics 140, 3393–3393 (2014).

23. 23.

Schlick, T. Molecular Modeling and Simulation. (Springer, 2002).

24. 24.

Darden, T., York, D. & Pedersen, L. Particle mesh Ewald: An Nṡlog(N) method for Ewald sums in large systems. Journal of Chemical Physics 98, 10089–10092 (1993).

25. 25.

Kume, A. et al. Exploring high-affinity binding properties of octamer peptides by principal component analysis of tetramer peptides. Journal of Bioscience & Bioengineering (2016).

26. 26.

Ouaray, Z., Elsawy, K. M., Lane, D. P., Essex, J. W. & Verma, C. Reactivation of mutant p53: Constraints on mechanism highlighted by principal component analysis of the DNA binding domain. Proteins-structure Function & Bioinformatics (2016).

27. 27.

Ernst, M., Sittel, F. & Stock, G. Contact- and distance-based principal component analysis of protein dynamics. Journal of Chemical Physics 143, 12B640_641–7760 (2015).

28. 28.

Iida, S. et al. Variation of free-energy landscape of the p53 C-terminal domain induced by acetylation: Enhanced conformational sampling. Journal of Computational Chemistry (2016).

29. 29.

Tripathi, S., Srivastava, G. & Sharma, A. Molecular dynamics simulation and free energy landscape methods in probing L215H, L217R and L225M βI-tubulin mutations causing paclitaxel resistance in cancer cells. Biochemical & Biophysical Research Communications 476, 273–279 (2016).

30. 30.

Adrien Nicolaï, P. D. & Senet, Patrick. Decipher the Mechanisms of Protein Conformational Changes Induced by Nucleotide Binding through Free-Energy Landscape Analysis: ATP Binding to Hsp70. Plos Computational Biology 9, e1003379 (2013).

31. 31.

Xu, X., Kanduč, M., Wu, J. & Dzubiella, J. Potential of mean force and transient states in polyelectrolyte pair complexation. Journal of Chemical Physics 145, 407–1009 (2016).

32. 32.

Zhang, Y., Huang, X., Han, K., Zheng, F. & Zhan, C. G. Free energy profiles of cocaine esterase-cocaine binding process by molecular dynamics and potential of mean force simulations. Chemico-biological interactions (2016).

33. 33.

Niu, Y., Li, S., Pan, D., Liu, H. & Yao, X. Computational study on the unbinding pathways of B-RAF inhibitors and its implication for the difference of residence time: insight from random acceleration and steered molecular dynamics simulations. Physical Chemistry Chemical Physics Pccp 18, 5622–5629 (2016).

34. 34.

Darve, E. & Pohorille, A. Adaptive biasing force method for scalar and vector free energy calculations. Journal of Chemical Physics 128, 144120–144120 (2008).

35. 35.

Genheden, S. & Ryde, U. The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert Opinion on Drug Discovery 10, 449–461 (2015).

36. 36.

Samudravijaya, K. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins Structure Function & Bioinformatics 65, 712–725 (2006).

37. 37.

Hou, T., Wang, J., Li, Y. & Wang, W. Assessing the performance of the molecular mechanics/Poisson Boltzmann surface area and molecular mechanics/generalized Born surface area methods. II. The accuracy of ranking poses generated from docking. Journal of Computational Chemistry 32, 866–877 (2011).

38. 38.

Kollman, I. M. P. A. Computational alanine scanning to probe protein-protein interactions: A novel approach to evaluate binding free energies. Journal of the American Chemical Society 121, 8133–8143 (1999).

39. 39.

Wang, F., Wan, H., Hu, J. P. & Chang, S. Molecular dynamics simulations of wild type and mutants of botulinum neurotoxin A complexed with synaptic vesicle protein 2C. Molecular Biosystems 11, 223–231 (2015).

40. 40.

Gromiha, M. M. & Suwa, M. Structural analysis of residues involving cation-pi interactions in different folding types of membrane proteins. International Journal of Biological Macromolecules 35, 55–62 (2005).

41. 41.

Papaleo, E., Mereghetti, P., Fantucci, P., Grandori, R. & Gioia, L. D. Free-energy landscape, principal component analysis, and structural clustering to identify representative conformations from molecular dynamics simulations: The myoglobin case. Journal of Molecular Graphics & Modelling 27, 889–899 (2009).

42. 42.

Qian, M. et al. Structural Basis of Fullerene Derivatives as Novel Potent Inhibitors of Protein Tyrosine Phosphatase 1B: Insight into the Inhibitory Mechanism through Molecular Modeling Studies. Journal of Chemical Information & Modeling (2016).

43. 43.

Chiappori, F., Merelli, I., Milanesi, L. & Rovida, E. Exploring the role of the phospholipid ligand in endothelial protein C receptor: a molecular dynamics study. Proteins-structure Function & Bioinformatics 78, 2679–2690 (2010).

44. 44.

Chiappori, F., Mattiazzi, L., Milanesi, L. & Merelli, I. A novel molecular dynamics approach to evaluate the effect of phosphorylation on multimeric protein interface: the αB-Crystallin case study. BMC Bioinformatics 17, 225–234 (2016).

45. 45.

Meijles, D. N., Fan, L. M., Howlin, B. J. & Li, J. M. Molecular Insights of p47phox Phosphorylation Dynamics in the Regulation of NADPH Oxidase Activation and Superoxide Production. Journal of Biological Chemistry 289, 22759–22770 (2014).

46. 46.

Jarmuła, A., Frączyk, T., Cieplak, P. & Rode, W. Mechanism of influence of phosphorylation on serine 124 on a decrease of catalytic activity of human thymidylate synthase. Bioorganic & Medicinal Chemistry 18, 3361–3370 (2010).

47. 47.

Burendahl, S., Danciulescu, C. & Nilsson, L. Ligand unbinding from the estrogen receptor: A computational study of pathways and ligand specificity. Proteins-structure Function & Bioinformatics 77, 842–856 (2009).

## Acknowledgements

This work was supported by the Major Scientific Research Projects of Jilin Province [20140203025NY], the Natural Science Foundation of Jilin Province [3B812C201465], and by the US National Institutes of Health (GM100701). All the simulations were conducted at the High Performance Computing Center of Jilin University.

## Author information

### Affiliations

1. #### Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Science, Jilin University, 2699 Qianjin Street, Changchun, 130012, China

• Jingxuan Zhu
• , Yishuo Lv
•  & Weiwei Han
2. #### Department of Electric Engineering and Computer Science, C.S. Bond Life Sciences Center, University of Missouri, Columbia, Missouri, 65211, USA

• Xiaosong Han
• , Dong Xu
•  & Weiwei Han
3. #### College of Computer Science and Technology Jilin University, 2699 Qianjin Street, Changchun, 130012, China

• Xiaosong Han
•  & Dong Xu

### Contributions

W.H. and J.Z. conceived and designed research; J.Z. performed MD and SMD simulation; J.Z., Y.L., X.H., D.X.and W.H. analyzed data; W.H., D.X. and J.Z. wrote the paper, and all authors approved the article.

### Competing Interests

The authors declare that they have no competing interests.

### Corresponding authors

Correspondence to Dong Xu or Weiwei Han.