Mimicking titration experiments with MD simulations: A protocol for the investigation of pH-dependent effects on proteins

Socher, Eileen; Sticht, Heinrich

doi:10.1038/srep22523

Download PDF

Article
Open access
Published: 03 March 2016

Mimicking titration experiments with MD simulations: A protocol for the investigation of pH-dependent effects on proteins

Eileen Socher¹ &
Heinrich Sticht¹

Scientific Reports volume 6, Article number: 22523 (2016) Cite this article

7816 Accesses
20 Citations
Metrics details

Subjects

An Erratum to this article was published on 03 May 2016

This article has been updated

Abstract

Protein structure and function are highly dependent on the environmental pH. However, the temporal or spatial resolution of experimental approaches hampers direct observation of pH-induced conformational changes at the atomic level. Molecular dynamics (MD) simulation strategies (e.g. constant pH MD) have been developed to bridge this gap. However, one frequent problem is the sampling of unrealistic conformations, which may also lead to poor pK_a predictions. To address this problem, we have developed and benchmarked the pH-titration MD (pHtMD) approach, which is inspired by wet-lab titration experiments. We give several examples how the pHtMD protocol can be applied for pK_a calculation including peptide systems, Staphylococcus nuclease (SNase), and the chaperone HdeA. For HdeA, pHtMD is also capable of monitoring pH-dependent dimer dissociation in accordance with experiments. We conclude that pHtMD represents a versatile tool for pK_a value calculation and simulation of pH-dependent effects in proteins.

Validating the CHARMM36m protein force field with LJ-PME reveals altered hydrogen bonding dynamics under elevated pressures

Article Open access 28 June 2021

Structure determination of high-energy states in a dynamic protein ensemble

Article 02 March 2022

Multisecond ligand dissociation dynamics from atomistic simulations

Article Open access 10 June 2020

Introduction

Solution pH can have a drastic effect on protein structure and function, which has been exploited by nature to trigger a large variety of physiological processes. For example, some bacteria are able to survive the acidic conditions in the stomach of their host by using acid-activated chaperones which protect substrate proteins upon binding¹. In viruses, some of the fusion proteins that mediate cell entry have been described to act pH-dependently^2,3. Other proteins in vertebrates undergo pH changes during their maturation on the way through the endoplasmic reticulum and the Golgi apparatus⁴. In plants, the simultaneous closure of water channels has been observed as a response to changing pH values during flooding⁵.

On a molecular level, changes in the pH value affect the protonation state of several types of amino acids, including aspartate, glutamate, histidine, lysine, cysteine, and tyrosine. The addition or removal of a proton always changes the charge of the respective amino acid side chain, thereby affecting the charge distribution within the protein, which may lead to conformational changes. For instance, these structural alterations can trigger changes in protein activity, ligand binding properties, or the oligomerization state.

However, due to the temporal or spatial resolution of experimental approaches, it is extremely difficult to observe pH-induced conformational changes in proteins directly at the atomic level. Also the generation of structural data at different pH values, for instance with X-ray crystallography or NMR spectroscopy, underlies different restrictions and is technically very demanding. To mention only a few general limitations, proteins mostly do not crystallize at very different pH values and NMR spectroscopy is limited to small proteins.

At this point, molecular dynamics (MD) simulations, which start from experimentally determined structures, can help investigate the effect of pH changes on an atomic level and on picosecond to microsecond time scales. One hallmark of conventional MD simulations is the fact that an initially assigned protonation state cannot be changed during the simulation. This “constant protonation” approach results in some drawbacks for studying pH-dependent effects⁶: (1) Assigning the right protonation states for the titratable groups in the protein requires knowledge of their pK_a values, (2) if any of these pK_a values are near the solvent pH there may be no single protonation state that adequately represents the ensemble of protonation states appropriate at that pH, and (3) the invariable protonation states decouple the dynamic dependence of pK_a and protonation state on conformation.

To avoid these problems, the constant pH molecular dynamics (CpHMD) approach was developed^6,7. One widespread implementation, for example in the AMBER software suite, performs Monte Carlo sampling of the Boltzmann distribution of protonation states interspersed in the molecular dynamics simulation⁸. Thereby, the solution pH is set as an external variable determining the distribution of the different protonation states, which are modeled by different charge sets⁸.

CpHMD has become a popular method to study the pH-dependence of protein⁹ and peptide¹⁰ structures or to calculate the pK_a values of titratable residues^6,11. However, a comparison between calculated and experimentally determined pK_a values frequently revealed significant differences indicating that unrealistic protein conformations are sampled^11,12. Recent approaches to reduce this problem are constant pH replica exchange molecular dynamics (pH-REMD) simulations^13,14 and the explicit consideration of the solvent^12,15.

As an alternative approach, we have devised a modified procedure, which is inspired by wet-lab titration experiments. This pH-titration MD (pHtMD) relies on the overall concept of CpHMD, but performs a consecutive series of MD simulations with small pH changes, which allows a smooth adaption of the structure to the solvent pH (Fig. 1).

**Figure 1: Workflow of pHtMD simulations.**

The rationale for suggesting this titration concept was the following: Conventional CpHMD usually runs a set of simulations at different pH values that are fixed at the beginning of each simulation and may differ significantly from the pH at which the structure was determined (e.g. pH 3 simulation using a pH 8 structure as a template). CpHMD thus requires a rapid adaptation of the structure to different pH values, which may cause the sampling of unrealistic conformations thereby producing inaccurate pK_a values.

To address this problem, we have developed and benchmarked the pHtMD approach as described in the present manuscript. First, we demonstrate that our approach is capable of accurately reproducing the pK_a values of small model peptides. Application of the approach to the staphylococcal nuclease (SNase) mutant Δ+PHS demonstrates that the pHtMD method performs better than conventional CpHMD methods for the prediction of pK_a values for this system. For the dimeric HdeA protein, pHtMD predicts pK_a values with a similar accuracy as recent pH-REMD simulations and is also capable of monitoring dimer dissociation at low pH values in accordance with experiments. We conclude that pHtMD represents a versatile tool for the calculation of pK_a values and the simulation of pH-dependent motions in proteins.

Results

Prediction of pK_a values for model peptides

The energy function for titratable amino acids in AMBER has been designed in that fashion that the experimentally observed pK_a values are reproduced, when simulations are performed according to the standard CpHMD approach implemented in AMBER. Therefore, the first aim of our work was to ensure that simulations according to the pHtMD protocol are still consistent with the AMBER parametrization and do not result in unrealistic pK_a values. To address this point, Ace-X-Nme model peptides were built for Asp, Glu, His, Cys, Tyr, and Lys residues. To eliminate effects on the amino acid side chain pK_a values by charged termini, the terminal ends were capped with N-terminal acetyl (ACE) and C-terminal N-methylamine groups (NME). The simulations of these model compounds started with deprotonated states at higher pH values and were slowly decreased, so that each titratable side chain became protonated. By plotting the deprotonated fraction over the pH range, titration curves can be fitted (Fig. 2a–f). The difference between the calculated pK_a values and the reference pK_a values of the side chains (implemented in AMBER) is maximally 0.05. The Hill coefficients are between 0.99 and 1.02. All in all, the findings for the model compounds are in good agreement with the expected results.

**Figure 2: Titration curves for side chains in the Ace-X-Nme model compounds.**

The pentapeptide Ala-Cys-Phe-Cys-Ala (ACFCA) had been designed as a model system by Swails et al.¹² for the development of pH replica exchange MD (pH-REMD) simulation methods. The peptide contains two charged termini as well as two titratable cysteines. Due to the charge, the termini interact electrostatically with the neighboring cysteines, thereby influencing their pK_a values (Fig. 3a).

**Figure 3: Titration curves for the cysteines in the ACFCA pentapeptide.**

The ACFCA pentapeptide was simulated according to the pHtMD strategy described in Methods. First, the peptide was titrated from pH 7 to pH 11 within 200 ns (Fig. 3b, left half). As expected, the titration curves of the cysteines are shifted to the left or right with respect to the titration curve of the Ace-Cys-Nme model compound, due to the influence of the charged termini of the pentapeptide. The titration curve of Cys2 is shifted to a lower pK_a value of 8.14 because of electrostatic interactions with the positively charged N-terminus, whereas the pK_a value of Cys4 is increased to 9.41 due to the vicinity of the negatively charged C-terminus (Fig. 3b). In a second step we checked for hysteresis by titrating the peptides back from pH 11 to pH 7 (Fig. 3b, right half). The pK_a values obtained from both titrations for the cysteines differ by ≤0.1 units. To allow a comparison to other approaches, a pH-REMD simulation in implicit solvent was performed using a total of six replicas at the following pH values: 7.1, 8.1, 8.3, 8.7, 8.9, and 9.9 (Fig. 3c). These pH values were adapted from the pH-REMD study by Swails et al.¹² that was performed for the same peptide in explicit solvent.

There is a good agreement between both simulation strategies and the determined pK_a values differ by less than 0.1 units. In addition, the values obtained are also close to those reported by Swails et al. for pH-REMD simulations in explicit solvent (cysteine pK_a values of 8.16 and 9.37, respectively)¹². An additional feature of pHtMD is the larger number of data points generated compared to conventional CpHMD or pH-REMD simulations (see Fig. 3b vs. 3c). This makes curve fitting less dependent on the inaccuracies of individual data points.

Thus we conclude that for small model compounds the performance of pHtMD is similar to that of other state-of-the-art methods. The next simulations investigated whether pHtMD also gives good pK_a predictions for globular proteins.

Prediction of pK_a values for globular proteins

The staphylococcal nuclease (SNase) is a highly charged protein¹¹, which contains 149 residues and carries no disulfide bonds. To test our simulation protocol, we investigated the SNase Δ+PHS mutant in the pH range between 8 and 2. This mutant represents a hyperstable, acid-resistant SNase with five substitutions (G50F, V51N, P117G, H124L, and S128A) and a deletion of residues 44–49¹⁶ (Fig. 4a). For the simulations, all Asp, Glu, His, Tyr, and Lys residues of the crystal structure were set as titrating residues. Representative titration curves are shown in Fig. 4b–d and the number of transitions observed as well as the solvent exposure of the respective group is shown.

**Figure 4: Titration of acidic side chains in the staphylococcal nuclease mutant Δ+PHS and of the His15 side chain in HEWL.**

Glu52 (Fig. 4b) is a residue that is highly solvent exposed between pH 2 and 7 and a sufficiently large number of transitions between the different protonation states is observed around the pK_a value thus providing a good basis for the fitting of the titration curve and calculation of the pK_a value. In contrast to Glu52, Glu67 (Fig. 4c) is significantly buried at pH < 4, which results in a low number of transitions observed. Consequently, a reliable calculation of the deprotonated fraction is not possible for the respective data points, which also hampers pK_a value calculation. The inclusion of all data points for curve fitting yields a pK_a value of 4.38 for Glu67 (gray line in Fig. 4c). Excluding the data at which the carboxyl group is exposed <30% from the fitting procedure (see Methods) leads to the new fitted curve (green line). The resulting pK_a value of 4.21 is now closer to the experimentally determined pK_a value of 3.76¹⁶.

Glu67 is also a good example to demonstrate that the correlation between experiment and prediction can be improved by running multiple independent pHtMD experiments. For SNase Δ+PHS a total of six simulations were performed and resulted in pK_a values of 4.21, 3.85, 3.78, 3.52, 3.64, and 4.05. The average value of 3.84 (Table 1) is in better agreement with the experimental pK_a value of 3.76 than the results from the individual simulations.

Table 1 Predicted and experimental pK_a values of the SNase mutant Δ+PHS.

Full size table

The exclusion of data points collected for significantly buried side chain conformations proved to be beneficial for those residues that are only buried for part of the simulation time. However, this approach cannot be applied to residues that are buried for most of the simulation time. This is, for example, the case for the three aspartates of the active site (Asp19, Asp21, Asp40) and for Asp83 (Fig. 4d). Consequently, no pK_a value was calculated for these residues. The problem that no reliable pK_a values can be obtained for buried residues showing a low number of transitions between different protonation states has already been noted in previous CpHMD¹¹ studies. The experimental study conducted by Castañeda et al. showed that neither Asp77 nor Asp83 titrated between pH 2 and pH 9 and that the oxygen atoms of these residues were less than 10% solvent-accessible¹⁶. All pK_a values obtained from the pHtMD approach are summarized in Table 1 and compared to experimental and alternative computational approaches. Inspection of the average deviation between experimental and predicted pK_a values reveals that our approach performs better for SNase Δ+PHS than a previous conventional CpHMD simulation¹¹ or PROPKA pK_a predictions based on the static crystal structure.

The lack of consideration of the protein dynamics could also be the reason why the predicted pK_a values of PROPKA differ sometimes more than one pH unit from the experimental values (e.g Glu73, Glu101, and His121). For these residues, the pHtMD simulation protocol predicted more precise pK_a values (pK_a differences: <0.6 pH units). The improvement compared to the conventional CpHMD simulation most likely results from a more accurate sampling of the conformational space, which was already identified as a key prerequisite for the prediction of reliable pK_a values by Williams et al.¹¹.

In our study of SNase above, pK_a values were obtained from single pHtMD experiments that covered a pH range from 8 to 2. Next, we investigated whether two pHtMD simulations may be combined to cover a broader pH range. As an example we selected hen egg-white lysozyme (HEWL), which contains a single histidine residue (Fig. 4e). Two pHtMD simulations were performed either covering a pH range from 7 to 2 or from 7 to 12. The rationale for choosing pH 7 was to start at a pH value for which an experimentally determined protein structure is available as starting point of the simulations. Figure 4f shows that the two resulting datasets can be combined to cover the entire pH range from 2 to 12.

The strategy above was repeated four times and resulted in an average pK_a value of 5.69, which is close to the pK_a values found for His15 in experiments: 5.68–5.74¹⁷ or 5.5 ± 0.2¹⁸. The pHtMD prediction is also closer to the experimental values than predictions from PROPKA (pK_a = 6.31) or from conventional CpHMD simulations (pK_a = 6.45⁶).

Prediction of pH-dependent motions in globular proteins

HdeA, the small acid stress chaperone, is important for Escherichia coli to survive in the acid environment of the host stomach because it binds other periplasmic proteins and prevents their aggregation. At neutral pH, HdeA forms homodimers, which dissociate into disordered monomers when the pH decreases^19,20. We have selected HdeA as a model system to investigate whether pHtMD can also monitor larger-scale structural changes in addition to predicting pK_a values. The pHtMD simulations started with the HdeA dimer, for which a crystal structure is available, and simulated a slow decrease of the pH from 7 to 2 within 250 ns.

The pK_a values of all aspartates and glutamates were calculated and are summarized in Table 2. The average deviation of 0.27 from the experimentally determined pK_a values is smaller than that resulting from PROPKA predictions (average deviation =0.52). This better performance of pHtMD indicates the requirement of an accurate conformational sampling, which is particularly relevant for proteins such as HdeA that undergo large conformational rearrangements upon pH changes.

Table 2 Predicted and experimental pK_a values of HdeA.

Full size table

As an alternative approach to obtain more realistic pK_a values, pH-REMD has been recently performed for HdeA by Ahlstrom et al.²¹. The results from this approach are also given in Table 2 for comparison. Depending on the oligomerization state chosen for simulation (monomer vs. dimer) pH-REMD results in average deviations of 0.16 and 0.33 pH units from the experimental pK_a values; this is in the same range as for our pHtMD protocol.

We also investigated whether pHtMD does not only allow to calculate pK_a values but also is able to monitor pH-dependent HdeA dissociation. In fact, it was possible to observe the dissociation process and the transition from a well-folded dimeric structure into disordered monomers (Fig. 5a). Unfolding and subsequent dissociation occurs in the pH range from 3.5 to 2 (Fig. 5a), which is also reflected by the gradual decrease of intermolecular contacts (Fig. 5b). At pH 2.2 all intermolecular contacts are lost. Since unfolding and dissociation occur over a rather broad pH range, they are covered by >50 ns of the simulation time. This is also reflected in the gradual changes of the protonation states of the titrating groups (Fig. 6), which allow curve fitting and calculations of the pK_a values. The structural properties of HdeA monitored during the pHtMD simulation are also in line with NMR spectroscopic data indicating that HdeA remains dimeric from pH 6 down to pH 3²² and that an unfolded monomeric conformation is present at pH 2.5²⁰.

**Figure 5: pH-dependent dissociation of the HdeA dimer of *Escherichia coli*.**

**Figure 6: Titration curves of acidic residues.**

The analysis of the pHtMD simulation also allows extracting additional properties of the system such as the overall charge as a function of the pH (Fig. 5c). In the respective plot, the overall charge is calculated from the protonation state of the ionizable groups after every nanosecond of simulation resulting in 250 data points. The result approximates the pH range in which a particular net charge is observed. The respective plot can be directly calculated based on the protonation states observed and does not rely on an interpretation of pK_a values that may be difficult to obtain for buried side chains (see Fig. 4c,d above for examples).

Discussion

In the present work, we have devised the pHtMD protocol, which is based on the CpHMD approach, to calculate pK_a values and to simulate pH-dependent effects on proteins. The concept of pHtMD aims to perform a consecutive series of MD simulations with small pH changes in a defined direction to allow a smooth adaption of the structure to the solvent pH. This differs from the traditional CpHMD approach, in which a set of simulations is run at different pH values, which may differ significantly from the pH at which the structure was determined (e.g. pH 2 simulation using a pH 8 structure as a template). Consequently, the structure needs to abruptly adapt to the lower pH, which may cause the sampling of unrealistic conformations thus producing inaccurate pK_a values.

pHtMD was applied to model peptides and globular proteins to assess the potential range of applications: SNase Δ+PHS represents a protein system that exhibits strongly shifted pK_a values, and HdeA is a protein that undergoes subunit dissociation at low pH. For the SNase and HdeA systems, the pK_a of the titratable groups was predicted with an average deviation of 0.45 and 0.27, respectively (Tables 1 and 2). For HdeA, this accuracy is within the range observed for recent pK_a predictions using the pH-REMD strategy²¹. In addition, the average errors of pHtMD are also within the estimated uncertainty of 0.5 pH units for pK_a values determined by NMR experimental techniques¹⁸. As one key feature of pHtMD, a large pH range can be scanned within one simulation and no prior pK_a value estimations of individual residues is required for the setup. Similar to other CpHMD-based protocols, pHtMD has problems to predict reliable pK_a values for highly buried residues that do not titrate within a physiological pH range. However, due to their strongly shifted pK_a values, these residues do not function as triggers of physiologically relevant pH-induced conformational changes. Therefore, the inability to calculate pK_a values of those residues does not critically affect the applicability of the pHtMD approach to investigate pH-dependent effects on protein structure that occur in a physiological pH range.

For HdeA it was possible to monitor the dissociation process and the pH range for dissociation is in close agreement with the experiment^19,22. As one advantage, the pHtMD method does not require a priori experimental knowledge of whether dissociation occurs and there is no need for an explicit consideration of dimeric and monomeric states in separate simulations. Instead, the dissociation process can be monitored within one single pHtMD run and does not require including additional assumptions for the setup of the simulation.

Similar to other simulation protocols, several repetitions of each pHtMD simulation allows increasing accuracy of the results, which is particularly important for systems with partially buried titratable residues such as SNase Δ+PHS. Multiple simulations reduce the risk that an unrealistic conformational sampling within a single simulation might result in a wrong prediction of pK_a values.

For protein systems that exhibit large conformational changes within a narrow pH-range, the currently used rate of pH change (0.02 units per nanosecond) might be too high. We therefore suggest to use smaller rates of change if there is evidence for such motions, e.g. from unusual titration curves or from experimental data.

We always suggest to start the pHtMD at a physiological pH value for which an experimental structure is available. Starting at extreme pH values may work for peptides, which exhibit a nonglobular structure that is rather independent of pH. However, it may fail for globular proteins that frequently undergo unfolding under such conditions. This can be illustrated by considering HdeA, which exists in a monomeric state at pH 2. A pHtMD experiment starting at pH 2 would require to simulate the association of two unfolded monomers into a correctly folded dimer, which is not yet feasible by MD simulations.

A further increase in the accuracy of pHtMD simulations may be achieved by an explicit consideration of the solvent in the future. Explicit solvent has been recently used in conjunction with pH-REMD and resulted in an improved prediction of pK_a values^12,15. Simulations in explicit solvent should also allow studying the role of ions, especially of those ions bound to the protein, or of a membrane environment. The concept of pHtMD is appropriate for a combination with explicit solvent; however, extensive benchmarking will be required to determine suitable simulation parameters.

Furthermore, we assume that the principle of our simulation approach can be easily transferred to other simulation programs and is not restricted to the Amber software suite, because pHtMD is based on the well-established CpHMD approach^6,23,24,25, which is also implemented in other molecular dynamics software packages such as GROMACS²⁶ or CHARMM^27,28.

Methods

Starting structure generation

The structures of the Ace-X-Nme model compounds (X = Asp, Glu, His, Cys, Tyr, Lys) and the Ala-Cys-Phe-Cys-Ala (ACFCA) model peptide were built using the tleap module of Amber 12⁸.

The structures of the proteins investigated were retrieved from the protein data bank²⁹: SNase Δ+PHS (3BDC¹⁶), HEWL (1AKI³⁰), and dimeric HdeA (1BG8³¹). If the terminal residues did not represent the natural termini, an acetyl group (ACE) was added at the N-terminus and/or an N-methylamine group (NME) was added at the C-terminus with PyMOL version 1.5 (The PyMOL Molecular Graphics System, Schrödinger, LLC, http://www.pymol.org/). Titrating Asp, Glu and His residues were renamed to AS4, GL4 and HIP, respectively, to allow for their proper treatment as titratable groups within AMBER. After these preparation steps, missing hydrogens were added, disulfide bonds were defined and the topology file and the coordinate file were created with tleap. Finally, the cpin file was created with the cpinutil.py program from the Amber 12 molecular dynamics package. In the cpin file all possible titrating residues were defined as titrating.

General setup and parameters of the MD simulation

The structure preparation was followed by two rounds of minimization, each comprising 2,500 steps of steepest descent followed by 2,500 steps of conjugate gradient minimization. In the first round, the hydrogen atoms were minimized while all heavy atoms were restrained with a constant force of 10 kcal/(mol∙Å²) to their initial positions. In the second round, no restrains were used to allow minimization of the entire system.

After minimization, the system was gradually heated from 10 K to 310 K within 0.1 ns while the protein was restrained with a constant force of 5 kcal/(mol∙Å²). The constant pH in the implicit solvent method was switched on and the solvent pH was set to a starting pH of either 7 or 8 for the protein simulations. The salt concentration was set to 0.1 M (based on Debye-Hückel), bonds involving hydrogen atoms were constrained with the SHAKE algorithm and a time step of 2 fs was used. Also the Langevin dynamics with a collision frequency of 2 ps⁻¹ was used. The correctness of the protonation state was checked through Monte Carlo sampling of the Boltzmann distribution of protonation states as implemented in AMBER. In the AMBER setup the solution pH is given as an external parameter⁸ and the residues which were set as titrating can change their protonation state, while the protonation states for non-titrating residues are fixed. The different protonation states of those titrating residues were realized by different charge sets and the protonation state changes through the change of these partial charges on the atoms of the titrating residues⁸. Therefore, the titrating residues sample a Boltzmann distribution of protonation states using Monte Carlo and between the Monte Carlo steps the system is simulated with standard molecular dynamics⁸. In our simulation setup, the protonation check was performed at each time step of the molecular dynamics simulation. In the following equilibration phase (0.4 ns) the temperature was maintained at 310 K and the restraints were reduced to the alpha carbon atoms of the protein. All other parameters remained unchanged. In the production stage that was used for pK_a value calculation and analysis of protein dynamics, no restraints were used.

All molecular dynamics simulations were carried out with Amber12 or Amber14³² using the ff99SB³³ force field. For all parts of the simulation the generalized Born solvent model igb = 2 and a cutoff of 30.0 Å were used for non-bonded interactions. In contrast to the simulations of proteins, the simulations of the model compounds were carried out at 300 K instead of 310 K. Intermolecular contacts and solvent accessibility were analyzed using the cpptraj³⁴ module of Amber 12. Visualization was done with VMD 1.9.1³⁵ and plots were created with gnuplot 4.6 (http://www.gnuplot.info/).

Implementation of the pH titrating molecular dynamics (pHtMD) concept and pK_a value calculation

Our concept of pHtMD was implemented in the following fashion: Each protein simulation started at a neutral pH value and lasted for 1 ns. After each nanosecond at constant pH, the pH value of the solvent was slightly lowered or elevated and a further nanosecond was simulated using the final coordinates and velocities from the previous simulation round as an input. Using this procedure, a large number (>100) of consecutive 1-ns MD simulations were performed, slightly changing the solvent pH value in a defined direction, thus resembling a classical wet-lab titration experiment.

In order to determine appropriate rates for the pH change during the pHtMD, model calculations using four different rates were performed: 0.01 pH units/ns, 0.02 pH units/ns, 0.1 pH units/ns and 0.1 pH units/2 ns (data not shown). The rate of 0.02 pH units/ns was selected since it yielded a larger number of data points for pK_a value calculation compared to the faster rates, thereby facilitating the fitting procedure of the titration curves. The slower rate of 0.01 pH units/ns did not result in further improvement and was therefore not used due to its higher computational costs. A rate of 0.02 pH units/ns means that 50 ns of dynamics were calculated to achieve a pH change of one unit resulting in 100–300 ns of simulation time for the systems studied.

Each of the 100–300 pHtMD trajectories of 1 ns length was then used to calculate the pK_a values in the same fashion as described previously for independent CpHMD simulations that were performed for distinct solution pH values¹¹. For each titratable group, the deprotonated fraction (f_deprot) was used to calculate its pK_a value according to equation (1):

Fits according to this equation allow the estimation of pK_a values at the midpoint of titration and of the Hill coefficient n, which describes the cooperativity of various titrating groups³⁶.

Post-processing of the f_deprot values was performed for buried side chains, which frequently exhibit a poor sampling of different protonation states: If the solvent accessible surface area (SASA) of a titratable side chain was <30% of the maximum SASA of this side chain, the respective data point was excluded from further analysis. Reference SASA values were deduced from the MD simulations of the Ace-X-Nme model compounds. The subsequent fitting procedure according to the equation above was then done only with the remaining data points.

To allow for a comparison with one commonly used pK_a predicting method, pK_a values of the side chains of aspartates, glutamates, and histidines were also predicted with PROPKA^37,38 (http://propka.ki.ku.dk/).

Additional Information

How to cite this article: Socher, E. and Sticht, H. Mimicking titration experiments with MD simulations: A protocol for the investigation of pH-dependent effects on proteins. Sci. Rep. 6, 22523; doi: 10.1038/srep22523 (2016).

Change history

03 May 2016
A correction has been published and is appended to both the HTML and PDF versions of this paper. The error has not been fixed in the paper.

References

Hong, W. Wu, Y. E. Fu, X. & Chang, Z. Chaperone-dependent mechanisms for acid resistance in enteric bacteria. Trends Microbiol. 20, 328–335 (2012).
Article CAS Google Scholar
Roche, S. Bressanelli, S. Rey, F. A. & Gaudin, Y. Crystal Structure of the Low-pH Form of the Vesicular Stomatitis Virus Glycoprotein G. Science (Washington, DC, US) 313, 187–191 (2006).
Article CAS ADS Google Scholar
White, J. & Helenius, A. pH-dependent fusion between the Semliki Forest virus membrane and liposomes. Proc. Natl. Acad. Sci. USA 77, 3273–3277 (1980).
Article CAS ADS Google Scholar
Thomson, C. A. & Ananthanarayanan, V. S. Structure-function studies on Hsp47: pH-dependent inhibition of collagen fibril formation in vitro . Biochem. J. 349 Pt 3, 877–883 (2000).
Article CAS Google Scholar
Tournaire-Roux, C. et al. Cytosolic pH regulates root water transport during anoxic stress through gating of aquaporins. Nature (London, UK) 425, 393–397 (2003).
Article CAS ADS Google Scholar
Mongan, J. Case, D. A. & McCammon, J. A. Constant pH Molecular Dynamics in Generalized Born Implicit Solvent. J. Comput. Chem. 25, 2038–2048 (2004).
Article CAS Google Scholar
Mongan, J. & Case, D. A. Biomolecular simulations at constant pH. Curr. Opin. Struct. Biol. 15, 157–163 (2005).
Article CAS Google Scholar
Case, D. A. et al. AMBER 12 (University of California, San Francisco, 2012).
Chu, W.-T. et al. Constant pH molecular dynamics (CpHMD) and molecular docking studies of CquiOBP1 pH-induced ligand releasing mechanism. J. Mol. Model. 19, 1301–1309 (2013).
Article CAS Google Scholar
Vila-Viçosa, D. Teixeira, V. H. Santos, H. A. F. & Machuqueiro, M. Conformational Study of GSH and GSSG Using Constant-pH Molecular Dynamics Simulations. J. Phys. Chem. B 117, 7507–7517 (2013).
Article Google Scholar
Williams, S. L. Blachly, P. G. & McCammon, J. A. Measuring the successes and deficiencies of constant pH molecular dynamics: A blind prediction study. Proteins: Struct. Funct. Bioinf. 79, 3381–3388 (2011).
Article CAS Google Scholar
Swails, J. M. York, D. M. & Roitberg, A. E. Constant pH Replica Exchange Molecular Dynamics in Explicit Solvent Using Discrete Protonation States: Implementation, Testing, and Validation. J. Chem. Theory Comput. 10, 1341–1352 (2014).
Article CAS Google Scholar
Meng, Y. & Roitberg, A. E. Constant pH replica exchange molecular dynamics in biomolecules using a discrete protonation model. J. Chem. Theory Comput. 6, 1401–1412 (2010).
Article CAS Google Scholar
Dashti, D. S. Meng, Y. & Roitberg, A. E. pH-Replica Exchange Molecular Dynamics In Proteins Using A Discrete Protonation Method. J. Phys. Chem. B 116, 8805–8811 (2012).
Article Google Scholar
Lee, J. Miller, B. T. Damjanović, A. & Brooks, B. R. Enhancing Constant-pH Simulation in Explicit Solvent with a Two-Dimensional Replica Exchange Method. J. Chem. Theory Comput. 11, 2560–2574 (2015).
Article CAS Google Scholar
Castañeda, C. A. et al. Molecular determinants of the pKa values of Asp and Glu residues in staphylococcal nuclease. Proteins: Struct. Funct. Bioinf. 77, 570–588 (2009).
Article Google Scholar
Demchuk, E. & Wade, R. C. Improving the Continuum Dielectric Approach to Calculating pKas of Ionizable Groups in Proteins. J. Phys. Chem. 100, 17373–17387 (1996).
Article CAS Google Scholar
Webb, H. et al. Remeasuring HEWL pKa values by NMR spectroscopy: Methods, analysis, accuracy, and implications for theoretical pKa calculations. Proteins: Struct. Funct. Bioinf. 79, 685–702 (2011).
Article CAS Google Scholar
Gajiwala, K. S. & Burley, S. K. HDEA, a Periplasmic Protein that Supports Acid Resistance in Pathogenic Enteric Bacteria. J. Mol. Biol. 295, 605–612 (2000).
Article CAS Google Scholar
Hong, W. et al. Periplasmic Protein HdeA Exhibits Chaperone-like Activity Exclusively within Stomach pH Range by Transforming into Disordered Conformation. J. Biol. Chem. 280, 27029–27034 (2005).
Article CAS Google Scholar
Ahlstrom, L. S. Law, S. M. Dickson, A. & Brooks, Charles L 3rd. Multiscale Modeling of a Conditionally Disordered pH-Sensing Chaperone. J. Mol. Biol. 427, 1670–1680 (2015).
Article CAS Google Scholar
Garrison, M. A. & Crowhurst, K. A. NMR-monitored titration of acid-stress bacterial chaperone HdeA reveals that Asp and Glu charge neutralization produces a loosened dimer structure in preparation for protein unfolding and chaperone activation. Protein Sci. 23, 167–178 (2014).
Article CAS Google Scholar
Baptista, A. M. Martel, P. J. & Petersen, S. B. Simulation of Protein Conformational Freedom as a Function of pH: Constant-pH Molecular Dynamics Using Implicit Titration. Proteins: Struct. Funct. Genet. 27, 523–544 (1997).
Article CAS Google Scholar
Baptista, A. M. Teixeira, V. H. & Soares, C. M. Constant-pH molecular dynamics using stochastic titration. J. Chem. Phys. 117, 4184 (2002).
Article CAS ADS Google Scholar
Lee, M. S. Salsbury, F. R. JR & Brooks, C. L. 3. Constant-pH Molecular Dynamics Using Continuous Titration Coordinates. Proteins: Struct. Funct. Bioinf. 56, 738–752 (2004).
Article CAS Google Scholar
Donnini, S. Tegeler, F. Groenhof, G. & Grubmuller, H. Constant pH Molecular Dynamics in Explicit Solvent with lambda-Dynamics. J. Chem. Theory Comput. 7, 1962–1978 (2011).
Article CAS Google Scholar
Khandogin, J. & Brooks, C. L. 3. Constant pH molecular dynamics with proton tautomerism. Biophys. J. 89, 141–157 (2005).
Article CAS Google Scholar
Khandogin, J. & Brooks, C. L. 3. Toward the Accurate First-Principles Prediction of Ionization Equilibria in Proteins. Biochemistry 45, 9363–9373 (2006).
Article CAS Google Scholar
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
Article CAS ADS Google Scholar
Artymiuk, P. J. Blake, C. C. F. Rice, D. W. & Wilson, K. S. The Structures of the Monoclinic and Orthorhombic Forms of Hen Egg-White Lysozyme at 6 Å Resolution. Acta Crystallogr. Sect. B: Struct. Crystallogr. Cryst. Chem. 38, 778–783 (1982).
Article Google Scholar
Yang, F. Gustafson, K. R. Boyd, M. R. & Wlodawer, A. Crystal structure of Escherichia coli HdeA. Nat. Struct. Biol. 5, 763–764 (1998).
Article CAS Google Scholar
Case, D. A. et al. AMBER 14 (University of California, San Francisco, 2014).
Hornak, V. et al. Comparison of Multiple Amber Force Fields and Development of Improved Protein Backbone Parameters. Proteins: Struct. Funct. Bioinf. 65, 712–725 (2006).
Article CAS Google Scholar
Roe, D. R. & Cheatham, T. E., III PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J. Chem. Theory Comput. 9, 3084–3095 (2013).
Article CAS Google Scholar
Humphrey, W. Dalke, A. & Schulten, K. VMD: Visual Molecular Dynamics. J. Mol. Graphics 14, 33–38 (1996).
Article CAS Google Scholar
Onufriev, A. Case, D. A. & Ullmann, G. M. A Novel View of pH Titration in Biomolecules. Biochemistry 40, 3413–3419 (2001).
Article CAS Google Scholar
Olsson, M. H. M. Søndergaard, C. R. Rostkowski, M. & Jensen, J. H. PROPKA3: Consistent Treatment of Internal and Surface Residues in Empirical pKa Predictions. J. Chem. Theory Comput. 7, 525–537 (2011).
Article CAS Google Scholar
Søndergaard, C. R. Olsson, M. H. M. Rostkowski, M. & Jensen, J. H. Improved Treatment of Ligands and Coupling Effects in Empirical Calculation and Rationalization of pKa Values. J. Chem. Theory Comput. 7, 2284–2295 (2011).
Article Google Scholar
Bashford, D. Case, D. A. Dalvit, C. Tennant, L. & Wright, P. E. Electrostatic Calculations of Side-Chain pKa Values in Myoglobin and Comparison with NMR Data for Histidines. Biochemistry. Biochemistry 32, 8045–8056 (1993).
Article CAS Google Scholar
McNutt, M. Mullins, L. S. Raushel, F. M. & Pace, C. N. Contribution of Histidine Residues to the Conformational Stability of Ribonuclease T1 and Mutant Glu-58-Ala. Biochemistry 29, 7572–7576 (1990).
Article CAS Google Scholar
Stryer, L. Biochemie. 4th ed. (Spektrum Akad. Verl. Heidelberg [u.a.], 1996).
Fitch, C. A. et al. Experimental pKa Values of Buried Residues: Analysis with Continuum Methods and Role of Water Penetration. Biophys. J. 82, 3289–3304 (2002).
Article CAS ADS Google Scholar

Download references

Acknowledgements

The authors thank the HPC group of the Regional Computing Center Erlangen (RRZE) for computational resources and the Deutsche Forschungsgemeinschaft (SFB796, project A2) for funding. Furthermore, the authors would like to thank Victoria Jackiw (Language Center, Univ. Erlangen-Nürnberg) for reading the manuscript.

Author information

Authors and Affiliations

Division of Bioinformatics, Institute of Biochemistry, Friedrich-Alexander-University Erlangen-Nürnberg (FAU), Fahrstraße 17, Erlangen, 91054, Germany
Eileen Socher & Heinrich Sticht

Authors

Eileen Socher
View author publications
You can also search for this author in PubMed Google Scholar
Heinrich Sticht
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.S. and H.S. conceived the project; E.S. performed the computer simulations and evaluated the data; H.S. supervised the project; E.S. and H.S. wrote the manuscript.

Corresponding author

Correspondence to Heinrich Sticht.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Socher, E., Sticht, H. Mimicking titration experiments with MD simulations: A protocol for the investigation of pH-dependent effects on proteins. Sci Rep 6, 22523 (2016). https://doi.org/10.1038/srep22523

Download citation

Received: 24 September 2015
Accepted: 15 February 2016
Published: 03 March 2016
DOI: https://doi.org/10.1038/srep22523

This article is cited by

A study on the secondary structure of the metalloregulatory protein CueR: effect of pH, metal ions and DNA
- Ria K. Balogh
- Eszter Németh
- Béla Gyurcsik
European Biophysics Journal (2021)
Identification of a unique Ca2+-binding site in rat acid-sensing ion channel 3
- Zhicheng Zuo
- Rachel N. Smith
- Eric B. Gonzales
Nature Communications (2018)
Critical test of isotropic periodic sum techniques with group-based cut-off schemes
- Takuma Nozawa
- Kenji Yasuoka
- Kazuaki Z. Takahashi
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.