A quantum mechanical computational method for modeling electrostatic and solvation effects of protein

Wang, Xianwei; Li, Yang; Gao, Ya; Yang, Zejin; Lu, Chenhui; Zhu, Tong

doi:10.1038/s41598-018-23783-8

Download PDF

Article
Open access
Published: 03 April 2018

A quantum mechanical computational method for modeling electrostatic and solvation effects of protein

Xianwei Wang ORCID: orcid.org/0000-0003-4471-426X¹,
Yang Li²,
Ya Gao ORCID: orcid.org/0000-0002-4391-430X³,
Zejin Yang¹,
Chenhui Lu⁴ &
…
Tong Zhu^5,6

Scientific Reports volume 8, Article number: 5475 (2018) Cite this article

5441 Accesses
10 Citations
Metrics details

Subjects

Abstract

An efficient computational approach for modeling protein electrostatic is developed according to static point-charge model distributions based on the linear-scaling EE-GMFCC (electrostatically embedded generalized molecular fractionation with conjugate caps) quantum mechanical (QM) method. In this approach, the Electrostatic-Potential atomic charges are obtained from ab initio calculation of protein, both polarization and charge transfer effect are taken into consideration. This approach shows a significant improvement in the description of electrostatic potential and solvation energy of proteins comparing with current popular molecular mechanics (MM) force fields. Therefore, it has gorgeous prospect in many applications, including accurate calculations of electric field or vibrational Stark spectroscopy in proteins and predicting protein-ligand binding affinity. It can also be applied in QM/MM calculations or electronic embedding method of ONIOM to provide a better electrostatic environment.

A Novel Method for Calculation of Molecular Energies and Charge Distributions by Thermodynamic Formalization

Article Open access 30 December 2019

Non-bonded force field model with advanced restrained electrostatic potential charges (RESP2)

Article Open access 03 April 2020

The Impact of Electron Correlation on Describing QM/MM Interactions in the Attendant Molecular Dynamics Simulations of CO in Myoglobin

Article Open access 22 May 2020

Introduction

Electrostatic interaction plays a central role in many molecular processes in biological molecules^1,2,3,4,5, including protein folding⁶, protein-ligand binding^7,8, protein-protein interaction⁹, electron transfer¹⁰, enzyme reaction^11,12, ion channels^13,14, etc. The molecular electrostatic potential (MEP) has been widely used to characterize inter-and intramolecular electrostatic interactions. A great deal of progress has been made over the past decades in the development of rigorous and practical methods for accurate description of the MEP of proteins^{9,15,16,17,18,19,20,21,22}.

The point charge model adopted in standard MM force field is widely used in simulations of electrostatic properties of proteins, including introducing the solvation effect in simulations of proteins by incorporating with implicit solvent model^23,24, modeling electrostatic potential of protein^9,25, electric field at the active site of protease^26,27,28 and vibrational Stark spectroscopy in protein^29,30, providing the electrostatic environment as background charges in QM/MM calculations^31,32,33 or electronic embedding method of ONIOM^34,35,36, etc. Although MM force fields have made great success in studying thermodynamic and kinetic properties of biomolecules, there are fundamental limitations in their applications. The atomic charges of each kind of amino acid in standard MM fore field are obtained from ab initio QM calculations of small model systems^{21,22,37,38,39}. For instance, the atomic charges used in Amber99SB force field are obtained from gas phase HF/6-31 G* QM calculations of small peptides³⁷. The atomic charges are static and fixed, and therefore do not contain quantum mechanical information (polarization and charge transfer effect) of a particular protein structure. It is well known that the local electrostatic environment inside a folded protein is inhomogeneous due to the specific organization of charged and polar groups. The electron-density distribution of each amino acid in a particular protein electrostatic environment is specific due to polarization and charge transfer effect. Many previous works have demonstrated that the simulations based on these standard MM force fields are incapable of giving a quantitative comparison and interpretation of experimental observables^{9,21,22,26,27,28}.

To overcome the fundamental deficiency of the fixed charge model used in the standard MM force fields and describe electrostatic environment of proteins accurately, many efforts have been made to develop a new generation of polarizable force field^16,40. Introducing excess parameterizations (such as induced dipole, etc.) in current standard MM force fields is a common method^41,42,43. However, parametrization often makes the applications of the polarizable force field much more complicated than that of standard MM force field regarding the accuracy and validity of the underlying theoretical models used to derive polarizable force field remain. To accurately account for polarization and charge transfer effects without introducing excess parameterizations, it is usually necessary to adopt first-principles electronic structure methods. However, it is still not practical to apply standard quantum mechanical methods for describing the full inhomogeneous electrostatic environment of the proteins⁴⁴. The major limitation of QM methods is the scaling problem. The Hartree-Fock (HF) and density functional theory (DFT) scales as O(N³) (N denotes the size of the system). The scaling of post-HF methods is O(N⁵) for second-order Moller-Plesset perturbation theory (MP2) and O(N⁶) for the coupled-cluster (CC) method that includes single and double excitations (CCSD), respectively.

To overcome the scaling limitation of the applications of rigorous electronic-structure methods in large systems, various linear-scaling methods have been developed over the past decade⁴⁵. Among the existing linear-scaling QM approaches, the fragmentation approach is one of the highly efficient and powerful methods. The fragmentation approach is on the basis of the “chemical locality” of most large molecular systems, which assumes that the local region of the large system is only weakly influenced by the atoms that are far away from this region. Based on this chemical intuition, the system is divided into many individual subsystems (fragments) and subsequently the properties of the whole system can be obtained by taking a linear combination of the properties of these fragments. Over the past decade, many fragmentation QM methods have been proposed^46,47, including the fragment molecular orbital (FMO) method⁴⁸, the systematic fragmentation method (SFM)⁴⁹, the molecular tailoring approach (MTA)⁵⁰, the molecular fractionation with conjugate caps (MFCC) method^21,51, the adjustable density matrix assembler (ADMA) method⁵², the electrostatically embedded many-body (EE-MB) expansion approach⁵³, the explicit polarizatioin (X-Pol) potential⁵⁴.

With the goal of obtaining more accurate electronic structure properties of proteins, we have proposed a linear-scaling QM method termed EE-GMFCC (electrostatically embedded generalized molecular fractionation with conjugate caps method)⁵⁵. In the calculations of total energies of proteins, the EE-GMFCC shows only a few kcal/mol deviation from the corresponding full system (FS) results at the levels of the HF, DFT and MP2 method. With respect to expensive conventional QM methods, the EE-GMFCC greatly reduces the computational cost of QM calculations, extending the applicability of the rigorous QM methods to proteins with any number of atoms (up to thousands of atoms or more). The EE-GMFCC method is linear-scaling with a low prefactor. The relative independence of the QM calculations of the fragments in the EE-GMFCC method makes it suitable for implementation of parallelization. The applications of the EE-GMFCC method have been extended to perform structural optimization of proteins⁵⁶ and molecular dynamics simulations with high level ab initio electronic structure theories⁵⁷.

In this paper, based on accurate electronic structure calculations of proteins with the EE-GMFCC method, an electrostatic potential (ESP) atomic charges computational approach was developed and used in the calculations of electrostatic potentials and the solvation energies of proteins. By comparison with the results of FS QM calculations, the capabilities of the new charge model are demonstrated and new physical insights obtained from accurate description of protein electrostatic properties are discussed.

Method

EE-GMFCC method

The EE-GMFCC method is initially developed for calculating protein energy (see refs^51,55. for more details). Here, we just give a brief review. In the framework of EE-GMFCC method, a protein is decomposed into a number of individual fragments in the unit of amino acid by cutting through the peptide bond as illustrated in Fig. 1. A pair of conjugate caps (concaps) is inserted at the cutting location to mimic the local chemical environment of the original protein to the cutoff fragments (see Fig. 1A). Two-body terms for the interaction energy between non-neighboring residues that are close in space are also introduced to capture short-range quantum effect (see Fig. 1B). All the fragment calculations are embedded in the electrostatic field of the point charges representing the remaining atoms in the protein. The point charge model is taken from the Amber94 force field. Hydrogen atoms are used to saturate the dangling bonds. The total energy of a protein (with N amino acids) using the EE-GMFCC method can be expressed as

$$E=\sum _{i=2}^{N-1}\tilde{E}({{\rm{Cap}}}_{i-1}^{\ast }{{\rm{A}}}_{i}{{\rm{Cap}}}_{i+1})-\sum _{i=2}^{N-2}\tilde{E}({{\rm{Cap}}}_{i}^{\ast }{{\rm{Cap}}}_{i+1})-\sum _{\begin{array}{c}i,j > i+2\\ |{R}_{i}-{R}_{j}|\le \lambda \end{array}}({\tilde{E}}_{ij}-{\tilde{E}}_{i}-{\tilde{E}}_{j})-{E}_{{\rm{DC}}}$$

(1)

where i and j represent the residue number and $\tilde{E}$ denotes the sum of the self-energy of the fragment and the interaction energy between the fragment and background charges of the remaining system. ${\tilde{E}}_{ij}-{\tilde{E}}_{i}-{\tilde{E}}_{j}$ represents the two-body QM interaction energy between residues i and j whose closest distance is less than a predefined threshold $\,{\rm{\lambda }}$. ${E}_{{\rm{DC}}}$ is the interaction energy doubly counted in the first three terms of Eq. (1) and is approximated by the pairwise charge-charge interactions. The complete definition of the $\,{E}_{{\rm{DC}}}$ can be found in refs^51,55.

Charge determination

To reproduce MEP as well as possible, the ESP fitting method is employed to determine the atomic charges. Based on the computational scheme of the EE-GMFCC method, the atomic charge of atom k in a protein can be obtained by the following equation:

$${q}_{k}=\sum _{i=2}^{N-1}{q}_{k}({{\rm{Cap}}}_{i-1}^{\ast }{{\rm{A}}}_{i}{{\rm{Cap}}}_{i+1})-\sum _{i=2}^{N-2}{q}_{k}({{\rm{Cap}}}_{i}^{\ast }{{\rm{Cap}}}_{i+1})-\sum _{\begin{array}{c}i,j > i+2\\ |{R}_{i}-{R}_{j}|\le \lambda \end{array}}({q}_{k}({{\rm{A}}}_{i}{{\rm{A}}}_{j})-{q}_{k}({{\rm{A}}}_{i})-{q}_{k}({{\rm{A}}}_{j}))$$

(2)

where $\,{q}_{k}({{\rm{Cap}}}_{i-1}^{\ast }{{\rm{A}}}_{i}{{\rm{Cap}}}_{i+1})$ and ${q}_{k}({{\rm{Cap}}}_{i}^{\ast }{{\rm{Cap}}}_{i+1})$ denote the ESP charge of atom k obtained from the quantum mechanical calculations of the fragment ${{\rm{Cap}}}_{i-1}^{\ast }{{\rm{A}}}_{i}{{\rm{Cap}}}_{i+1}$ and concap ${{\rm{Cap}}}_{i}^{\ast }{{\rm{Cap}}}_{i+1}$ respectively. Since atom k may be assigned to different fragments (there is overlap of neighboring fragments, see Fig. 1) and concaps, ${q}_{k}$ would be counted more than once after sum calculation of the first term of Eq. (2), while the double counting can be just deducted by subtracting the atom k^,s charge in concaps. To avoid introducing unnatural excess charge in the process of charge fitting, the charges of link-atoms (hydrogen atom) of concap are constrained to have the same value with that of corresponding hydrogen atom in the fragment with penalty function in the RESP fitting, e.g., the charge of the hydrogen atom using as a link-atom of ${{\rm{Cap}}}_{i+1}$ in the concap ${{\rm{Cap}}}_{i}^{\ast }{{\rm{Cap}}}_{i+1}$ is constrained to have the same value with the link-atom of ${{\rm{Cap}}}_{i+1}$ in the fragment ${{\rm{Cap}}}_{i-1}^{\ast }{{\rm{A}}}_{i}{{\rm{Cap}}}_{i+1}$. This is essential and makes sense, because the local chemical environment of the corresponding hydrogen atoms in the fragment and concap is similar. The third term in Eq. (2) is used to capture quantum-mechanical two-body effect between nonsequentially connected residues that are spatially close.

The validity of the obtained charges from the EE-GMFCC method (termed EE-GMFCC-CHG) is tested with two protein systems (PDB id: 1BHI and 2KCF) using HF and DFT (M06-2×) methods with 6-31 G* basis set. The geometries of the two proteins were optimized with Amber99SB³⁸ force field using Sander module of the Amber program⁵⁸ in order to remove bad contacts prior to subsequent ab initio calculations. The calculated electrostatic potential using the EE-GMFCC-CHG were compared to that of the FS QM calculations. All ab initio calculations were performed using the Gaussian 09 program⁵⁹.

Solvent effect

In the continuum-solvent model, the solvent is represented as a continuous polarizable medium with dielectric constant ${\rm{\varepsilon }}$ and solute (protein) is encapsulated in a cavity with charge density ${\rm{\rho }}({\bf{r}})$ embedded in the medium. The solute polarizes the surrounding dielectric medium and creates a reaction potential which acts back to polarize the solute until equilibrium is reached. According to the classical electrostatic theory, the reaction potential acting on the solute can be effectively represented by that of induced charges on the surface of the cavity. In the Polarized continuum model (PCM), the solvation effect is modeled by discretizing the induced charges on the cavity surface and iteratively solving the quantum chemistry equation for the solute in the field of surface charges⁶⁰. The PCM is a popular continuum-solvent method for incorporation of solvent effect in quantum mechanical calculations of small molecules. However, although the PCM method was generalized to model the solvation effect of large proteins based on linear-scaling quantum mechanical methods^61,62,63, it still has limitations, e.g., many discrete surface charges will be required in the PCM which makes the solution of linear equation difficult computationally and the effect of ion concentrations is not included in the PCM method. In this work, by combining with continuum-solvent model based on the Poisson-Boltzmann (PB) equation, the EE-GMFCC-CHG will be used to model the electrostatic solvation effects of large proteins. The PB method has two advantages relative to the PCM method in modeling the solvation effects of large proteins. (1) The induced charges are obtained by numerically solving the PB equation which avoids the solution of large linear equations and improves the computation speed. (2) The PB equation incorporates the effect of ion concentrations which gives a better description of the real-environment of proteins.

In combinatorial point-charge fitting approach of EE-GMFCC-CHG and PB equation, partial charges of atoms of the protein generated from the EE-GMFCC quantum mechanical calculations were passed to the PB solver Delphi²⁴ to derive the induced surface charges on the dielectric boundary. The dielectric solute/solvent boundary was defined by Amber van der Waals radii for protein molecule with a probe radius of 1.4 Å. The solvent and internal dielectric constants are set to 80 and unity respectively. The obtained surface charges are added to the background charges to the next EE-GMFCC quantum-chemical calculations to generated new partial charges of the protein. The partial charges of the protein and induced surface charges generated by Delphi polarize each other until converge was reached. This process was iterated until the corrected reaction field energy calculated with Delphi converged and its variations were smaller than a certain criterion. Usually, the criterion was reached within five iterations.

The capability of EE-GMFCC-CHG in predicting the relative electrostatic solvation energy was demonstrated using 20 different conformations of a small protein (PDB id: 2I9M) generated from a 2 ns MD simulation. The MD simulation was performed with Amber99SB force field³⁷ and TIP3P water model to handle the protein and solvent (the specific process of the MD simulation is the same as that in ref.⁶³). The conformations were selected from the trajectory every 100 ps. MD simulations were performed with the Amber 12 program⁵⁸.

Results and Discussion

EE-GMFCC-CHG Reproduces ab initio Electrostatic Potential

Correct description of the electrostatic is vital in accurate prediction of molecular interactions in biological systems. Although the standard non-polarizable force fields (such as Amber, CHARMM, etc.) have achieved successes in simulating many of the macroscopic properties of proteins, it is expected to have difficulties in giving accurate prediction of properties that are more sensitive to the local electrostatic environment. This originates from the fact that the point-charge model of standard force fields is mean-field-like and it does not contain protein-specific quantum mechanical information such as polarization effect, charge transfer effect, etc. The EE-GMFCC-CHG derives from the quantum-chemical calculations that are performed using the EE-GMFCC fragmentation methods. To account for the protein polarization, the calculations of all molecular fragments are in the field created by point charges of the remaining system. By treatment of nonsequentially connected residues that are spatially close with generalized concaps (Gconcaps), polarization and charge transfer effect are accurately included. EE-GMFCC-CHG should reproduce ab initio electrostatics of proteins better than standard MM force field.

We have calculated the electrostatic potential of the two real three-dimensional proteins (PDB id: 1BHI and 2KCF) that contain 591 and 571 atoms respectively. The secondary structure of the protein 1BHI is a mixture of ${\rm{\alpha }}$-helix and ${\rm{\beta }}$-sheet, while 2KCF contains ${\rm{\beta }}$-sheet primarily. Since the electrostatic potential near a molecule are location-dependent, they will become very large near the nuclei in many cases. So the electrostatic potentials at grid points which are from 2.5 to 4.5 Å away from the closest atom in the protein were calculated. The three-dimensional protein structures of 1BHI and 2KCF and the grid points are shown in Fig. 2. The calculated molecular electrostatic potential with the FS QM method is chosen as benchmark and are compared to the corresponding results obtained according to the EE-GMFCC-CHG and Amber99SB force field approach.

The benchmark ab initio calculations are performed at HF/6-31 G* and M06-2×/6-31 G* level of theory respectively. The correlations between the benchmark QM calculations and the EE-GMFCC-CHG and Amber99SB force field approach are shown in Fig. 2. From panels (A) and (C) in Fig. 2, one can see that the calculated electrostatic potential using Amber99SB force field shows large root-mean-square deviation (RMSD) of MEP from FS calculations at the HF/6-31 G* level. While the obtained MEPs based on the EE-GMFCC-CHG are in excellent agreement with the results from FS calculations. There is an order of magnitude improvement in RMSD for the EE-GMFCC-CHG method as compared to Amber99SB force field. These results demonstrate that although the charge model used in the Amber99SB force field was obtained by fitting the gas-phase electrostatic potential of small peptides calculated at the HF/6-31 G* level, it can still not reproduce accurate electrostatic properties of protein due to lack of the polarization and charge transfer effect. Because of introducing polarization and charge transfer effect (including the charge transfer effect of sequentially and nonsequentially connected residues) by rigorous ab initio quantum chemistry calculations in the charge fitting process of EE-GMFCC-CHG, EE-GMFCC-CHG shows a significant improvement in describing the electrostatic environment with respect to Amber99SB force field.

Since the electron correlation is not included in the HF method, the calculated molecular dipole moment with the HF method is usually overestimated. The electrostatic properties of proteins predicted using the HF method are not accurate enough. While the DFT method such as M06-2× functional can give much better prediction in electrostatics of proteins. To test the accuracy of the EE-GMFCC-CHG, the correlation between the calculated MEP based on the EE-GMFCC-CHG and FS QM calculations for the two proteins (1BHI and 2KCF) at the M06-2×/6-31 G* level are plotted in panels (B) and (D) of Fig. 2. Similar to the results based on the HF method, the calculated MEP of the two proteins base on the EE-GMFCC-CHG shows very small RMSD from the corresponding FS QM calculations and also shows about an order of magnitude improvement in RMSD as compared to the results of Amber99SB force field.

To investigate the role in reproducing the ab initio QM electrostatic potential of introducing the two-body effect (quantum mechanical polarization and charge transfer effect) for nonsequentially connected residues that are spatially close, we plot the evolution of RMSD of MEP over the distance threshold ${\rm{\lambda }}$ as reference to the FS QM results in Fig. 3. Figure 3 demonstrates that the RMSD are all close to convergence at $\,{\rm{\lambda }}$ = 4 Å. The closest non-neighboring fragment appears when ${\rm{\lambda }}\,\,$is about 1.7–1.9 Å for the two globular proteins. The RMSDs of MEP based on the EE-GMFCC-CHG at the HF/6-31 G* level are about 2.2 × 10⁻³ au. and 1.5 × 10⁻³ au. (see panel (A) and (C) in Fig. 3 when the distance threshold $\,{\rm{\lambda }}\,\,$is less than 1.7 Å) for 1BHI and 2KCF in the case that the two-body effect is not introduced. Compared with Amber99SB force field, the error is reduced by about 5.7 × 10⁻³ au. (about 72% relative to the result based on Amber99SB force field) and 5.2 × 10⁻³ au. (about 78% relative to the result based on Amber99SB force field) respectively. The RMSDs of MEP using the EE-GMFCC-CHG without introducing two-body effect are also reduced by about 70% for 1BHI and 76% for 2KCH at M06-2×/6-31 G* level. The results indicate that the polarization effect and charge transfer from neighboring residues play a significant role in accurate description of protein electrostatic. From Fig. 3, one can see that the introduction of the two-body effect can reduce the RMSD of MEP obviously. The RMSDs are reduced to 9.4 × 10⁻³ au. (HF) and 1.04 × 10⁻³ au. (M06-2×) for 1BHI and 9.25 × 10⁻³ au. (HF) and 1.08 × 10⁻³ au. (M06-2×) for 2KCF when $\,{\rm{\lambda }}\,\,$is set to 4.0 Å which indicates that the two-body QM correction of vicinal non-neighboring residues is crucial to reproducing accurate electrostatic properties of proteins. It is worth noting that the RMSDs are markedly reduced when distance threshold ${\rm{\lambda }}\,\,$is increased from 1.7 to 2.5 Å. The RMSDs are almost flat when $\,{\rm{\lambda }}\,\,$is increased from 2.7 to 4.0 Å which suggests that the quantum mechanical effect of non-neighboring residues is local and a smaller distance threshold $\,{\rm{\lambda }}\,$(such as 2.7 Å) is still appropriate for the two globular proteins. Adoption of an appropriate distance threshold $\,{\rm{\lambda }}\,\,$for introducing two-body effect in fitting EE-GMFCC-CHG is essential for reducing computational cost.

Electrostatic solvation energy calculation using the EE-GMFCC-CHG

Most of biological processes occur in solution. The solvent effect plays important roles in mediating biological processes such as protein-ligand, protein-protein interaction and protein folding. During folding process, the conformation of a protein changes dramatically from random coil to its functional three-dimensional structure. The interplay between the protein and solvent seriously affects the pathway of protein folding. Accurate calculations of solvent energies of proteins are significant for revealing the roles of solvent effect in these biological processes. The relative electrostatic solvent energies of 20 different conformations of a real protein 2I9M are calculated by combining the EE-GMFCC-CHG with the PB implicit solvent model and compared with the calculated results using the FS ab initio calculations. The solvent effect is introduced by iteratively introduction of the induced surface charges as background charges in the QM calculations. The relative electrostatic solvation energies of 20 conformations of protein 2I9M are shown in Fig. 4. For comparison, the calculated results based on Amber99SB force field are also shown in Fig. 4. Protein 2I9M is a prototype of $\,{\rm{\alpha }}$-helix polypeptide which has been studied in previous theoretical protein folding works⁶. Our 2 ns molecular dynamic (MD) simulation of protein 2I9M with Amber99SB force field shows that its native conformation is not stable and unfolding occurs. The representative three-dimensional structures of three different conformations selected from the MD simulation are presented in Fig. 4. One can see that the conformation of the protein 2I9M changes dramatically in the 2 ns MD simulation. This is because the Amber99SB force field is considered to disfavor the $\,{\rm{\alpha }}$-helix structure. As a result, it is worth studying the free energy profile of protein 2I9M with more sophisticated methods. Figure 4 shows that the electrostatic solvation energies of the 20 different conformations undergo large fluctuation between −530 and −330 kcal/mol. The calculated electrostatic solvation energies with the EE-GMFCC-CHG are in excellent agreement with the FS calculations with RMSD of 1.3 kcal/mol, in contrast, the results obtained from Amber99SB force field show much larger deviations with RMSD of 10.6 kcal/mol which has about 1 order of magnitude larger than that of the EE-GMFCC-CHG. Furthermore, the deviation of calculated absolute electrostatic solvation energy between EE-GMFCC-CHG and FS QM calculations ranges from −0.3 to −5.5 kcal/mol, see Table S1 of the Supporting Information. While the deviation of the results based on Amber99SB force field from FS QM calculations ranges from 56 to 93 kcal/mol. In addition, the electrostatic solvation energies calculated by the EE-GMFCC-CHG are all lower than the FS results. The mean unsigned error (MUE) of the EE-GMFCC-CHG is 2.79 kcal/mol which is much smaller than that of Amber99SB force field. It clearly shows that the errors from Amber99SB force field calculations are significantly larger than those calculated by the EE-GMFCC-CHG. The comparison shows that the including quantum mechanical information is very important for predicting accurate electrostatic solvation energies of proteins.

To further demonstrate the capability of the EE-GMFCC-CHG in reproducing the solvation energy with DFT exchange-correlation functionals, the electrostatic solvation energies of the 20 different conformations of protein 2I9M are calculated using M06-2× method with the 6-31 G* basis set and shown in Fig. 5. One can see that the relative electrostatic energies evaluated using the EE-GMFCC-CHG agree well with that of the FS QM calculations with RMSD of 1.3 kcal/mol. The calculated electrostatic solvation energies of the 20 different conformations of protein 2I9M at M06-2×/6-31 G* level ranges from −310 to −500 kcal/mol which is a little higher than the results obtained at HF/6-31 G* level, see Table S2 of the Supporting Information. The deviation of calculated absolute electrostatic solvation energy between the EE-GMFCC-CHG and the FS QM calculations is also small which ranges from −0.3 to −4.7 kcal/mol. Similar to the results of the HF method, the electrostatic solvation energies calculated by the EE-GMFCC-CHG with DFT (M06-2×/6-31 G*) method are also all lower than the FS results. The results demonstrate that EE-GMFCC-CHG could reproduce the solvation energy of the protein well with both HF and DFT method.

Conclusions

In this work, we developed a charge model termed EE-GMFCC-CHG for accurately modeling the molecular electrostatic potential of proteins. The EE-GMFCC-CHG is obtained by fitting the ESP calculated from accurate electronic structure calculation of protein. Therefore, it contains almost all quantum effects (the polarization and charge transfer effects) of a specific structure of a protein. The EE-GMFCC method is computationally efficient and linear-scaling. The individual QM calculations of all fragments can be carried out in parallel. In reproducing MEP of protein, EE-GMFCC-CHG gives an excellent agreement with full system ab initio QM method and shows a significant improvement relative to popular MM force field. By analysis of RMSDs (reference to the full system QM results) of the MEP calculated from the EE-GMFCC-CHG with different two-body distance threshold $\,{\rm{\lambda }}$, we have shown that accounting for the polarization and charge transfer effect over the sequently-neighboring residues is very important for accurately reproducing the ab initio QM electrostatic potential of proteins, following by the quantum mechanical two-body effects between the non-neighboring residues in close contact.

By combining the EE-GMFCC-CHG with the implicit water model based on the PB equation, we developed a quantum chemical method for modeling the solvation effect of proteins. With respect to popular MM force fields, the EE-GMFCC-CHG-PB method could take the polarization effect between solute and solvent into consideration by iteratively introducing protein surface induced charges in the fitting process and therefore it shows a significant improvement in the description of electrostatic solvation energetics of proteins. The error of EE-GMFCC-CHG in modeling relative/absolute electrostatic solvation energies of protein is very small with reference to the full system QM calculation. The EE-GMFCC-CHG-PB method will thus be a useful tool for modeling electrostatic solvation energetics of solvated proteins.

References

Perutz, M. Electrostatic effects in proteins. Science 201, 1187–1191 (1978).
Article ADS CAS PubMed Google Scholar
Štrajbl, M., Shurki, A. & Warshel, A. Converting conformational changes to electrostatic energy in molecular motors: The energetics of ATP synthase. Proc. Natl. Acad. Sci. USA 100, 14834–14839 (2003).
Article ADS PubMed PubMed Central Google Scholar
Warshel, A. & Russell, S. T. Calculations of electrostatic interactions in biological systems and in solutions. Q. Rev. Biophys. 17, 283–422 (1984).
Article CAS PubMed Google Scholar
Honig, B. & Nicholls, A. Classical electrostatics in biology and chemistry. Science-New York Then Washington-, 1144–1144 (1995).
Duan, Y. & Kollman, P. A. Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. Science 282, 740–744 (1998).
Article ADS CAS PubMed Google Scholar
Duan, L. L., Mei, Y., Zhang, D., Zhang, Q. G. & Zhang, J. Z. Folding of a helix at room temperature is critically aided by electrostatic polarization of intraprotein hydrogen bonds. J. Am. Chem. Soc. 132, 11159–11164 (2010).
Article CAS PubMed Google Scholar
Cho, A. E., Guallar, V., Berne, B. J. & Friesner, R. Importance of accurate charges in molecular docking: quantum mechanical/molecular mechanical (QM/MM) approach. J. Comput. Chem. 26, 915–931 (2005).
Article CAS PubMed PubMed Central Google Scholar
Gräter, F., Schwarzl, S. M., Dejaegere, A., Fischer, S. & Smith, J. C. Protein/ligand binding free energies calculated with quantum mechanics/molecular mechanics. J. Phys. Chem. B 109, 10474–10483 (2005).
Article PubMed Google Scholar
Gascon, J. A., Leung, S. S., Batista, E. R. & Batista, V. S. A self-consistent space-domain decomposition method for QM/MM computations of protein electrostatic potentials. J. Chem. Theor. Comput. 2, 175–186 (2006).
Article CAS Google Scholar
Gunner, M., Nicholls, A. & Honig, B. Electrostatic potentials in Rhodopseudomonas viridis reaction centers: implications for the driving force and directionality of electron transfer. J. Phys. Chem. 100, 4277–4291 (1996).
Article CAS Google Scholar
Warshel, A. et al. Electrostatic basis for enzyme catalysis. Chem. Rev. 106, 3210–3235 (2006).
Article CAS PubMed Google Scholar
Fried, S. D., Bagchi, S. & Boxer, S. G. Extreme electric fields power catalysis in the active site of ketosteroid isomerase. Science 346, 1510–1514 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Aqvist, J. & Luzhkov, V. Ion permeation mechanism of the potassium channel. Nature 404, 881 (2000).
Article ADS CAS PubMed Google Scholar
Bliznyuk, A. A., Rendell, A. P., Allen, T. W. & Chung, S.-H. The potassium ion channel: Comparison of linear scaling semiempirical and molecular mechanics representations of the electrostatic potential. J. Phys. Chem. B 105, 12674–12679 (2001).
Article CAS Google Scholar
van der Vaart, A., Bursulaya, B. D., Brooks, C. L. & Merz, K. M. Are many-body effects important in protein folding? J. Phys. Chem. B 104, 9554–9563 (2000).
Article Google Scholar
Halgren, T. A. & Damm, W. Polarizable force fields. Curr. Opin. Struct. Biol. 11, 236–242 (2001).
Article CAS PubMed Google Scholar
Roux, B. & Bernèche, S. On the potential functions used in molecular dynamics simulations of ion channels. Biophys. J. 82, 1681 (2002).
Article CAS PubMed PubMed Central Google Scholar
Rick, S. W. & Stuart, S. J. Potentials and algorithms for incorporating polarizability in computer simulations. Reviews in computational chemistry 18, 89–146 (2002).
CAS Google Scholar
Ponder, J. W. & Case, D. A. Force fields for protein simulations. Advances in protein chemistry 66, 27–85 (2003).
Article CAS PubMed Google Scholar
Cieplak, P., Caldwell, J. & Kollman, P. Molecular mechanical models for organic and biological systems going beyond the atom centered two body additive approximation: aqueous solution free energies of methanol and N‐methyl acetamide, nucleic acid base, and amide hydrogen bonding and chloroform/water partition coefficients of the nucleic acid bases. J. Comput. Chem. 22, 1048–1057 (2001).
Article CAS Google Scholar
Ji, C. & Mei, Y. Some practical approaches to treating electrostatic polarization of proteins. Accounts. Chem. Res. 47, 2795–2803 (2014).
Article CAS Google Scholar
Ji, C., Mei, Y. & Zhang, J. Z. Developing polarized protein-specific charges for protein dynamics: MD free energy calculation of pK a shifts for Asp 26/Asp 20 in Thioredoxin. Biophys. J. 95, 1080–1088 (2008).
Article CAS PubMed PubMed Central Google Scholar
Sharp, K. A. & Honig, B. Electrostatic interactions in macromolecules: theory and applications. Annu. Rev. Biophys. Biophys. Chem. 19, 301–332 (1990).
Article CAS PubMed Google Scholar
Rocchia, W. et al. Rapid grid‐based construction of the molecular surface and the use of induced surface charge to calculate reaction field energies: Applications to the molecular systems and geometric objects. J. Comput. Chem. 23, 128–137 (2002).
Article CAS PubMed Google Scholar
Leverentz, H. R., Maerzke, K. A., Keasler, S. J., Siepmann, J. I. & Truhlar, D. G. Electrostatically embedded many-body method for dipole moments, partial atomic charges, and charge transfer. Phys. Chem. Chem. Phys. 14, 7669–7678 (2012).
Article CAS PubMed Google Scholar
Wang, X., He, X. & H., Z. J. Z. Predicting mutation-induced Stark shifts in the active site of a protein with a polarized force field. J. Phys. Chem. A 117, 6015–6023 (2013).
Article CAS PubMed Google Scholar
Wang, X. & Zhang, J. Z. H. & X., H. Quantum mechanical calculation of electric fields and vibrational Stark shifts at active site of human aldose reductase. J. Chem. Phys. 143, 184111 (2015).
Article ADS PubMed Google Scholar
Wang, X., He, X. & Zhang, J. Chapter Three-Accurate Calculation of Electric Fields Inside Enzymes. Methods Enzymol. 578, 45–72 (2016).
Article CAS PubMed Google Scholar
Kraft, M. L., Weber, P. K., Longo, M. L., Hutcheon, I. D. & Boxer, S. G. Phase separation of lipid membranes analyzed with high-resolution secondary ion mass spectrometry. Science 313, 1948–1951 (2006).
Article ADS CAS PubMed Google Scholar
Fried, S. D., Bagchi, S. & Boxer, S. G. Measuring electrostatic fields in both hydrogen-bonding and non-hydrogen-bonding environments using carbonyl vibrational probes. J. Am. Chem. Soc. 135, 11181–11192 (2013).
Article CAS PubMed PubMed Central Google Scholar
Senn, H. M. & Thiel, W. QM/MM methods for biomolecular systems. Angewandte Chemie International Edition 48, 1198–1229 (2009).
Article CAS PubMed Google Scholar
Gao, J. & Xia, X. A priori evaluation of aqueous polarization effects through Monte Carlo QM-MM simulations. Science-New York Then Washington-, 631-631 (1992).
Gao, J., Amara, P., Alhambra, C. & Field, M. J. A generalized hybrid orbital (GHO) method for the treatment of boundary atoms in combined QM/MM calculations. J. Phys. Chem. A 102, 4714–4721 (1998).
Article CAS Google Scholar
Vreven, T., Morokuma, K., Farkas, Ö., Schlegel, H. B. & Frisch, M. J. Geometry optimization with QM/MM, ONIOM, and other combined methods. I. Microiterations and constraints. J. Comput. Chem. 24, 760–769 (2003).
Article CAS PubMed Google Scholar
Vreven, T. et al. Combining quantum mechanics methods with molecular mechanics methods in ONIOM. J. Chem. Theor. Comput. 2, 815–826 (2006).
Article CAS Google Scholar
Chung, L. W. et al. The ONIOM method and its applications. Chem. Rev. 115, 5678–5796 (2015).
Article CAS PubMed Google Scholar
Bayly, C. I., Cieplak, P., Cornell, W. & Kollman, P. A. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: the RESP model. J. Phys. Chem. 97, 10269–10280 (1993).
Article CAS Google Scholar
Cornell, W. D. et al. A second generation force field for the simulation of proteins, nucleic acids, and organic molecules. J. Am. Chem. Soc. 117, 5179–5197 (1995).
Article CAS Google Scholar
Duan, Y. et al. A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations. J. Comput. Chem. 24, 1999–2012 (2003).
Article CAS PubMed Google Scholar
Kaminski, G. A. et al. Development of a polarizable force field for proteins via ab initio quantum chemistry: first generation model and gas phase tests. J. Comput. Chem. 23, 1515–1531 (2002).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. et al. Development of polarizable models for molecular mechanical calculations II: induced dipole models significantly improve accuracy of intermolecular interaction energies. J. Phys. Chem. B 115, 3100–3111 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ren, P. & Ponder, J. W. Polarizable atomic multipole water model for molecular mechanics simulation. J. Phys. Chem. B 107, 5933–5947 (2003).
Article CAS Google Scholar
Ren, P. & Ponder, J. W. Temperature and pressure dependence of the AMOEBA water model. J. Phys. Chem. B 108, 13427–13437 (2004).
Article CAS Google Scholar
Szabo, A. & Ostlund, N. S. Modern quantum chemistry: introduction to advanced electronic structure theory. (Courier Corporation, 2012).
Gordon, M. S., Fedorov, D. G., Pruitt, S. R. & Slipchenko, L. V. Fragmentation methods: a route to accurate calculations on large systems. Chem. Rev 112, 632–672 (2012).
Article CAS PubMed Google Scholar
Raghavachari, K. & Saha, A. Accurate composite and fragment-based quantum chemical models for large molecules. Chem. Rev. 115, 5643–5677 (2015).
Article CAS PubMed Google Scholar
Collins, M. A. & Bettens, R. P. Energy-based molecular fragmentation methods. Chem. Rev. 115, 5607–5642 (2015).
Article CAS PubMed Google Scholar
Fedorov, D. G., Asada, N., Nakanishi, I. & Kitaura, K. The use of many-body expansions and geometry optimizations in fragment-based methods. Accounts. Chem. Res. 47, 2846–2856 (2014).
Article CAS Google Scholar
Collins, M. A., Cvitkovic, M. W. & Bettens, R. P. The combined fragmentation and systematic molecular fragmentation methods. Accounts. Chem. Res. 47, 2776–2785 (2014).
Article CAS Google Scholar
Sahu, N. & Gadre, S. R. Molecular tailoring approach: a route for ab initio treatment of large clusters. Accounts. Chem. Res. 47, 2739–2747 (2014).
Article CAS Google Scholar
He, X., Zhu, T., Wang, X., Liu, J. & Zhang, J. Z. Fragment quantum mechanical calculation of proteins and its applications. Accounts. Chem. Res. 47, 2748–2757 (2014).
Article CAS Google Scholar
Mezey, P. G. Fuzzy Electron Density Fragments in Macromolecular Quantum Chemistry, Combinatorial Quantum Chemistry, Functional GroupAnalysis, and Shape–Activity Relations. Accounts. Chem. Res. 47, 2821–2827 (2014).
Article CAS Google Scholar
Wang, B. et al. Quantum mechanical fragment methods based on partitioning atoms or partitioning coordinates. Accounts. Chem. Res. 47, 2731–2738 (2014).
Article CAS Google Scholar
Gao, J. et al. Explicit polarization: A quantum mechanical framework for developing next generation force fields. Accounts. Chem. Res. 47, 2837–2845 (2014).
Article CAS Google Scholar
Wang, X., Liu, J., Zhang, J. Z. & He, X. Electrostatically embedded generalized molecular fractionation with conjugate caps method for full quantum mechanical calculation of protein energy. J. Phys. Chem. A 117, 7149–7161 (2013).
Article CAS PubMed Google Scholar
Liu, J., Zhang, J. Z. & He, X. Fragment quantum chemical approach to geometry optimization and vibrational spectrum calculation of proteins. Phys. Chem. Chem. Phys. 18, 1864–1875 (2016).
Article CAS PubMed Google Scholar
Liu, J., Zhu, T., Wang, X., He, X. & Zhang, J. Z. Quantum fragment based ab initio molecular dynamics for proteins. J. Chem. Theor. Comput. 11, 5897–5905 (2015).
Article CAS Google Scholar
Case, D. A. et al. The Amber biomolecular simulation programs. J. Comput. Chem. 26, 1668–1688 (2005).
Article CAS PubMed PubMed Central Google Scholar
Frisch, M. J. et al. Gaussian 09, revision B.01; Gaussian, Inc.: Wallingford, CT, (2010).
Miertuš, S., Scrocco, E. & Tomasi, J. Electrostatic interaction of a solute with a continuum. A direct utilizaion of ab initio molecular potentials for the prevision of solvent effects. Chem. phys. 55, 117–129 (1981).
Article ADS Google Scholar
Fedorov, D. G., Kitaura, K., Li, H., Jensen, J. H. & Gordon, M. S. The polarizable continuum model (PCM) interfaced with the fragment molecular orbital method (FMO). J. Comput. Chem. 27, 976–985 (2006).
Article CAS PubMed Google Scholar
Mei, Y., Ji, C. & Zhang, J. Z. A new quantum method for electrostatic solvation energy of protein. J. Chem. Phys. 125, 094906 (2006).
Article ADS PubMed Google Scholar
Jia, X. et al. An improved fragment-based quantum mechanical method for calculation of electrostatic solvation energy of proteins. J. Chem. Phys. 139, 12B604_601 (2013).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Grants No. 21703206, 11547164), Natural Science Foundation of Shandong (Grant No. ZR2016BB13), Natural Science Foundation of Zhejiang (Grant No. LY17B030008).

Author information

Authors and Affiliations

College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang, 310023, China
Xianwei Wang & Zejin Yang
School of Information Science and Engineering, Shandong Agricultural University, Taian, 271018, China
Yang Li
College of Fundamental Studies, Shanghai University of Engineering Science, Shanghai, 201620, China
Ya Gao
College of Mechanical Engineering, Shanghai University of Engineering Science, Shanghai, 201620, China
Chenhui Lu
College of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China
Tong Zhu
YU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, China
Tong Zhu

Authors

Xianwei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Li
View author publications
You can also search for this author in PubMed Google Scholar
Ya Gao
View author publications
You can also search for this author in PubMed Google Scholar
Zejin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chenhui Lu
View author publications
You can also search for this author in PubMed Google Scholar
Tong Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

In this works, Xianwei Wang prepared all the figures and wrote the main manuscript text. Tong Zhu provided ideas. Yang Li, Ya Gao, Chenhui Lu and Zejin Yang helped with data analysis and all authors reviewed the manuscript. These authors contributed equally to this work.

Corresponding authors

Correspondence to Xianwei Wang or Tong Zhu.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, X., Li, Y., Gao, Y. et al. A quantum mechanical computational method for modeling electrostatic and solvation effects of protein. Sci Rep 8, 5475 (2018). https://doi.org/10.1038/s41598-018-23783-8

Download citation

Received: 04 January 2018
Accepted: 19 March 2018
Published: 03 April 2018
DOI: https://doi.org/10.1038/s41598-018-23783-8

This article is cited by

The Impact of Electron Correlation on Describing QM/MM Interactions in the Attendant Molecular Dynamics Simulations of CO in Myoglobin
- Xianwei Wang
- Chenhui Lu
- Maoyou Yang
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.