A new structural arrangement in proteins involving lysine NH3 + group and carbonyl

Rogacheva, Olga N.; Izmailov, Sergei A.; Slipchenko, Lyudmila V.; Skrynnikov, Nikolai R.

doi:10.1038/s41598-017-16584-y

Download PDF

Article
Open access
Published: 27 November 2017

A new structural arrangement in proteins involving lysine NH₃ ⁺ group and carbonyl

Olga N. Rogacheva^1,3,
Sergei A. Izmailov¹,
Lyudmila V. Slipchenko² &
…
Nikolai R. Skrynnikov^1,2

Scientific Reports volume 7, Article number: 16402 (2017) Cite this article

3208 Accesses
11 Citations
Metrics details

Subjects

Abstract

Screening of the Protein Data Bank led to identification of a recurring structural motif where lysine NH₃ ⁺ group interacts with backbone carbonyl. This interaction is characterized by linear atom arrangement, with carbonyl O atom positioned on the three-fold symmetry axis of the NH₃ ⁺ group (angle C^ε-N^ζ-O close to 180°, distance N^ζ-O ca. 2.7-3.0 Å). Typically, this linear arrangement coexists with three regular hydrogen bonds formed by lysine NH₃ ⁺ group (angle C^ε-N^ζ-acceptor atom close to 109°, distance N^ζ-acceptor atom ca. 2.7-3.0 Å). Our DFT calculations using polarizable continuum environment suggest that this newly identified linear interaction makes an appreciable contribution to protein’s energy balance, up to 2 kcal/mol. In the context of protein structure, linear interactions play a role in capping the C-termini of α-helices and 3₁₀-helices. Of note, linear interaction involving conserved lysine is consistently found in the P-loop of numerous NTPase domains, where it stabilizes the substrate-binding conformation of the P-loop. Linear interaction NH₃ ⁺ – carbonyl represents an interesting example of ion-dipole interactions that has so far received little attention compared to ion-ion interactions (salt bridges) and dipole-dipole interactions (hydrogen bonds), but nevertheless represents a distinctive element of protein architecture.

Ion-specific binding of cations to the carboxylate and of anions to the amide of alanylalanine

Article Open access 20 December 2022

Benefits and constrains of covalency: the role of loop length in protein stability and ligand binding

Article Open access 18 November 2020

Effect of intramolecular hydrogen-bond formation on the molecular conformation of amino acids

Article Open access 30 June 2020

Introduction

Regular hydrogen bonds between lysine charged ε-ammonium group and uncharged acceptors, as well as salt bridges between lysine and anionic residues, are well documented in the literature^1,2,3,4. Yet inspection of the Protein Data Bank (PDB) finds many examples of an unusual interaction between lysine NH₃ ⁺ and backbone carbonyl group, which is clearly different from a hydrogen bond. One example of such interaction is illustrated in Fig. 1A. Characteristically, carbonyl oxygen atom is positioned on the symmetry axis of NH₃ ⁺ group, equidistant from the three ammonium protons. The angle C^ε-N^ζ-O in this arrangement is close to 180°, in contrast to hydrogen bonds where the corresponding angle is near 109°. Note also that this linear arrangement co-exists with three conventional hydrogen bonds centered on NH₃ ⁺ group and directed toward other protein sites or toward crystallographic water (indicated by dashed lines in Fig. 1A).

In order to determine how typical this arrangement is, we have screened the Protein Data Bank for NH₃ ⁺ – carbonyl pairs. Only high- and medium-resolution x-ray structures have been included in the analysis, resulting in a set of 47,388 unique structures (see Methods). The statistics are presented in Fig. 1B in a form of a map showing the density of states as a function of N^ζ-O distance (r) and C^ε-N^ζ-O angle (θ). In this map we observe the dominant cluster, corresponding to a regular NH₃ ⁺ – carbonyl hydrogen bond, which is centered at 2.8 Å, ca. 109° (cluster 1). At the same time we observe a distinct additional cluster (cluster 2), which corresponds to configurations illustrated in Fig. 1A with θ approaching 180° and the distance of ca. 2.9 Å (essentially the same distance as in hydrogen bonds). The number of such linear configurations is significant: within the red contour line delimiting cluster 2 we find 11,719 such examples. For comparison, cluster 1 contains 208,708 instances of hydrogen bonds originating on lysine NH₃ ⁺ groups. To put these numbers into perspective, bear in mind that each NH₃ ⁺ group can simultaneously form three hydrogen bonds, but only one linear interaction to carbonyl.

While PDB contains multiple examples of the linear arrangement involving NH₃ ⁺ group and carbonyl, it is also desirable to present “true negative” controls. To this end, we have compiled analogous (r, θ) density maps for the pairs (i) NH₃ ⁺ – backbone amide NH and (ii) NH₃ ⁺ – side-chain methylene groups. The results clearly demonstrate that the cluster 2, corresponding to the linear interaction, is non-existent in both cases, see Fig. S2. This observation confirms that the linear placement of NH₃ ⁺ and carbonyl is not an energy-neutral artefact of protein packing, but rather a meaningful interaction.

In order to characterize this newly observed linear interaction between NH₃ ⁺ and carbonyl group from an energetic standpoint we turned to density functional theory (DFT) calculations. A 2-molecule model system consisting of methylammonium ion and N-methylacetamide was constructed toward this goal (see Fig. 2H). The calculations were conducted using the program Q-Chem 4.3⁵ in conductor-like polarizable continuum water solvent C-PCM⁶ (see Methods for details). The relaxed potential energy scans have been performed over the parameter space (r, θ). The obtained two-dimensional energy map is shown in Fig. 2A; potential energy slices through the point corresponding to the geometry of interest, r = 2.7 Å, θ = 180°, are shown in Fig. 2B,C.

As can be seen from Fig. 2A, the linear geometry corresponds to a saddle point on the potential energy surface. The linear interaction (Fig. 2H) offers 2 kcal/mol stabilizing energy, whereas the hydrogen bond (Fig. 2G) offers ca. 5.5 kcal/mol. Figure 2C illustrates the scenario where the linear geometry is smoothly transformed into the hydrogen-bonded geometry while maintaining the overall planar arrangement. The same effect can be achieved by performing the rotation in an orthogonal direction; we have investigated this case separately and found that the dependence of energy on θ remains similar (see Fig. S3a). Other degrees of freedom also proved to be largely inconsequential. For instance, rotating N-methylacetamide about the pivot on O atom does not change the energy, see Fig. S3c,d. We conclude that the linear interaction NH₃ ⁺ – carbonyl is essentially fully parameterized by the two coordinates, r and θ, and its energetics is nearly completely characterized by the data in Fig. 2A–C. Of note, standard hydrogen bond shows a significant dependence on the donor-acceptor distance as well as two characteristic angles⁷, i.e. has more sophisticated directionality properties than the newly described linear interaction.

According to Fig. 2C, the energy difference between the NH₃ ⁺ – carbonyl hydrogen bond (stronger interaction) and NH₃ ⁺ – carbonyl linear arrangement (weaker interaction) amounts to ca. 3.5 kcal/mol. On the other hand, comparing the populations of the two respective clusters in Fig. 1B points toward the free energy difference of 1.7 kcal/mol (if one assumes that PDB coordinates are representative of the structures at physiological temperature). Given that the data in Fig. 2C pertain to the simple model system, whereas the data in Fig. 1B are characteristic of the elaborate protein architectures, one should not expect a quantitative agreement between the above two estimates. In fact, it is rather satisfying that the two values are qualitatively consistent with each other.

Now recall that in the context of protein structure linear interaction NH₃ ⁺ – carbonyl does not usually occur in isolation, but rather in combination with regular hydrogen bonds (see Fig. 1A). Specifically, among all examples of this linear interaction in the PDB-extracted set of high-quality protein structures, 61.4% of geometries feature three or more hydrogen bonds centered on the NH₃ ⁺ group (this statistic includes bifurcated hydrogen bonds). Additionally, 20.5% of geometries have two hydrogen bonds. Apparently, the truly stable configuration involves NH₃ ⁺ group engaged in three hydrogen bonds plus a linear interaction with an eligible carbonyl group, as illustrated in Fig. 1A. In this manner lysine ε-ammonium group fully utilizes its “bonding” potential. However, we may not always see all three hydrogen-bonded partners in the crystal structures – for example, if they include water molecules that have not been resolved in crystallographic model. Furthermore, in some cases lysine NH₃ ⁺ groups form hydrogen bonds with symmetry-related molecules in the crystal lattice (not accounted for in our analyses). Alternatively, steric constraints encountered in the structure may prevent one or two hydrogen bonds from forming. While acknowledging such “incomplete” arrangements, we focus on the “complete” configuration such as shown in Fig. 1A.

To investigate such more extended system, we have constructed a 5-molecule model based on the coordinates 4RLZ, see Fig. 1A. This model includes one water and two formamide molecules, imitating the hydrogen-bonded partners of the lysine NH₃ ⁺ group. Instead of N-methylacetamide, which has been previously used to imitate a peptide plane, we now use a smaller formamide molecule. This is necessary because the bulkier N-methylacetamide tends to pack against the hydrogen-bonded ligands, which to some degree obscures the effect of linear interaction. Using this PDB-extracted geometry as a starting point we have conducted the energy scans similar to the ones previously carried out for the 2-molecule model. In doing so, we have fixed the coordinates of the methylammonium ion and its three hydrogen-bonded ligands, while the formamide molecule responsible for the linear interaction has been moved by varying r and θ. Hence, we model hydrogen-bonded Lys side chain as a part of the static protein matrix – and treat its linear interaction with carbonyl as a weak perturbation.

The results of the energy calculations using 5-molecule model are summarized in Fig. 2D–F. Obviously, the energy surface representative of the linear interaction has changed from the saddle to the well, see Fig. 2D. This is understandable since the three hydrogen-bonding sites are now all occupied and the linear arrangement is the only energetically favorable arrangement that remains available to the fourth NH₃ ⁺ ligand. Importantly, the “energy well” seen in Fig. 2D broadly corresponds to the cluster 2 in the PDB-based density map Fig. 1B.

It is worth noting that the favorable energetic effect of the linear interaction is similar to the one obtained for the 2-molecule model, cf. Figure 2B and Fig. 2E. In other words, adding three hydrogen bonds to the model has little effect on the energy of the linear interaction. More accurately, the energy of the linear interaction has been slightly attenuated, 1.8 vs. 2.0 kcal/mol. In principle, this is a reasonable outcome given that hydrogen bonds and linear interactions are mainly electrostatic in nature, see below, and therefore are expected to compete against each other (it is clear, however, that the small energy difference of 0.2 kcal/mol should not be overinterpreted considering the different makeup of the two models). We conclude that 2 kcal/mol is the sound estimate of the energy associated with the linear interaction, including the situation where NH₃ ⁺ is hydrogen-bonded. Bear in mind, however, that this result does not directly reflect on the contribution of the linear interaction into protein stability. In order to evaluate such contribution, one would need to take into consideration lysine desolvation penalty, as well as potential transient contacts made by the NH₃ ⁺ group in the disordered protein state – which is generally far from trivial^8,9.

In essence, lysine NH₃ ⁺ group, which makes three hydrogen bonds, can further improve its favorable binding energy by forming an axial interaction with a carbonyl group. This additional interaction is worth ca. one half of the hydrogen-bond energy. As such it is a meaningful complement to the lysine side-chain energy balance. The Protein Data Bank contains many examples of this “3 + 1” interaction scheme; the geometries found in the PDB are consistent with the energy minimum predicted in our model calculations (cf. blue vertical lines in Fig. 2E,F).

One important question concerns the nature of the linear interaction – is it mainly electrostatic, or does it have a partial covalent character, similar to hydrogen bonds? To address this question we have used the Natural Bond Orbital (NBO) analysis¹⁰. The calculations were conducted for geometry-optimized 2-molecule system methylammonium ion – formamide, and also for the formamide dimer that was employed as a model for the conventional hydrogen bond. As it turns out, linear interaction indeed has partially covalent character – mainly due to the donor-acceptor interactions involving lone electron pairs of the carbonyl oxygen and the antibonding orbitals associated with C^ε-N^ζ, N^ζ-H^ζ1, N^ζ-H^ζ2, and N^ζ-H^ζ3 bonds. However, the corresponding NBO stabilization energies are very small, an order of magnitude smaller than the corresponding values for the standard hydrogen bond (see Table S2 and Fig. S4). Therefore we conclude that the linear interaction has only minimal covalent character, but rather is dominated by electrostatics.

An interesting additional insight can be obtained from the analyses of NMR nuclear spin-spin couplings, which may exist despite the very weak covalency of the linear interaction^11,12. Specifically, we are interested in “through-space” coupling between ¹⁵N^ζ and carbonyl ¹³C’. The DFT calculations using 2-molecule model predict the coupling constant of −1.4 Hz for the system in vacuum and −0.7 Hz for the system in aqueous PCM solvent. For the 5-molecule model based on x-ray coordinates both vacuum and PCM calculations predict the value −0.4 Hz. This is similar in magnitude to J-coupling constants across backbone hydrogen bonds that have been successfully measured for a number of small proteins^13,14. It is well known, however, that NH₃ ⁺ groups typically suffer from rapid solvent exchange, which makes them a difficult target for such measurements¹⁵.

Since linear lysine-carbonyl interaction is a recurring motif in proteins, it is interesting to place this interaction in the context of protein structure. Typically, this interaction occurs near the protein surface, although we have also identified a number of exceptions where it is found in the protein core, see Fig. 3A. Addressing this latter situation, we can draw a parallel to buried salt bridges. In the low-dielectric-constant environment of protein interior, salt-bridging side chains form strong electrostatic interactions; however, the resulting favorable enthalpy is almost completely offset by a large desolvation penalty^16,17. A similar compensation effect can be expected for linear lysine-carbonyl interactions that are sequestered in the protein core.

Parsing of the PDB-extracted dataset reveals that 89% of all linear interactions occur within one protein chain, while 11% are interchain contacts. Linear interactions can also occur in a form of crystal contacts between symmetry-related molecules in the crystal lattice. Considering secondary-structure preferences, lysine residues involved in linear interaction are no different from the general population of lysines, i.e. show no special preferences. On the other hand, their counterpart carbonyl groups do have distinctive conformational preferences: they strongly disfavor β-sheet (as expressed by a factor 0.26) and less so α-helices (0.61), while showing moderate preference for coil (1.27) and more substantial preference for turns (2.06).

The case of α-helices deserves a special discussion. Starting from N-terminus of an α-helix, all carbonyl groups are engaged in canonical CO ··· HN hydrogen bonds – except for three carbonyls at the C-terminal end of the helix which lack HN counterparts in a standard helical arrangement¹⁸. Although technically possible, lysine side chain rarely forms linear interaction with a hydrogen-bonded carbonyl in the N-terminal portion of the helix. On the other hand, lysine side chains often connect to one of the eligible carbonyls at the C-terminal end of α-helices. As it turns out, the last residue in α-helix is a favored site to form linear interaction with the corresponding propensity 1.9, while next-to-last residue has even stronger preference, 3.6. This leads us to suggest that linear interactions have a role in C-terminal helix capping. Similar observations can be made for 3₁₀ helices (see Fig. S6 for additional information). In relative terms, helix-capping motifs involving linear NH₃ ⁺ – carbonyl interactions are rare. In our PDB-extracted dataset only 0.4% of all α-helices are capped in this manner (note that capping motifs involving side chains are generally infrequent in the C-termini of α-helices¹⁹). However, on the absolute scale this translates into 2,122 unique examples of linear interaction occurring as a part of the C-terminal helix cap (see Fig. 3B for illustration). Evidently, in each of these instances linear interaction can have a role in stabilizing the C-terminal end of the helix and thereby could exert influence on protein stability and function.

Not too many linear interactions originate from carbonyl groups in β-strands. For those carbonyls that are engaged in canonical CO ··· HN hydrogen bonds, linear interactions are possible, but exceedingly rare. In contrast, those carbonyls that are located in the outer strands and are oriented outward are more likely to form linear interactions. There are examples of linear interaction protecting the outer strand of β-sheet (see Fig. 3C) whereby the presence of charged ε-ammonium group over the outer edge of the sheet prevents it from connecting with a β-sheet from another protein molecule and thus avoids unwanted dimerization²⁰. In a number of cases linear interaction involves free carbonyl groups of the last (C-terminal) residue in β-strand, apparently providing stabilization akin to the helix capping and reducing terminal fraying in the β-sheet.

Finally, it is not surprising that linear interactions are often encountered in turns. Indeed, turns provide an “open” topology, where several hydrogen-bond acceptors can converge on one point to engage lysine side-chain NH₃ ⁺ group. In addition, a free carbonyl is frequently available to form an axial interaction and thus complete the arrangement (illustrated in Fig. 3D).

Although relatively rare, linear interactions are expected to be of consequence in many specific proteins or protein complexes. For example, illustrated in Fig. 3E is the linear interaction at the interface between doubly-acetylated histone H4 tail and bromodomain from transcription coactivator CBP²¹. In this case, the H4 peptide contributes lysine side chain, which is accommodated by the C-terminal turn of helix B from the bromodomain, creating a full complement of interactions for lysine ε-ammonium group, i.e. three hydrogen bonds (two toward backbone carbonyls and one toward the ordered water molecule) plus linear interaction with the carbonyl in −2 position from the C-terminal end of the helix. Of note, the binding of H4 to bromodomains is typically transient, with affinity in the range from tens to hundreds of micromoles²². In this situation, the linear interaction shown in Fig. 3E is expected to provide a meaningful contribution to the binding affinity. Indeed, it has been observed that the lysine residue at hand is one of the two key binding residues²¹. The recognition of acetylated H4 by bromodomains activates gene transcription; this mechanism has been identified as a potential target for epigenetic cancer therapy, leading to intense search for competitive inhibitors^23,24.

An even more interesting example of linear interaction is found in the structure of the well-known oncoprotein H-Ras p21. This protein belongs to the ubiquitous P-loop NTPase fold and therefore contains the signature P-loop sequence GxxxxGK(S/T). The conserved lysine residue in this motif is particularly important for NTP loading and hydrolysis: its ε-ammonium group forms hydrogen bonds with γ- and β-phosphates, as well as carbonyl of the conserved glycine residue at the beginning of the P-loop. As it turns out, in the Ras proteins the consensus lysine K16 can additionally establish linear interaction with the carbonyl from the second residue in the P-loop, A11. This arrangement, illustrated in Fig. 3F, features a pair of consecutive residues, where conserved glycine G10 forms hydrogen bond with the lysine NH₃ ⁺ group and the next residue A11 forms linear interaction with the same group. Of note, the hydrogen bond is shifted slightly away from its standard orientation (C^ε-N^ζ-O angle 93°), making additional room for the linear interaction (C^ε-N^ζ-O angle 151°).

In the Protein Data Bank we have identified 159 structures of Ras and Ras-like proteins featuring linear interaction between the conserved lysine in position 7 and residue in position 2 within the P-loop (in other Ras-family structures the interaction falls just outside the boundaries of the cluster 2, Fig. 1B). Furthermore, we found multiple structures of other P-loop NTPases that feature the same characteristic linear interactions: adenylate kinases, e.g. 1ZIN²⁵, mitochondrial ATP synthases, e.g. 2JDI²⁶, elongation factors, e.g. 4LBW²⁷, myosin and kinesin motor domains, e.g. 1LVK²⁸ and 2ZFI²⁹, etc. The P-loop NTPase fold is by far the most common protein fold in eukaryotes; many of its functions, such as genome replication and translation, are indispensable for life³⁰. Evidently, the linear interaction discussed above contributes toward stabilization of the P-loop in a requisite conformation, which is highly important for the catalytic activity of NTPase³¹.

One may wonder if linear interaction occurs only among NH₃ ⁺ – carbonyl pairs in proteins, or similar motifs may also arise in other contexts. The analysis of the PDB-derived dataset showed that oxygen atom of crystallographic water is also often found near the symmetry axis of the ε-ammonium group. Such water oxygens form a distinct cluster on the density map akin to the cluster 2 in Fig. 1B, but centered somewhat farther away from N^ζ atom (see Fig. S7a). Furthermore, it turns out that hydroxyl oxygen atoms from Ser, Thr and Tyr side chains are also frequently localized in the area of cluster 2 (see Fig. S7b). These examples suggest that sp3 oxygen can interact with the NH₃ ⁺ group in a similar manner to the sp2 oxygen; the details of this former interaction await future investigation.

In addition, we have also identified several examples of linear arrangement involving lysine NH₃ ⁺ group and various sites in nucleic acids, e.g. carbonyl of the nitrogenous base (PDB ID 1C9S³²) or ribose hydroxyl (PDB ID 1SDS³³). However, there is no adequate statistics for such pairs, which makes it difficult to decide whether these examples represent a reproducible structural motif.

In this connection it is also interesing to note that carboxylic oxygens belonging to the Asp and Glu side chains do not produce a clear-cut cluster that can be associated with linear interaction (see Fig. S7c). It appears that COO^– can usually force its way into a more favorable hydrogen-bonded position by displacing a weaker acceptor, such as carbonyl. As a result, COO⁻ ··· NH₃ ⁺ salt bridges almost invariably occur in a form of hydrogen bonds, while linear arrangement for this ion pair is rare and does not manifest itself as a distinct structural motif (a more nuanced discussion would require the knowledge of carboxyl protonation status).

In conclusion, lysine NH₃ ⁺ – carbonyl linear interaction provides an interesting example of the previously unidentified structural motif. It can successfully complement hydrogen bonds, providing a meaningful addition to the stabilization energy. This motif finds some distinctive uses in protein architecture, e.g. in helix capping; it also turns out to be an integral part of important protein sites, such as the P-loop in nucleoside triphosphate hydrolases. Other examples of apparent linear interaction between NH₃ ⁺ and various polar moieties also exist and warrant further investigation.

Methods

PDB analyses

A subset of high-quality protein x-ray structures has been extracted from Protein Data Bank on 06.09.2017 using the following criteria: resolution is 2.0 Å or better and R _free is 0.25 or better. Further analysis was restricted to the first model from each PDB file. All atoms with alternate conformations have been ignored. The program HBPLUS³⁴ has been used to identify hydrogen bonds and the program STRIDE³⁵ has been used for secondary structure classification. We have also used other selection criteria: (i) resolution is 1.5 Å or better, R _free is 0.20 or better and/or (ii) sequence identity between any two structures in the subset is 90% or lower. The results from these additional analyses are presented in Fig. S1. The propensity of linear interactions toward certain elements of secondary structure is calculated as explained in the caption of Fig. S6.

Quantum chemistry calculations

All calculations were performed using Q-Chem (unless explicitly stated, see below). To calculate the energies of 2-molecule model system methylammonium ion – N-methylacetamide, the following procedure has been adopted. The initial geometry has been generated as illustrated in Fig. 2G,H for a given setting of r and θ. For each of the three methyls as well as NH₃ ⁺ group we generated two different rotamers differing by 60°, resulting in a total of 16 different starting geometries. Each of these geometries has been optimized using B3LYP functional^36,37 with 6-31G(d) basis set^38,39 while keeping the coordinates of C^ε, N^ζ and O atoms fixed. The resulting refined models were used to conduct single-point energy calculations employing ωB97X-D functional⁴⁰ with cc-pVQZ basis set⁴¹. The lowest energy (among the 16 models) was chosen for plotting in Fig. 2A–C. This tactic has allowed for consistently good optimization of proton coordinates. To model a polar environment on the protein surface, all calculations were carried out using conductor-like C-PCM solvent⁶.

Similar protocol has been used to calculate the energies in 5-molecule model system, Fig. 2D–F. In this case, the coordinates of all atoms belonging to the hydrogen-bonded ligands of NH₃ ⁺ have been fixed according to the crystal structure 4RLZ. The N-methylacetamide molecule has been replaced in this model with the smaller formamide molecule; the effect of this replacement on the calculated energy is modest (cf. Table S1). J-coupling constants have been computed for optimized geometries at the same level of DFT as energies using the Mixed option⁴² in the Gaussian program⁴³. NBO (Natural Bond Orbital) analysis was performed on geometry-optimized model using NBO 5.0 program⁴⁴ (available as a part of Q-Chem package) at B3LYP/6–31G(d) level.

Data availability statement

All data are available from the authors upon request.

References

Bass, M. B., Hopkins, D. F., Jaquysh, W. A. N. & Ornstein, R. L. A method for determining the positions of polar hydrogens added to a protein structure that maximizes protein hydrogen bonding. Proteins: Struct. Funct. Genet. 12, 266–277 (1992).
Article CAS Google Scholar
Smith, J. S. & Scholtz, J. M. Energetics of polar side-chain interactions in helical peptides: Salt effects on ion pairs and hydrogen bonds. Biochemistry 37, 33–40 (1998).
Article CAS PubMed Google Scholar
Zandarashvili, L. & Iwahara, J. Temperature dependence of internal motions of protein side-chain NH₃ ⁺ groups: insight into energy barriers for transient breakage of hydrogen bonds. Biochemistry 54, 538–545 (2015).
Article CAS PubMed Google Scholar
Donald, J. E., Kulp, D. W. & DeGrado, W. F. Salt bridges: Geometrically specific, designable interactions. Proteins: Struct. Funct. Bioinf. 79, 898–915 (2011).
Article CAS Google Scholar
Shao, Y. H. et al. Advances in molecular quantum chemistry contained in the Q-Chem 4 program package. Mol. Phys. 113, 184–215 (2015).
Article ADS CAS Google Scholar
Cossi, M., Rega, N., Scalmani, G. & Barone, V. Energies, structures, and electronic properties of molecules in solution with the C-PCM solvation model. J. Comput. Chem. 24, 669–681 (2003).
Article CAS PubMed Google Scholar
Morozov, A. V., Kortemme, T., Tsemekhman, K. & Baker, D. Close agreement between the orientation dependence of hydrogen bonds observed in protein structures and quantum mechanical calculations. Proc. Natl. Acad. Sci. USA 101, 6946–6951 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Sheinerman, F. B. & Honig, B. On the role of electrostatic interactions in the design of protein-protein interfaces. J. Mol. Biol. 318, 161–177 (2002).
Article CAS PubMed Google Scholar
Xiao, S. F. et al. Rational modification of protein stability by targeting surface sites leads to complicated results. Proc. Natl. Acad. Sci. USA 110, 11337–11342 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Reed, A. E., Curtiss, L. A. & Weinhold, F. Intermolecular interactions from a natural bond orbital, donor-acceptor viewpoint. Chem. Rev. 88, 899–926 (1988).
Article CAS Google Scholar
Arnold, W. D. & Oldfield, E. The chemical nature of hydrogen bonding in proteins via NMR: J-couplings, chemical shifts, and AIM theory. J. Am. Chem. Soc. 122, 12835–12841 (2000).
Article CAS Google Scholar
Grzesiek, S., Cordier, F., Jaravine, V. & Barfield, M. Insights into biomolecular hydrogen bonds from hydrogen bond scalar couplings. Prog. NMR Spectrosc. 45, 275–300 (2004).
Article CAS Google Scholar
Cordier, F. & Grzesiek, S. Direct observation of hydrogen bonds in proteins by interresidue ^h3J_NC’ scalar couplings. J. Am. Chem. Soc. 121, 1601–1602 (1999).
Article CAS Google Scholar
Cornilescu, G. et al. Correlation between ^3hJ_NC’ and hydrogen bond length in proteins. J. Am. Chem. Soc. 121, 6275–6279 (1999).
Article CAS Google Scholar
Zandarashvili, L., Li, D. W., Wang, T. Z., Bruschweiler, R. & Iwahara, J. Signature of mobile hydrogen bonding of lysine side chains from long-range ¹⁵N-¹³C scalar J-couplings and computation. J. Am. Chem. Soc. 133, 9192–9195 (2011).
Article CAS PubMed Google Scholar
Waldburger, C. D., Schildbach, J. F. & Sauer, R. T. Are buried salt bridges important for protein stability and conformational specificity? Nat. Struct. Biol. 2, 122–128 (1995).
Article CAS PubMed Google Scholar
Kumar, S. & Nussinov, R. Salt bridge stability in monomeric proteins. J. Mol. Biol. 293, 1241–1255 (1999).
Article CAS PubMed Google Scholar
Kabsch, W. & Sander, C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983).
Article CAS PubMed Google Scholar
Aurora, R. & Rose, G. D. Helix capping. Protein Sci. 7, 21–38 (1998).
Article CAS PubMed PubMed Central Google Scholar
Richardson, J. S. & Richardson, D. C. Natural β-sheet proteins use negative design to avoid edge-to-edge aggregation. Proc. Natl. Acad. Sci. USA 99, 2754–2759 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Plotnikov, A. N. et al. Structural insights into acetylated-histone H4 recognition by the bromodomain-PHD finger module of human transcriptional coactivator CBP. Structure 22, 353–360 (2014).
Article CAS PubMed Google Scholar
Filippakopoulos, P. et al. Histone recognition and large-scale structural analysis of the human bromodomain family. Cell 149, 214–231 (2012).
Article CAS PubMed PubMed Central Google Scholar
Asangani, I. A. et al. Therapeutic targeting of BET bromodomain proteins in castration-resistant prostate cancer. Nature 510, 278–282 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Jain, A. K. & Barton, M. C. Bromodomain histone readers and cancer. J. Mol. Biol. 429, 2003–2010 (2017).
Article CAS PubMed Google Scholar
Berry, M. B. & Phillips, G. N. Crystal structures of Bacillus stearothermophilus adenylate kinase with bound Ap₅A, Mg²⁺ Ap₅A, and Mn²⁺ Ap₅A reveal an intermediate lid position and six coordinate octahedral geometry for bound Mg²⁺ and Mn²⁺. Proteins: Struct. Funct. Genet. 32, 276–288 (1998).
Article CAS Google Scholar
Bowler, M. W., Montgomery, M. G., Leslie, A. G. W. & Walker, J. E. Ground state structure of F₁-ATPase from bovine heart mitochondria at 1.9 Å resolution. J. Biol. Chem. 282, 14238–14242 (2007).
Article CAS PubMed Google Scholar
Groftehauge, M. K. et al. Identifying ligand-binding hot spots in proteins using brominated fragments. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 69, 1060–1065 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bauer, C. B., Kuhlman, P. A., Bagshaw, C. R. & Rayment, I. X-ray crystal structure and solution fluorescence characterization of Mg·2′(3′)-O-(N-methylanthraniloyl) nucleotides bound to the Dictyostelium discoideum myosin motor domain. J. Mol. Biol. 274, 394–407 (1997).
Article CAS PubMed Google Scholar
Nitta, R., Okada, Y. & Hirokawa, N. Structural model for strain-dependent microtubule activation of Mg-ADP release from kinesin. Nat. Struct. Mol. Biol. 15, 1067–1075 (2008).
Article CAS PubMed Google Scholar
Koonin, E. V., Wolf, Y. I. & Aravind, L. Protein fold recognition using sequence profiles and its application in structural genomics. Adv. Protein Chem. 54, 245–275 (2000).
Article CAS PubMed Google Scholar
Saraste, M., Sibbald, P. R. & Wittinghofer, A. The P-loop - a common motif in ATP- and GTP-binding proteins. Trends Biochem. Sci. 15, 430–434 (1990).
Article PubMed Google Scholar
Antson, A. A. et al. Structure of the trp RNA-binding attenuation protein, TRAP, bound to RNA. Nature 401, 235–242 (1999).
Article ADS CAS PubMed Google Scholar
Hamma, T. & Ferre-D’Amare, A. R. Structure of protein L7Ae bound to a K-turn derived from an archaeal box H/ACA sRNA at 1.8 Å resolution. Structure 12, 893–903 (2004).
Article CAS PubMed Google Scholar
McDonald, I. K. & Thornton, J. M. Satisfying hydrogen-bonding potential in proteins. J. Mol. Biol. 238, 777–793 (1994).
Article CAS PubMed Google Scholar
Frishman, D. & Argos, P. Knowledge-based protein secondary structure assignment. Proteins: Struct. Funct. Genet. 23, 566–579 (1995).
Article CAS Google Scholar
Becke, A. D. Density-functional thermochemistry. III. The role of exact exchange. J. Chem. Phys. 98, 5648–5652 (1993).
Article ADS CAS Google Scholar
Stephens, P. J., Devlin, F. J., Chabalowski, C. F. & Frisch, M. J. Ab initio calculation of vibrational absorption and circular dichroism spectra using density functional force fields. J. Phys. Chem. 98, 11623–11627 (1994).
Article CAS Google Scholar
Hehre, W. J., Ditchfield, R. & Pople, J. A. Self-consistent molecular orbitals methods. XII. Further extensions of Gaussian-type basis sets for use in molecular orbital studies of orgnaic molecules. J. Chem Phys. 56, 2257–2262 (1972).
Article ADS CAS Google Scholar
Hariharan, P. C. & Pople, J. A. Influence of polarization functions on molecular orbital hydrogenation energies. Theor. Chim. Acta 28, 213–222 (1973).
Article CAS Google Scholar
Chai, J. D. & Head-Gordon, M. Long-range corrected hybrid density functionals with damped atom-atom dispersion corrections. Phys. Chem. Chem. Phys. 10, 6615–6620 (2008).
Article CAS PubMed Google Scholar
Dunning, T. H. Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen. J. Chem. Phys. 90, 1007–1023 (1989).
Article ADS CAS Google Scholar
Deng, W., Cheeseman, J. R. & Frisch, M. J. Calculation of nuclear spin-spin coupling constants of molecules with first and second row atoms in study of basis set dependence. J. Chem. Theory Comput. 2, 1028–1037 (2006).
Article CAS PubMed Google Scholar
Frisch, M. J. et al. Gaussian 16 (Wallingford, CT, 2016).
Weinhold, F. & Landis, C. R. Natural bond orbitals and extensions of localized bonding concepts. Chem. Educ. Res. Pract. 2, 91–104 (2001).
Article CAS Google Scholar
Liu, W. et al. A unique human norovirus lineage with a distinct HBGA binding interface. PLoS Pathog. 11, e1005025 (2015).
Greiner, W., Neise, L. & Stoecker, H. Thermodynamics and Statistical Mechanics. (Springer-Verlag, 1995).
Benini, S. et al. The crystal structure of Sporosarcina pasteurii urease in a complex with citrate provides new hints for inhibitor design. J. Biol. Inorg. Chem. 18, 391–399 (2013).
Article CAS PubMed Google Scholar
Casteleijn, M. G. et al. Functional role of the conserved active site proline of triosephosphate isomerase. Biochemistry 45, 15483–15494 (2006).
Article CAS PubMed Google Scholar
Kajander, T., Lehtio, L., Schlomann, M. & Goldman, A. The structure of Pseudomonas P51 Cl-muconate lactonizing enzyme: Co-evolution of structure and dynamics with the dehalogenation function. Protein Sci. 12, 1855–1864 (2003).
Article CAS PubMed PubMed Central Google Scholar
Pai, E. F. et al. Refined crystal structure of the triphosphate conformation of H-ras p21 at 1.35 Å resolution: implications for the mechanism of GTP hydrolysis. EMBO J. 9, 2351–2359 (1990).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the Russian Science Foundation grant 15-14-20038. We acknowledge the Computer Center resource at St. Petersburg State University.

Author information

Authors and Affiliations

Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg, 199034, Russia
Olga N. Rogacheva, Sergei A. Izmailov & Nikolai R. Skrynnikov
Department of Chemistry, Purdue University, West Lafayette, IN, 47907, USA
Lyudmila V. Slipchenko & Nikolai R. Skrynnikov
Department of General Pathology and Pathophysiology, Institute of Experimental Medicine, St. Petersburg, 197376, Russia
Olga N. Rogacheva

Authors

Olga N. Rogacheva
View author publications
You can also search for this author in PubMed Google Scholar
Sergei A. Izmailov
View author publications
You can also search for this author in PubMed Google Scholar
Lyudmila V. Slipchenko
View author publications
You can also search for this author in PubMed Google Scholar
Nikolai R. Skrynnikov
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors designed research. S.A.I. and O.N.R. conducted PDB analyses. O.N.R., S.A.I. and L.V.S. conducted quantum chemistry calculations. O.N.R., S.A.I. and N.R.S. wrote the manuscript.

Corresponding author

Correspondence to Nikolai R. Skrynnikov.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rogacheva, O.N., Izmailov, S.A., Slipchenko, L.V. et al. A new structural arrangement in proteins involving lysine NH₃ ⁺ group and carbonyl. Sci Rep 7, 16402 (2017). https://doi.org/10.1038/s41598-017-16584-y

Download citation

Received: 18 September 2017
Accepted: 14 November 2017
Published: 27 November 2017
DOI: https://doi.org/10.1038/s41598-017-16584-y

This article is cited by

Homology modeling and virtual characterization of cytochrome c nitrite reductase (NrfA) in three model bacteria responsible for short-circuit pathway, DNRA in the terrestrial nitrogen cycle
- Megha Kaviraj
- Upendra Kumar
- Soumendranath Chatterjee
World Journal of Microbiology and Biotechnology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.