A closer look into the α-helix basin

Haimov, Boris; Srebnik, Simcha

doi:10.1038/srep38341

Download PDF

Article
Open access
Published: 05 December 2016

A closer look into the α-helix basin

Boris Haimov¹ &
Simcha Srebnik^1,2

Scientific Reports volume 6, Article number: 38341 (2016) Cite this article

6088 Accesses
20 Citations
2 Altmetric
Metrics details

Subjects

Abstract

α-Helices are the most abundant structures found within proteins and play an important role in the determination of the global structure of proteins and their function. Representation of α-helical structures with the common (φ, ψ) dihedrals, as in Ramachandran maps, does not provide informative details regarding the helical structure apart for the abstract geometric meaning of the dihedrals. We present an alternative coordinate system that describes helical conformations in terms of residues per turn (ρ) and angle (ϑ) between backbone carbonyls relative to the helix direction through an approximate linear transformation between the two coordinates system (φ, ψ and ρ, ϑ). In this way, valuable information on the helical structure becomes directly available. Analysis of α-helical conformations acquired from the Protein Data Bank (PDB) demonstrates that a conformational energy function of the α-helix backbone can be harmonically approximated on the (ρ, ϑ) space, which is not applicable to the (φ, ψ) space due to the diagonal distribution of the conformations. The observed trends of helical conformations obtained from the PDB are captured by four conceptual simulations that theoretically examine the effects of residue bulkiness, external electric field, and externally applied mechanical forces. Flory’s isolated pair hypothesis is shown to be partially correct for α-helical conformations.

Extended experimental inferential structure determination method in determining the structural ensembles of disordered protein states

Article Open access 09 June 2020

Exo-chirality of the α-helix

Article Open access 14 August 2024

Sampling of the conformational landscape of small proteins with Monte Carlo methods

Article Open access 23 October 2020

Introduction

The middle of the 20^th century is considered to be the genesis of structural biology. During this period, Pauling and coworkers discovered the two most fundamental structures found within proteins: the α-helix and the β-sheet^1,2,3. This has led to a leap in our understanding of protein structure⁴ and function⁵, and later in folding prediction⁶ and de novo design⁷. According to Pauling and coworkers, α-helices have a well-defined structure with constant displacement distances between nitrogen (N) and alpha carbon (Cα) of N-Cα = 1.47 Å, between Cα and carbonyl carbon (C) of Cα-C = 1.53 Å, and C-N = 1.32 Å. The bend angles are fixed at Cα-C-N = 117°, C-N-Cα = 120°, N-Cα-C = 110°, and planar Cα-C-N-Cα = 180° dihedral angle. There are 3.7 amino acid (AA) residues per turn, with a translation of 1.47 Å per residue along the α-helical axis, and a hydrogen bond (HB) between the carbonyl group of every i^th residue to the amide group of i + 4^th residue with an optimal distance of 2.72 Å between the H-bonded oxygen and nitrogen backbone atoms.

Nearly a decade after the discovery of the α-helix, a systematic tool was developed by Ramachandran and coworkers⁸ for the analysis of the backbone conformation of polypeptides, namely the Ramachandran map. The Ramachandran map allows for distinguishing between regions of similar backbone conformations of polypeptide chains⁹. The Ramachandran map is a plot of dihedral angles φ and ψ, where φ is the C-N-Cα-C dihedral angle and ψ is the N-Cα-C-N dihedral angle. An important variation of the Ramachandran analysis is the neighbor dependent analysis. Neighbor dependent studies allow to take into account the effects of the neighbors on the (φ, ψ) propensity of a given AA^10,11,12. One of the interesting conclusions drawn from Ramachandran’s pioneering study was that more than half of the regions on the Ramachandran map are sterically inaccessible. The understanding of the inaccessible and accessible Ramachandran regions is crucial for the understanding of the distribution of the empirically determined backbone conformations.

The Ramachandran map opened the doors for a wide variety of empirical and theoretical conformational studies of α-helical polypeptides. Empirical determination of α-helix structures demonstrated a diagonal distribution of the (φ, ψ) pairs^13,14. Since the α-helix basin is found near a sterically inaccessible region near the origin and is distributed along a diagonal, it was believed that the cause for the observed distributions is its location on the Ramachandran map¹³. Earlier studies carried out by Scheraga and coworkers^15,16,17,18, also pointed out the relevance of the diagonals when dealing with α-helical conformations. A later study revealed that geometrical constraints of α-helical HBs are likely to be the reason for the observed diagonal distributions¹⁴. Further empirical studies^19,20 showed that the different AAs are found in different proportions within α-helices due to the energetic cost for inclusion of some given AA within the α-helix. The latter allowed the definition of α-helix propensities for the different AAs. Accordingly, MET, ALA, LEU, GLU (E), and LYS (K) (or shortly MALEK in one-letter AA codes) are the AAs with the highest α-helix propensity while PRO and GLY are with the lowest.

Following these advances, many efforts have been invested in the study of α-helices, however, little was done to understand the conformations of α-helices within the α-helix basin. Thus, the purpose of this study is to provide a deeper look into the different conformations of α-helices.

Results

We wish to find a mathematical relation between (φ, ψ) and (ρ, ϑ) coordinate systems, where (φ, ψ) are the commonly used dihedral angles, ρ is the number of residues per single turn of the α-helix, and ϑ is the angle between a backbone carbonyl (CO) normal of the given residue relative to the normal of the α-helix direction (ϑ is positive for CO normal pointing outwards from the α-helix center) as illustrated in Fig. 1a. The relation between ρ and ϑ as a function of (φ, ψ) is shown in Fig. 1c and d, respectively. The analytical determination of the α-helix basin was done by using the HB alignment score S, such that regions with S > 0 are treated as the α-helix basin in this study. Figure 1b illustrates the evaluated score S as a function of (φ, ψ), and presents visually the shape and the location of the α-helix basin. Next, we derived a linear transformation from (φ, ψ) coordinates to (ρ, ϑ) coordinates with the following set of equations:

where the errors of ρ and ϑ are Δρ < 0.2 [Res/Turn] and Δϑ < 2.4° for S > 0. Figure 1e presents the alignment score S on the resulting (ρ, ϑ) space.

In this study we distinguish between 400 naturally occurring AA pairs which we name as transitions. By naming pairs as transitions we actually emphasize the importance of directionality when dealing with polypeptides. On the (φ, ψ) space a transitional conformation is the (φ, ψ) pair describing the conformation of the transition from AA_X to AA_Y. For a transition advancing from N to C terminus with the following backbone atoms: N_X-Cα_X-C_X-N_Y-Cα_Y-C_Y, we define the transitional ψ = ψ_X−>Y dihedral as N_X-Cα_X-C_X-N_Y and the transitional φ = φ_X−>Y dihedral as C_X-N_Y-Cα_Y-C_Y. This naming convention is used to describe the conformation of the α-helix backbone. Figure 2a presents the distribution of all the sampled ALA−>ALA transitions on the (φ, ψ) coordinates system. It is clearly observed that the (φ, ψ) distribution is along the φ + ψ = const diagonal, which is a result of the HBs along the α-helix backbone^14,21 and the sterically inaccessible regions near the α-helix basin^13,14. Figure 2b presents the typical distribution of all the sampled ALA−>ALA transitions on the (ρ, ϑ) coordinates system. By observing Fig. 2b we immediately conclude that ALA−>ALA transitions found in PDB helices contain an average of 3.6 [Res/Turn] and a ϑ angle of about 12° relative to the helix direction normal in agreement to previous reports^22,23,24,25. The latter is a validation of the (φ, ψ to ρ, ϑ) transformation applied on the PDB data, presented in Equation 1. All other transitions demonstrate very similar distributions to those of ALA−>ALA with the exception of PRO and GLY that will be discussed later in this study. The difference between the different transitions is in the mean value of the specific distribution. An interesting result presented in Fig. 2b is the symmetrical Gaussian-like fluctuations on the alternative coordinates system. The fluctuations of the measured values are the result of measurement, thermal, and other sources of noise. The symmetrical fluctuation pattern allows drawing the following conclusion: by focusing on some circular contour on Fig. 2b, the green one for instance, we deduce that the energy to cause a shift of ~0.8 [Res/Turn] is the same as the energy to cause a shift of ~15° of the ϑ angle. The implication of this observation will be further discussed within the Heterogeneous transitions section.

The mean value of each of the 400 naturally occurring transitions is displayed in Fig. 2c for every one of the 4 filtering levels. The inset of Fig. 2c shows the mean value of all of these transitions for the given filtering level. We clearly observe the migration of the transitional (φ, ψ) pairs towards higher φ values and lower ψ values along the φ + ψ = const diagonal, which suggests that for helices with better aligned HBs we should expect higher φ values and lower ψ values. This raises the question of how much we can increase φ and decrease ψ along the φ + ψ = const diagonal such that the polypeptide will remain in its α-helical shape. The answer is found within Equation 3 (in Methods) that shows that the optimum of the HB alignment equation is found on the line φ + ψ = −107.8° at (φ_M, ψ_M) = (−49.7°, −58.1°). Figure 2d presents the same transitions as in Fig. 2c but on the (ρ, ϑ) coordinates system. A clear migration towards regions of high alignment score is observed with increasing filtering level. Furthermore, a very weak change is observed from filtering level 2 and level 3 which suggests that the maximally aligned HB conformation at (ρ, ϑ) ~ (3.62 [Res/Turn], 0°) is hard to reach. In Fig. 3 we show that the conformational optimum of (ρ, ϑ) = (3.63 [Res/Turn], 9.3°) is a direct result of HB interactions on the order of 4 [kCal/mole]. Having the distributions of the 400 naturally occurring transitions, we divide the analysis into two parts: 20 homogeneous (same AAs) and 380 heterogeneous (different AAs) transitions analysis.

Homogeneous transitions

Figure 4 presents the mean (ρ, ϑ) values of the 20 possible homogeneous transitions at the four filtering levels, the higher the level the better aligned the backbone of the helix. MALEK are known as the AAs with the highest helical propensities²⁰, and as indeed may be observed in Fig. 4, MALEK are closely spread around the center of each filtering level, especially in filtering Level 0 and Level 1. This suggests that MALEK are helically well aligned AAs and practically dictate the location of the mean helix conformation. It is clearly observed that GLY transitions are always the leftmost with the lowest amount of residues per turn and PRO transitions are always at the bottom with lowest ϑ. GLY and PRO have a clear tendency to stay away from the overall mean helical behavior (which is denoted by the + symbol on every plot), possibly due to the high energetic cost to include GLY and PRO within α-helices^19,20. However, GLY and PRO differ in their local effect on helix geometry. GLY strongly decreases the number of residues per turn and keeps ϑ above the overall average as is observed in all filtering levels. PRO, on the other hand, keeps the number of residues per turn above the overall mean and strongly decreases ϑ, as observed in all the filtering levels.

Interestingly, the basic AAs ARG (R) and LYS (K) demonstrate a conformational behavior that is very close to the overall mean conformation in all the filtering levels. The basic AA HIS, demonstrates the highest amount of residues per turn among all the basic AAs and the propensity to stay above the overall mean amount of residues per turn in all the filtering levels. Similarly, the polar uncharged AAs SER, THR, and GLN (Q) are always with less residues per turn than the overall mean, while the amount of residues per turn for ASN (N) is always above the overall mean.

Both ILE and VAL demonstrate a strong propensity to lower ϑ. These AAs are the only residues with two carbons at the γ position, which raises the question whether they are the reason for the observed lower ϑ angle. To assert this premise we performed Conceptual Simulations 1 and 2 as described in the Methods section and depicted in Fig. 5. The results of the inflated virtual residue in Fig. 5a shows that bulkiness near the helix backbone strongly decrease ϑ and also decrease ρ. Thus, we may deduce that since VAL is less bulky than ILE it decreases ϑ less as predicted by the conceptual simulations.

THR is a polar and uncharged AA that differs from VAL by the oxygen atom on the γ position, otherwise VAL and THR are sterically very similar and might be expected to behave similarly. As may be observed in Fig. 4, THR demonstrates considerably lower ϑ angle than the other uncharged AAs (SER, ASN (N), and GLN (Q)) in all the filtering levels, with the exception of GLN (Q) in Level 0. The latter may be because Level 0 may contain non-α-helical conformations, and because of the polar nature of the THR and GLN (Q) residues. Previous reports suggest that the bulky residues stabilize the α-helix HBs and shield them from surrounding water molecules^23,25. Our observations of bulkiness in the vicinity of the backbone for ILE, VAL, and in some cases for THR confirm that the shielding of the α-helix backbone occurs via the decrease of ϑ, since increased values of ϑ suggest the existence of nearby water molecules that destabilize the α-helical HBs²². The exceptions of THR in the lower filters might be explained by its polar nature.

The results for Conceptual Simulation 2 that focus on the effect of increasing distance of the virtual residue, presented in Fig. 5b, suggest that almost no conformational changes take place when the residue is inflated up to the critical value σ_C = 6 Å. Above σ_C we observe a decrease in the number of residues per turn while ϑ remains nearly constant. By focusing on the most bulky AAs PHE (F), TYR (Y), and TRP (W) (in increasing order of bulkiness, respectively) in Fig. 4 we indeed observe a decrease in the amount of residues per turn with the increase of bulkiness, as is confirmed by the conceptual simulation. In addition, two mismatches are evident: (1) FYW are expected to be to the left of MALEK but observed with an increased ρ (shifted rightwards in Fig. 4), and (2) FYW are expected to demonstrate a slight decrease of ϑ with the increase of bulkiness but demonstrates an increase of ϑ with the increase of bulkiness. The reason for the observed mismatches might be the naïve nature of the performed conceptual simulation that does not take into account interactions other than steric.

Since α-helices are not isolated structures and probably interact with the surrounding environment, we performed two additional simulations to understand the dependence of the helical structure on the surrounding forces. On the atomic scale, charged particles may induce local effects similar to those applied by an electric field^26,27,28. Thus, we tested the effect of an applied electric field as illustrated in Fig. 5c. As clearly observed from the resulting plot, the electric field promotes a change of ϑ, and has a negligible effect on the amount of residues per turn, or literally the change in ϑ is proportional to the magnitude of the applied electric field. The small but still present change in the ρ axis is due to the unequal magnitudes of the partial charges of the carbonyl oxygens (−0.5) and amide hydrogens (+0.33). For the hypothetical case where both of the partial charges were with the same absolute magnitude we would expect no dependency of the electric potential on the ρ axis.

In the last simulation shown in Fig. 5d we measure the contribution of an external stretching force applied only on Cα’s of the helix backbone. The plot shows the contribution of the external stretching force to the total conformational energy of the α-helix backbone. As clearly observed, stretching forces encourage the change of the number of residues per turn with a negligible change of ϑ, or literally the change in ρ is proportional to the magnitude of the externally applied force. The observed behavior demonstrates practically no change in ϑ since change in ϑ results in energetically unfavorable misalignment of the HBs. In both conceptual simulations 3 and 4, changing the sign of the electric field (direction of the applied force) will result in exactly the opposite change of ϑ (ρ).

Heterogeneous transitions

A heterogeneous transition is a transition AA₁−>AA₂ where AA₁ ≠ AA₂ with a total of possible 380 heterogeneous transitions for the 20 common AAs. Flory’s isolated pair hypothesis (IPH)²⁹ which was shown to be generally incorrect^30,31,32,33 states that the (φ, ψ) pair is independent of its neighbors. Interestingly, IPH was never discussed in detail specifically for the α-helix basin, probably because of the difficulties distinguishing between one helix and another. Our approach of analyzing conformations on the (ϑ, ρ) space allows studying the differences even between very similar α-helical conformations and will be used test the validity of IPH for the α-helix basin. Before approaching IPH we will focus on another important question which is directly related to IPH: Can we predict the conformational behavior of a heterogeneous transition AA₁−>AA₂ from the arithmetic mean of the homogeneous transitions: AA₁−>AA₁ and AA₂−>AA₂? To perform the comparison between a measured conformation to some predicted conformation we calculate the energetic cost of the predicted conformation relative to the measured conformation. We deduced earlier that the energy required to shift ~0.8 [Res/Turn] is the same energy required to shift ~15° of the ϑ angle for the green contour visually presented in Fig. 2b. To find more precise values of the shift, we repeated the latter calculation and found that 50% of all the measured transitions are confined within (Δϑ, Δρ) ≈ (13.5, 0.8), (10.5, 0.6), (9.6, 0.6) and (8.7, 0.5), for the four filtering levels 0–3, respectively, with an average of (〈Δϑ〉, 〈Δρ〉) ≈ (10.6°, 0.63 [Res/Turn]). The rationale behind such averaging is to give more weight to the more aligned helical structures (higher filtering levels). If we define a single Energy Unit (EU) as the energy required to a shift of Δϑ = 1°, and a conversion ratio K = 〈Δϑ〉/〈Δρ〉 ≈ 16.8 [°Turn/Res], we may use the following harmonic energy expression to approximate the energy of some given conformation:

where ϑ₀ and ρ₀ represent the minimum energy conformation for some given transition. Since both 〈Δϑ〉 and 〈Δρ〉 are the average confinement values for half of the measured transitions we deduce that the border energy of the confined transitions is E_0.5 = (〈Δϑ〉/2)² = K²(〈Δρ〉/2)² ≈ 28 [EU], which is approximately the energy of the green contour in Fig. 2b.

Table 1 presents the energy difference between the measured conformations to the predicted conformations for the heterogeneous transitions for filtering Level 1 (tables for other levels may be found in Supporting Information). The conformations were predicted by calculating the arithmetic mean of the homogeneous conformations. The energy differences were calculated using Equation 2, by defining the measured conformation as the minimum energy conformation. The results suggest that the mean energy difference of all cases is ~3 [EU], that the highest energy differences are observed for transitions from PRO (AA_PRO−>AA_X) and to PRO (AA_X−>AA_PRO), and that in most cases the transitions carry an asymmetric nature, i.e. ΔE_AA1−>AA2 ≠ ΔE_AA2−>AA1. The asymmetry issue is enough to conclude that in most cases predicted conformations will differ from the real conformations. Nevertheless, if energy differences are tolerable, the prediction of heterogeneous conformations may sometimes be useful especially when excluding PRO. In case the tolerance is set to a very small value of 1 [EU] we find that 45% (171 transitions out of 380) of the heterogeneous transitions are predictable by homogeneous averaging, and in case the tolerance is set to the half population boundary energy E_0.5 = 28 [EU], 97% (375 out of 380) of the heterogeneous transitions are predictable by homogeneous averaging. If IPH was absolutely correct than we would expect that all the values presented in Table 1 would equal to 0. Table 1 is actually a proof that IPH is incorrect when no tolerance in energy difference is allowed, however when introducing such tolerance, IPH becomes 45% correct for 1 [EU] tolerance and 97% correct for 28 [EU] (which is approximately the energy of the green contour in Fig. 2) tolerance as explained above for Level 1 filtering (the percentages increase with increased filtering). In addition, the asymmetric nature of the transitions justifies the transitional analysis approach that was done in this study and stresses that previous efforts of analyzing α-helical conformations lack important transitional information.

Table 1 Energy difference [EU] between homogeneous average conformations and measured conformation for transition AA_ROW−>AA_COL for Level 1 filtering.

Full size table

A possible explanation to the observed deviation between heterogeneous transitions to their homogeneous average is the interaction between residues – residues that do not interact chemically, sterically, or in any other way are expected to demonstrate stochastic conformational behavior, i.e. the average conformation of two non-interacting AAs are expected to be equal to the observed heterogeneous conformation. Thus, we can conclude that the higher the energy difference between the heterogeneous conformation to the average homogeneous, the higher the interaction between the residues with the exception of PRO that may result in high deviations because of its limited degrees of freedom.

Discussion

By representing helical structures on the (ρ, ϑ) space we attribute a meaning to the 2-dimensional representation of the α-helix in terms of residues per turn (ρ) and the CO angle of backbone carbonyls relative to the helix direction vector (ϑ). It was shown that a simple linear transformation allows for switching between (φ, ψ) and (ρ, ϑ) spaces, giving freedom of choice for the desired representation space. The transformation was validated by comparing our observations with those found in the literature. By using our new (ρ, ϑ) space we were able to deduce that: (1) 50% of α-helix conformations found in PDB are confined in average within (〈Δϑ〉, 〈Δρ〉) ≈ (10.6°, 0.63 [Res/Turn]). (2) The energy required to shift the conformation by Δϑ = 16.8° is the same energy required to shift the conformation by Δρ = 1 [Res/Turn] within the α-helix basin. (3) Residues with bulkiness near to the helix backbone (the case of VAL, ILE, and THR) decrease ϑ stronger than other residues. (4) Residues with bulkiness far from the helix backbone (the case of PHE (F), TYR (Y), and THR (W)) demonstrate a decrease in ρ with increased bulkiness. (5) An environment with charged particles affects primarily ϑ. (6) External stretching/squeezing forces affect ρ. Furthermore, It was shown that representation of helical structure on the (ρ, ϑ) space has the advantage of easily calculating the conformational energy of any given α-helix, which is not applicable on the (φ, ψ) space. The latter allowed to approach Flory’s IPH problem and to draw relevant conclusions. This study presented the α-helix basin from a novel perspective and resolutions that were not previously available.

Methods

This study is performed in three parts: (1) Derivation of the alternative coordinate system (ρ, ϑ) and its relation to the commonly used coordinates system (φ, ψ), (2) Statistical study of α-helical conformations found in PDB, and (3) Conceptual simulations to understand the conformational tendencies of α-helices found in PDB. In the 1^st part we use the base simulation model of the α-helix backbone to calculate the alignment score of HBs. We use the conformational sweep method to analytically determine the alignment score for all the possible α-helical conformations. One of the purposes of HBs alignment score is to analytically determine whether any given conformation is α-helical or not. Finally, we use the HBs alignment score to calculate the conformational backbone energy of any given α-helix. In the 2^nd part we explain how we organized the PDB data from less ordered helical structures to more ordered helical structures and how the PDB redundancy issue was treated. In the 3^rd part we adjust the base simulation model to conceptually demonstrate how the shape of the residue affects the helical conformation, and how the environment affects the helical shape by applying external electric field, and external mechanical forces. In-house software was developed under C++ and Matlab^TM to perform the simulations and the analysis of the PDB data. Visual Molecular Dynamics (VMD)³⁴ was used for the 3D visualization of molecular structures.

Base simulation model of the α-helix backbone

The base model consists of 30 AA-long polypeptide backbone (without residues) and was used as the basis for the calculations carried out in this study. All the geometrical values for the base model were sampled directly from PDB. The resulting values used for the base model are: constant distances N-Cα = 1.46 Å, Cα-C = 1.52 Å, C-N = 1.33 Å, C-O = 1.23 Å, constant bend angles Cα-C-N = 117°, C-N-Cα = 121°, N-Cα-C = 111°, and a constant dihedral Cα-C-N-Cα = 180°. The values confirmed the proper function of the custom software designed for the sampling and analysis used in this study. The positions of the carbonyl oxygens assumed the Cα, C, O, N backbone atoms on the same plane with equal bend angles Cα-C-O = N-C-O. No hydrogen atoms were included in the base model.

Conformational sweep

Conformational sweep is an important method used in this study to evaluate properties of the α-helical conformation. If P is some property of interest that is a function of the α-helical conformation, then by using conformational sweep over a binned space (φ, ψ space for instance) we perform the calculation of P on every binned point within the space of interest and get a resulting map P (space of interest).

Scoring the α-helical hydrogen bond alignment

HBs play a key role in the formation of the α-helical shape^{1,21,35,36,37,38,39}. Thus, the alignment score of HBs is used to analytically determine the quality of some given α-helical polypeptide segment by using the following scoring function:

where s_HO = max(0, 1 − (dst(HO) − 1.9)²/1.2²) is the harmonic sub-score of the H-O distance with H being the hydrogen of the i + 4^th residue nitrogen and O being the i^th residue oxygen. The optimal distance of 1.9 Å and a fluctuation range of ±0.6 Å were picked for the harmonic function. s_OCNH = max(0, OC·NH) is the angular alignment sub-score of the OC normal of the i^th residue with the NH normal of the i + 4^th residue. s_OCHO = max(0, OC·HO) is the angular alignment sub-score of the OC normal of the i^th residue with HO normal, where H is the hydrogen of the i + 4^th nitrogen and O is the i^th oxygen. s_HAND equals to 1 for right handed α-helices and 0 otherwise. All the sub-scores s_HO, s_OCNH, s_OCHO, and s_HAND range from 0 to 1 thus the total score s range from 0 to 1, with s = 0 meaning no HB alignment detected and s = 1 meaning the best alignment detected. The total score 0 ≤ S ≤ 1 for a given polypeptide chain measures the mean alignment of all possible i^th to i + 4^th HBs, such that S = 〈s〉.

Calculation of (ρ, ϑ) given (φ, ψ)

Given (φ, ψ) we first calculate the Cartesian coordinates of every atom of the base model. To calculate the number of residues per turn for the base model, we evaluate the rotation angle r_i,i+1 between every two consecutive Cα’s and calculate the cumulative rotation angle as for N residues. Next we calculate the amount of residues per turn as ρ = 2πN/R. To calculate ϑ, we first calculate the helix direction vector as , where Cα_i is the position of i^th Cα on the Cartesian space. Next, we find the helix normal n_H = H/||H||. Letting C_i and O_i be the Cartesian positions of the i^th residue carbonyl carbon and oxygen atoms, respectively, we find the vector Q_i which is pointing from the nearest point on the helix vector toward C_i, and accordingly its normal q_i = Q_i/||Q_i||. Finally, we calculate , where ϑ_i = a sin (−q_i·n_COi), and n_COi = CO_i/||CO_i|| is the normal of the CO vector of i^th residue given by CO_i = C_i − O_i.

Conformational energy of the α-helix backbone

The optimal conformation according to the scoring function illustrated in Fig. 1e is evaluated at ρ = 3.62 [Res/Turn] and ϑ = 1.1°. The non-zero value of ϑ is due to the binning of the (ρ, ϑ) space and the theoretical expected value of ϑ for the optimal conformation should be ϑ = 0°. To find the optimal α-helix conformation with the inclusion of steric energy we introduced the vdW interactions in the form of Lennard-Jones (LJ) 12–6 potential to the HB energy. Since the value of HB is usually in the range of 2 to 6 [kCal/mole]^30,38,39,40, we used a value of 4 [kCal/mole] in our numerical calculations and expressed the HB energy as E_HB = −4S [kCal/mole]. In addition, we used OPLS force field^41,42 parameters with geometric mixing rules for the calculation of the vdW backbone energy: σ(N, CA, C, O) = (3.25, 3.5, 3.75, 2.96) [Å], and ε(N, CA, C, O) = (0.17, 0.066, 0.105, 0.21) [kCal/mole].

PDB data sampling at four filtering levels

PDB is a worldwide rapidly growing database of biological structures that is open to public access^43,44. It includes proteins acquired by different techniques with X-RAY and NMR protein structure acquisition techniques used in 99% of the cases. To date, more than 100 K empirically determined structures are available on PDB. Transitional Ramachandran (φ, ψ) pairs of the common 400 AA transitions found in PDB were sampled and filtered according to four levels: Level 0 filter checks whether the (φ, ψ) pair is found within a predefined window where −100° < φ < −20° and −80° < ψ < 0°. Level 1 filter checks whether the given residue pair is found within α-helical regions as determined within the PDB file usually with PROMOTIF based on DSSP³⁷. Level 2 filter is the custom filter based on Equation 3 that checks the satisfactory of HBs alignment with a weak threshold of S ≥ 0.01. Level 3 filter is the same custom filter with a stronger threshold of S ≥ 0.5. The four filtering levels are of increasing levels of order, where Level 0 filtering is the less ordered and might even include non-helical conformations. Level 1 filtering includes only α-helical regions but with possible kinks and other types of deformations. Level 2 filtering includes a subset of conformations that must satisfy HB alignment criterion with a weak threshold. Level 3 filtering is the most ordered filtering criterion with a strong HB alignment threshold. The transitional conformation (φ, ψ) pairs were sampled into 400 2D histograms.

PDB dataset resolution and redundancy treatment

In this study we sampled all of the available conformational data found in PDB for α-helices. We did not introduce any resolution limit because of the stochastic nature of the sampled data in PDB, and since we believe that important conformational data might be found even in low resolution measurements⁴⁵. To reduce the effects of PDB redundancy we applied a logarithmic function on the resulting distributions. In addition, the distribution of every transition was normalized by its area, such that the sum of all the possible conformations for a given transition equals to 1.

Conceptual simulation 1: residues with near bulkiness

The goal of this simulation is to demonstrate how bulky groups that are close to the α-helix backbone affect the conformation of the α-helical structure. We introduced virtual residues to every Cα on the backbone of the base model. The virtual residue is similar to ALA residue, but is attached perpendicular to the α-helix backbone while ignoring the hydrogen at the Cα position as is found in real residues. This was done to simulate the average space covered by the many possible conformations of any given residue and to allow a conceptual analysis of what happens to the α-helix when the residue is inflated. The virtual residue was maintained at a constant distance of 1.54 Å from the Cα in one case while the vdW radius (σ) of the virtual residue was changed from 3.5 Å to 9.5 Å with steps of 0.5 Å. The rationale behind keeping constant distance between the virtual residue and the Cα is to test how bulkiness at positions close to the α-helix backbone affects the conformation of the α-helix, as in the case of ILE, VAL, and THR.

Conceptual simulation 2: residues with far bulkiness

Here we repeat the same simulation described in Conceptual Simulation 1 but with an increased bond length of the virtual residue. We increased the distance of the virtual residue from the Cα and maintained it at σ − 1.96 Å. The rationale behind increasing the distance between the virtual residue and the Cα is to test how bulky residues like PHE (F), TYR (Y), and TRP (W) affect the conformation of the α-helix.

Conceptual simulation 3: external electric field

The goal of this simulation is to demonstrate the dependency of the α-helical conformation on external electric field with cylindrical symmetry. To achieve the goal we applied an electric field with its zero set at the center of the α-helix and with an increasing linear potential extending outwards. The contribution of the electric potential energy was calculated as: E_Electric = E_CO + E_NH, where E_CO = −0.5·D_Helix-O and E_NH = 0.33 · D_Helix-H. D_Helix-O is the distance of the oxygen atom of the backbone carbonyl group from the α-helix center, and D_Helix-H is the distance of the estimated hydrogen atom of the backbone amide group from the α-helix center. The estimation of the hydrogen position assumed N-H distance of 0.98 Å (sampled from PDB), that the backbone atom positions of C, N, H, Cα are on the same plane, and equal bend angles C-N-H = Cα-N-H. The coefficients −0.5 and 0.33, are the partial electric charges of the backbone carbonyl oxygen and of the backbone amide hydrogen, respectively, according to OPLS.

Conceptual simulation 4: external mechanical force

The goal of this simulation is to demonstrate the dependency of the α-helical conformation on homogeneous external stretching/squeezing forces that act on the α-helix residues from the α-helix center outwards for the stretching case, and towards the α-helix center in the case of squeezing. Other mechanical force that can act on the α-helix residues can be translated to an effective stretch/squeeze force. The contribution to the total energy of the applied mechanical force was calculated as: dE_Stretch = −D_Helix-CA, where D_Helix-CAis the distance of the backbone Cα from the α-helix center.

Additional Information

How to cite this article: Haimov, B. and Srebnik, S. A Closer Look into the α-Helix Basin. Sci. Rep. 6, 38341; doi: 10.1038/srep38341 (2016).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Pauling, L., Corey, R. B. & Branson, H. R. The structure of proteins: two hydrogen-bonded helical configurations of the polypeptide chain. Proc. Natl. Acad. Sci. 37, 205–211 (1951).
Article ADS CAS PubMed PubMed Central Google Scholar
Pauling, L. & Corey, R. B. Atomic coordinates and structure factors for two helical configurations of polypeptide chains. Proc. Natl. Acad. Sci. 37, 235–240 (1951).
Article ADS CAS PubMed PubMed Central Google Scholar
Pauling, L. & Corey, R. B. Configurations of polypeptide chains with favored orientations around single bonds two new pleated sheets. Proc. Natl. Acad. Sci. 37, 729–740 (1951).
Article ADS CAS PubMed PubMed Central Google Scholar
Murzin, A. G., Brenner, S. E., Hubbard, T. & Chothia, C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536–540 (1995).
CAS PubMed Google Scholar
Rosenbaum, D. M., Rasmussen, S. G. & Kobilka, B. K. The structure and function of G-protein-coupled receptors. Nature 459, 356–363 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Dill, K. A. & MacCallum, J. L. The protein-folding problem, 50 years on. Science 338, 1042–1046 (2012).
Article ADS CAS PubMed Google Scholar
Khoury, G. A., Smadbeck, J., Kieslich, C. A. & Floudas, C. A. Protein folding and de novo protein design for biotechnological applications. Trends Biotechnol. 32, 99–109 (2014).
Article CAS PubMed Google Scholar
Ramachandran, G. N., Ramakrishnan, C. & Sasisekharan, V. Stereochemistry of polypeptide chain configurations. J. Mol. Biol. 7, 95–99 (1963).
Article CAS PubMed Google Scholar
Subramanian, E. GN Ramachandran. Nat. Struct. Mol. Biol. 8, 489–491 (2001).
Article CAS Google Scholar
Popov, E. Quantitative approach to conformations of proteins. Int. J. Quantum Chem. 16, 707–737 (1979).
Article CAS Google Scholar
Jha, A. K. et al. Helix, sheet, and polyproline II frequencies and strong nearest neighbor effects in a restricted coil library. Biochemistry (Mosc.) 44, 9691–9702 (2005).
Article CAS Google Scholar
Ting, D. et al. Neighbor-dependent Ramachandran probability distributions of amino acids developed from a hierarchical Dirichlet process model. PLoS Comput Biol 6, e1000763 (2010).
Article MathSciNet PubMed PubMed Central CAS Google Scholar
Lovell, S. C. et al. Structure validation by Cα geometry: φ, ψ and Cβ deviation. Proteins Struct. Funct. Bioinforma. 50, 437–450 (2003).
Article CAS Google Scholar
Ho, B. K., Thomas, A. & Brasseur, R. Revisiting the Ramachandran plot: Hard-sphere repulsion, electrostatics, and H-bonding in the α-helix. Protein Sci. 12, 2508–2522 (2003).
Article CAS PubMed PubMed Central Google Scholar
Scott, R. A. & Scheraga, H. A. Conformational Analysis of Macromolecules. III. Helical Structures of Polyglycine and Poly-L-Alanine. J. Chem. Phys. 45, 2091–2101 (1966).
Article ADS CAS Google Scholar
Leach, S., Némethy, G. & Scheraga, H. A. Computation of the sterically allowed conformations of peptides. Biopolymers 4, 369–407 (1966).
Article CAS PubMed Google Scholar
Ooi, T., Scott, R. A., Vanderkooi, G. & Scheraga, H. A. Conformational Analysis of Macromolecules. IV. Helical Structures of Poly-L-Alanine, Poly-L-Valine, Poly-β-Methyl-L-Aspartate, Poly-γ-Methyl-L-Glutamate, and Poly-L-Tyrosine. J. Chem. Phys. 46, 4410–4426 (1967).
Article ADS CAS PubMed Google Scholar
Scheraga, H. A. Calculations of conformations of polypeptides. Adv. Phys. Org. Chem. 6, 103–184 (1968).
CAS Google Scholar
Myers, J. K., Pace, C. N. & Scholtz, J. M. Helix propensities are identical in proteins and peptides. Biochemistry (Mosc.) 36, 10923–10929 (1997).
Article CAS Google Scholar
Pace, C. N. & Scholtz, J. M. A helix propensity scale based on experimental studies of peptides and proteins. Biophys. J. 75, 422–427 (1998).
Article CAS PubMed PubMed Central Google Scholar
Porter, L. L. & Rose, G. D. Redrawing the Ramachandran plot after inclusion of hydrogen-bonding constraints. Proc. Natl. Acad. Sci. 108, 109–113 (2011).
Article ADS CAS PubMed Google Scholar
Walsh, S. T. et al. The hydration of amides in helices; a comprehensive picture from molecular dynamics, IR, and NMR. Protein Sci. 12, 520–531 (2003).
Article CAS PubMed PubMed Central Google Scholar
Garcia, A. E. & Sanbonmatsu, K. Y. α-Helical stabilization by side chain shielding of backbone hydrogen bonds. Proc. Natl. Acad. Sci. 99, 2782–2787 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Scheraga, H. A., Vila, J. A. & Ripoll, D. R. Helix–coil transitions re-visited. Biophys. Chem. 101, 255–265 (2002).
Article PubMed Google Scholar
Vila, J. A., Ripoll, D. R. & Scheraga, H. Physical reasons for the unusual α-helix stabilization afforded by charged or neutral polar residues in alanine-rich peptides. Proc. Natl. Acad. Sci. 97, 13075–13079 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Ripoll, D. R., Vila, J. A. & Scheraga, H. A. Folding of the villin headpiece subdomain from random structures. Analysis of the charge distribution as a function of pH. J. Mol. Biol. 339, 915–925 (2004).
Article CAS PubMed Google Scholar
Shoemaker, K. R., Kim, P. S., York, E. J., Stewart, J. M. & Baldwin, R. L. Tests of the helix dipole model for stabilization of α-helices. Nature 326, 563–567 (1987).
Article ADS CAS PubMed Google Scholar
Baker, E. G. et al. Local and macroscopic electrostatic interactions in single α-helices. Nat. Chem. Biol. 11, 221–228 (2015).
Article CAS PubMed PubMed Central Google Scholar
Flory, P., Volkenstein, M. & others. Statistical mechanics of chain molecules. (Wiley Online Library, 1969).
Pappu, R. V., Srinivasan, R. & Rose, G. D. The Flory isolated-pair hypothesis is not valid for polypeptide chains: implications for protein folding. Proc. Natl. Acad. Sci. 97, 12565–12570 (2000).
Article ADS PubMed PubMed Central Google Scholar
Keskin, O., Yuret, D., Gursoy, A., Turkay, M. & Erman, B. Relationships between amino acid sequence and backbone torsion angle preferences. Proteins Struct. Funct. Bioinforma. 55, 992–998 (2004).
Article CAS Google Scholar
Zaman, M. H., Shen, M.-Y., Berry, R. S., Freed, K. F. & Sosnick, T. R. Investigations into sequence and conformational dependence of backbone entropy, inter-basin dynamics and the Flory isolated-pair hypothesis for peptides. J. Mol. Biol. 331, 693–711 (2003).
Article CAS PubMed Google Scholar
Baldwin, R. L. & Zimm, B. H. Are denatured proteins ever random coils? Proc. Natl. Acad. Sci. 97, 12391–12392 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics www.ks.uiuc.edu/Research/vmd. J. Mol. Graph. 14, 33–38 (1996).
Article CAS PubMed Google Scholar
Hagler, A., Huler, E. & Lifson, S. Energy functions for peptides and proteins. I. Derivation of a consistent force field including the hydrogen bond from amide crystals. J. Am. Chem. Soc. 96, 5319–5327 (1974).
Article CAS PubMed Google Scholar
Hagler, A. & Lifson, S. Energy functions for peptides and proteins. II. Amide hydrogen bond and calculation of amide crystal properties. J. Am. Chem. Soc. 96, 5327–5335 (1974).
Article CAS PubMed Google Scholar
Kabsch, W. & Sander, C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983).
Article CAS PubMed Google Scholar
Ben-Naim, A. The role of hydrogen bonds in protein folding and protein association. J. Phys. Chem. 95, 1437–1444 (1991).
Article CAS Google Scholar
Fabiola, F., Bertram, R., Korostelev, A. & Chapman, M. S. An improved hydrogen bond potential: impact on medium resolution protein structures. Protein Sci. 11, 1415–1423 (2002).
Article CAS PubMed PubMed Central Google Scholar
Voegler Smith, A. & Hall, C. K. α-Helix formation: Discontinuous molecular dynamics on an intermediate-resolution protein model. Proteins Struct. Funct. Bioinforma. 44, 344–360 (2001).
Article CAS Google Scholar
Jorgensen, W. L. & Tirado-Rives, J. The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. J. Am. Chem. Soc. 110, 1657–1666 (1988).
Article CAS PubMed Google Scholar
Jorgensen, W. L., Maxwell, D. S. & Tirado-Rives, J. Development and testing of the OPLS all-atom force field on conformational energetics and properties of organic liquids. J. Am. Chem. Soc. 118, 11225–11236 (1996).
Article CAS Google Scholar
Berman, H. M. et al. www.rcsb.org The protein data bank. Nucleic Acids Res. 28, 235–242 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Berman, H., Henrick, K., Nakamura, H. & Markley, J. L. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 35, D301–D303 (2007).
Article CAS PubMed Google Scholar
Kosloff, M. & Kolodny, R. Sequence-similar, structure-dissimilar protein pairs in the PDB. Proteins Struct. Funct. Bioinforma. 71, 891–902 (2008).
Article CAS Google Scholar

Download references

Acknowledgements

This work was funded in part by the Israel Science Foundation Grants No. 318/11 and 265/16. B.H. dedicates this contribution to the memory of the grandfather and the mentor Kano Yakubov (1936–2016).

Author information

Authors and Affiliations

Russell Berrie Nanotechnology Institute, Technion - Israel Institute of Technology, Haifa, 32000, Israel
Boris Haimov & Simcha Srebnik
Department of Chemical Engineering, Technion - Israel Institute of Technology, Haifa, 32000, Israel
Simcha Srebnik

Authors

Boris Haimov
View author publications
You can also search for this author in PubMed Google Scholar
Simcha Srebnik
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.H. and S.S. wrote the main manuscript text. B.H. prepared all figures. Both authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Dataset 1

Supplementary Dataset 2

Supplementary Movie 1

Supplementary Movie 2

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Haimov, B., Srebnik, S. A closer look into the α-helix basin. Sci Rep 6, 38341 (2016). https://doi.org/10.1038/srep38341

Download citation

Received: 05 October 2016
Accepted: 08 November 2016
Published: 05 December 2016
DOI: https://doi.org/10.1038/srep38341

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.