Extension of the classical classification of β-turns

de Brevern, Alexandre G.

doi:10.1038/srep33191

Download PDF

Article
Open access
Published: 15 September 2016

Extension of the classical classification of β-turns

Alexandre G. de Brevern^1,2,3,4

Scientific Reports volume 6, Article number: 33191 (2016) Cite this article

13k Accesses
66 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The functional properties of a protein primarily depend on its three-dimensional (3D) structure. These properties have classically been assigned, visualized and analysed on the basis of protein secondary structures. The β-turn is the third most important secondary structure after helices and β-strands. β-turns have been classified according to the values of the dihedral angles φ and ψ of the central residue. Conventionally, eight different types of β-turns have been defined, whereas those that cannot be defined are classified as type IV β-turns. This classification remains the most widely used. Nonetheless, the miscellaneous type IV β-turns represent 1/3^rd of β-turn residues. An unsupervised specific clustering approach was designed to search for recurrent new turns in the type IV category. The classical rules of β-turn type assignment were central to the approach. The four most frequently occurring clusters defined the new β-turn types. Unexpectedly, these types, designated IV₁, IV₂, IV₃ and IV₄, represent half of the type IV β-turns and occur more frequently than many of the previously established types. These types show convincing particularities, in terms of both structures and sequences that allow for the classical β-turn classification to be extended for the first time in 25 years.

Clustering predicted structures at the scale of the known protein universe

Article Open access 13 September 2023

ColabFold: making protein folding accessible to all

Article Open access 30 May 2022

Highly accurate protein structure prediction for the human proteome

Article Open access 22 July 2021

Introduction

The functional properties of a protein primarily depend on its three-dimensional (3D) structure. These properties have classically been assigned, visualized and analysed on the basis of protein secondary structures, which are composed of repetitive parts (α-helices¹ represent 1/3^rd of residues, and β-strands² represent 1/5^th of residues) connected by coils³. This simplification of 3D structure into a unidimensional representation of secondary structure is often regarded as a resolved question. In fact, this simplification conceals the difficulty of precisely defining and assigning repetitive structures⁴, thus explaining the large number of alternative assignment approaches^{5,6,7,8,9,10,11}. For instance, comparison of different approaches emphasizes their major discrepancies^12,13. Another limitation of this type of simplification is that the coil state is neglected, although it represents almost 50% of all residues and a large set of distinct local protein structures. Loop analyses cannot provide a complete representation of the coil state because their classification is usually limited to 8 residues^{4,14,15,16,17}. More precise descriptions are needed to comprehensively describe their diversity.

Helical and extended regions are the most frequently occurring repetitive structures. However, two other local protein conformations have also been characterized: the polyproline II helix and turns. The former is a left-handed helical structure with an overall shape resembling a triangular prism. It represents 5% of all protein residues¹⁸, contributes to coiled coil super secondary structure formation and is present in fibrous proteins^19,20. Because polyproline II helices do not have strong hydrogen bond patterns, they have not been studied in as much detail as the other local conformations^{21,22,23,24,25,26}.

Turns comprise n consecutive residues (denoted i to i+n), in which the distance between Cα(s) of residues i and i+n must be smaller than 7 Å (or 7.5 Å, according to some authors^27,28). The turns are composed of γ-turns (n = 3)^29,30, β-turns (n = 4), α-turns (n = 5)^31,32 and π-turns (n = 6)^33,34. The restrictive distance between Cαs applies a particular geometry to the backbone, thereby causing it to turn back on itself.

β-turns have been the most analysed among the turn conformations. Apart from the distance between Cαs, a second rule applies to the characterization of their secondary structure; because helices can easily be confused with a succession of turns, the central residues of β-turns, i.e., i+1 and i+2, should not be helical. Similarly, β-turn residues must not consist solely of β-strand residues. β-turns have been classified according to the values of their central residue dihedral angles, φ and ψ. A deviation of ± 30° from these canonical values is allowed on 3 of these angles, whereas the fourth can deviate by ± 45°³⁵.

The β-turns, as defined by C.M. Venkatachalam, are characterized by a hydrogen bond between the N-H and C = O of residues i and i+3³⁶. Venkatachalam has also defined types I, II, and III, and their corresponding mirror image types, I’, II’ and III’³⁶. Crawford and collaborators have proposed a more strict definition in terms of distance³⁷. Lewis and co-workers have added types V and V’. β-turn type VI is characterized by the presence of a proline; type VII is associated with a kink; and type IV corresponds to all other non-classified β-turns³⁸. Different turns have been excluded for various reasons: β-turns III and III’ are too close to the 3₁₀-helix and types I and I’, whereas turns V, V’ and VII are rare, and their definitions are inaccurate³⁵. Type VI is divided into 2 sub-types, VI_a and VI_b. Hutchinson and Thornton³⁹ have divided type VI_a into the 2 sub-types VI_a1 and VI_a2. Wilmot and Thornton have precisely defined type VIII⁴⁰, which is based on Richardson’s type I_b and was proposed after the removal of type VII³⁵. The definitions used by Thornton’s group^39,41 are currently considered to be the standard (see Supplementary Information 1)⁴². The β-turn assignment program PROMOTIF assigns β-turns on the basis of these standards⁴³. Studies have shown that repetitive structure assignment approaches have a direct effect on decreasing or increasing the number of residues associated with β-turns^27,28.

The difficulty with using such an approach is the ‘strict’ rule(s) used to define the β-turn types. Efimov has used a Ramachandran plot simplified to 6 and 8 regions: β (β_E and β_P), γ, δ, α, ε and α_L (α_L and γ_L). This rough clustering allows various classes to be defined, with some being associated with amino acid specific behaviours. The turns are also divided into full turns (with a polypeptide chain reversal of 180°) and half turns (with a polypeptide chain angle of 90°). The first category represents 7 major clusters, and the second one represents 8 major clusters^44,45. This system has widely been used to define super-secondary elements^46,47 and structural trees of protein superfamilies^48,49,50. In a similar way, Wilmot and Thornton have also used a simplification of the Ramachandran plot for the following 6 major regions: β_E, β_P, α_R, ε, α_L and γ_L⁵¹. They observed 12 combinations in their dataset. The most frequent turns were easily detected, whereas the two most interesting non-classical turns were β_E → γ_L (8%) and γ_L → α_R (4%). The 6 other clusters represented only 1% each⁵¹.

More recently, Koch and Klebe have proposed a combination of turns of different lengths ranging from 3 to 6 residues; the turns sometimes overlap, thus leading to complex categorizations⁵². Koch and Klebe trained a very large modified Self-Organizing Map^53,54 and extracted new types from the map. The assignment is provided as part of Secbase, an extension module of Relibase⁵⁵. Koch and Klebe have used the identified new types in a second step to perform a prediction from the sequence⁵⁶. This approach is innovative, but it has not been implemented as a web tool and is therefore less used. George Rose’s group has conducted research with a focus on the rationalization of two-, three-, and four-residue turn conformations found in their coil library⁵⁷. Rose’s group has defined 12 categories and has used them in Monte-Carlo simulations. These categories cover at least 90% of coil library fragments ranging from 5- to 20-residues, thus indicating that longer fragments are composites of shorter ones⁵⁸. Rose’s group has extended this approach to redraw the Ramachandran plot⁵⁹.

However, none of these approaches has succeeded in superseding the classical definition of β-turns^35,36,41,43. A major shortcoming of past β-turn classification concerns the classification of type IV β-turns, i.e., the miscellaneous category, because it represents 1/3^rd of β-turn residues and is the second most common type of β-turn. To locate potentially new recurrent conformations in this miscellaneous type, an automatic clustering approach based on the rules of β-turn assignment was designed. It is related to Self-Organizing Maps^53,54 and takes into account the specificity of β-turn assignment rules. All type IV β-turns were clustered. The four most occurring clusters were chosen as new types and analysed. Unexpectedly, these sub-types, denoted IV₁, IV₂, IV₃ and IV₄, represent half of the type IV β-turns and occur more frequently than many of the classical types.

Methods

Data sets

To remove representative bias regarding protein resolution or sequence identity, non-redundant datasets were used. These datasets were generated using the PISCES database⁶⁰. As previously performed in^12,61, 10 sets of proteins were defined. Each contained no more than x% pairwise sequence identity (with x ranging from 20 to 90%). The selected chains had X-ray crystallographic resolutions less than 1.6 Å or 2.5 Å and R-factors less than 0.25 or 1.0. They comprised between 2,542 and 23,943 protein chains. Each chain was automatically examined with geometric criteria to avoid bias from zones with missing density. The main purpose of such diversity was to examine (i) the poorly populated turns and (ii) the stability of the clustering approach (see below).

Secondary structure assignment

Secondary structure assignment was performed with DSSP⁵ (CMBI version 2000) using the default parameters. DSSP yields more than three states, so we reduced them to the following: the α-helix, containing α, 3₁₀ and π-helices; the β-strand, containing only the β-sheet; and the coil, comprising everything else (β-bridge, hydrogen bond turn, bend, and coil). Turn assignment was performed as described previously^27,28,36 using the following classical rules: the distance between residues i and i+3 should be less than 7 Å; the central residues of the turns must be non-helical; and in the case of strands, at least one residue must be associated with a coil. The types of turns (I, I’, II, II’, VI_a1, VI_a2, VI_b and VIII) were assigned according to the classical definition by using the φ and ψ dihedral angles of the central residues (see Supplementary Information 1). The turns were required to be less than 30° from the canonical values (at most one angle was allowed to deviate by +/− 45°)⁴³. Types VI_a1, VI_a2 and VI_b were characterized by a cis-proline at position i+2. Turns that did not fit any of the above criteria were classified as type IV^39,43. The turns were also classified into two classes according to their function as described by Efimov^44,45: full turns resulting in a chain reversal of 180° and half turns that change the polypeptide chain direction by approximately 90°. This methodology was used to enable comparisons with previous studies.

Protein Blocks

Protein Blocks (PBs^62,63) corresponded to a set of 16 local prototypes, labelled from a to p, of 5 residue length that were described on the basis of dihedral angles (φ, ψ). The PBs were obtained with an unsupervised classifier similar to Kohonen Maps⁵⁴ and hidden Markov models⁶⁴. The PBs m and d are prototypes for the central regions of α-helix and β-strands respectively. PBs a through c primarily represent the N-cap of a β-strand, whereas e and f correspond to the C-caps; PBs g through j are specific to coils, PBs k and l correspond to the N cap of an α-helix, and PBs n through p correspond to C-caps. PBs were assigned by using in-house Python software, although similar assignment can be performed through the PBE web server⁶⁵ or PBxplore (https://github.com/pierrepo/PBxplore⁶⁶).

Specific clustering approach

A specific clustering approach was designed to cluster type IV β-turns by using the classical rule, allowing +/− 30° for all angles, with the exception of one at +/− 45° for the defined values. The clustering derived from Self-Organizing Maps (SOM, without diffusion between the clusters^53,54). The training was carried out in 2 successive parts; the first one limited the potential bias of initialization, and the second refined the clustering by using the specific rules for β-turn types. The type IV β-turns were selected from a dataset D. Thus, each dataset was associated with T type IV β-turns.

Step one:

1. k clusters were created and were vectors v of length 2M = 4, representing the dihedral angles (φ_i+1, ψ_i+1, φ_i+2, and ψ_i+2). k type IV β-turns were taken randomly to initialize the clusters.

2. One of the T type IV β-turns was randomly selected from the dataset D (denoted V₂) and compared with each of the k clusters.

The dissimilarity measure between two vectors V₁ (representing the clusters) and V₂ of dihedral angles was defined as the Euclidean distance among the M links, the RMSDA (root mean square deviations on angular values⁶⁷):

where {Φ_i(V₁), Ψ_i(V₁)}(resp. Ψ_i(V₂), Ψ_i(V₂)) denotes the series of the (2M) dihedral angles for V₁ (resp. V₂). The angle differences were computed modulo 360°. Thus, in the training, this distance was used for assessing the dissimilarity of any fragment in the database with the different clusters.

3. The minimal RMSDA value was used to define the winning cluster W, i.e., the closest to the observation. W values were modified according to the learning coefficient α:

where {Φ_j(V_w)} and Ψ_j(V_w) are the values of the winner at time t, with j ranging from 1 to 2, similar to the values of the real data (i.e., dihedral angles i+1 and i+2, modulo 360°).

The decrease of α was performed similarly to that for SOM^53,54, T represents the total amount of data to learn (here the number of type IV β-turns). t represents the number of β-turns already used. The process goes back to step 2. One cycle of training corresponds to the learning of the whole dataset α₀, which is then equal to α₀/2; after 5 cycles, it is equal to α₀/5, etc. Initially, α₀ = 0.35, as in^68,69.

The process was iterated for 20 cycles, i.e., 20 times T; these steps were important to diminish the potential effect of the initialization.

Step two:

1
The final values of the k clusters were used as initial values. α₀ was still equal to 0.35.
2
One of the T type IV β-turns was randomly selected from the dataset D (denoted V₂) and compared with each of the k clusters. Instead of using only RMSDA, the β-turn rule was used: 3 angles can be at +/− 30° and 1 angle at +/− 45°.

The winner positively applied this rule; otherwise no training was performed.
3
Modification of the winner weights was performed as in step one −3.
4
The process was iterated for 20 cycles.

An important point is the choice of k. k was first set at 50 and then reduced. The obtained clusters were compared in the order of largest to smallest k values.

Z-score

The amino acid occurrences for each local structure conformation were normalized into a Z-score:

where is the observed number of occurrences of amino acid i in position j for a given secondary structure, and is the expected number. The product of the occurrences in position j with the frequency of amino acid i in the entire databank equals . Positive Z-scores (respectively negative) corresponded to overrepresented amino acids (respectively underrepresented); threshold values of 4.42 and 1.96 were chosen (probability less than 10⁻⁵ and 5.10⁻², respectively). The same computation was also performed for the protein blocks.

Analysis

Most of the quantitative analysis was performed using in-house Python scripts, and statistics and visualization were performed with R software (version 3.2.2)⁷⁰.

Results and Discussion

Protein structure dataset

The different amino acid datasets showed the expected amino acid and protein block occurrences, with no peculiarities in the rate of redundancy and the resolution quality (see Supplementary Information 2). As noted previously^27,28, the occurrence of β-turns is highly dependent on the way in which the assignment is performed. Following the work of Fuchs and Alix²⁷, we assigned secondary structures to the different protein datasets by using DSSP⁵. The DSSP provided 8 classes that were reduced to 3 classes (helix, strand and coil) or 4 classes (helix, strand, turn and coil, see Supplementary Information 3) for practicality. Helical structures represented more than 37.3% of the residues and the β-sheets represented 22.5%, whereas the remaining coil class covered 42.7% of the residues and included 20.4% of the β-turns (11.9% were turns and 8.5% were bends). Our β-turn assignment in the coil regions provided a slightly different number, with 21.9% being β-turns (difference: 1.5%). In total, 71.8% were similar to the DSSP assignment (45.6% were turns, and 23.0% were bends), whereas 28.1% and 1.9% were associated with coils and bridges, respectively. These proportions were comparable to the results of previous studies^27,28. The β-turn types were then assigned by using classical definitions (described in the methods section, see Supplementary Information 1). Type I β-turns were the most frequent (38.2%), followed by the miscellaneous type IV (31.7%), and types II (11.8%), VIII (9.8%), I’ (4.1%), II’ (2.5%) and the different sub-types of the type VI β-turns (ranging from 0.9 to 0.2%, see Table 1). Henceforth, the type IV β-turns will be denoted type IV^ori to differentiate them from the new types in the current analyses. Figures 1 and 2 show the different types of β-turns in 3D and the distribution of their dihedral angles in the Ramachandran plot^36,71,72.

Table 1 β-turn frequencies.

Full size table

Analyses of discarded types

As a first step, before searching for new types, the previously discarded types were analysed.

Notably, type III and III’ β-turns had been included by Venkatachalam³⁶, but have been discarded because they are considered to be too close to the 3₁₀ helices and to type I (and I’) β-turns. The type V β-turn has been considered to be a rather unusual departure from the type II β-turn (see Figures 35 and 36 of ref. 35). If the type III β-turn were still recognized, it would represent 9.6% of the residues; i.e., it would be the third most frequently occurring type. The obsolete type III’ β-turn represented approximately 1.5% of the turns, whereas the type V and V’ β-turns represented only 0.03 and 0.02%, respectively (see Supplementary Information 4), and were associated with type IV β-turns (see Supplementary Information 5), but they were negligible.

For the type III and III’ β-turns, the overlap with type I and I’ β-turns remained as expected, with 88.7% of the type III β-turns assigned as type I β-turns, 87.6% of the type III’ β-turns assigned as type I’ β-turns (see Supplementary Information 4 and 6), and the remaining 11–12% associated with type IV β-turns. Interestingly, 60% of type I β-turns were also assignable to type III, and 83.9% of type I’ were assignable to type III’ (see Supplementary Information 7). Therefore, the decision to remove this particular definition was clearly reasonable.

Searching for new types

From the above section, it is apparent that nearly 1/3^rd of residues are not associated with a defined type. Moreover, as presented in the methods section, learning was performed on the type IV β-turns, the clustering was conducted on the basis of dihedral angles with an unsupervised approach similar to the approaches used for protein blocks^62,67. The first step of learning was entirely unsupervised and was performed to properly define the initial values of the clusters, whereas the second step dictated the specific rules of the β-turns (e.g., +/−30° and one dihedral angle at +/− 45°).

A major difficulty in every classification approach is the choice of the clusters. Here, it was slightly different; the idea was not to have an optimal number of clusters but to assess the most frequently occurring and recurrent clusters to define the new pertinent types. In related research, Micheletti and collaborators have decided to take the largest cluster each time and iteratively repeat the clustering, each time removing the largest cluster⁷³. This clustering is slightly unstable because each repetition removes a large amount of data. Thus, it did not seem pertinent to use it here. Moreover, with a large initial number of clusters, determining the clusterability of the data was manageable.

The training was performed with different datasets beginning with a large number of clusters (50 at first), which was progressively reduced (to 10). A notable feature of the learning was that four clusters appeared at the beginning and remained the most frequently occurring cluster for each of the different datasets. The deviation in the dihedral angle values between the different simulations (and different datasets) was never higher than 0.3°, thus indicating that the clustering was reasonably stable (a more detailed description is provided in Supplementary Information 8).

The four new type IV β-turn sub-types were named IV₁, IV₂, IV₃ and IV₄. They represent half of the of type IV β-turns (see Table 2), composing 16.1, 12.4, 11.2 and 8.5% of the IV^ori type, respectively. In regards to all of the defined types, they were the 4^th, 6^th, 7^th and 8^th most frequent turns (5.10%, 3.9%, 3.5% and 2.7%, respectively). These numbers are reasonable because they were highly consistent across all of the datasets. Figure 3 shows these four new categories. The remaining clusters were not selected because (i) their occurrences were very low (largely less than those of type VI β-turns) and (ii) they were often dependent on the number of clusters (see Supplementary Information 9). They were not useful for either protein structure or sequence–structure relationship analyses. The rest of the type IV β-turns were classified as IV_misc.

Table 2 β-turn frequencies.

Full size table

Table 3 provides the observed angles. Because the clustering approach was based on the specific clustering of type IV, no overlap could be found with the existing types. Figures 4a,b show the relative position of each turn. A relationship was observed between type IV₁ and type II β-turns (see Fig. 4c) and between type IV₂ and VIII β-turns (see Fig. 4d, see Supplementary Information 10). In terms of dihedral angle values, the type IV₁ β-turn resembled a slightly displaced conformation of the type II β-turn, whereas the type IV₂ β-turn appeared to be a less extended type VIII β-turn. Type IV₃ and IV₄ were much more specific, with very particular dihedral angles in the helical regions (see Supplementary Information 11).

Table 3 New β-turns.

Full size table

New turns in regards to DSSP

To describe the type IV β-turns more precisely, we examined their former DSSP assignments (hydrogen bond estimation) as turns or bends. Interestingly, more than 2/3 of the residues of IV^ori were identified by DSSP as turns, with 35% being bends and 37% being hydrogen bond turns, and the rest were mainly associated with coils and β-sheets. The type IV_misc was more associated with non-hydrogen bond, stabilized local structures, with a 41% enrichment in bends and 31% fewer hydrogen bond turns. This evolution is mainly associated with the newer and less frequent type IV β-turns (e.g., type IV₃ and IV₄), which comprise 70% and 49% hydrogen bond turns. The evolution was strikingly lower for the type IV₁ β-turn, with less than 30% of residues associated with hydrogen bond turns. Although all the new type IV β-turns were linked to neither α-helices nor β-sheets, type IV₁ β-turns were often observed at the ends of β-sheets (in nearly 2/3 of the cases).

Comparison with previous analyses

As mentioned in the introduction, two major efforts were made in the 1980 s and 1990 s to define β-turns. Both were based on a Ramachandran plot divided into 6 to 8 large regions. The size and shape of these regions were largely different from the strict rule of +/−30° (and 45°). Notably, these previous classifications were performed with all turns, whereas in the current analyses the classification was performed on only a subset of type IV β-turns.

Table 4 shows the new turns classified using a Ramachandran plot division scheme similar to that described above. Efimov has proposed a very precise definition of turns and half-turns with 7 and 8 types of turn^44,45. Interestingly, type IV₁ might seem as if it could be characterized as β_Eα_L because it looks like the proposed βαL-half-turn; however, the type IV₁ β-turn is not a half-turn but a complete turn. The type IV₃ β-turn is the only local conformation that can be described as a half-turn, but instead of being a αγ-half-turn, it is mainly α/γ- > α. Type IV₄ β-turns can be described as γγ; a similar type has been described in⁴⁵, but here it is mainly a turn, whereas the previously described types were half-turns. In fact, the type IV₂ β-turns were the only ones that seemed to be directly related to Efimov’s analyses, because they could be characterized by a γδ connection between α-helices, as described in⁴⁵. The percentage of turns and half-turns observed correctly correlated with the distance threshold proposed by Crawford and co-workers³⁷.

Table 4 Torsion angle regions taken from Wilmot and Thornton, and Efimov, with turns and half-turn proportions as defined by Efimov and distance in regards to Crawford.

Full size table

Wilmot and Thornton have also used a simplification of the Ramachandran plot in 6 major regions, with 12 combinations⁵¹. Because the size of the different regions is higher than Efimov’s, the number of types is relatively limited. The region α_R represents the γ, δ and α regions; very diverse conformations were found in type IV₃ and IV₄ β-turns as well as type I β-turns (i.e., α_R → α_R). Type IV₂ β-turns had the same description as type VIII (i.e., α_R → β_E). Interestingly, only two non-classical turns, β_E → γ_L (8%) and γ_L → α_R (4%)⁵¹, were defined by Wilmot and Thornton. One could expect that one of these two types might be associated with the most frequent new turn. However, this was not the case, because the type IV₁ β-turn is not β_E → γ_L, but β_E → α_L.

Hence, these comparisons illustrate that the specific clustering performed in the current analyses highlighted one new main cluster that was not observed previously: the type IV₁ β-turn. Additionally, it showed the specificity of the type IV₃ and IV₄ β-turns in regards to their fine description. The type IV₂ β-turn was the only one to have been clearly characterized previously by both studies^45,51.

Koch and Klebe (KK) used a sophisticated approach to unify the assignment of turns of different lengths⁵². This approach is not easily comparable to others because: (i) it is not based on the classical assignment rules and (ii) all the turns have been re-assigned. Hence, for β-turns, other features were used in the training in addition to the values of the dihedral angles (φ, ψ) of the central residue. Classical and new β-turns were compared to the final definition of the 24 open KK β -turns (7 were considered to be non-turn-like structures) and 18 reverse KK β–turns presented in Supplemental Data S14 and S16 of ref. 52. Owing to the particular learning method, type I’, II and II’ β-turns had no direct equivalent in the KK β-turns, whereas type I, IV₃ and IV₄ β-turns were associated with the KK type I β-turn (18% of the true turns). Type VIII β-turns were associated with the KK type VIII3 β-turn (6.5% of the true turns). Interestingly, type IV₂ β-turns were not associated with any KK β-turn types.

Hence, this comparison between studies indicated some similarities because the major turn (type I β-turn) could not distinguish between the two new less frequent turns (types IV₃ and IV₄ β-turn), whereas type VIII β-turns were easily found by using this approach. Similarly to previous results, the type IV₂ β-turn remained specific to our clustering. However, differences between the studies should be taken into account, such as the different learning method used by Koch and Klebe, considered more angles than ours and their training was conducted on the complete set of turns and not just the type IV β-turns.

Comparison with protein blocks

Table 5 shows the over- and under-representation of protein blocks for all the β-turn types. Type IV^ori β-turns were characterized by a PB motif of [efghijko] [bhijklno] [abghijlnop] [acgiop]. As expected, this signature was more ambiguous in regards to the well-defined types, which showed a range of only one to four PBs at each position. The IV_misc represented only half of the previous β-turn IV^ori types. The only exception was the newly over-represented PBs n and p at positions i and i+1 as well as the reduced over-representation of PBs n and p at positions i+1 and i+3, whereas 28/32 over-representations remained the same.

Table 5 Protein blocks’ Z-scores of β-turn types.

Full size table

The newly defined type IV β-turns had stronger PB motifs. They could be analysed not only in regards to β-turn IV^ori but also in regards to II and VIII for types IV₁ and IV₂.

For type IV₁, the PB motif is [aegp] [aegho] [hikp] [ail] and has no direct contradiction with the classical behaviours of β-turn IV^ori. However, this motif had some interesting specificities in regards to type IV₂. However, the PB motifs of type II β-turns were less ambiguous, with only two main PBs at each position [eg] [ho] [ik] [al]. Type IV₁ β-turns were clearly different, with 8 over-represented PBs that were under-represented in type II β-turns (PBs a and p at position i, PBs a, e and g at position i+1, PBs h and p at position i+2 and PBs i at position i+3). Similarly, in type IV₂ β-turns, the PB motif was [fjkl] [bklno] [bglp] [cg] and was comparable to the type IV^ori β-turns but also had some differences compared with the type VIII β-turns. Hence, only half of the over-represented PBs in type VIII β-turn were found in type IV₂ β-turns and 5 under-represented PBs were over-represented (PBs k, n and p at position i+1, and PBs b and p at position i+2).

PB motifs of type IV₃ and IV₄ β-turns were mainly associated with the most frequent β-turn, the type I β-turn, because their dihedral angles were in the same restricted area.

Amino Acid Specificities of the new types

β-turns have been widely analysed in terms of sequence – structure relationships, which have been incorporated in various prediction approaches^27,74,75. Table 6 shows the under- and over-represented amino acids in each type of turn. Some associations were expected because all of the different type VI β-turns were characterized by the proline at position i+2.

Table 6 Amino acid’s Z-score of β-turn types.

Full size table

Concerning the new turns defined in the current analyses, the four important points are as follows:

1
Type IV^ori and IV_misc β-turns remained strongly linked, because erasing half of the occurrences did not change the general trend of the unassigned turns.
2
IV₃ and IV₄ were clearly distinct in terms of dihedral angle distributions but had very similar amino acid compositions. Indeed, they shared the same over- or underrepresented amino acid trends in 80% of the cases; only one inversion of amino acid preference was observed for the type IV₃ β-turns at position i+2 (alanine),
3
The type VIII and IV₂ β-turns were structurally close, with high sequence similarity. We found only one inversion between these types at position i+2 for the valine residue.
4
Interestingly, the type IV₁ and II β-turns were close structurally but had strongly divergent sequences. At position i, no common amino acid over- or under-representation was observed. In the Ramachandran plot’s α_L region, glycine represented 88% of the residues, whereas in γ_L, it was only 38% (with N 17%, D 9%, K 5%, E and R 4%, respectively). Interestingly, the type IV₁ encompassed mainly the non-glycine residues at i+2 (see Table 4). Moreover, proline and glycine residues were under-represented at position i+3 of type II, although they were over-represented in type VIII β-turns. Additionally, the i+2 positions of both types had more divergent residues. Figure 5 shows a Sammon map projection⁷⁶ of all the β-turns. It emphasizes these relationships and highlights the strong differences between types IV₁ and II, with the distance being quite substantial. The type IV₁ β-turn amino acid composition was similar to that of the two other new β-turn types, IV₃ and IV₄ (see Supplementary Information 12 and 13).
Figure 5
Sammon map of amino acid behaviours of the different β-turns.
Classical turns are in green while new turns are in red.
Full size image

Conclusions

β-turns are the most important secondary structures preceded by the α-helix and β-sheet. β-turns correspond to approximately 25 to 30% of all protein residues⁷⁷. The current classification of the different β-turns has remained unchanged for the past 30 years. In the 1980 s and 1990 s, different studies proposed extending the definition of turns, mainly on the basis of the division of a Ramachandran plot into 6 to 8 regions^46,51,78. These analyses of β-turns showed strong similarities with classical analyses and provided new definitions for the least frequently occurring turns. Two recent studies have expressed interest in redefining the definitions: (i) Koch and Klebe⁵² have used a very large modified Self-Organizing Map^53,54 and (ii) George Rose’s group has defined 12 categories comprising different lengths^57,58. Nonetheless, these approaches were performed in a manner comparable to the secondary structure assignment that is still dominated by DSSP⁵. Although different turn classifications have subsequently been proposed⁹, none of them have been successfully used. The main idea in this study was not to redraw a novel classification but to extend the classical classification.

From an unsupervised classification, based exclusively on dihedral angles, four new types were defined. The two most frequently occurring, type IV₁ and IV₂ β-turns, were similar to existing type II and VIII β-turns but had very distinct features. On the one hand, type IV₂ and VIII β-turns shared striking amino acid compositional features, with minor differences. However, type IV₂ β-turns were associated with stabilizing hydrogen bonds, unlike type VIII β-turns. On the other hand, type IV₁ and II β-turns were very close in terms of dihedral angles but were distinct in terms of their amino acid content. Figure 5 clearly shows that type II β-turns were highly specific, whereas type IV₁ β-turns had more classical characteristics, being closer to type I’ β-turns than type II β-turns.

The two remaining β-turn types, IV₃ and IV₄, were within bin 6 of the Ramachandran plot, close to type I β-turns⁷⁹. Although their amino acid profiles were highly similar, their local protein structure conformations were distinct.

A classical question raised by any clustering methodology is the relevance of the results. Here, our results can be considered reliable, owing to their reproducibility and stability. The use of 10 different datasets ranging in quality and sequence identity highlighted the high stability of the four main clusters (i.e., the new turns). For each simulation, the clusters were always found at similar frequencies and with similar dihedral values. However, the other clusters were substantially more variable. A simple analysis was also performed to evaluate the possibility of the presence of sub-clusters inside the different clusters by diminishing the authorized dihedral angle deviation allowed during the training. Similarly, the centre of the four main clusters always appeared, thus supporting their stability.

Comparisons with the previous alternative classification proposed by Efimov^45,78 and Thornton’s group⁵¹ emphasized the uniqueness of the approach. Notably, the most frequent new turn (type IV₁ β-turn) was not highlighted, although it is the 5^th most occurring turn (including type IV_misc β-turns). Only the type IV₂ β-turns were previously included.

This extended classification is relevant because it does not modify the currently accepted β-turn types, is highly stable (in regards to amino acid redundancy and the quality of protein resolution), and proposes new ways to analyse the architecture and dynamics of the protein or peptide structure of β-turns. Hence, we envision two potential applications of this classification system. The first one addresses molecular dynamics simulations in which researchers follow the dynamic evolution of type VIII β-turns⁸⁰. The change from type VIII to a type IV (i.e., IV^ori) during the simulations is very different when the turn is in fact a type IV₂ or IV_misc. The former case (type IV₂ β-turn) is a simple extension of this conformation, whereas the latter (type IV_misc β-turn) is really a different independent conformation⁸⁰. The second example involves an analysis of conformational characteristics of asparaginyl residues in proteins⁸¹. Interestingly, many are associated with turn conformations. With this new classification, only 16.5% (see Supplementary Information 14) were associated with miscellaneous turns (e.g., IV_misc); thus, this classification provides a better description of local protein conformations and resolves the spectrum of IV_misc turns to a greater extent.

An interesting point is that turns are often observed as tandem repeats, sometimes leading to long series of γβ, βγ, ββ or γγ turns⁸². It is also notable that γ and β turns are associated with the same residues^83,84. In future work, we plan to investigate the succession of turns, particularly the ones mentioned in this study.

Additional Information

How to cite this article: de Brevern, A.G. Extension of the classical classification of β-turns. Sci. Rep. 6, 33191; doi: 10.1038/srep33191 (2016).

References

Pauling, L., Corey, R. B. & Branson, H. R. The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. Proc Natl Acad Sci USA 37, 205–211 (1951).
ADS CAS PubMed PubMed Central Google Scholar
Pauling, L. & Corey, R. B. The pleated sheet, a new layer configuration of polypeptide chains. Proc Natl Acad Sci USA 37, 251–256 (1951).
ADS CAS PubMed PubMed Central Google Scholar
Eisenberg, D. The discovery of the alpha-helix and beta-sheet, the principal structural features of proteins. Proc Natl Acad Sci USA 100, 11207–11210 (2003).
ADS CAS PubMed PubMed Central Google Scholar
Fourrier, L., Benros, C. & de Brevern, A. G. Use of a structural alphabet for analysis of short loops connecting repetitive structures. BMC Bioinformatics 5, 58 (2004).
PubMed PubMed Central Google Scholar
Kabsch, W. & Sander, C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983).
CAS PubMed Google Scholar
Fodje, M. N. & Al-Karadaghi, S. Occurrence, conformational features and amino acid propensities for the pi-helix. Protein Eng 15, 353–358 (2002).
CAS PubMed Google Scholar
Martin, J. et al. Protein secondary structure assignment revisited: a detailed analysis of different assignment methods. BMC structural biology 5, 17 (2005).
PubMed PubMed Central Google Scholar
Heinig, M. & Frishman, D. STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res 32, W500–502, 10.1093/nar/gkh429 (2004).
Article CAS PubMed PubMed Central Google Scholar
Offmann, B., Tyagi, M. & de Brevern, A. G. Local Protein Structures. Current Bioinformatics 3, 165–202 (2007).
Google Scholar
Klose, D. P., Wallace, B. A. & Janes, R. W. 2Struc: the secondary structure server. Bioinformatics 26, 2624–2625, 10.1093/bioinformatics/btq480 (2010).
Article CAS PubMed PubMed Central Google Scholar
Calligari, P. A. & Kneller, G. R. ScrewFit: combining localization and description of protein secondary structure. Acta Crystallogr D Biol Crystallogr 68, 1690–1693, 10.1107/S0907444912039029 (2012).
Article CAS PubMed Google Scholar
Tyagi, M., Bornot, A., Offmann, B. & de Brevern, A. G. Analysis of loop boundaries using different local structure assignment methods. Protein Sci 18, 1869–1881, 10.1002/pro.198 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kruus, E., Thumfort, P., Tang, C. & Wingreen, N. S. Gibbs sampling and helix-cap motifs. Nucleic Acids Res 33, 5343–5353, 33/16/534366 (2005).
CAS PubMed PubMed Central Google Scholar
Wintjens, R., Wodak, S. J. & Rooman, M. Typical interaction patterns in alphabeta and betaalpha turn motifs. Protein Eng 11, 505–522 (1998).
CAS PubMed Google Scholar
Wojcik, J., Mornon, J. P. & Chomilier, J. New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification. J Mol Biol 289, 1469–1490 (1999).
CAS PubMed Google Scholar
Boutonnet, N. S., Kajava, A. V. & Rooman, M. J. Structural classification of alphabetabeta and betabetaalpha supersecondary structure units in proteins. Proteins 30, 193–212 (1998).
CAS PubMed Google Scholar
Bonet, J. et al. ArchDB 2014: structural classification of loops in proteins. Nucleic Acids Res 42, D315–319, gkt1189 (2014).
CAS PubMed Google Scholar
Mansiaux, Y., Joseph, A. P., Gelly, J. C. & de Brevern, A. G. Assignment of PolyProline II conformation and analysis of sequence--structure relationship. PLoS One 6, e18401, 10.1371/journal.pone.0018401 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Pauling, L. & Corey, R. B. The structure of fibrous proteins of the collagen-gelatin group. Proc Natl Acad Sci USA 37, 272–281 (1951).
ADS CAS PubMed PubMed Central Google Scholar
Cowan, P. M., McGavin, S. & North, A. C. The polypeptide chain configuration of collagen. Nature 176, 1062–1064 (1955).
ADS CAS PubMed Google Scholar
Adzhubei, A. A. & Sternberg, M. J. Left-handed polyproline II helices commonly occur in globular proteins. J Mol Biol 229, 472–493 (1993).
CAS PubMed Google Scholar
Creamer, T. P. Left-handed polyproline II helix formation is (very) locally driven. Proteins 33, 218–226 (1998).
CAS PubMed Google Scholar
Stapley, B. J. & Creamer, T. P. A survey of left-handed polyproline II helices. Protein Sci 8, 587–595 (1999).
CAS PubMed PubMed Central Google Scholar
Creamer, T. P. & Campbell, M. N. Determinants of the polyproline II helix from modeling studies. Adv Protein Chem 62, 263–282 (2002).
CAS PubMed Google Scholar
Chellgren, B. W. & Creamer, T. P. Short sequences of non-proline residues can adopt the polyproline II helical conformation. Biochemistry 43, 5864–5869 (2004).
CAS PubMed Google Scholar
Adzhubei, A. A., Sternberg, M. J. & Makarov, A. A. Polyproline-II helix in proteins: structure and function. J Mol Biol 425, 2100–2132, S0022-2836(13)00166-6 (2013).
CAS PubMed Google Scholar
Fuchs, P. F. & Alix, A. J. High accuracy prediction of beta-turns and their types using propensities and multiple alignments. Proteins 59, 828–839 (2005).
CAS PubMed Google Scholar
Bornot, A. & de Brevern, A. G. Protein beta-turn assignments. Bioinformation 1, 153–155. (2006).
PubMed PubMed Central Google Scholar
Matthews, B. W. the gamma-turn. Evidence for a new folded conformation in Proteins.. Macromolecules 5, 818–819 (1972).
ADS CAS Google Scholar
Milner-White, E. J. Situations of gamma-turns in proteins. Their relation to alpha-helices, beta-sheets and ligand binding sites. J Mol Biol 216, 386–397 (1990).
CAS PubMed Google Scholar
Nataraj, D., Srinivasan, N., Sowdhamini, R. & Ramakrishnan, C. Alpha-turns in pro tein structures. Curr. Sci. 69, 434–447 (1995).
CAS Google Scholar
Pavone, V. et al. Discovering protein secondary structures: classification and description of isolated alpha-turns. Biopolymers 38, 705–721 (1996).
CAS PubMed Google Scholar
Dasgupta, B. & Chakrabarti, P. pi-Turns: types, systematics and the context of their occurrence in protein structures. BMC Struct Biol 8, 39, 1472-6807-8-39 (2008).
PubMed PubMed Central Google Scholar
Rajashankar, K. R. & Ramakumar, S. Pi-turns in proteins and peptides: Classification, conformation, occurrence, hydration and sequence. Protein Sci 5, 932–946 (1996).
CAS PubMed PubMed Central Google Scholar
Richardson, J. S. The anatomy and taxonomy of protein structure. Adv Protein Chem 34, 167–339 (1981).
CAS PubMed Google Scholar
Venkatachalam, C. M. Stereochemical criteria for polypeptides and proteins. V. Conformation of a system of three linked peptide units. Biopolymers 6, 1425–1436 (1968).
CAS PubMed Google Scholar
Crawford, J. L., Lipscomb, W. N. & Schellman, C. G. The reverse turn as a polypeptide conformation in globular proteins. Proc Natl Acad Sci USA 70, 538–542 (1973).
ADS CAS PubMed PubMed Central Google Scholar
Lewis, P. N., Momany, F. A. & Scheraga, H. A. Chain reversals in proteins. Biochim Biophys Acta 303, 211–229 (1973).
CAS PubMed Google Scholar
Hutchinson, E. G. & Thornton, J. M. A revised set of potentials for beta-turn formation in proteins. Protein Sci 3, 2207–2216 (1994).
CAS PubMed PubMed Central Google Scholar
Wilmot, C. M. & Thornton, J. M. Analysis and prediction of the different types of beta-turn in proteins. J Mol Biol 203, 221–232 (1988).
CAS PubMed Google Scholar
Chan, A. W., Hutchinson, E. G., Harris, D. & Thornton, J. M. Identification, classification, and analysis of beta-bulges in proteins. Protein Sci 2, 1574–1590 (1993).
CAS PubMed PubMed Central Google Scholar
Nataraj, D. V., Srinivasan, N. & Sowdhamini, R. & Ramakrishnan, C. β - turns in protein structures. Curr. Sci. 69, 434–447 (1995).
CAS Google Scholar
Hutchinson, E. G. & Thornton, J. M. PROMOTIF–a program to identify and analyze structural motifs in proteins. Protein Sci 5, 212–220 (1996).
CAS PubMed PubMed Central Google Scholar
Efimov, A. V. [Standard conformations of a polypeptide chain in irregular protein regions]. Mol Biol (Mosk) 20, 250–260 (1986).
CAS Google Scholar
Efimov, A. V. Standard structures in proteins. Prog Biophys Mol Biol 60, 201–239 (1993).
CAS PubMed Google Scholar
Efimov, A. V. Super-secondary structures involving triple-strand beta-sheets. FEBS Lett 334, 253–256 (1993).
CAS PubMed Google Scholar
Efimov, A. V. Super-secondary structures and modeling of protein folds. Methods Mol Biol 932, 177–189, 10.1007/978-1-62703-065-6_11 (2013).
Article CAS PubMed Google Scholar
Efimov, A. V. Structural trees for protein superfamilies. Proteins 28, 241–260 (1997).
CAS PubMed Google Scholar
Efimov, A. V. A structural tree for proteins containing 3beta-corners. FEBS Lett 407, 37–41 (1997).
CAS PubMed Google Scholar
Gordeev, A. B., Kargatov, A. M. & Efimov, A. V. PCBOST: Protein classification based on structural trees. Biochem Biophys Res Commun 397, 470–471, 10.1016/j.bbrc.2010.05.136 (2010).
Article CAS PubMed Google Scholar
Wilmot, C. M. & Thornton, J. M. Beta-turns and their distortions: a proposed new nomenclature. Protein Eng 3, 479–493 (1990).
CAS PubMed Google Scholar
Koch, O. & Klebe, G. Turns revisited: a uniform and comprehensive classification of normal, open, and reverse turn families minimizing unassigned random chain portions. Proteins 74, 353–367, 10.1002/prot.22185 (2009).
Article CAS PubMed Google Scholar
Kohonen, T. Self-organized formation of topologically correct feature maps. Biol. Cybern 43, 59–69 (1982).
MathSciNet MATH Google Scholar
Kohonen, T. Self-Organizing Maps (3rd edition). (Springer, 2001).
Koch, O., Cole, J., Block, P. & Klebe, G. Secbase: database module to retrieve secondary structure elements with ligand binding motifs. J Chem Inf Model 49, 2388–2402, 10.1021/ci900202d (2009).
Article CAS PubMed Google Scholar
Meissner, M., Koch, O., Klebe, G. & Schneider, G. Prediction of turn types in protein structure by machine-learning classifiers. Proteins 74, 344–352, 10.1002/prot.22164 (2009).
Article CAS PubMed Google Scholar
Fitzkee, N. C., Fleming, P. J. & Rose, G. D. The Protein Coil Library: a structural database of nonhelix, nonstrand fragments derived from the PDB. Proteins 58, 852–854 (2005).
CAS PubMed Google Scholar
Perskie, L. L. & Rose, G. D. Physical-chemical determinants of coil conformations in globular proteins. Protein Sci 19, 1127–1136, 10.1002/pro.399 (2010).
Article CAS PubMed PubMed Central Google Scholar
Porter, L. L. & Rose, G. D. Redrawing the Ramachandran plot after inclusion of hydrogen-bonding constraints. Proc Natl Acad Sci USA 108, 109–113, 1014674107 (2011).
ADS CAS PubMed Google Scholar
Wang, G. & Dunbrack, R. L. Jr. PISCES: a protein sequence culling server. Bioinformatics 19, 1589–1591 (2003).
CAS PubMed Google Scholar
Tyagi, M., Bornot, A., Offmann, B. & de Brevern, A. G. Protein short loop prediction in terms of a structural alphabet. Comput Biol Chem 33, 329–333, S1476-9271(09)00051-6 (2009).
CAS PubMed Google Scholar
de Brevern, A. G., Etchebest, C. & Hazout, S. Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks. Proteins 41, 271–287 (2000).
CAS PubMed Google Scholar
Joseph, A. P. et al. A short survey on protein blocks. Biophys Rev 2, 137–145 (2010).
CAS PubMed PubMed Central Google Scholar
Rabiner, L. R. A tutorial on hidden Markov models and selected application in speech recognition. Proceedings of the IEEE 77, 257–286 (1989).
Google Scholar
Tyagi, M. et al. Protein Block Expert (PBE): a web-based protein structure analysis server using a structural alphabet. Nucleic Acids Res 34, W119–123 (2006).
CAS PubMed PubMed Central Google Scholar
Poulain, P. PBxplore: A program to explore protein structures with Protein Blocks. Technical report. (2016) Available at: https://github.com/pierrepo/PBxplore. (Accessed: 21st June 2016).
Schuchhardt, J., Schneider, G., Reichelt, J., Schomburg, D. & Wrede, P. Local structural motifs of protein backbones are classified by self-organizing neural networks. Protein Eng 9, 833–842 (1996).
CAS PubMed Google Scholar
de Brevern, A. G. & Hazout, S. ‘Hybrid protein model’ for optimally defining 3D protein structure fragments. Bioinformatics 19, 345–353 (2003).
CAS PubMed Google Scholar
Esque, J., Urbain, A., Etchebest, C. & de Brevern, A. G. Sequence-structure relationship study in all-alpha transmembrane proteins using an unsupervised learning approach. Amino Acids 47, 2303–2322, 10.1007/s00726-015-2010-510.1007/s00726-015-2010-5 (2015).
Article CAS PubMed Google Scholar
Ihaka, R. & Gentleman, R. R: A Language for Data Analysis and Graphics. Journal of Computational and Graphical Statistics 5, 299–314 (1996).
Google Scholar
Ramachandran, G. N., Ramakrishnan, C. & Sasisekharan, V. Stereochemistry of polypeptide chain configurations. J Mol Biol 7, 95–99 (1963).
CAS PubMed Google Scholar
Ramakrishnan, C. & Ramachandran, G. N. Stereochemical criteria for polypeptide and protein chain conformations. II. Allowed conformations for a pair of peptide units. Biophys J 5, 909–933, S0006-3495(65)86759-5 (1965).
CAS PubMed PubMed Central Google Scholar
Micheletti, C., Seno, F. & Maritan, A. Recurrent oligomers in proteins: an optimal scheme reconciling accurate and concise backbone representations in automated folding and design studies. Proteins 40, 662–674 (2000).
CAS PubMed Google Scholar
Chou, P. Y. & Fasman, G. D. Prediction of beta-turns. Biophys J 26, 367–383, S0006-3495(79)85259-5 (1979).
CAS PubMed PubMed Central Google Scholar
Singh, H., Singh, S. & Raghava, G. P. In silico platform for predicting and initiating beta-turns in a protein at desired locations. Proteins 83, 910–921, 10.1002/prot.24783 (2015).
Article CAS PubMed Google Scholar
Sammon, J. A nonlinear mapping for data structure analysis. IEEE Transactions on Computers 18, 401–409. (1969).
Google Scholar
Guruprasad, K. & Rajkumar, S. Beta-and gamma-turns in proteins revisited: a new set of amino acid turn-type dependent positional preferences and potentials. J Biosci 25, 143–156 (2000).
CAS PubMed Google Scholar
Efimov, A. V. [Standard structures in protein molecules. II. Beta-alpha hairpins]. Mol Biol (Mosk) 20, 340–345 (1986).
CAS Google Scholar
Kalmankar, N. V., Ramakrishnan, C. & Balaram, P. Sparsely populated residue conformations in protein structures: revisiting “experimental” Ramachandran maps. Proteins 82, 1101–1112, 10.1002/prot.24384 (2014).
Article CAS PubMed Google Scholar
Fuchs, P. F. et al. Kinetics and thermodynamics of type VIII beta-turn formation: a CD, NMR, and microsecond explicit molecular dynamics study of the GDNP tetrapeptide. Biophys J 90, 2745–2759, S0006-3495(06)72457-2 (2006).
CAS PubMed PubMed Central Google Scholar
Srinivasan, N., Anuradha, V. S., Ramakrishnan, C., Sowdhamini, R. & Balaram, P. Conformational characteristics of asparaginyl residues in proteins. Int J Pept Protein Res 44, 112–122 (1994).
CAS PubMed Google Scholar
Guruprasad, K., Prasad, M. S. & Kumar, G. R. Analysis of gammabeta, betagamma, gammagamma, betabeta continuous turns in proteins. J Pept Res 57, 292–300 (2001).
CAS PubMed Google Scholar
Guruprasad, K., Prasad, M. S. & Kumar, G. R. Analysis of gammabeta, betagamma, gammagamma, betabeta multiple turns in proteins. J Pept Res 56, 250–263 (2000).
CAS PubMed Google Scholar
Guruprasad, K., Rao, M. J., Adindla, S. & Guruprasad, L. Combinations of turns in proteins. J Pept Res 62, 167–174 (2003).
CAS PubMed Google Scholar
de Sanctis, D. et al. Bishistidyl heme hexacoordination, a key structural property in Drosophila melanogaster hemoglobin. J Biol Chem 280, 27222–27229, 10.1074/jbc.M503814200 (2005).
Article CAS PubMed Google Scholar
Becker, A. & Kabsch, W. X-ray structure of pyruvate formate-lyase in complex with pyruvate and CoA. How the enzyme uses the Cys-418 thiyl radical for pyruvate cleavage. J Biol Chem 277, 40036–40042, 10.1074/jbc.M205821200 (2002).
Article CAS PubMed Google Scholar
Dobbek, H., Svetlitchnyi, V., Liss, J. & Meyer, O. Carbon monoxide induced decomposition of the active site [Ni-4Fe-5S] cluster of CO dehydrogenase. J Am Chem Soc 126, 5382–5387, 10.1021/ja037776v (2004).
Article CAS PubMed Google Scholar
Levy, C. W. et al. Insights into enzyme evolution revealed by the structure of methylaspartate ammonia lyase. Structure 10, 105–113 (2002).
CAS PubMed Google Scholar
Burmeister, W. P., Guilligay, D., Cusack, S., Wadell, G. & Arnberg, N. Crystal structure of species D adenovirus fiber knobs and their sialic acid binding sites. J Virol 78, 7727–7736, 10.1128/JVI.78.14.7727-7736.2004 (2004).
Article CAS PubMed PubMed Central Google Scholar
Grabarse, W. et al. On the mechanism of biological methane formation: structural evidence for conformational changes in methyl-coenzyme M reductase upon substrate binding. J Mol Biol 309, 315–330, 10.1006/jmbi.2001.4647 (2001).
Article CAS PubMed Google Scholar
Hisano, T. et al. Crystal structure of the (R)-specific enoyl-CoA hydratase from Aeromonas caviae involved in polyhydroxyalkanoate biosynthesis. J Biol Chem 278, 617–624, 10.1074/jbc.M205484200 (2003).
Article CAS PubMed Google Scholar
Zuo, Y., Wang, Y. & Malhotra, A. Crystal structure of Escherichia coli RNase D, an exoribonuclease involved in structured RNA processing. Structure 13, 973–984, 10.1016/j.str.2005.04.015 (2005).
Article CAS PubMed Google Scholar
Kwak, B. Y. et al. Structure and mechanism of CTP:phosphocholine cytidylyltransferase (LicC) from Streptococcus pneumoniae. J Biol Chem 277, 4343–4350, 10.1074/jbc.M109163200 (2002).
Article CAS PubMed Google Scholar
Schafer, K. et al. X-ray structures of the maltose-maltodextrin-binding protein of the thermoacidophilic bacterium Alicyclobacillus acidocaldarius provide insight into acid stability of proteins. J Mol Biol 335, 261–274 (2004).
CAS PubMed Google Scholar
Hayashi, I. & Ikura, M. Crystal structure of the amino-terminal microtubule-binding domain of end-binding protein 1 (EB1). J Biol Chem 278, 36430–36434, 10.1074/jbc.M305773200 (2003).
Article CAS PubMed Google Scholar
Wise, E. L., Graham, D. E., White, R. H. & Rayment, I. The structural determination of phosphosulfolactate synthase from Methanococcus jannaschii at 1.7-A resolution: an enolase that is not an enolase. J Biol Chem 278, 45858–45863, 10.1074/jbc.M307486200 (2003).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

I thank the editor and anonymous reviewers for their constructive comments, which helped me improve the manuscript. This work came from various trips and discussions I had during recent years in Bangalore, India, and I would like to dedicate this research to Indian protein pioneers G.N. Ramachandran, C. Ramakrishnan, C.M. Venkatachalam, P. Balaram, N. Srinivasan and R. Sowdhamini and also to my colleagues C. Etchebest, P.F.J. Fuchs, J.-C. Gelly, and especially T.J. Narwani. This work was supported by grants from the French Ministry of Research, University of Paris Diderot – Paris 7, French National Institute for Blood Transfusion (INTS), French Institute for Health and Medical Research (INSERM). AdB also acknowledges the Indo-French Centre for the Promotion of Advanced Research/CEFIPRA for collaborative grants (numbers 3903-E and 5302-2). This study was supported by grants from the Laboratory of Excellence GR-Ex, reference ANR-11-LABX-0051. The labex GR-Ex is funded by the programme “Investissements d’avenir” of the French National Research Agency, reference ANR-11-IDEX-0005-02. Calculations were performed on an SGI cluster granted by Conseil Régional Ile de France and INTS (SESAME Grant).

Author information

Authors and Affiliations

INSERM, U 1134, DSIMB, Paris, F-75739, France
Alexandre G. de Brevern
Univ Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, F-75739, France
Alexandre G. de Brevern
Institut National de la Transfusion Sanguine (INTS), Paris, F-75739, France
Alexandre G. de Brevern
Laboratoire d’Excellence GR-Ex, Paris, F-75739, France
Alexandre G. de Brevern

Authors

Alexandre G. de Brevern
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.G.d.B. designed and performed experiments, analysed data and wrote the paper.

Ethics declarations

Competing interests

The author declares no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

de Brevern, A. Extension of the classical classification of β-turns. Sci Rep 6, 33191 (2016). https://doi.org/10.1038/srep33191

Download citation

Received: 13 April 2016
Accepted: 22 August 2016
Published: 15 September 2016
DOI: https://doi.org/10.1038/srep33191

This article is cited by

An active site loop toggles between conformations to control antibiotic hydrolysis and inhibition potency for CTX-M β-lactamase drug-resistance enzymes
- Shuo Lu
- Liya Hu
- Timothy Palzkill
Nature Communications (2022)
Design, characterization and structure–function analysis of novel antimicrobial peptides based on the N-terminal CATH-2 fragment
- Pratibha Sharma
- Sheetal Sharma
- Avneet Saini
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Clustering predicted structures at the scale of the known protein universe

ColabFold: making protein folding accessible to all

Highly accurate protein structure prediction for the human proteome

Introduction

Methods

Data sets

Secondary structure assignment

Protein Blocks

Specific clustering approach

Z-score

Analysis

Results and Discussion

Protein structure dataset

Analyses of discarded types

Searching for new types

New turns in regards to DSSP

Comparison with previous analyses

Comparison with protein blocks

Amino Acid Specificities of the new types

Conclusions

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

An active site loop toggles between conformations to control antibiotic hydrolysis and inhibition potency for CTX-M β-lactamase drug-resistance enzymes

Design, characterization and structure–function analysis of novel antimicrobial peptides based on the N-terminal CATH-2 fragment

Comments

Search

Quick links