Abstract
Integrin αIIbβ3 mediates platelet aggregation and thrombus formation. In a rare hereditary bleeding disorder, Glanzmann thrombasthenia (GT), αIIbβ3 expression / function are impaired. The impact of deleterious missense mutations on the complex structure remains unclear. Long independent molecular dynamics (MD) simulations were performed for 7 GT variants and reference structure of the Calf-1 domain of αIIb. Simulations were analysed using a structural alphabet to describe local protein conformations. Common and flexible regions as well as deformable zones were observed in all the structures. The most flexible region of Calf-1 (with highest B-factor) is rather a rigid region encompassed into two deformable zones. Each mutated structure barely showed any modifications at the mutation sites while distant conformational changes were observed. These unexpected results question the relationship between molecular dynamics and allostery; and the role of these long-range effects in the impaired αIIbβ3 expression. This method is aimed at studying all αIIbβ3 sub-domains and impact of missense mutations at local and global structural level.
Similar content being viewed by others
Introduction
In humans, the Integrins protein superfamily consists of 24 heterodimeric receptors resulting from different combination of 18 α and 8 β subunits. Integrins are highly dynamic glycoprotein (GP) involved in cell-cell or cell-matrix interactions1. Upon activation the integrin αIIbβ3 binds plasmatic fibrinogen leading to platelet aggregation and thrombus formation (Primary haemostasis). Some well-known human integrin structures are: αxβ2, αvβ3 (cell-matrix adhesion) and αIIbβ3 (cell-cell adhesion). αv can associate to β1,3,5,6,8 subunits while αIIb, that is specific to platelets and megakaryocytes cells, only associates to β3 1. The αIIbβ3 structure is organized into 3 distinct regions; an N-terminus extracellular ectodomain, a single spanning transmembrane (TM) region and a C-terminus cytoplasmic region. The cytoplasmic region in α subunit is very swift (~20 residues) while in β subunit it extended up to 46 residues in length and constitutes an important node for signalling. Both α and β TM regions are single spanning, and consist of 22 residues each. The ectodomain is relatively huge with 959 and 693 residues in α and β subunits respectively. Figure 1 depicts transition steps from the inactive conformations of αIIbβ3 (crystallized closed structure) to its theoretical open liganded active form. A complete structure of the open forms of the ectodomain with or without ligand remains to be crystallized. The ectodomain is further divided into four regions: headpiece, knee, legs and tails. The headpiece that carries the ligand-binding site consists of the β-propeller domain of αIIb and the β-I domain of β3. The αIIb β-propeller domain consists of a 7 bladed fold with four Ca2+ ions coordinated with β-hairpin loops connecting the antiparallel β-strands (see Fig. 1A). The β-I domain of β3 mainly consists of α helices and loops with coordinated metal ions Ca2+ and Mg2+ constituting a MIDAS (Metal Ion Dependent Adhesion Sites) with an ADMIDAS (Adjacent to MIDAS) and SyMBS (Synergistic Metal Binding Site). These sites play critical role in opening the αIIbβ3 binding site and helps in ligand binding2. Following β-propeller, the αIIb leg is composed of the Thigh domain, the Genu (knee), the rigid Calf-1 and Calf-2 domains. The short loop of αIIb Genu coordinates with a divalent calcium3. This metal ion might help in stabilizing Calf-1 and Thigh domains interface during the opening of the structure following the activation process (angular shift at Genu). The αIIb leg is rigid and provides a framework to the entire ectodomain. The β3 β-I domain of the headpiece is followed by the Hybrid, PSI, and 4 IEGFs (Integrin Epidermal Growth Factor) domains. A short knee joined the IEGF-1 with IEGF-2 domains. The β3 leg consists of IEGF-2 to IEGF-4 whose C-terminus ends in the ankle domain (tail). αIIb and β3 transmembrane and cytoplasmic domains are not shown. The activation state of αIIbβ3 is tightly controlled by inside - out signalling resulting from platelet activation by multiple exogenous factors (physiological plasmatic agonists, exposed sub-endothelial matrix …). The αIIb headpiece opens up creating an angular shift between Thigh and Calf-1 domains (Fig. 1C) meanwhile, the β3 leg and tail remains parallel to the αIIb leg. Thereafter, opening of the β3 headpiece pulls the β3 legs outward resulting into an extended open conformation (Fig. 1D,E) that can bind plasmatic fibrinogen at the MIDAS, which is constituted by elements of both the headpieces4, 5. Finally, fibrinogen binding leads to outside-in signalling.
The human integrin αIIbβ3, a platelet surface fibrinogen receptor, is responsible for platelets aggregation, a key process in primary haemostasis and thrombus formation6. Defective primary haemostasis leads to two severe life-threatening bleeding disorders: (1) Glanzmann thrombasthenia (GT), a rare autosomal recessive genetic disease associated with impaired αIIbβ3 expression and/or function7; and (2) Fetal/neonatal alloimmune thrombocytopenia (FNAIT) results from fetal/neonatal platelet destruction by maternal antibodies in mothers lacking an alloantigen inherited from the father. FNAIT clinical consequences range from no symptoms to intracranial haemorrhages with a risk of neurological sequel and/or fetal/neonatal death8.
Both diseases result from αIIb and β3 gene polymorphisms. In GT, more than 300 mutations have been identified in αIIb or β3 genes. Most of them are reported in https://sinaicentral.mssm.edu/intranet/research/glanzmann. In FNAIT, neither the expression nor function of αIIbβ3 is affected but missense mutations results in amino acid (aa) substitutions that define Human Platelet Antigens (HPA). All human platelet alloantigens are described in the HPA database (http://www.ebi.ac.uk/ipd/hpa/). The effects of these amino acid substitutions on αIIbβ3 structure remain largely unknown. We are interested in understanding how amino acid substitutions in GT can impact the Calf-1 domain of αIIbβ3 structure and its structural dynamics.
Very few structures of αIIbβ3 have been crystallized and only one contains the whole αIIbβ3 ectodomain (PDB id 3FCS9) in closed conformation (Fig. 1B). Using this structure, we showed that the β3 Lys253Met GT mutation impaired key ionic interactions between the αIIb β-propeller and the β3 β-I like domain10. However static models cannot depict all mutation-induced effects on a highly dynamic structure.
In our previous studies, molecular dynamics (MD) simulations were used to compare the structures of the β3 L33 and P33 forms11. The L33P substitution located in the PSI domain is responsible for the HPA-1 system, clinically the most important one in Caucasian populations. A third form with a Valine in position 33 of β3 was also studied12. Although the 3 variants mostly shared common conformations, the P33-β3 variant presented a higher mobility and specific conformations of IEGF-1, IEGF-2, and PSI domains. The L33V substitution mainly displaced a dynamic equilibrium between common structures that could explain a variable reactivity of different anti-HPA-1a sera with the two β3 forms.
Using new strategies in MD, we studied the effect of 7 variants of the αIIb Calf-1 domain known to impair αIIbβ3 integrin expression in GT. Unexpected long-range structural effects of mutations were discovered. Furthermore, our results raised the question about a possible role of these allosteric effects in the impaired αIIbβ3 expression.
Methods
Structural data
The αIIb Calf-1 domain was extracted from a 2.55 Å resolution crystal structure of the αIIbβ3 integrin (PDB code 3FCS9). Calf-1 is a domain of 141 residues [positions 603–743] (see supplementary videos 1 and 2). It is a mainly-beta sandwich protein with an immunoglobulin-like topology as described in CATH database (CATH number: 2.60.40.1510, http://www.cathdb.info/version/latest/domain/3fcsA03)13. Some missing atoms in side chains of residues 667 and 668 were completed using Modeller software v.9.1414, 15. The seven GT aa substitutions were introduced in Calf-1 structure by in-silico mutagenesis using PyMOL software16 and the SCWRL method17. The effects of all mutations were studied exclusively.
Molecular Dynamics
MD simulations were done using GROMACS 4.5.7 software18 with the OPLS-AA force-field19. WT and variant forms of Calf-1 were soaked in a rhombic dodecahedral simulation box with TIP3P water molecules and neutralized with Cl- ions. The MD protocol had been used in our previous works11, 12. After 1 nsec of equilibration (with position restraints on the protein), each system was simulated through 11 independent dynamics for a total of 850 nanoseconds (5 × 50 nsec + 6 × 100 nsec). Molecular conformations were saved every 100 psec for downstream analysis. The first 5 nsec of each MD simulation were discarded as the residues at the extremities. Trajectory analyses were done with the GROMACS software, in-house Python and R scripts. Root mean square deviations (RMSD) and root mean square fluctuations (RMSF) were calculated on Cα atoms only. Residues interactions were analysed using the online tool PIC (Protein Interactions Calculator)20.
Protein Blocks analysis
Protein Blocks (PBs) are a structural alphabet composed of 16 local prototypes21. Each specific PB is characterized by the φ, ψ dihedral angles of five consecutive residues with each PB assignment focused on the central residue. Obtained through an unsupervised training approach and performed on a representative non-redundant databank, PBs give a reasonable approximation of all local protein 3D structures22. PBs are very efficient in tasks such as protein superimpositions23 and MD analyses24. They are labelled from a to p: PBs m and d can be roughly described as prototypes for α-helix and central β-strand, respectively. PBs a to c primarily represent β-strand N-caps and PBs e and f representing β-strand C-caps; PBs a to j are specific to coils; PBs k and l to α-helix N-caps while PBs n to p to α-helix C-caps. PB25 assignment was carried out using our PBxplore tool at GitHub26.
PB assignments are done for each residue of the Calf-1 domain and over every snapshot extracted from MD simulations. The equivalent number of PBs (N eq ) is a statistical measurement similar to entropy that represents the average number of PBs for a residue at a given position. N eq is calculated as follows22:
where, f x is the probability of PB x. A N eq value of 1 indicates that only one type of PB is observed, while a value of 16 is equivalent to a random distribution. To underline the main differences between the wild-type (WT) and a variant for each position, ΔN eq value is computed. ΔN eq is the absolute difference between corresponding N eqs .
However, a same ΔN eq value can be obtained with different types of blocks in similar proportions. To detect a change in PBs profile, a ΔPB value was calculated. It corresponds to the absolute sum of the differences for each PB between the probabilities of a PB x to be present in the WT and the variant forms (x goes from PB a to PB p). ΔPB is calculated as follows:
where, f WT x and f var x are the percentages of occurrence of a PB x in respectively the WT and the variant forms of Calf-1 structures. A value of 0 indicates perfect PBs identity between WT and variant, while a score of 2 indicates a total difference.
Calf-1 variants
The seven Calf-1 domain variants studied herein were involved in GT and are reported in the GT database (https://sinaicentral.mssm.edu/intranet/research/glanzmann). Variants L653R27, L721R (only reported in the GT database), L721V28, R724P28, 29, R724Q30, and P741R29 severely impaired αIIbβ3 expression (less than 5%) while variant C674R28, 30,31,32 allowed a 10% residual expression (type I and II GT33).
Results
Structural analysis of the Calf-1 domain
Calf-1 domain extends from residues 603 to 743 of the αIIb integrin subunit. This domain is an all beta structure adapting an Immunoglobulin-like Beta-sandwich fold13 with 9 consecutive β-strands connected by 8 loops (Fig. 2, the loops position is provided in Fig. 3D). Loops 1 and 10, located at the N- and C-terminals of Calf-1 connects it with N-ter Thigh and C-ter Calf-2 domains, respectively. RMSD from all MD simulations reach a steady state at 50 nsec (Figure S1) that is maintained in longer runs of 100 nsec indicating stable and reproducible independent dynamics.
According to the high B-factor values obtained from crystallographic data, loops 2, 3, 4 and 5 are the most flexible regions of Calf-1 (Fig. 3A). Residues 622, 643, 710 and residues 667/668 (of loop 5 that contains missing atoms) presented the highest B-factor values in their respective loops. On average β-strands are more rigid than loops34, 35, although some of their residues represent relatively high B-factor values. In our study, the B-factors seemingly escape the influence of crystal packing contacts. However, it is known that B-factors are strongly influenced by the crystal packing of the structure36. Some protein moieties very flexible in solution might seem to be rigid only because they are involved in the solid-state packing.
RMSF values computed from MD simulations measure the mobility of each residue around its median position in the structure and allow assessing protein flexibility (Fig. 3B). High RMSF values are often associated with loops and sometimes with C-ter of β-strands. As defined by high RMSF values, loop 2 (residues 619–620 and 625–626), loop 5 (residues 665–671) and loop 8 (residues 711–713) are flexible regions, with loop 5 being the most flexible. The rest of the structure is more rigid. RMSF and B-factors values are correlated for loops 2, 5 and 8 (Fig. 3D). Some points are noteworthy: (a) the limits of flexible positions can show some little differences between RMSF and B-factor and (b) loop 3 is associated to high B-factor but low RMSF values although it binds a Ca2+ (not included in MD simulations) in the crystal structure. Similar correlation between B-factor and RMSF values had been previously reported37.
Figure 3B–D indicate a good correlation between RMSF and N eq values. Indeed, highest N eq values are associated to flexible regions (as defined by B-factor and RMSF) with residues K678-T682 (loop 6) and N709-E712 (loop 8), but also with T619 (loop 2). Expectedly, some regions can show higher N eq for some residues; G641-G643 (loop 3) and S728-N730 (loop 9). On the other hand, highly flexible region can also represent high local rigidity in terms of PBs, for instance, residues V666-F669 and E670 in loop 5 (Figs 3C and 4). Direct comparison of RMSF and N eq values (Fig. 4A) clearly shows that E667 represents a high RMSF but a low N eq . This can be explained by its PB distribution (Fig. 4B): E667, G668 and F669 representing the highest RMSF values (and also B-factors), mainly adopted the PB sequence “hia” with respective occurrences of 86.2, 82.9 and 61.6%. A series of PB “hia” is a classical loop conformation but this region (in blue rectangle on Fig. 3C) maintains a single conformation and is not really flexible. This apparent discrepancy can be explained by the insertion of the rigid stretch E667-F669 in a larger flexible (or more precisely deformable) loop N665-L672. Interestingly our results revealed that a locally rigid aa stretch (few possible conformations/low N eq ) can be a part of a large mobile loop involved in the global structural motions of the protein (high RMSF).
Overall, our results showed a good correlation between experimental data (B-factor), RMSF and N eq obtained from MD simulations. Although some discrepancies did exist, they are explained by local structure singularities. As expected in an all-Beta domain, rigid β-strands are linked by flexible loops.
The selection of OPLS-AA force field was made in regards to our previous works on integrins11, 12. To assess the effect of the force field, simulations were also performed on WT with Gromos 46a7 and Gromos 54a7. For the latter, 11 independent simulations were performed with the protocol used with OPLS-AA. No significant global or local differences were observed.
Structure comparisons between GT variants and WT Calf-1
The αIIbβ3 integrin was cut into compact structural domains through Protein Peeling38 that correlate the delineations found in literature9. As shown in Fig. 2, the variant residues studied are mostly located at β-strands presenting low flexibility with the exception of residue 653 localized near the β-strand 3 C-ter.
As for the WT system, the 7 variants were studied with 11 independent MD simulations performed to a complete timing of 850 nsec and with parameters similar to Jallu et al.12. Each system reached a plateau after 5 nsec with an average RMSD of 2 Å (beginning of loop 1 and end of loop 10 excluded). All energetic and geometric parameters showed a good evaluation for the 77 different simulations used in this study, e.g. no clashes found. The Calf-1 domain stayed consistent during the whole dynamics.
Average RMSF from each variant and the WT were comparable (Fig. 5). The most important variations observed in loop 2 (V625), loop 5 (E670), loop 8 (A713) and loop 9 (N732) did not lead to disordered patterns. Some variants showed specific higher or lower RMSF for some restricted positions like for C674R and L721V variants (Fig. 5). As seen with the WT system, RMSF can be confused with deformability. To resolve this aspect, we have used PBs analyses of the MD trajectories.
PBs analyses revealed striking local structure alterations, but distant from the variant sites
For clarity, results are detailed for only 3 variants R724Q, L653R and C674R that are representative of all behaviours observed for the 7 variants.
R724Q
This aa variation is located at β-strand number 8. In regards to the WT structure (Figs 6A and S2A), the highest N eq differences are at S621 (beginning of loop 2), A644 (loop 3) and L710 (loop 8). These loops that are naturally flexible are even more so in the variant. Conversely, residues L624 to D628 have a lower N eq value thus indicating that loop 2 represents a dual behaviour, with increased deformability at its beginning and enhanced stability in its C-ter part. Surprisingly, the mutant residue Q724 (β-strand 8) conserved the same N eq (Fig. 6A) with a low ΔPB of 0.09 (Fig. 6B) indicating that local β-strand conformation is conserved, i.e. PB d.
Regarding the structure, the polar amino acid arginine contains a longer aliphatic side-chain than glutamine, an uncharged hydrophilic polar amino acid. Q724 conserves the backbone - backbone interaction with E648 as observed with R724 (β-strand 3, see Fig. 6E). Beside, Q724 lost the ionic bond and the side chain - side chain interactions with E648 but made new hydrogen bonds through side chains interactions with E722. This showcases a classic example of structural compensation that maintained the local conformation of the residue through different interactions.
The highest ΔN eq (2.71) that is also associated with the highest ΔPB (0.57), is observed for S621 (Fig. 6A,B). S621 is located at the opposite side of the domain in reference to residue 724 (Fig. 6E). In the variant structure, S621 mostly remained in a PB d (i.e., β-strand) conformation with however, a decreased frequency of occurrences. Besides, downstream P622 and L623 presented some lost conformers with increased frequencies of PBs e and h respectively. Very few typical backbone - backbone interactions of S621 with L623 and backbone - side chain interactions with N629 are replaced by a single bond between side chains with N629. Adding to this high mobility, S621 did not do consistent and sustainable interactions. This behaviour is amplified in the Q724 variant and the most stable residue S621 in a naturally flexible region (loop 2), became one of the most deformable position.
L653R
This GT variant results from a L653R substitution in loop 4. The highest N eq variations (Figs 7A and S2B) affected residues G620-P622 (loop 2), V630-L631 (β-strand 2), E646 (loop 3), R671 (loop 5) and L710 (loop 8). As observed with the R724Q substitution, residues G620-L623 gained slightly more flexibility. Conversely, residues L624-D628 showed increased flexibility but with a limited impact (ΔPB 0.23 on average) on the most frequent PBs (PB e for L624, h for V625 and i for G626 in Fig. 7B).
The mutated residue in position 653 (loop 4) was not subjected to any N eq modification. It conserved a strong local structural stability (Fig. 7C) similar to its direct environment. The PB series at this position “dddeh” is even slightly more common in the variant than in the WT (64% and 59%, respectively). In the R653 variant, the 8 hydrophobic bonds of L653 disappeared in favour of new interactions between the R653 backbone and A657 and E676 side-chains (Fig. 7E). The backbone – backbone interaction with R683 was conserved. The mutation zone showed no conformational change as the loss of important specific interactions were partly compensated by new ones. Over 9 original interactions only 1 is conserved while 3 are created.
Q679 (loop 6) is a very interesting case where ΔN eq was negligible while the ΔPB was the highest (0.78). The most frequent PB b (N-cap of β-strand) was replaced by a PB h (loop structure) in regards to their frequency of occurrences (Fig. 7D). Hydrogen interactions with T682 and K677 are retained but the backbone - backbone interaction with E681 was lost and replaced by side chain and ionic side chain interactions with R724 in loop 9. In the variant structure, this region has high fluctuations in PBs, mainly associated to loops that even affected the C-ter of the β-strand 5 located the above loop 9.
C674R
This variant is associated with a C674R substitution in β-strand 5. We observed a variation of N eq profile (Figs 8A and S2C) similar to that occurring in the R724Q substitution (see previous section). Loop 2 presented the same increased deformation at its beginning (S621), followed by a stiffening in its centre (residues L624-D628). The same PB series “ehiac” (L624-D628) is found in greater proportion than in the Q724 and R674 variants, reinforcing the local stiffening of the loop in this region.
The main destabilization was far upstream of residue 674 (Fig. 8C). With the C674R substitution, the residue 674 not just lost its covalent disulphide bond with C687 located at the end of β-strand 6, but also its aromatic interaction with Y659 in β-strand 4 (Fig. 8D). However, the mutated R674 made an ionic bond with E688 located at end of β-strand 8 that strengthened a backbone – backbone interaction. The 80% frequency of PB d (the highest) in WT decreased to 49% in the variant. Surprisingly, N675 and Q676 located downstream the substitution remained structurally stable with similar PB occurrences.
The highest N eq variation affected R671 as shown by the strongest ΔN eq (5.02) and ΔPB (0.91). The side chain of R671 is mainly exposed at the domain surface and forms a single ionic interaction with the neighbouring E670, like in WT. But in the variant conformation, the R671 side-chain can occasionally turned toward loop 8 to make ionic side chain interactions with E688 (Fig. 8E). The frequency of PB d (the highest) drastically decreased in the variant leading to an increased disorganization of the neighbourhood.
Experimentally, the C674R mutation severely impaired the αIIbβ3 complex expression with only 10% expressed at the surface of the patient’s platelets and transiently transfected CHO cells31. The C674R mutation did not impair pro-αIIb synthesis but affect the stability of the complex that is not correctly matured and/or expressed at the cell membrane.
Other variants
These MD simulations on Calf-1 domain allowed demonstrating more or less pronounced structural changes depending on the variants under study. The C674R and P741R variants presented conformational changes at the mutated site. In the case of the C674R substitution, the resulting loss of the disulphide linkage relaxes the structure and introduces significant structural alterations (Figs 7 and 8). The proline is the aa known to cause the most drastic change in conformations39. Indeed, the P741R substitution (Fig. 9) inverse the PB profile going from 55% of PB d (β-strand) and 29% of PB f (C-cap of β-strand) to 24% of PB d and 59% of PB f (Figure S3). This case was associated with a low ΔN eq (0.15, Figure S4A) while the ΔPB was high (0.70). In P741R substitution two hydrophobic interactions were lost and R741 formed ionic and side chain - side chain interactions shortening the β-strand. Note that residue 741 is located at two residues from the C-term of Calf-1 and is normally in contact with the Calf-2 domain. Thus, the absence of the neighbouring domain in our MD simulations can impact our observations. To solve this problem, MD simulations of combined domains are currently under study.
In the remaining 5 variants studied, compensation mechanisms were also observed. Most interactions formed by WT residues are replaced by new ones, allowing conservation of the local structure. Surprisingly, regions displaying significant changes (high ΔPB) are distant from substitution sites without any contact/interaction with the substituted aa. These regions contribute towards increasing the deformability and are usually located at interfaces adjacent to neighbourhood β-propeller, Calf-2 or Thigh domains. These results depict changes resulting from substitutions in distant regions suggesting long-range mechanism to be at play.
Different variants with common mutation sites
L721R and L721V showed quite different results (Fig. 9). Compared to L721R, the L721V substitution had very little impact on RMSF, apart for the end of loop 8, a highly flexible region. This is particularly true for E712 (loop 8), whose ΔN eq were respectively of 3.33 and 0.39 in R721 and V721 variant forms (Figures S4B and S5). Residue 721 and E712 are remote (separated by β-strand 8 and a part of loop 9), and E712 can interact with the β-propeller domain. Loop 8 and especially E712 are already naturally mobile. Locally, V721 also presented a lower impact than R721 that has a higher overall ΔN eq . These observations sound logical as Leucine and Valine share similar hydrophobic aliphatic side-chains (a single methyl group of difference), mitigating the impact of the L721V substitution with respect to arginine, which is positively charged.
R724P and R724Q
The previously described R724Q substitution (see above) presented a very pronounced impact compared to others and induced the dual deformability/rigidity alterations of loop 2. This particular phenomenon is also observed in the variant R724P although contributing more stiffness to the site, owing to the backbone structure of proline (Fig. 9). In both cases, the mutated residue did not affect the local conformation (ΔN eq and ΔPB are identical). The P724 variant mainly differed from the R724 WT in residues G690 (C-cap of β-strand 6) and N691 (start of loop 7, in contact with the Thigh domain). G690 and N691 have maximal ΔPBs of 0.79 and 0.75, and ΔN eq of 2.17 and 0.85 respectively. For G690, these values are consistent with a drastic PB j (loop) decreased from 70% in the WT to 23% in the variant, while the occurrences increased for very rare PBs i and p (loop and N-cap of α-helix) (Figure S6). For N691, PB c occurrences decreased from 80% to 45%, while PB a reached 44%; both PBs types being associated with N-cap β-strand.
Considering the 7 variants, some Calf-1 regions are more affected than others (see Figure S6). For instance, loop 5 had certain rigidity in terms of PBs but this area is subject to strong N eq increment under influence of substitutions involved in Glanzmann thrombasthenia (e.g., case of variant C674R). Loop 8 deformability is also observed to be increased, especially in the L721V substitution. The region from the middle of loop 2 to the middle of β-strand 2 (L624-E632) is also affected, from deformability (L653R, L721R) to rigidity (C674R, R724Q).
Loop 2 (especially residues G620-P622) is particularly a remarkable region with a local variability common to all variants (Figs 8 and S6). The end of β-strand 1 has high ΔN eq values ranging from 2.06 to 2.71, while ΔPB ranges from 0.54 to 0.71. A direct comparison reveals that PBs a, b and d frequency of occurrences in the WT decreased from 66% to 37% in the variant.
Discussion
This study aimed at investigating the αIIbβ3 integrin structure in a context of GT disorder. Previously, two MD studies of GT mutations have been performed, both with very short simulation times. The first study addresses the mutation in a calcium binding site with a single 20 ns simulation for reference and variant sequences40. A second study carried on the β3 S189 variant was done with a single simulation of 60 ns41. Hence, both these studies were very short for sampling the conformational space.
Our team showed the impact of three distinct mutations by a static approach10. More recently, we have focused on the HPA-1 system characterization with MD simulations 10.8 times longer than the longest study cited herein11, 12. In the present work simulation times were 14.1 times longer and 11 independent simulations were done for each variant to correctly sample the conformational space. To illustrate this, Sammon maps42 of the 88 independent MD were done for the 8 systems (WT and 7 variants, Fig. 10). In brief, a Sammon map (SM) is a projection method, which differs from Principle Component Analysis (PCA). SM is an iterative approach that tends to conserve the distances between data within a high-dimensional space into a smaller dimensional space, often a 2D space. A major interest of SM is that it does not overweight the outliers; therefore the projection is really pertinent. SM of MD encoded in terms of PBs allowed comparing distances between MD obtained in a single or multiple systems. Considering first a single system, the limited numbers of simulations in a close vicinity to its barycentre underlined the differences between simulations as observed for the WT (Fig. 10A). Similar dispersion is obtained for each system (Fig. 10B). Two systems with variants at the same position like L721R (green dots)/L721V (yellow dots), and R724P (brown dots)/R724Q (cyan dots) can also lead to somewhat superimposed positions (Fig. 10B). Nonetheless comparing the whole set of positions (or their barycentre) clearly indicate differences in behaviour of MD of each system. It confirmed the necessity to proceed with several independent simulations otherwise the conclusions might be biased.
Sammon maps (Fig. 10C) showed that it is also possible with PB encoding to find some relationship between different systems. C674R and P741R were the most distant systems compared with the WT, while the other variants studied are closer to the reference. Interestingly the two variants of residue 721 are not close (yellow /green dots) while the variants of residue 724 (brown/blue dots) are. These examples showed that 2 variations at a same position could lead to similar or different behaviours.
Calf-1 domain is an all-β structural domain9. Analysis of both B-factors obtained from crystallographic data and RMSF measured from simulations showed that the more flexible regions are connector loops, as expected. The usage of three different metrics, B-factor, RMSF and PBs related values (N eq and ΔPBs), provided a more complex yet comprehensive way to analyse protein flexibility locally. Comparison of these metrics revealed some interesting cases and some of these results are worth to be underlined. Loop 5 had the highest B-factor (where some atoms are missing43) and RMSF values. However, low N eq values indicate that a highly rigid zone forms the core of the loop. This underlines interesting contexts of mobility: rigid stretch encompassed in larger deformable region leads to high mobility (high B-factor and RMSF values) while locally the loop structure did not change (low N eq values). In the case of loop 2, its central residue P622 is associated to a high B-factor value, but neither high RMSF nor high N eq are. It is its neighbourhood regions that would have the highest mobility. Other interesting cases are positions V625-G626 (loop 2) and E642-G643 (loop 3) that had antinomic behaviours. Both had low B-factors, even if E642 is interacting in the crystal structure with Ca2+ ion. While residues V625-G626 had high RMSF and low N eq , mimicking the loop 5 behaviour in a lesser extent, residues 642 to 643 had medium RMSF but high N eq . This last case is more complex: structural variations though associated to several distinct conformations (high N eq ) were highly local, limiting the RMSF values.
These results underline the interest of N eq computation in regards to classical measures and reveal the high complexity of MD thus enabling to highlight a single or only few critical residues.
The most surprising result concerns the minimal or absent structural effect observed at the aa variation site. Apart from the two very specific cases of C674R (leading to disulfide bridge disruption) or R724P (replacement by a proline) mutants, in all other studied GT variants, local conformations are maintained through new molecular interactions. Nonetheless such compensatory mechanisms cannot account for impaired αIIβ3 expression associated with the 7 GT variants tested. Another surprising result is that most important alterations are located at long distances from the mutation sites. These alterations increase mobility or deformability of the same adaptable regions (e.g. loops 2, 3 and 8) in all variants, a new concept to biomolecular structural biology as defined by R. Nussinov44, 45. Furthermore although RMSF profiles of the WT and the 7 variants were very similar (Fig. 5), it is particularly surprising that N eq values of most loops are affected, sometimes drastically (Fig. 9). Only loops 4 and 7 presenting very low N eq values in the WT form, were least affected.
The 7 variants tested under the present work severely impacted the αIIbβ3 integrin expression. Using NN-Splice46 and GENSCAN47 on-line softwares, the genomic mutations involved in the aa variations studied were not predicted to induce splicing defects (Results not shown). Thence these aa variations should alter the biosynthesis of the αIIbβ3 complex by structural alterations. Hence, these variant structures represent common and specific structural effects regarding the WT Calf-1 domain (ΔN eq and ΔPBs). Observed modifications rather concerns structure dynamics than large structural alterations (loss of interaction10) as previously seen in other studies. These results raise the possibility that current explanations of GT type I and II phenotypes regarding structure alterations might be reconsidered. In fact our results point out that structural mechanisms leading to an impaired expression of αIIbβ3 integrin can be far more complex than expected. It must be noted that our study did not allow identifying which of the observed effects are responsible for the pathogenicity of the aa variations tested. Interestingly, two mutations at the same site (L721R/L721V and R724P/R724Q) can lead to similar or different effects, underlining that long-range effect depends upon the localization of the variations and also on its nature. Regardless of the effect, all variations severely impaired the expression of αIIbβ3 integrin.
The expression of a functional αIIbβ3 integrin at the platelet surface requires: (1) the synthesis of each subunit; (2) their association and (3) their consecutive post-translational maturation. Hence the situation can be far more complex when considering the different aspects of the biosynthesis processing of proteins. Allosteric alterations identified (or suggested) here may affect conformations of the integrin complex not described by our reference structure - the initial structure obtained from a crystallized, bent, inactivated form of αIIbβ3. Interestingly, a study by Mitchell et al.48 suggested that the αIIb adopts a bent form early in its synthesis process. Finally functions of other proteins like chaperones or translocon potentially involved in the synthesis process could also be affected by αIIbβ3 structural alterations.
Although difficult to interpret in terms of pathogenicity our results showcased that aa substitutions identified in GT induced long-range alterations in the dynamics of the Calf-1 domain. To explore new potential pathogenic mechanisms, we are currently addressing the role of other variants in allosteric alterations of αIIbβ3 and particularly the role of neutral variants regarding the expression of the integrin αIIbβ3.
References
Hynes, R. O. Integrins: bidirectional, allosteric signaling machines. Cell 110, 673–687 (2002).
Zhu, J., Zhu, J. & Springer, T. A. Complete integrin headpiece opening in eight steps. J. Cell Biol. 201, 1053–1068 (2013).
Zhang, K. & Chen, J. The regulation of integrin function by divalent cations. Cell Adh. Migr. 6, 20–29 (2012).
Xiao, T., Takagi, J., Coller, B. S., Wang, J. H. & Springer, T. A. Structural basis for allostery in integrins and binding to fibrinogen-mimetic therapeutics. Nature 432, 59–67 (2004).
Xiong, J. P., Stehle, T. & Diefenbach, B. et al. Crystal structure of the extracellular segment of integrin alpha Vbeta3. Science 294, 339–345 (2001).
Coller, B. S. & Shattil, S. J. The GPIIb/IIIa (integrin alphaIIbbeta3) odyssey: a technology-driven saga of a receptor with twists, turns, and even a bend. Blood 112, 3011–3025 (2008).
Nurden, A. T., Pillois, X. & Wilcox, D. A. Glanzmann thrombasthenia: state of the art and future directions. Semin. Thromb. Hemost. 39, 642–655 (2013).
Curtis, B. R. Recent progress in understanding the pathogenesis of fetal and neonatal alloimmune thrombocytopenia. Br. J. Haematol. 171, 671–682 (2015).
Zhu, J., Luo, B. H. & Xiao, T. et al. Structure of a complete integrin ectodomain in a physiologic resting state and activation and deactivation by applied forces. Mol. Cell 32, 849–861 (2008).
Jallu, V., Dusseaux, M. & Panzer, S. et al. AlphaIIbbeta3 integrin: new allelic variants in Glanzmann thrombasthenia, effects on ITGA2B and ITGB3 mRNA splicing, expression, and structure-function. Hum. Mutat. 31, 237–246 (2010).
Jallu, V., Poulain, P., Fuchs, P. F. J., plan, C. & evern, A. G. Modeling and molecular dynamics of HPA-1a and -1b polymorphisms: Effects on the structure of the β3 subunit of the αIIbβ3 integrin. PLoS. One. 7, 1–10 (2012).
Jallu, V., Poulain, P., Fuchs, P. F., Kaplan, C. & de Brevern, A. G. Modeling and molecular dynamics simulations of the V33 variant of the integrin subunit beta3: Structural comparison with the L33 (HPA-1a) and P33 (HPA-1b) variants. Biochimie 105, 84–90 (2014).
Sillitoe, I., Lewis, T. E. & Cuff, A. et al. CATH: comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res. 43, D376–D381 (2015).
Sali, A. & Blundell, T. L. Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234, 779–815 (1993).
Webb, B. & Sali, A. Comparative Protein Structure Modeling Using MODELLER. Curr. Protoc. Bioinformatics. 47, 5–32 (2014).
DeLano, W. L. The PyMOL Molecular Graphics System on World Wide Web. http://www.pymol.org. (2013).
Krivov, G. G., Shapovalov, M. V. & Dunbrack, R. L. Jr. Improved prediction of protein side-chain conformations with SCWRL4. Proteins 77, 778–795 (2009).
Hess, B., Kutzner, C., van der Spoel, D. & Lindahl, E. GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J.Chem. Theory Comput. 4, 435–447 (2008).
Jorgensen, W. L., Maxwell, D. S. & Tirado-Rives, J. Development and testing of the OPLS All-atom force field on conformational energetics and properties of organic liquids. J Am Chem Soc 118, 11225–11236 (1996).
Tina, K. G., Bhadra, R. & Srinivasan, N. PIC: Protein Interactions Calculator. Nucleic Acids Res. 35, W473–W476 (2007).
Joseph, A. P., Agarwal, G. & Mahajan, S. et al. A short survey on protein blocks. Biophys. Rev. 2, 137–147 (2010).
de Brevern, A. G., Etchebest, C. & Hazout, S. Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks. Proteins 41, 271–287 (2000).
Joseph, A. P., Srinivasan, N. & de Brevern, A. G. Progressive structure-based alignment of homologous proteins: Adopting sequence comparison strategies. Biochimie 94, 2025–2034 (2012).
de Brevern, A. G., Wong, H. & Tournamille, C. et al. A structural model of a seven-transmembrane helix receptor: the Duffy antigen/receptor for chemokine (DARC). Biochim. Biophys. Acta 1724, 288–306 (2005).
de Brevern, A. G. New assessment of a structural alphabet. In Silico. Biol. 5, 283–289 (2005).
Poulain, P. PBxplore: A program to explore protein structures with Protein Blocks. Technical report. https://github.com/pierrepo/PBxplore (2016).
Pillitteri, D., Pilgrimm, A. K. & Kirchmaier, C. M. Novel Mutations in the GPIIb and GPIIIa Genes in Glanzmann Thrombasthenia. Transfus. Med. Hemother. 37, 268–277 (2010).
D’Andrea, G., Colaizzo, D. & Vecchione, G. et al. Glanzmann’s thrombasthenia: Identification of 19 new mutations in 30 patients. Thromb Haemost 87, 1034–1042 (2002).
Vijapurkar, M., Ghosh, K. & Shetty, S. Novel mutations in GP IIb gene in Glanzmann’s thrombasthenia from India. Platelets. 20, 35–40 (2009).
Nurden, A. T., Pillois, X. & Fiore, M. et al. Expanding the Mutation Spectrum Affecting alphaIIbbeta3 Integrin in Glanzmann Thrombasthenia: Screening of the ITGA2B and ITGB3 Genes in a Large International Cohort. Hum. Mutat. 36, 548–561 (2015).
Gonzalez-Manchon, C., Fernandez-Pinel, M. & Arias-Salgado, E. G. et al. Molecular genetic analysis of a compound heterozygote for the glycoprotein (GP) IIb gene associated with Glanzmann’s thrombasthenia: disruption of the 674-687 disulfide bridge in GPIIb prevents surface exposure of GPIIb-IIIa complexes. Blood 93, 866–875 (1999).
Franchini, M., Favaloro, E. J. & Lippi, G. Glanzmann thrombasthenia: an update. Clin. Chim. Acta 411, 1–6 (2010).
George, J. N., Caen, J. P. & Nurden, A. T. Glanzmann’s thrombasthenia: the spectrum of clinical disease. Blood 75, 1383–1395 (1990).
Schlessinger, A. & Rost, B. Protein flexibility and rigidity predicted from sequence. Proteins 61, 115–126 (2005).
Craveur, P., Joseph, A. P. & Esque, J. et al. Protein flexibility in the light of structural alphabets. Front Mol. Biosci. 2, 20 (2015).
Carugo, O. & Argos, P. Protein-protein crystal-packing contacts. Protein Sci. 6, 2261–2263 (1997).
Bornot, A., Etchebest, C. & de Brevern, A. G. A new prediction strategy for long local protein structures using an original description. Proteins 76, 570–587 (2009).
Gelly, J. C. & de Brevern, A. G. Protein Peeling 3D: New tools for analyzing protein structures. Bioinformatics. 27, 132–133 (2011).
Macarthur, M. W. & Thornton, J. M. Influence of proline residues on protein conformation. J Mol. Biol. 218, 397–412 (1991).
Mansour, W., Einav, Y. & Hauschner, H. et al. An alphaIIb mutation in patients with Glanzmann thrombasthenia located in the N-terminus of blade 1 of the beta-propeller (Asn2Asp) disrupts a calcium binding site in blade 6. J. Thromb. Haemost. 9, 192–200 (2011).
Pillois, X., Fiore, M., Heilig, R., Pico, M. & Nurden, A. T. A novel amino acid substitution of integrin alphaIIb in Glanzmann thrombasthenia confirms that the N-terminal region of the receptor plays a role in maintaining beta-propeller structure. Platelets. 24, 77–80 (2013).
Sammon, J. W. A non-linear mapping for data structure analysis. IEEE Transactions on Computers C-18, 401–409 (1969).
Djinovic-Carugo, K. & Carugo, O. Missing strings of residues in protein crystal structures. Intrinsically. Disord. Proteins 3, e1095697 (2015).
Liu, J. & Nussinov, R. Allostery: An Overview of Its History, Concepts, Methods, and Applications. PLoS. Comput. Biol. 12, e1004966 (2016).
Nussinov, R. & Tsai, C. J. Allostery in disease and in drug discovery. Cell 153, 293–305 (2013).
Reese, M. G., Eeckman, F. H., Kulp, D. & Haussler, D. Improved splice site detection in Genie. J. Comput. Biol. 4, 311–323 (1997).
Burge, C. B. & Karlin, S. Finding the genes in genomic DNA. Curr. Opin. Struct. Biol. 8, 346–354 (1998).
Mitchell, W. B., Li, J. & Murcia, M. et al. Mapping early conformational changes in alphaIIb and beta3 during biogenesis reveals a potential mechanism for alphaIIbbeta3 adopting its bent conformation. Blood 109, 3725–3732 (2007).
Wheeler, T. J., Clements, J. & Finn, R. D. Skylign: a tool for creating informative, interactive logos representing sequence alignments and profile hidden Markov models. BMC. Bioinformatics. 15, 7 (2014).
Acknowledgements
We would like to thanks Nicolas Shinada, Catherine Etchebest and Jean-Christophe Gelly for fruitful discussions. We also would like to thank the anonymous reviewers who provided us with an excellent critique of our work with highly pertinent questions that has helped us to greatly improve this manuscript. This work was supported by grants from the Ministry of Research (France), University Paris Diderot, Sorbonne, Paris Cité (France), National Institute for Blood Transfusion (INTS, France), National Institute for Health and Medical Research (INSERM, France) and labex GR-Ex. The labex GR-Ex, reference ANR-11-LABX-0051 is funded by the program “Investissements d’avenir” of the French National Research Agency, reference ANR-11-IDEX-0005–02. Calculations were done on SGI cluster granted by Conseil Régional Ile de France and INTS (SESAME Grant). TjrN and AdB acknowledge to Indo-French Centre for the Promotion of Advanced Research/CEFIPRA for collaborative grant (number 5302–2).
Author information
Authors and Affiliations
Contributions
A.G.de.B. and V.J. designed the experiments; M.G. and T.N. conducted the experiments; M.G., T.N., A.G.de.B. and V.J. analysed the data; T.N., A.G.de.B., R.P. and V.J. wrote the manuscript; A.G.de.B. and V.J. supervised the research. All authors approved the final manuscript. The authors reported no potential conflicts of interest.
Corresponding author
Ethics declarations
Competing Interests
The authors declare that they have no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Goguet, M., Narwani, T.J., Petermann, R. et al. In silico analysis of Glanzmann variants of Calf-1 domain of αIIbβ3 integrin revealed dynamic allosteric effect. Sci Rep 7, 8001 (2017). https://doi.org/10.1038/s41598-017-08408-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-017-08408-w
This article is cited by
-
Molecular mechanisms of catch bonds and their implications for platelet hemostasis
Biophysical Reviews (2023)
-
iPBAvizu: a PyMOL plugin for an efficient 3D protein structure superimposition approach
Source Code for Biology and Medicine (2019)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.