Multibody cofactor and substrate molecular recognition in the myo-inositol monophosphatase enzyme

Ferruz, Noelia; Tresadern, Gary; Pineda-Lucena, Antonio; De Fabritiis, Gianni

doi:10.1038/srep30275

Download PDF

Article
Open access
Published: 21 July 2016

Multibody cofactor and substrate molecular recognition in the myo-inositol monophosphatase enzyme

Noelia Ferruz^1,2,
Gary Tresadern³,
Antonio Pineda-Lucena⁴ &
…
Gianni De Fabritiis^1,5

Scientific Reports volume 6, Article number: 30275 (2016) Cite this article

3102 Accesses
13 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Molecular recognition is rarely a two-body protein-ligand problem, as it often involves the dynamic interplay of multiple molecules that together control the binding process. Myo-inositol monophosphatase (IMPase), a drug target for bipolar disorder, depends on 3 Mg²⁺ ions as cofactor for its catalytic activity. Although the crystallographic pose of the pre-catalytic complex is well characterized, the binding process by which substrate, cofactor and protein cooperate is essentially unknown. Here, we have characterized cofactor and substrate cooperative binding by means of large-scale molecular dynamics. Our study showed the first and second Mg²⁺ ions identify the binding pocket with fast kinetics whereas the third ion presents a much higher energy barrier. Substrate binding can occur in cooperation with cofactor, or alone to a binary or ternary cofactor-IMPase complex, although the last scenario occurs several orders of magnitude faster. Our atomic description of the three-body mechanism offers a particularly challenging example of pathway reconstruction, and may prove particularly useful in realistic contexts where water, ions, cofactors or other entities cooperate and modulate the binding process.

Protein–ligand binding with the coarse-grained Martini model

Article Open access 24 July 2020

Substrate promiscuity of inositol 1,4,5-trisphosphate kinase driven by structurally-modified ligands and active site plasticity

Article Open access 19 February 2024

Competition between electrostatic interactions and halogen bonding in the protein–ligand system: structural and thermodynamic studies of 5,6-dibromobenzotriazole-hCK2α complexes

Article Open access 08 November 2022

Introduction

Bipolar disorder is a serious medical illness where episodes of mania alternate with depression. It currently affects more than 254 million people worldwide and is one of the major causes of loss of health and suicide in the middle-aged population¹. Since the anti-manic properties of lithium were first reported more than 60 years ago², it has been the most widely used treatment for bipolar disorder. Unfortunately, the ion’s therapeutic window is very narrow and it is accompanied by severe toxicity issues and side-effects such as tremors, frequent urination, thyroid problems, weight gain and kidney failure³. Therefore, it is desirable to replace it with a more harmless treatment.

The discovery that lithium intake diminishes brain inositol levels⁴ led to the formulation of the ‘Inositol depletion hypothesis⁴’ where the ion is proposed to mitigate neurotransmitters in the phosphatidyl inositol (PI) pathway (Fig. 1), overactive in bipolar patients⁵. Myo-inositol monophosphatase (IMPase) plays a key role in the PI pathway, by hydrolyzing myo-inositol monophosphate (IP). IMPase’s activity in patients suffering from bipolar disorder is assumed to be higher than normal, thus increasing myo-inositol levels. IMPase is also specifically inhibited by therapeutic (0.5–1.5 mM) concentrations of lithium⁶, and that is why it has been traditionally proposed as the putative target of the inositol depletion theory^7,8. Accordingly, IMPase has been the subject of major industrial and academic research for bipolar treatment, and despite the myriad of inhibitors that have been tested over the last years, all have shown poor bioavailability or difficulties to reach the site of action in vivo^8,9,10,11,12.

**Figure 1: phosphatidyl inositol (PI) pathway and the role of IMPase.**

There are two main reasons for the failure in finding a bioavailable drug inhibiting IMPase. Firstly, the structure of IMPase reveals a difficult binding pocket for drug-like compounds. More concretely, mammalian IMPase have been crystalized from murine¹³, bovine¹⁴ and human¹⁵ brain and show a homodimer of 60 kDa, with each subunit consisting of a penta-layered αβαβα sandwich formed by alternating 9 α-helices and 13 β-strands (Fig. 2a). The active site of IMPase is a highly hydrophilic cavity lying beneath a β-hairpin region which is thought to play a critical role in the enzyme function^16,17,18. To recognize the IP substrate the catalytic cavity is a highly polar pocket which favors polar charged compounds, typically unable to cross the blood-brain barrier (BBB)¹⁹. Secondly, although the structural conformation upon substrate and cofactor binding is well defined, its kinetic mechanism is still not clear. A recently solved human crystal IMPase structure in complex with Mg²⁺ and phosphate showed a catalytic pocket with 3 Mg²⁺ and superimposable with previous structures¹³. Mg²⁺ in site I, to which we will refer as Mg-I throughout this work, binds Glu70, Asp90 the carboxyl group of Ile92, three water molecules and the phosphate group. Mg²⁺ in site II (Mg-II) is coordinated with Asp90, Asp93, Asp220 the phosphate group and three water molecules, one being shared with Mg-I. The more external Mg²⁺ site III (Mg-III) is only coordinated by Glu70, the phosphate group and 5 water molecules (Fig. 2b). Different experiments have suggested that the three Mg²⁺ must occupy the catalytic pocket for the accomplishment of the reaction^14,20,21. Attempts to quantify Mg²⁺ binding showed that the three ions bind with decreasing affinity: Mg-I with a K_D of 300 μM²², Mg-II, K_D = 3.9 mM²³ and low affinity Mg-III. Mg²⁺ concentration in neurons range from 0.5 to 1 mM and therefore the real occupancy at physiological conditions is unclear²⁴. Whereas some studies proved the enzyme is doubly bound in neurons and the third Mg²⁺ binds after substrate¹⁷, another suggested the presence of three Mg²⁺ in the absence of substrates¹⁴.

**Figure 2: Structural features IMPase. PDB code 4AS4.**

It is therefore important to determine the mechanism of binding and the most populated states of the protein under physiological conditions in order to provide the basis for the rational design of new inhibitors. Here, we have performed an unprecedented 0.8 milliseconds of all-atom high-throughput molecular dynamics simulations in order to ascertain the concrete mechanism of binding of Mg²⁺ and the pathway of binding of the natural substrate.

Results

In all in-silico binding analyses, full kinetic and thermodynamic data were obtained by performing free-ligand binding²⁴, all-atom molecular dynamics simulations with the ACEMD²⁵ molecular dynamics software on the distributed computing project GPUGRID²⁶. The data were analysed using the HTMD software²⁷ (available at http://www.htmd.org) and a Markov state modelling (MSM)²⁸ method able to produce quantitative estimations of k_on, k_off and ∆G⁰. MSM analyses have been successfully used in a wide range of problems from ligand binding^24,29 to the characterisation of protein folding²⁸ and intrinsically disordered protein dynamics³⁰. In this work, we provide a comprehensive study on IMPase enzyme. By means of HTMD, we explain the full picture of cofactor and substrate binding. A total of 6 simulation batches have been performed (Table 1). Four of them focused on understanding the IMPase mode of action in the absence of substrates or inorganic molecules, and the other two, on the natural substrate (IP)’s kinetic mechanism. More details on the specific simulation parameters are provided in the Methods section.

Table 1 Summary of performed simulations, grouped by batches.

Full size table

IMPase mode of action in the absence of substrates

Crystallographic studies have shown that there are three Mg²⁺ ions per subunit in the presence of substrates or inorganic phosphate^13,14,15,31. However, some studies have proved IMPase is doubly bound in neurons and the third Mg²⁺ binds after the substrate¹⁷. Given the mild affinities of Mg²⁺ in site-I and II, and the low Mg²⁺ concentration in neurons (0.5–1 mM) is unclear what is the occupancy of IMPase prior to substrate binding. Here, we have examined the binding of the three Mg²⁺ ions by looking at the recognition process one ion at a time (Fig. 3). For the sake of simplicity, we will call binding event of Mg-I as event I, and subsequently event II and event III throughout the study.

**Figure 3: Summary of results for Mg²⁺ binding.**

Binding of Mg-I: event I

For the analysis of event I, one single Mg²⁺ ion was placed free in solution around the apo-IMPase dimer giving a concentration of 3 mM, at which IMPase is maximally active³². The ion was placed at least 15 Å distance away from the protein in the initial systems’ coordinates, such as it could spontaneously identify its binding pocket without any bias. Using an adaptive sampling scheme³³, more than 4500 trajectories of 40 ns were performed in order to compute the binding affinity and rate constants against each of the monomers. We analyzed the event I by performing one independent MSM analysis per subunit (see Methods), which provided remarkably consistent kinetic and affinity estimations between the monomers (Table 2). In quantitative terms, the standard free energy of binding is computed to be −3.8 ± 0.1 kcal/mol, slightly higher than previous dialysis (−4.8 kcal/mol)²² and fluorescence (−4.6 kcal/mol)³⁴ experiments. There are no available crystal structures containing only one Mg²⁺ bound, however, in binary or ternary complexes Mg-I appears coordinated to three protein residues (Glu70, Asp90, Ile92) and three water molecules. Here, in the absence of Mg-II we observed that it tends to coordinate both characteristic site I and II residues and one single water molecule (Fig. 3). The transition to this pose occurs with very fast kinetics, in the order of 10⁸ M⁻¹ s⁻¹.

Table 2 Kinetic and thermodynamic characterization of Mg²⁺ binding obtained by MSM analysis.

Full size table

Binding of Mg-II: event II

Reconstruction of event II was performed in a similar fashion. Mg-I was placed on each monomer coordinating both site I and II residues as obtained in event I’s analysis. Then a single Mg-II ion was placed free in solution, at least 15 Å distance from the protein. Accounting for three total Mg²⁺ ions, the concentration was 8 mM. We then performed another MSM analysis per monomer, as previously done for event I’s analysis. The analysis provided two poses with similar affinity in the first monomer, whereas in the other we lacked statistics to get a converged model. Looking at the first monomer’s poses, it was observed Mg-II tended to approach the catalytic pocket by interacting in two different binding sites other than the crystallographic one, as was site-II in coordination with Mg-I. Both poses are in the vicinity of Mg-I, the first interacting with Glu213, and the other with Glu71 (Fig. 3b, event II). We then analyzed Mg-I’s stability, in order to see transitions towards the crystallographic site. Taking into account the 277 μs produced in this and the previous set, we performed a root mean squared deviation (RMSD) of Mg-I aligning all Cα atoms in the protein against the crystal coordinates in 4AS4¹³. Only in 4 trajectories out of more than 5000, Mg-I evolved to the crystallographic position with an RMSD lower than 1 Å. Thorne et al. performed stopped-flow fluorescence spectroscopy studies to determine association and dissociation constants for Mg-I and II³⁴. The study showed a slow increase in fluorescence after a rapid binding of Mg-I, suggesting that Mg-I binding is followed by a subtle structural rearrangement in the microenvironment of site I. The obtained on-rate was 4.4 ± 0.18·10⁵ M⁻¹ s⁻¹, whereas our estimates for Mg-I’s kinetics revealed a much faster process. Hence, event I showed very fast kinetics in our experiment, and our analysis did not recover the crystal pose, contrary to the equilibrium-based fluorescence experiments. Although our pose and the crystallographic one are only 3.3 Å RMSD away, subtle differences in neighboring atoms confer completely different octahedral coordination for Mg-I. We argue that the rearrangement of the negatively charged residues around these ions needed to reach its exact coordination found in X-ray is plausibly a much slower process in line with the experimental observations. Note that given the experimental kinetics, in order to sample such slow rearrangements, it would be needed to produce multi-millisecond simulations length that is beyond current capabilities.

Binding of Mg-III: event III

For the analysis of event III, 236 μs of simulation time were produced. Mg-III was placed around IMPase, whereas Mg-I and II were located in their X-ray sites in both subunits this time, as the rearrangement to crystallographic positions was shown to be a very slow process. Interestingly, performing an RMSD analysis of the 3900 trajectories against the coordinates in 4AS4¹³ did not provide any spontaneous event III under these low concentration conditions. By performing an MSM analysis, we observed that at this concentration and timescale Mg-III binds at the interface between monomers (Fig. S1). Anticipating that event III is governed by very slow kinetics, we increased the concentration to 54 mM, with a total of 20 Mg²⁺ ions in the box. In 100 μs of simulation time, only 1 complete binding event was recovered per monomer, providing an RMSD of 2 Å to the crystal position of Mg-III¹³. The computed binding frequency is of the order of 0.01 μs⁻¹, which, taking into account the concentration yields an on-rate of 1.85·10⁵ M⁻¹ s⁻¹.

Mg²⁺ is known to exert a bimodal activation on IMPase depending on its concentration. At low concentrations, as in neurons, Mg²⁺ acts as an activator being maximally active at 1 mM. At higher concentrations (>20 mM), it acts as non-competitive inhibitor. We performed our simulations at concentrations at which IMPase would only have residual activity hydrolyzing IP. However, inhibition by high concentrations is thought to be due to product trapping^31,32,35. In a recent crystallographic study, it was concluded that both Li⁺ and Mg²⁺ do not interfere with the catalytic reaction, but stabilize the post-catalytic complex instead³¹. These data taken together, suggest that Mg-III can bind to IMPase to form a ternary complex even in the absence of inorganic phosphate, natural substrates or inhibitors, but we cannot estimate the affinity due to the millisecond binding timescale.

Substrate pathway reconstruction

IMPase’s catalytic mechanism on IP’s hydrolysis has been the subject of several studies. For many years, the enzyme was thought to operate via two Mg²⁺ ions. Pollack et al.¹⁶ proposed a mechanism in which Mg-I acted as the water nucleophile activator while Mg-II as a stabilizer for the leaving inositol. More recent observations have favored a hydrolysis operating via three Mg²⁺ ions instead²⁰. Despite the pre-catalytic complex being well characterized, the steps leading to its formation are not yet clear. Whereas some studies supported a random mechanism²², others inclined towards an ordered mechanism, with the substrate binding IMPase, and only Mg-III binding after¹⁷. Some other studies favored the presence of three Mg²⁺ in the absence of substrates¹⁴.

Assuming that once each of the Mg²⁺ ions binds in their corresponding positions and remain stable for timescales much longer than the substrate binding, we can consider bound ions as virtually covalent. For simplicity, we will refer to the protein states in which there are two and three Mg²⁺ bound per subunit as IMPase-II and IMPase-III. The exchange between IMPase-II and IMPase-III occurs in very long timescales, up to several milliseconds. This leads to a partition of IMPase’s conformational space into two different kinetic pathways. The binding of the natural substrate could present different relative affinities for IMPase-II and IMPase-III, leading to conformational selection³⁶, and could be able to shift the equilibrium towards any of the conformers by induced-fit³⁷.

In order to understand the sequence of events prior to the formation of the pre-catalytic complex, the binding was performed against these two protein states. The first system contained IMPase-II, and the two remaining Mg²⁺ corresponding to site III were randomly placed in solution. The second system contained IMPase-III, no Mg²⁺ ions present in bulk. A total of five IP molecules were placed around the enzyme in both cases. The final substrate and ion concentration were set to 12 and 5 mM, respectively. The two systems were therefore thermodynamically identical.

Taking into account the two systems, 155.8 μs of total sampling time was produced. The IP molecules carry two negative charges in the phosphate group, and therefore the interaction with other ligand molecules is avoided by electrostatic repulsion. Still, the ligands performed short-lived interactions among themselves or through Mg²⁺ bridges, the same way it could be expected in an experiment at this concentration. For our analysis, we treated each ligand interaction against IMPase as an independent trajectory from other ligands. The results show many IP binding events to IMPase-III and to IMPase-II, in coordination with Mg²⁺ or alone. In order to understand the main pathways of binding and provide kinetic estimates, an MSM was produced gathering the two simulation sets. In this analysis, the contacts between substrate and protein were mapped and geometrically clustered. After, each cluster was further split taking into account whether the Mg²⁺ ion was coordinated or not with IP’s phosphate group at a shorter distance than 4 Å (see Methods).

Five final states showing IP free in solution, bound, or in metastable states were obtained. Figure 4b summarizes the transitions among them and their specific binding modes. State 1, corresponds to bulk or the initial state in the reaction pathway, that is to say, when IP is free in solution. State 2, located at the interface between subunits, does not directly convert to the other states without reverting to bulk. This binding pose is independent of IMPase’s coordination and occurs both in IMPase-II and III. IP does not interact to any Mg²⁺ ion in this pose. States 3 and 4, correspond to short-lived states in the pathway of IP binding. Lastly, state 5, corresponds to the bound pose in the catalytic pocket of IMPase-III. The pose obtained through our analysis, overlaps well with the crystal structure (Fig. 5). Under the case of a conventional non-covalent reversible binding, IP would bind with a Gibbs free energy of −7.1 ± 0.3 kcal/mol, as computed as the ratio between its off and on rates. Once bound, state 5 presents a residence time of several milliseconds. IMPase’s k_cat is 22 + 3 s⁻¹ at 0.5 mM IP concentration, and therefore hydrolyzes an IP molecule in 45 ms. With this turnover number, IMPase-III plausibly hydrolyzes IP molecules once they have reached state 5, shifting metastable states 3 and 4 towards the bound pose. Full kinetic and quantitative data is presented in Table S1.

**Figure 4: Overview of substrate mechanism of binding.**

**Figure 5: Pose from the most probable state in the analysis of IP binding superimposing with the available crystal structures (PDB code 1AWB)**

IP can reach the bound state through three different pathways of binding. The fastest binding route corresponds to the direct binding to IMPase-III from bulk. The average binding occurs in a time between 2.6 and 5.8 μs. A second, slower binding pathway consists on a two-step binding mechanism. The first rate-limiting step comprises the binding of the IP-Mg-III complex to IMPase-II’s catalytic pocket, occurring in 0.8–3.4 ms (state 4). Visual inspection of the trajectories leading to this state showed that although IP-Mg-III reaches the pocket as a complex, once inside the protein residues are able to dissociate the complex more than 4 Å apart. Thus, the MSM analysis detected a state represented by Mg-III and IP in disordered positions inside IMPase-II’s pocket. The formation of this metastable state is followed by a quick reordering of the complex towards the bound pose in 1–2 μs. The third, slowest mechanism consists on the binding IP alone to the vicinities of the catalytic pocket (state 3), with a time about 1–10 ms, followed by a much faster step occurring in a few microseconds. The longer rearrangement times for this third pathway regarding the second could be due to the formation of di-IP-Mg²⁺ complexes or the longer distance to the active site.

Looking at the different binding pathways in relative terms, we see that the first, single-step binding pathway occurs three orders of magnitude faster than the two others, and could in practical terms be the only pathway of binding. Assuming equal populations of the two IMPase forms, the substrate could reach the pre-hydrolysis pose over one thousand times through the direct pathway in the time it would need through the second or third pathways. Note, however, that the real populations of IMPase-II and IMPase-III remain unknown. We never observed the Mg-III’s unbinding event and neither the off-rate nor the equilibrium constant can be estimated or are present in literature.

From the binding event, we have estimated that the shift from IMPase-II to IMPase-III takes several milliseconds in the absence of IP, as deduced from the process’ on-rate of 1.85 · 10⁵ M⁻¹ s⁻¹ (Fig. 4a). However, in the presence of IP, the equilibrium is shifted to the right: once the substrate is bound to IMPase-II, the cofactor reaches site III in a few microseconds. These facts can easily be explained in terms of electrostatic repulsion. apo-IMPase presents a highly polar pocket composed by four acidic residues (Glu70, Asp90, Asp93, and Asp220), totaling up to four negative charges in close vicinity. The binding of Mg-I and Mg-II, each carrying two positive charges, neutralizes the pocket. The binding of Mg-III under these conditions would in principle not be very favorable, thus explaining its very slow kinetics. However, the natural substrate presents two negative charges on its phosphate group. The great differences we observe for substrate binding to IMPase-II and IMPase-III are also possibly a consequence of the pocket’s total charge differences: whereas IP’s binding to the doubly positively charged IMPase-III’s pocket is diffusion-controlled, the binding to neutral IMPase-II’s pocket takes a few milliseconds regardless its pathway.

Discussion

We have fully characterized substrate and cofactor binding prior to the catalytic event. The first study concluded that the protein is able to form a ternary complex with Mg²⁺ ions, even in the absence of substrate, inhibitors or inorganic phosphate, but its population could not be reliably measured. Our study shows that Mg-I and II’s pocket identification is diffusion limited, whereas subsequent rearrangement of coordinating residues takes several milliseconds. The binding of Mg-III, although difficult to observe at physiological concentrations, could be recorded at 20 mM, giving a mean first passage time (mfpt) estimation of around 50 ms at 1 mM.

We have also provided an atomic-level description of substrate and cofactor cooperation and binding. IP is able to bind both IMPase-II and III forms to different extents. The substrate shows a very fast binding, occurring in a mean time of 4 μs, to IMPase-III. Additionally, the molecule is also able to bind to IMPase-II although in a slower fashion. Both either accompanied by Mg-III or alone it identifies IMPase-II in a millisecond two-step reaction. Interestingly, although event III is very difficult to observe even in high-throughput simulations as done here, the process speeds up by 3 orders of magnitude in the presence of IP. These facts are easily supported by the drastic net charge changes at IMPase’s pocket.

The mechanism presented complements previous studies on deciphering the order of substrate and cofactor binding. The most recent proposed studies agreed with the Leech et al.’s ordered mechanism, with substrate binding first and modifying or creating the binding site for one or two metals binding after³⁸. However, this mechanism did not account with the 3-metal structures that later emerged³⁹; and subsequent analysis proposed cofactor binding before and after substrate binding^17,32. Our mechanism shows that substrate binding after cofactor is kinetically favored, however we have shown that cofactor binding before and along with substrate is also possible, and could be the only pathway in the scenario where IMPase-III’s population was marginal. Actually, the versatility here presented could explain the diversity in previous studies, and those which observed a random-ordered mechanism^35,40. Biochemical experimental studies on IMPase’s gained a lot of attention two decades ago, however, difficulties to design bioavailable inhibitors prompted research on IMPase’s to come to a halt. We hope that future experiments can further progress our understanding of IMPase’s function and in particular help characterize the IMPase conformational space. We have used arguably the most advanced computational methods and infrastructure for exploring enzyme dynamics and whilst we are limited by the use of empirical force fields and possible difficulties of parameterization, we have been able to shed light onto important biochemical questions arising from previous experimental work. We note, however, that the picture of binding presented here, although capturing the different substrate binding routes, is only a portion of the complete conformational space. In particular, IMPase is known to present two segments in the entrance of its catalytic pocket (the β-hairpin region comprised by residues 30–40 and the short helix comprised by residues 70–75) which appear to be disordered in the absence of ions, and could undergo several rearrangements upon substrate and metal binding^17,39. We have, of course, observed protein plasticity along all the simulation sets performed in these regions. Nevertheless, given all our structures started from active conformation such movements are not representative of the global protein conformation space. Helix and β-sheets formation occur at timescales much slower⁴¹ than our ensembles and we do not have enough data at this stage to provide a solid study on the role such segments.

In summary, we have used large-scale HTMD and been able to recapitulate the binding events of Mg ions and natural substrate at IMPase, and we identified structures close to the X-ray solutions. In addition, our methodology also provides important information about the competition, cooperativity and kinetics of the binding pathways in this complex three-body process. We propose that the pathway diversity seen here might not be a particular case of IMPase, but a general principle for ligand binding. The ligand-binding paradigm is rarely a two-body problem (drug and receptor) because, water, or in particular ions, can play a critical role. We note that the quantitative study of IMPase’s mode of action is a particularly challenging case. The highly polar nature of the enzyme’s pocket, the metal parameterization, long timescales of the processes and the three-body mechanism of binding might not be the case of other targets. Still, with the advances in computational infrastructure, forcefield and analysis methods we believe that this approach can provide insight to understanding binding pathways for difficult targets like this. We expect that in the near future approaches similar to the one presented will become common in the early stage of the drug discovery pipeline. Such a study can provide a deeper understanding of the binding processes and the endogenous population of the active site, essential aspects of lead finding and optimization.

Material and Methods

Simulation system setup and simulation parameters

Input coordinates for human IMPase protein were based on the PDB code 4AS4¹³. The AMBER FF12SB⁴² forcefield was used to describe all the protein parameters. Mg²⁺ parameters were taken from a previous study⁴³ where the parameters were fitted against experimental data in order to provide a better description of their kinetic properties in water, also improving the phosphate binding description. All chemical entities were protonated with the OpenBabel software at pH 7.4⁴⁴ and parameterized by the Antechamber 12 tool⁴⁵. All the complexes were explicitly solvated by the LEAP module of the AMBER 12 software package in a TIP3P⁴⁶ cubic water box with at least 12 Å distance around the complex and then electrically neutralized using K⁺ and Cl⁻ ions. The final size of the systems was about 90000 atoms. The different cofactor and substrate concentrations are specified in Table 1.

Each system was minimized and relaxed under NPT conditions for 1 ns at 1atm and 298 K using a time-step of 4 fs, rigid bonds, a cut-off of 9 Å and PME for long-range electrostatics. Heavy protein and ligand atoms were constrained by a 1 kcal/mol/Å² spring constant during the equilibrations and gradually reduced. Production simulations were run using ACEMD over GPUGRID⁴⁷ in the NVT ensemble using a Langevin thermostat with damping of 0.1 ps⁻¹ and hydrogen mass repartitioning scheme to achieve timesteps of 4 fs⁴⁸. The total simulation times are summarized in Table 1.

Markov State Models

A Markov state model (MSM) for each of the systems was built from the molecular simulation trajectories. MSMs have been successfully used to reconstruct the equilibrium and kinetic properties in a large number of molecular systems^24,49,50. By determining the frequency of transitions between conformational states we were able to construct a master equation which describes the dynamics between a set of conformational states. Relevant states are determined geometrically by clustering the simulation data onto a metric space (e.g. contact maps). In this case, a discrete description of the process was obtained by means of protein-ligand contact maps. The carbon alpha atoms in the protein and the Mg²⁺ atom or the heavy atoms for the substrate molecule were selected for the construction of contact maps along all the trajectories. Two atoms are in contact if their distance is less than 8 Å.

One of the most important requirements for constructing Markov models is to be able to finely discretize the slowest order parameters. TICA⁵¹ (time-lagged independent component analysis) is a method that projects the data on the slow order parameters, thus producing a very good discretization. After projecting the high-dimensional protein-ligand contact maps onto the three slowest processes found by TICA with a 2 ns lag-time, the n-dimensional projected data was clustered using the k-centers algorithm. 3 and 5-dimensional projections were used for the analysis of Mg²⁺ binding and pathway reconstruction. The master equation is then built as

where P_i(t) is the probability of state i at time t, and k_ij are the transition rates from j to i, and K = (K_ij) is the rate matrix with elements K_ij = k_ij for i ≠ j and . The master equation dP/dt = K P has solution with initial condition P(0) given by P(t) = T(t) P(0), where we defined the transition probability matrix T_ij(t) = (exp[Kt])_ij = p(i,t|j,0), i.e. the probability of being in state i at time t, given that the system was in state j at time 0. In practical terms, p_ij(Δt) is estimated from the simulation trajectories for a given lag time Δt using a maximum likelihood estimator compatible with detailed balance⁵². The eigenvector π with eigenvalue 1 of the matrix T(Δt) corresponds to the stationary, equilibrium probability. Higher eigenvectors correspond to exponentially decaying relaxation modes for which the relaxation timescale is computed by the eigenvalue as , where λ_s is to the largest eigenvalue above 1. For long enough lag times the model will be Markovian, however every process faster than Δt is lost. Therefore, the shortest lag is chosen for which the relaxation timescales do not show a dependence on the lag time Δt anymore. In our case, we chose different lag times depending on the system, provided a good compromise between convergence in the implied timescale while being short enough to allow for sufficient statistical variance. Implied timescales and chosen lag times are shown in Table S2. Furthermore, although this fine discretization provides very good Markov models, it is necessary to reduce the amount of states to obtain a humanly interpretable model of the system in question. Therefore, the initial microstates can be lumped together into macrostates using kinetic information from the MSM eigenvector structure. Mean first passage times and commitor probabilities can also be calculated to obtain the relevant kinetics of the system⁵³. Hence, the produced clusters were then lumped together into macrostates using the PCCA algorithm, each consisting of a set of kinetically similar clusters. For the specific case of substrate pathway binding, the analysis was performed as follows. The 3870 trajectories were split into five independent IP trajectories, as each simulation comprised five molecules. This set contained 7801920 frames, each of which was transformed into a ligand-protein alpha carbon contact map where two atoms were considered in contact when closer than 8 Å. After performing a TICA projection onto the 5 slowest order parameters, the data was geometrically clustered using the k-centers algorithm into 925 clusters. Each of these clusters was further split taking into account if the substrate’s phosphate group was in contact with any of the two Mg-III ions or not. 26 new clusters were created giving a total of 951 clusters, which were subsequently used in the MSM model. The microstates were finally combined into 5 macrostates by PCCA, and their transitions and binding modes are represented in Fig. 4b and Table S1. Only bulk, state 4 and 5 contained clusters in which Mg²⁺ and IP were in contact.

Errors were estimated for all properties using a bootstrapping technique. We performed 7 independent runs in which 20% of the trajectories were randomly eliminated and a new MSM was built after re-clustering. On each of these runs, the same parameters as described above were applied.

Additional Information

How to cite this article: Ferruz, N. et al. Multibody cofactor and substrate molecular recognition in the myo-inositol monophosphatase enzyme. Sci. Rep. 6, 30275; doi: 10.1038/srep30275 (2016).

References

Equilibrium-The bipolar foundation.
CADE, J. F. J. Lithium salts in the treatment of psychotic excitement. Med. J. Aust. 2, 349–352 (1949).
Article CAS PubMed Google Scholar
Atack, J. R., Broughton, H. B. & Pollack, S. J. Inositol monophosphatase–a putative target for Li+ in the treatment of bipolar disorder. Trends Neurosci. 18, 343–349 (1995).
Article CAS PubMed Google Scholar
Berridge, M. J., Downes, C. P. & Hanley, M. R. Neural and developmental actions of lithium: a unifying hypothesis. Cell 59, 411–419 (1989).
Article CAS PubMed Google Scholar
Harwood, A. J. Lithium and bipolar mood disorder: the inositol-depletion hypothesis revisited. Mol. Psychiatry 10, 117–126 (2004).
Article CAS Google Scholar
Hallcher, L. M. & Sherman, W. R. The effects of lithium ion and other agents on the activity of myo-inositol-1-phosphatase from bovine brain. J. Biol. Chem. 255, 10896–10901 (1980).
CAS PubMed Google Scholar
Pollack, S. J. et al. Mechanism of inositol monophosphatase, the putative target of lithium therapy. Proc. Natl. Acad. Sci. USA 91, 5766–5770 (1994).
Article CAS ADS PubMed PubMed Central Google Scholar
Fauroux, C. M. J. & Freeman, S. Inhibitors of Inositol Monophosphatase. J. Enzyme Inhib. Med. Chem. 14, 97–108 (1999).
CAS Google Scholar
Atack, J. R. Inositol monophosphatase inhibitors—lithium mimetics? Med. Res. Rev. 17, 215–224 (1997).
Article CAS PubMed Google Scholar
Piettre, S. R., Ganzhorn, A., Hoflack, J., Islam, K. & Hornsperger, J.-M. α-Hydroxytropolones: A New Class of Potent Inhibitors of Inositol Monophosphatase and Other Bimetallic Enzymes. J. Am. Chem. Soc. 119, 3201–3204 (1997).
Article CAS Google Scholar
Bashir-Uddin Surfraz, M., Miller, D. J., Gani, D. & Allemann, R. K. Product-like inhibitors of inositol monophosphatase. Tetrahedron Lett. 44, 7677–7679 (2003).
Article CAS Google Scholar
Miller, D. J., Bashir-Uddin Surfraz, M., Akhtar, M., Gani, D. & Allemann, R. K. Removal of the phosphate group in mechanism-based inhibitors of inositol monophosphatase leads to unusual inhibitory activity. Org. Biomol. Chem. 2, 671–688 (2004).
Article CAS PubMed Google Scholar
Singh, N. et al. Cloning, expression, purification, crystallization and X-ray analysis of inositol monophosphatase from Mus musculus and Homo sapiens. Acta Crystallograph. Sect. F Struct. Biol. Cryst. Commun. 68, 1149–1152 (2012).
Article CAS Google Scholar
Gill, R. et al. High-resolution structure of myo-inositol monophosphatase, the putative target of lithium therapy. Acta Crystallogr. D Biol. Crystallogr 61, 545–555 (2005).
Article CAS PubMed Google Scholar
Bone, R., Springer, J. P. & Atack, J. R. Structure of inositol monophosphatase, the putative target of lithium therapy. Proc. Natl. Acad. Sci. USA 89, 10031–10035 (1992).
Article CAS ADS PubMed PubMed Central Google Scholar
Atack, J. R., Broughton, H. B. & Pollack, S. J. Structure and mechanism of inositol monophosphatase. FEBS Lett. 361, 1–7 (1995).
Article CAS PubMed Google Scholar
Ganzhorn, A. J. et al. The contribution of lysine-36 to catalysis by human myo-inositol monophosphatase. Biochemistry (Mosc.) 35, 10957–10966 (1996).
Article CAS Google Scholar
Whiting, P., Gee, N. S., Potter, J., Howell, S. & Ragan, C. I. Limited proteolysis and ‘in vitro’ mutagenesis of bovine brain inositol monophosphatase identifies an N-terminal region important for activity. Biochem. J. 272, 465–468 (1990).
Article CAS PubMed PubMed Central Google Scholar
Miller, D. J. & Allemann, R. K. myo-Inositol monophosphatase: a challenging target for mood stabilising drugs. Mini Rev. Med. Chem. 7, 107–113 (2007).
Article CAS PubMed Google Scholar
Ganzhorn, A. J. & Rondeau, J.-M. Structure of an Enzyme-Substrate Complex and the Catalytic Mechanism of Human Brain Myo-Inositol Monophosphatase. Protein Eng 10, 61–null (1997).
CAS Google Scholar
Lu, S. et al. Insights into the role of magnesium triad in myo-inositol monophosphatase: metal mechanism, substrate binding, and lithium therapy. J. Chem. Inf. Model. 52, 2398–2409 (2012).
Article CAS PubMed Google Scholar
Greasley, P. J., Hunt, L. G. & Gore, M. G. Bovine inositol monophosphatase. Ligand binding to pyrene-maleimide-labelled enzyme. Eur. J. Biochem. FEBS 222, 453–460 (1994).
Article CAS Google Scholar
Rees-Milton, K., Thorne, M., Greasley, P., Churchich, J. & Gore, M. G. Detection of metal binding to bovine inositol monophosphatase by changes in the near and far ultraviolet regions of the CD spectrum. Eur. J. Biochem. FEBS 246, 211–217 (1997).
Article CAS Google Scholar
Buch, I., Giorgino, T. & De Fabritiis, G. Complete reconstruction of an enzyme-inhibitor binding process by molecular dynamics simulations. Proc. Natl. Acad. Sci. 108, 10184–10189 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Harvey, M. J., Giupponi, G. & Fabritiis, G. D. ACEMD: Accelerating Biomolecular Dynamics in the Microsecond Time Scale. J. Chem. Theory Comput. 5, 1632–1639 (2009).
Article CAS PubMed Google Scholar
Buch, I., Harvey, M. J., Giorgino, T., Anderson, D. P. & De Fabritiis, G. High-Throughput All-Atom Molecular Dynamics Simulations Using Distributed Computing. J. Chem. Inf. Model. 50, 397–403 (2010).
Article CAS PubMed Google Scholar
Doerr, S., Harvey, M. J., Noé, F., De Fabritiis, G. HTMD: High-throughput molecular dynamics for molecular discovery, J. Chem. Theory Comput, 12(4), 1845–1852 (2016).
Article CAS PubMed Google Scholar
Voelz, V. A., Bowman, G. R., Beauchamp, K. & Pande, V. S. Molecular simulation of ab initio protein folding for a millisecond folder NTL9(1–39). J. Am. Chem. Soc. 132, 1526–1528 (2010).
Article CAS PubMed PubMed Central Google Scholar
Held, M. & Noé, F. Calculating kinetics and pathways of protein–ligand association. Eur. J. Cell Biol. 91, 357–364 (2012).
Article CAS PubMed Google Scholar
Stanley, N., Esteban-Martín, S. & De Fabritiis, G. Kinetic modulation of a disordered protein domain by phosphorylation. Nat. Commun. 5, (2014).
Dutta, A., Bhattacharyya, S., Dutta, D. & Das, A. K. Structural elucidation of the binding site and mode of inhibition of Li(+) and Mg(2+) in inositol monophosphatase. FEBS J. 281, 5309–5324 (2014).
Article CAS PubMed Google Scholar
Strasser, F., Pelton, P. D. & Ganzhorn, A. J. Kinetic characterization of enzyme forms involved in metal ion activation and inhibition of myo-inositol monophosphatase. Biochem. J. 307 (Pt 2), 585–593 (1995).
Article CAS PubMed PubMed Central Google Scholar
Doerr, S. & De Fabritiis, G. On-the-Fly Learning and Sampling of Ligand Binding by High-Throughput Molecular Simulations. J. Chem. Theory Comput. doi: 10.1021/ct400919u (2014).
Thorne, M. R., Greasley, P. J. & Gore, M. G. Bovine inositol monophosphatase: enzyme-metal-ion interactions studied by pre-equilibrium fluorescence spectroscopy. Biochem. J. 315 (Pt 3), 989–994 (1996).
Article CAS PubMed PubMed Central Google Scholar
Ganzhorn, A. J. & Chanal, M. C. Kinetic studies with myo-inositol monophosphatase from bovine brain. Biochemistry (Mosc.) 29, 6065–6071 (1990).
Article CAS Google Scholar
Weikl, T. R. & Paul, F. Conformational selection in protein binding and function. Protein Sci. Publ. Protein Soc. 23, 1508–1518 (2014).
Article CAS Google Scholar
Boehr, D. D., Nussinov, R. & Wright, P. E. The role of dynamic conformational ensembles in biomolecular recognition. Nat Chem Biol 5, 789–796 (2009).
Article CAS PubMed PubMed Central Google Scholar
Leech, A. P., Baker, G. R., Shute, J. K., Cohen, M. A. & Gani, D. Chemical and kinetic mechanism of the inositol monophosphatase reaction and its inhibition by Li+ Eur. J. Biochem. FEBS 212, 693–704 (1993).
Article CAS Google Scholar
Bone, R. et al. Structural analysis of inositol monophosphatase complexes with substrates. Biochemistry (Mosc.) 33, 9460–9467 (1994).
Article CAS Google Scholar
Gee, N. S. et al. The purification and properties of myo-inositol monophosphatase from bovine brain. Biochem. J. 249, 883–889 (1988).
Article CAS PubMed PubMed Central Google Scholar
Kalyaanamoorthy, S. & Chen, Y.-P. P. Modelling and enhanced molecular dynamics to steer structure-based drug discovery. Prog. Biophys. Mol. Biol. 114, 123–136 (2014).
Article CAS PubMed Google Scholar
Hornak, V. et al. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins 65, 712–725 (2006).
Article CAS PubMed PubMed Central Google Scholar
Allnér, O., Nilsson, L. & Villa, A. Magnesium Ion–Water Coordination and Exchange in Biomolecular Simulations. J. Chem. Theory Comput. 8, 1493–1502 (2012).
Article CAS PubMed Google Scholar
O’Boyle, N. M. et al. Open Babel: An open chemical toolbox. J. Cheminformatics 3, 33 (2011).
Article CAS Google Scholar
Case, D. A. et al. The Amber biomolecular simulation programs. J. Comput. Chem. 26, 1668–1688 (2005).
Article CAS PubMed PubMed Central Google Scholar
Mark, P. & Nilsson, L. Structure and Dynamics of the TIP3P, SPC, and SPC/E Water Models at 298 K. J Phys Chem A 105, 9954–9960 (2001).
Article CAS Google Scholar
Fabritiis, G. D. The GPUGRID.org website. http://www.gpugrid.org/.
Feenstra, K. A., Hess, B. & Berendsen, H. J. C. Improving efficiency of large time-scale molecular dynamics simulations of hydrogen-rich systems. J. Comput. Chem. 20, 786–798 (1999).
Article CAS PubMed Google Scholar
Sadiq, S. K., Noé, F. & Fabritiis, G. D. Kinetic characterization of the critical step in HIV-1 protease maturation. Proc. Natl. Acad. Sci., doi: 10.1073/pnas.1210983109 (2012).
Pan, A. C. & Roux, B. Building Markov state models along pathways to determine free energies and rates of transitions. J. Chem. Phys. 129, 064107 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Pérez-Hernández, G., Paul, F., Giorgino, T., De Fabritiis, G. & Noé, F. Identification of slow molecular order parameters for Markov model construction. J. Chem. Phys. 139, 015102 (2013).
Article ADS CAS PubMed Google Scholar
Prinz, J.-H. et al. Markov models of molecular kinetics: Generation and validation. J. Chem. Phys. 134, 174105–174105–23 (2011).
Article ADS CAS PubMed Google Scholar
Singhal, N., Snow, C. D. & Pande, V. S. Using path sampling to build better Markovian state models: Predicting the folding rate and mechanism of a tryptophan zipper beta hairpin. J. Chem. Phys. 121, 415 (2004).
Article CAS ADS PubMed Google Scholar

Download references

Acknowledgements

NF acknowledges support from Generalitat de Catalunya (FI-Agaur). GDF acknowledges support from MINECO (BIO2014-53095-P) and FEDER.We also thank all the volunteers of GPUGRID who donated GPU computing time to the project.

Author information

Authors and Affiliations

Computational Biophysics Laboratory (GRIB-IMIM), Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Doctor Aiguader 88, Barcelona, 08003, Spain
Noelia Ferruz & Gianni De Fabritiis
Acellera, Barcelona Biomedical Research Park, C Dr Aiguader 88, 08003, Barcelona, Spain ,
Noelia Ferruz
Research Informatics, Janssen Research and Development, Janssen Cilag S A, Calle Jarama 75, Poligono Industrial, Toledo, 45007, Spain
Gary Tresadern
Centro de Investigación Príncipe Felipe, Valencia, 46012, Spain
Antonio Pineda-Lucena
Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, Barcelona, 08010, Spain
Gianni De Fabritiis

Authors

Noelia Ferruz
View author publications
You can also search for this author in PubMed Google Scholar
Gary Tresadern
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Pineda-Lucena
View author publications
You can also search for this author in PubMed Google Scholar
Gianni De Fabritiis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.F. built the systems, analysed the data and wrote the manuscript, G.T. designed the project, A.P.L. revised the manuscript and provided experimental guidance and G.D.F. wrote part of the software, supervised the project and revised the manuscript.

Corresponding author

Correspondence to Gianni De Fabritiis.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (PDF 435 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Ferruz, N., Tresadern, G., Pineda-Lucena, A. et al. Multibody cofactor and substrate molecular recognition in the myo-inositol monophosphatase enzyme. Sci Rep 6, 30275 (2016). https://doi.org/10.1038/srep30275

Download citation

Received: 24 February 2016
Accepted: 29 June 2016
Published: 21 July 2016
DOI: https://doi.org/10.1038/srep30275

This article is cited by

Metabolic profiling reveals glucose and fructose accumulation in gcr1 knock-out mutant of Arabidopsis
- Seung-A Baek
- Soon Kil Ahn
- Jae Kwang Kim
Applied Biological Chemistry (2019)
Dopamine D3 receptor antagonist reveals a cryptic pocket in aminergic GPCRs
- Noelia Ferruz
- Stefan Doerr
- Gianni De Fabritiis
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.