# NmrLineGuru: Standalone and User-Friendly GUIs for Fast 1D NMR Lineshape Simulation and Analysis of Multi-State Equilibrium Binding Models

## Abstract

The ability of high-resolution NMR spectroscopy to readout the response of molecular interactions at multiple atomic sites presents a unique capability to define thermodynamic equilibrium constants and kinetic rate constants for complex, multiple-step biological interactions. Nonetheless, the extraction of the relevant equilibrium binding and rate constants requires the appropriate analysis of not only a readout that follows the equilibrium concentrations of typical binding titration curves, but also the lineshapes of NMR spectra. To best take advantage of NMR data for characterizing molecular interactions, we developed NmrLineGuru, a software tool with a user-friendly graphical user interface (GUI) to model two-state, three-state, and four-state binding processes. Application of NmrLineGuru is through stand-alone GUIs, with no dependency on other software and no scripted input. NMR spectra can be fitted or simulated starting with user-specified input parameters and a chosen kinetic model. The ability to both simulate and fit NMR spectra provides the user the opportunity to not only determine the binding parameters that best reproduce the measured NMR spectra for the selected kinetic model, but to also query the possibility that alternative models agree with the data. NmrLineGuru is shown to provide an accurate, quantitative analysis of complex molecular interactions.

## Introduction

NMR is a powerful technique to study molecular structure, dynamics and kinetics without the need for chemical modification. Due to its atom-level resolution, it is particularly useful to study molecular interactions involving multi-state equilibrium and can be used to simultaneously extract thermodynamics and kinetics parameters of each binding step1,2,3,4,5. In contrast, the readout of other label-free techniques, such as isothermal titration calorimetry, contains averaged information of multiple binding steps and/or provides estimates of only thermodynamics parameters6,7.

To study binding interactions by NMR, a titration experiment is usually performed. At each titration point, an NMR-observable molecule (P) is mixed with its binding partner (L) at a certain ratio and 1D or 2D NMR spectra are recorded. The changes of chemical shifts, peak widths, and peak intensities observed during this process are then used to extract thermodynamics and kinetics parameters.

NMR titration data exhibit different behavior depending on the binding process and exchange regimes set by the relative values of the on/off kinetic rates and chemical shift frequencies of the various molecular species. For the fast-exchange regime, the observed chemical shifts are a weighted average of the two exchanging states and thus thermodynamic parameters can be extracted directly from the normalized chemical shifts as for any titration binding curve1,8. For the slow-exchange regime, the peak intensities of exchanging states are proportional to their population and thus can be used to extract thermodynamic parameters1,6. However, if the exchange falls in the intermediate regime or involves multiple binding steps or both thermodynamics and kinetics parameters are wanted, a lineshape analysis using the whole peak envelope is necessary.

Although lineshape analysis can extract both thermodynamics and kinetics parameters for multi-state equilibrium binding models and all types of exchange regimes, the calculation steps are more complicated by involving both the equilibrium binding constants and kinetics rate constants. This complication limits the wide application of NMR lineshape analysis method.

Many published works involving NMR lineshape analysis utilized home-written scripts which cannot be easily adopted2,3. There are also several published software packages for NMR lineshape analysis which are summarized in Table 1. All the listed NMR lineshape analysis software packages require MATLAB installation, which is a commercial computing environment with expensive licensing fees. In addition, the MATLAB language syntax changes in different versions and older versions (required for older software) are harder to obtain and use as the compatible operating systems become obsolete. In addition, none of the listed software packages is fully based on a graphical user interface (GUI) for both lineshape simulation and fitting; running the script-based simulation or fitting requires a certain level of programming knowledge for MATLAB.

Based on our previous work in NMR lineshape analysis utilizing multi-state equilibrium models3, we developed NmrLineGuru, a standalone and user-friendly NMR lineshape software containing six GUIs for simulating and fitting NMR lineshape with two-, three-, and four-state binding models. These GUIs have no dependency on other software; NmrLineGuru is developed in the MATLAB environment, but does not require separate installation of MATLAB. NmrLineGuru aims to be extremely user-friendly for non-experts and time-saving with the hope of promoting the application of NMR lineshape analysis in research.

## Results and Discussion

### Models and interface

NmrLineGuru aims to be extremely friendly for non-experienced users. It currently supports the most commonly used two-, three-, and four-state binding models. For each model, there are two single-window GUI applications, one for simulating and the other for fitting 1D NMR lineshape data. Figure 1 shows screenshots of the 2- and 3-state GUIs as examples (see Supporting Information Fig. S1 for the 4-state GUIs). The input fields on each GUI are grouped and arranged by logical order to facilitate use. Default parameters are filled in by example, and to enable one-click simulation or fitting.

### Overview of the workflow

The general workflow of NmrLineGuru is shown in Fig. 2. Users are expected to provide only the minimal amount of information in the single-window GUI and the GUI will handle the subsequent steps automatically.

The simulation GUIs take the default or user-provided thermodynamic, kinetic, and resonance parameters to generate 1D NMR lineshape data for the user-specified concentrations of each species in a given titration series according to the selected model. The generated data can contain arbitrary noise and any number of points as requested by the user. The results will be automatically plotted, displayed, and saved in various formats (png, eps, fig, and txt) according to user preference.

The fitting GUIs read in the user-provided lineshape data to search for the best-match thermodynamic and kinetic parameters. The input can be data from the simulation GUIs, experimental data from 1D NMR spectra, or 1D slices from 2D HSQC experiments. The input format (detailed in online tutorials) is simple two-column text files that can be generated or converted from most NMR-spectrum software packages. For the users’ convenience, a plugin for data export is provided and integrated in the NMRFAM9 distribution of Sparky (Goddard TD & Kneller DG, University of California, San Francisco), the most popular NMR spectra visualization tool9. After reading the lineshape data and initial parameters, the fitting GUIs automatically normalize the input data, perform a Lorentzian fit to estimate resonance- parameters, iteratively search the best-fit dynamic and kinetic parameters within the user-specified range of values, perform Monte Carlo error estimation, and output the results as plots and tables. Automatic global fitting will be performed if multiple datasets are found in the input data directory.

### Example 2-state simulations and fittings

The 2-state simulation GUI simulates the single-step binding process (P + L ↔ PL) with arbitrary parameters. Figure 3 shows example 2-state simulations using the prefilled default parameters (300 μM [Ptotal]; 0–900 μM [Ltotal]; 10 μM KD; NMR frequencies ω0 = 0, 500 s−1 for P and PL, respectively; all line widths dω = 100 s−1) with different koff values ranging from 0.01 to 100 × Δω (5–50000 s−1) to highlight the effects of exchange kinetics on NMR lineshape. For a 2-state binding system (free P and bound PL), the exchange rate is kex = kon [L] + koff, which is dominated by koff at low ligand concentration. As expected, the generated lineshapes show canonical slow-exchange behavior (Fig. 3b) when koff Δω and fast-exchange behavior (Fig. 3f) when koff Δω. For moderate koff values (0.1–10 ×  Δω), the resonances broaden to various extent and shows intermediate-slow or intermediate-fast behavior (Fig. 3c–e).

These simulated 2-state lineshape data (50 data points sampled on each line with random noise, and signal/noise ≈50; a typical signal/noise level of 50–100 was seen in our NMR titration studies, such as the 29 kD Syk tSH2 constructs3,7) are fitted with the 2-state fitting GUI (Fig. 3 and Table 2). In all cases, the GUI correctly determines the KD value. koff values are determined for the range of 0.1–10 ×  Δω, and cannot be determined for values ≥ 100 × Δω, which is above the fast-exchange limit (i.e. spectrum appearance becomes insensitive to the koff values above the limit).

### Example 3-state simulations and fittings

The 3-state simulation GUI can simulate any [L]-dependent binding followed by a [L]-independent step such as ligand isomerization or ligand-induced protein conformational change (Fig. 4a). Figure 4 displays example 3-state simulations (300 μM [Ptotal]; 0–900 μM [Ltotal]; 10 μM KD; 1 Keq; NMR frequencies ω0 = 0, 500, 1000 s−1 for P, PL and P′L, respectively; all line widths dω = 100 s−1) for fast or slow binding coupled with fast or slow isomerization (see Table 3 for koff and krev values). For simplification, intermediate exchange is not illustrated for either step. For this 3-state binding system (free P, bound PL and isomerized P′L), the isomerization step lacks dependency on ligand concentration so species PL and P′L form at a constant ratio defined by the equilibrium constant Keq = [PL]/[P′L]. Coupled with the [L]-dependent exchange between P and PL, this system has interesting behavior. When the isomerization step is slow exchange (Fig. 3c,e), PL and P′L resonances are separate peaks; the system can appear like 2-state exchange between P and PL or P′L depending on Keq. When the isomerization step is fast exchange (Fig. 3d,f), PL and P′L resonances show up as one peak; the system looks like 2-state exchange between P and the PL/P′L-degenerate peak. The behavior therefore can be deceptive and careful inspection of the spectra is needed when investigating the detailed binding mechanism1.

The combined fitting and simulation capability of NmrLineGuru enables facile consideration of different binding models. The simulated 3-state lineshape data (50 data points on each line with noise, signal/noise ≈50) were fitted with the 2- and 3-state fitting GUIs. The results are summarized in Table 3. As required, the 3-state model fits well with all the lineshape data (Supporting Information Fig. S2), and gives correct thermodynamic and kinetic parameters within 95% confidence intervals for most values. The difference in KD estimated from resonance in Fig. 4c and the actual value is less than a factor of two, which is typically considered to be within experimental error in practice. Interestingly, the 2-state model agrees well with the lineshape data when the isomerization is in fast exchange (Fig. 4d,f and Supporting Information Fig. S3). When both steps are in slow exchange, and the resonances for P and PL are selected for the 2-state fitting, ignoring the P′L resonance, the lineshapes are well fitted (Fig. 4c and Supporting Information Fig. S3). But for the case where ligand binding is in fast exchange and isomerization is slow (Fig. 4e), fitting is poor even though the P′L resonance is ignored and deviations in fitting the NMR lineshapes are sufficient to suspect an incorrect model. Nevertheless, even for the cases where the lineshapes are well fitted, the KD and koff values are inaccurate, especially when both the binding and isomerization steps are in slow exchange (Table 3). In part, the apparent good fit of the lineshapes is due to the normalization step imposed in the fitting algorithm.

### Example 4-state simulations and fittings

The 4-state simulation GUI can simulate any system with two independent or coupled binding sites and arbitrary binding affinities (Fig. 5a). Figure 5 shows example 4-state simulations for two independent binding sites with similar binding affinities (300 μM [Ptotal]; 0–900 μM [Ltotal]; 10 μM KD for all binding steps; NMR frequencies ω0 = 0, 400, 600, 1000 s−1 for P, PL, LP and LPL, respectively; all line widths dω = 100 s−1), and disparate kinetic rate constants (see Table 4). Lineshape simulations are only shown for the limiting conditions of fast or slow exchange. When binding to both sites is in slow exchange, the NMR lineshape shows four resonances for the free, two singly ligated (LP and PL) and doubly ligated (LPL) forms of the protein (Fig. 5c), with the peak intensity for P gradually decreasing, that for LP and PL increasing and then decreasing, and that for LPL gradually increasing, which directly reflects the population change shown in Fig. 5b. When one of the two sites is in fast exchange, the NMR lineshape is dominated by the rapid kinetics for binding either free protein or protein ligated at the alternate site, and the spectrum looks like two fast-exchange binding processes in that the line frequency appears to follow the equilibrium concentration; however, the peak intensity for free P decreases and that for the ligated form increases (Fig. 5d,e). When both sites are in fast exchange, the NMR lineshape is collapsed into a single resonance at all titration points and looks like a 2-state binding system (Fig. 5f).

These simulated 4-state lineshape data (50 data points on each line with noise, signal/noise ≈50) were fitted with the 2- or 4-state fitting GUI. The 2-state model cannot fit any of the lineshape data, not even those from Fig. 5f which look like 2-state exchange (Fig. 5e,f; Supporting Information Fig. S4). In contrast, the 4-state model fits well with all lineshape data (Supporting Information Fig. S5) and gives correct values for all KD parameters and most of the koff parameters (Table 4). There are two koff parameters which cannot be determined from the 4-state fitting due to reaching the fast-exchange limit (“UB” in Table 4).

### Experimental lineshape data for Syk tSH2 binding with N-IHP

The GUIs were used to examine data reported for the interaction between Syk tandem SH2 domain and tyrosyl phosphorylated peptides3. In previous work, the necessity and power of using NMR lineshape analysis to effectively extract dynamic and kinetic information was recognized3. NmrLineGuru is demonstrated here to analyze data on this interaction.

The Syk tandem SH2 domain fragment (tSH2) contains two SH2 domains and either domain can bind to a phosphotyrosine peptide, N-IHP [Ac-PD(pY)EPIRKG-NH2], with a sequence derived from the CD3ε chain of the T cell receptor. tSH2 was titrated with N-IHP and the binding process was monitored by 15N-1H HSQC experiments (Fig. 6a,b). Although the lineshapes look like 2-state exchange, it is demonstrated that the 2-state model cannot fit the data (Fig. 6c,d). The lineshapes of resonances from both domains (proton-dimension slices for G32, L37, H61, A74, F106, D175, G184, and G210; nitrogen-dimension slices for L52, Y73, L192, C205, and S244) are then globally fitted with the 4-state model assuming two independent binding sites and the 4-state model fits the data well (Fig. 6e,f). The fitted equilibrium dissociation constants are 730 ± 20 and 69 ± 4 µM and the fitted off rates are 1800 ± 300 and 380 ± 20 s−1 (mean ± SD of fitted values from two independent experiments), for the N- and C-terminal SH2 domains binding N-IHP, respectively.

### Comparison of the fitted parameters by IDAP, TITAN, and NmrLineGuru

The experimental lineshape data10 for E22A CsnN174 chitosanase binding with the chitosan hexamer substrate (GlcN)6 were used to compare the fitting quality of NmrLineGuru to two other packages, IDAP1,2 and TITAN12. Data from multiple peaks were globally fitted in all three cases. Example lineshapes and fittings by NmrLineGuru are shown in Fig. 7 and the fitted parameters are summarized in Table 5 (fitted values with IDAP and TITAN are from reference10).

The lineshape behavior shown in Fig. 7, especially that of W28, resembles the simulation in Fig. 4e whereby one peak shifts and decreases intensity, and another peak gradually appears. This behavior indicates that the underlying mechanism is fast ligand binding coupled with slow isomerization, consistent with the reported analysis10. Global fitting of the selected 1D spectral slices by NmrLineGuru gives similar thermodynamic and kinetic parameters to those from IDAP and TITAN; the differences shown in Table 5 are considered negligible. The off rate of the ligand binding step is too fast to be accurately determined, which for NmrLineGuru is reported as beyond fast-exchange limit.

## Conclusion

We describe six standalone and user-friendly GUIs for both simulating and fitting 1D NMR lineshape data using the common 2-, 3-, or 4-state binding models. Accuracy in fitting the NMR data was demonstrated with simulated and experimental data, including global fitting of multiple peak lineshapes. These GUIs require the minimal amount of user input and handle most of the workflow in an automatic way. Aiming for non-experienced users, these GUIs can help to promote the wide use of the NMR lineshape analysis method, which is powerful and unique in studying dynamics and kinetics for multi-state binding systems.

## Methods

### Code development

The GUIs are developed in MATLAB R2014a (The MathWorks, Inc.) and compiled into standalone applications for both Windows and Linux. The low-level 1D NMR lineshape data I/O APIs (including the Sparky plugin) are from IDAP1,2 (Integrative Data Analysis Platform, http://lineshapekin.net).

### NMR lineshape for an exchange system

The NMR transverse magnetization for an equilibrium system of N spins in chemical exchange is generally described by the matrix form of the Bloch-McConnell equations2,11. The spectrum intensity at angular frequency ω is given by the sum of real components of the following complex vector S:

$${\bf{S}}={{\boldsymbol{\Omega }}}^{-1}{\bf{P}}$$
(1)

where P is a complex vector of length N and Ω is a N × N complex matrix. The elements of P are the relative spin populations, pi, defined by the kinetic rate constants $${k}_{ij}^{\ast }$$

$${\rm{spin}}\,i\,\mathop{\leftarrow }\limits_{{k}_{ji}^{\ast }}\,\mathop{\to }\limits^{{k}_{ij}^{\ast }}{\rm{spin}}\,j$$
(2)

Ω is given by

$${\boldsymbol{\Omega }}={\bf{M}}-{\bf{K}}$$
(3)

M is a diagonal matrix with the following element Mii for spin i

$${M}_{ii}={R}_{2}^{i}+i2\pi (\omega -{\omega }_{0}^{i})$$
(4)

where $${R}_{2}^{i}$$ and $${\omega }_{0}^{i}$$ are the transverse relaxation rate and intrinsic resonance frequency of spin i. $${R}_{2}^{i}$$ and $${\omega }_{0}^{i}$$ are provided by user input for simulations, or obtained by fitting the given resonance peaks to a Lorentzian function (assuming the peak centers at ω0 with a line width 2R2), or searched as fitting parameters during fitting procedures. K is the N × N exchange matrix with the following elements from kinetic rate constants:

$${K}_{ij}={k}_{ji}^{\ast }(i\ne j)$$
(5)
$${K}_{ii}=-\mathop{\sum }\limits_{j\ne i}^{N}\,{k}_{ji}^{\ast }$$
(6)

### Theory for 2-state binding and lineshape analysis

The 2-state binding model contains only one binding step between P and L to form PL, assuming P and PL are the NMR-detectable species. The system contains one equilibrium dissociation constant, KD:

$${K}_{{\rm{D}}}=\frac{[{\rm{P}}][{\rm{L}}]}{[{\rm{PL}}]}$$
(7)

The total concentration of protein and ligand of each titration point is:

$$[{{\rm{P}}}_{{\rm{total}}}]=[{\rm{P}}]+[{\rm{PL}}]$$
(8)
$$[{{\rm{L}}}_{{\rm{total}}}]=[{\rm{L}}]+[{\rm{PL}}]$$
(9)

From Eqs (79), the relationship between [L], [Ptotal], [Ltotal], and KD can be solved as:

$${[{\rm{L}}]}^{2}+([{{\rm{P}}}_{{\rm{total}}}]-[{{\rm{L}}}_{{\rm{total}}}]+{K}_{{\rm{D}}})[{\rm{L}}]-{K}_{{\rm{D}}}[{{\rm{L}}}_{{\rm{total}}}]=0$$
(10)

With given values of [Ptotal], [Ltotal], and varying values for KD, the value of [L] is solved analytically. Once [L] is known, the concentrations of other species in the system are determined based on Eqs (79):

$$[{\rm{P}}]=\frac{{K}_{{\rm{D}}}[{{\rm{P}}}_{{\rm{total}}}]}{[{\rm{L}}]+{K}_{{\rm{D}}}}$$
(11)
$$[{\rm{PL}}]=\frac{[{\rm{L}}][{\rm{P}}]}{{K}_{{\rm{D}}}}$$
(12)

The NMR lineshape calculation requires an additional parameter to describe the 2-state system, the off rate koff. The corresponding on rate is:

$${k}_{{\rm{on}}}=\frac{{k}_{{\rm{off}}}}{{K}_{{\rm{D}}}}$$
(13)

The predicted lineshape for a nucleus at a given titration point is described in Eqs (16). The following matrices are now

$${\bf{P}}=[\begin{array}{c}{[{\rm{P}}]}\\{[{\rm{PL}}]}\end{array}]$$
(14)
$${\bf{K}}=[\begin{array}{cc}-{k}_{{\rm{on}}}[{\rm{L}}] & {k}_{{\rm{off}}}\\ {k}_{{\rm{on}}}[{\rm{L}}] & -{k}_{{\rm{off}}}\end{array}]$$
(15)

### Theory for the 3- state binding and lineshape

The 3-state binding model (Fig. 4a) contains an inter-molecular binding step between the free P and L to form PL, and an intra-molecular (concentration independent) isomerization step for PL to form P′L. These two steps are described by the equilibrium dissociation constant KD and equilibrium isomerization constant Keq:

$${K}_{{\rm{D}}}=\frac{[{\rm{P}}][{\rm{L}}]}{[{\rm{PL}}]}$$
(16)
$${K}_{{\rm{eq}}}=\frac{[{\rm{PL}}]}{[{\rm{P}}^{\prime} {\rm{L}}]}$$
(17)

The total concentration of protein and ligand of each titration point is expressed as

$$[{{\rm{P}}}_{{\rm{total}}}]=[{\rm{P}}]+[{\rm{PL}}]+[{\rm{P}}^{\prime} {\rm{L}}]$$
(18)
$$[{{\rm{L}}}_{{\rm{total}}}]=[{\rm{L}}]+[{\rm{PL}}]+[{\rm{P}}^{\prime} {\rm{L}}]$$
(19)

From Eqs (1619), the relationship between [L], [Ptotal], [Ltotal], KD, and Keq is

$$({K}_{{\rm{eq}}}+1){[{\rm{L}}]}^{2}+(({K}_{{\rm{eq}}}+1)[{{\rm{P}}}_{{\rm{total}}}]-({K}_{{\rm{eq}}}+1)[{{\rm{L}}}_{{\rm{total}}}]+{K}_{{\rm{D}}}{K}_{{\rm{eq}}})[{\rm{L}}]-{K}_{{\rm{D}}}{K}_{{\rm{eq}}}[{{\rm{L}}}_{{\rm{total}}}]=0$$
(20)

With given values of [Ptotal], [Ltotal], KD, Keq, the value of [L] can be solved analytically. Once [L] is known, the concentrations of other species in the system could be determined as follows based on Eqs (1619):

$$[{\rm{P}}]=\frac{{K}_{{\rm{D}}}\,{K}_{{\rm{eq}}}\,[{{\rm{P}}}_{\mathrm{total}}]}{({K}_{{\rm{eq}}}+1)[{\rm{L}}]+{K}_{{\rm{D}}}\,{K}_{{\rm{eq}}}}$$
(21)
$$[{\rm{PL}}]=\frac{[{\rm{P}}][{\rm{L}}]}{{K}_{{\rm{D}}}}$$
(22)
$$[{\rm{P}}^{\prime} {\rm{L}}]=\frac{[{\rm{PL}}]}{{K}_{{\rm{eq}}}}$$
(23)

The lineshape analysis requires two more parameters to describe the 3-state system: the off rate for the binding process (koff) and the reverse rate for the isomerization process (krev). The corresponding on rate and forward rate are:

$${k}_{{\rm{on}}}=\frac{{k}_{{\rm{off}}}}{{K}_{{\rm{D}}}}$$
(24)
$${k}_{{\rm{fwd}}}=\frac{{k}_{{\rm{rev}}}}{{K}_{{\rm{eq}}}}$$
(25)

The predicted lineshape for a nucleus at a given titration point is described in Eqs (16). The following matrices are now

$${\bf{P}}=[\begin{array}{c}{[{\rm{P}}]}\\{[{\rm{PL}}]}\\{[{\rm{P}}^{\prime} {\rm{L}}]}\end{array}]$$
(26)
$${\bf{K}}=[\begin{array}{ccc}-{K}_{{\rm{on}}}[{\rm{L}}] & {k}_{{\rm{off}}} & 0\\ {K}_{{\rm{on}}}[{\rm{L}}] & -{k}_{{\rm{off}}}-{k}_{{\rm{fwd}}} & {k}_{{\rm{rev}}}\\ 0 & {k}_{{\rm{fwd}}} & -{k}_{{\rm{rev}}}\end{array}]$$
(27)

### Theory for the 4- state binding and lineshape

The 4-state binding model (Fig. 5a) contains four binding steps described by four equilibrium dissociation constants:

$${K}_{{\rm{D}}}^{{\rm{A}}1}=\frac{[{\rm{P}}][{\rm{L}}]}{[{\rm{PL}}]}$$
(28)
$${K}_{{\rm{D}}}^{{\rm{B}}1}=\frac{[{\rm{P}}][{\rm{L}}]}{[{\rm{LP}}]}$$
(29)
$${K}_{{\rm{D}}}^{{\rm{A}}2}=\frac{[{\rm{LP}}][{\rm{L}}]}{[{\rm{LPL}}]}$$
(30)
$${K}_{{\rm{D}}}^{{\rm{B}}2}=\frac{{K}_{{\rm{D}}}^{{\rm{B}}1}{K}_{{\rm{D}}}^{{\rm{A}}2}}{{K}_{{\rm{D}}}^{{\rm{A}}1}}$$
(31)

Note that only the first three equilibrium dissociation constants are independent parameters while the fourth becomes a function of the first three. The total concentration of protein and ligand of each titration point is expressed as

$$[{{\rm{P}}}_{{\rm{total}}}]=[{\rm{P}}]+[{\rm{PL}}]+[{\rm{LP}}]+[{\rm{LPL}}]$$
(32)
$$[{{\rm{L}}}_{{\rm{total}}}]=[{\rm{L}}]+[{\rm{PL}}]+[{\rm{LP}}]+2[{\rm{LPL}}]$$
(33)

From Eqs (2833), the relationship between [L], [Ptotal], [Ltotal], $${K}_{{\rm{D}}}^{{\rm{A}}1}$$, $${K}_{{\rm{D}}}^{{\rm{B}}1}$$ and $${K}_{{\rm{D}}}^{{\rm{A}}2}$$ is

$$\begin{array}{c}{K}_{{\rm{D}}}^{{\rm{A}}1}\,{[{\rm{L}}]}^{3}+(2{K}_{{\rm{D}}}^{{\rm{A}}1}[{{\rm{P}}}_{{\rm{total}}}]-{K}_{{\rm{D}}}^{{\rm{A}}1}[{{\rm{L}}}_{{\rm{total}}}]+{K}_{{\rm{D}}}^{{\rm{A}}2}{K}_{{\rm{D}}}^{{\rm{B}}1}+{K}_{{\rm{D}}}^{{\rm{A}}1}{K}_{{\rm{D}}}^{{\rm{A}}2}){[{\rm{L}}]}^{2}+(({K}_{{\rm{D}}}^{{\rm{A}}2}{K}_{{\rm{D}}}^{{\rm{B}}1}+{K}_{{\rm{D}}}^{{\rm{A}}1}{K}_{{\rm{D}}}^{{\rm{A}}2})[{{\rm{P}}}_{{\rm{total}}}]\\ -({K}_{{\rm{D}}}^{{\rm{A}}2}{K}_{{\rm{D}}}^{{\rm{B}}1}+{K}_{{\rm{D}}}^{{\rm{A}}1}{K}_{{\rm{D}}}^{{\rm{A}}2})[{{\rm{L}}}_{{\rm{total}}}]+{K}_{{\rm{D}}}^{{\rm{A}}1}{K}_{{\rm{D}}}^{{\rm{A}}2}{K}_{{\rm{D}}}^{{\rm{B}}1})[{\rm{L}}]-{K}_{{\rm{D}}}^{{\rm{A}}1}{K}_{{\rm{D}}}^{{\rm{A}}2}{K}_{{\rm{D}}}^{{\rm{B}}1}[{{\rm{L}}}_{{\rm{total}}}]=0\end{array}$$
(34)

With given values of [Ptotal], [Ltotal], $${K}_{{\rm{D}}}^{{\rm{A}}1}$$, $${K}_{{\rm{D}}}^{{\rm{B}}1}$$ and $${K}_{{\rm{D}}}^{{\rm{A}}2}$$, the value of [L] can be solved analytically with symbolic linear algebra software, such as Maxima (a Computer Algebra System, version 5.42.2, http://maxima.sourceforge.net). Once [L] is known, the concentrations of other species in the system are known from Eqs (2833):

$$[{\rm{P}}]=\frac{\,{K}_{{\rm{D}}}^{{\rm{A}}1}\,{K}_{{\rm{D}}}^{{\rm{A}}2}\,{K}_{{\rm{D}}}^{{\rm{B}}1}[{{\rm{P}}}_{{\rm{total}}}]}{{K}_{{\rm{D}}}^{{\rm{A}}1}{[{\rm{L}}]}^{2}+({K}_{{\rm{D}}}^{{\rm{A}}2}\,{K}_{{\rm{D}}}^{{\rm{B}}1}+\,{K}_{{\rm{D}}}^{{\rm{A}}1}\,{K}_{{\rm{D}}}^{{\rm{A}}2})[{\rm{L}}]+{K}_{{\rm{D}}}^{{\rm{A}}1}\,{K}_{{\rm{D}}}^{{\rm{A}}2}\,{K}_{{\rm{D}}}^{{\rm{B}}1}}$$
(35)
$$[{\rm{PL}}]=\frac{[{\rm{P}}][{\rm{L}}]}{{K}_{{\rm{D}}}^{{\rm{A}}1}}$$
(36)
$$[{\rm{LP}}]=\frac{[{\rm{P}}][{\rm{L}}]}{{K}_{{\rm{D}}}^{{\rm{B}}1}}$$
(37)
$$[{\rm{LPL}}]=\frac{[{\rm{P}}]{[{\rm{L}}]}^{2}}{{K}_{{\rm{D}}}^{{\rm{B}}1}{K}_{{\rm{D}}}^{{\rm{A}}2}}$$
(38)

The lineshape analysis requires four more parameters to describe the 4-state system: the off rates for the four binding process: $${k}_{{\rm{off}}}^{{\rm{A1}}}$$, $${k}_{{\rm{off}}}^{B1}$$, $${k}_{{\rm{off}}}^{A2}$$, and $${k}_{{\rm{off}}}^{{\rm{B2}}}$$. The corresponding on rate is

$${k}_{{\rm{on}}}^{i}=\frac{{k}_{{\rm{off}}}^{i}}{{K}_{D}^{i}}$$
(39)

where i denotes any of the four binding steps. The predicted lineshape for a nucleus at a given titration point is described in Eqs (16). The following matrices are now

$${\bf{P}}=[\begin{array}{c}{[{\rm{P}}]}\\{[{\rm{PL}}]}\\ {[{\rm{LP}}]}\\{[{\rm{LPL}}]}\end{array}]$$
(40)
$${\bf{K}}=[\begin{array}{cccc}-({k}_{{\rm{o}}{\rm{n}}}^{{\rm{A}}1}+{k}_{{\rm{o}}{\rm{n}}}^{{\rm{B}}1})[{\rm{L}}] & {k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{{\rm{A}}1} & {k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{{\rm{B}}1} & 0\\ {k}_{{\rm{o}}{\rm{n}}}^{{\rm{A}}1}[{\rm{L}}] & -{k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{A1}-{k}_{{\rm{o}}{\rm{n}}}^{{\rm{B}}2}[{\rm{L}}] & 0 & {k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{{\rm{B}}2}\\ {k}_{{\rm{o}}{\rm{n}}}^{B1}[{\rm{L}}] & 0 & -{k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{{\rm{B}}1}-{k}_{{\rm{o}}{\rm{n}}}^{{\rm{A}}2}[{\rm{L}}] & {k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{A2}\\ 0 & {k}_{{\rm{o}}{\rm{n}}}^{{\rm{B}}2}[{\rm{L}}] & {k}_{{\rm{o}}{\rm{n}}}^{{\rm{A}}2}[{\rm{L}}] & -{k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{B2}-{k}_{{\rm{o}}{\rm{f}}{\rm{f}}}^{{\rm{A}}2}\end{array}]$$
(41)

### Fitting-specific algorithms

The fitting GUIs search the best-fit values of unknown thermodynamic and kinetic parameters based on the input lineshape data. The unknown parameters (e.g. KD, koff) are assigned to the initial values from user selection and then iteratively refined by an optimization algorithm to match the experimental data, such that the following target function (sum of squared errors between the experimentally determined and the predicted lineshape data) is minimized:

$$\sum _{i,j,k}{\{[{S}_{i,j}^{pred}({\omega }_{k}){a}_{i}+{b}_{i}]-{S}_{i,j}^{expt}({\omega }_{k})\}}^{2}$$
(42)

where $${S}_{i,j}^{pred}({\omega }_{k})$$ and $${S}_{i,j}^{expt}({\omega }_{k})\,$$are the predicted and experimental spectrum intensity of resonance i at titration point j at frequency $${\omega }_{k}$$, respectively; ai and bi are the intensity and baseline correction factors for resonance i to compensate the normalization errors, if any, introduced during the normalization process of the experimental lineshape data.

If selected by the user in the interface, fitting errors of the best-fit values are determined by Monte Carlo resampling, in which a random noise is generated based on the spectrum noise level input with the experimental data and added to each of the input lineshape data points. The resampling and fitting procedure is repeated a minimum of 50 times and until convergence to determine the 95% confidence intervals of the fitted parameters. To determine experimental errors, multiple experiments should be performed and processed independently. The variance from experimental errors is generally larger than that from fitting errors3. If multiple experimental datasets are available, results from their independent fitting should be used to determine uncertainty for the fitted parameters.

## Data availability

The datasets generated during and/or analyzed during the current study are available in the online tutorials from NmrLineGuru GitHub wiki (https://github.com/stonefonly/NmrLineGuru/wiki) or from the corresponding author.

## Code availability

The compiled GUIs and related documents are available from NmrLineGuru GitHub repo (https://github.com/stonefonly/NmrLineGuru). The software has been pre-installed in NMRbox (https://www.nmrbox.org). For academic use, this work should be cited. For commercial use or source code request, please contact us directly.

## References

1. 1.

Kovrigin, E. L. NMR line shapes and multi-state binding equilibria. J. Biomol. NMR 53, 257–270 (2012).

2. 2.

Greenwood, A. I. et al. Complete determination of the Pin1 catalytic domain thermodynamic cycle by NMR lineshape analysis. J. Biomol. NMR 51, 21–34 (2011).

3. 3.

Feng, C. & Post, C. B. Insights into the allosteric regulation of Syk association with receptor ITAM, a multi-state equilibrium. Phys. Chem. Chem. Phys. PCCP 18, 5807–5818 (2016).

4. 4.

Furukawa, A., Konuma, T., Yanaka, S. & Sugase, K. Quantitative analysis of protein–ligand interactions by NMR. Prog. Nucl. Magn. Reson. Spectrosc. 96, 47–57 (2016).

5. 5.

Fielding, L. NMR methods for the determination of protein-ligand dissociation constants. Curr. Top. Med. Chem. 3, 39–53 (2003).

6. 6.

Wiseman, T., Williston, S., Brandts, J. F. & Lin, L. N. Rapid measurement of binding constants and heats of binding using a new titration calorimeter. Anal. Biochem. 179, 131–137 (1989).

7. 7.

Feng, C., Roy, A. & Post, C. B. Entropic allostery dominates the phosphorylation-dependent regulation of Syk tyrosine kinase release from immunoreceptor tyrosine-based activation motifs: Entropic Allostery Dominates Syk Release. Protein Sci. 27, 1780–1796 (2018).

8. 8.

Williamson, M. P. Using chemical shift perturbation to characterise ligand binding. Prog. Nucl. Magn. Reson. Spectrosc. 73, 1–16 (2013).

9. 9.

Lee, W., Tonelli, M. & Markley, J. L. NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy. Bioinforma. Oxf. Engl. 31, 1325–1327 (2015).

10. 10.

Shinya, S. et al. NMR line shape analysis of a multi-state ligand binding mechanism in chitosanase. J. Biomol. NMR 67, 309–319 (2017).

11. 11.

Post, C. B. Characterization of enzyme-complex formation by analysis of nuclear magnetic resonance line shapes. Methods Enzymol. 240, 438–446 (1994).

12. 12.

Waudby, C. A., Ramos, A., Cabrita, L. D. & Christodoulou, J. Two-Dimensional NMR Lineshape Analysis. Sci. Rep. 6 (2016).

13. 13.

Kovrigin, E. L. & Loria, J. P. Enzyme dynamics along the reaction coordinate: critical role of a conserved residue. Biochemistry 45, 2636–2647 (2006).

14. 14.

Günther, U. L. & Schaffhausen, B. NMRKIN: simulating line shapes from two-dimensional spectra of proteins upon ligand binding. J. Biomol. NMR 22, 201–209 (2002).

## Acknowledgements

This work was supported by NIH R01GM039478.

## Author information

Authors

### Contributions

C.F. developed the GUIs and fitting/simulation code, contributed to the research design and wrote the manuscript. E.K. developed 1D NMR lineshape data APIs (including the Sparky plugin) and contributed to the written manuscript. C.B.P. contributed to the research design and the written manuscript.

### Corresponding author

Correspondence to Carol Beth Post.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Feng, C., Kovrigin, E.L. & Post, C.B. NmrLineGuru: Standalone and User-Friendly GUIs for Fast 1D NMR Lineshape Simulation and Analysis of Multi-State Equilibrium Binding Models. Sci Rep 9, 16023 (2019). https://doi.org/10.1038/s41598-019-52451-8

• Accepted:

• Published: