Introduction

One protein binds a few molecules out of the thousands of different types it encounters in a cell and the properties of those physical interactions are key determinants in biological processes. There are well-established in vitro biophysical methods to address protein–ligand interactions and every technique has advantages and disadvantages for the study of a particular biological system1. Some of the established technologies to determine binding affinities are isothermal titration calorimetry (ITC), surface plasmon resonance (SPR), microscale thermophoresis (MST)2,3 and fluorescence-based assays among others4,5.

Particularly, differential scanning fluorimetry (DSF), also termed fluorescent thermal shift assay (FTSA), a methodology commonly applied to investigate the stability of proteins in different environments6,7,8,9, is often used to probe for the interaction between compounds and proteins in drug discovery and fragment screening10,11,12,13,14,15,16. Typically, the ligand binding affinity is presented by showing the \({T}_{m}\)-shift as a function of ligand concentration17. However, quantifying the corresponding binding constant (\({K}_{d}\)) from the unfolding curves at varying ligand concentrations and temperatures retrieves only an approximated value known as “apparent \({K}_{d}\)” that does not take into account the temperature dependence of the binding constant.

The isothermal approach instead uses the fraction of protein that is folded at a temperature range near the unfolding transition assuming a constant temperature, where protein folding/unfolding and the ligand binding equilibria are coupled16. However, often proteins do not unfold in a reversible two-state manner, and therefore the equilibrium thermodynamics models do not apply.

Here, we evaluate the isothermal strategy and improve the analysis of DSF data. We assess the isothermal approach with different biological model systems by comparing the binding affinities to the ones determined by either isothermal titration calorimetry (ITC) or microscale thermophoresis (MST). We also compare binding affinities determined by isothermal analysis to “apparent” binding affinities obtained by fitting a single-site model to the nDSF melting temperatures17. In this manuscript, we describe a method to optionally fit the heat capacity change upon unfolding (\(\Delta {C}_{p}\)) during the isothermal analysis and show that this approach provides more accurate results to assuming a value of zero for a set of simulated cases. The whole methodology is made available as a web server including the latter experimental feature.

Results

“Apparent” binding affinities from melting temperatures

A common approach to compare binding affinities of different ligands is the determination of a so-called “apparent \({K}_{D}\)” by fitting a simple 1:1 binding model to the melting temperatures versus ligand concentrations17:

$${T}_{m}([L{]}_{0})={T}_{m, unbound}+{(T}_{m, upper}-{T}_{m, unbound})*(1-a([L{]}_{0}))$$
(1)
$$\mathrm{a}({[L]}_{0})=\left({\left[P\right]}_{0}-{K}_{d}-{\left[L\right]}_{0} +\sqrt{({\left[P\right]}_{0}+{\left[L\right]}_{0}+ {K}_{d})2-4{\left[P\right]}_{0}{\left[L\right]}_{0} )}\right)/ (2 * {[P]}_{0})$$
(2)

In this 0th order approximation, \({T}_{m}\) values are fitted directly versus a complexation ratio. It is important to stress that this approach has no physico-chemical foundation. In the following, this will be referred to as model 1. Since a \({K}_{d}\) is defined for a certain temperature, this approach is thermodynamically not correct. Furthermore, this model assumes the Tm to converge to a well-defined melting temperature Tm,upper, which is incorrect11,12,18. Despite its shortcomings, its simplicity made this approach popular.

A thermodynamic-based approach is to use the following expression18,19:

$$\frac{{\Delta T}_{m}({\left[L\right]}_{0})}{{T}_{m}({\left[L\right]}_{0})}=\frac{{\Delta T}_{m}\left({\left[L\right]}_{0}\right)- {T}_{m0}}{{T}_{m}({\left[L\right]}_{0})}= \frac{R*{T}_{m0}}{{\Delta H}_{{T}_{m0}}}*\mathrm{ln}\left(1+\frac{{\left[L\right]}_{0}}{{K}_{d}}\right)$$
(3)

Here Tm([L]0) is the melting temperature of the protein in presence of the initial ligand concentration [L]0. Tm0 is the protein melting temperature in absence of the ligand. ΔHTm0 is the unfolding enthalpy of the protein in absence of the ligand at Tm0 and Kd the apparent dissociation constant. Rearranging this equation yields an expression for Tm([L]0) that we have used to fit binding affinities directly from the observed melting temperatures:

$${T}_{m}\left({\left[L\right]}_{0}\right)= \frac{{T}_{m0}}{1- \frac{R*{T}_{m0}}{{\Delta H}_{{T}_{m0}}}*\mathrm{ln}\left(1+\frac{{\left[L\right]}_{0}}{{K}_{d}}\right)}$$
(4)

In the following this approach will be referred to as model 2. It is important to note that this model approximates the free ligand concentration [L] to be equal to the initial ligand concentration [L]0 which can lead to deviations for low ligand concentrations and/or high binding affinities (low Kd).

Isothermal approach and fitting of \(\Delta {C}_{p}\) values

Similar to model 2 [Eq. (4)], isothermal analysis of differential scanning fluorimetry (DSF) data is based on a thermodynamic model16. However, the Kd is determined at a chosen fixed temperature by fitting the fraction unfolded versus the initial ligand concentration [L]0, instead of fitting the melting temperature Tm (i.e. at a constant fraction unfolded fu = 0.5) versus the initial ligand concentration [L]0 as assumed in model 2. The isothermal method involves several steps that are shown in Fig. 1. In all the examples shown here, we use a reversible 2-state unfolding model between folded (\(F\)) and unfolded protein (\(U\)) and a 1:1 binding model for the complex (\(FL\)) formation. Ligand binding of the unfolded species is not taken into account. Combining the two equilibria results in the following equation:

$$ {\text{FL}}\overset {K_{{\text{d}}} } \rightleftharpoons {\text{F}} + {\text{L}}\overset {K_{{\text{U}}} } \rightleftharpoons {\text{U}} + {\text{L}} $$
(5)

where \({K}_{D}\) is the dissociation constant and \({K}_{U}\) the unfolding equilibrium constant:

$${K}_{d}=\frac{[F][L]}{[FL]}$$
(6)
$${K}_{U}=\frac{[U]}{[F]}$$
(7)

where squared brackets denote equilibrium concentrations. During the analysis, it is assumed that DSF can only distinguish between folded (sum of \(F\) and \(FL\)) and unfolded species (\(U\)). The fraction unfolded is a central quantity for the determination of a binding constant or for the simulation of fluorescence melting curves specified in the following section (cf. Equation 14). It is the concentration ratio of unfolded protein and the total protein [P]0.

$${f}_{u}=\frac{[U]}{[P{]}_{0}}=\frac{ [U]}{[U]+[F]+[FL]}$$
(8)
Figure 1
figure 1

Flow chart of isothermal analysis of nDSF binding study. After selecting the signal type (F330, F350 or Ratio) and the spectral window, a local fit of the thermal curves yields starting values for the subsequent global fit. This can be either done with a fixed ∆Cp or with ∆Cp as fitted variable. The latter option is still an experimental feature. The next step is the calculation of the fraction unfolded fu for selected temperatures. Fitting these fu with a suitable binding model then yields a binding affinity for each selected temperature. A detailed description of the isothermal fitting routine can be found in Supplementary Information S10.

One obstacle in the application of isothermal analysis so far is the necessity to know the change of heat capacity upon unfolding \(\Delta {C}_{p}\) prior to the analysis. This quantity is needed for fitting the DSF melting curves and influences the Gibbs energy of unfolding \(\Delta {G}_{U}\). At temperatures \(T\) close to the melting temperature \({T}_{m}\), \(\Delta {G}_{U}\) (in absence of ligand) can be expressed as:

$$\Delta {G}_{U}(T)=\Delta {H}_{{T}_{m}}+\Delta {C}_{P}(T-{T}_{m})-T(\Delta {S}_{{T}_{m}}+\Delta {C}_{P} ln(\frac{T}{{T}_{m}}) )$$
(9)

\(\Delta {C}_{p}\) can experimentally be determined by differential scanning calorimetry (DSC). This method can be sample consuming and challenging as far as data analysis is concerned. Thus it is common practice to assume a \(\Delta {C}_{p}\) of zero when it comes to fitting differential scanning fluorimetry curves. It would be desirable to obtain the \(\Delta {C}_{p}\) “on-the-fly” during the isothermal analysis approach without the need for a priori knowledge or additional experiments. Therefore, we have complemented the common approach to specify a \(\Delta {C}_{p}\) with a global fit that optimizes this quantity as a variable during the fitting process of the thermal unfolding curves (cf. Fig. 1). It is important to note, that this approach so far is an experimental feature and still needs to be carefully assessed case-to-case. To further scrutinize isothermal analysis and in particular the fitting of \(\Delta {C}_{p}\) presented here, we will present extensive simulations in the next section.

Simulations

To assess the robustness of the presented method including the fitting of \(\Delta {C}_{p}\) values, data were simulated with different combinations of binding affinities \({K}_{D}\) ranging from \({10}^{-10}\) M to \({10}^{-1}\) M, protein concentrations ranging from \({10}^{-10}\) M to \({10}^{-1}\) M and heat capacity changes upon unfolding ranging from 0 to 12 kcal/mol K (0, 4, 8 and 12 kcal/mol K) using the following equations. The equations shown here are based on previous studies by Matulis et al.11 and Bai et al.16. The steps for preparing the data are:

  1. 1.

    Generate the temperature dependent Gibbs energy of unfolding \({G}_{U}\left(T\right)\) with a melting temperature in the absence of ligand \({T}_{m}\), the unfolding enthalpy (at \({T}_{m}\)) \(\Delta {H}_{U,{T}_{m}}\) and the change in heat capacity \(\Delta {C}_{p,U}\) using the Eq. (9).

  2. 2.

    The total Gibbs energy \(\Delta G\) is a sum of a term for unfolding \(\Delta {G}_{U}\left(T\right)\) and a term for dissociation of the ligand \(\Delta {G}_{d}\)11. The dissociation constant \({K}_{d}\) for the simulated data is assumed to be temperature independent for simplicity. This needs to be done for each initial ligand concentration [L]0.

    $$\Delta {G}_{U}(T,[L{]}_{0})=\Delta {G}_{U}(T)+RT*ln (1+\frac{[L{]}_{Free}}{{K}_{d}})$$
    (10)

    Here, R is the gas constant in kcal/mol and [L]free is the unbound ligand concentration that can be calculated with the following equation for each.

    $$[L]_{free}=\frac{ [L{]}_{0}-[P{]}_{0}/({K}_{u}+1) -{K}_{d} }{2} +\sqrt{\frac{{([L{]}_{0}+[P{]}_{0}/({K}_{u}+1)+{K}_{d})}^{2}}{4}-\frac{[L{]}_{0}[P{]}_{0}}{{K}_{U}+1}}$$
    (11)
  3. 3.

    Calculate the apparent unfolding equilibrium constant \({K}_{U}{^{\prime}}\left(T\right)\) using Eqs. (10) and (11) for each [L]0.

    $$K{^{\prime}}_{U}(T)={e}^{-\Delta G(T) / RT}$$
    (12)
  4. 4.

    Calculate the fraction unfolded \({f}_{u}\left(T\right)\) (cf. Equation 8)

    $${f}_{u}\left(T\right)= \frac{K{^{\prime}}_{U}(T) }{K{^{\prime}}_{U}\left(T\right)+1}$$
    (13)
  5. 5.

    fu(T) can then be used to model the fluorescence signal \(Y\left(T\right)\)

    $$Y(T)={f}_{u}(T)*({m}_{u}T+{b}_{u})+(1- {f}_{u}(T))*({m}_{f}T+{b}_{f})$$
    (14)

    Here \({m}_{u}\), \({m}_{f}\) are the slopes of and \({b}_{u}\), \({b}_{f}\) the intercepts of the unfolded/folded state. For simplicity, the signal of the folded bound state is assumed to be equal to the signal of the folded unbound state.

  6. 6.

    Add noise ranging from 0 to 10% of the dynamic range of the data (0, 1, 2, 5 and 10%).

This yields in total 1500 virtual binding studies that were analyzed by the same approach as the real experimental data shown in this manuscript (cf. Fig. 1). To investigate the effect of \(\Delta {C}_{p}\) fitting, we have done data analysis with and without this feature. The data was grouped according to Kd, \(\Delta {C}_{p}\), noise level and whether or not \(\Delta {C}_{p}\) was used (each combination contains 100 virtual binding studies). For each of these groups the Kd error was determined by dividing the fitted Kd by the true Kd and color coded as shown in Fig. 2 for \(\Delta {C}_{p}=8\) kcal/molK and a noise level of 2%, which is a realistic noise level observed for the experimental spectra shown in this work (cf. Fig. S1). All other results are shown in the supplementary information (Figs. S1–S23).

Figure 2
figure 2

Kd fits for simulated data with ∆Cp = 8 kcal/mol K and 2% noise. Kd deviations \(\left( {\frac{{K_{{\text{d,fitted}}} }}{{K_{{\text{d,true}}} }}} \right)\) are color coded with green corresponding to the initial value. Datesets outside the range are shown as white circles. Fitting the ∆Cp during the thermal curve fitting process results in better agreement with the initial Kd values (B) compared to fits with a fixed ∆Cp = 0 kcal/mol K (A). The fitted ∆Cp values are shown in Supplementary Information S22S23.

In the investigated cases, the \(\Delta {C}_{p}\) could be fitted accurately (Figs. S22/S23) as long as the noise level did not exceed an unusual threshold (ca. 5%). This held true even for high \(\Delta {C}_{p}\) values, with the start value for the optimization being always 0. Even for a \(\Delta {C}_{p}=12\) kcal/molK with a noise level of 2% the fitting could recover the \(\Delta {C}_{p}=12\) kcal/molK accurately (Fig. S23). The simulations underline the recommendation for choosing a concentration smaller than the Kd20, if experimentally possible, for an accurate Kd determination since the Kd deviations are in general higher above the diagonal (dashed line in Fig. 2). The lower limit of protein concentration is set by the fluorescence signal or the signal-to-noise ratio, which depends on the number of chromophores and their sensitivity to protein unfolding. If a \(\Delta {C}_{p}\) of 0 is assumed with a true \(\Delta {C}_{p}\ne 0,\) the lowest Kd deviations are obtained when the temperature chosen for isothermal analysis Tiso is close to the melting temperature Tm.

Proof of principle studies

In the following section, we will present nDSF binding studies with different protein systems and small molecule binding partners. For three systems, the binding affinities determined by isothermal analysis of nDSF data were compared to ITC or MST determined Kd values. In addition to that, we determined apparent binding affinities \({K}_{d,app}\) using model 1 [Eq. (1)] and model 2 [Eq. (4)], to compare these values to the Kd values determined by isothermal analysis. The results of these proof-of-principle studies are summarized in Table 2.

One open question is which \(\Delta {C}_{p}\) value to use for isothermal analysis. Fitting the \(\Delta {C}_{p}\) has proven to work in our simulations but is still an experimental feature until assessed with experimental values. In the supporting information we show the DSC data for Pcs60, one protein that was used to test isothermal analysis of nDSF data (Fig. S24). In this case, we were not able to determine the \(\Delta {C}_{p}\) from a simple DSC experiment due to overlapping transitions. The first of these transitions corresponds to the unfolding observed in nDSF experiments. In the provided example, to obtain an experimental \(\Delta {C}_{p}\) would require a measurement series with different pH or denaturant concentrations as the system does not show an ideal two-state unfolding equilibrium. Here we suggest that when \(\Delta {C}_{p}\) cannot be determined experimentally, it can be approximated computationally from the surface accessible area changes upon unfolding21,22 or from the number of residues \({N}_{res}\) of the protein21:

$$\Delta {C}_{P}=13.88*{N}_{res }\frac{cal}{mol \, K}$$
(15)

In Table 1, we compared the estimated \(\Delta {C}_{p}\) values for the different systems with the fitted values. The fitted value were in each case closer to the estimated value than 0 kcal/mol K. Due to the lack of experimental \(\Delta {C}_{p}\) values and the fact that the \(\Delta {C}_{p}\) fitting is still an experimental feature that still needs to be scrutinized with experimental values, we have decided to present all isothermal analysis shown in the main text with a \(\Delta {C}_{p}\) set to zero. The results are summarized in Table 2. As a comparison, we show all the analysis results with the fitted \(\Delta {C}_{p}\) values listed in Table 1 in the supplementary information (Figs. S25–S28). Our results indicate that fitting \(\Delta {C}_{p}\) provides more accurate Kds than assuming a value of zero.

Table 1 Estimated and fitted ∆Cp values for the systems presented in this work. The ∆Cp was estimated using Eq. (15).
Table 2 Proof-of-principle studies of isothermal analysis of nDSF data.

ADPR-binding proteins

As first model systems we have investigated two ADPR-binding proteins. EG1 (human) and SS1 (zebrafish) are soluble proteins that bind the physiological ligand adenosine diphosphate ribose (ADPR) enabling a variety of downstream signalling events. According to ITC experiments, EG1/ADPR shows a high nM affinity (16 nM at 20 °C, Fig. 3A) whereas SS1/ADPR shows a low micromolar affinity (3.5 μM at 25 °C, Fig. 4A). Only comparing Tm changes for different systems can lead to wrong conclusions (Figs. 3B and 4B). Even though SS1 shows a much larger melting temperature change than EG1 (5 °C vs. 10 °C, binding is tighter in case of EG1. These wrong conclusions are supported by the apparent binding affinity \({K}_{d,app}\) which is higher for EG1 than for SS1 (Figs. 3C and 4C). For EG1, model 1 fails completely, the respective \({K}_{d,app}\) is 4 orders of magnitude higher than the value determined by ITC. Model 2 performs better but the \({K}_{d,app}\) is still 3 orders of magnitude off. In contrast to that, isothermal analysis of nDSF binding studies not only reflects the trend in binding affinities but also the same order of magnitude compared to ITC. The temperature to extract Kd values is, however, limited to the region around the melting temperature. For EG1, the determined Kd values in the temperature range between 44 and 48 °C range from 442 nM at 44 °C to 2.9 μM at 48 °C (Fig. 3D). This strong apparent temperature dependence is a possible explanation for the significantly lower Kd value of 16 nM determined by ITC at 20 °C. Contrary to that behavior, isothermal analysis of SS1/ADPR yields stable binding affinities between 6.8 and 5.9 μM in the temperature range between 48 and 52 °C (Fig. 4D) which are in good agreement with the Kd value of 3.5 μM determined by ITC at 25 °C.

Figure 3
figure 3

(A) The Buffer subtracted ITC of EG1/ADPR yields Kd = 16 ± 4 nM (3 measurements) (B) nDSF signal (Ratio) for EG1/ADPR binding study with a protein concentration of 8 µM and ligand concentrations between 2 mM and 24 nM (14 dilutions). The region in the colored box was used for isothermal analysis (shown in D). (C) Melting temperature analysis with two different models yields apparent Kd,app values. (D) Isothermal analysis of nDSF data for ∆Cp = 0 at three selected temperatures. The same analysis for a fitted ∆Cp (cf. Table 1) is shown in Supplementary Information S25.

Figure 4
figure 4

(A) The buffer subtracted ITC experiment for SS1/ADPR yields a Kd = 3.5 µM (B) nDSF signal (fluorescence ratio F350/330) for SS1/ADPR binding study. The region in the colored box was used for isothermal analysis (shown in D). The ligand concentrations are color code from blue (low conc.) to red (high conc). The apo protein spectrum in absence of ligand is shown in black. (C) Melting temperature analysis with two different models yields apparent Kd,app values. (D) Isothermal analysis of nDSF data for ∆Cp = 0 at three selected temperatures. The same analysis for a fitted ∆Cp (cf. Table 1) is shown in Supplementary Information S26.

Major histocompatibility complex class I

Major histocompatibility complex (MHC) class I molecules are an important component in the immune response23,24,25. These molecules bind peptides and present them at the cell surface to mediate T cell immune surveillance. nDSF has already been successfully applied to assessing the thermal and kinetic stability of MHC/peptide complexes26. We have performed a nDSF binding study of empty MHC molecules of the HLA-A*02:01 allotype with the suboptimal peptide NT8 (sequence, NLVPMVAT; Fig. 5). Isothermal analysis at 32 °C results in a binding affinity of 6.3 μM, close to the value of 17.0 µM predicted by the NetMHC neural network27. Similarly to the case shown before, the apparent Kd,app for model 1 is considerably higher than the value determined by isothermal analysis, while model 2 yields a similar Kd,app.

Figure 5
figure 5

(A) nDSF binding study between MHC and the peptide NT8 (fluorescence ratio F350/330). The region in the colored box was used for isothermal analysis (shown in C). The ligand concentrations are color code from blue (low conc.) to red (high conc). (B) Melting temperature analysis with two different models yields apparent Kd,app values. (C) Isothermal analysis with a ∆Cp = 0. The same analysis for a fitted ∆Cp is show in supplementary information S27.

Peroxisomal-coenzyme A synthetase

We have also applied isothermal analysis to the Peroxisomal-coenzyme A synthetase (Pcs60), which is involved in the peroxisomal fatty-acid metabolism28. It catalyzes the formation of oxalyl-CoA from oxalate, ATP and CoA29. Similarly, to DSC measurements (cf. S24), nDSF melting curves for Pcs60 in presence of different concentrations of the nucleotide γS-ATP show two transitions around 40 and at 62 °C (Fig. 6A). The second transition does not shift upon addition of the ligand, whereas the first transition is affected by the presence of the ligand. Hence, the first transition was chosen for isothermal and melting temperature analysis, which yields an apparent Kd,app of 152 μM for model 1 and 10.2 μM for model 2, respectively (Fig. 6B). Isothermal analysis of the nDSF data at 40 °C yields a Kd of 18 μM (Fig. 6C). To evaluate isothermal analysis we have done microscale thermophoresis with the same system at 21 °C (MST, Fig. 6D + E). This method yields a Kd of 16 μM, which agrees very well with isothermal analysis and model 2.

Figure 6
figure 6

(A) nDSF signal (fluorescence ratio) for Pcs60/γS-ATP titration. The region in the colored box was used for isothermal analysis (shown in C). (B) Melting temperature analysis with two different models yields apparent Kd,app values. (C) Isothermal analysis of nDSF data for ∆Cp = 0 at three selected temperatures. The same analysis for a fitted ∆Cp (cf. Table 1) is shown in supplementary information S28. (D) MST experiment with Pcs60/γS-ATP. The “cold” and “hot” regions used for calculating Fnorm are marked as blue and red shadows, respectively. (E) Fitting Fnorm to a 1:1 model yields a Kd of 16 µM.

To test whether our method is sensitive to defects in ligand binding, we generated a Pcs60 K523A mutant. Lys523 is conserved among different Pcs60 sequences (cf. Fig. S30) and was shown to be crucial for nucleotide binding and catalytic activity in a related Pcs60 enzyme30. An isothermal analysis of the nDSF data at 40 °C yields a Kd of 213 μM for nucleotide binding of the K523A mutant (Supplementary Information S31) while the melting temperature analysis yields a higher Kd of 741 μM when using model 1 and a lower Kd of 117 μM when using model 2.

Discussion

For all cases shown here, binding affinities from isothermal analysis are in good agreement with the ones from isothermal titration calorimetry (ITC), microscale thermophoresis (MST) or a neural network prediction (MHC-I). Binding affinities for EG1/ADPR obtained by isothermal analysis show a strong temperature dependence. Sub μM affinities were retrieved, but only at low temperatures (\({T}_{iso}=44\)°C) far away from the melting temperature (\({T}_{m}=51\)°C). At these temperatures, the dynamic range of the fraction of unfolded protein is relatively low, which results in a larger relative error. The lowest fitting error can usually be obtained for a \({T}_{iso}\) around the \({T}_{m}\) of the protein in the absence of ligand. Apparent binding affinities from melting temperatures are considerably higher when using model 1. Together with the lack of physico-chemical foundations, this clearly shows that this model should not be used to estimate binding affinities. Affinities determined by model 2 are in good agreement with isothermal analysis as long as the affinities are above 1 μM. For sub-μM affinities, isothermal analysis performs better for our example system. This is supported by an additional binding study with a nM binder pknG and its inhibitor AX2001731 shown in the supplementary information (Supplementary Information, Figs. S34S35). For SS1 and EG1, the apparent Kds do not show the same trend compared to the affinities determined by ITC. The shape of the fitted curves can deviate from the experimental melting temperatures for both model 1 and 2 (cf. Figs. 4B and 6B), due to the thermodynamical incorrectness of model 1 or the approximation [L] = [L]0 for model 2. Apparent \({K}_{d}\) values with thermodynamically incorrect model 1 in Eq. (1) needs to be treated with caution. Model 2 yields in general good results but one could expect deviations when determining nM binding affinities. For Pcs60, isothermal analysis yielded binding affinities in agreement with microscale thermophoresis data confirming the crucial role of a particular active site lysine residue for nucleotide binding.

Our results show that isothermal analysis can be used to determine binding affinities in the nM to high μM range. Compared to a simple melting temperature analysis, the binding affinities are in better agreement with isothermal titration calorimetry, especially for nM binders. This label-free method requires a relatively low amount of protein sample when compared to other biophysical methods like ITC. Assuming that most proteins would contain at least one tryptophan residue, the concentration has to be adjusted to the detection range of the instrument, with an average working concentration around 0.1–0.2 mg/ml. When compared to the isothermal analysis performed on Thermofluor data, it is important to remember that most standard dyes are not compatible with membrane proteins solubilised in detergents. In addition, the Thermofluor assay reports unfolding/aggregation when hydrophobic patches are accessible while the nDSF assay reports the exposure of buried tryptophan residues upon protein unfolding. For Pcs60, nDSF experiment provided separated transitions that could be easily analyzed in contrast to the respective Thermofluor experiment (Supplementary Information S37). In our experience, the nDSF isothermal analysis is a suitable probe for intermediate to low binders (high nM to mM) where other biophysical techniques have limitations. The fact that autofluorescence of ligands does not seem to be of major concern when nucleotides have been analysed shows the robustness of the technique. Also, this method can be used to determine binding constants at higher temperatures (e.g., 37 °C), where a fraction of protein is already partly unfolded. Finally, we have developed a web server that is fast and easy to use. Our code reduces the calculation time for the computational bottleneck, which is global fitting of the thermal curves down to 5–10 s (45 curves with 649 points each). This server is now publicly available at https://spc.embl-hamburg.de and removes one obstacle in the application of isothermal analysis, namely easy access without programming skills. In the following, we will discuss common questions and criticism when it comes to the application of this method.

Choice of fluorescence signal and temperature range

Isothermal analysis can be applied to both intrinsic and extrinsic (external dye) fluorescence data. In the case of nDSF there are two quantities obtained during the measurements. Thermal unfolding can be simultaneously followed by the fluorescence at 330 nm or 350 nm. In many cases the unfolding can be visually most easily followed by the ratio between the fluorescence intensities at 350 nm and 330 nm, since they mostly show a different trend upon unfolding. However, there can be also exceptions where this ratio is not a good measure of unfolding32. In all the cases shown in this work, the fluorescence ratio was a good measure of protein unfolding and obtaining binding affinities. One exception is shown in the supplementary information (Fig. S34S36) where different ligand quenching effects on the intrinsic protein fluorescence are observed for 330 and 350 nm. This distorts the fluorescence ratio and prevents a proper analysis. It is therefore important to keep in mind to try the individual signals, in case the fluorescence ratio does not give a satisfying result. As far as the choice of temperature for isothermal analysis is concerned, we recommend to scan a wide region around the melting temperature of interest and to use the fitting error as a measure of quality. This will be exemplified later (cf. Fig. 8) by the heating rate experiments. Fitting a binding model to the fraction unfolded is fast and therefore not a computational bottleneck during isothermal analysis.

Reversibility of unfolding and effect on isothermal analysis

One prominent point of criticism when it comes to analysing nDSF data is the degree of irreversibility of the unfolding process during heating. In the ideal case, the protein unfolding is a two-state process that is completely reversible [cf. Eq. (5)]. The data analysis scheme presented here uses this model. However, irreversible unfolding can occur, which can affect the results of data analysis33. One example is the dependence of the apparent \({T}_{m}\) on the heating rate used during the experiment9. In the following two subsections, we present experiments to assess (I) the effect of the heating rate on melting temperatures and binding affinities and (II) the reversibility of unfolding for two systems presented earlier: SS1/ADPR and Pcs60/\(\gamma \)S-ATP.

To investigate the irreversibility of thermal unfolding we performed refolding experiments. For these experiments, the protein is heated up to a defined temperature and then cooled down at the same rate until the starting temperature is reached. Both during heating and cooling, the fluorescence signal is acquired. The maximum temperature chosen here is related to the temperature at which the error of the isothermal analysis is minimal. These temperatures are 40 °C for Pcs60/\(\gamma \)S-ATP and 50 °C for SS1/ADPR for a heating rate of 1 °C/min (Table 3). While Pcs60 unfolding is mostly reversible, SS1 unfolds irreversibly to a large extent (Fig. 7). This is even the case when decreasing maximum temperature for SS1 to 45 °C and increasing it for Pcs60 to 50 °C (Fig. S32).

Table 3 Summary of isothermal analysis for different heating rates for Pcs60/gS-ATP and SS1/ADPR for different heating rates shown in Fig. 8.
Figure 7
figure 7

Refolding experiment with Pcs60 (A) and SS1 (B) with a maximum temperature set to the temperature at which Kd values were extracted by isothermal analysis in Figs. 4 and 6 (40 °C for Pcs60 and 50 °C for SS1). Refolding experiment for Pcs60 up to 50 °C and SS1 up to 45 °C are shown in S32.

Influence of heating rate and choice of temperature for isothermal analysis

The heating rate during the nDSF experiment can have a strong influence on the apparent melting temperature \({T}_{m}\). This is caused by an irreversible unfolding step that is kinetically controlled. There are models that take into account this third state9 using a kinetic parameter. The relevant question here is how much the determination of binding affinities are affected by this shift in apparent melting temperatures.

A prior question is which temperature(s) to choose for the isothermal analysis. As mentioned above, we have calculated \({K}_{d}\) values for a broad range of temperatures around the melting temperature, e.g. the whole temperature window of the experimental data. All of them yield a \({K}_{d}\) and the errors of the respective fit, similarly to the examples shown in Fig. 8. As shown in this figure, the \({K}_{d}\) fitting error usually undergoes a minimum for temperatures around the melting temperature of the apo protein in absence of the ligand. For the analysis shown here we have used the numerical error of the fit as a criterion for choosing the temperature at which to extract a \({K}_{D}\) (shown as vertical lines in Fig. 8).

Figure 8
figure 8

Isothermal analysis at different heating rates for Pcs60/γS-ATP (A) and SS1/ADPR (B). For all datasets, the ∆Cp = 0 was assumed. The binding affinities were extracted at the temperature with minimum Kd fitting error marked by vertical lines (lines are horizontally shifted for visibility). All temperature and values are summarized in Table 3. The analyses for a fitted ∆Cp for a heating rate of 1 °C/min (cf. Table 1) are shown in Supplementary Information S33.

To test the influence of the heating rate on the determination of binding affinity, we have done nDSF binding studies with different heating rates: 7, 5, 3 and 1 °C/min for Pcs60/γS-ATP and 3, 1 and 0.1 °C/min for SS1/ADPR and analyzed the data by isothermal analysis. The analysis for \(\Delta {C}_{p}=0\) kcal/mol are shown in Fig. 8. The respective analysis for fitted \(\Delta {C}_{p}\) values (cf. Table 1) are shown in supplementary information S33. All results are summarized in Table 3. As far as apparent melting temperatures are concerned, Pcs60 and SS1 exhibit a very different behaviour upon change of the heating rate (Fig. 8A and B, top panels). Pcs60 only shows a slight increase of apparent melting temperatures with increased heating rates, even for very fast heating. In contrast to that, the apparent melting temperatures for SS1 increase drastically by 7 °C when changing heating rates from 1 to 3 °C/min. This might be an additional indication that Pcs60 is less prone to irreversible unfolding processes than SS1 and corroborates the results from the refolding experiments. As seen in the second panel of Fig. 8B, the determined affinities differ depending on which temperature is chosen for isothermal analysis and depending on the heating rate. However, choosing the temperature of minimal fitting error, provides similar binding affinities \({K}_{d}\) (Table 3). We still recommend using the same data acquisition parameters when comparing binding affinities in a systematic study. Using a \(\Delta {C}_{p}=0\) or the fitted \(\Delta {C}_{p}\) (cf. Table 1) does not result in differences in Kd determination for these systems.

Even though unfolding for the systems described here might not be simple two-state unfolding processes (e.g. Pcs60: DSC in fig. S24; full nDSF in Fig. 4) and not reversible (e.g. SS1), isothermal analysis with a simple two-state unfolding model still could reproduce the binding affinities determined by isothermal titration calorimetry or microscale thermophoresis for the systems presented here.

Outlook: application to more complex binding stoichiometries

One feature of the data analysis approach shown here is the simplicity of the thermodynamic model used. Fitting the thermal unfolding curves (both local and global) does not require any information of the binding stoichiometry (cf. Fig. 1). The parameters retrieved from these fittings are only two parameters \(\Delta H\) and \({T}_{m}\) of protein unfolding. These parameters are used to obtain the fraction unfolded \({f}_{u}\) at a chosen temperature vs. ligand concentration, which are subsequently fitted to a binding model. All the examples shown in this work are 1:1 binders and thus a 1:1 binding model was chosen for fitting the \({f}_{u}\). However, the analysis scheme can also be applied to more complex binding models. The only requirement is a mathematical model to describe the fraction unfolded \({f}_{u}\) as a function of the ligand concentration with the binding affinities as parameters. This includes analytical or numerical expressions for the equilibrium concentrations of the different species. The model has to take into account which species can be distinguished by the fluorescence signal. Normally, DSF will distinguish only between the folded and unfolded species. However, quenching effects (e.g. at different binding sites) can require a more complicated model.

Methods

Protein expression and purification

Human EG1 and zebrafish SS1 proteins

Human (hs) EG1 was produced with a TEV-cleavable C-terminal His10-tag using the pET15b(+) vector. The sequence coding for the zebrafish (dr) SS1 was cloned into the pnEK-vH vector to generate a construct with a TEV-cleavable N-terminal His6-tag. Protein production was carried out in BL21 Gold (DE3) E. coli bacteria in Terrific Broth (TB) media. Cells were grown at 37 °C to a cell density of 0.6–0.9 (measured at 600 nm) and after induction with 0.1 mM isopropyl-\(\beta \)-d-1-thio-galactopyranoside (IPTG), the target sequences were expressed overnight at 20 °C. The cells were harvested by centrifugation (5000×g for 25 min) and lysed by sonication or homogenization. Lysates were centrifuged (39,000×g for 45 min) and the tagged proteins were purified from the supernatant using immobilized metal affinity chromatography (IMAC). After His-tag removal by TEV protease and a second round of IMAC, the target proteins were further purified by gel filtration (Superdex 200 Increase 10/300 GL) using buffer N (for EG1; 20 mM HEPES pH 7.5, 150 mM NaCl, 5 mM CaCl2) or buffer M (for SS1; 25 mM HEPES pH 7.5, 150 mM NaCl, 5 mM CaCl2). Peak fractions were pooled and protein identity confirmed by SDS-PAGE and mass spectrometry.

MHC-I

HLA-A heavy chain disulfide mutant (dsA2) and human β2m light chain (hβ2m) were expressed and purified as described by Anjanappa et al.23. Briefly, expression was performed in E. coli Rosetta BL21 (DE3) pLysS using pET plasmid under the control of a T7 promoter. Cells were pelleted and lysed in cell lysis buffer (25% sucrose, 1 mM EDTA, 1 mM PMSF, 10 mM DTT in 50 mM Tris–Cl pH 8.0). Inclusion bodies were harvested by centrifugation, washed three times with detergent buffer (50 mM Tris pH 8.0, 150 mM NaCl, 0.5% Triton X 100, 1 mM DTT), followed by final wash with TBS (50 mM Tris pH 8.0, 150 mM NaCl, 1 mM DTT). The pellet was dissolved in solubilization buffer (6 M Guanidine HCl, 50 mM HEPES pH 6.5, 0.1 mM DTT, 0.1 mM EDTA) for 24 h at 4 °C. The solubilized protein was centrifuged at 30,000 × g for 30 min and the supernatant fraction was stored at − 80 °C until use. The refolding reaction was performed by diluting 1 μM of dsA2 heavy chain and 2 μM of hβ2m in a refolding buffer (100 mM Tris·Cl pH 8, 0.5 M arginine, 2 mM EDTA, 0.5 mM oxidized glutathione, 5 mM reduced glutathione, 10 mM dipeptide GM (Bachem) and incubated for 4–5 days at 4 °C with constant stirring, followed by protein concentration using 30 kDa cutoff membrane filters (Vivaflow-200; Sartorius). The concentrated protein was purified by SEC in 20 mM Tris–HCl pH 8.0, 150 mM NaCl on an ÄKTA system (GE healthcare) using a HiLoad Superdex 200 16/600 column (GE healthcare).

Pcs60

Full length S. cerevisiae Pcs60 was cloned in a pETM14 vector. Pcs60 was expressed in BL21 E. coli strain grown in auto-induction medium34, with 5 h at 37 °C and 16 h at 18 °C. Cells were harvested, resuspended in lysis buffer (50 mM HEPES pH 7.5, 150 mM NaCl, 20 mM imidazole, protease inhibitor (Roche), DNAse (Sigma) and lysozyme (Sigma)), homogenized 1 h at 4 °C and lysed by sonication. Lysates were then cleared by centrifugation and the supernatant loaded onto Ni–NTA resin (Qiagen). Bound proteins were washed with 50 mM HEPES pH 7.5, 750 mM NaCl, 20 mM Imidazole and the protein eluted with 50 mM HEPES pH 7.5, 150 mM NaCl, 250 mM Imidazole. The eluate was then dialysed against HEPES pH 7.5, 150 mM NaCl, 0.5 mM TCEP and simultaneously digested with 1 mg of TEV-protease per 100 mg of protein. Undigested protein and TEV protease were removed by a second Ni–NTA step and flow through containing Pcs60 was concentrated to 5 ml for gel filtration (HiLoad 16/60 Superdex 200 pg, GE healthcare) using an ÄKTA pure purification system. Relevant fractions were pooled together and the protein was concentrated, flash frozen in liquid nitrogen and stored at − 80 °C.

For the binding studies we have selected a Pcs60 variant with optimized oligomerization behavior. This construct shows very similar binding affinities compared to the wild-type PCS60 (cf. Fig. S29) and a larger melting temperature range with lower melting temperatures (cf. Fig. 6A and Fig. S29A).

Isothermal titration calorimetry (ITC)

ITC measurements for Human EG1 and zebrafish SS1 proteins were carried out on a MicroCal ITC-200 isothermal titration calorimeter (Malvern Panalytcal, Malvern, UK) and thermodynamic parameters were analysed using the MicroCal Origin program. Measurements were performed at 20 °C in buffer N (for EG1) or at 25 °C in buffer M (for SS1). The ligand ADPR was dissolved in the respective buffer. After an initial injection of 0.5 μl, 18 regular injections of 2 μl of ADPR were added to 10–20  μM EG1 or SS1 in the sample cell. The individual injections were interspaced by 150 s and stirring speed was set to 750 rpm. Baseline corrections were obtained by titrating ADPR into the corresponding buffer.

Microscale thermophoresis (MST)

MST curves for Pcs60/\(\gamma \)S-ATP were acquired with a Nanotemper Monolith NT.LabelFree (Nanotemper) using the MO.Control v1.6 acquisition software and Monolith NT.LabelFree capillaries. The same protein concentration and buffer conditions as for the nDSF experiments were used (9.9 μM). 16 different ligand concentrations (serial dilution with a factor of 2) in the range between 5 mM and 0.15 μM were used for each measurement. For the MST experiment the IR laser power was set to 20% and the UV laser power to 1%. In total, two independent measurements were done plus an additional repeat. The data was exported as xlsx and analyzed and plotted with a self-written Python script. The regions for Fnorm determination were − 1–0 s for “cold” and 19–20 s for “hot”. For each curve Fnorm was calculated as the ratio between the mean values in the “hot” and “cold” region. In the final step, Fnorm was fitted to a 1:1 binding model. The self-written Python scripts for spectral analysis including MST analysis are available online: https://github.com/steniegit/libspec

Sequence alignment

Sequence alignment of Saccharomyces cerevisiae Pcs60 (UniProt P38137) and Arabidopsis thaliana AAE3 (UniProt Q9SMT7) were performed using MUSCLE35 and visualized with Jalview36.

nDSF experiments: general information

All nDSF temperature curves were acquired with a Nanotemper Prometheus NT.48 fluorimeter (Nanotemper) controlled by PR.ThermControl (version 2.1.2). Excitation power was pre-adjusted to obtain fluorescence readings above 2000 RFU for fluorescence emission at 330 nm (F330) and 350 nm (F350), and samples were heated from 20 to 95 °C with a rate of 1 °C/min unless otherwise stated. For all measurements, Prometheus NT.48 series nDSF grade standard capillaries (Nanotemper, Munich, Germany) were used. For each binding study a serial dilution series of the ligand (dilution ratio 2) at constant protein concentration was prepared. In addition to the ligand dilutions, one pure protein sample was used. If not stated otherwise, all binding studies were performed in duplicates.

nDSF binding studies

Human EG1 and zebrafish SS1 proteins

Binding studies of EG1 and SS1 were performed in the same buffers as mentioned in the protein expression and purification section at protein concentrations of 8 μM and 5 μM, respectively. The ligand ADPR was dissolved in the same buffer and a dilution series of a solution containing the protein at a constant concentration (as shown above) and ADPR concentrations between 2 mM and 25 nM (14 dilutions, triplicates) were used for the nDSF study (LED power 10%).

MHC-I

11 dilutions of NT8 peptide ligand (NLVPMVAT) starting at 200 μM in the presence of 2.2 μM of MHC-I in SEC buffer (20 mM Tris pH 8, 150 mM NaCl) were prepared. Peptide was purchased from Genecust (Boynes, France) with 90% purity. The dilutions span a ligand concentration range between 200 μM and 195 nM with a dilution factor of 2 between each point. After 30 min incubation at room temperature, the samples were transferred to nDSF capillaries (duplicates) and nDSF curves were acquired with a LED power of 70%.

Pcs60

The following protocol was used for all nDSF experiments with Pcs60 and the nucleotide ligand γS-ATP. 14 dilutions of the nucleotide ligand (γS-ATP) starting at 5 mM at a constant Pcs60 concentration of 9.9 μM were prepared in buffer (50 mM HEPES pH 7.5, 150 mM NaCl, 0.5 mM TCEP). The dilutions span a ligand concentration range between 5 mM and 610 nM with a dilution factor of 2 between each point. In addition to the ligand dilutions, a pure Pcs60 sample in the same buffer was used. If not stated otherwise, the measurements were run as triplicates (in total 45 temperature curves). The LED laser power for the experiments was between 70 and 80%.

Isothermal analysis of nDSF data in a nutshell

The isothermal analysis follows the approach by Bai et al.16. Briefly, obtaining binding affinities from the experimental data is a four-step process. In the first step, each fluorescence curve is fitted individually using six parameters: the melting temperature \({T}_{m}\), the enthalpy of unfolding \(\Delta \)H, the initial and final intercepts, and the initial and final slopes (combining Eqs. 14, 12 and 9). These values can in principle already be used to continue with the isothermal analysis to obtain a binding affinity. However, an additional global fit of the fluorescence curves using the parameters from the previous fit as input gives more consistent results. In the global fit, the initial and final slope are fitted globally while the other parameters are still fitted individually for each curve. In the third step, the fraction unfolded (\({f}_{u}\)) is calculated for each ligand concentration at one or several temperatures. These temperatures can be selected in the temperature range, where unfolding of the relevant domain takes place. In the last step, the fraction unfolding vs. the ligand concentrations are fitted to a binding model. Here, we used a 1:1 binding model but the method can be extended to more complex models. The whole process is depicted as flowchart in Fig. 1. More details about data analysis are given in the experimental section and in the supplementary information S10 and figure S38.

Isothermal fitting routines

Both the local and global fitting routines for the isothermal analysis contain boundaries for the fitted melting temperatures \({T}_{m}\) and the unfolding enthalpy \(\Delta H\). The melting temperature is constrained to the experimental temperature region that is selected for the analysis. \(\Delta H\) is constrained to be positive. Our program contains an experimental feature that allows to fit the change of heat capacity \(\Delta {C}_{p}\) as a global variable in addition to the others. This feature remains to be experimentally validated. All errors specified in the plots are numerical fitting errors derived from the square root of the diagonal elements of the covariance matrix (output of Scipy curve_fit function).