Abstract
The strength and specificity of protein complex formation is crucial for most life processes and is determined by interactions between residues in the binding partners. Double-mutant cycle analysis provides a strategy for studying the energetic coupling between amino acids at the interfaces of such complexes. Here we show that these pairwise interaction energies can be determined from a single high-resolution native mass spectrum by measuring the intensities of the complexes formed by the two wild-type proteins, the complex of each wild-type protein with a mutant protein, and the complex of the two mutant proteins. This native mass spectrometry approach, which obviates the need for error-prone measurements of binding constants, can provide information regarding multiple interactions in a single spectrum much like nuclear Overhauser effects (NOEs) in nuclear magnetic resonance. Importantly, our results show that specific inter-protein contacts in solution are maintained in the gas phase.
Similar content being viewed by others
Introduction
Methods such as X-ray crystallography, electron microscopy, and nuclear magnetic resonance can be used to determine the structures of protein complexes at high resolution. Nevertheless, complementary methods are needed for determining the energetics of binding reactions and for obtaining structural information in cases that are not amenable to analysis by these methods owing, e.g., to low solubility, large size, and/or high flexibility. One such complementary method that is widely used is double-mutant cycle (DMC) analysis1. In this method, two residues of interest are mutated individually and in combination and the energetic effects of the mutations (e.g., on binding or folding) are measured. If the effect of the double mutation differs from the sum of effects of the corresponding single mutations then an interaction, either direct or indirect, between the two residues is inferred. Under favorable circumstances, the deviation from additivity, termed the coupling energy, can be used to estimate the strength of intramolecular2, 3 and intermolecular4 pairwise interactions in proteins or protein–ligand complexes. Coupling energies can also be used as constraints (much like nuclear Overhauser effects (NOEs) in nuclear magnetic resonance) to determine protein structures, e.g., when docking two proteins to each other5. Recent applications of the DMC method in molecular recognition studies include combining it with cryo-electron microscopy to identify the binding site of capsaicin in the TRPV1 ion channel6 and demonstrating an affinity-specificity tradeoff in PDZ–peptide interactions7. Here we describe a native mass spectrometry-based approach that simplifies the application and enhances the power of DMC analysis for studying protein interactions. Our results also show that specific inter-residue interactions are maintained in the gas phase.
Results
Native mass spectrometry can simplify DMC analysis
The application of DMCs for detecting and quantifying inter-protein pairwise interactions between residue i in protein X and residue j in protein Y involves creating a cycle that comprises the four possible complexes that can be formed by the two wild-type proteins, Xi and Yj, and the two corresponding single mutants, Xm and Ym, (Fig. 1). The change in the free energy of binding upon the mutation i→m, when leaving residue j unchanged may be expressed relative to the wild-type protein complex as ΔG(ij→mj). Likewise, the change in the binding free energy upon the mutation i→m, when residue j has already been mutated to m, may be expressed as ΔG(im→mm). The difference between ΔG(ij→mj) and ΔG(im→mm), which is equal to the difference between ΔG(ij→im) and ΔG(mj→mm), provides a measure of the coupling energy, ΔΔG int(ij), between residue i in protein X and residue j in protein Y. This coupling energy can be expressed in terms of binding constants as follows:
where K ij is the binding constant for the two wild-type proteins, Xi and Yj, K mm is the binding constant for the two corresponding single mutants Xm and Ym and K im and K mj are the respective binding constants for Xi with Ym and Xm with Yj.
Identification and quantification of an inter-molecular pairwise interaction using the DMC approach is labor intensive since it requires generating binding isotherms for the four complexes in each cycle using methods such as isothermal calorimetry or surface plasmon resonance. It is also error-prone because unavoidable errors in protein concentrations lead to errors in the estimates of the binding constants and, thus, to a cumulative error in the estimate of the value of the coupling energy, ΔΔG int(ij). The native mass spectrometry (MS) approach for DMC analysis that is described here circumvents the need for the determination of the individual binding constants and provides a way to measure ΔΔG int(ij) directly from a single spectrum. It is based on the realization that substitution of each of the binding constants in Eq. 1 by the appropriate ratio of free protein and complex concentrations (e.g., K ij = [XiYj]/[Xi][Yj]) leads to the following expression:
where the concentrations of all the free species cancel out.
Native MS is a method based on the ability to transfer protein complexes to the gas phase without disrupting them8,9,10,11,12. The high resolution afforded by the Orbitrap platform enabled us to determine the intensities of all four complexes simultaneously from a single mass spectrum. Hence, by mixing the two wild-type proteins, Xi and Yj, and the two corresponding single mutants Xm and Ym, and then measuring the intensities of all four possible co-existing complexes using native MS one can obtain a direct estimate of ΔΔG int(ij) that does not depend on knowing the concentrations of the free species ([Xi], [Yj], [Xm], and [Ym]).
Targeting inter-protein interactions in the E9:Im2 complex
The applicability of this native MS-based approach for measuring pairwise coupling energies was tested for the well-characterized interaction of colicin E9 endonuclease with the bacterial immunity protein Im2 with which it forms a 1:1 complex13. Two DMCs were constructed: one for the interaction of Asn34 in Im2 with Ser84 in E9 and a second for the interaction of Asp33 in Im2 with Phe86 in E9. Phe86 is a main hot spot residue in the interface of E9 with Im913. In both cycles, all the mutations were to alanine as before13. The coupling energies of the two pairwise interactions were determined by the native MS-based approach for DMC analysis described here and, in comparison, by measuring all the required individual binding constants using isothermal titration calorimetry.
Assignment of the peaks in the native MS spectra
A prerequisite for DMC analysis by native MS is that the changes in mass upon mutation are sufficiently large so that the masses of the different complexes forming the cycle can be well resolved. In the case of the cycle for the interaction of Asn34 in Im2 with Ser84 in E9, this condition was initially not met. We, therefore, combined the mutation Ser84→Ala in E9 with two additional mutations, His131→Ala and Arg132→Ala, in order to increase the mass difference. His131 and Arg132 are in the flexible tail of E9, thereby suggesting that they are not involved in any inter- or intra-protein interactions14. This was confirmed by showing that the affinity of the E9 Ser84→Ala mutant for Im2 is unchanged by the tail mutations (Table 1 and Supplementary Fig. 1). Introducing such mutations, therefore, provides a general strategy for increasing mass differences when needed. Native MS analysis confirmed that the differences in mass between wild type and mutant E9 or wild type and mutant Im2 can indeed be resolved (Fig. 2). Importantly, the results also show that the wild type and mutant variants of each protein (Fig. 2 and Table 2) and their complexes (Supplementary Tables 1 and 2) have similar ionization efficiencies so that the ratios of their intensities correspond to their concentration ratios in solution.
Given the results in Fig. 2, we proceeded to analyze mixtures containing different ratios of the four wild type and mutant E9 and Im2 proteins that combine to form the different complexes in each of the respective cycles (Figs. 3 and 4). The results show that, although the main peaks corresponding to the respective complexes in each cycle are well resolved from each other, there is some overlap in the minor peaks. The existence of several peaks is most likely due to bound ethylenediammonium ions and not to any variations in the protein sequences or post-translational modifications since only a single peak with a mass corresponding to the expected calculated mass was observed upon MS analysis of each of the denatured proteins (Supplementary Fig. 2). The concentration of each complex, therefore, had to be determined by summing the areas corresponding to both its major peak and all its minor peaks. Consequently, correct assigning of the peaks in the overlapping regions was necessary in order to obtain reliable estimates of the concentrations. These assignments were achieved by determining how the intensities of the peaks in question change as a function of the mixing ratio. Importantly, inspection of Eq. 1 shows that the values of the coupling energies do not depend on the mixing ratio.
Coupling energies from native MS and isothermal calorimetry
Given the relative concentrations of the four complexes that form each of the cycles, it was possible to calculate the coupling energies corresponding to them using Eq. 2 (Tables 3 and 4). The values obtained for the two different charge states and the various concentration ratios were found to be similar as expected. The average values for the interactions of Asn34 in Im2 with Ser84 in E9 (Table 3) and Asp33 in Im2 with Phe86 in E9 (Table 4) are −0.04 ± 0.03 and 1.21 ± 0.10 kcal mol−1, respectively. These values are in very good agreement with values of 0.06 ± 0.21 and 1.43 ± 0.25 kcal mol−1 that we obtained for these respective interactions using isothermal calorimetry (Fig. 5 and Table 1). The value of about zero measured for the coupling energy of Asn34 in Im2 with Ser84 in E9 is not surprising given the relatively large distance separation between these residues in the complex structure (the closest distance is 8 Å between the OD1 atom of Asn34 and the O atom of Ser8414). The positive coupling energy of more than 1 kcal mol−1 found for Asp33 in Im2 and Phe86 in E9 is consistent with a stabilizing and possibly cooperative anion–aromatic interaction between the O and OD2 atoms of Asp33 and the respective Cz and CE1 atoms of Phe86, which are separated in the complex structure by about 4.3 and 5.5 Å, respectively14. Cation-pi pairs in proteins have been documented widely but much less attention has been paid to anion–aromatic interactions between negatively charged side chains and the positive electrostatic potential associated with the ring edge of aromatic groups15, 16. The negative coupling energy of about −1.7 kcal mol−1 reported for the interaction of Asp33 in Im2 and Phe86 in E9 in previous work13 suggests that the different buffer conditions employed in the two studies may have caused conformational switching between a favorable (negatively charged side chain with ring edge) and unfavorable (negatively charged side chain with ring face) anion–aromatic interaction.
Concluding remarks
In summary, many previous studies have shown that protein–ligand and protein–protein complexes can be maintained in the gas phase8,9,10,11,12. Here we have shown using DMC analysis that the interaction energies of specific residue pairs can also remain unchanged upon transfer to the gas phase. The native MS version of DMC analysis described here provides values of coupling energies without the need for multiple measurements of affinity constants and accurate determination of protein concentrations, thereby simplifying the measurements and reducing the experimental error. The method can be extended to work in a high-throughput fashion by mixing wild type and n different single mutants of one protein with wild type and m different single mutants of a second interacting protein. These n + 1 and m + 1 variants of the two proteins form m × n different mutant cycles that can yield m × n coupling energies that can serve as constraints for docking. Hence, it might be possible in the future to determine the structure of a complex from a single native mass spectrum if the structures of the free proteins are known.
Methods
Molecular biology
The plasmid pET21d containing the wild-type genes of E9 DNase and Im9 with a His tag or E2 DNase and Im2 with a His tag was used as a template for mutagenesis by restriction-free cloning17. The following mutagenic oligonucleotides were used together with the T7 universal primer as forward and backward primers, respectively.
Phe86→Ala in E9:
5′-GTGTTTCAAAAGGTTATTCTCCGGCTACTCCAAAGAATCAACAGGTCG-3′;
Asn34→Ala in Im2:
5′-GGTGCTACTGAAGAGGATGACGCTAAATTAGTGAGAGAGTTTGAGCG-3′;
Asp33→Ala in Im2:
5′-GGTGCTACTGAAGAGGATGCCAATAAATTAGTGAGAGAGTTTGAGC-3′. The mutation Ser84→Ala in E9 was introduced together with two other mutations, His131→Ala and Arg132→Ala, designed to increase the difference between the mass of this mutant and those of the other variants without affecting its binding to Im2. This triple mutant was created using the following respective mutagenic oligonucleotides as forward and backward primers.
Ser84→Ala:
5′-GGTGCTACTGAAGAGGATGCCAATAAATTAGTGAGAGAGTTTGAGC-3′;
His131→Ala and Arg132→Ala:
5′-CCTAAGCGACATATCGATATTGCCGCAGGTAAGTAAAATGGAACTGAAGC-3′. The mutations were confirmed by DNA sequencing of the entire genes.
Expression and purification of the E9 and Im2 proteins
E. coli BL21 (DE3) cells harboring the pET21d plasmid that contains the appropriate genes coding for E9 and Im9 with a His6 tag or E2 and Im2 with a His6 tag were grown in LB medium containing 100 µg ml−1 ampicillin at 37 °C until an O.D600 nm of 0.6 was reached and protein expression was then induced by adding 1 mM isopropylthio-β-galactoside. The cells were then grown for another 4 h at 37 °C and harvested. Cell pellets were stored at −80 °C until further use. Purification was carried out by resuspending the cells in 20 mM Tris buffer (pH 7.5) containing 100 mM NaCl, 5 mM imidazole, and 1 mM DTT (buffer A) to which was added the cOmplete, EDTA-free, protease inhibitor cocktail (Roche Applied Science, Penzberg, Germany). The cells were then disrupted by sonication and the lysate was clarified by centrifugation. The supernatant was loaded on a 5 ml HisTrap HP column (Amersham Pharmacia, Uppsala, Sweden). The E9 DNase was dissociated from the column-bound Im9 and eluted by washing the column with buffer A containing 6 M guanidine hydrochloride. Im2 was eluted using a 10–200 mM imidazole gradient in buffer A containing 6 M guanidine hydrochloride. Fractions that contain the E9 and Im2 proteins were dialyzed overnight at 4 °C against water and lyophilized. The purified E9 was stored at −80 °C until further use. The Im2 protein was dissolved in 50 mM Tris buffer (pH 7.5) containing 50 mM KCl and 1 mM DTT (buffer B) and then further purified using a Superdex 75 column equilibrated with this buffer. The fractions containing Im2 were then loaded on a 5 ml HiTrap Q FF column (Amersham Pharmacia, Uppsala, Sweden). The Im2 protein was eluted from the Q FF column using a 0.05–0.5 M KCl gradient in buffer B. Fractions containing Im2 were combined, dialyzed overnight at 4 °C against water, and then lyophilized. The purified Im2 protein was stored at −80 °C until further use. The concentrations of E9 and Im2 proteins were determined using molar absorbance coefficients of 15,470 and 10,095 M−1 cm−1, respectively.
Isothermal titration calorimetry
Experiments were carried out using an ITC200 Microcal calorimeter. Lyophilized protein samples were dissolved in 250 mM ethylenediammonium diacetate (EDDA) buffer (pH 7.0) and then dialyzed overnight at 4 °C against this buffer. The protein samples were centrifuged and their concentrations determined before starting the ITC experiments. The titrations were carried out using a concentration of 0.04–1 mM E9 in the cell and a ten-fold higher concentration of Im2 in the syringe. Each ITC experiment comprised 26 injections with 3 min intervals of 1.5 µl Im2 at 25 °C. The heats of dilution were subtracted from the raw data and values of the binding constants were then determined by fitting the resulting binding isotherm to a 1:1 binding model using the ORIGIN program supplied by the manufacturer.
Mass spectrometry analysis
All the native MS experiments were performed using a Q Exactive Plus Orbitrap mass spectrometer (Thermo Fisher Scientific, Bremen, Germany) modified for optimal transmission and detection of high molecular weight ions. Before MS analysis, proteins were dialyzed against 0.25 M EDDA buffer (pH 7). Proteins forming the DMC of interest were then mixed at the desired ratios and incubated for 5 min at room temperature. Typically, an aliquot of 2 μl protein solution was loaded into a gold-coated nano-ESI capillary prepared in-house as previously described18 and then sprayed into the instrument. Conditions within the mass spectrometer were adjusted to preserve non-covalent interactions, with the source operating in positive mode. The following experimental parameters were used: capillary voltage, 1.7 kV; HCD direct voltage, 30–70 eV; and inlet capillary temperature in the range of 160–245 °C. MS spectra were recorded at a resolution of 17,500 (at m/z 200).
Assignment of peaks and area calculations
For each mass spectrum, the areas of the peaks corresponding to the four possible complexes, and taking into consideration the adherence of ethylenediammonium, i.e., (M+H)+ and (M+C2H10N2)+, were calculated using the PeakFit v4.12 software (Jandel Scientific, San Rafael, CA). In this program, peaks are fitted automatically to a series of Gaussians using a deconvolution approach. A numerical fitting procedure was repeated to minimize the deviation from experimental data as monitored by the coefficient of determination, R 2. Typical R 2 values were higher than 0.99. This analysis was repeated for the two detected charge states, i.e., 8+ and 9+. The reliability of the data deconvolution results was validated by comparison of the expected peak position calculated from the mass to charge ratio with the generated peakfit value. In addition, in cases of uncertainty due to the presence of bound ethylenediammonium adducts, the assignment was further confirmed by comparing the relative abundance of a specific peak in measurements performed at different ratios. For example, the peak at 2889 m/z can correspond to the complex of E9S84A with either wild-type Im2 or Im2N34A. This peak increases when the concentration of wild-type Im2 is doubled relative to the other three proteins (see Supplementary Fig. 3). Hence, it was possible to assign the peak at 2889 m/z to the complex of E9S84A with wild-type Im2. At least four independent experiments were performed for each ratio.
Data availability
Data supporting the findings of this study are available within the article (and its Supplementary Information file) and from the corresponding authors on reasonable request.
References
Horovitz, A. Double mutant cycles: a powerful method for analyzing protein structure and function. Fold. Des. 1, R121–R126 (1996).
Serrano, L., Horovitz, A., Avron, B., Bycroft, M. & Fersht, A. R. Estimating the contribution of engineered surface electrostatic interactions to protein stability by using double-mutant cycles. Biochemistry 29, 9343–9352 (1990).
Horovitz, A., Serrano, L., Avron, B., Bycroft, M. & Fersht, A. R. Strength and co-operativity of contributions of surface salt bridges to protein stability. J. Mol. Biol. 216, 1031–1044 (1990).
Schreiber, G. & Fersht, A. R. Energetics of protein-protein interactions: analysis of the barnase-barstar interface by single mutations and double mutant cycles. J. Mol. Biol. 248, 478–486 (1995).
Hidalgo, P. & MacKinnon, R. Revealing the architecture of a K+ channel pore through mutant cycles with a peptide inhibitor. Science 268, 307–310 (1995).
Yang, F. et al. Structural mechanism underlying capsaicin binding and activation of the TRPV1 ion channel. Nat. Chem. Biol. 11, 518–524 (2015).
Karlsson, O. A., Sundell, G. N., Andersson, E., Ivarsson, Y. & Jemth, P. Improved affinity at the cost of decreased specificity: a recurring theme in PDZ-peptide interactions. Sci. Rep. 6, 34269 (2016).
Sharon, M. Structural MS pulls its weight. Science 340, 1059–1060 (2013).
Sharon, M. & Horovitz, A. Probing allosteric mechanisms using native mass spectrometry. Curr. Opin. Struct. Biol. 34, 7–16 (2015).
Chen, F. et al. Applying mass spectrometry to study non-covalent biomolecule complexes. Mass Spectrom. Rev. 35, 48–70 (2016).
Liko, I., Allison, T. M., Hopper, J. T. & Robinson, C. V. Mass spectrometry guided structural biology. Curr. Opin. Struct. Biol. 40, 136–144 (2016).
Lössl, P., van de Waterbeemd, M. & Heck, A. J. The diverse and expanding role of mass spectrometry in structural and molecular biology. EMBO J. 35, 2634–2657 (2016).
Keeble, A. H. et al. Experimental and computational analyses of the energetic basis for dual recognition of immunity proteins by colicin endonucleases. J. Mol. Biol. 379, 745–759 (2008).
Meenan, N. A. et al. The structural and energetic basis for high selectivity in a high-affinity protein-protein interaction. Proc. Natl Acad. Sci. USA 107, 10080–10085 (2010).
Thomas, K. A., Smith, G. M., Thomas, T. B. & Feldmann, R. J. Electronic distributions within protein phenylalanine aromatic rings are reflected by the three-dimensional oxygen atom environments. Proc. Natl Acad. Sci. USA 79, 4843–4847 (1982).
Schwans, J. P. et al. Use of anion-aromatic interactions to position the general base in the ketosteroid isomerase active site. Proc. Natl Acad. Sci. USA 110, 11308–11313 (2013).
van den Ent, F. & Löwe, J. RF cloning: a restriction-free method for inserting target genes into plasmids. J. Biochem. Biophys. Methods 67, 67–74 (2006).
Kirshenbaum, N., Michaelevski, I. & Sharon, M. Analyzing large protein complexes by structural mass spectrometry. J. Vis. Exp. doi: 10.3791/1954 (2010).
Acknowledgements
A.H. is an incumbent of the Carl and Dorothy Bennett Professorial Chair in Biochemistry. M.S. is grateful for the support of a Starting Grant from the European Research Council (ERC) (Horizon 2020)/ERC Grant Agreement no. 636752. M.S. is an incumbent of the Aharon and Ephraim Katzir Memorial Professorial Chair.
Author information
Authors and Affiliations
Contributions
M.Sh. and A.H. designed the experiments and wrote the paper. M.So. and I.K. purified the proteins and performed the ITC measurements. J.C. and D.H. carried out the mass spectrometry experiments.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Sokolovski, M., Cveticanin, J., Hayoun, D. et al. Measuring inter-protein pairwise interaction energies from a single native mass spectrum by double-mutant cycle analysis. Nat Commun 8, 212 (2017). https://doi.org/10.1038/s41467-017-00285-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-017-00285-1
This article is cited by
-
Direct characterization of overproduced proteins by native mass spectrometry
Nature Protocols (2020)