Introduction
Owing to the conserved structural elements of protein kinases and the dynamic nature of kinase cascades, the study of cellular signaling remains a challenge. A variety of powerful methods (for example, gene knockouts, RNA interference and pharmacological agents) are available for determining the role of kinases within signaling cascades. Pharmacological agents such as small organic molecules are particularly powerful because they offer rapid and dose-dependent control over kinase activity. However, two key issues arise when interpreting the cellular effects of kinase inhibition by small molecules. First, few (if any) ATP-competitive kinase inhibitors are truly specific for a single kinase because key structural elements of the active site across all 518 human kinases are conserved; off-target effects are common1, 2. Second, the intracellular concentration and biological activity of kinase inhibitors are both greatly influenced by multiple factors, including barriers to cell permeability, drug efflux, cellular metabolism, competition with ATP, and the presence of cellular phosphatases, the endogenous antagonists of kinase activity2, 3. Genetic techniques offer a potential solution to these issues because knockouts result in complete loss of the protein kinase (100% inhibition), yet cellular compensation often confounds the analysis of highly adaptive pathways2, 4. Thus, although typical small-molecule kinase inhibitors allow rapid and dose-dependent inhibition of kinases inside a cell, current technology does not provide a quantitative understanding of how the degree of kinase activity relates to cellular function. By addressing this problem, we seek to identify the consequences of partial kinase inhibition, with the hope of identifying the most pharmacologically sensitive nodes within a signaling cascade. To achieve this, it is critical to inhibit a single kinase and to quantitate the degree of inhibition achieved in vivo.
To develop an active site affinity probe capable of measuring inhibitor occupancy, we took design cues from the field of activity-based probes (ABPs). The widespread utility of ABPs demonstrates that irreversible inhibitors provide powerful platforms for measuring the fraction of an enzyme in its active versus its inactive conformation within cells and in cell lysates; ABPs for several protein and lipid kinases have been developed5, 6, 7, 8, 9. In particular, wortmannin (Compound 1), an irreversible inhibitor targeting the catalytic lysine in the phosphatidylinositol-3-OH kinase (PI(3)K) and PI(3)K-related families, has been derivatized with a biotin label; this wortmannin analog demonstrates kinase activity–dependent labeling of DNA-dependent protein kinase (DNA-PK)7. Following the design paradigm of ABPs that involves linking reporter groups to reactive substituents, we sought to design an irreversible, inhibitor-based probe incorporating a functional handle for signal readout. Whereas ABPs classically rely on the polyspecificity of a probe to target an enzyme family, we instead designed our affinity probe to label a single targeted kinase.
To this end, we developed a general chemical genetic strategy using both inhibitor design and protein kinase engineering. By introducing two rationally designed mutations into the kinase active site, we sensitized two cytoplasmic kinases and a receptor tyrosine kinase (Fyn, c-Src and EGFR) to irreversible chemical inhibitors and a fluorescent affinity probe. First, we mutated the active site 'gatekeeper' residue, which is often conserved as a large hydrophobic residue across all protein kinases, to a smaller glycine residue10, 11. The resulting expanded binding pocket acts as a selectivity element: it permits binding of kinase inhibitor analogs derivatized with bulky aryl substituents that are sterically occluded from wild-type active sites12. The second mutation, a cysteine residue that we introduced at the lip of the ATP binding site, serves as an irreversible anchor point for the kinase inhibitor. This facilitates quantitative measurement of active site occupancy by the inhibitor after cell lysis. Using the quinazoline scaffold of the irreversible EGFR inhibitor PD 168393 (Compound 2), which reacts with the targeted Cys797 (Cys773 in an alternative numbering scheme) in EGFR13, we designed analogs of PD 168393 that bind only kinases containing both a glycine gatekeeper residue and a cysteine residue at the position corresponding to Cys797 in EGFR. To reveal the relative contribution of the dual specificity-determining mutations to the inhibitor binding mode, we solved several cocrystal structures of irreversible inhibitors bound to both EGFR kinase and a second kinase—c-Src kinase containing the electrophile-targeted cysteine mutation. These cocrystal structures of inhibitors irreversibly bound to protein kinases afford us the opportunity to contrast the mode of irreversible inhibitor binding with the wealth of data available from reversible kinase inhibitors. These structures reveal a surprising plasticity in the quinazoline core's binding to the two different kinases. Based on these structures, we synthesized an irreversible fluorescent affinity probe designed to covalently label an engineered EGFR mutant; we used the probe to measure the fraction of EGFR active sites available for signaling after inhibitor treatment. Subsequent measurement of the phosphorylation of EGFR signaling partners allowed for the correlation of cellular receptor activity with downstream outputs. This chemical genetic strategy enables measurement of the contribution of a kinase to a downstream signaling event.
Results
Design of irreversible inhibitors
Highly specific irreversible kinase inhibitors have been developed against both the EGFR kinase and ribosomal S6 kinase (RSK) families through both optimization of reversible binding and targeting of rare cysteine residues in these two families13, 14, 15. These examples served as templates for our chemical genetic strategy. Specifically, we chose as the starting point for inhibitor design the 6-acrylamide–substituted 4-anilinoquinazoline PD 168393, a potent and specific irreversible inhibitor of EGFR and human EGF receptor 2 (HER-2) that reacts with a critical cysteine (Cys797) in the ATP binding site of EGFR (Fig. 1a)13. Despite the high sequence homology of kinase active sites, a cysteine in a position analogous to Cys797 in EGFR is rare among protein kinases (Fig. 1b)14. A crystal structure of EGFR in complex with erlotinib (Compound 3; Fig. 2a), a reversible 4-anilinoquinazoline similar to PD 168393, shows that the C4 aryl substituent of erlotinib points into a hydrophobic pocket adjacent to Thr790, the gatekeeper residue in EGFR16, 17. This binding mode suggests that the 4-anilinoquinazoline might afford an allele-specific kinase inhibitor scaffold. We sought to determine whether, by introducing a cysteine residue at the position corresponding to Cys797 of EGFR into a kinase of choice, we could engineer a second, covalently reactive specificity-determining element.
Figure 1: A chemical genetic strategy sensitizes kinases to irreversible inhibitors that do not inhibit wild-type kinases.
(a) Features of EGFR-selective irreversible inhibitor PD 168393 (Compound 2) are shown13. H
, hydrophobic. Analysis of kinase cocrystal structures in complex with quinazoline inhibitors suggests that inhibitor derivatives of PD 168393 enlarged at C4 cannot bind to wild-type (wt) kinases because of their increased steric bulk. A double-mutant kinase is sensitized to these inhibitor analogs because of two structural selectivity elements: a smaller gatekeeper and an engineered cysteine analogous to the one targeted in EGFR. (b) A partial structure-based sequence alignment of several protein kinases with EGFR11. Residue Thr790 of EGFR is the gatekeeper residue, and Cys797 in EGFR13 is targeted for alkylation by PD 168393. A cysteine in this position is conserved among three of the four EGFR-family kinases (conserved cysteines colored in red).
Figure 2: The screening of a panel of C4-derivatized PD 168393 analogs reveals potent, selective inhibitors for an engineered double mutant of c-Src kinase.
(a) Structures of irreversible inhibitors (Compound 2, Compound 4, Compound 5, Compound 6, Compound 7, Compound 8, Compound 9, Compound 10, Compound 11, Compound 12) and a reversible inhibitor (Compound 13) are shown. Erlotinib (Compound 3), a reversible EGFR inhibitor instrumental in the structure-guided design reported here, is shown for reference. Detailed synthetic methods are described and illustrated in the Supplementary Methods. (b) IC50 values (in
M) for a panel of inhibitors against c-Src, c-Src-as1, c-Src-cys and c-Src-dm. Inhibitor Compound 5 (highlighted in red text) demonstrates the best balance of potency and selectivity for c-Src-dm.
Synthesis of a panel of inhibitors
We synthesized a small panel of inhibitors using a method previously described for the synthesis of PD 168393 (Fig. 2a and Supplementary Methods online)18. The panel included 11 analogs (Compound 2, Compound 4, Compound 5, Compound 6, Compound 7, Compound 8, Compound 9, Compound 10, Compound 11, Compound 12, Compound 13) with C4 substituents of varying steric demand intended to orient into the space occupied by the gatekeeper residue. Detailed synthetic procedures are described in Supplementary Methods.
Mutant c-Src generation and irreversible inhibitor screening
Cytoplasmic tyrosine kinases seemed like a tractable target for evaluating the feasibility and extensibility of our chemical genetic design, given their extensive kinetic characterization, their recently reported high-level heterologous expression in bacteria and their proven amenability to analog-sensitive chemical genetics. Therefore, we introduced cysteine mutations into the cytoplasmic tyrosine kinase c-Src. We previously identified c-Src-as1 (T338G) as an analog-sensitive kinase (an "as-allele")12, and alignment of chicken c-Src with EGFR (Fig. 1b) identified Ser345 as the candidate for cysteine mutation11. Given that cysteine is isosteric with serine, we hypothesized that this conservative serine-to-cysteine mutation would be functionally silent. We generated a c-Src double mutant (c-Src-dm) that contained T338G and S345C. To investigate the contribution of the cysteine residue alone, we also generated the single cysteine mutant c-Src S345C (c-Src-cys). We expressed the c-Src kinases using a recently reported high-yielding bacterial expression system for tyrosine kinases19.
Profiling the panel of inhibitors (Compound 2, Compound 4, Compound 5, Compound 6, Compound 7, Compound 8, Compound 9, Compound 10, Compound 11, Compound 12, Compound 13) against the four c-Src variants revealed substantial structure-activity relationship (SAR) trends (Fig. 2b). Though the quinazoline scaffold has been elaborated to give potent, highly selective reversible c-Src inhibitors20, most of the inhibitors described here showed no inhibitory activity against wild-type c-Src at a concentration of 27
M (Fig. 2b), which suggests that the quinazoline scaffold alone does not bind c-Src preferentially and that larger C4 modifications occlude the inhibitors from the wild-type kinase, as anticipated. Additionally, each inhibitor is substantially more potent against the analog-sensitive mutants c-Src-as1 and c-Src-dm than against wild-type c-Src, which suggests that the gatekeeper residue is a specificity-determining element in the context of C4-derivatized 6-acrylamido-4-anilinoquinazolines. Inhibitors bearing a 6-acrylamide substituent show increased potency for kinases with a targeted cysteine relative to kinases without one, which suggests that irreversible binding occurs, thereby boosting inhibitor potency. Notably, the significant increase in potency of the irreversible inhibitors for c-Src-cys versus c-Src suggests that covalent binding is possible even when the reversible binding mode is non-optimal. Also of note, the reversible inhibitor Compound 13 does not distinguish between cysteine-containing and non-cysteine-containing c-Src variants. Collectively, these results affirm our rational structure-based design of this inhibitor series.
Within this series, inhibitor Compound 5 stands out as the optimal rationally designed inhibitor. A handful of endogenous kinases have a targeted cysteine (for example, EGFR-family kinases), and an important element in the design of covalent inhibitors is to limit potency against these potential native kinases; compound Compound 5 has the least potency against c-Src-cys (half-maximal inhibitory concentration (IC50) = 5.29
M). Furthermore, Compound 5 is a potent inhibitor of the engineered double mutant, with an IC50 value of 5 nM. These data suggest that Compound 5, owing to its C4 p-t-butylaniline moiety, is especially sensitive to the gatekeeper mutation in the context of a properly positioned cysteine; this inhibitor demonstrates the best balance of selectivity and potency. Screening these inhibitors against Fyn variants, we found similar SAR data for the tyrosine kinase Fyn-dm, which suggests that this chemical genetic strategy may be extensible to other tyrosine kinases (Supplementary Table 1 online).
Mutant c-Src adducts suggest Cys345-targeted inhibition
Mass spectrometry (ESI-oa-TOF) measurements of irreversible inhibitor-treated c-Src mutants were consistent with irreversible inhibition. The single-cysteine mutant, c-Src-cys, treated with Compound 4 had a mass 291 Da higher than that of the control-treated kinase, a result consistent with a 1:1 drug–protein complex (Fig. 3). Similarly, the respective masses of c-Src-dm treated with Compound 4 and Compound 5 both correspond to 1:1 adducts. We observed
30% conversion of c-Src-cys to its inhibitor-kinase adduct when treated with Compound 5, which is consistent with the lower potency of Compound 5 against kinases with a large gatekeeper. Our attempts to confirm specific alkylation of Cys345 in c-Src-cys and c-Src-dm using proteolysis and tandem mass spectrometry failed, presumably because of poor ionization of the peptide adduct (data not shown). We did not observe alkylation of wild-type c-Src treated with either Compound 4 or Compound 5 (Fig. 3), which suggests that irreversible inhibition requires the presence of Cys345.
Figure 3: Deconvoluted mass spectra of c-Src variants treated with irreversible inhibitors suggest covalent inhibitor binding to kinase active sites bearing a properly positioned cysteine.
ESI-oa-TOF of control and drug-treated c-Src kinases are shown. Note that c-Src-cys treated with Compound 5 led to only
30% conversion to the inhibitor-kinase adduct, whereas c-Src-cys and c-Src-dm treated with Compound 4 and c-Src-dm treated with Compound 5 resulted in stoichiometric adduct formation.
Complex structures of EGFR
To examine the binding mode of 6-acrylamido-4-anilinoquinazoline inhibitors, we solved the cocrystal structures of EGFR with inhibitors Compound 2 and Compound 4 bound within the EGFR active site. We expressed, purified and crystallized the kinase domain of human EGFR (amino acids 696–1,022) as previously described21; soaking these crystals overnight in a solution of either Compound 2 or Compound 4 afforded the cocrystal complexes. These crystals belong to space group I23 and show packing that is nearly identical to that of the crystals we recently described for reversible inhibitors bound to the EGFR kinase domain (Supplementary Table 2 online)21. We solved both complex structures (EGFR–Compound 2 and EGFR–Compound 4) by molecular replacement and refined them to 2.95 Å and 3.10 Å, respectively (Supplementary Table 2). The overall fold of these EGFR complexes is similar to those reported for the EGFR kinase domain in an active conformation either in complex with reversible inhibitors or without ligand. The conserved salt bridge between Glu762 and the catalytic Lys745, which aligns the helix C in an active conformation, remains intact, which confirms the active conformation of these complexes16, 21. Both structures show nearly continuous electron density along the entire length of the protein, though some density is disordered around the C-terminal tail; therefore, we did not model in the refinement residues 986 and 990–1,005 in the EGFR–Compound 4 complex, or 992–1,001 in EGFR–Compound 2.
The binding mode of EGFR–Compound 2 and EGFR–Compound 4
The structures of EGFR in complex with Compound 2 and Compound 4 demonstrate that the inhibitors are bound covalently to Cys797 in the active site and adopt an orientation similar to that found for several kinases in complex with reversible quinazoline inhibitors (Fig. 4 and Supplementary Fig. 1 online). Clear electron density of the inhibitors and Cys797 shows formation of a new bond between the
-carbon atom of the acrylamide Michael acceptor on each inhibitor and the
-sulfur atom of Cys797. These orientations within the EGFR active site are similar to those previously reported for reversible 4-anilinoquinazolines in complex with the kinases EGFR, CDK2, p38 MAPK, Aurora A and c-Src16, 20, 21, 22, 23, 24: both Compound 2 and Compound 4 form a hydrogen bond from the quinazoline N1 to the main chain amide of Met793 along the hinge region, and the C4 aniline points into the hydrophobic pocket defined by the gatekeeper, Thr790 (Fig. 4a and Supplementary Fig. 1). This key hydrogen bonding interaction provides substantial binding energy for reversible 4-anilinoquinazoline inhibitors; a carbon substitution of N1 results in
3,700-fold loss of inhibitor potency in EGFR25. Many kinase inhibitors form two hydrogen bonding interactions to the hinge region, and weak electron density may place a water molecule near the quinazoline N3 that represents a second, water-mediated hydrogen bond to either Thr845 or Thr790. If present, this is likely a minor interaction, as loss of this interaction was reported for the cocrystal complex of EGFR with gefitinib (Compound 14), a potent reversible quinazoline inhibitor21, 25. We found an interplanar angle between the C4 aniline and the quinazoline core of
45° for both inhibitors, and both aniline substituents were found to lie within 4 Å of the residue side chains comprising the hydrophobic selectivity pocket: Thr790, Leu788, Met766, Lys745, Glu762 and Thr854 (the m-bromine of Compound 2 lies 3.4 Å away from Thr790, and the phenyl ring of Compound 4 is 3.6 Å from Thr790).
Figure 4: Stereodiagrams for irreversible 6-acrylamido-4-anilinoquinazoline inhibitor 2 covalently bound to the ATP site of both EGFR and c-Src-cys show different binding modes for each kinase.
The experimental electron densities of EGFR and c-Src-cys at 2.95 Å and 2.48 Å resolution, respectively, are shown (2Fo – Fc map contoured at 1
). (a) Optimized EGFR inhibitor Compound 2 (PD 168393)13 (green ball and sticks) in complex with the EGFR kinase domain shows clear electron density between the targeted Cys797 and the acrylamide Michael acceptor on Compound 2. The inhibitor makes a direct hydrogen bond between its quinazoline N1 and the main chain amide of Met793, which is a common recognition motif among reversible anilinoquinazolines and several protein kinase domains16, 20, 21, 22, 23, 24. We find the cocrystal complex in an active conformation of EGFR: a conserved salt bridge found only in active EGFR conformations between the catalytic Lys745 and helix C Glu762 remains intact. The m-bromine group sits adjacent (within 3.4 Å) to the gatekeeper residue, Thr790. (b) Notably, structures of Compound 2 in complex with c-Src-cys show an inhibitor binding mode distinct from any reported for anilinoquinazolines with a protein kinase. In molecule B of the c-Src-cys–Compound 2 complex, Compound 2 (green ball and sticks) forms a water-mediated (W1) hydrogen bond via its N1 of the quinazoline core to the backbone amide of Met341.
Despite a similar overall binding mode, there are several differences between these structures and the previously published structure of the reversible inhibitor erlotinib in complex with EGFR16. In our EGFR cocrystal structures, inhibitors Compound 2 and Compound 4 bind further away from the hinge region, and the quinazoline core of Compound 2 and Compound 4 tilts
30° 'down' toward the C-terminal domain relative to erlotinib, pivoting about the hydrogen bond between N1 and the Met793 amide (Fig. 5a). These movements presumably allow covalent attachment between the inhibitor's acrylamide moiety and Cys797.
Figure 5: Comparison of the binding modes of 2 and 4 in EGFR and c-Src-cys with a canonical kinase-quinazoline binding mode reveals inhibitor movements necessary to accommodate covalent attachment.
The crystal structures reported here aligned along the hinge-region residues (red loop) of the EGFR-erlotinib cocrystal (PDB entry 1M17; ref. 16). Erlotinib is shown as thin white sticks, Compound 2 is shown as green sticks and Compound 4 is shown as orange sticks. The covalently modified cysteine atoms are drawn as yellow spheres. (a) EGFR–Compound 2 (green loop) and EGFR–Compound 4 (orange loop) complexes compared with erlotinib. In the top panel, the irreversibly bound inhibitors rotate roughly 30° downward about the N1–backbone amide (blue sphere, Met793) hydrogen bond to accommodate covalent attachment to Cys797. The bottom panel illustrates the significant overlap of the irreversibly bound quinazoline core to the reversible inhibitor and reveals the close proximity of the C4 aniline substituents to the gatekeeper residue. (b) c-Src-cys–Compound 2 (green loop) and c-Src-cys–Compound 4 (orange loop) complexes compared with erlotinib. The irreversibly bound inhibitors bind farther away from the hinge region than erlotinib; no direct hydrogen bonding contacts are made.
Full size image (27 KB)Complex structures of a target cysteine–containing c-Src mutant
To understand how covalent 6-acrylamido-4-anilinoquinazoline inhibitors exploit the cysteine specificity element in a kinase that does not naturally contain this cysteine, we solved cocrystal structures of a cysteine-containing chicken c-Src mutant with inhibitors Compound 2 and Compound 4. We carried out cocrystallization experiments by incubating the kinase domain of c-Src-cys (amino acids 251–533) with an excess of either Compound 2 or Compound 4 and removing insoluble material by centrifugation. The complex crystals (c-Src-cys–Compound 2 and c-Src-cys–Compound 4) formed by hanging-drop vapor diffusion and belong to space group P1, which is the same space group reported previously for the human c-Src kinase domain26. We solved both the complexes by molecular replacement with two monomers (A and B) of c-Src-cys in the crystallographic asymmetric unit and refined them to 2.48 Å and 2.50 Å, respectively (Supplementary Table 2). The overall fold of the chicken c-Src kinase domain mutant is similar to that of the wild-type human c-Src recently observed in an active conformation26, 27. The two molecules in the asymmetric unit differ slightly in areas of known flexibility and show weak or discontinuous main chain electron density for the N terminus (residues 251–254), the glycine-rich loop (residues 274–279), the helix C and connecting loop to
-strand 3 (residues 300–317) and the activation loop (residues 405–424) past the DFG motif. A detailed discussion of differences in the kinase conformation and inhibitor occupancy of the two molecules in the asymmetric unit is found in the Supplementary Note online.
The binding mode of c-Src-cys–Compound 2 and c-Src-cys–Compound 4
The cocrystals of c-Src-cys in complex with inhibitors Compound 2 and Compound 4 show covalent inhibitor binding to the kinase active site, and, as found in the EGFR–Compound 2 and EGFR–Compound 4 complexes, they reveal the inhibitors displaced from the predicted position based on reversible quinazoline binding18. As in the EGFR complexes described above, clear electron density defines covalent inhibitor binding to the engineered Cys345 (Fig. 4b and Supplementary Fig. 1). In contrast to the EGFR structures, however, the inhibitors in the c-Src-cys structures do not simply rotate with respect to the plane of the quinazoline but also make a substantial movement away from the hinge region with no direct hydrogen bonding contacts, and the C4 aniline substituents are coplanar with the quinazoline core (Figs. 4b and 5b; Supplementary Fig. 1). Furthermore, the individual conformations of the covalent c-Src-cys–inhibitor adducts (Cys345–Compound 2 and Cys345–Compound 4) differ with respect to rotation around the C1-C2 bond of the former Michael acceptor system (dihedral 20°), consequently placing the inhibitor core of Compound 2 even farther away from the hinge region, with the m-bromoaniline moiety stacked above Asp404 of the DFG motif and the side chain of the catalytic Lys295. In molecule B of c-Src-cys–Compound 2, we observed one key hydrogen bond between the ligand and the backbone of the adenine binding pocket. The contact occurs through a water molecule (W1) that mediates a hydrogen bonding interaction between N1 (2.5 Å) of the quinazoline core and the backbone amide of Met341 (2.9 Å) of the hinge region (Fig. 4b). The same water molecule is found as W1027 coordinated to the backbone amide of Met341 in c-Src when neither ligand nor ATP is present26, 28. A similar water-mediated contact of a ligand to a structurally analogous backbone amide has recently been described for a new class of triaminotriazine-based p38 MAP kinase inhibitors29.
Attempts to crystallize the double mutant c-Src-dm with the as-allele–selective inhibitor Compound 5 produced ambiguous results (data not shown). For detailed discussion see the Supplementary Note. A model of the potential selectivity mechanism afforded by a gatekeeper mutation is illustrated in Supplementary Figure 2 online.
Compound 5 inhibits analog-sensitive, but not wild-type, EGFR
To determine whether our rationally designed PD 168393 analogs were truly selective for kinases containing both specificity elements, we investigated whether Compound 2 or Compound 5 could inhibit EGFR—presumably the most stringent test, as wild-type EGFR contains the targeted cysteine residue and a threonine gatekeeper, and several reversible 4-anilinoquinazolines are highly potent EGFR inhibitors24, 30, 31, 32. Our in vitro screening data of irreversible inhibitors Compound 2, Compound 4 and Compound 5 tested against recombinant EGFR kinase affirm the conclusions from our c-Src-cys and Fyn-cys inhibition data: Compound 5 is only weakly potent (IC50 = 0.689
M) in the context of the targeted cysteine and a threonine gatekeeper (Supplementary Table 3 online). Likewise, the cellular potency of these inhibitors matches the in vitro inhibitor selectivity profiles. We transfected NIH-3T3 cells, which have undetectable concentrations of endogenous EGFR, with either EGFR or EGFR-as3, an analog-sensitive allele of EGFR33. In 3T3:EGFR cells treated with Compound 2 then stimulated with EGF (100 ng ml-
1), we observed, as expected, complete inhibition of EGF-stimulated tyrosine phosphorylation (Fig. 6). Under the same conditions, as-allele–selective inhibitor Compound 5 did not inhibit the phosphotyrosine signal. In 3T3:EGFR-as3 cells (the analog-sensitive allele background), both Compound 2 and Compound 5 completely inhibited EGF-stimulated tyrosine phosphorylation. These results are consistent with the in vitro inhibition data of Compound 5 against c-Src-dm (Fig. 2b), thus corroborating this inhibitor's selectivity for analog-sensitive kinase alleles with a targeted cysteine residue. Importantly, these data suggest that we can selectively target a designed double-mutant kinase with molecules such as Compound 5 in the presence of wild-type EGFR.
Figure 6: Reversal of tyrosine phosphorylation in cells demonstrates both the cell permeability and allele selectivity of two rationally designed 6-acrylamido-4-anilinoquinazoline inhibitors.
NIH-3T3 cells transfected with EGFR or EGFR-as3 (3T3:EGFR and 3T3:EGFR-as3, respectively) were incubated with inhibitors Compound 2 and Compound 5 overnight before receptor stimulation with EGF. Immunoblots of the SDS-PAGE–separated cell lysates are shown. As expected, Compound 2, an EGFR-selective inhibitor13, reversed all EGF-dependent phosphotyrosine (pTyr) signals near 170 kDa, the approximate mass of EGFR. This phosphotyrosine signal was insensitive to cell treatment with as-allele–selective inhibitor Compound 5 (left panel). Conversely, in the background of the analog-sensitive EGFR mutant 3T3:EGFR-as3, Compound 5 was equipotent with Compound 2, reversing the EGF-stimulated phosphotyrosine to background levels (right panel).
Full size image (18 KB)Design and synthesis of a fluorescent affinity probe
We used structural analysis of the complex structures of EGFR and c-Src-cys with irreversible inhibitors Compound 2 and Compound 4 to identify a suitable site for attachment of a fluorophore to the 4-anilinoquinazoline scaffold without disrupting inhibitor potency. The C7 position of the quinazoline points outside the ATP binding pocket toward solvent (Fig. 4) and has been successfully modified to include solubilizing elements for otherwise poorly soluble quinazoline inhibitors34. In addition, a recent report described a photoaffinity probe for EGFR (AX7593, Compound 15) that bears a tetramethylrhodamine fluorophore linked to a reversible 4-anilinoquinazoline inhibitor scaffold at the C7 position6. Based on these precedents, we functionalized the 6-acrylamido-4-anilinoquinazoline scaffold at C7 by tethering a fluorophore via a PEG linker, thereby forming the irreversible fluorescent probe Compound 16 (Fig. 7a and Supplementary Methods). To synthesize affinity probe Compound 16, we reacted an amine precursor with a fluorogenic benzofurazan reagent (NBD-Cl, Compound 17). Amine-linked NBD derivatives show environmentally sensitive fluorescence with both a hypsochromic shift of the emission spectra and an increase in fluorescence intensity upon binding to protein surfaces35, 36. Detailed synthetic procedures are described in Supplementary Methods. Affinity probe Compound 16 has good water solubility, and we anticipated it to be selective for EGFR-family kinases because of its m-bromoaniline substituent13.
Figure 7: EGFR-as3 activity in intact cells, as reported by an affinity probe for protein kinases, correlates with the downstream signals of EGFR-as3.
(a) Chemical structure of affinity probe Compound 16 based on the EGFR-selective inhibitor PD 168393 (ref. 13). An NBD fluorophore is linked via a PEG linker at C7 of the quinazoline scaffold. Detailed synthetic methods are described and illustrated in the Supplementary Methods. (b) The fluorescence gel imaging of a pulse-chase experiment in the 3T3:EGFR-as3 cell line using probe Compound 16 shows minimal background labeling of other proteins in the cell lysate. The fluorescent signal was dose dependent for only the band near 170 kDa, which suggests that Compound 16 retains specificity for EGFR-as3. (c) The EGF-dependent phosphorylation of Akt, Erk1 and Erk2, the downstream targets of EGFR, shows a proportional relationship with the fluorescence intensity for EGFR-as3. After fluorescence scanning, the gel was analyzed by immunoblotting for EGFR stimulation and downstream signaling with the appropriate antibodies. (d) Integrated intensities of the fluorescence signal of Compound 16 from the pulse-chase experiment described above. The pulse-chase experiments were conducted in duplicate, and the error bars represent the s.d. above the mean.
Full size image (39 KB)EGFR-as3 activity correlates with downstream signaling
To determine the requirement for EGFR-as3 activity in NIH-3T3 cells, we used affinity probe Compound 16 in a pulse-chase experiment involving receptor stimulation with EGF. After treating serum-starved 3T3:EGFR-as3 cells with Compound 5 for 90 min, we stimulated the cells with EGF (100 ng ml-
1) and then washed and treated the cells with 30
M Compound 16 for 25 min. Fluorescence imaging of the SDS-PAGE–separated lysates revealed minimal background labeling with a band near 170 kDa, which is consistent with the molecular weight of full-length EGFR-as3 (Fig. 7b). The signal was dose dependent with respect to the concentration of Compound 5 added to the cells, and we observed strong maximal signals for the DMSO-treated controls. Furthermore, this fluorescence tracks with the amount of phospho-EGFR at each inhibitor concentration, which suggests that in the case of EGFR, activation of this pathway correlates linearly with active site occupancy.
Next, we sought to determine the relationship between activation of EGFR and activation of its downstream effectors. Stimulation of EGFR results in activating phosphorylation events on both the PI(3)K and the Ras/MAPK signaling pathways within cells37. Immunoblotting for phospho-Akt (Ser473), phospho-Erk1 (Thr202) and phospho-Erk2 (Tyr204) (Fig. 7c) revealed activation proportional to both the phospho-EGFR signal and the fluorescence signal (the activity readout) (Fig. 7d). Thus, we find that the strength of a downstream signal is directly proportional to the percentage of available kinase active sites in the case of EGFR-as3.
Discussion
The success of our chemical genetic design rested on the significant structural information available for 4-anilinoquinazoline inhibitors in complex with several kinases16, 20, 21, 22, 23, 24. These cocrystal structures revealed that C4 anilines extend into the hydrophobic pocket surrounding the gatekeeper residue. To exploit this feature, we explored the chemical space of the 6-acrylamido-4-anilinoquinazoline scaffold and defined the SAR necessary to achieve the best selectivity for the rationally designed double mutants c-Src-dm and Fyn-dm. As predicted based on reversible quinazoline inhibitors, a bulky C4 aniline was necessary to target kinases c-Src-as1, c-Src-dm, Fyn-as1 and Fyn-dm. Additionally, inhibitors bearing a 6-acrylamide substituent proved significantly more potent; the cysteine residue acts as a selectivity filter in a manner similar to that of an endogenous cysteine used to achieve selectivity for halomethylketone-containing pyrrolopyrimidines in RSK-family kinases15. Our in vitro and cellular experiments validate the structure-based chemical genetic design, which suggests that we can rationally target a kinase of interest for irreversible inhibition.
The close proximity of the C4 aniline groups of Compound 2 and Compound 4 to the gatekeeper in EGFR, as seen in the complex structures we report here, provides a structural explanation for the as-allele selectivity of bulky inhibitors such as Compound 5. The EGFR–Compound 2 and EGFR–Compound 4 complexes show C4 aniline moieties oriented adjacent to the gatekeeper residue and several residues comprising the hydrophobic pocket. In this orientation, the pocket does not have enough volume to allow binding of significantly larger aniline substituents—an observation that underscores the importance of the gatekeeper's size in controlling inhibitor access to this pocket, which is essential for inhibitor potency. Upon mutation of the threonine gatekeeper to glycine, additional space permits larger aniline substituents to bind. Previously, we confirmed the basis of this selectivity mechanism in a cocrystal structure of inactive, full-length c-Src-as1 (T338G) in complex with N6-benzyl-ADP (Compound 18), an as-allele–selective ATP analog, and we found the benzyl ring to be within 3.5 Å of the glycine
-carbon38. Thus, for compounds such as Compound 5 that show greater selectivity for as-alleles, the scaffold binds deeper into the hydrophobic cleft made accessible by the glycine gatekeeper, thereby facilitating the interaction of the C4 aniline with the hydrophobic residues. The size and geometry of the C4 aniline is critical for the selectivity of these compounds, and the 6-acrylamide affords both increased potency and a chemical handle to the target cysteine–containing kinases.
Unexpectedly, the drug-sensitized c-Src structures in complex with PD 168393 analogs have an inhibitor binding mode that is different from predicted binding modes based on known reversible quinazolines. Crystal structures of reversible 4-anilinoquinazoline inhibitors bound to CDK2, p38 MAPK, Aurora A and EGFR kinases show similar binding modes, which suggests a common binding orientation of the scaffold among all kinases16, 21, 22, 23, 24; we find this inhibitor orientation in our EGFR–Compound 2 and EGFR–Compound 4 complexes. Here, the key hydrogen bond between the quinazoline N1 and the backbone amide is maintained with only minor scaffold movements to accommodate the covalent attachment. Modeling of both reversible and irreversible 4-anilinoquinazolines into EGFR places the quinazoline scaffold in a similar orientation18, 39, which our EGFR complex structures confirm. In addition, a recently reported structure of c-Src in complex with AZD0530 (Compound 19), a potent, highly selective 5,7-disubsituted 4-anilinoquinazoline reversible inhibitor of c-Src and Abl, showed the key hydrogen bond between N1 and the Met341 main chain amide20. Notably, our c-Src-cys structures reveal binding modes that do not make this conserved hydrogen bond but instead place the inhibitor away from the hinge region, toward the modified cysteine.
Based on these data, we believe that this class of irreversible inhibitor adopts two binding orientations within the kinase: a reversible binding mode necessary for inhibitor recognition and a shifted binding orientation necessary for covalent attachment. The SAR data from our panel of inhibitors suggest that inhibitor selectivity depends on essential contacts to the hinge region (for example, contacting the gatekeeper residue). In addition, the covalently modified EGFR and c-Src-cys structures show varying degrees of inhibitor mobility that enable covalent bond formation. Considering these results, we believe that perfect optimization of the reversible binding mode of the quinazoline scaffold is not an absolute requirement with irreversible inhibitors—the reversible interaction only needs to be strong enough to orient the acrylamide substituent in the vicinity of the target cysteine, where thermal motion allows covalent attachment between the inhibitor and the cysteine residue. The unexpected mobility of the inhibitor within the confines of the ATP binding pocket that we have observed is likely the basis for successful extension of this chemical genetic approach to irreversible kinase inhibition of three tyrosine kinases, despite the need for contacts with two disparate elements of the ATP binding site, which may vary from kinase to kinase.
The mobility of quinazoline-based irreversible inhibitors within kinase active sites, as suggested by our c-Src-cys structures, may explain the potency of irreversible inhibitors against emerging EGFR-driven cancers that are resistant to reversible anilinoquinazoline inhibitor therapy. Two reversible quinazoline-based drugs, erlotinib and gefitinib, show clinical activity in non-small-cell lung cancers driven by EGFR, yet patients develop drug resistance through selection for mutations in the drug binding site of EGFR40, 41, 42, 43. Particularly troublesome for treatment of these cancers is a methionine mutation at the EGFR gatekeeper (T790M) that confers resistance to inhibition by erlotinib and gefitinib; this often results in relapse in people who once responded to drug treatment42, 43. Roughly half of all relapse patients harbor T790M42, 43, and emerging evidence using highly sensitive sequencing techniques suggests that T790M EGFR mutants exist in minor, barely detectable populations in pretreated cancers, which implies that reversible inhibitor treatment selects for these resistant cells over time until the treatment cannot overcome a critical population of cancer cells44, 45. Notably, several irreversible anilinoquinazoline-based inhibitors of EGFR show potency in cell culture against cells harboring the T790M mutation42, 44, 46, 47, 48, which suggests that one mechanism for combating these resistant mutants is through irreversible inhibition, which we believe allows effective inhibition despite occlusion of the analogous reversible agents. In light of the recent discovery of minor populations of T790M EGFR conferring resistance, irreversible inhibitors may represent the best first-line treatments for non-small-cell lung cancers, or they may be effective in treating people who become resistant to the current first-line therapy. Future structural studies in the T790M background will likely identify the relevant inhibitor binding mode and may lead to optimized inhibitors that are able to combat drug resistance.
The irreversible inhibitors and fluorescent affinity probe Compound 16 presented here represent a new set of chemical genetic tools that enable the quantitative analysis of the functional consequences of irreversibly inhibiting protein kinase activity. Using the fluorescent reporting functionality of Compound 16, we demonstrate that there is a linear correspondence between inhibition of EGFR kinase activity and inhibition of its downstream effectors. Although our current data are not of sufficient resolution to determine the exact quantitative relationship between EGFR and downstream effector activation, this experiment demonstrates that it may now be possible to directly test hypotheses regarding the thresholds of kinase activity necessary for signal propagation in other pathways.
Methods
Generation and expression of c-Src mutants.
We used the following reagents: the chicken c-Src (residues 251-533) gene cloned into pSKB-3 (a pET-28a vector (Novagen) modified to yield an N-terminal, tobacco etch virus protease-cleavable (His)6 tag19), a plasmid containing tyrosine phosphatase YopH cloned into pCDFDuet-1 (Novagen)19, and a plasmid containing the chaperones GroEL and trigger factor cloned into pACYADuet-1 (Novagen). We used QuikChange mutagenesis (Stratagene) on the c-Src–containing plasmid to generate c-Src-as1 (T338G), c-Src-cys (S345C) and c-Src-dm (T338G S345C). We expressed and purified the c-Src mutants as previously described19, except we triply transformed the BL21 (DE3) Escherichia coli cells with c-Src, YopH, GroEL and trigger factor plasmids and additionally used chloramphenicol (34
g ml-
1) during expression for the selection of GroEL and trigger factor. The expression yielded 2–10 mg of purified kinase per liter of bacterial culture. The concentration of the c-Src mutants was determined by absorbance spectroscopy at 280 nm using the calculated extinction coefficient of 52,370 M-
1 cm-
1. We concentrated the purified c-Src mutants to 15–18 mg ml-
1 and used them directly for structural studies. We froze the remaining c-Src kinase in liquid nitrogen, storing the aliquots at -
80 °C for later use in biochemical assays.
In vitro kinase assays for c-Src variants.
We performed radioactive phosphotransfer assays in triplicate at various inhibitor concentrations as reported previously10. Details of the assay are described in Supplementary Methods.
Mass spectrometry of c-Src variants.
We used the purified kinase domains (residues 251-533) of three c-Src variants for MS experiments. To 5
g of c-Src, c-Src-cys or c-Src-dm in 10
l of buffer (50 mM Tris (pH 8.0), 100 mM NaCl, 5% (v/v) glycerol, 1 mM DTT) we added 3
l of DMSO, Compound 4 or Compound 5 (100
M in DMSO). We incubated this mixture (23
M:15
M inhibitor/kinase; 1.5:1 molar equivalents) on ice for 15 min and then diluted it into 100
l water. We analyzed aliquots (20
l) by mass spectrometry using an HTS PAL 12-plate autosampler (CTC Analytics) and an LCT Premier XE LC-ESI-oa-TOF mass spectrometer (Waters).
Generation and expression of EGFR for structural studies.
The expression and purification of EGFR was similar to methods we recently described21. We describe the detailed methods in Supplementary Methods.
Crystallization and data collection of EGFR–Compound 2 and EGFR–Compound 4.
We obtained crystals by the hanging-drop vapor diffusion method (precipitant: 1.2 M potassium sodium tartrate, 0.1 M HEPES (pH 7.5), 5 mM Tris(2-carboxyethyl)phosphine hydrochloride) and made complex crystals by soaking crystals overnight in a solution containing 400
M of either inhibitor Compound 2 or Compound 4. We collected diffraction data at Argonne National Laboratory Advanced Photon Source, beamline ID24, under a nitrogen gas stream at 100 K, using a wavelength of 0.9795 nm. We processed the data with HKL2000 and DENZO/SCALEPACK49.
Structure determination and refinement of EGFR–Compound 2 and EGFR–Compound 4.
We solved the EGFR by molecular replacement with starting coordinates from an isomorphous EGFR structure (Protein Data Bank (PDB) entry 1M17; ref. 16). We describe the details of the structure determination and refinement in Supplementary Methods. Detailed data and refinement statistics are given in Supplementary Table 2. We produced the figures using PyMOL50. Ramachandran statistics are shown in Supplementary Table 4 online.
Crystallization and data collection of c-Src-cys–Compound 2 and c-Src-cys–Compound 4.
We expressed and purified the chicken c-Src-cys variant as described above, and we describe the formation of the cocrystals in Supplementary Methods. We used a cryoprotectant of 20% glycerol before flash freezing the crystals in liquid nitrogen. We collected the diffraction data at the Berkeley Lab Advanced Light Source, beamlines 8.2.1 and 8.2.2, under a nitrogen gas stream at 100 K, using a wavelength of 1.0000 nm. We processed the data with DENZO and SCALEPACK49 using HKL2000.
Structure determination and refinement of c-Src-cys–Compound 2 and c-Src-cys–Compound 4.
We solved the chicken c-Src-cys (S345C) structures by molecular replacement with starting coordinates from the catalytic domain of human c-Src (PDB entry 1YOJ; ref. 26). Further determination and refinement details are available online (Supplementary Methods). We present detailed data and refinement statistics in Supplementary Table 2. We used PyMOL50 to produce the figures. Ramachandran statistics are shown in Supplementary Table 4.
Cell culture, retrovirus production and infection.
We previously described establishing 3T3:EGFR (ref. 33). Similarly, we transduced NIH-3T3 cells with EGFR-as3, an analog-sensitive allele of EGFR kinase; the construction of EGFR-as3 is described in the Supplementary Methods. We cultured the NIH-3T3 cells in DMEM supplemented with 10% FBS (FBS) and a solution of penicillin 'G' (100 units ml-
1) and streptomycin sulfate (100
g ml-
1) (PenStrep, University of California, San Francisco Cell Culture Facility). To produce ecotropic virus, we transfected EGFR-as3 (cloned into pWZLhygro vector) into Bosc23 cells. We selected pools of transduced rodent fibroblasts by adding hygromycin (Roche) at 48 h; this selection lasted 2 weeks using hygromycin 500
g ml-
1.
Reversal of tyrosine phosphorylation in 3T3:EGFR and 3T3:EGFR-as3 cells.
We incubated either EGFR-transduced cells or EGFR-as3–transduced cells with control (1.2% DMSO) or with drug (0.1
M, 1.0
M and 2.0
M inhibitors Compound 2 or Compound 5, 1.2% DMSO) and treated the cells with or without EGF (100 ng ml-
1, Roche) for 30 min before harvest. After 3 h of incubation, we lysed the cells using Upstate lysis buffer (Upstate), separated equal amounts of total protein using 4–12% SDS-PAGE and transferred the gel to nitrocellulose membranes. We immunoblotted the membranes with antibodies against phosphotyrosine (4G10, Upstate Biotechnology; 1:500) or EGFR (Ab5, NeoMarkers; 1:2,000) and detected the antibodies with horseradish peroxidase–conjugated sheep antibody to mouse (Amersham Pharmacia Biotech), followed by enhanced chemiluminescence (Amersham Pharmacia Biotech).
Measurement of EGFR-as3 activity and its downstream signals in vivo.
We serum-starved 3T3:EGFR-as3 cells overnight in DMEM supplemented with 0.5% FBS and PenStrep. The next day, we treated the cells with medium (DMEM plus 0.5% FBS and PenStrep) containing either 1% DMSO or 1% DMSO with three-fold dilutions of Compound 5 from 1
M to 37 nM. After a 1.5-h inhibitor treatment, we stimulated the cells for 10 min with EGF (100 ng ml-
1, Sigma), placed the dishes on ice, washed the cells with cold (0–4 °C, ice-bath-chilled) phosphate-buffered saline and incubated them with 30
M fluorescent affinity probe Compound 16 in cold phosphate-buffered saline for 25 min. We then lysed the cells, normalized the lysates for protein content and separated the lysates using SDS-PAGE. We scanned the gels on a flat-bed fluorescence imager (Typhoon, Molecular Dynamics) and then transferred them to nitrocellulose for immunoblot analysis. We measured the fluorescence intensity using ImageQuant software (Molecular Dynamics). We performed the cell treatments in duplicate. See Supplementary Methods for additional details.
Accession codes.
Protein Data Bank: coordinates and structure factors from this study have been deposited under accession codes 2J5E, 2J5F, 2HWO and 2HWP. Structures cited from previous studies include 1M17 and 1YOJ.
Note: Supplementary information and chemical compound information is available on the Nature Chemical Biology website.
Author contributions
J.A.B. and C.K. synthesized the panel of inhibitors, expressed the Fyn variants and measured the Fyn in vitro IC50 values. J.A.B. and D.R. expressed the c-Src variants, crystallized and measured the c-Src-cys cocrystals and measured the EGFR-as3 cellular activity. J.A.B. conducted the protein mass spectrometry and measured the c-Src and EGFR in vitro IC50 values. D.R. and H.R. synthesized and characterized probe Compound 16. D.R. solved the c-Src-cys complex structures. Q.W.F. established the 3T3:EGFR cell lines and conducted the cellular inhibition experiments with Compound 2 and Compound 5. C.H.Y. expressed, crystallized and solved the EGFR complex structures. J.A.B. prepared the manuscript, with help from and editing by all the co-authors. C.Z. conceptualized the initial chemical genetic design. W.A.W., M.J.E. and K.M.S. helped conceive of experiments.

