Epidermal growth factor receptor (EGFR) mutations typically occur in exons 18–21 and are established driver mutations in non-small cell lung cancer (NSCLC)1,2,3. Targeted therapies are approved for patients with ‘classical’ mutations and a small number of other mutations4,5,6. However, effective therapies have not been identified for additional EGFR mutations. Furthermore, the frequency and effects of atypical EGFR mutations on drug sensitivity are unknown1,3,7,8,9,10. Here we characterize the mutational landscape in 16,715 patients with EGFR-mutant NSCLC, and establish the structure–function relationship of EGFR mutations on drug sensitivity. We found that EGFR mutations can be separated into four distinct subgroups on the basis of sensitivity and structural changes that retrospectively predict patient outcomes following treatment with EGFR inhibitors better than traditional exon-based groups. Together, these data delineate a structure-based approach for defining functional groups of EGFR mutations that can effectively guide treatment and clinical trial choices for patients with EGFR-mutant NSCLC and suggest that a structure–function-based approach may improve the prediction of drug sensitivity to targeted therapies in oncogenes with diverse mutations.
Patients with classical EGFR mutations (L858R or exon 19 deletions (Ex19del)) show marked improvements in clinical outcomes when treated with first-, second- or third-generation tyrosine kinase inhibitors (TKIs)4,5,6,11. Other EGFR mutations in the kinase domain (exons 18−21) have also been established as oncogenic drivers of NSCLC8. Patients with atypical EGFR mutations show heterogeneous and reduced responses to EGFR inhibitors1,3,7,8,9,10, including osimertinib12. Atypical EGFR mutations with US Food and Drug Administration (FDA)-approved treatments are EGFR S768I, L861Q and G719X, for which afatinib was deemed effective on the basis of retrospective studies13,14,15, and the EGFR/MET bispecific antibody amivantamab for exon 20 insertions (Ex20ins)16. There are no clear established guidelines for EGFR TKI treatment for patients with atypical EGFR mutations without an FDA-approved TKI, often resulting in patients receiving chemotherapy. Clinical trial design and treatment of patients with atypical EGFR mutations often rely on mutated-exon location to predict treatment, although heterogeneity in drug sensitivity across a single exon has been observed1,8,17. Therefore, there is an unmet clinical need to establish a system for classifying EGFR mutations that is predictive of drug sensitivity and for more robust clinical trial design.
Clinical outcomes for atypical mutations
To characterize the molecular landscape of EGFR-mutant NSCLC, we used 5 patient databases with genomic profiling (Methods), representing 16,715 patients with EGFR-mutant NSCLC. There were 11,619 patients in whom primary and/or co-occurring mutations were recorded on a per-patient basis (Extended Data Fig. 1a). Among those patients, 67.1% had classical EGFR mutations (L858R and/or Ex19del with or without T790M); 30.8% had atypical EGFR mutations, including Ex20ins (9.1%), atypical mutations (12.6%), or a complex mutation including an atypical mutation (9.1%); and 2.2% had a classical mutation with T790M and an atypical mutation (Fig. 1a, Extended Data Fig. 1b). Atypical EGFR mutations (n = 7,199) occurred primarily in exons 18 (23.7%) and 20 (20.9% insertions and 19.2% point mutations; Fig. 1b). Prevalent hotspots for atypical mutations were the P-loop (L718–V726, 13.6%) and the C-terminal loop of the αC-helix (A767–G779, 29.4%, Fig. 1c).
To assess the effect of atypical EGFR mutations on patient outcomes, we determined the time to treatment failure18 (TTF) of patients with NSCLC containing classical or atypical EGFR mutations. When treated with an EGFR TKI, patients with atypical EGFR mutations had a shorter TTF compared with patients with classical EGFR mutations (Fig. 1d, hazard ratio (HR) = 1.8, P < 0.0001), even when patients with Ex20ins were excluded from the analysis (Fig. 1e, HR = 1.6, P < 0.0001) or when patients were stratified by mutation exon location (Fig. 1e, Extended Data Fig. 1c). When patients were stratified by TKI treatment, those with classical EGFR mutations had a longer TTF than those with atypical EGFR mutations when treated with first-generation (HR = 1.9, P = 0.0005) or third-generation TKIs (HR = 3.0, P < 0.0001) (Extended Data Fig. 1d, e). A similar trend was observed for second-generation TKIs; however, the difference was not statistically significant (HR = 1.7, P = 0.10) (Extended Data Fig. 1f). Validating these findings in the cBioPortal database, patients with atypical EGFR mutations had a shorter progression free interval19 and overall survival, irrespective of treatment (Extended Data Fig. 1g, h).
Structural groups predict drug response
We generated a panel of 76 cell lines expressing EGFR mutations spanning exons 18–21 and screened these cell lines against 18 EGFR inhibitors representing first- (non-covalent), second (covalent) and third- (covalent, T790M targeting) generation and Ex20ins-active TKIs (Supplementary Table 1). Using hierarchical clustering of in vitro selectivity over WT EGFR and mutational mapping of EGFR mutations, we observed four distinct subgroups of EGFR mutations: classical-like mutations that were distant from the ATP-binding pocket (Extended Data Fig. 2a), T790M-like mutations in the hydrophobic core (Extended Data Fig. 2b), insertions in the loop at the C-terminal end of the αC-helix in exon 20 (Ex20ins-L; Extended Data Fig. 2c), and mutations on the interior surface of the ATP-binding pocket or C-terminal end of the αC-helix, which were predicted to be P-loop and αC-helix compressing (PACC) (Fig. 2a, Extended Data Fig. 2d). Supervised heat maps of mutant/wild-type ratios by exon location (Extended Data Fig. 3a) and structure–function groups (Extended Data Fig. 3b) showed distinct differences, suggesting that structure–function-based groups better defined groups of mutations by drug sensitivity than exon-based classification. To test this hypothesis, we calculated the correlations of drug sensitivity and selectivity for each mutation to the predicted drug sensitivity by exon or structure–function groups (Extended Data Fig. 4a) and then compared the median rho value of each correlation for both groups. We found that structure–function-based groups were more predictive of mutation sensitivity than exon-based groups (P < 0.0001) (Fig. 2b). We used a secondary approach employing machine learning to analyse data by classification and regression trees (CART) and determine variable importance20,21 (Extended Data Fig. 4b). Structure–function-based groups had a higher variable importance than exon-based groups, suggesting that structure–function-based groups were more predictive of which mutational groups would be most sensitive to a particular drug compared with exon-based groups (P < 0.0001) (Fig. 2c). We validated these findings without T790M mutations (Extended Data Fig. 4c, d), and structure–function-based groups remained more predictive of mutation and drug sensitivity than exon-based groups (P = 0.0034 and P < 0.0001, respectively) (Extended Data Fig. 4e, f).
Classical-like, atypical EGFR mutations were predicted to have little effect on the overall structure of EGFR compared with wild-type EGFR (Extended Data Fig. 5a–d) and were sensitive and selective for all classes of EGFR TKIs, particularly third-generation TKIs, in vitro and in vivo (Extended Data Fig. 5e–g). Mutations and assigned groups are in Supplementary Table 4.
Exon 20 mutations are heterogenous
Studies showed that exon 20 mutations are heterogenous in their response to TKIs10,22. Insertions in the αC-helix (for example, an FQEA insertion at A763 (A763insFQEA)) were pan-sensitive to EGFR TKIs23,24, whereas those in the loop following the αC-helix (A767–C775) were not25,26, and the T790M mutation was sensitive to third- but not first- or second-generation TKIs. We found that most exon 20 point mutations were PACC mutations; that exon 20 insertions in the αC-helix were classical-like mutations; and, that the remainder of exon 20 insertions occurring in the C-terminal loop of the αC-helix were a distinct subgroup: exon 20 loop insertions (Ex20ins-L) (Fig. 2a). Ex20ins-L were sensitive only to select second-generation TKIs (that is, poziotinib and tarlox-TKI) and Ex20ins-active TKIs in vitro and in vivo (Fig. 2a, Extended Data Fig. 6a–c). However, even within Ex20ins-L mutations, some degree of heterogeneity in drug sensitivity was observed (Fig. 2a, Extended Data Fig. 6a). Using a panel of an additional 15 Ba/F3 cell lines expressing Ex20ins-L mutations spanning A767–V774, we found that Ex20ins-L mutations could be subdivided into two subgroups: near- and far-loop Ex20ins (Extended Data Fig. 6d). Exon 20 near-loop insertions (Ex20ins-NL) were more sensitive to second-generation and Ex20ins-active TKIs compared with exon 20 far-loop insertions (Ex20ins-FL) (P = 0.0025 and P = 0.027, respectively) (Extended Data Fig. 6e). These data exemplify that mutations within an exon are heterogenous and that an exon-based classification is unlikely to be optimal for guiding treatment decisions.
Drug repurposing for resistant mutations
Although all T790M-like mutants had at least one mutation in the hydrophobic core, there were two distinct subgroups of T790M-like mutants—third-generation TKI sensitive (T790M-like-3S) and third-generation TKI resistant (T790M-like-3R) (Extended Data Fig. 7a). Previous reports have shown that protein kinase C27 (PKC) and anaplastic lymphoma kinase28,29 (ALK) inhibitors exhibit off-target activity for EGFR mutations including T790M, and the non-covalent nature of these compounds predict that they retain activity in mutations that interrupt covalent binding. T790M-like-3S mutants had high selectivity for third-generation TKIs and some Ex20ins-active inhibitors and moderate selectivity for ALK and PKC inhibitors (Extended Data Fig. 7b). T790M-like-3R mutants, complex mutations comprising T790M and a known drug-resistance mutation (that is, C797S30, L718X31 or L792H18,31), were resistant to classical EGFR TKIs but retained selectivity for select ALK and PKC inhibitors such as brigatinib or midostaurin (Extended Data Fig. 7c). These data support expanding testing of ALK and/or PKC inhibitors or development of novel non-covalent inhibitors for the broader group of T790M-like-3R mutations.
Second-generation TKIs inhibit PACC mutations
PACC mutations comprise mutations spanning exons 18–21 including G719X, L747X, S768I, L792X and T854I and were predicted to alter the orientation of the P-loop or αC-helix (Extended Data Fig. 8a, b). In silico analysis of the interaction of osimertinib with PACC mutations G719S and L718Q predicted that changes in the orientation of the P-loop alter the position of TKI stabilization points tilting the indole ring of osimertinib away from the P-loop, destabilizing drug binding (Extended Data Fig. 8c, d). By contrast, second-generation TKIs do not interact with the P-loop of EGFR and maintain interaction points in the hydrophobic cleft (Extended Data Fig. 8d, e). When we compared the selectivity of EGFR TKIs for PACC mutations, we found that second-generation TKIs were significantly more selective for PACC mutations than any other TKI class (Fig. 3a). In vivo, we also observed that NSCLC patient-derived xenografts (PDXs) containing G719A mutations were resistant to the third-generation TKI osimertinib, but most sensitive to the second-generation TKI poziotinib (Fig. 3b, Extended Data Fig. 8f). Notably, a patient with a complex PACC mutation, E709K/G719S, saw clinical benefit and tumour shrinkage with afatinib treatment after progressing on osimertinib (Extended Data Fig. 8g).
Similarly, acquired PACC mutations co-occurring with primary classical EGFR mutations retained sensitivity to second-generation TKIs while acquiring resistance to third-generation TKIs in an allele-specific manner (Fig. 3c, d). In silico analysis of acquired PACC mutation, G796S, co-occurring with Ex19del was predicted to confer resistance to third-generation TKIs such as osimertinib by shifting the hinge region of the receptor, preventing stabilization of osimertinib at M793 and displacing the acrylamide group of osimertinib away from C797 (Extended Data Fig. 8h). However, second-generation TKIs were less affected by shifts in the hinge region of the receptor and were predicted to maintain the orientation of the acrylamide group (Extended Data Fig. 8f). Previous studies have reported that C797S mutations confer resistance to third-generation TKIs even without the presence of T790M30 (Fig. 3e). Similarly, C797S mutations without T790M conferred resistance to Ex20ins-active inhibitors, but not first- or second-generation TKIs unless T790M was present (Fig. 3e). Retrospectively, we identified three patients with NSCLC containing EGFR L858R mutations that received first-line osimertinib and subsequently developed an EGFR-dependent mechanism of resistance. In all patients, a PACC mutation was identified upon biopsy at progression (Extended Data Fig. 9a–c). Two patients acquired a L718V mutation, and one acquired two PACC mutations (V765L and C797S). All patients were treated with a second-generation TKI and experienced clinical benefits of stable disease and tumour shrinkage (Extended Data Fig. 9a–c). These data demonstrate that both primary and acquired PACC mutations are sensitive to second-generation TKIs, and structure–function-based groupings could identify a novel class of mutations, PACC mutations, for which second-generation TKIs had higher selectivity and efficacy than third-generation drugs.
Structure-based groups predict outcomes
To determine whether structure–function-based groups could identify patients who are most likely to benefit from a treatment better than exon-based groups, we used a publicly available database of outcomes for patients harbouring atypical EGFR mutations treated with afatinib32 and determined overall response rate (ORR) and duration of treatment (DOT). Structure–function-based grouping showed clear differences between sensitive (classical-like and PACC) and resistant (T790M-like and Ex20ins-L) subgroups (ORR of 63% versus 20%), whereas exon-based groups had less variation between groups (Extended Data Fig. 10a, b). Structure–function-based groups identified that patients with PACC mutations (n = 156) had a significantly longer DOT for afatinib than other structure-based groups (DOT: 17.1 months, P < 0.0001) (Fig. 4a, b). Using exon-based groups, we also identified that patients with exon 18 mutations (n = 87) had a longer DOT than patients with mutations in exons 19–21 (DOT: 17.4 months, P < 0.0001) (Fig 4b, Extended Data Fig. 10c); however, the structure-based approach identified nearly twice as many individuals who benefited from afatinib treatment.
To determine whether structure-based groups could identify which class of inhibitors would provide the most benefit to patients with atypical EGFR mutations, we performed retrospective analyses of TTF of patients with atypical EGFR mutations treated with EGFR TKIs in MD Anderson Cancer Center GEMINI and Moffitt Cancer Center databases, and TTF was determined for the first EGFR TKI for which patients were treated. Most patients (80%) were stage IV at diagnosis, and there were no statistical differences in patient characteristics (Supplementary Tables 5, 6). When stratified by structure–function-based groups, patients with PACC mutations treated with second-generation TKIs had a significantly longer TTF than patients treated with either first- or third-generation TKIs (21.7 months versus 10.0 months and 4.1 months, respectively; P < 0.0001, HR = 0.23) (Fig. 4c, d). By contrast, TTF was not significantly different between classes of EGFR TKIs in patients with non-PACC mutations (Fig. 4d, Extended Data Fig. 10d). Further, patients with PACC mutations had a longer TTF than patients with non-PACC mutations when treated with second-generation TKIs (21.7 months versus 10.0 months, respectively; HR = 2.6, P = 0.0068) (Fig. 4e). When patients were stratified by exon and TTF was calculated for first-, second- and third-generation TKIs, significant differences were observed only in patients with exon 18 mutations treated with second-generation TKIs compared with third-generation TKIs (20.9 months versus 5.5 months; P = 0.001, HR = 0.29) (Fig 4d, Extended Data Fig. 10e–h). Therefore, structure–function classification identified not only a larger subgroup of patients, but also a subgroup with greater benefit from second-generation TKIs than the exon-based classification.
The diversity and higher than previously appreciated prevalence of atypical EGFR mutations shown here highlights the necessity of comprehensive next-generation sequencing (NGS) for patients with NSCLC. We show that EGFR mutations, including atypical mutations, can be divided into four distinct subgroups based on structure and function (Fig. 5), and that structure–function-based groups can predict drug sensitivity and patient outcomes better than exon-based groups. While previous studies have shown activity of second-generation TKIs in patients with specific exon 18 mutations33,34, structure–function-based grouping identified a larger subgroup of EGFR mutations, PACC mutants, for which second-generation TKIs were more selective than third-generation TKIs. Clinically, second-generation TKIs have been associated with inhibition of wild-type EGFR and related adverse events15,35,36; however, most second-generation TKIs are dosed at the maximum tolerated doses, resulting in plasma concentrations 10–100 fold greater than concentrations necessary for inhibiting PACC mutations. Unlike osimertinib, second-generation TKIs have limited activity in the central nervous system, demonstrating the need for novel TKIs with reduced inhibition of wild-type EGFR inhibition and CNS activity that can inhibit PACC mutants.
These findings demonstrate that structure–function-based groups can identify classes of drugs that may be effective for whole groups of mutations, reflecting the observation that mutations in different regions of the gene may induce similar changes in protein structure. For example, L718Q, S768I, and T854I correspond to exons 18, 20, and 21, respectively, but are all PACC mutations with similar structural effects on drug binding. Conversely, mutations within the same exon may induce quite disparate changes. L747_K754del-insATSPE, L747P and E746-A750del mutations occur in exon 19 but are T790M-like, PACC, and classical mutations, respectively, with distinct drug sensitivity and structural effects. A clinical challenge for physicians treating patients with EGFR-mutant cancers is to appropriately identify and match patient mutations with the best EGFR TKI. While a separate classification for each individual mutation could provide more precision than the groups described here, validating the clinical activities of different drugs for each mutation is not feasible. The classification presented here provides a framework through which clinicians, informed by internet-based tools or companies providing NGS reports, could more effectively personalize EGFR TKI therapy. Finally, these findings support the notion that for cancers containing oncogenes with diverse mutations, adopting a structure–function-based approach may improve clinical trial design and drug development.
No sample size calculations were done to predetermine group sizes, and investigators were not blinded during randomization and outcome assessments.
Analysis of EGFR variants in MD Anderson Cancer Center GEMINI, Foundation Medicine, Guardant Health and cBioPortal databases
To analyse the numbers and frequencies of different EGFR mutations among patients with NSCLC in the MD Anderson Cancer Center GEMINI database, the database was queried for patients with EGFR mutations (n = 1,054) and manually curated as classical or atypical EGFR mutations. The MD Anderson Cancer Center GEMINI database is prospectively collected from patients consented and enrolled on protocol number PA13-0589 in accordance with the MD Anderson Institutional Review Board.
EGFR mutations were determined from formalin-fixed paraffin-embedded tumours or digital-droplet PCR for blood samples by CLIA-certified methods as previously described18,37. In brief, samples from MD Anderson Cancer Center were collected through molecular pathology and mutations were determined by next-generation sequencing panels of tumour tissue DNA (MD Anderson Cancer Center Molecular Diagnostics Laboratory). MD Anderson Molecular Diagnostics Laboratory is a tissue molecular profiling method for NGS-based analysis to detect mutations in hotspot regions of 50 genes, and in April 2016, it was expanded to analyse 134 unique genes for the detection of somatic mutations in coding sequences of 128 genes and selected copy number variations (amplifications) in 49 genes. Moffitt Cancer Center used diagnostic methods such as Clarinet (bi-directional sequencing of exons 18–21 of EGFR), Pyrosequencing of EGFR gene (exons 18–21), and Moffitt Illumina TruSight Tumor 26 (TST26). Moffitt Trusight is a NGS Illumina sequencing platform with a panel of 170 genes. Commercial NGS platforms including FoundationOne and Guardant360 were used by both MD Anderson and Moffitt Cancer Center as described below.
To identify patients with EGFR mutations in the Foundation Medicine database, patient samples taken between November 2011 and May 2020 previously subjected to hybrid-capture based comprehensive genomic profiling using formalin-fixed paraffin-embedded tissue or plasma using previously validated assays38,39, were analysed for EGFR mutations (n = 10,221). Patients were stratified by EGFR mutation, and EGFR mutations were manually curated as atypical or classical EGFR mutations. Classical EGFR mutations were defined as L858R point mutations, T790M mutations, and various exon 19 deletions including any deletion in exon 19 beginning at amino acid E746 or L747 and ending at amino acid A755. Deletions also including insertions were allowed and still considered classical exon 19 deletions. Atypical EGFR mutations were defined as non-synonymous mutations that were not defined as classical mutations. Patients with EGFR mutations where the sequence of the mutation was unknown were excluded from the analysis.
To determine the frequency of individual EGFR variants reported across the MD Anderson GEMINI database, cBioPortal, Foundation Medicine and the Guardant Health database, each database was analysed separately, and the average of all databases was determined. To determine the frequency of atypical mutations in the MD Anderson GEMINI and Foundation Medicine databases, atypical mutations were identified as described above and total number of known EGFR mutations across all patients was tabulated. For the analysis of cBioPortal, all non-overlapping studies were selected and exported. For overlapping studies, only the largest dataset was used, and all known EGFR mutations were tabulated. To determine the frequencies of EGFR variants from Guardant Health, a database of sequenced circulating free DNA (cfDNA), the Guardant360 clinical database was searched for NSCLC samples tested between November 2016 and November 2019 harbouring EGFR mutations (n = 5,026 patients). Guardant360 is a CLIA-certified, CAP/NYSDOH accredited comprehensive cfDNA NGS test that reports on SNVs, indels, fusions and SNVs in up to 73 genes. The Guardant360 clinical database, and the four datasets reported here, are enriched in North American patients with NSCLC; the frequency of atypical EGFR mutations may differ in Asia or other regions.
Analysis of TTF in MD Anderson Cancer Center GEMINI and Moffitt Cancer Center
To determine TTF after EGFR TKI treatment, patients with NSCLC harbouring an EGFR mutation in the tyrosine kinase domain (exons 18–22) were identified in the MD Anderson GEMINI and Moffitt Cancer Center databases. Data collection for Moffitt Cancer Center (MCC) patients was performed under the protocol (MCC 19161), which was formally reviewed and granted approval by MCC in accordance with the Declaration of Helsinki and the 21st Century Cures Act. Outcomes were recorded for patients for only first EGFR TKI. Patients were stratified by classical (L858R or Ex19del, as defined above) or atypical (non-classical). There were 333 patients with NSCLC identified in the MD Anderson GEMINI database who had tumours expressing atypical mutations. Of these patients, 88 patients received at least one line of EGFR TKI treatment. In addition, at Moffitt Cancer Center, there were 21 patients with NSCLC with tumours harbouring atypical EGFR mutations. Clinical parameters were extracted from the respective databases. Patients previously receiving chemotherapy were included, and TTF was calculated for the first EGFR TKI received. TTF was determined as previously described18 and defined as time from commencement of EGFR TKI to radiologic progression, TKI discontinuation, or death, and was not based on RECIST criteria. For patients treated beyond progression, radiologic progression was recorded as the end point, and data cut-off was May 2021. Median TTF was calculated using the Kaplan–Meier method. HR and P values were determined using GraphPad Prism software and two-sided Mantel–Cox log-rank tests.
Analysis of OS and PFI from cBioPortal Database
For overall survival (OS) and progression-free interval (PFI), analysis of patients in cBioportal was determined as previously described19 for patients receiving any treatment with survival information and qualifying EGFR mutation. This information was curated from cBioportal by selecting all non-overlapping studies of NSCLC. For overlapping studies, the largest database was selected. PFI and OS analysis were restricted to the tyrosine kinase domain. Median OS and median PFI were calculated using the Kaplan–Meier method. HR and P values were determined using GraphPad Prism software and two-sided Mantel–Cox log-rank tests.
Ba/F3 cell generation, drug screening and IC50 approximations
Ba/F3 cells were obtained as a gift from G. Mills (MD Anderson Cancer Center) and maintained in RPMI (Sigma) containing 10% FBS, 1% penicillin-streptomycin and 10 ng ml−1 recombinant mIL-3 (R&D Biosystems). To establish stable Ba/F3 cell lines, Ba/F3 cells were transduced with retroviruses containing mutant EGFR plasmids for 12–24 h. Retroviruses were generated using Lipofectamine 2000 (Invitrogen) transfections of Phoenix 293T-ampho cells (Orbigen) with pBabe-Puro based vectors listed in Supplementary Table 7. Vectors were generated by GeneScript or Bioinnovatise using parental vectors from Addgene listed in Supplementary Table 7. After 48–72 h of transduction, 2 µg ml−1 puromycin (Invitrogen) was added to Ba/F3 cell lines in complete RPMI. To select for EGFR-positive cell lines, cells were stained with PE-EGFR (Biolegend) and sorted by fluorescence-activated cell sorting. After sorting, EGFR-positive cells were maintained in RPMI containing 10% FBS, 1% penicillin-streptomycin, and 1 ng ml−1 EGF to support cell viability. Drug screening was performed as previously described22,36. Shortly, cells were plated in 384-well plates (Greiner Bio-One) at 2,000–3,000 cells per well in technical triplicate. Seven different concentrations of TKIs or DMSO vehicle were added to reach a final volume of 40 µl per well. After 72 h, 11 µl of Cell Titer Glo (Promega) was added to each well. Plates were incubated for a minimum of 10 min, and bioluminescence was determined using a FLUOstar OPTIMA plate reader (BMG LABTECH). Raw bioluminescence values were normalized to DMSO control-treated cells, and values were plotted in GraphPad Prism. Non-linear regressions were used to fit the normalized data with a variable slope, and IC50 values were determined by GraphPad prism by interpolation of concentrations at 50% inhibition. Drug screens were performed in technical triplicate on each plate and either duplicate or triplicate biological replicates. Mutant to WT ratios for each drug were calculated by dividing the IC50 values of mutant cell lines by the average IC50 value of Ba/F3 cells expressing WT EGFR supplemented with 10 ng ml−1 EGF for each drug. Statistical differences between groups were determined by one-way ANOVA as described in the figure legends.
In silico mutational mapping and docking experiments
X-ray structures of wild type EGFR in complex with AMP-PNP (2ITX) and osimertinib (4ZAU), and EGFR L858R mutant in complex with AMP-PNP (Protein Data Bank (PDB) ID: 2ITV) were retrieved from the Protein Data Bank. Molecular Operating Environment (2019.01; Chemical Computing Group CCCG) was used to generate mutant homology models, construct protein–ligand models and for visualization. Pymol was used for visualization of mutation location on WT EGFR (PDB ID: 2ITX) and structural alignment with EGFR D770insNPG (PDB ID: 4LRM) or EGFR G719S (PDB ID: 2ITN).
Heat map generation
Heat maps and hierarchical clustering were generated by plotting the median log (Mut/WT) value for each cell line and each drug using R and the ComplexHeatmap package40 2.6.2 (R Foundation for Statistical Computing). Hierarchical clustering was determined by Euclidean distance between Mut/WT ratios. For co-occurring mutations, mutation order was assigned arbitrarily, and for acquired mutations, mutations were assigned in the order mutations are observed clinically. Structure–function groups were assigned based on predicted impact of mutation on receptor conformation.
Determination of EGFR groups and subgroups
Mutational mapping was used to separate EGFR mutations into distinct groups based on predicted drug sensitivity. Structural features of EGFR mutations with known drug sensitivity (that is, classical EGFR mutations41,42, T790M43,44,45 and exon 20 insertions22,25) were used as the basis for predicting the impact of mutations on drug sensitivity. Using mutational mapping there were four distinct groups: (1) no obvious effect on the drug binding pocket (similar to L858R); (2) a mutation in the hydrophobic core (similar to T790M); (3) a large inward shift of both the αC-helix and P-loop (similar to exon 20 insertions); and (4) a slight inward shift of the αC-helix and/or P-loop due to direct changes to the either the αC-helix and/or P-loop or indirectly through alterations of the ß-pleated sheets that are predicted to effect the position either the αC-helix and/or P-loop. Groups were validated by hierarchical clustering of in vitro sensitivity of Ba/F3 cells expressing the various EGFR mutations. Subgroups such as T790M-like-3S/T790M-like-3R and Ex20ins-NL/Ex20ins-FL were defined based on cell line sensitivity data.
Statistical analyses of structure-function groups
Correlations for mutations were determined using Spearman’s rho by correlating the median log (Mut/WT) value for each mutation and drug versus the average of the median log (Mut/WT) value for the structure–function-based group or exon-based group for which the mutation belongs. For each correlation, the mutation tested was removed from the average structure function and exon-based groups. Average rho values were compared by two-sided Student’s t-test. To determine whether structure function groups or exon groups were better predictor of drug sensitivity, we performed recursive-partitioning analyses to construct a decision tree for each drug using structure function group and mutation data on exons 18, 19, 20, and 21 as predictors. The decision tree classified samples by posing a series of decision rules based on predictors. Each decision rule was constrained in an internal node, and every internal node points to yes-or-no questions that result in a ‘yes’ or ‘no’ branch. We applied the CART algorithm20,21 using the rpart R package. We calculated variable importance as the sum of the goodness of split measures for each split. These were scaled to sum to 100 for a tree. Median SAS version 9.4 and R version 3.5.6 were used to carry out the computations for all analyses. The structure function group variable was involved in the first and second splits in all of the 18 regression trees of drug sensitivity. The variable importance of this variable was in a range of 66–94%. Both the order of the split and variable importance indicate that the structure function group variable was more predictive than the exon-based variables in evaluation of drug sensitivity. Code for this analysis can be found at https://github.com/MD-Anderson-Bioinformatics/EGFR-Structure-Function-Nature-Manuscript.
PDX generation and in vivo experiments
As part of the MD Anderson Cancer Center Lung Cancer Moon Shots program, PDXs harbouring EGFR G719A and EGFR L858R/E709K were generated and maintained in accordance with Good Animal Practices and with approval from MD Anderson Cancer Center Institutional Animal Care and Use Committee on protocol number PA140276 as previously described46. Surgical samples were rinsed with serum-free RPMI supplemented with 1% penicillin-streptomycin then implanted into the right flank of 5- to 6-week-old NSG female mice within 2 h of resection. Tumours were validated for EGFR mutations by DNA fingerprinting and quantitative PCR as described46. PDXs harbouring EGFR S768dupSVD were purchased from Jackson Laboratories (J100672). To propagate tumours, 5- to 6-week-old female NSG mice (NOD.Cg-Prkdcscid IL2rgtmWjl/Szj) were purchased from Jackson Laboratories (005557). Fragments of NSCLC tumours expressing EGFR S768dupSVD, G719A or L858R/E709K were implanted into 6- to 8-week-old female NSG mice. Once tumours reached 2,000 mm3, they were collected and re-implanted into the right flank of 6- to 8-week-old female NSG mice. Tumours were measured 3 times per week and were randomized into treatment groups when tumors reached a volume of 275–325 mm3 for the EGFR G719A and S768dupSVD models, and 150–175 mm3 for the L858R/E709K model. Treatment groups included vehicle control (0.5% methylcellulose, 0.05% Tween-80 in dH2O), 100 mg kg−1 erlotinib, 20 mg kg−1 afatinib, 2.5 mg kg−1 poziotinib, 5 mg kg−1 osimertinib, and 25 mg kg−1 osimertinib. During treatment, body weight and tumour volumes were measured three times per week, and mice received treatment five days per week (Monday to Friday). Dosing holidays were given if mouse body weight decreased by more than 10% or overall body weight dropped below 20 g. Maximum allowed tumour burden by approved IACUC protocol was a volume of 2,000 mm3. Mice were humanely euthanized if tumour sizes exceeded the maximum size.
Case studies of patients treated with second-generation TKIs
Patients were consented under the GEMINI protocol (PA13-0589) which was approved in accordance with the MD Anderson Institutional Review Board, or protocol MCC 19161, which was formally reviewed and granted approval by Moffitt Cancer Center in accordance with the Declaration of Helsinki and the 21st Century Cures Act for retrospective analysis of patient outcomes and treatment course for case studies of patients presented. Both protocols include informed consent for publication of deidentified data.
Retrospective analysis of ORR and duration of treatment with afatinib
Response to afatinib and duration of afatinib treatment was tabulated from 803 patients in the Uncommon EGFR Database (www.uncommonegfrmutations.com). Objective response rate was reported in 529 patients. Patients were stratified by either structure–function-based groups or exon-based groups and ORR was determined by counting the number of patients reported to have complete response or partial response. Fisher’s exact test was used to determined statistical differences between subgroups (structure based or exon-based). Duration of treatment was provided in the Uncommon EGFR Database for 746 patients. Patients were stratified by structure–function-based groups and exon-based groups and median DOT was calculated using the Kaplan–Meier method. Statistical differences in Kaplan–Meier plots, HR and P values were generated using GraphPad Prism software and the Mantel–Cox log-rank method. When mutations were not explicitly stated (that is, exon 19 mutation) those patients were excluded from the structure–function-based analysis but included in the exon-based analysis.
Further information on research design is available in the Nature Research Reporting Summary linked to this paper.
Source data for all figures can be found at https://github.com/MD-Anderson-Bioinformatics/EGFR-Structure-Function-Nature-Manuscript. Public datasets used in this study include non-overlapping studies including NSCLC in cBioportal (www.cbioportal.org) and the Uncommon EGFR Database (www.uncommonegfrmutations.com). Details of specific studies accessed can be found in the Reporting Summary. Data from Foundation Medicine and Guardant Health were provided under data use agreements; however, summarized data used in Fig. 1 and Extended Data Fig. 1 are provided at https://github.com/MD-Anderson-Bioinformatics/EGFR-Structure-Function-Nature-Manuscript. Source data are provided with this paper.
All code used in this study can be found at https://github.com/MD-Anderson-Bioinformatics/EGFR-Structure-Function-Nature-Manuscript.
Russo, A. et al. Heterogeneous responses to epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKIs) in patients with uncommon EGFR mutations: new insights and future perspectives in this complex clinical scenario. Int. J. Mol. Sci. 20, 1431 (2019).
Kobayashi, Y. et al. EGFR exon 18 mutations in lung cancer: molecular predictors of augmented sensitivity to afatinib or neratinib as compared with first- or third-generation TKIs. Clin. Cancer Res. 21, 5305–5313 (2015).
Klughammer, B. et al. Examining treatment outcomes with erlotinib in patients with advanced non-small cell lung cancer whose tumors harbor uncommon EGFR mutations. J. Thorac. Oncol. 11, 545–555 (2016).
Rosell, R. et al. Erlotinib versus standard chemotherapy as first-line treatment for European patients with advanced EGFR mutation-positive non-small-cell lung cancer (EURTAC): a multicentre, open-label, randomised phase 3 trial. Lancet Oncol. 13, 239–246 (2012).
Sequist, L. V. et al. Phase III study of afatinib or cisplatin plus pemetrexed in patients with metastatic lung adenocarcinoma with EGFR mutations. J. Clin. Oncol. 31, 3327–3334, https://doi.org/10.1200/JCO.2012.44.2806 (2013).
Soria, J. C. et al. Osimertinib in untreated EGFR-mutated advanced non-small-cell lung cancer. N. Engl. J. Med. 378, 113–125 (2018).
Yoshikawa, S. et al. Structural basis for the altered drug sensitivities of non-small cell lung cancer-associated mutants of human epidermal growth factor receptor. Oncogene 32, 27–38 (2013).
Massarelli, E., Johnson, F. M., Erickson, H. S., Wistuba, I. I. & Papadimitrakopoulou, V. Uncommon epidermal growth factor receptor mutations in non-small cell lung cancer and their mechanisms of EGFR tyrosine kinase inhibitors sensitivity and resistance. Lung Cancer 80, 235–241 (2013).
Yasuda, H., Kobayashi, S. & Costa, D. B. EGFR exon 20 insertion mutations in non-small-cell lung cancer: preclinical data and clinical implications. Lancet Oncol. 13, e23–e31 (2012).
Kosaka, T. et al. Response heterogeneity of EGFR and HER2 exon 20 insertions to covalent EGFR and HER2 inhibitors. Cancer Res. 77, 2712–2721 (2017).
Ramalingam, S. S. et al. Overall survival with osimertinib in untreated, EGFR-mutated advanced NSCLC. N. Engl. J. Med. 382, 41–50 (2020).
Cho, J. H. et al. Osimertinib for patients with non-small-cell lung cancer harboring uncommon EGFR mutations: a multicenter, open-label, phase II trial (KCSG-LU15-09). J. Clin. Oncol. 38, 488–495 (2020).
Wu, Y. L. et al. Afatinib versus cisplatin plus gemcitabine for first-line treatment of Asian patients with advanced non-small-cell lung cancer harbouring EGFR mutations (LUX-Lung 6): an open-label, randomised phase 3 trial. Lancet Oncol. 15, 213–222 (2014).
Yang, J. C. et al. Clinical activity of afatinib in patients with advanced non-small-cell lung cancer harbouring uncommon EGFR mutations: a combined post-hoc analysis of LUX-Lung 2, LUX-Lung 3, and LUX-Lung 6. Lancet Oncol. 16, 830–838 (2015).
Yang, J. C. et al. Afatinib versus cisplatin-based chemotherapy for EGFR mutation-positive lung adenocarcinoma (LUX-Lung 3 and LUX-Lung 6): analysis of overall survival data from two randomised, phase 3 trials. Lancet Oncol. 16, 141–151 (2015).
Administration, U. S. F. a. D. (ed FDA) (2021).
Shi, J. et al. Uncommon EGFR mutations in a cohort of Chinese NSCLC patients and outcomes of first-line EGFR-TKIs and platinum-based chemotherapy. Chin. J. Cancer Res. 29, 543–552 (2017).
Le, X. et al. Landscape of EGFR -dependent and -independent resistance mechanisms to osimertinib and continuation therapy post-progression in EGFR-mutant NSCLC. Clin. Cancer Res. 24, 6195–6203 (2018).
Liu, J. et al. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell 173, 400–416.e411 (2018).
Strobl, C., Malley, J. & Tutz, G. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychol. Methods 14, 323–348 (2009).
Breiman, L. Classification and Regression Trees (Wadsworth International Group, 1984).
Robichaux, J. P. et al. Mechanisms and clinical activity of an EGFR and HER2 exon 20-selective kinase inhibitor in non-small cell lung cancer. Nat. Med. 24, 638–646 (2018).
Vasconcelos, P., Kobayashi, I. S., Kobayashi, S. S. & Costa, D. B. Preclinical characterization of mobocertinib highlights the putative therapeutic window of this novel EGFR inhibitor to EGFR exon 20 insertion mutations. JTO Clin. Res. Rep. 2, 100105 (2021).
Vasconcelos, P. et al. EGFR-A763_Y764insFQEA is a unique exon 20 insertion mutation that displays sensitivity to approved and in-development lung cancer EGFR tyrosine kinase inhibitors. JTO Clin. Res. Rep. 1, 100051 (2020).
Yasuda, H. et al. Structural, biochemical, and clinical characterization of epidermal growth factor receptor (EGFR) exon 20 insertion mutations in lung cancer. Sci. Transl. Med. 5, 216ra177 (2013).
Yang, M. et al. NSCLC harboring EGFR exon-20 insertions after the regulatory C-helix of kinase domain responds poorly to known EGFR inhibitors. Int. J. Cancer 139, 171–176 (2016).
Lee, H. J. et al. Noncovalent wild-type-sparing inhibitors of EGFR T790M. Cancer Discov. 3, 168–181 (2013).
Wang, X. et al. Lung adenocarcinoma harboring EGFR 19del/C797S/T790M triple mutations responds to brigatinib and anti-EGFR antibody combination therapy. J. Thorac. Oncol. 14, e85–e88 (2019).
Uchibori, K. et al. Brigatinib combined with anti-EGFR antibody overcomes osimertinib resistance in EGFR-mutated non-small-cell lung cancer. Nat. Commun. 8, 14768 (2017).
Thress, K. S. et al. Acquired EGFR C797S mutation mediates resistance to AZD9291 in non-small cell lung cancer harboring EGFR T790M. Nat. Med. 21, 560–562 (2015).
Ou, S. I. et al. Emergence of novel and dominant acquired EGFR solvent-front mutations at Gly796 (G796S/R) together with C797S/R and L792F/H mutations in one EGFR (L858R/T790M) NSCLC patient who progressed on osimertinib. Lung Cancer 108, 228–231 (2017).
Yang, J. C. et al. Afatinib for the treatment of NSCLC harboring uncommon EGFR mutations: a database of 693 cases. J. Thorac. Oncol. 15, 803–815 (2020).
Tamiya, M. et al. Successful treatment of an osimertinib-resistant lung adenocarcinoma with an exon 18 EGFR mutation (G719S) with afatinib plus bevacizumab. Invest. New Drugs 39, 232–236 (2021).
Ma, C. et al. Effective treatment with afatinib of lung adenocarcinoma with leptomeningeal metastasis harboring the exon 18 p.G719A mutation in the EGFR gene was detected in cerebrospinal fluid: a case report. Front. Oncol. 10, 1635 (2020).
Mok, T. S. et al. Improvement in overall survival in a randomized study that compared dacomitinib with gefitinib in patients with advanced non-small-cell lung cancer and EGFR-activating mutations. J. Clin. Oncol. 36, 2244–2250 (2018).
Robichaux, J. P. et al. Pan-cancer landscape and analysis of ERBB2 mutations identifies poziotinib as a clinically active inhibitor and enhancer of T-DM1 activity. Cancer Cell 36, 444–457.e447 (2019).
Liu, X. et al. Concurrent use of aspirin with osimertinib is associated with improved survival in advanced EGFR-mutant non-small cell lung cancer. Lung Cancer 149, 33–40 (2020).
Frampton, G. M. et al. Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing. Nat. Biotechnol. 31, 1023–1031 (2013).
Clark, T. A. et al. Analytical validation of a hybrid capture-based next-generation sequencing clinical assay for genomic profiling of cell-free circulating tumor DNA. J. Mol. Diagn. 20, 686–702 (2018).
Galili, T., O’Callaghan, A., Sidi, J. & Sievert, C. heatmaply: an R package for creating interactive cluster heatmaps for online publishing. Bioinformatics 34, 1600–1602 (2018).
Tamirat, M. Z., Koivu, M., Elenius, K. & Johnson, M. S. Structural characterization of EGFR exon 19 deletion mutation using molecular dynamics simulation. PLoS ONE 14, e0222814 (2019).
Tsigelny, I. F. et al. Molecular determinants of drug-specific sensitivity for epidermal growth factor receptor (EGFR) exon 19 and 20 mutants in non-small cell lung cancer. Oncotarget 6, 6029–6039 (2015).
Yun, C. H. et al. The T790M mutation in EGFR kinase causes drug resistance by increasing the affinity for ATP. Proc. Natl Acad. Sci. USA 105, 2070–2075 (2008).
Finlay, M. R. et al. Discovery of a potent and selective EGFR inhibitor (AZD9291) of both sensitizing and T790M resistance mutations that spares the wild type form of the receptor. J. Med. Chem. 57, 8249–8267 (2014).
Cross, D. A. et al. AZD9291, an irreversible EGFR TKI, overcomes T790M-mediated resistance to EGFR inhibitors in lung cancer. Cancer Discov 4, 1046–1061 (2014).
Chen, Y. et al. Tumor characteristics associated with engraftment of patient-derived non-small cell lung cancer xenografts in immunocompromised mice. Cancer 125, 3738–3748 (2019).
We thank the patients, families and caregivers involved in the GEMINI protocol and the patient advocacy groups who have an essential role in patient care and research, including the Exon 20 Group. We also thank I. G. Munoz for critical review and editorial assistance. This work was supported by the generous philanthropic contributions to The University of Texas MD Anderson Lung Cancer Moon Shots Program and the MD Anderson Cancer Center Support Grant P30 CA016672 including animal housing and care in the Research Animal Support Facility (RASF). Additional support was contributed by NIH-R01CA247975, CPRIT-IIRA RP200150, NIH/NCI R01CA234183, NIH/NCI R01CA190628, Lung SPORE P50 CA070907-20, the David Bruton Jr Endowment, the Rexanna Foundation for Fighting Lung Cancer, 1U54CA224065-01, research support from Spectrum Pharmaceuticals (J.H.), the Hallman Fund, the Stading Fund for EGFR inhibitor resistance, the Gil and Dody Weaver Foundation, the Hanlon Fund, the Richardson Fund for EGFR mutant lung cancer research, and ASCO CDA-57112 (Y.Y.E.).
The research being reported in this publication is research in which The University of Texas MD Anderson Cancer Center has an institutional financial conflict of interest. Because MD Anderson is committed to the protection of human subjects and the effective management of its financial conflicts of interest in relation to its research activities, MD Anderson has implemented an Institutional Conflict of Interest Management and Monitoring Plan to manage and monitor the conflict of interest with respect to MD Anderson’s conduct of this research. MD Anderson, including J.P.R., M.B.N. and J.V.H. have filed a patent for the use of poziotinib for treating EGFR- and HER2-mutant cancers and licensed the technology to Spectrum Pharmaceuticals. J.P.R. and J.V.H. receive research support from Spectrum Pharmaceuticals, Takeda and Enliven Therapeutics. MD Anderson, including M.B.N., J.P.R. and J.V.H. have a pending patent submitted for treatment of EGFR TKI resistant NSCLC, and another, including J.P.R., S.H. and J.V.H., for the classification of EGFR mutations. J.P.R. and M.B.N. have no non-financial competing interests. J.V.H. also receives grant or research support from AstraZeneca and GlaxoSmithKline and has served on advisory committees for AstraZeneca, Boehringer Ingelheim, Bristol-Myers Squibb, Catalyst, EMD Serono, Foundation Medicine (FMI), Hengrui Therapeutics, Genentech, GlaxoSmithKline, Guardant Health, Eli Lilly, Merck, Novartis, Pfizer, Roche, Sanofi, Seattle Genetics, Spectrum Pharmaceuticals, Takeda. As non-financial competing interests, J.V.H. serves as scientific advisor for Rexanna Foundation and the EGFR resisters. X. Le receives consulting/advisory fees from EMD Serono (Merck KGaA), AstraZeneca, Spectrum Pharmaceuticals, Eli Lilly, Boehringer Ingelheim, Bristol-Myers Squibb, Novartis, Hengrui Therapeutics, Daiichi Sankyo, and Celgene, and research funding to the institution from Eli Lilly and Boehringer Ingelheim, and no non-financial competing interests. Y.Y.E. discloses research support from Spectrum Pharmaceuticals, AstraZeneca, Takeda, Eli Lilly, Xcovery, Turning Point Therapeutics; advisory role for AstraZeneca, Eli Lilly and Turning Point Therapeutics; and accommodation expenses from Eli Lilly, and no non-financial competing interests. J.Z. reports research funding and consulting fees from Bristol-Myers Squibb, AstraZeneca, Geneplus, OrigMed, Innovent, Merck, Johnson and Johnson, and no non-financial competing interests. F.S. receives consulting/advisory fees from Amgen, Navire Pharma, Intellisphere LLC; research funding to the institution from Amgen, Mirati Therapeutics, Boehringer Ingelheim, Merck&Co, Novartis, Pfizer; speaker’s fees from Bristol-Myers Squibb and RV Mais Promocao Eventos LTDS; other support from AstraZeneca Pharmaceuticals; travel expenses from Tango Therapeutics and reports past stock ownership of Moderna Inc and BioNTech SE. As non-financial competing interests F.S. serves as scientific advisor for Tango Therapeutics and has previously served as advisor for Calithera Biosciences. J.E.G. reports research support from AstraZeneca, Boehringer Ingelheim, Bristol-Myers Squibb, Genentech, G1 Therapeutics, Merck, Novartis, Pfizer, and the Lurdwig Institute of Cancer Research, and serves as a consultant/advisor for Daiichi Sankyo, Janssen, Novartis, Merck, Inivata, EMD Serono, Bristol-Myers Squibb, Blueprint Medicines, and AstraZeneca. H.T. reports research support from Bayer AS, Guardant Health, Ziopharm, and no non-financial interests. J.S. and V.R. are shareholders/full time employees of Guardant Health and report no non-financial interests. R.M. and A.B.S. are full time employees of Foundation medicine, a wholly-owned subsidiary of Roche, and have Roche stock ownership, with no non-financial competing interests. J.K.H. receives research support from OneOme, is a consultant for Quest Diagnostics and 23andMe, and has served on an advisory committee for Novartis, and has provided educational content for the American College of Clinical Pharmacy, Florida Pharmacy Association and HorizonCME. All other authors report no financial or non-financial competing interests.
Peer review information Nature thanks Daniel Costa, Helena Yu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data figures and tables
Extended Data Fig 1 Patients with atypical EGFR mutations have worse clinical outcomes than those with classical EGFR mutations.
a, Diagram of patient sample sources and types across databases. b, Lollipop plot of frequency of all EGFR mutations observed in patients with NSCLC (N=24,934 mutations). EGFR mutations associated with acquired drug resistance, as described by the literature, are highlighted in red. c–f, Kaplan-Meier plot of TTF of patients with NSCLC tumors harboring classical (N=264 patients) or atypical EGFR mutations stratified by (c) exon after treatment with an EGFR TKI (Exon 18 N= 40, Exon 19 N=19, Exon 20 N=15, Exon 21 N=26), or EGFR TKI class including 1st- (d), 2nd- (e), or 3rd- (f) gen TKIs in MD Anderson GEMINI and Moffitt Cancer center databases. Patients that received prior chemotherapy or immunotherapy were included, but TTF was calculated for first EGFR TKI received. g, h, Kaplan-Meier plot of PFI (g) and OS (h) of patients with NSCLC tumors harboring classical (N=50 for PFI and N=52 for OS) or atypical (N=35 for PFI and N=39 for OS) EGFR mutations from cBioPortal. Atypical EGFR mutations were limited to mutations in the tyrosine kinase domain, and treatment and stage were unknown. c–h, HRs and p-values were calculated using two-sided Mantel-Cox, Log-Rank tests
a–d, Mutational mapping of classical-like (a), T790M-like (b), exon 20 loop insertion (red/blue) and WT (grey/green) (c), and PACC mutants (d) onto WT EGFR crystal (PDB 2ITX).
Extended Data Fig 3 Heat maps generated through supervised clustering by structure-function-based groups cluster drug sensitivity better than exon-based groups.
a, b, Heat maps supervised clustering by exon-based (a) or structure-function-based (b) groups of log (Mutant/WT) ratios from Ba/F3 cells expressing indicated mutations after 72 h of indicated drug treatment. To determine the mutant/WT ratio, IC50 values for each drug and cell line were calculated and then compared to the average IC50 values of Ba/F3 cells expressing WT EGFR (+10ng/ml EGF to maintain viability). Squares are representative of the median of n=3 replicates. For co-occurring mutations, the order of mutations 1, 2, and 3 were assigned arbitrarily. Groups were assigned based on structural predictions
Extended Data Fig. 4 Structure-function-based groupings are more predictive of drug and mutation sensitivity compared to exon-based groupings.
a, Bar plot of Spearman rho values for indicated mutations compared to exon-based groups (yellow) or structure-function-based groups (green). The delta of the two rho values is shown as an overlapped grey bar. When the delta bar shifts to the right, the spearman rho value was higher for structure-function-based groups, and when the grey bar shifts to the left, the spearman rho value was higher for the exon-based groups. b, Representative classification and regression trees for each indicated drug. Colors represent drug sensitivity (green) or resistance (red) as defined by log (mutant IC50/WT EGFR IC50). c, Bar plot of Spearman rho values for indicated mutations (excluding T790M mutations) compared to exon-based groups (yellow) or structure-function-based groups (green). The delta of the two rho values is shown as an overlapped grey bar. d, Representative classification and regression trees for each indicated drug excluding T790M from the analysis. Colors represent drug sensitivity (green) or resistance (red) as defined by log (mutant IC50/WT EGFR IC50). e, Dot plot of rho values from Spearman correlations of mutations vs exon-based group averages or structure-function based averages for each drug excluding T790M mutations. Dots are representative of each mutation; bars are representative of the average rho value ± standard deviation (SD). p-value was determined using a paired two-sided t-test, and n = 59 cell lines/mutations. f, Dot plot of variable importance calculated as sum of the goodness of split for each split in the classification and regression trees (CART). Dots are representative of variable importance for each drug in the exon and structure-function-based groups as indicated and excluding T790M mutations. Bars are representative of the median + 95% confidence interval of variable importance for all drugs (Supplementary Table 3). p-value was determined using a paired two-sided t-test, and n = 18 drugs
Extended Data Fig. 5 Classical-like EGFR mutations are not predicted to alter the drug-binding pocket and are most sensitive to 3rd-gen EGFR TKIs.
a, b, Rendering of crystal structure of WT EGFR (PDB 2ITX) visualized as both a ribbon (a) and space filling (b) models. Residues important in receptor signaling and drug binding are highlighted. c, d, Overlapped rendering of WT crystal (grey) (c) and L861R (blue) and space filing model (d) of L861Q demonstrate the R861 substitution is distal from the drug binding pocket and has minimal impact on the overall structure of EGFR compared to WT. e, Dot plot of mutant/WT IC50 values of Ba/F3 cells expressing classical-like EGFR mutations and treated with indicated classes of EGFR TKIs. Dots are representative of average of n=3 replicate mutant/WT IC50 values of individual cell lines expressing classical-like mutations with individual drugs. Bars are representative of average mutant/WT IC50 values ± SEM for each class of EGFR TKI and all classical-like cell lines. p-values were determined by one-way ANOVA with unequal SD as determined by Brown-Forsythe test to determine differences in SD. Holm-Sidak’s multiple comparisons test was used to determine differences between groups. f, Tumor growth curves for PDXs harboring EGFR L858R E709K complex mutation treated with indicated inhibitors. Tumors were measured three times per week and symbols are average of tumor volumes ± SEM. Mice were randomized into six groups: vehicle (N=6), poziotinib 2.5mg/kg (N=7), erlotinib 100mg/kg (N=6), afatinib 20mg/kg (N=6), osimertinib 5mg/kg (N=6), and osimertinib 25mg/kg (N=6). Mice received drug 5 days per week, and mice were euthanized at day 28 to harvest tumors. g, Dot plot of percent change in tumor volume on day 28 of tumors described in f. Dots are representative of each tumor, and bars are representative of average ± SEM for each group. Statistical differences were determined by ordinary one-way ANOVA with post-hoc Tukey’s multiple comparisons test to determined differences between groups vehicle, N=6 mice, poziotinib 2.5mg/kg, N=7 mice, erlotinib 100mg/kg, N=6 mice, afatinib 20mg/kg, N=6 mice, osimertinib 5mg/kg, N=6, and osimertinib 25mg/kg, N=6 mice.
a, Dot plot of mutant/WT IC50 values of Ba/F3 cells expressing exon 20 loop insertion mutations and treated with indicated classes of EGFR TKIs. Dots are representative of average of n=3 replicate mutant/WT IC50 values of individual cell lines expressing exon 20 insertion mutations with individual drugs. Bars are representative of average mutant/WT IC50 values ± SEM for each class of EGFR TKI and all Ba/F3 cell lines. p-values were determined by one-way ANOVA with unequal SD as determined by Brown-Forsythe test to determine differences in SD. Holm-Sidak’s multiple comparisons test was used to determine differences between groups. b, Tumor growth curves for PDXs harboring EGFR S768dupSVD exon 20 insertion mutation treated with indicated inhibitors. Tumors were measured three times per week and symbols are average of tumor volumes ± SEM. Mice were randomized into four groups: vehicle (N=4), poziotinib 2.5mg/kg (N=5), osimertinib 5mg/kg (N=4), and osimertinib 25mg/kg (N=5) . Mice received drug 5 days per week, and mice were euthanized at day 21 to harvest tumors. c, Dot plot of percent change in tumor volume on day 21 of tumors described in panel c (vehicle, N=4 mice, poziotinib 2.5mg/kg, N=5 mice, osimertinib 5mg/kg, N=4 mice, and osimertinib 25mg/kg, N=5 mice). Dots are representative of each tumor, and bars are representative of average ± SEM for each group. Statistical differences were determined by ordinary one-way ANOVA with post-hoc Tukey’s multiple comparisons test to determined differences between groups. d, Heat map with unsupervised hierarchical clustering of log (Mutant/WT) ratios from Ba/F3 cells expressing indicated exon 20 loop insertions after 72 h of indicated drug treatment. Squares are representative of the median of n=3 replicates. e, Bar plot of average mutant/WT ratio of Ba/F3 cells expressing exon 20 loop insertions separated by mutational groups for indicated drug classes. Bars are representative of average mutant/WT ratio for all mutations and drugs in the indicated groups, dots are representative of average (n=3) mutant/WT ratio of individual mutations and drugs. Error bars are representative of SEM for each bar. Statistical differences between near- and far-loop mutants was determined by two-sided unpaired t-test with unequal variance
a, Heat map with unsupervised hierarchical clustering of log (mutant/WT) ratios from Ba/F3 cells expressing indicated mutations after 72 h of indicated drug treatment. Squares are representative of the median of n=3 replicates. For co-occurring mutations, the order of mutations 1, 2, and 3 were assigned arbitrarily. Groups were assigned based on hierarchal clustering and known resistance mutations. b, c, Dot plot of mutant/WT IC50 values of Ba/F3 cells expressing T790M-like-3S (3rd-gen TKI sensitive) (b) and T790M-like-3R (3rd-gen TKI resistant) (c) treated with indicated classes of EGFR TKIs. Dots are representative of average of n=3 replicate mutant/WT IC50 values of individual cell lines with individual drugs and drugs of each class are grouped together. Bars are representative of average mutant/WT IC50 values ± SEM for each class of EGFR TKI and all cell lines. p-values were determined by one-way ANOVA with unequal variance as determined by Brown-Forsythe test to determine differences in variance. Holm-Sidak’s multiple comparisons test was used to determine differences between groups
Extended Data Fig. 8 PACC mutations alter the orientation of the P-loop and/or αC-helix and are sensitive to 2nd-gen TKIs.
a, Overlap of G719S (PDB 2ITN, green) and WT EGFR (PDB 2ITX, grey) crystal structures demonstrate arobust shift of F723 (red arrow) in the P-loop orienting the benzyl ring in a downward position condensing the P-loop in the drug binding pocket. Further, G719S displays an inward shift of the αC-helix compared to the WT crystal structure. b, Surface representation of G719S (PDB 2ITN) with P-loop (red), αC-helix (blue), hinge region (orange), C797 (yellow), and DFG motif (green) highlighted to demonstrate steric hindrance of drug binding pocket caused by shifted P-loop. c, Comparison of osimertinib bound to wild-type EGFR (PDB 4ZAU, green) or EGFR G719S (PDB 2ITN, purple) demonstrates destabilization of TKI-protein interactions. d, In silico homology model of EGFR L718Q (pink) with predicted binding modes of osimertinib and poziotinib structures demonstrates that Q718 hinders the interaction of osimertinib (green) with M793 and shifts the Michael acceptor (reactive group, green arrow) out of alignment with C797 (yellow arrow). In contrast, poziotinib (blue) is less effected by Q718 and is still positioned to react with C797, even in the context of L718Q mutations. e, In silico modeling of EGFR G719S (purple) with poziotinib (blue) shows no predicted changes in poziotinib binding or TKI-protein interactions. f, Dot plot of percent change in tumor volume on day 28 of tumors described in Fig. 3c. Dots are representative of each tumor, and bars are representative of average ± SEM for each group, N=5 mice per treatment group. Statistical differences were determined by ordinary one-way ANOVA with post-hoc Tukey’s multiple comparisons test to determined differences between groups. g, Representative CT images from a patient harbouring E709K G719S complex mutation before and after four weeks of afatinib treatment. Red arrow indicates resolved pleural effusion on the left lobe and reduced pleural effusion and tumor in the right lobe. h, In silico modeling of EGFR Ex19del G796S (purple) with osimertinib in the reactive (blue) and predicted (orange) conformations demonstrate destabilization of TKI-protein interactions in the hinge region (yellow), displacing the reactive group (arrow). i, In silico modeling of EGFR Ex19del G796S (purple) with the reactive conformation of poziotinib (blue) and the predicted conformation of poziotinib (orange) predicted minimal changes in poziotinib binding and similar TKI-protein interactions.
Extended Data Fig. 9 2nd-gen TKIs confer durable clinical benefit in patients with acquired osimertinib-resistant NSCLC.
a, Representative CT images from a patient after 5.5 months of osimertinib and 6 months after afatinib treatment. Red arrow indicates lesion harboring L858R/L718V. b, CT scan of a patient after 17 months of osimertinib treatment showed new pleural lesion that tested positive for both EGFR L858R and L718V mutations (red arrow), and CT image of patient four weeks after beginning poziotinib treatment shows reduction in size of the pleural lesion (red arrow). Blue arrow indicates resolved pleural effusion. a, b, Schematic below CT images shows timeline of patient treatments and outcomes. c, Schematic representations of a patient treatments and outcomes that acquired two PACC mutations (V765L and C797S) after 18 months of osimertinib treatment. PR = partial response, PD = progressive disease, SD = stable disease, SRS = stereotactic radiosurgery Chemo/IO = carboplatin/ pemetrexed+ pembrolizumab.
Extended Data Fig. 10 Structure-function groups identify patients with greater benefit to 2nd-gen TKIs than exon-based groups.
a, b, Overall response rate to afatinib stratified by structure-function-based groups (N= 507: Classical-like N=91, T790M-like N=103, Ex20ins-L N=120, and PACC N=193) (a) or exon based groups (N= 528: Exon 18 N=133, Exon 19 N=22, Exon 20 N=294, Exon 21 N=79) (b). When mutations were not explicitly stated (N=21), those patients were excluded from the structure-function based analysis. c, Kaplan-Meier plot of duration of afatinib treatment of patients with NSCLC tumors harboring atypical EGFR mutations (N= 364 patients) stratified by exon-based groups. Exon 18 N=87, Exon 19 N=19, Exon 20 N=195, and Exon 21 N=63. d, Kaplan-Meier plot of TTF of patients with NSCLC harboring non-PACC atypical EGFR mutations (N= 56) treated with 1st- (N=25), 2nd- (N=13), or 3rd-gen (N=18) EGFR TKIs. Forrest plot comparing PACC and non-PACC mutants can be found in Fig. 4d. e–h, Kaplan-Meier plots of TTF of patients atypical EGFR mutations stratified by EGFR TKI class for exons 18 (N=40) (e), 19 (N=19) (f), 20 (N=24) (g), and 21 (N=26) (h). Forrest plot comparing HRs and p-values across exons can be found in Fig. 4d.
About this article
Cite this article
Robichaux, J.P., Le, X., Vijayan, R.S.K. et al. Structure-based classification predicts drug response in EGFR-mutant NSCLC. Nature 597, 732–737 (2021). https://doi.org/10.1038/s41586-021-03898-1