Structure–function analysis of oncogenic EGFR Kinase Domain Duplication reveals insights into activation and a potential approach for therapeutic targeting

Du, Zhenfang; Brown, Benjamin P.; Kim, Soyeon; Ferguson, Donna; Pavlick, Dean C.; Jayakumaran, Gowtham; Benayed, Ryma; Gallant, Jean-Nicolas; Zhang, Yun-Kai; Yan, Yingjun; Red-Brewer, Monica; Ali, Siraj M.; Schrock, Alexa B.; Zehir, Ahmet; Ladanyi, Marc; Smith, Adam W.; Meiler, Jens; Lovly, Christine M.

doi:10.1038/s41467-021-21613-6

Download PDF

Article
Open access
Published: 02 March 2021

Structure–function analysis of oncogenic EGFR Kinase Domain Duplication reveals insights into activation and a potential approach for therapeutic targeting

Zhenfang Du ORCID: orcid.org/0000-0001-6364-1321¹^na1,
Benjamin P. Brown^2,3,4^na1,
Soyeon Kim⁵,
Donna Ferguson⁶,
Dean C. Pavlick⁷,
Gowtham Jayakumaran⁶,
Ryma Benayed ORCID: orcid.org/0000-0002-3754-0321⁶,
Jean-Nicolas Gallant ORCID: orcid.org/0000-0003-4696-2013¹,
Yun-Kai Zhang¹,
Yingjun Yan¹,
Monica Red-Brewer¹,
Siraj M. Ali⁷,
Alexa B. Schrock⁷,
Ahmet Zehir ORCID: orcid.org/0000-0001-5406-4104⁶,
Marc Ladanyi⁶,
Adam W. Smith⁵,
Jens Meiler^3,4,8 &
…
Christine M. Lovly ORCID: orcid.org/0000-0002-3641-6361^1,9

Nature Communications volume 12, Article number: 1382 (2021) Cite this article

10k Accesses
29 Citations
26 Altmetric
Metrics details

Subjects

Abstract

Mechanistic understanding of oncogenic variants facilitates the development and optimization of treatment strategies. We recently identified in-frame, tandem duplication of EGFR exons 18 - 25, which causes EGFR Kinase Domain Duplication (EGFR-KDD). Here, we characterize the prevalence of ERBB family KDDs across multiple human cancers and evaluate the functional biochemistry of EGFR-KDD as it relates to pathogenesis and potential therapeutic intervention. We provide computational and experimental evidence that EGFR-KDD functions by forming asymmetric EGF-independent intra-molecular and EGF-dependent inter-molecular dimers. Time-resolved fluorescence microscopy and co-immunoprecipitation reveals EGFR-KDD can form ligand-dependent inter-molecular homo- and hetero-dimers/multimers. Furthermore, we show that inhibition of EGFR-KDD activity is maximally achieved by blocking both intra- and inter-molecular dimerization. Collectively, our findings define a previously unrecognized model of EGFR dimerization, providing important insights for the understanding of EGFR activation mechanisms and informing personalized treatment of patients with tumors harboring EGFR-KDD. Finally, we establish ERBB KDDs as recurrent oncogenic events in multiple cancers.

Truncated FGFR2 is a clinically actionable oncogene in multiple cancers

Article Open access 10 August 2022

Genetic fusions favor tumorigenesis through degron loss in oncogenes

Article Open access 18 November 2021

Qualitative differences in disease-associated MEK mutants reveal molecular signatures and aberrant signaling-crosstalk in cancer

Article Open access 13 July 2022

Introduction

Next generation sequencing (NGS)-based assays have demonstrated high utility as a diagnostic tool for multiple cancer types^1,2,3,4. Interpretation of tumor genomic test results is often complicated by discovery of “variants of unknown significance” (VUS), because insufficient evidence is available to confirm whether the variant is a driver (deleterious) mutation^5,6. Previously, we identified a VUS in EGFR that contains a tandem in-frame duplication of exons 18–25 in an index patient with metastatic lung adenocarcinoma. Since exons 18–25 encode the entire tyrosine kinase domain (TKD), we termed this variant “EGFR Kinase Domain Duplication” (EGFR-KDD)⁷.

The ability to effectively treat patients is rooted in our mechanistic understanding of genomic variants identified via sequencing. The classic example is BRAF mutations, which are detected in numerous tumors⁸. There are three classes of BRAF mutations, stratified by mechanism and therapeutic actionability^8,9. Generally, class I mutations, most notably V600E, are treated with a B-RAF inhibitor such as vemurafenib or dabrafenib, while class II and III mutations are insensitive to vemurafenib/dabrafenib^8,9. Thus, a primary goal in precision medicine is to identify and mechanistically characterize mutations and translate these findings into clinically actionable therapeutic strategies.

Regarding EGFR, mutations in the kinase domain involving small deletions in exon 19 or point mutation in exon 21 (L858R) have been well described¹⁰. These mutations increase receptor activation by stabilizing the active conformation of the kinase domain to promote dimerization¹¹. Numerous studies have now shown that patients with EGFR kinase domain mutations benefit from treatment with EGFR tyrosine kinase inhibitors (TKIs), whereas patients with tumors containing wild-type EGFR do not derive benefit¹⁰. Analogously, mutations in the EGFR extracellular domain (ECD) are detected in patients with glioblastoma but are significantly less sensitive to EGFR TKIs in vitro compared to the EGFR kinase domain mutations found in lung cancer¹², reinforcing the concept that not all mutations within a given gene can be therapeutically targeted in the same manner. In the case of EGFR-KDD, the entire gene contains wild-type sequence with an intragenic duplication of exons 18–25. The addition of a second kinase domain to the intracellular region of EGFR introduces a potentially significant structural perturbation. The functional and therapeutic implications of this variant remain uncertain. Moreover, the unique biology of this variant may make it a valuable tool in the study of ERBB family members and, more generally, suggests a strategy for the study of kinases.

In the present study, we evaluate the prevalence of KDD in ERBB family members (EGFR/EGFR, ERBB2/HER2, ERBB3/HER3, and ERBB4/HER4) across multiple types of human cancers in order to refine our understanding of KDD as an oncogenic driver. In addition, we combine structural modeling, biochemical assays, and experimental and computational biophysical analyses to understand the mechanism whereby EGFR-KDD aberrantly activates EGFR. Collectively, these complementary approaches suggest that EGFR-KDD is activated through the formation of ligand-independent intra-molecular dimers and signaling is amplified through ligand-dependent inter-molecular dimers/multimers. Furthermore, we show that inhibition of EGFR-KDD activity is maximally achieved by blocking both intra-molecular and inter-molecular dimerization. These studies have important implications for the treatment of patients whose tumor harbor EGFR-KDD.

Results

ERBB family KDDs are recurrent in multiple cancer types

To investigate the prevalence of KDD in all ERBB family members, we analyzed clinical NGS data from 237,701 tumor samples within the Foundation Medicine (FMI) database. In total, we identified 799 KDDs in ERBB family members (0.34%, 799/237,701). Of those 799 KDDs, EGFR accounts for 443 (55.4%), ERBB2 217 (27.2%), ERBB3 92 (11.5%), and ERBB4 47 (5.9%). Among the cancers present in the FMI database, ERBB-KDD was found most frequently in glioma (2.4%, 227/9381 total glioma cases), followed by upper gastrointestinal cancer (upper GI; 0.8%, 89/11,822) and non-small cell lung cancer (NSCLC; 0.2%, 109/48,699). For EGFR-KDD, glioma has the highest frequency (2.4%, 222/9381), followed by NSCLC (1.4%, 70/48,699) and GI (0.3%, 40/11,822) (Table 1a). We observed lower incidences of KDD in ERBB2, ERBB3 and ERBB4 than EGFR, with distributions mirroring those of other observed oncogenic mutations in brain tumors and NSCLC^{13,14,15,16,17} (Table 1a).

Table 1 Incidence of ErbB family kinase domain duplications (KDDs) across tumor types.

Full size table

We also analyzed 40,165 tumor samples from the Memorial Sloan Kettering Cancer Center (MSKCC) IMPACT database (MSK-IMPACT)¹⁸. These data confirm that KDD occurs most frequently in EGFR, followed by ERBB2 (Table 1b). EGFR-KDD is most prevalent in glioma and NSCLC, while ERBB2-KDD is most prevalent in breast and gynecological cancers (GYN). These distributions are consistent with the observed distributions of other EGFR oncogenic mutations in glioblastoma^13,15 and NSCLC^14,16 and other ERBB2 mutations in breast cancer¹⁹, supporting the notion that specific genes may be genomically altered through a variety of mechanisms in a given tumor context.

The overall frequency of ERBB-KDDs from the two datasets is between 0.58–2.4% in glioma, 0.07–0.22% in NSCLC, and 0.05–0.40% in breast cancer. Differences in detection between the two datasets are likely the result of the different methodologies employed for each dataset to identify KDDs (see the “Methods” section). Nevertheless, these data suggest that ERBB-KDD is a recurring oncogenic driver in tumor types known to be dependent on ERBB signaling (lung, breast, etc.).

EGFR-KDD is a constitutively active intra-molecular dimer

Even within a single driver gene, the type of mutation that occurs can influence prognosis and drug responsiveness. It is therefore critical to fully characterize the functional consequences of genomic variants in clinically relevant genes. To help us probe the biochemistry of the EGFR-KDD intra-molecular dimer, we leverage core principles of EGFR receptor biology.

ERBB family members are transmembrane tyrosine kinases that possess an extracellular ligand-binding domain, a single-pass transmembrane domain, a juxtamembrane (JM) region, an intracellular TKD, and a carboxy (C-) terminal tail with multiple tyrosine phosphorylation sites²⁰. Biochemical and crystallographic studies have shown that activation of EGFR-wild type (WT) involves ligand-induced asymmetric homo- or hetero- dimerization of two TKDs. In the presence of ligand, the C-lobe of one TKD (activator) contacts the N-lobe of another TKD (receiver) to relieve autoinhibition and activate the receiver TKD²¹. Previous studies of EGFR-WT have identified mutations at the inter-molecular dimer interface that can disrupt dimerization and prevent EGFR-WT enzymatic activity²¹.

EGFR-KDD is composed of two intact kinase domains⁷ (Fig. 1a). We hypothesized that the forced proximity of the two adjoined kinase domains could form a constitutively active intra-molecular asymmetric dimer in the absence of ligand. To test this hypothesis, we engineered EGFR-KDD constructs with putative intra-molecular dimer disruption mutants (for EGFR mutations, we utilized protein numbering of the human immature EGFR sequence that includes the 24-residue signal sequence) (Fig. 1a, b, Supplementary Table 1): V948R (C¹; C-lobe of TKD1) and I706Q (N¹; N-lobe of TKD1) in TKD1, and V1299R (C²; C-lobe of TKD2) and I1057Q (N²; N-lobe of TKD2) in TKD2. We also introduced catalytically inactivating mutations (kinase dead) into each TKD individually (D837N in TKD1 and D1188N in TKD2; Dead¹ and Dead², respectively) (Fig. 1b). We reasoned that these mutants would help us to determine: (1) if EGFR-KDD is catalytically active in the absence of ligand stimulation, (2) the relative orientation of the two intra-molecular kinase domains (i.e. activator vs. receiver), and (3) which of the kinase domains (or both) is catalytically active.

**Fig. 1: Mutations disrupting the potential intra-molecular dimer interface abrogate phosphorylation of EGFR-KDD and anchorage-independent growth.**

EGFR-KDD and the mutants described above were stably expressed in NR6²² (low endogenous EGFR expression) and YAMC (EGFR^−/−)²³ cells. We evaluated EGF ligand-independent phosphorylation at EGFR C-terminal tyrosine sites. Ligand-induced dimerization of EGFR-WT results in auto-phosphorylation of its C-terminal tyrosine residues, including Y992²⁴, Y1068²⁵, and Y1173²⁵ (Y1343, Y1419, and Y1524 for EGFR-KDD, respectively). For EGFR phosphorylation sites, we utilized protein numbering of mature EGFR sequence that does not include the 24-residue signal sequence (Supplementary Table 1). We observed that EGFR-KDD, but not EGFR-WT, displays phosphorylation of all three tyrosine residues in the absence of EGF ligands (Fig. 1c, lanes 2, 3), indicating that EGFR-KDD is catalytically active without ligand stimulation. We also found that the intra-molecular dimer interface mutants, C¹ and N² (Fig. 1c, lanes 6, 7; Supplementary Fig. 1a, lanes 6, 7), abolish phosphorylation at all three sites, while N¹ and C² mutants remain phosphorylated in YAMC and NR6 cells (Fig. 1c, lanes 4, 9; Supplementary Fig. 1a, lanes 4, 9), suggesting that the auto-activation of EGFR-KDD was disrupted by C¹ and N² mutants, rather than N¹ and C² mutants. These data suggest that the N-lobe-mutated TKD1 can activate the C-lobe-mutated TKD2, but not the reverse (Fig. 1a).

Our catalytically inactive EGFR-KDD TKD2 mutant (Dead²) failed to autophosphorylate all three tyrosine sites. In contrast, the Dead¹ mutant retained phosphorylation levels comparable to EGFR-KDD in both YAMC and NR6 cells (Fig. 1c, lanes 5, 8 and Supplementary Fig. 1a, lanes 5, 8). Therefore, in this intra-molecular dimer model, TKD2 functions as the enzymatically active receiver to TKD1, while TKD1 functions as activator to TKD2 (Fig. 1a).

We further sought to evaluate EGFR-KDD in a phenotypic assay. In both YAMC and NR6 cells, we observed robust colony growth in cells stably expressing EGFR-KDD (Fig. 1d, Supplementary Fig. 1b). We observed that there were comparable numbers of colonies in N¹ and C² mutants compared with EGFR-KDD, while significantly fewer colonies were observed in the intra-molecular dimer-disrupted C¹ and N² mutants (Fig. 1d, Supplementary Fig. 1b). We also found that Dead¹, but not Dead², could support anchorage-independent growth of YAMC (Fig. 1d) and NR6 (Supplementary Fig. 1b) cells. Therefore, our phenotypic data provide evidence that reduced phosphorylation in the C¹ and N² intra-molecular dimer-disrupted mutants diminish anchorage-independent growth. Taken together, these data are evidence that EGFR-KDD forms a catalytically active asymmetric intra-molecular dimer in the absence of EGF ligand.

Linker contributions to intra-molecular dimer stability

The juxtamembrane B (JMB) domain is an integral component of HER-family homo- and hetero-dimerization. The receiving kinase JMB domain forms specific stabilizing enthalpic contacts in the activator kinase C-lobe (e.g. the hydrophobic residues L688, V689, and L692, and multiple polar contacts)^26,27. Not surprisingly, the JMB residues are highly conserved in HER-family receptors (Fig. 2a). In EGFR-KDD, the TKD2 JMB is linked directly to the C-terminus of TKD1 (Fig. 2b). Thus, an important question remained as to whether constitutive EGF-independent activation of EGFR-KDD is the result of (A) sequence-specific structural perturbations to the JMB region, or (B) the sterically imposed forced proximity of TKD1 and TKD2. To address this question, we generated all-atom structural models of EGFR-KDD with Rosetta and molecular dynamics (MD) simulations (Supplementary Fig. 2a–c). For comparison, we also modeled the EGFR-WT homodimer.

**Fig. 2: The EGFR-KDD linker has distinct enthalpic and entropic contributions to intra-molecular dimer formation.**

We measured the per-residue root-mean-square-fluctuations (RMSF) of the linker residues in EGFR-KDD. Our modeling suggests that the linker region corresponding to the JMB is less flexible than the activator C-terminus region, particularly near the N-terminal portion of the JMB (Fig. 2c, d). Therefore, we hypothesized that the EGFR-KDD JMB forms enthalpically stabilizing contacts at the intra-molecular dimer interface.

To test this hypothesis, we replaced pieces of the linker with unstructured glycine-glycine-serine (GGS) repeats. We substituted (GGS)₃ for the JMB part of the linker (KDD-(GGS)₃) and (GGS)₆ for the activator C-terminus part of the linker (KDD-(GGS)₆) (Fig. 2b). Substitution with (GGS)_x exchanges sequence-specific contacts with a non-interacting, flexible sequence of matching length²⁸. We transiently transfected the mutants into HEK293 cells and measured EGF-independent receptor phosphorylation via Western blot analysis. KDD-(GGS)₃ displays decreased phosphorylation relative to EGFR-KDD, while KDD-(GGS)₆ retained similar levels of phosphorylation as EGFR-KDD (Fig. 2e, lanes 3–5). Importantly, KDD-(GGS)₃ retains increased activity compared to EGFR-WT (Fig. 2e, lanes 2, 4). Taken together, these data suggest that residues in the JMB portion of the linker contribute to the stability of the EGFR-KDD intra-molecular dimer.

Interestingly, the most stable EGFR-KDD linker model packs two leucine residues (L1038 and L1039) against helices αE and αI, corresponding structurally to residue V689 in EGFR-WT (Fig. 2f, Supplementary Figs. 2b, c, and 3a–d). EGFR-WT V689 has previously been shown to be necessary for EGFR-WT dimer-dependent phosphorylation²⁷. In agreement with these data, our equilibrated EGFR-WT homodimer preserves the V689 contact (Fig. 2f, Supplementary Fig. 3b). Because L1038 and L1039 were among the most stable residues in the model and correspond structurally to an EGFR-WT residue known to stabilize dimerization (V689), we hypothesized that mutation of these residues would impair EGFR-KDD EGF-independent intra-molecular dimer activity.

To test this hypothesis, we performed site-directed mutagenesis at residues L1038 and L1039. In support of this hypothesis, simultaneous introduction of L1038A/R and L1039A/R (KDD-LLAA and KDD-LLRR) resulted in a substantial reduction in phosphorylation (Fig. 2g, lanes 6, 9). Critically, however, KDD-(GGS)₃, KDD-LLAA, and KDD-LLRR all retain increased phosphorylation relative to EGFR-WT (Fig. 2e, lanes 2, 4; Fig. 2g, lanes 2, 6, 9). Individual point mutations L1038A/R and L1039A/R do not appreciably reduce phosphorylation; only the combined mutations reduce phosphorylation. Importantly, the sequential leucine residues in the linker are a unique feature of EGFR-KDD resulting from the domain fusion. Altogether, this suggests that despite sequence-dependent JMB contributions to stability, the forced proximity of TKD1 and TKD2 is sufficient for the formation of EGF-independent active intra-molecular dimers. Nevertheless, the linker sequence can provide additional enthalpic stabilization to increase activation.

Ligand induces inter-molecular multimer activity

EGFR-WT activation is achieved through ligand-induced inter-molecular dimerization²¹. Recent evidence demonstrates that EGFR-WT also forms tetramers and other small oligomers that increase phosphorylation in an EGF concentration-dependent manner^29,30,31,32. We wanted to know if EGFR-KDD activity is similarly augmented by EGF-ligand stimulation.

To differentiate between EGFR-KDD activity caused by EGF-dependent inter-molecular dimerization and EGF-independent intra-molecular dimerization, we utilized cetuximab, an anti-EGFR ECD antibody that blocks EGF-mediated EGFR dimerization³³. EGF binding leads to inter-molecular dimerization of EGF receptors. Cetuximab prevents EGF binding by blocking the EGF-binding site. We stimulated cells expressing various EGFR-KDD constructs with EGF. We found that phosphorylation of EGFR-KDD is dramatically increased in the presence of EGF stimulation (Fig. 3a, lanes 5, 6; Fig. 3b, lanes 5, 7; Supplementary Fig. 4a, lanes 5, 6). Addition of cetuximab effectively mitigates EGF-induced phosphorylation of EGFR-KDD (Fig. 3b, lanes 5–8, Supplementary Fig. 4b, lanes 9–12). These data suggest that EGF stimulation may promote EGFR-KDD activity through the formation of at least inter-molecular dimers; however, cetuximab does not preclude the formation of dimers entirely.

**Fig. 3: EGFR-KDD forms inter-molecular dimers and higher order oligomers after ligand stimulation.**

To further test the hypothesis that EGF stimulation promotes the formation of at least inter-molecular dimers in EGFR-KDD, we administered mAb806 to YAMC EGFR-KDD cells. The mAb806 antibody inhibits EGFR dimerization by binding to ECD II (residues 287–302)³⁴, rather than the EGF ligand-binding site in domain III³³. Thus, inhibition with mAb806 is highly complementary to similar experiments performed with cetuximab. As expected based on our cetuximab results, we found that mAb806 had no impact on phosphorylation level in the absence of EGF ligand (Supplementary Fig. 4c, lanes 1, 2, 5, 6) and decreased the level of phosphorylation with EGF-ligand stimulation (Supplementary Fig. 4d, lanes 3, 4, 7, 8). We also note that phosphorylation was reduced more by cetuximab than mAb806 at approximately equimolar concentrations, consistent with previous reports that the EGFR inhibitory potency of mAb806 is considerably lower than cetuximab³⁵.

We showed above (Fig. 1c, d) that intra-molecular dimer-disrupted mutants C¹ and N² are not active in the absence of ligand. Unexpectedly, we noticed that EGF-stimulation rescued these mutants, leading to a robust increase in phosphorylation (Fig. 3a, lanes 11–14; Supplementary Fig. 4a, lanes 11–14). We speculated that this could result from either (A) compensatory stabilization of the intra-molecular receiver kinase domains or (B) stabilization of the donor kinase domains during inter-molecular dimerization.

To better understand how inter-molecular dimerization increases EGFR-KDD autophosphorylation, we built template-based structural models of the intracellular portion of the EGFR-KDD inter-molecular dimer based on two proposed EGFR-WT tetramer models: (1) an extension of the inter-molecular dimer model in which each kinase domain is successively asymmetrically docked with another (end-to-end model)²⁹ (Fig. 3c), and (2) two asymmetric dimers oriented such that the N-lobe and C-lobe of one dimer are in contact with the N-lobe and C-lobe of the other dimer, respectively (side-by-side model)³¹ (Fig. 3d). Other models are possible (e.g. the receiver kinase of one intra-molecular dimer could act as the donor to the receiver kinase of a second intra-molecular dimer). There are currently no experimental structures (e.g. from X-ray crystallography or cryogenic electron microscopy) elucidating the organization of EGFR-WT tetramer or EGFR-KDD inter-molecular dimer. Thus, we built our template-based models of EGFR-KDD intracellular inter-molecular dimer on two published EGFR-WT tetramer models both of which have experimental and computational support.

Our models each consist of two EGFR-KDDs containing an intra-molecular donor (TKD1 or TKD3) and receiver (TKD2 or TKD4) kinase. Both structural models suggest a mechanism for active-state stabilization of TKD3 during inter-molecular dimerization (Fig. 3c, d). In the end-to-end model, active-state stabilization of TKD3 (inter-molecular receiver, intra-molecular donor) could occur by canonical asymmetric dimerization with TKD2 (inter-molecular donor, intra-molecular receiver)²⁹ (Fig. 3c). In the side-by-side model, active-state stabilization of TKD3 could occur through sterically impaired inactivation by TKD1 (inter-molecular donor, intra-molecular donor) (Fig. 3d), as observed in the 40 µs MD simulation of the EGFR-WT full-length tetramer model in Needham et al. 2016³¹.

We previously observed that Dead² (TKD2 and TKD4 are inactive), but not Dead¹ (TKD1 and TKD3 are inactive), ablates EGFR-KDD activity in the absence of EGF (Fig. 1c, lanes 3, 5, and 8). Here, we see that EGF-ligand stimulation robustly revives phosphorylation in Dead² (Fig. 3a, lanes 15, 16; Fig. 3b, lanes 21, 23; Supplementary Fig. 4a, lanes 15, 16), suggesting active-state stabilization of TKD3 through the formation of at least inter-molecular dimers (Fig. 3c, d). Less dramatic increases in Dead¹ from baseline intra-molecular dimer phosphorylation are consistent with changes due to ligand-induced EGFR recruitment (Fig. 3a, lanes 9, 10; Supplementary Fig. 4a, lanes 9, 10). Consistent with these results, pre-administration with cetuximab prevents EGF-dependent phosphorylation of Dead² and has only a minor impact on Dead¹ phosphorylation. Taken together, these data suggest that in addition to activation of TKD2 and TKD4 by TKD1 and TKD3, respectively, TKD3 becomes catalytically active in the inter-molecular dimer.

To better characterize the effect of EGF on EGFR-KDD and quantify the extent of EGFR-KDD oligomerization in live cells, we performed two-color pulsed interleaved excitation fluorescence cross-correlation spectroscopy (PIE-FCCS)³⁶. PIE-FCCS has been previously applied to evaluate EGFR dimerization and multimerization^29,37. For these experiments, the protein of interest was expressed as a mixture of eGFP and mCherry fusions and single, live-cell measurements were recorded and analyzed as described in the “Methods” section. In the absence of ligand, both EGFR-WT and EGFR-KDD have median cross-correlation (ƒ_c) values of 0.00, indicating that they are predominantly monomeric (Fig. 3e, Supplementary Fig. 6b). Stimulation with EGF ligand leads to a significant level of cross-correlation for EGFR-WT (ƒ_c = 0.19) and EGFR-KDD (ƒ_c = 0.17) (Fig. 3e, Supplementary Fig. 6b), indicating that ligand stimulation induces dimerization and multimerization in both EGFR-WT and EGFR-KDD^36,38. There is no statistically significant difference between EGFR-WT and EGFR-KDD, suggesting that the kinase duplication does not sterically restrict dimerization and multimerization. Taken together, these data demonstrate that EGFR-KDD forms multimers upon ligand binding.

EGFR-KDD directly interacts with ERBB family members

Our biophysical studies demonstrate that EGFR-KDD forms ligand-induced homodimers/multimers. We hypothesized that EGFR-KDD could also heterodimerize with EGFR-WT in the presence of ligand. To test this hypothesis, we performed co-immunoprecipitation in HEK293 cells with transiently co-transfected Myc-epitope tagged EGFR-KDD/EGFR-WT and V5-epitope tagged EGFR-WT/EGFR-KDD. We observed that V5-epitope-tagged EGFR-WT can interact with Myc-epitope-tagged EGFR-KDD, and vice versa (Fig. 4a, Supplementary Fig. 6a). We further evaluated potential interactions between EGFR-WT and EGFR-KDD with PIE-FCCS. With the ƒ_c values, we can distinguish homodimerization and heterodimerization, which cannot be assessed with diffusion coefficients alone. EGFR-WT-eGFP and EGFR-KDD-mCherry were simultaneously expressed in COS7 cells. In the absence of EGF ligand, there was no interaction (ƒ_c = 0.00). Upon addition of EGF-ligand, there was a significant increase in cross-correlation (ƒ_c = 0.22), indicating the formation of heteromeric complexes (Fig. 4b, Supplementary Fig. 6e, Supplementary Table 2). The positive cross-correlation is rigorous evidence for heteromeric complex formation, but alone is not sufficient to define the interaction strength or stoichiometry of the complexes. For simplicity we will refer to these complexes as heterodimers as this is the minimal size consistent with positive cross-correlation. In agreement with changes to the ƒ_c values, the diffusion coefficients of both EGFR-WT and EGFR-KDD decreased after ligand addition, indicating slower diffusion due to homodimerization and hetero-dimerization/multimerization (Supplementary Fig. 6b, e, Supplementary Table 2).

Heterodimerization is especially important for the activation of HER2 and HER3. HER2 has lost the capacity to bind ligands and activates primarily as a receiver kinase domain through heterodimerization with other ERBB family members^39,40. In contrast, the TKD of HER3 has low kinase activity, and HER3 acts as an activator in heterodimers⁴¹. We hypothesized that EGFR-KDD can also interact with wild-type HER2 and HER3. To test this hypothesis, we performed co-immunoprecipitation. We transiently co-transfected Myc-epitope-tagged EGFR-KDD with V5-epitope-tagged HER2-WT and HER3-WT in HEK293 cells. Independent pulldowns with V5 and Myc antibodies demonstrate that EGFR-KDD could interact with HER2 and HER3 (Fig. 4c, Supplementary Fig. 6c, d). Moreover, we observed quantitatively with PIE-FCCS that EGFR-WT and EGFR-KDD heterodimerize with HER2 to a larger extent in the presence of EGF-ligand (ƒ_c = 0.10 and ƒ_c = 0.16, respectively) than in its absence (ƒ_c = 0.00 and ƒ_c = 0.06, respectively) (Fig. 4d, Supplementary Fig. 6f, Supplementary Table 2). Interestingly, our biophysical data suggest that like EGFR-WT, EGFR-KDD also heterodimerizes with HER3 to a greater extent in the presence of NRG1 than in the presence of EGF (Fig. 4e, Supplementary Fig. 6g, Supplementary Table 2). These data demonstrate that EGFR-KDD forms direct interactions with EGFR-WT, HER2, and HER3.

Intra- and inter-molecular dimer activity dual inhibition

The dual nature of EGFR-KDD as an EGF-independent active intra-molecular dimer and as an EGF-dependent active inter-molecular dimer/multimer poses a unique therapeutic challenge. Our computational models and experimental data suggest that the ideal therapy would simultaneously reduce intra-molecular and inter-molecular dimer-mediated activity. One potential treatment strategy is therefore the combination of cetuximab with a TKI (here afatinib). Prior pre-clinical literature has suggested that such a combination may be effective in L858R but not Ex19Del⁴².

The combination of cetuximab with various EGFR TKIs, including gefitinib⁴³ and afatinib^44,45, has been tested in lung cancer patients. In a phase I trial, no responses were observed with the combination of cetuximab plus gefitinib⁴³, and therefore has not been subsequently used in patients. The combination of cetuximab plus afatinib has advanced in the clinic, including a phase I trial (NCT01090011) that included an expansion cohort^44,45. Results from this trial of cetuximab plus afatinib demonstrated that the combination therapy was effective in achieving tumor reduction (as assessed by CT scans using RECIST criteria) in patients with both Ex19Del and L858R EGFR-mutant lung cancer, in contrast to prior pre-clinical data⁴². Importantly, the combination of cetuximab plus TKI is not FDA-approved because there was no benefit (in terms of PFS, intracranial response, and OS) compared to TKI alone, and thus not standardly used in the treatment of patients with Ex19Del or L858R mutations. The current standard of care for these patients is the mutant-selective EGFR TKI, osimertinib, based on a seminal phase 3 clinical trial^46,47.

In contrast, no pre-clinical study or clinical trial has evaluated antibody/TKI combination vs. either alone in EGFR-KDD patients. Indeed, the index patient for EGFR-KDD described in Gallant et al. 2015 unfortunately only had a partial response to afatinib⁷. The anti-tumor response was short-lived (7 cycles of afatinib, or approximately 7 months) before the patient developed acquired resistance to afatinib driven by amplification of the EGFR-KDD allele⁷. Collectively, these observations suggested that more potent EGFR blockade is necessary to overcome the oncogenic activity of EGFR-KDD. Here, we test the hypothesis that combined TKI and cetuximab treatment will reduce EGFR-KDD-mediated phosphorylation in vitro more than either treatment alone.

We treated YAMC cells stably expressing EGFR Ex19Del (E746_A750del), L858R, and EGFR-KDD with afatinib and cetuximab both in the absence and presence of EGF ligand (Fig. 5a, Supplementary Fig. 7a). Importantly, we observed that in both the absence and presence of EGF, afatinib resulted in a near complete ablation of p-EGFR in Ex19Del (Fig. 5a, lanes 1, 2, 5, 6) and L858R (Fig. 5a, lanes 9, 10, 13, 14), but substantial residual phosphorylation existed in EGFR-KDD (Fig. 5a, lanes 17, 18, 21, 22). As expected, cetuximab alone reduced phosphorylation in Ex19Del, L858R, and EGFR-KDD in the presence of EGF ligand (Fig. 5a, lanes 7, 15, 23). Notably, the greatest reduction of phosphorylation for EGFR-KDD occurred with the combination of cetuximab+afatinib in the presence of EGF (Fig. 5a, lanes 21, 22, 23, 24). These data suggest that phosphorylation of EGFR Ex19Del and L858R is abolished by afatinib (TKI) or cetuximab alone, and addition of cetuximab to afatinib does not add substantially more inhibition to the decrease in auto-phosphorylation. Unlike EGFR Ex19Del and L858R, phosphorylation of EGFR-KDD is inhibited by both afatinib and cetuximab as single agent, but the combination treatment yielded more inhibitory effects.

**Fig. 5: Inhibition of EGFR-KDD is maximally achieved by blocking both intra- and inter-molecular dimerization.**

We also performed viability assays with BaF3 cells stably expressing EGFR-KDD, Ex19Del (E746_A750del) or L858R. First, we evaluated Ba/F3 cell growth in serum starved (0.5% fetal bovine serine; FBS) conditions to minimize EGF activation (Supplementary Fig. 7b). At 0.5% FBS, cetuximab maximally exhibited ~40% inhibition of EGFR-KDD, ~80% inhibition of Ex19Del, and almost 100% inhibition of L858R cell viability (Fig. 5b and Supplementary Table 3). These data are consistent with a model in which EGFR-KDD retains an active intra-molecular dimer in the absence of EGF stimulation (Fig. 1) and previously published models of Ex19Del and L858R in which intrinsic αC-helix stabilization transforms them into dimer-dependent “super acceptor” kinases^11,48. Indeed, progressively higher concentrations of FBS and the addition of exogenous EGF resulted in stable or increased viability of all mutants in the presence of cetuximab, though EGFR-KDD proved to be the least inhibited (Fig. 5c, Supplementary Fig. 7c–e, and Supplementary Table 3).

In 0.5% FBS conditions with minimal EGF-ligand present, the potency of afatinib on EGFR-KDD is approximately equivalent in the absence (0 μg/ml) and presence (10 μg/mL) of cetuximab (EC₅₀ = 0.103 ± 0.035 and 0.095 ± 0.040 nM, respectively). Similar results are observed in Ex19Del (EC₅₀ = 0.061 ± 0.027 and 0.060 ± 0.017 nM, respectively). The near complete ablation of Ba/F3 L858R viability at higher concentrations of cetuximab mask any potential similar effects. Generally, we observe that Ex19Del and L858R are more sensitive to afatinib than is EGFR-KDD (Fig. 5b and Supplementary Table 4), consistent with our phosphorylation assays (Fig. 5a and Supplementary Fig. 7a).

As the concentration of EGF-ligand in the medium is increased, we observe not only an increase in viability with cetuximab and increased EC₅₀ of afatinib, but also a greater potentiation of afatinib by cetuximab (Fig. 5b, c and Supplementary Fig. 7c–e). In 10% FBS+ 50 ng/ml exogenous EGF, we observe a 5.8× increase in afatinib potency transitioning from 0 to 10 μg/ml in Ba/F3 EGFR-KDD cells. We also observe potentiation of afatinib in Ex19Del (4.7×) and L858R (3.7×) (Supplementary Fig. 7e and Supplementary Table 4). Compared to Ex19Del and L858R, the larger potentiation of afatinib inhibition of Ba/F3 EGFR-KDD by cetuximab seems to be mediated by the lower inhibition of EGFR-KDD by afatinib. Together, our data suggests that a lower dose of afatinib can be administered to maximally inhibit EGFR-KDD when supplemented with cetuximab.

Discussion

In this study, we combined methods in clinical genomics, computational structural biology, biochemistry, and biophysics to mechanistically characterize a former VUS, EGFR exon 18–25 Kinase Domain Duplication (EGFR-KDD). To investigate the prevalence of KDD in all ERBB family members across various cancers, we analyzed comprehensive genomic profiling data from two large databases. We discovered that ERBB-KDDs are recurrent at a frequency between 0.58% and 2.4% in glioma, 0.07–0.22% in NSCLC, and 0.05–0.40% in breast cancer. We identified fractions of KDDs in multiple other tumor types as well. No previous studies have reported KDD in ERBB2, ERBB3, and ERBB4. These data indicate that ERBB-KDDs account for a small but significant fraction of ERBB family-mediated cancers, and suggest utility of approved targeted therapies for patients based on standard of care clinical genomic testing. Importantly, developing targeted therapies for uncommon variants has precedent. ROS1 variants account for ~1% of lung cancers⁴⁹ and have been detected with lower prevalence in multiple other cancers⁵⁰ and NTRK fusions have been implicated in 0.31% of adult tumors and in 0.34% of pediatric tumors⁵¹. There are TKIs targeting both ROS1 and NTRK^52,53 that are FDA approved and additional agents in clinical development. Further, in the case of KDD, both TKIs and antibody therapies already exist for ERBB receptors, thus new trials and therapeutic strategies for this population does not depend on new therapy development.

We sought to elucidate the mechanisms of EGFR-KDD-driven oncogenicity. We demonstrate that EGFR-KDD forms a catalytically active asymmetric intra-molecular dimer in the absence of EGF-ligand stimulation. Mutations disrupting the intra-molecular dimerization interface abolish the phosphorylation of EGFR-KDD in its monomeric form, and the loss of phosphorylation in these mutants can be recovered by the formation of inter-molecular dimerization and multimerization. These data demonstrate that ligand-independent constitutive activation of EGFR-KDD is driven by asymmetric intra-molecular dimerization.

We next characterized differences in the functionality of the JMB region of EGFR-KDD relative to EGFR-WT. The JMB is a conserved stretch of amino acids critical for inter-molecular dimerization in wild-type ERBB-family receptor kinases. In EGFR-KDD, the JMB region of TKD2 is covalently linked to the C-terminus of TKD1. All-atom computational modeling investigations coupled with in vitro mutagenesis suggests the EGFR-KDD linker region is capable of forming specific stabilizing JMB domain contacts within the intra-molecular dimer; however, the forced proximity of the two kinase domains by the linker is sufficient for elevated EGFR-KDD activity relative to EGFR-WT. In comparison, EGFR-WT depends on stable contacts in the JMB domain for dimer activity^26,27. We focused our analysis on EGFR-KDD with duplication of exons 18–25, but other groups have recently identified EGFR-KDD with longer duplications (e.g. exons 14–26 and exons 17–25)⁵⁴ that may reduce the likelihood of forming stabilizing contacts at the linker JMB interface of the intra-molecular dimer. We speculate that there may be selective pressure for specific linker lengths/sequences in the formation of KDDs. Recent investigations have suggested similar structural constraints in the context of BRAF, HER2, and EGFR β3-αC deletion mutations⁵⁵.

EGFR-KDD further forms EGF-dependent inter-molecular dimers. Inter-molecular dimerization of EGFR-KDD increases activity in part by stabilizing the active conformation of the EGFR-KDD donor kinase domain. This has broad implications for HER-family signaling as well. We speculate that the formation of dual activator/receiver kinases in higher order oligomers of HER-family receptors may contribute to ligand-dependent increases in phosphorylation^29,31. In the present study, we did not identify the configuration of the EGFR-KDD inter-molecular dimer/multimer. Mutations at N¹ and C² only partially disrupted EGF-dependent phosphorylation (Fig. 3a, lanes 7, 8, 17, 18; Supplementary Fig. 4a, lanes 7, 8, 17, 18). Moreover, in the side-by-side structural model, the N-termini of TKD1 and TKD3 are oriented in close proximity (Fig. 3c, yellow), while in the end-to-end model they are separated (Fig. 3b, yellow). Consequently, we considered the end-to-end model less likely to form interactions between the N-terminal juxtamembrane A (JMA) and TM domains of the two interacting proteins, a key feature of inter-molecular dimerization in EGFR-WT^26,27. Nevertheless, it is clear that EGFR-KDD is forming an EGF-dependent inter-molecular dimer. We anticipate that future investigations will identify the most likely inter-molecular configuration(s).

Interestingly, EGF-stimulated EGFR-KDD displays substantially more phosphorylation than EGF-stimulated canonical-activating mutations (Fig. 5a and Supplementary Fig. 4a). We speculate that this may be because of the increased ratio of EGF-ligand to active recruited kinase domains in EGFR-KDD (i.e. EGF-mediated dimerization of two ECDs results in an effective tetramer of intracellular kinase domains with potentially 2–3 active TKDs, versus typical oncogenic activation with 1–2 active TKDs). Alternatively, it may be that the EGFR-KDD inter-molecular dimer forms a more favorable interface than other oncogenic mutants, thus resulting in increased dimerization and activity. A combination of factors likely contributes to the overall increase in phosphorylation that we observe. Additional studies are needed to characterize the EGFR-KDD inter-molecular dimer.

Through a combination of biochemical and biophysical methods, we also determined that EGF-ligand stimulation induces formation of catalytically active homo- and hetero- inter-molecular dimers and multimers. Critically, this demonstrates that EGFR-KDD retains the ability to activate other ERBB family members. This has important implications for the therapeutic management of patients whose tumors harbor EGFR-KDD. Indeed, we found neither cetuximab nor afatinib alone were able to completely ablate EGFR-KDD phosphorylation. We demonstrate, however, that cetuximab can be used to potentiate afatinib inhibitory activity for greater overall inhibition. We suspect that this is because of the synergistic mechanisms of the two drugs: cetuximab disassembles dimers and removes the ability of EGFR-KDD to activate other ERBB kinases, and afatinib inhibits the active intra-molecular dimer EGFR-KDD. It has been well-recognized that cetuximab induces degradation of EGFR mutants in different NSCLC cells^56,57. In this study, no degradation of EGFR-Ex19Del, L858R and EGFR-KDD levels were observed in YAMC (Fig. 5a) and NR6 cells (Supplementary Fig. 4b), probably due to the shorter treatment time than previous studies (4 h versus 24–72 h)^56,57.

Finally, our computational and biochemical insights raise important considerations for the use of EGFR-KDD as a research tool. Whereas the inactive form of EGFR can be readily studied by the introduction of inter-molecular dimer-disrupting interface mutations²¹, controlling the active fraction of EGFR in vitro has typically required introduction of known oncogenic point mutations or stimulation with EGF-ligand. The former causes well-documented perturbations to enzyme kinetics^58,59,60,61, while recent literature has demonstrated that the latter can influence EGFR multimerization and phosphorylation in a concentration-dependent manner³¹. Moreover, dimerization and activation of EGFR oncogenic missense mutants is dependent on protein concentration⁶² and/or EGF-ligand stimulation⁴⁸. EGFR-KDD provides a model of a fully active EGFR dimer in an EGF-independent setting, and may provide a more native-like control than kinase domain missense mutants without the complexity of concentration-dependent signaling effects.

Kinase domain duplications (KDDs) represent a novel form of activation for oncogenic kinases via a mechanism of constitutive dimerization. In this study, we have systematically characterized the fundamental biochemical and biophysical features of a prototypical KDD, EGFR-KDD. Subsequently, we identified potential treatment strategies in pre-clinical models of EGFR-KDD-mediated disease. This represents a comprehensive mechanistic and pre-clinical evaluation of treatment strategies specifically for a KDD-mediated disease. We anticipate that our results will also be used to inform additional studies on kinase duplication domains.

Methods

Cell culture, reagents, and transfection

Ba/F3 cells were purchased from DSMZ. NR6 cells were a kind gift from Dr. William Pao⁶³. YAMC EGFR^−/− cells were a kind gift from Dr. Robert H. Whitehead²³. Plat-GP cells were purchased from CellBioLabs. HEK293 cells were purchased from ATCC. Ba/F3 cells were maintained in RPMI 1640 medium (Mediatech, Inc.) supplemented with 1 ng/mL murine IL3 (Gibco, Life Technologies). NR6 cells were maintained in DMEM (Gibco). The Plat-GP cell line was cultured in full DMEM with selection of 1 μg/mL blasticidin (Gibco). YAMC cells were cultured as previously described^23,64. COS-7 cells were cultured in DMEM (Calsson Lab, Smithfield, UT). All media were supplemented with 10% heat-inactivated FBS (Gibco) and penicillin–streptomycin (Gibco) to final concentrations of 100 U/mL and 100 μg/mL, respectively. All cell lines were maintained in a humidified incubator with 5% CO₂ at 37 °C (33 °C for YAMC cells⁶⁴) and routinely evaluated for mycoplasma contamination.

Cetuximab was purchased from Bristol-Myers Squibb (Princeton, NJ). mAb806 is produced and purified in the Biological Production Facility (Ludwig Institute for Cancer Research, Melbourne)^65,66. Transient transfection for expression in HEK293 cells was carried out using Lipofectamine 2000 (Invitrogen) according to the manufacturer’s instructions. A total of 0.45 μg of each expression plasmid was used per well in six-well plates. To assess ligand-dependent EGFR activation, cells were serum starved overnight and treated with 50 ng/mL EGF for 5 min.

For PIE-FCCS experiments, COS-7 cells were transiently transfected 24 h before the experiment using Lipofectamine 2000 (Invitrogen). A total of 5 µg DNA (1:1 ratio of mCherry-tagged and eGFP-tagged plasmids mixture) was used per 35 mm MatTek plate (MatTek Corporation, Ashland, MA) to express both fluorescent-tagged species evenly and acquire the local density of 100–2000 receptors/µm². The media was changed to Opti-MEM I Reduced Serum Medium without phenol red (Thermo Fisher Scientific) before placing the plate in the on-stage incubator (37 °C) for FCCS measurement. Measurements were taken for both ligand-free and ligand-stimulated state of each construct, with 2 μg/mL recombinant human EGF (Sigma Aldrich, St. Louis, MO) or NRG1 (R&D Systems, Inc., Minneapolis, MI) as the ligand.

Plasmid construction

Generation of EGFR-KDD, EGFR-WT, and EGFR-L858R constructs was described previously⁷. EGFR-KDD mutations were constructed by using multisite-directed mutagenesis (Agilent) on the pMa-EGFR-KDD plasmid per the manufacturer’s recommendations—with the exception of extension time being set at 1.5 min/kb. To specifically introduce mutations into each TKD due to the presence of two identical TKDs at the genomic level, after bi-directional dideoxy sequencing, pMa-EGFR-KDD-mutants were digested with ClaI and recombined with other pMa-EGFR-KDD fragments to create all single mutants: ClaI digests mutated pMa-EGFR-KDD plasmid were recombined with a ClaI–ClaI segments from unmutated pMa-EGFR-KDD and/or ClaI digests of unmutated pMa-EGFR-KDD plasmid were recombined with ClaI–ClaI segments from mutated pMa-EGFR-KDD. pMa-EGFR-KDD mutants were then subcloned to the pMSCV vector by HpaI digest and then subcloned to pcDNA3.1(−) vector by XhoI/HindIII digest. All plasmids were verified in the forward and reverse directions by Sanger sequencing. To obtain V5-epitope-tagged EGFR-KDD, we used PCR to add AgeI to the 3′ end of EGFR-KDD fragment by using pMSCV-EGFR-KDD as template, then EGFR-KDD fragment was inserted into pcDNA6-V5 HisB vector by using SnaBI and XhoI. To obtain Myc-epitope-tagged EGFR-KDD, the EGFR-KDD fragment was subcloned to pEF4Myc-HisB vector by using MfeI and XhoI. pcDNA6-EGFR-WT with Myc-epitope tag was purchased from Addgene (#42665). V5-epitope-tagged HER2 and HER3 were kind gift from Dr. Carlos L. Arteaga⁶⁷. For PIE-FCCS experiments, EGFR-WT, HER2, and HER3 were subcloned to eGFP-N1 and mCherry-N1 vectors by XhoI and AgeI digests. EGFR-KDD was subcloned to eGFP-N2 and mCherry-N2 vectors by using SnaBI and XhoI digests. For V5-tagged epitope EGFR-WT, we replaced the eGFP fragment of pEGFR-N1-EGFR-WT by V5-tagged epitope. In this study, for EGFR mutations, we utilized codon numbering of the human immature EGFR sequence that includes the 24-residue signal sequence (Supplementary Table 1). Primers used in this study were listed in Supplementary Table 5.

Generation of stable cell lines

Constructs of pMSCV, EGFR-WT, EGFR-L858R, EGFR-KDD, and EGFR-KDD-I706Q, D837N, V948R, I1057Q, D1188N, and V1299R mutations were introduced into NR6 and YAMC cells separately by retroviral transduction system as described previously⁷. Construct of EGFR Ex19Del (E746_A750del) was stably introduced into YAMC cells, and constructs of EGFR Ex19Del (E746_A750del), EGFR-L858R, and EGFR-KDD were stably introduced into Ba/F3 cells as described previously⁶⁸.

Immunoblotting and antibodies

For immunoblotting, cells were washed in cold PBS, and lysed in RIPA buffer (150 mmol/L NaCl, 1% Triton-X-100, 0.5% Na-deoxycholate, 0.1% SDS, 50 mmol/L Tris–HCl, pH 8.0) with freshly added 40 mmol/L NaF, 1 mmol/L Na₃VO₄, and protease inhibitor (Thermo Fisher Scientific, Waltham, MA). Lysates were quantified by Bradford assay in SmartSpec Plus Spectrophotometer (Bio-Rad, Hercules, CA) following the manufacturer’s instructions. Lysates were subjected to SDS–PAGE followed by blotting with the indicated antibodies and detection by Western Lightning ECL reagent (Perkin Elmer, Waltham, MA). The densitometry for both phosphotyrosine content at Y1068 and total EGFR expression was quantified by ImageJ Software. The ratio of phosphotyrosine to total EGFR expression for each construct relative to EGFR-KDD was calculated. All immunoblotting experiments were performed three independent times and one representative replicate was shown in the manuscript. Raw, uncropped, and unprocessed scans of all blots as well as quantifications and standard deviations were included in the Source Data file.

For co-immunoprecipitation experiments, cells were washed in cold PBS and lysed in hypotonic buffer (20 mM HEPES pH7.5, 10 mM KCl, 1 mM EDTA, 1 mM EGTA, 1 mM mgCl₂, 0.1% NP-40, EDTA-free Protease Inhibitor Cocktail (Sigma-Aldrich # 04693159001)). The lysates were supplemented with 150 mM NaCl before centrifuging. Protein G Dynabeads (# 10004D, Life Technologies, Carlsbad, CA) were incubated with the primary antibody for 30 min at room temperature. Lysates were then added and incubated for 3 h at 4 °C. Immobilized beads were washed three times with hypotonic buffer supplemented with 0.65 M NaCl. 2×SDS loading buffer was added to the beads and then used for immunoblotting analysis. All co-immunoprecipitation experiments were performed two independent times and one representative replicate was shown in the manuscript.

Antibodies

Antibodies used included: EGFR (1:2000, #4267), phospho-EGFR (Y992) (1:1000, #2235), phospho-EGFR (Y1068) (1:1000, #2234), phospho-EGFR (Y1173) (1:1000, #4407) (For EGFR phosphorylation sites, we utilized codon numbering of mature EGFR sequence that does not include the 24-residue signal sequence, Supplementary Table 1), horseradish peroxidase (HRP)-conjugated anti-mouse (1:5000, #7076), and HRP-conjugated anti-rabbit (1:5000, #7074) (Cell Signaling, Beverly, MA); V5 (1:5000, MCA1360GA, AbD Serotec), Myc (1:2500, Sigma-Aldrich A5963); actin antibody (1:5000, Sigma-Aldrich A2066).

Pulsed interleaved excitation fluorescence cross-correlation spectroscopy

FCCS data were taken on a customized inverted microscope setup coupled with pulsed interleaved excitation and time-correlated single photon detection as described in previous works^29,37. A supercontinuum pulsed laser (9.2 MHz repetition rate, SuperK NKT Photonics, Birkerød, Denmark) was split into two beams of 488 and 561 nm through a series of filters and mirrors for the excitation of eGFP and mCherry, respectively. The beams were directed through two different-length single mode optical fiber to introduce 50 ns time delay for pulsed interleaved excitation to eliminate possible spectral crosstalk⁶⁹. The beams were overlapped before entering the microscope through a dichroic beam splitter (LM01-503-25, Semrock) and a customized filter block (zt488/561rpc, zet488/561m, Chroma Technology). A ×100 TIRF oil objective (Nikon, Tokyo, Japan) was used for the excitation beam focus and fluorescence emission collection. A short fluorescently tagged DNA fragment was used to verify the alignment of the system, including the confocal volume overlap. Negative and/or positive controls (Supplementary Fig. 5a, b) were tested regularly prior to the experimental samples for comparisons of the fit parameters. The excitation beams were focused to the peripheral membrane of the cell to allow the fluorescence measurements of only the membrane-bound receptors. Data were only taken on the flat, peripheral membrane area, where the distance between the basal and apical membranes were within a few hundred nanometers, to avoid inclusion of fluorescence from cytosolic organelles or vesicles. For each cell, one area of the membrane was selected for data collection. Six 10-s acquisitions were taken per area. The fluorescence signal was collected through a home-built confocal detection unit with a 50 µm confocal pinhole and dichroic beam splitter (LM01-503-25, Semrock, Rochester, NY). The two signals were filtered (91032, Chroma Technology Corp., Bellows Falls, VT; zt488/561rpc and zet488/561 m, Chroma Technology Corp., Bellows Falls, VT) and then focused independently on to single-photon avalanche diodes (Micro Photon Devices, Bolzano, Italy). The photon counts were recorded by a time-correlated single photon counting module (Picoharp 300, PicoQuant, Berlin, Germany). For analysis, the time-tagged photon data were gated to isolate photons that arrived within 40 ns after each laser pulse arrival time. Then we calculated auto-correlation and cross-correlation curves corresponding to each species using our custom MATLAB script. Curves of six consecutive acquisitions per area were averaged then fitted to a single component, 2D diffusion model as described in previous works^37,38,69.

The auto-correlation curves contain two types of decay. The first decay is due to the photophysical activity, such as triplet relaxation or blinking. The second decay indicates the average dwell time (τ_D), which is used to calculate the effective diffusion coefficient using D_eff = ω_o²/4τ_D. The amplitude of the correlation curves indicates local concentration of the diffusing receptors. Using the cross-correlation curve (Supplementary Fig. 5), we can calculate cross-correlation values, or fraction correlated (ƒ_c) values that indicate the degree of oligomerization. For an ideal system undergoing on dimerization, the ƒ_c value varies from 0 to 1, with 0 indicating the system is monomeric and 1 indicating complete dimerization. For real systems, effects like photostability, interaction statistics, and relative expression levels drop the expected ƒ_c value for dimerization into the range of 0.10–0.15 for a monomer–dimer equilibrium. For higher order oligomerization the ƒ_c values will increase, allowing us to compare the degree of oligomerization for more complex systems⁶⁹.

Anchorage-independent assays and cell viability assay

Anchorage-independent assays were performed using modified protocols^70,71. For the bottom layer of agar, 1.5 mL of a 1:1 mix of 1.0% agar (prepared in 1× PBS) and media were plated in each well of six-well plate. For the upper layer of agar, 1.5 mL of a 1:1 mix of 0.6% agar (prepared in 1× PBS) and media containing 5000 cells was plated into each well of six-well plate. Colonies were counted using GelCount (Oxford Optronix) with identical acquisition and analysis settings. Cell viability assay was performed on IL3-independent Ba/F3 cells stably expressing EGFR-KDD, Ex19Del, and L858R by using CellTiter-Blue^® Cell Viability Assay (#G8080, Promega, Madison, WI) following manufacturer’s instructions. Three days after incubation, CellTiter-Blue Reagent was added, and the fluorescence was detected at 560_EX/590_EM with a Synergy HTX microplate reader (BioTek Instruments, Winooski, VT, USA). All experiments of anchorage-independent assays and cell viability assay were performed three independent times in triplicate, and one representative replicate was shown in the manuscript.

Molecular modeling

Previously, we performed de novo loop modeling to determine a geometrically plausible model of the EGFR-KDD linker region⁷. Here, an all-atom structural model of the EGFR-KDD intracellular domain was generated with RosettaCM⁷² with the active EGFR WT dimer PDB ID 2GS6 as the base template. Missing density in the β3-αC region was templated with PDB ID 2ITX. The N- and C- termini of the donor and receiver kinases of the EGFR-KDD intra-molecular dimer, respectively, as well as the connecting linker region, are based on three templates: the previously modeled linker region from Gallant et al. 2015⁷; the JMB domain of PDB ID 4RIW; and the JMB domain of PDB ID 3GOP. Missing residues are modeled de novo with RosettaCM fragment insertion. Three rounds of comparative modeling were performed. After rounds two and three, the best scoring models with varying RMSDs from the lowest scoring model in each round were selected as additional starting templates for the next round. After the third round, distance-based clustering of the linker region identified three low-energy clusters. The best scoring model from each cluster was refined with a 1 μs MD simulation in Amber18⁷³. The final EGFR-KDD model and EGFR-WT homodimer subsequently each underwent 1 µs MD simulations.

Models were solvated in a rectangular box of SPC/E explicit solvent neutralized with monovalent anions. Protein was buffered on all sides with 12 Å solvent. Solvent and ions were minimized with 500 steps steepest gradient descent followed by 1000 steps of conjugate gradient descent while protein atoms were restrained with a force constant of 10.0 kcal/mol/A². The protein was then minimized for 200 steps steepest gradient descent followed by 800 steps of conjugate gradient descent in buffer restrained with a force constant of 5.0 kcal/mol/A². Finally, restraints were removed from the system for 100 additional steps of steepest gradient descent followed by 900 steps of conjugate gradient descent minimization.

Post-minimization, SHAKE was implemented to constrain covalent bonds to hydrogen atoms. Systems were slowly heated in NVT ensemble to 100 K over 50 ps with a 1 fs timestep. Subsequently, systems were heated in NPT ensemble at 1 bar with isotropic position scaling from 100 to 300 K over 500 ps and 1 fs timestep. Equilibration/production simulations were run in the NPT ensemble at 300 K with a Monte Carlo barostat. Temperature was controlled using Langevin dynamics with a collision frequency of 1 ps⁻¹ and a unique random seed for each simulation. Periodic boundary conditions were imposed on the system throughout heating and equilibration. Electrostatics were evaluated using the particle mesh Ewald (PME) method and a distance cutoff of 8.0 Å. A 2 fs integration timestep was employed during production simulations. All RMSD and RMSF calculations were performed with CPPTRAJ⁷⁴.

Approximations of the linker interaction energies of the top three EGFR-KDD clusters were performed with the single-trajectory molecular mechanics/generalized Born solvent-accessible surface area (MM-GBSA) method as implemented in MMPBSA.py⁷⁵. GBSA was calculated with the OBCII generalized born solvent model with a surface tension of 0.0072 kcal/mol/Å² and salt concentration of 0.15 M, and nonpolar contributions to the solvation free energy were computed with the LCPO method. Entropic contributions to binding were neglected. The final reported values are averaged over frames collected every 100 ps.

Kinase domain duplication detection from foundation medicine and MSK-IMPACT datasets

For the Foundation Medicine dataset, a minimum of 50 ng of DNA was extracted from formalin-fixed paraffin-embedded sections and comprehensive genomic profiling was performed on hybridization-captured, adaptor ligation-based libraries to a median exon coverage depth of >500× for all coding exons of 315 (FoundationOne^®, n = 152,674), or 324 (FoundationOneCDx^®, n = 86,824) cancer-related genes plus selected introns from genes frequently rearranged in cancer to identify base substitutions, small insertions or deletions, copy number alterations (focal amplifications and homozygous deletions), and rearrangements, as previously described⁷⁶. Testing was performed in a Clinical Laboratory Improvement Amendments-certified, College of American Pathologists-accredited reference laboratory (Foundation Medicine, Cambridge, MA). We interrogated the Foundation Medicine dataset of n = 239,498 consecutive unique solid tumor specimens for KDD in EGFR, ERBB2, ERBB3, and ERBB4. These rearrangement duplications were detected by clustering chimeric and semi-mapped paired-end reads within each gene of interest and mapping breakpoints onto the hg19 reference genome assembly, as previously described⁷⁶. A KDD was therein defined as a large genomic duplication where breakpoints both flanked and did not disrupt the region corresponding to the respective gene’s kinase domain. Statistical enrichment including P-value and odds-ratio (OR) were calculated using Fisher’s exact testing. For a detailed description of the FMI data analysis pipeline, please see the Supplementary Methods section in the Supporting Information file. Approval for this study, including a waiver of informed consent and a Health Insurance Portability and Accountability Act waiver of authorization, was obtained from the Western Institutional Review Board (protocol no. 20152817). This is retrospective research that involves no more than a minimal risk to the privacy of patients and involves no intervention or contact with the patients. FMI provides FMI Tests at the request of treating physicians and therefore has no direct relationship with any of these patients. Moreover, in many cases the patients may no longer be associated with the treating physician who ordered their FMI Test or may be deceased, and therefore it may be impossible to contact these patients.

Identification of KDDs from the MSK-IMPACT dataset is a re-analysis of published data¹⁸. MSK-IMPACT-sequencing data from patients whose tumor and matched normal samples were prospectively sequenced between January 2014 and September 2019 (n = 40,165, NCT01775072) were used in this study. Structural variant detection was performed on the paired-end reads using Delly (version 0.7.5; https://github.com/dellytools/delly). Duplication events that surrounded or overlapped known kinase domains were selected for further manual review. For copy number-based analysis, coverage data from the tumor and an unmatched normal sample were used to generate a fold change value⁷⁷ for each exon in a kinase gene. Using k-mean clustering (k = 2), we identified samples where one of the clusters was overlapping (requiring at least 70% of the kinase domain to be involved) or encompassing the kinase domain with a median cluster fold change difference of at least 0.4. We combined the two datasets for further manual review to identify a subset of confident KDD calls.

Statistical analysis

Statistical significance was analyzed using one-way ANOVA with Bonferroni or uncorrected Fisher’s least significant difference (LSD) post hoc test. Results were displayed as mean values or median values ± standard deviation (SD) or standard error of the mean (SEM). For all tests, the criteria for significance were nonsignificant (ns), P < 0.05 (*), P < 0.01 (**), P < 0.001 (***), and P < 0.0001 (****). Statistical analysis was carried out using Prism 9 (GraphPad Software).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The authors declare that all data supporting the findings of this study are available within the article and its supplementary information files or from the corresponding author upon request. The results underlying Table 1 are based on a combination of genomic sequencing data from Foundation Medicine Inc. and MSK-IMPACT. In accordance with the Health Insurance Portability and Accountability Act, in an effort to minimize the risk of re-identification of individuals, individual-level data are not publicly available. For the Foundation Medicine dataset, raw sequencing data are proprietary and not publicly available. However, requests from accredited researchers for access to de-identified individual-level or aggregate data relevant to this manuscript, such as tumor type and mutational status, can be made available upon request by contacting Dr. Alexa B. Schrock at aschrock@foundationmedicine.com. Accredited researchers should provide contact information, affiliation/organization, and research rationale. The analysis presented here from the MSK-IMPACT dataset is a re-analysis of data originally reported by Zehir and coworkers¹⁸. The MSK-IMPACT dataset is publicly available through the cBioPortal for Cancer Genomics (http://cbioportal.org/msk-impact). MSK-IMPACT KDD data can be made available upon request. Protein Data Bank (PDB) identifiers 2GS6, 2ITX, 3GOP, and 4RIW were accessed to assist with model building for this study. In addition, Dataset 1 from the Supplementary Information of Needham and colleagues³¹ was accessed to assist with model building of the EGFR-KDD inter-molecular dimer. Source data are provided with this paper.

References

Wheler, J. J. et al. Cancer therapy directed by comprehensive genomic profiling: a single center study. Cancer Res. 76, 3690–3701 (2016).
Article CAS PubMed Google Scholar
Ross, J. S. et al. Comprehensive genomic profiling of carcinoma of unknown primary site: new routes to targeted therapies. JAMA Oncol. 1, 40–49 (2015).
Article PubMed Google Scholar
Qin, A. et al. Detection of known and novel FGFR fusions in non-small cell lung cancer by comprehensive genomic profiling. J. Thorac. Oncol. 14, 54–62 (2019).
Article CAS PubMed Google Scholar
Disel, U. et al. The pan-cancer landscape of coamplification of the tyrosine kinases KIT, KDR, and PDGFRA. Oncologist 25, e39–e47 (2020).
Article CAS PubMed Google Scholar
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 17, 405–424 (2015).
Article PubMed PubMed Central Google Scholar
Li, M. M. et al. Standards and Guidelines for the Interpretation and Reporting of Sequence Variants in Cancer: A Joint Consensus Recommendation of the Association for Molecular Pathology, American Society of Clinical Oncology, and College of American Pathologists. J. Mol. Diagn. 19, 4–23 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gallant, J. N. et al. EGFR Kinase Domain Duplication (EGFR-KDD) is a novel oncogenic driver in lung cancer that is clinically responsive to afatinib. Cancer Discov. 5, 1155–1163 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dankner, M., Rose, A. A. N., Rajkumar, S., Siegel, P. M. & Watson, I. R. Classifying BRAF alterations in cancer: new rational therapeutic strategies for actionable mutations. Oncogene 37, 3183–3199 (2018).
Article CAS PubMed Google Scholar
Yao, Z. et al. BRAF mutants evade ERK-dependent feedback by different mechanisms that determine their sensitivity to pharmacologic inhibition. Cancer Cell 28, 370–383 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pao, W. & Chmielecki, J. Rational, biologically based treatment of EGFR-mutant non-small-cell lung cancer. Nat. Rev. Cancer 10, 760–774 (2010).
Article CAS PubMed PubMed Central Google Scholar
Shan, Y. et al. Oncogenic mutations counteract intrinsic disorder in the EGFR kinase and promote receptor dimerization. Cell 149, 860–870 (2012).
Article CAS PubMed Google Scholar
Vivanco, I. et al. Differential sensitivity of glioma- versus lung cancer-specific EGFR mutations to EGFR kinase inhibitors. Cancer Discov. 2, 458–471 (2012).
Article CAS PubMed PubMed Central Google Scholar
Brennan, C. W. et al. The somatic genomic landscape of glioblastoma. Cell 155, 462–477 (2013).
Article CAS PubMed PubMed Central Google Scholar
Imielinski, M. et al. Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell 150, 1107–1120 (2012).
Article CAS PubMed PubMed Central Google Scholar
Frattini, V. et al. The integrated landscape of driver genomic alterations in glioblastoma. Nat. Genet. 45, 1141–1149 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cancer Genome Atlas Research, N. Comprehensive molecular profiling of lung adenocarcinoma. Nature 511, 543–550 (2014).
Article ADS CAS Google Scholar
Mishra, R., Hanker, A. B. & Garrett, J. T. Genomic alterations of ERBB receptors in cancer: clinical implications. Oncotarget 8, 114371–114392 (2017).
Article PubMed PubMed Central Google Scholar
Zehir, A. et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat. Med. 23, 703–713 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nik-Zainal, S. et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature 534, 47–54 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Lemmon, M. A. & Schlessinger, J. Cell signaling by receptor tyrosine kinases. Cell 141, 1117–1134 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhang, X., Gureasko, J., Shen, K., Cole, P. A. & Kuriyan, J. An allosteric mechanism for activation of the kinase domain of epidermal growth factor receptor. Cell 125, 1137–1149 (2006).
Article CAS PubMed Google Scholar
Pruss, R. M. & Herschman, H. R. Variants of 3T3 cells lacking mitogenic response to epidermal growth factor. Proc. Natl Acad. Sci. USA 74, 3918–3921 (1977).
Article ADS CAS PubMed PubMed Central Google Scholar
Dise, R. S., Frey, M. R., Whitehead, R. H. & Polk, D. B. Epidermal growth factor stimulates Rac activation through Src and phosphatidylinositol 3-kinase to promote colonic epithelial cell migration. Am. J. Physiol. Gastrointest. Liver Physiol. 294, G276–G285 (2008).
Article CAS PubMed Google Scholar
Walton, G. M., Chen, W. S., Rosenfeld, M. G. & Gill, G. N. Analysis of deletions of the carboxyl terminus of the epidermal growth factor receptor reveals self-phosphorylation at tyrosine 992 and enhanced in vivo tyrosine phosphorylation of cell substrates. J. Biol. Chem. 265, 1750–1754 (1990).
Article CAS PubMed Google Scholar
Helin, K. et al. The biological activity of the human epidermal growth factor receptor is positively regulated by its C-terminal tyrosines. Oncogene 6, 825–832 (1991).
CAS PubMed Google Scholar
Jura, N. et al. Mechanism for activation of the EGF receptor catalytic domain by the juxtamembrane segment. Cell 137, 1293–1307 (2009).
Article PubMed PubMed Central Google Scholar
Red Brewer, M. et al. The juxtamembrane region of the EGF receptor functions as an activation domain. Mol. Cell 34, 641–651 (2009).
Article PubMed CAS Google Scholar
He, L. & Hristova, K. Consequences of replacing EGFR juxtamembrane domain with an unstructured sequence. Sci. Rep. 2, 854 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Huang, Y. et al. Molecular basis for multimerization in the activation of the epidermal growth factor receptor. Elife 5, e14107 (2016).
Article PubMed PubMed Central CAS Google Scholar
Clayton, A. H. et al. Ligand-induced dimer–tetramer transition during the activation of the cell surface epidermal growth factor receptor—a multidimensional microscopy analysis. J. Biol. Chem. 280, 30392–30399 (2005).
Article CAS PubMed Google Scholar
Needham, S. R. et al. EGFR oligomerization organizes kinase-active dimers into competent signalling platforms. Nat. Commun. 7, 13307 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Chung, I. et al. Spatial control of EGF receptor activation by reversible dimerization on living cells. Nature 464, 783–787 (2010).
Article ADS CAS PubMed Google Scholar
Li, S. et al. Structural basis for inhibition of the epidermal growth factor receptor by cetuximab. Cancer Cell 7, 301–311 (2005).
Article CAS PubMed Google Scholar
Sivasubramanian, A., Chao, G., Pressler, H. M., Wittrup, K. D. & Gray, J. J. Structural model of the mAb 806–EGFR complex using computational docking followed by computational and experimental mutagenesis. Structure 14, 401–414 (2006).
Article CAS PubMed Google Scholar
Reilly, E. B. et al. Characterization of ABT-806, a humanized tumor-specific anti-EGFR monoclonal antibody. Mol. Cancer Ther. 14, 1141–1151 (2015).
Article CAS PubMed Google Scholar
Christie, S., Shi, X. & Smith, A. W. Resolving membrane protein-protein interactions in live cells with pulsed interleaved excitation fluorescence cross-correlation spectroscopy. Acc. Chem. Res. 53, 792–799 (2020).
Article CAS PubMed Google Scholar
Endres, N. F. et al. Conformational coupling across the plasma membrane in activation of the EGF receptor. Cell 152, 543–556 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kaliszewski, M. J. et al. Quantifying membrane protein oligomerization with fluorescence cross-correlation spectroscopy. Methods 140-141, 40–51 (2018).
Article CAS PubMed PubMed Central Google Scholar
Graus-Porta, D., Beerli, R. R., Daly, J. M. & Hynes, N. E. ErbB-2, the preferred heterodimerization partner of all ErbB receptors, is a mediator of lateral signaling. EMBO J. 16, 1647–1655 (1997).
Article CAS PubMed PubMed Central Google Scholar
Klapper, L. N. et al. The ErbB-2/HER2 oncoprotein of human carcinomas may function solely as a shared coreceptor for multiple stroma-derived growth factors. Proc. Natl Acad. Sci. USA 96, 4995–5000 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Jura, N., Shan, Y., Cao, X., Shaw, D. E. & Kuriyan, J. Structural analysis of the catalytically inactive kinase domain of the human EGF receptor 3. Proc. Natl Acad. Sci. USA 106, 21608–21613 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Cho, J. et al. Cetuximab response of lung cancer-derived EGF receptor mutants is associated with asymmetric dimerization. Cancer Res. 73, 6770–6779 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ramalingam, S. et al. Dual inhibition of the epidermal growth factor receptor with cetuximab, an IgG1 monoclonal antibody, and gefitinib, a tyrosine kinase inhibitor, in patients with refractory non-small cell lung cancer (NSCLC): a phase I study. J. Thorac. Oncol. 3, 258–264 (2008).
Article PubMed Google Scholar
Janjigian, Y. Y. et al. Dual inhibition of EGFR with afatinib and cetuximab in kinase inhibitor-resistant EGFR-mutant lung cancer with and without T790M mutations. Cancer Discov. 4, 1036–1045 (2014).
Article CAS PubMed PubMed Central Google Scholar
Horn, L. et al. Continued use of afatinib with the addition of cetuximab after progression on afatinib in patients with EGFR mutation-positive non-small-cell lung cancer and acquired resistance to gefitinib or erlotinib. Lung Cancer 113, 51–58 (2017).
Article PubMed Google Scholar
Soria, J. C. et al. Osimertinib in untreated EGFR-mutated advanced non-small-cell lung cancer. New Engl. J. Med. 378, 113–125 (2018).
Article CAS PubMed Google Scholar
Ramalingam, S. S. et al. Overall survival with osimertinib in untreated, EGFR-mutated advanced NSCLC. New Engl. J. Med. 382, 41–50 (2020).
Article CAS PubMed Google Scholar
Red Brewer, M. et al. Mechanism for activation of mutated epidermal growth factor receptors in lung cancer. Proc. Natl Acad. Sci. USA 110, E3595–E3604 (2013).
Article PubMed PubMed Central Google Scholar
Bergethon, K. et al. ROS1 rearrangements define a unique molecular class of lung cancers. J. Clin. Oncol. 30, 863–870 (2012).
Article CAS PubMed PubMed Central Google Scholar
Davies, K. D. & Doebele, R. C. Molecular pathways: ROS1 fusion proteins in cancer. Clin. Cancer Res. 19, 4040–4045 (2013).
Article CAS PubMed PubMed Central Google Scholar
Okamura, R. et al. Analysis of NTRK alterations in pan-cancer adult and pediatric malignancies: implications for NTRK-targeted therapeutics. JCO Precis. Oncol. 2018, https://doi.org/10.1200/PO.18.00183 (2018).
Shaw, A. T. et al. Lorlatinib in non-small-cell lung cancer with ALK or ROS1 rearrangement: an international, multicentre, open-label, single-arm first-in-man phase 1 trial. Lancet Oncol. 18, 1590–1599 (2017).
Article CAS PubMed PubMed Central Google Scholar
Shaw, A. T. et al. Crizotinib in ROS1-rearranged non-small-cell lung cancer. New Engl. J. Med. 371, 1963–1971 (2014).
Article PubMed CAS Google Scholar
Wang, J. et al. Clinical outcomes of EGFR kinase domain duplication to targeted therapies in NSCLC. Int. J. Cancer 144, 2677–2682 (2019).
Article CAS PubMed Google Scholar
Foster, S. A. et al. Activation mechanism of oncogenic deletion mutations in BRAF, EGFR, and HER2. Cancer Cell 29, 477–493 (2016).
Article CAS PubMed Google Scholar
Doody, J. F. et al. Inhibitory activity of cetuximab on epidermal growth factor receptor mutations in non small cell lung cancers. Mol. Cancer Ther. 6, 2642–2651 (2007).
Article CAS PubMed Google Scholar
Perez-Torres, M., Guix, M., Gonzalez, A. & Arteaga, C. L. Epidermal growth factor receptor (EGFR) antibody down-regulates mutant receptors and inhibits tumors expressing EGFR mutations. J. Biol. Chem. 281, 40183–40192 (2006).
Article CAS PubMed Google Scholar
Yun, C. H. et al. Structures of lung cancer-derived EGFR mutants and inhibitor complexes: mechanism of activation and insights into differential inhibitor sensitivity. Cancer Cell 11, 217–227 (2007).
Article CAS PubMed PubMed Central Google Scholar
Carey, K. D. et al. Kinetic analysis of epidermal growth factor receptor somatic mutant proteins shows increased sensitivity to the epidermal growth factor receptor tyrosine kinase inhibitor, erlotinib. Cancer Res. 66, 8163–8171 (2006).
Article CAS PubMed Google Scholar
Gilmer, T. M. et al. Impact of common epidermal growth factor receptor and HER2 variants on receptor activity and inhibition by lapatinib. Cancer Res. 68, 571–579 (2008).
Article CAS PubMed Google Scholar
Yun, C. H. et al. The T790M mutation in EGFR kinase causes drug resistance by increasing the affinity for ATP. Proc. Natl Acad. Sci. USA 105, 2070–2075 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Sholl, L. M. et al. Lung adenocarcinoma with EGFR amplification has distinct clinicopathologic and molecular features in never-smokers. Cancer Res. 69, 8341–8348 (2009).
Article CAS PubMed PubMed Central Google Scholar
Regales, L. et al. Dual targeting of EGFR can overcome a major drug resistance mutation in mouse models of EGFR mutant lung cancer. J. Clin. Investig. 119, 3000–3010 (2009).
CAS PubMed PubMed Central Google Scholar
Whitehead, R. H., VanEeden, P. E., Noble, M. D., Ataliotis, P. & Jat, P. S. Establishment of conditionally immortalized epithelial cell lines from both colon and small intestine of adult H-2Kb-tsA58 transgenic mice. Proc. Natl Acad. Sci. USA 90, 587–591 (1993).
Article ADS CAS PubMed PubMed Central Google Scholar
Johns, T. G. et al. Antitumor efficacy of cytotoxic drugs and the monoclonal antibody 806 is enhanced by the EGF receptor inhibitor AG1478. Proc. Natl Acad. Sci. USA 100, 15871–AG15876 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Johns, T. G. et al. Novel monoclonal antibody specific for the de2-7 epidermal growth factor receptor (EGFR) that also recognizes the EGFR expressed in cells containing amplification of the EGFR gene. Int. J. Cancer 98, 398–408 (2002).
Article CAS PubMed Google Scholar
Wang, S. E. et al. HER2 kinase domain mutation results in constitutive phosphorylation and activation of HER2 and EGFR and resistance to EGFR tyrosine kinase inhibitors. Cancer Cell 10, 25–38 (2006).
Article PubMed CAS Google Scholar
Brown, B. P. et al. On-target resistance to the mutant-selective EGFR inhibitor osimertinib can develop in an allele-specific manner dependent on the original EGFR-activating mutation. Clin. Cancer Res. 25, 3341–3351 (2019).
Article CAS PubMed PubMed Central Google Scholar
Comar, W. D., Schubert, S. M., Jastrzebska, B., Palczewski, K. & Smith, A. W. Time-resolved fluorescence spectroscopy measures clustering and mobility of a G protein-coupled receptor opsin in live cell membranes. J. Am. Chem. Soc. 136, 8342–8349 (2014).
Article CAS PubMed PubMed Central Google Scholar
Borowicz, S. et al. The soft agar colony formation assay. J. Vis. Exp. e51998 https://doi.org/10.3791/51998 (2014).
Horibata, S., Vo, T. V., Subramanian, V., Thompson, P. R. & Coonrod, S. A. Utilization of the soft agar colony formation assay to identify inhibitors of tumorigenicity in breast cancer cells. J. Vis. Exp. e52727 https://doi.org/10.3791/52727 (2015).
Song, Y. et al. High-resolution comparative modeling with RosettaCM. Structure 21, 1735–1742 (2013).
Article CAS PubMed Google Scholar
Case, D. A. et al. AMBER 2018 (University of California, 2018).
Roe, D. R. & Cheatham, T. E.III. PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 9, 3084–3095 (2013).
Article CAS PubMed Google Scholar
Miller, B. R. III et al. MMPBSA.py: an efficient program for end-state free energy calculations. J. Chem. Theory Comput. 8, 3314–3321 (2012).
Article CAS PubMed Google Scholar
Frampton, G. M. et al. Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing. Nat. Biotechnol. 31, 1023–1031 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ross, D. S. et al. Next-generation assessment of human epidermal growth factor receptor 2 (ERBB2) amplification status: clinical validation in the context of a hybrid capture-based, comprehensive solid tumor genomic profiling assay. J. Mol. Diagn. 19, 244–254 (2017).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work is supported through NIH NCI R01CA227833 and NIH NIGMS R01 GM080403. C.M.L. was also supported in part through P30CA086485, UG1CA233259, and 5P01CA129243-12. Z.D. is supported by the 2018 AACR-AstraZeneca Lung Cancer Research Fellowship, Grant Number 18-40-12-DU. Work in the Meiler laboratory is also supported through the NIH (R01GM080403, R01GM099842, and R01GM073151). A.W.S. and S.K. are supported through the National Science Foundation (CHE-1753060). B.P.B. is supported through the NIH by a Ruth L. Kirschstein NRSA fellowship (F30DK118774). Work at MSKCC is supported in part by the NCI Cancer Center Support Grant/Core Grant (P30CA008748). We sincerely thank John Kuriyan, Portia L. Thomas, and Wade T. Iams for their productive conversations, comments, and suggestions for this manuscript.

Author information

These authors contributed equally: Zhenfang Du, Benjamin P. Brown.

Authors and Affiliations

Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Zhenfang Du, Jean-Nicolas Gallant, Yun-Kai Zhang, Yingjun Yan, Monica Red-Brewer & Christine M. Lovly
Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, USA
Benjamin P. Brown
Department of Chemistry, Vanderbilt University, Nashville, TN, USA
Benjamin P. Brown & Jens Meiler
Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
Benjamin P. Brown & Jens Meiler
Department of Chemistry, University of Akron, Akron, OH, USA
Soyeon Kim & Adam W. Smith
Department of Molecular Pathology, Memorial Sloan Kettering Cancer Center, New York City, NY, USA
Donna Ferguson, Gowtham Jayakumaran, Ryma Benayed, Ahmet Zehir & Marc Ladanyi
Foundation Medicine, Inc., Cambridge, MA, USA
Dean C. Pavlick, Siraj M. Ali & Alexa B. Schrock
Institute for Drug Discovery, Leipzig University Medical School, Leipzig, Germany
Jens Meiler
Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN, USA
Christine M. Lovly

Authors

Zhenfang Du
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin P. Brown
View author publications
You can also search for this author in PubMed Google Scholar
Soyeon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Donna Ferguson
View author publications
You can also search for this author in PubMed Google Scholar
Dean C. Pavlick
View author publications
You can also search for this author in PubMed Google Scholar
Gowtham Jayakumaran
View author publications
You can also search for this author in PubMed Google Scholar
Ryma Benayed
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Nicolas Gallant
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Kai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yingjun Yan
View author publications
You can also search for this author in PubMed Google Scholar
Monica Red-Brewer
View author publications
You can also search for this author in PubMed Google Scholar
Siraj M. Ali
View author publications
You can also search for this author in PubMed Google Scholar
Alexa B. Schrock
View author publications
You can also search for this author in PubMed Google Scholar
Ahmet Zehir
View author publications
You can also search for this author in PubMed Google Scholar
Marc Ladanyi
View author publications
You can also search for this author in PubMed Google Scholar
Adam W. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Jens Meiler
View author publications
You can also search for this author in PubMed Google Scholar
Christine M. Lovly
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.D., B.P.B., J.-N.G., J.M., M.R.-B., and C.M.L. conceived and designed the study. Z.D. constructed mutations within linker of EGFR-KDD and performed most of the in vitro cell-based assays. B.P.B. and J.M. designed the computational structural modeling experiments, and B.P.B. performed the structural modeling. S.K. and A.W.S. designed the PIE-FCCS experiments, and S.K. performed PIE-FCCS. D.C.P., S.M.A., and A.B.S. acquired and analyzed the Foundation Medicine dataset. D.F., G.J., R.B., A.Z., and M.L. acquired and analyzed the MSKCC dataset. J.-N.G. constructed the dimer interface mutations and stably expressed them into NR6 and YAMC cells. Z.D. and Y.-K.Z. performed cell viability assay. Y.Y. maintained all the stocks of cells, DNA and reagents and provided technical support. Z.D., B.P.B., J.M., and C.M.L. wrote the manuscript. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Jens Meiler or Christine M. Lovly.

Ethics declarations

Competing interests

D.C.P., S.M.A., and A.B.S. are employees of Foundation Medicine, Inc. C.M.L. is a consultant/advisory board member for Pfizer, Novartis, Astra Zeneca, Genoptix, Genentech, Syros, Cepheid, Roche, Sequenom, Ariad, Takeda, Foundation Medicine, Blueprints Medicine, Achilles, Amgen and reports receiving commercial research grants from Xcovery and Novartis. M.L. has received advisory board compensation from Merck, Lilly Oncology, AstraZeneca, Bristol-Myers Squibb, Takeda, and Bayer, and research support from LOXO Oncology, Helsinn Healthcare, Elevation Oncology and Merus. No potential conflicts of interest were disclosed by the other authors.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Reporting Summary

Source data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Du, Z., Brown, B.P., Kim, S. et al. Structure–function analysis of oncogenic EGFR Kinase Domain Duplication reveals insights into activation and a potential approach for therapeutic targeting. Nat Commun 12, 1382 (2021). https://doi.org/10.1038/s41467-021-21613-6

Download citation

Received: 19 March 2020
Accepted: 28 January 2021
Published: 02 March 2021
DOI: https://doi.org/10.1038/s41467-021-21613-6

This article is cited by

Lung cancer in patients who have never smoked — an emerging disease
- Jaclyn LoPiccolo
- Alexander Gusev
- Pasi A. Jänne
Nature Reviews Clinical Oncology (2024)
The genomic and transcriptomic landscape of advanced renal cell cancer for individualized treatment strategies
- K. de Joode
- W. S. van de Geer
- A. A. M. van der Veldt
Scientific Reports (2023)
Rare molecular subtypes of lung cancer
- Guilherme Harada
- Soo-Ryum Yang
- Alexander Drilon
Nature Reviews Clinical Oncology (2023)
Comprehensive characterization of posttranscriptional impairment-related 3′-UTR mutations in 2413 whole genomes of cancer patients
- Wenqing Wei
- Wenyan Gao
- Zhihua Liu
npj Genomic Medicine (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.