Aromatase inhibition remodels the clonal architecture of estrogen-receptor-positive breast cancers

Miller, Christopher A.; Gindin, Yevgeniy; Lu, Charles; Griffith, Obi L; Griffith, Malachi; Shen, Dong; Hoog, Jeremy; Li, Tiandao; Larson, David E.; Watson, Mark; Davies, Sherri R; Hunt, Kelly; Suman, Vera J.; Snider, Jacqueline; Walsh, Thomas; Colditz, Graham A.; DeSchryver, Katherine; Wilson, Richard K.; Mardis, Elaine R.; Ellis, Matthew J.

doi:10.1038/ncomms12498

Download PDF

Article
Open access
Published: 09 August 2016

Aromatase inhibition remodels the clonal architecture of estrogen-receptor-positive breast cancers

Christopher A. Miller ORCID: orcid.org/0000-0003-4266-6700^1,2,
Yevgeniy Gindin¹^nAff11,
Charles Lu¹^nAff12,
Obi L Griffith ORCID: orcid.org/0000-0002-0843-4271^1,3,4,
Malachi Griffith ORCID: orcid.org/0000-0002-6388-446X^1,4,5,
Dong Shen¹^nAff13,
Jeremy Hoog³,
Tiandao Li ORCID: orcid.org/0000-0003-1650-0555¹,
David E. Larson ORCID: orcid.org/0000-0001-7934-5906^1,5,
Mark Watson⁶,
Sherri R Davies³,
Kelly Hunt⁷,
Vera J. Suman⁸,
Jacqueline Snider³,
Thomas Walsh⁹,
Graham A. Colditz^4,9,
Katherine DeSchryver^3,9,
Richard K. Wilson^1,2,3,4,5,
Elaine R. Mardis^1,3,4,5 &
…
Matthew J. Ellis^1,3,10

Nature Communications volume 7, Article number: 12498 (2016) Cite this article

9840 Accesses
59 Citations
134 Altmetric
Metrics details

Subjects

Abstract

Resistance to oestrogen-deprivation therapy is common in oestrogen-receptor-positive (ER+) breast cancer. To better understand the contributions of tumour heterogeneity and evolution to resistance, here we perform comprehensive genomic characterization of 22 primary tumours sampled before and after 4 months of neoadjuvant aromatase inhibitor (NAI) treatment. Comparing whole-genome sequencing of tumour/normal pairs from the two time points, with coincident tumour RNA sequencing, reveals widespread spatial and temporal heterogeneity, with marked remodelling of the clonal landscape in response to NAI. Two cases have genomic evidence of two independent tumours, most obviously an ER− ‘collision tumour’, which was only detected after NAI treatment of baseline ER+ disease. Many mutations are newly detected or enriched post treatment, including two ligand-binding domain mutations in ESR1. The observed clonal complexity of the ER+ breast cancer genome suggests that precision medicine approaches based on genomic analysis of a single specimen are likely insufficient to capture all clinically significant information.

Patterns of genomic change in residual disease after neoadjuvant chemotherapy for estrogen receptor-positive and HER2-negative breast cancer

Article 03 September 2021

Molecular profiling of aromatase inhibitor sensitive and resistant ER+HER2- postmenopausal breast cancers

Article Open access 07 July 2023

CDK4/6i-treated HR+/HER2- breast cancer tumors show higher ESR1 mutation prevalence and more altered genomic landscape

Article Open access 22 February 2024

Introduction

Despite the success of therapeutic approaches that suppress oestrogen production or inhibit ER function, resistance to endocrine therapy is common and accounts for the majority of breast cancer deaths¹. The identification of endocrine resistance in primary ER+ breast cancer is a critical endeavour that has been widely explored^2,3 but the current transcriptional profiling approaches do not typically identify drivers of resistance, nor do they take into account resistance mechanisms that evolve from a minor subpopulation of cells.

Breast carcinomas are driven by accumulated somatic alterations. Those present in the initiating cell define the founder clone and persist in every cell of the tumour. As tumours grow, they accumulate additional mutations and these may give rise to subclonal populations with distinct characteristics^4,5. These subpopulations compete and evolve^6,7,8,9 and often harbour mutations that eventually confer resistance to specific therapies^{9,10,11,12,13}. When this occurs, the resistant tumour cells will represent a proportionally larger fraction of the tumour mass, as the susceptible population diminishes.

Another source of heterogeneity is synchronous breast cancer, most obviously when the disease is bilateral at diagnosis, or when two tumours are far apart in the same breast. These events are clinically significant as bilateral disease can signify the presence of a germline predisposition as well as a worse prognosis¹⁴. However when several independent breast cancers evolve in very close proximity, it can be impossible, without genomic techniques, to diagnose ‘collision tumours’ with different founder clones. The incidence of collision tumours of the breast and their clinical significance is therefore unknown, with most of the literature focused on tumours of different tissue origins growing together at a metastatic site¹⁵.

While subclonal heterogeneity has been extensively studied in breast cancer^16,17, little is known about the interplay between clonal evolution and therapy response. The effects of short-term (14 day) aromatase inhibitor (AI) treatment have been explored on the transcriptomic level using microarrays^18,19, but it is unclear what population changes these expression differences reflect, that is, whether existing cells are being ‘remodelled’ at the transcriptional level, or whether the changes are due to continued expansion of cells harbouring resistance mutations. The effects of longer-term AI inhibition at the whole-genome level remain unknown.

In this study, we provide a first step towards answering these questions by comprehensively characterizing the effects of neoadjuvant aromatase inhibitor (NAI) therapy on the genomes and transcriptomes of a group of ER-positive breast tumours. From a set of previously whole-genome sequenced (WGS) tumours²⁰, we selected 22 Luminal A or Luminal B subtype tumours classified as either ‘aromatase-inhibitor-sensitive’ (N=12, median surgical Ki67=1.1%, range 0–7.0%) or ‘aromatase-inhibitor-resistant’ (N=10, median surgical Ki67 24.6%, range 10.4–47.1%). For each of these patients, we then performed WGS on matching tumours sampled by core biopsy at surgical resection, as well as on additional core biopsies taken from a subset of the baseline (pre-treatment, N=5) and surgical resection tumours (post-treatment, N=6). RNA sequencing was performed on 20 of the baseline tumours and on 18 of the surgical tumours. To obtain additional information on treatment emergent mutations, a subset of these tumours, along with 38 additional cases were analysed at greater sequencing depth using targeted capture with a gene panel. Together, these data allow us to characterize the genomic landscapes and clonal architectures of breast tumours and show that they are often dramatically altered during NAI therapy.

Results

Mutational landscape

Comprehensive genomic characterization of these 22 patients’ tumours (Table 1, Supplementary Data 1) revealed a total of 42,300 somatic single-nucleotide variants (SNVs) and indels in non-repetitive loci (tiers 1–3; ref. 21) of all tumour samples, with a median of 947 somatic SNVs and 23 indels per tumour (Supplementary Data 2 and 3). A total of 2,061 somatic variants were in the coding regions of the resulting proteins, with a median of 55 per patient. One tumour, BRC26, harboured a low mutation count (55), likely due to lower sequence coverage (Supplementary Data 4). Coding variant counts included 1,166 coding-region SNVs and indels not detected in the original study of the baseline tumours, obtained both from analysis of the surgical samples and via deeper sequencing coverage of baseline samples. Of these, 256 were absent or low frequency (<5% variant allele fraction (VAF) in the baseline samples, making them challenging or impossible to detect from standard-depth WGS without information from postsurgical or second core biopsy samples. The number of detectable low-frequency mutations was also impacted by tumour purity, which ranged from 12 to 100% (Supplementary Data 4). We identified 72 expressed coding mutations specific to the surgical samples, based on our limits of detection (Supplementary Note 1).

Table 1 Samples whole-genome sequenced in this study and a description of key clinical and biomarker parameters.

Full size table

Copy number profiles were largely concordant between baseline and surgical samples, with a few notable exceptions (Supplementary Fig. 1). Eight samples harboured at least one event greater than 100 Mb in size that differed between the baseline and surgical samples, including one tumour (BRC48) that changed from diploid to triploid over the course of treatment. We also detected and validated a total of 1,695 structural variants (SVs) across 55 samples. The mean value was 30 SVs per sample (range 1 to 141) and had no significant correlation with either AI response or EFS (Supplementary Data 5). Overall, 71.8% of validated SVs present in the surgical samples were also detected in the baseline samples. Gene fusion-causing alterations were detected in 21 of 22 patients, with only moderate concordance between multiple samples from the same individual (Supplementary Note 2, Supplementary Data 6). No fusions involving ESR1 were observed. Transcriptome analysis revealed few differentially expressed genes and unsupervised hierarchical clustering did not identify clear expression signatures that distinguished AI-sensitive or -resistant tumours (Supplementary Figs 2–6, Supplementary Note 3, Supplementary Data 7 and 8).

Subclonal inference by clustering of variant allele fractions

Using somatic mutations from all the samples for each patient, we applied the SciClone algorithm²² to cluster the VAFs of copy-number neutral SNVs, thus inferring the clonal architecture of each sample (Supplementary Fig. 7). Two samples, BRC17 and BRC48, had large-scale ploidy changes, which prevented accurate automated clustering of the clonal subpopulations, and a third case, BRC18, had a very high mutation rate and insufficient separation between clusters. The remaining 19 cases included 11 time points with multiple core biopsies. Several patterns of altered subclonal composition and evolution were observed in the context of AI treatment response, and are described below.

Intertwined tumours of independent origin

In one patient (BRC38), the surgical tumour shared no somatic SNVs or indels with the baseline tumour, despite having matching identity from germline SNP concordance (Fig. 1a). The ER+ baseline tumour contained a CDH1 splice-site mutation in the founder clone, and subclonal missense mutations in FOXA1 and FOXQ1. The surgical tumour specifically lacked these mutations or indeed other SNVs or indels in genes strongly implicated in cancer, but had focal amplifications containing MYC, EGFR and CCND1 (Fig. 1b). Although the baseline tumour also had amplified MYC and CCND1, these were distinct events with breakpoints different than those observed in the surgical tumour. Both tumours contained numerous other copy number alterations, but none were shared except for a 1q amplification (with unresolvable breakpoints in the centromere). This event may have occurred in a shared cell of origin, or may be a case of homoplasy. Several gene fusions also were detected with no overlap between these two ‘collision’ tumours.

**Figure 1: Collision tumours of independent origin and ER status in BRC38.**

RNAseq data from this patient’s tumour samples reveal a transition from baseline ER-positive (FPKM 131.65) to post-treatment ER-negative (FPKM of 0.99), and ER immunohistochemistry supported this observation, with ER Allred scores of 8 in the pre-treatment sample and 2 in the sequenced surgical sample (Fig. 1c–e, Supplementary Data 7). A second IHC result from a different portion of the post-treatment surgical sample showed residual ER-positivity, suggesting that the two tumours continued to co-exist, but that the surgical core we sequenced did not capture the ER-positive residual tumour. There was a 3-fold increase in Ki67 between baseline and surgery and this patient was classified as AI-resistant.

A second patient, BRC21, contained genomic evidence for a cryptic second tumour, inferred via a pattern of subclonal evolution that is impossible to explain with only a single tumour (Supplementary Fig. 7). Specifically, the variants in cluster 3 shifted from nearly absent at baseline to a VAF that was greater than any other cluster at surgery. This pattern suggests retention of the baseline tumour, coupled with the emergence of a second tumour containing a PIK3CA mutation (H1047R). These two tumours comprise 19 and 28 percent of the surgical sample, respectively, and this combined percentage is more parsimonious with the 70% estimate from pathology than would be a single-tumour solution with purity of less than 30%. Unlike BRC38, both appear to be ER+ tumours, with an Allred score at surgery of 6. Though both measures are limited somewhat by the low purity of the surgical sample, this patient was classified as AI-sensitive, and switched from an intrinsic subtype of LumA to Normal-like.

Simple and clonally stable tumours

Only a single sample, BRC14, harboured no detectable subclonal cell populations, either at baseline or following AI treatment (Fig. 2). This tumour was categorized as AI-sensitive. In keeping with AI responsiveness, the tumour intrinsic subtype switched from Luminal B to Luminal A.

**Figure 2: Simple and stable clonal structure in BRC14.**

Complex and dynamic tumours

Eighteen patients (81.8%) had tumours containing multiple subclonal cell populations, some of which were substantially altered during the course of treatment. In BRC15, the two baseline samples had similar clonal composition, but the two surgical samples contained significant spatial heterogeneity as well as extensive remodelling of the clonal architecture (Fig. 3, Supplementary Note 4). This tumour had over 2.5-fold higher ESR1 mRNA expression levels at surgery than at baseline and was sensitive to aromatase-inhibitor treatment as determined by the drop in Ki67 level from 24 to 1%.

**Figure 3: Subclonal complexity and response to AI inhibition in BRC15.**

Four-dimensional clustering revealed a founder clone and four subclones. The founder clone contained expressed mutations in PIK3CA (H1047R) and ARID2 (frameshift insertion). Cluster two was the dominant subclone in the pre-treatment sample and post-treatment sample 2, but comprised only about 11% of post-treatment sample 1. Cluster three, which contained expressed mutations in BRAF (K601E) and MSH6 (nonsense), made up 92% of the baseline tumour, but was absent in the post-treatment samples, suggesting that it harboured enhanced susceptibility to oestrogen deprivation. Cluster four was undetectable in the baseline samples (limit of detection ∼1% VAF), but constituted 89% of post-treatment core 2, where clonal expansion may have been driven by a second mutation in PIK3CA (G118D). Similarly, cluster 5 was absent pre-treatment but present only in post-treatment core 1. The copy number profiles between all samples were largely concordant, with notable differences including amplification of chromosome 8 in surgical sample 1 and deletion of one copy of chromosome 9 in surgical sample 2. Three samples from the periphery of this tumour (all classified as Ductal Carcinoma in situ) were obtained from the surgical formalin-fixed block; all carried the founder clone BIRC6 and PIK3CA mutations, and one carried the BRAF mutation found in cluster 3. Additional descriptions of these sample genotypes can be found in the Supplementary Note 5 and Supplementary Data 8.

BRC41 provides another striking example of tumour evolution under oestrogen deprivation, with almost complete loss of a subclone that comprised 86% of the baseline tumour cells (Supplementary Fig. 7). This subclone loss was coincident with the emergence of a rare subclone that was present in about 2% of baseline tumour cells, but expanded to 84% of the surgical sample. The responsive baseline subclone included an expressed KDM8 (R227H) histone demethylase mutation. The emergent resistant subclone contained an expressed mutation in BCAS3 (K533N), a gene with some evidence linking it to tamoxifen resistance²³. Unlike BRC15, the paired core samples taken at baseline and at surgery were highly concordant, suggesting minimal spatial heterogeneity in this tumour.

In addition to these two cases, 13 other tumours were computationally classified as clonally complex and dynamic (Supplementary Fig. 7). To do so, we used the mean absolute difference in clonal fractions before and after treatment to derive a clonal instability index, where 0 indicates no change in clonal architecture and 1 indicates completely distinct tumours (see ‘Methods’ section, Supplementary Data 10). Among the 15 patients in this complex and dynamic group, we observed clonal instability values ranging from 0.25 to 0.90 (median 0.53), reflecting the large degree of temporal change in clonality during NAI treatment. Three additional cases were manually assigned to this category, for a total of 18 complex and dynamic tumours. Of these, 10 were characterized as responsive and 8 were resistant to AI. There was no significant correlation between clonal instability and AI response, as measured by either reduction in Ki67 or Ki67 at surgery. We observed intrinsic subtype switching in nine of these samples, with all changing from Luminal B to Luminal A.

Complex and stable tumours

Only one sample (BRC11) had detectable subclonal populations, but showed little evidence of treatment-induced remodelling, with a clonal instability score of 0.097 (Fig. 4). The likely founder clone carried missense mutations in ARID1A (p.A1239T), PIK3CA (p.H1047R) and a frame-shift deletion in TP53. The tumour was classified as treatment-resistant with Ki67 rising from 0.25 to 0.417 at surgery.

**Figure 4: Complex and stable clonal structure in BRC11.**

Targeted sequencing and treatment-associated heterogeneity

To examine AI treatment-associated changes at the level of a sequencing panel typical of clinical NGS-based diagnostic assays, we developed a hybrid capture panel to sequence 83 breast cancer-related genes in 19 of the WGS cases, plus an additional 38 pre/post pairs (Table 2, Supplementary Note 5, Supplementary Data 11 and 12). Of these 57 sample sets, 38 (from both groups) had paired baseline and surgical samples as described above, the other 19 had samples from baseline and end of treatment (most of which were triaged to chemotherapy). The mean depth of coverage was × 103.4 in this assay, compared with a mean of × 44.1 in the WGS and × 79.4 from the combined WGS and validation sequencing on the original 22 patients

Table 2 Thirty-eight additional cases sequenced with an 83-gene panel and a description of key clinical and biomarker parameters.

Full size table

We identified a total of 255 mutations from this panel test (median 3 per patient), three of which occur in genes that are high-priority drug targets. The ESR1 D538G mutation is a well-recognized ligand-binding domain (LBD) mutation, associated with ligand-independent transcriptional activity^24,25 and was present only in the post-treatment sample (11% VAF). The low post-treatment Ki67 suggests, however, that at 11% the ESR1 D538G allele was not yet sufficiently dominant to produce elevated proliferation despite AI treatment. The second treatment-emergent LBD mutation was K481N, present at 14% VAF at baseline and ∼40% VAF post treatment. The increasing VAF of this K481N mutation was associated with Ki67-defined resistance, since the Ki67 was 11% in the post-treatment sample, sufficiently high to elevate the risk of relapse²⁶. A third LBD mutation, E380Q, was present at a low frequency at baseline without enrichment suggesting that this mutation does not strongly confer AI resistance, as suggested by an earlier study of ER+ patient-derived xenografts¹⁷.

To determine whether somatic mutations in particular genes were driving clonal response, we calculated the difference between pre-treatment and post-treatment VAFs for each gene with at least five non-silent mutations (Supplementary Fig. 3). Though NCOR1 mutations, in particular, trended towards higher VAFs in the post-treatment samples, a larger cohort would be needed to establish significance in the context of multiple testing correction. There also was no significant difference in VAF shifts between mutations in different PIK3CA hotspots (near amino acids 1,047 or 543) or between mutations in different TP53 domains (DNA-binding versus other).

Discussion

In this study, we performed deep genomic characterization of primary ER+ breast tumours at baseline and after 16 to 18 weeks of AI therapy. By combining genomic sequencing data at high depth on mutated sites, copy number and structural variant discovery, comparisons of RNA expression to derive intrinsic subtypes, and clinical pathology markers such as Ki67 and ER Allred scores, we have generated comprehensive information about the range of changes that occur when ER+ breast cancers are subjected to oestrogen deprivation. Overall, four genomic patterns were observed: (1) two intertwined but genomically separate ‘collision tumour’ patterns; (2) ‘clonally simple and treatment stable’ patterns; (3) ‘clonally complex and treatment dynamic’ patterns; and (4) ‘clonally complex and treatment stable’ patterns.

In two cases, we observed collision tumours comprised separate malignancies, and in one case an ER+ tumour with indolent clinical features was replaced by an ER− tumour with aggressive clinical features. Presumably the ER− tumour genotype was missed in the pre-treatment sample, and became more readily detectable over time due to the regression of the ER+ tumour. It is well recognized that a subset of ER+ breast cancers have ‘lost’ ER expression upon relapse²⁷, and earlier studies report that ER loss in the context of neoadjuvant endocrine therapy is associated with a much higher risk of relapse²⁶. Together, these suggest that comparative ER testing should be repeated on residual breast tumours after NAI therapy to identify discernable changes in ER positivity.

Clonally stable tumours were rare in this cohort, with only one simple and one complex tumour. The simple tumour (BRC14) switched from Luminal B to Luminal A with a dramatic decline in Ki67 (44.2 to 1.2%). The complex tumour (BRC11) likely contained innate resistance in the founder clone (with no evidence of sensitivity to the antiproliferative effects of AI), which enabled it to maintain both its clonal structure and its intrinsic subtype despite treatment.

Most tumours analysed (18 of 22) were clonally heterogeneous and contained subclonal populations whose relative proportions changed, often dramatically, during AI treatment. This high clonal instability, over 4 months of AI treatment, was likely due to a selective growth advantage for a subclone in the presence of oestrogen deprivation. While it is hard to disprove the null hypothesis, that is, we are simply observing spatial heterogeneity in a very complex tumour, cases like BRC41 displayed very high levels of temporal change with little difference between spatially separate cores taken concurrently. This evidence supports the argument for tumour evolution under selective pressure as the likely explanation. On the other hand, 63% of the multiple concurrent biopsies that we sequenced exhibited spatial heterogeneity. Thus, the shifts in clonal architecture that we observed are likely due to a combination of therapy-related subclonal selection and sampling of different subclonal populations within the tumour at different time points.

Few resistance mechanisms were revealed by this study, as only two ESR1 mutations were treatment emergent at the 16 to 18 week time point and no ESR1 translocations were detected by WGS or RNA seq¹⁷. This suggests that ESR1 mutation-driven resistant subclones emerge over long periods of time, although the presence of several easily detectable ESR1 mutations at ∼100X coverage suggests that deeper sequencing might be important to detecting these mutations at an early-on treatment time point. The ALTERNATE neoadjuvant endocrine therapy trial is comparing an AI with fulvestrant, an ER-degrading agent (NCT01953588). The sample size for ALTERNATE (over 1,000 patients) and the longer duration of treatment before surgery (24 weeks) will eventually allow hypotheses based on detecting low frequency ESR1 mutant alleles in the post-treatment sample to be explored more thoroughly, and will determine whether the choice of endocrine agent influences the rate at which ESR1 mutations evolve.

Interestingly, two tumours were diagnosed with activating ERBB2 mutations²⁸, however, both were in AI-sensitive tumours, with very low on-treatment KI67 values. This suggests, at least anecdotally, that ERBB2 activating mutations do not cause intrinsic endocrine therapy resistance, with high on-AI-treatment Ki67 values that were noted when studying HER2 amplified ER+ tumours in the neoadjuvant endocrine setting²⁹. One other treatment emergent mutation in ERBB2 was noted, S157F, but the functional significance of the mutation is uncertain. A final point about mutation-driven therapeutic hypotheses concerns the pre- and post-treatment PIK3CA data. It has been previously reported that there is no interaction between PIK3CA mutation status and response to neoadjuvant endocrine therapy^20,30.The lack of positive or negative selection for PIK3CA mutations observed in the present study is compatible with this conclusion. Of the three tumours with PIK3CA mutations detected at surgery, but not at baseline, two were collision tumours, where the second tumour containing mutant PIK3CA was missed by the initial biopsies. Thus, spatial heterogeneity explains this pattern, and also is the likely cause of the apparent gain of a G118D mutation in one of the BRC15 post-treatment samples. This is supported by the lack of trend toward positive or negative selection in the 16 kinase domain mutant cases studied herein (Table 3).

Table 3 Druggable mutations observed in samples assayed with targeted sequencing.

Full size table

In conclusion, caution needs to be exercised when interpreting clinical genomic results from a single core biopsy at a single time point. Higher depth of massively parallel sequencing, coupled with analysis tools to detect low-VAF variants, analysis of multiple tumour samples and tracking of genomic evolution in post-therapy samples all provide a more complete picture. These are feasible techniques, as we illustrate, and the data we report can be considered a proof of principle, supporting the idea that investment is needed to collect and study very large cohorts in such a manner. These studies will ultimately determine the clinical utility of genomic analysis in ER+ breast cancer that is more comprehensive than that afforded by analysis of a single pre-treatment sample.

Methods

Sample acquisition and clinical characterization

The samples were from two neoadjuvant endocrine therapy trials of postmenopausal women with clinical stage II to III ER-positive (Allred score 6–8) breast cancer (ACOSOG-Z1031, Alliance, NCT00265759 and NCT00084396) that have been previously described^31,32. All the samples were studied that had appropriate consents, at least 70% tumour content (by nuclei), and available DNA from both the baseline and surgical time points.

Genomic library preparation and sequencing

Library preparation was performed on DNA extracted from flash-frozen fresh tissue biopsies and matched peripheral blood, then sequenced. Both were done as described previously²⁰.

Copy number and structural variant detection

Copy number aberrations were detected using CopyCat v1.6.9 (https://github.com/chrisamiller/copycat) (Supplementary Software 2), with purity estimates derived from SNV allele fractions. (Table 1) Structural variants were detected as previously described^20,33. For measurement of concordance, SVs were considered to be the same event if the ends of each call were localized within 500 bp. Gene fusions were identified using INTEGRATE v0.1e, with default parameters³⁴.

SNV and indel detection

We detected SNVs using the union of three callers: (1) samtools version r963 [2] (params: -A -B) filtered by SNP-filter version v1 and intersected with Somatic Sniper version 1.0.2 [3] (params: -F vcf -q 1 -Q 15) filtered by false-positive version v1 (params: --bam-readcount-version 0.4 --bam-readcount-min-base-quality 15) then somatic-score-mapping-quality version v1 (params:--min-mapping-quality 40 --min-somatic-score 40); (2) VarScan 2.2.6 [4] filtered by varscan-high-confidence version v1 then false-positive version v1 (params: --bam-readcount-version 0.4 --bam-readcount-min-base-quality 15); (3) Strelka version 0.4.6.2 [5] (params: isSkipDepthFilters=0).

Short insertions and deletions were detected using the union of four callers: (1) gatk-somatic-indel version 5336 [6] filtered by false-indel version v1 (params: --bam-readcount-version 0.4 --bam-readcount-min-base-quality 15); (2) pindel version 0.5 [7] filtered by pindel-somatic-calls version v1 then pindel-vaf-filter version v1 (params: --variant-freq-cutoff=0.08) then pindel-read-support version v1; (3) VarScan 2.2.6 [6] filtered by varscan-high-confidence-indel version v1 then false-indel version v1 (params: --bam-readcount-version 0.4 --bam-readcount-min-base-quality 15); (4) Strelka version 0.4.6.2 [5] (params: isSkipDepthFilters=0).

For SNVs and Indels in tier 2 and 3, we retained all variant calls that were also called in the deep validation data. Additional steps were taken to enhance the number of coding variants recovered, including pooling of discovery (WGS) and validation (capture) data to provide enhanced coverage. A binomial log-likelihood filter (https://github.com/genome/genome/blob/master/lib/perl/Genome/Model/Tools/Validation/IdentifyOutliers.pm) was applied to variant calls with at least 10 × sequence coverage in both tumour and normal samples, and those identified as somatic with LLR>3 were retained. Review of mutational hotspots was used to recover additional variants in breast cancer-related genes.

Variant validation

Mutations in the 22 WGS samples from the original paper were validated using custom capture arrays targeted to the mutations in each tumour, as previously described²⁰. After sequencing the post-AI samples, a second round of custom capture validation was performed on all 77 samples from these 22 patients, by combining the somatic mutation sites predicted in both the baseline and surgical samples. All coding SNVs, indels and SVs were targeted for validation, along with a random subset of non-coding variants, providing additional sequence coverage for validation and clonality analysis. No Sanger sequencing validation was performed. Controlled tests of our somatic mutation calling pipeline (with similar configuration to that used here) have established a positive predictive value of over 90% for mutations down to VAFs of ∼5%, so we expect the baseline false-positive rate to be low in these data. Results from the validation sequencing pipeline were then subjected to extensive manual review to validate the presence of variants and remove additional false-positive somatic calls. This involved manually inspecting sequencing reads using the Integrated Genomic Viewer (IGV), and flagging a number of different types of sequencing artifacts, including: insertions and deletions at the ends of homopolymer runs, calls that were present in the matched normal sample (or other normal samples), and regions with substantial numbers of low-or-zero quality mapping scores, indicating potential reference build problems or mapping artifacts. A similar process was used to inspect copy number calls, removing those resulting from alignment errors in highly repetitive regions of the genome or that span assembly gaps or centromeres.

Baseline and surgical-specific variants

Variant allele fractions of all tier 1 variants were corrected for purity by reducing the number of reference-supporting reads in proportion to the purity of the sample. This effectively scales up the VAFs in such a way that founding clone variants are near 50% VAF. At each variant position, Fisher’s exact test was used to identify significant VAF changes, by comparing each baseline sample with each surgical sample. Those with P<0.05 were retained. Baseline-specific variants were required to be below 10% VAF in the surgical sample and have at least one variant-supporting read in the baseline RNA sample. Surgical-specific variants were required to be below 10% VAF in the baseline sample and have at least one variant-supporting read in the surgical RNA sample.

Targeted sequencing of extension cohort

A panel of 83 breast cancer-related genes was derived from meta-analysis and literature review (Supplementary Data 11). These genes were targeted comprehensively with 3,029 complementary probes for hybridization-based enrichment (IDT Technologies), then sequenced on the Illumina HiSeq platform. These captured data were produced for baseline and surgical samples from 19 of the 22 WGS cases, and for an additional 38 cases. SNVs and Indels were called as described in the Supplementary Methods.

RNA library preparation and sequencing

RNA preparation was performed according to the Illumina TruSeq mRNA protocol with Poly-A selection. Unstranded paired-end sequencing was performed on the Illumina HiSeq 2000 platform, producing 48 × 7 × 48 paired-end reads.

Expression profiling

RNA-seq data were aligned with Tophat v2.0.8 (de novo mode, reference only, reference guided) and expression levels calculated with Cufflinks v2.1.1 (params: --max-bundle-length=10,000,000). RNAseq readcounts for all coding mutations were calculated using bam-readcount 0.7 (https://github.com/genome/bam-readcount). The samples were clustered by applying unsupervised hierarchical clustering on a matrix of sample-by-sample correlations derived from gene expression (FPKM) values (Supplementary Data 7 and 8).

Clonality and clonal instability

The clonal architecture of each tumour was inferred with sciClone (version 1.0.7; ref. 22), using default parameters except for copyNumberMargins set to 0.25. The median VAF of each subclone was used as input for clonevol (https://github.com/hdng/clonevol), which constructed evolutionary trees and estimated the absolute percentage of each clone in each sample. The clonal instability index was calculated as the mean of absolute differences between pre and post clonevol-inferred clonal percentages. A value of 0 represents no change, and 1 represents completely distinct tumours. In this study, 0.1 was used as an arbitrary cutoff between ‘stable’ and ‘dynamic’ tumours. BRC17 and BRC48 had complex patterns of copy number and ploidy changes preventing automated clonal inference, but clearly fall into this classification, with mutations specific to both baseline and surgical tumours. BRC18 did not have clear separation between subclonal populations, but contained a large number of mutations that were present in roughly half of the baseline sample tumour cells and were significantly enriched (and mostly expressed) in the surgical sample tumour cells. Links to all the data and scripts used for clonal inference are found in Supplementary Software 1.

Code availability

Code used for analyses is all open source and detailed versions and parameters are provided above.

Data availability

The baseline whole-genome sequences have already been reported²⁰ and deposited in dbGaP with accession ID phs000472. Both the RNA sequencing data and the genomic sequencing data from the post-treatment samples and extension cohort have been added to the same study. All other data are contained within the Article or Supplementary Information files, or available from the authors upon request.

Additional information

How to cite this article: Miller, C. A. et al. Aromatase inhibition remodels the clonal architecture of estrogen-receptor-positive breast cancers. Nat. Commun. 7:12498 doi: 10.1038/ncomms12498 (2016).

References

Ma, C. X., Reinert, T., Chmielewska, I. & Ellis, M. J. Mechanisms of aromatase inhibitor resistance. Nat. Rev. Cancer 15, 261–275 (2015).
Article CAS Google Scholar
Miller, W. R. et al. Predicting response and resistance to endocrine therapy: profiling patients on aromatase inhibitors. Cancer 112, 689–694 (2008).
Article CAS Google Scholar
Ma, C. X., Sanchez, C. G. & Ellis, M. J. Predicting endocrine therapy responsiveness in breast cancer. Oncology (Williston Park) 23, 133–142 (2009).
Google Scholar
Nowell, P. C. The clonal evolution of tumor cell populations. Science 194, 23–28 (1976).
Article ADS CAS Google Scholar
Welch, J. S. et al. The origin and evolution of mutations in acute myeloid leukemia. Cell 150, 264–278 (2012).
Article CAS Google Scholar
Nik-Zainal, S. et al. The life history of 21 breast cancers. Cell 149, 994–1007 (2012).
Article CAS Google Scholar
Zhang, J. et al. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science 346, 256–259 (2014).
Article ADS CAS Google Scholar
de Bruin, E. C. et al. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution. Science 346, 251–256 (2014).
Article ADS CAS Google Scholar
Gerlinger, M. & Swanton, C. How Darwinian models inform therapeutic failure initiated by clonal heterogeneity in cancer medicine. Br. J. Cancer 103, 1139–1143 (2010).
Article CAS Google Scholar
Yap, T. A., Gerlinger, M., Futreal, P. A., Pusztai, L. & Swanton, C. Intratumor heterogeneity: seeing the wood for the trees. Sci. Transl. Med. 4, 127ps10 (2012).
Article Google Scholar
Turke, A. B. et al. Preexistence and clonal selection of MET amplification in EGFR mutant NSCLC. Cancer Cell 17, 77–88 (2010).
Article CAS Google Scholar
Su, K.-Y. et al. Pretreatment epidermal growth factor receptor (EGFR) T790M mutation predicts shorter EGFR tyrosine kinase inhibitor response duration in patients with non-small-cell lung cancer. J. Clin. Oncol. 30, 433–440 (2012).
Article CAS Google Scholar
Juric, D. et al. Convergent loss of PTEN leads to clinical resistance to a PI(3)Kα inhibitor. Nature 518, 240–244 (2015).
Article ADS CAS Google Scholar
Hartman, M. et al. Incidence and prognosis of synchronous and metachronous bilateral breast cancer. J. Clin. Oncol. 25, 4210–4216 (2007).
Article Google Scholar
Wahner-Roedler, D. L., Reynolds, C. A. & Boughey, J. C. Collision tumors with synchronous presentation of breast carcinoma and lymphoproliferative disorders in the axillary nodes of patients with newly diagnosed breast cancer: a case series. Clin. Breast Cancer 11, 61–66 (2011).
Article Google Scholar
Shah, S. P. et al. The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature 486, 395–399 (2012).
Article ADS CAS Google Scholar
Li, S. et al. Endocrine-therapy-resistant ESR1 variants revealed by genomic characterization of breast-cancer-derived xenografts. Cell Rep. 4, 1116–1130 (2013).
Article ADS CAS Google Scholar
Miller, W. R. et al. Changes in breast cancer transcriptional profiles after treatment with the aromatase inhibitor, letrozole. Pharmacogenet. Genomics 17, 813–826 (2007).
Article CAS Google Scholar
Mackay, A. et al. Molecular response to aromatase inhibitor treatment in primary breast cancer. Breast Cancer Res. 9, R37 (2007).
Article Google Scholar
Ellis, M. J. et al. Whole-genome analysis informs breast cancer response to aromatase inhibition. Nature 486, 353–360 (2012).
Article ADS CAS Google Scholar
Mardis, E. R. et al. Recurring mutations found by sequencing an acute myeloid leukemia genome. N. Engl. J. Med. 361, 1058–1066 (2009).
Article CAS Google Scholar
Miller, C. A. et al. SciClone: inferring clonal architecture and tracking the spatial and temporal patterns of tumor evolution. PLoS Comput. Biol. 10, e1003665 (2014).
Article Google Scholar
Gururaj, A. E., Holm, C., Landberg, G. & Kumar, R. Breast cancer-amplified sequence 3, a target of metastasis-associated protein 1, contributes to tamoxifen resistance in premenopausal patients with breast cancer. Cell Cycle 5, 1407–1410 (2006).
Article CAS Google Scholar
Toy, W. et al. ESR1 ligand-binding domain mutations in hormone-resistant breast cancer. Nat. Genet. 45, 1439–1445 (2013).
Article CAS Google Scholar
Robinson, D. R. et al. Activating ESR1 mutations in hormone-resistant metastatic breast cancer. Nat. Genet. 45, 1446–1451 (2013).
Article CAS Google Scholar
Ellis, M. J. et al. Outcome prediction for estrogen receptor-positive breast cancer based on postneoadjuvant endocrine therapy tumor characteristics. J. Natl Cancer Inst. 100, 1380–1388 (2008).
Article CAS Google Scholar
Foukakis, T., Åström, G., Lindström, L., Hatschek, T. & Bergh, J. When to order a biopsy to characterise a metastatic relapse in breast cancer. Ann. Oncol. 23, (Suppl 10): x349–x353 (2012).
Article Google Scholar
Bose, R. et al. Activating HER2 mutations in HER2 gene amplification negative breast cancer. Cancer Discov. 3, 224–237 (2013).
Article CAS Google Scholar
Ellis, M. J. Functional analysis of the breast cancer genome. J. Clin. Oncol. 24, 1649–1650 (2006).
Article CAS Google Scholar
Ellis, M. J. et al. Phosphatidyl-inositol-3-kinase alpha catalytic subunit mutation and response to neoadjuvant endocrine therapy for estrogen receptor positive breast cancer. Breast Cancer Res. Treat. 119, 379–390 (2010).
Article CAS Google Scholar
Ellis, M. J. et al. Randomized phase II neoadjuvant comparison between letrozole, anastrozole, and exemestane for postmenopausal women with estrogen receptor–rich stage 2 to 3 breast cancer: clinical and biomarker outcomes and predictive value of the baseline PAM50-based intrinsic subtype—ACOSOG Z1031. J. Clin. Oncol. 29, 2342–2349 (2011).
Article CAS Google Scholar
Olson, J. A. et al. Improved surgical outcomes for breast cancer patients receiving neoadjuvant aromatase inhibitor therapy: results from a multicenter phase II trial. J. Am. Coll. Surg. 208, 906–914 (2009),) discussion 915–916.
Article Google Scholar
Griffith, M. et al. Genome modeling system: a knowledge management platform for genomics. PLoS Comput. Biol. 11, e1004274 (2015).
Article Google Scholar
Zhang, J. et al. INTEGRATE: gene fusion discovery using whole genome and transcriptome data. Genome Res. 26, 108–118 (2016).
Article CAS Google Scholar

Download references

Acknowledgements

Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under Award Numbers U10CA180821 and U10CA180882 (to the Alliance for Clinical Trials in Oncology), and the following grants: 5U10CA180833 and 5U10CA180858. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. Also supported in part by funding provided by grants U54HG003079 from the National Human Genome Research Institute to R.K.W., R01-CA095614 to M.J.E., U24-CA114736, U10-CA076001, and U01-CA114722 from the National Cancer Institute; by the Breast Cancer Research Foundation; Komen Promise Grant PG12220321 to M.J.E., a Komen St Louis Affiliate Clinical Trials Grant; and support for Z1031 from Pfizer and Novartis. M.J.E. is a McNair Medical Foundation Scholar and the recipient of a Cancer Prevention Research Institute of Texas established investigator award RR140033. We thank the participants of the clinical trials for their critically important samples provided for the study.

Author information

Yevgeniy Gindin
Present address: Present address: Clontech Laboratories, 1290 Terra Bella Ave, Mountain View, California 94043, USA,
Charles Lu
Present address: Present address: AbbVie, 1401 Sheridan Rd, North Chicago, Illinois 60064, USA,
Dong Shen
Present address: Present address: AstraZeneca, 1 MedImmune Way, Gaithersburg, Maryland 20878, USA,

Authors and Affiliations

McDonnell Genome Institute, Washington University School of Medicine, St Louis, 63108, Missouri, USA
Christopher A. Miller, Yevgeniy Gindin, Charles Lu, Obi L Griffith, Malachi Griffith, Dong Shen, Tiandao Li, David E. Larson, Richard K. Wilson, Elaine R. Mardis & Matthew J. Ellis
Department of Medicine, Division of Genomics and Bioinformatics, Washington University School of Medicine, St Louis, 63108, Missouri, USA
Christopher A. Miller & Richard K. Wilson
Department of Medicine, Division of Oncology, Washington University School of Medicine, St Louis, 63108, Missouri, USA
Obi L Griffith, Jeremy Hoog, Sherri R Davies, Jacqueline Snider, Katherine DeSchryver, Richard K. Wilson, Elaine R. Mardis & Matthew J. Ellis
Siteman Cancer Center, Washington University School of Medicine, St Louis, 63108, Missouri, USA
Obi L Griffith, Malachi Griffith, Graham A. Colditz, Richard K. Wilson & Elaine R. Mardis
Department of Genetics, Washington University School of Medicine, St Louis, 63108, Missouri, USA
Malachi Griffith, David E. Larson, Richard K. Wilson & Elaine R. Mardis
Department of Pathology and Immunology, Washington University School of Medicine, St Louis, 63108, Missouri, USA
Mark Watson
Department of Breast Surgery, MD Anderson Cancer Center, 77030, Texas, Houston, USA
Kelly Hunt
Alliance Statistics and Data Center, Mayo Clinic, Rochester, Minnesota, 55905, USA
Vera J. Suman
Department of Surgery, Division of Public Health, St Louis Breast Tissue Registry, Washington University School of Medicine, St Louis, 63108, Missouri, USA
Thomas Walsh, Graham A. Colditz & Katherine DeSchryver
Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, 77030, Texas, USA
Matthew J. Ellis

Authors

Christopher A. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Yevgeniy Gindin
View author publications
You can also search for this author in PubMed Google Scholar
Charles Lu
View author publications
You can also search for this author in PubMed Google Scholar
Obi L Griffith
View author publications
You can also search for this author in PubMed Google Scholar
Malachi Griffith
View author publications
You can also search for this author in PubMed Google Scholar
Dong Shen
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Hoog
View author publications
You can also search for this author in PubMed Google Scholar
Tiandao Li
View author publications
You can also search for this author in PubMed Google Scholar
David E. Larson
View author publications
You can also search for this author in PubMed Google Scholar
Mark Watson
View author publications
You can also search for this author in PubMed Google Scholar
Sherri R Davies
View author publications
You can also search for this author in PubMed Google Scholar
Kelly Hunt
View author publications
You can also search for this author in PubMed Google Scholar
Vera J. Suman
View author publications
You can also search for this author in PubMed Google Scholar
Jacqueline Snider
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Walsh
View author publications
You can also search for this author in PubMed Google Scholar
Graham A. Colditz
View author publications
You can also search for this author in PubMed Google Scholar
Katherine DeSchryver
View author publications
You can also search for this author in PubMed Google Scholar
Richard K. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Elaine R. Mardis
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J. Ellis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.A.M., R.K.W., M.J.E. and E.R.M. designed the experiments and data analysis. C.A.M., Y.G., C.L., O.L.G., M.G., D.S., T.L. and D.E.L. performed the data analysis. M.J.E. led the clinical investigations and biomarker analysis. C.A.M., Y.G. and C.L. prepared the figures and tables. V.J.S. provided statistical support. M.W. and K.D. provided pathology analysis. K.H., J.S., T.W., G.A.C. and S.R.D. provided study materials, patients or clinical support. C.A.M., E.R.M. and M.J.E. wrote the manuscript. Y.G., S.R.D., R.K.W. and D.E.L. critically read and commented on the manuscript.

Corresponding authors

Correspondence to Elaine R. Mardis or Matthew J. Ellis.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-8, Supplementary Notes 1-5, Supplementary Data 1-12, and Supplementary Software 1-2. (PDF 14436 kb)

Supplementary Data 1

Clinical characteristics of cohort (XLSX 15 kb)

Supplementary Data 2

Protein-coding (tier1) SNVs and Indels (XLS 1260 kb)

Supplementary Data 3

Non-protein-coding (tier2-3) SNVs and Indels (XLS 18271 kb)

Supplementary Data 4

Sequencing Coverage and Purity (XLS 60 kb)

Supplementary Data 5

Validated structural variants (XLS 288 kb)

Supplementary Data 6

Gene fusions detected using INTEGRATE (XLS 80 kb)

Supplementary Data 7

Gene expression values (expressed in FPKM). (XLSX 19543 kb)

Supplementary Data 8

Mutations detected in pre-malignant (DCIS) samples from BRC15 (XLSX 12 kb)

Supplementary Data 9

Subclonal VAFs (XLS 32 kb)

Supplementary Data 10

List of the 83 genes targeted by the breast cancer-related gene panel (XLSX 11 kb)

Supplementary Data 11

Mutations detected in targeted sequencing of extension cohort (XLS 39 kb)

Supplementary Data 12

Differentially expressed genes between responders and non- responders (XLS 123 kb)

Supplementary Software 1

All data and scripts used for clonal clustering (ZIP 6820 kb)

Supplementary Software 2

CopyCat code (ZIP 43 kb)

Peer review file (PDF 258 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Miller, C., Gindin, Y., Lu, C. et al. Aromatase inhibition remodels the clonal architecture of estrogen-receptor-positive breast cancers. Nat Commun 7, 12498 (2016). https://doi.org/10.1038/ncomms12498

Download citation

Received: 06 January 2016
Accepted: 07 July 2016
Published: 09 August 2016
DOI: https://doi.org/10.1038/ncomms12498

This article is cited by

ERα/PR crosstalk is altered in the context of the ERα Y537S mutation and contributes to endocrine therapy-resistant tumor proliferation
- Rosemary J. Huggins
- Geoffrey L. Greene
npj Breast Cancer (2023)
BAP18 facilitates CTCF-mediated chromatin accessible to regulate enhancer activity in breast cancer
- Ge Sun
- Yuntao Wei
- Yue Zhao
Cell Death & Differentiation (2023)
NPY1R exerts inhibitory action on estradiol-stimulated growth and predicts endocrine sensitivity and better survival in ER-positive breast cancer
- Raksha Bhat
- Hariprasad Thangavel
- Meghana V. Trivedi
Scientific Reports (2022)
Acquired mutations and transcriptional remodeling in long-term estrogen-deprived locoregional breast cancer recurrences
- Nolan Priedigkeit
- Kai Ding
- Adrian V. Lee
Breast Cancer Research (2021)
Suppression of breast cancer metastasis and extension of survival by a new antiestrogen in a preclinical model driven by mutant estrogen receptors
- Mary J. Laws
- Yvonne Ziegler
- Benita S. Katzenellenbogen
Breast Cancer Research and Treatment (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Mutational landscape

Subclonal inference by clustering of variant allele fractions

Intertwined tumours of independent origin

Simple and clonally stable tumours

Complex and dynamic tumours

Complex and stable tumours

Targeted sequencing and treatment-associated heterogeneity

Discussion

Methods

Sample acquisition and clinical characterization

Genomic library preparation and sequencing

Copy number and structural variant detection

SNV and indel detection

Variant validation

Baseline and surgical-specific variants

Targeted sequencing of extension cohort

RNA library preparation and sequencing

Expression profiling

Clonality and clonal instability

Code availability

Data availability

Additional information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links