Unique evolutionary trajectories of breast cancers with distinct genomic and spatial heterogeneity

Phung, Tanya N.; Webster, Timothy H.; Lenkiewicz, Elizabeth; Malasi, Smriti; Andreozzi, Mariacarla; McCullough, Ann E.; Anderson, Karen S.; Pockaj, Barbara A.; Wilson, Melissa A.; Barrett, Michael T.

doi:10.1038/s41598-021-90170-1

Download PDF

Article
Open access
Published: 19 May 2021

Unique evolutionary trajectories of breast cancers with distinct genomic and spatial heterogeneity

Tanya N. Phung^1,2,
Timothy H. Webster^1,3,
Elizabeth Lenkiewicz⁴,
Smriti Malasi⁴,
Mariacarla Andreozzi⁴,
Ann E. McCullough⁵,
Karen S. Anderson⁶,
Barbara A. Pockaj⁷,
Melissa A. Wilson^1,2 &
…
Michael T. Barrett⁴

Scientific Reports volume 11, Article number: 10571 (2021) Cite this article

1612 Accesses
1 Citations
Metrics details

Subjects

Abstract

Breast cancers exhibit intratumoral heterogeneity associated with disease progression and therapeutic resistance. To define the sources and the extent of heterogeneity, we performed an in-depth analysis of the genomic architecture of three chemoradiation-naïve breast cancers with well-defined clinical features including variable ER, PR, ERBB2 receptor expression and two distinct pathogenic BRCA2^mut genotypes. The latter included a germ line carrier and a patient with a somatic variant. In each case we combined DNA content-based flow cytometry with whole exome sequencing and genome wide copy number variant (CNV) analysis of distinct populations sorted from multiple (4–18) mapped biopsies within the tumors and involved lymph nodes. Interrogating flow-sorted tumor populations from each biopsy provided an objective method to distinguish fixed and variable genomic lesions in each tumor. Notably we show that tumors exploit CNVs to fix mutations and deletions in distinct populations throughout each tumor. The identification of fixed genomic lesions that are shared or unique within each tumor, has broad implications for the study of tumor heterogeneity including the presence of tumor markers and therapeutic targets, and of candidate neoepitopes in breast and other solid tumors that can advance more effective treatment and clinical management of patients with disease.

The genomic landscape of metastatic breast cancer highlights changes in mutation and signature frequencies

Article 30 September 2019

Interrogating breast cancer heterogeneity using single and pooled circulating tumor cell analysis

Article Open access 05 July 2022

Single-cell transcriptomics reveals multi-step adaptations to endocrine therapy

Article Open access 02 September 2019

Introduction

It is well established that tumors arise as a result of an acquired genomic instability and the evolution of distinct cell lineages during the natural history of disease^1,2,3. Consequently, tumors have unique patterns of mutations and copy number aberrations in their genomes. The nature and the extent of tumor heterogeneity within a tumor are challenges for advancing improved more personalized care for patients with disease. Next generation sequencing (NGS) surveys of one or more regions within tumors have described a complex genomic landscape for breast and other solid tumors^1,4,5. Notably, NGS studies have used variant allele fractions (VAFs) to reconstruct the clonal composition of tumors of interest^6,7. A recurrent theme in these studies is that tumors consist of cell lineages with tree-like structures composed of clonal (trunk) and sub clonal (branches) mutations. A fundamental hypothesis is that the composition and heterogeneity of the tumor cell lineages present in a patient’s tumor will impact its’ clinical history^8,9,10. Thus, it has been proposed that measures of intra tumor heterogeneity based on multiple biopsies are needed to accurately stage a tumor and to identify those genomic lesions that can be exploited for improved patient care^5,9.

Clinical management of breast cancer patients relies on the status of estrogen (ER), progesterone (PR), and ERBB2 receptors in diagnostic biopsies. Results for these markers are typically based on well-established IHC and FISH assays that incorporate staining intensity and the percentage of positive cell numbers for each sample of interest^11,12. Heterogeneous results for one or more of these clinical markers are often observed in different sites within a tumor, suggesting that multiple clonal populations with unique and shared driver mutations and CNVs may be present. In addition to receptor status pathogenic BRCA1/2 mutations associated with homologous recombination deficiency (HRD) define a distinct clinical class of breast tumors^13,14. The latter are characterized by elevated levels of genomic instability, a hallmark of cancer, notably an increase in intra chromosomal CNV aberrations relative to BRCA^wt tumors¹⁵. Thus, BRCA2^mut tumors that express an HRD phenotype represent an additional challenge for assessing tumor heterogeneity and accurate clinical staging.

The loss of normal cell cycle control and the development of tetraploid and aneuploid cell populations arise in premalignant tissues and contribute to the evolution of neoplasias^3,16,17. In addition to VAFs of single nucleotide variants (SNVs), tumors display highly variable patterns of CNVs including both gains and losses. These can range from single copy changes spanning whole chromosomes to focal high-level amplifications and intragenic homozygous deletions. The presence of CNVs alters the ratio of alleles, notably in non-diploid tumors, and may provide highly selectable changes in the dosages of oncogenes and tumor suppressors throughout the clinical history of a tumor. Current models of tumor heterogeneity rely on bulk tumor preparations, pathological review, and the application of multiple computational tools to deconvolute the presence of clonal populations^5,18,19. Tumor lineages are described based on predicted shared clonal events and include genes with lesions described as double hits (e.g., variant plus copy number loss, biallelic variant, homozygous deletion) in histologically admixed samples. However genomic instability including ploidy and chromosome instability, and variable admixtures of non-neoplastic cells in samples of interest limit the precision and potential clinical impact of VAF based methods in the identification of shared genomic lesions in heterogeneous tumors.

Fixed variants, notably homozygous SNVs and deletions, are those that are present in the absence of an alternative variant and thus cannot revert or be replaced by a preexisting allele. These variants can dramatically affect the natural or clinical history of a tumor. Of significant interest are those fixed variants targeting clinically relevant genes that are shared across the populations or that distinguish unique populations within a tumor. The emergence and persistence of fixed variants in aneuploid genomes can be driven by changes in ploidy and both the nature and extent of genomic instability that drives the tumor. Thus, in contrast to variants mapping to autosomes and sex chromosomes in normal diploid cells, the identification of fixed variants in tumor genomes needs to account for the presence of abnormal genotypes, chromosomes, and ploidies present in histologically admixed tumors.

In this study we systematically interrogated the composition of 3 chemoradiation naïve surgically excised ductal carcinomas with well annotated clinical and genomic features. To overcome the limitations of VAF based methods with bulk tumor samples we combined flow cytometry, whole genome CNV, and whole exome SNV analyses to identify fixed lesions within distinct aneuploid populations present in multiple (4–18) biopsies from each of the three heterogeneous breast tumors. The first tumor, PS13-9062, was a grade 3 ER+, PR+, and ERBB2- (pT2, pN0) arising in a patient with a germ line BRCA2 mutation. The second tumor, PS13-1760, was classified as a grade 3 node negative ER+ PR− ERBB2+ (3+) (pT2, pN0) and had a somatic BRCA2 mutation. The third tumor, PS13-585, was a large node positive tumor (pT2, pN3c, pMX) with a diagnosis based on a single core needle biopsy of grade 3 ER+ PR− ERBB2+ (3+). However, further IHC testing of mapped research biopsies scored ERBB2 as equivocal with heterogeneous staining within primary and lymph nodes. In addition, both ER and PR IHC staining were heterogeneous throughout the tumor with abrupt staining boundaries for both clinical markers. These three tumors with distinct clinical and genomic contexts provide a model to systematically study the nature and extent of heterogeneity based on mapped clonal aneuploid populations, shared and unique fixed somatic lesions, established clinical markers used for the management of patients with breast cancer, and the presence of predicted neoepitopes.

Results

Aneuploid populations within resectable ER+ breast tumors

We screened the mapped biopsies from each tumor with DNA content flow cytometry and sorted each diploid, tetraploid, and aneuploid population detected. Patient 1 (PS13-9062) had a single ploidy (3.7 N) in each of the 4 biopsies mapped within the tumor (Supplemental Fig. 1). The second patient (PS13-1750) had a dominant ploidy (3.2 N) in each of the 10 (A1–A10) primary tumor biopsies and a second co-existing ploidy (3.6 N) in one (A2) biopsy (Supplemental Fig. 2). Patient 3 (PS13-585) had 12 mapped biopsies (A1–A12) within the primary tissue, four (B1–B3, D1) from two adjacent nodes, and two (F1, F2) from a distal node. In total we detected six different non-diploid populations across the 18 mapped biopsies (Supplemental Fig. 3). Co-existing ploidies were present in six of the primary biopsies. In contrast the nodes contained either a 5.8 N (2/6 biopsies) or a 5.0 N (4/6 biopsies) population. Strikingly both populations were present as co-existing ploidies within biopsies of 2 regions (A3 and A4) in the primary. The presence of aneuploid populations with distinct ploidies provides an initial measure of heterogeneity in each tumor. In addition, these sorted populations provide templates for high-resolution analysis for the detection of fixed variants, SNVs and CNVs, and neoepitopes that are either shared or unique to each of the populations within each tumor.

PS13-9062

This tumor arose in a patient with a known pathogenic BRCA2^E49X germ line variant. The genomes of each sorted population displayed multiple unique and shared CNVs that reflect the BRCA2 mutated status (Supplemental Fig. 4). Notably the BRCA2^E49X variant was fixed by an interstitial 13q12.3–q14.2 deletion in the 3.7 N tumor population present in each of the four biopsies (Fig. 1A, B). We detected a total of 369 somatic mutations across the 4 biopsies (Fig. 1C, D). Of these 120 (33%) were shared, while mutations unique to a single region varied from 19 in F3 (5.1%) to 68 in F6 (18.4%). Shared mutations that were fixed in each population included 3 variants in non-coding regions and missense mutations in a protein carboxyl methyltransferase, PCMT1, and the ferritin receptor SCARA5. In addition, there was a fixed PIAS4^N156S missense mutation present in the 4 biopsies, however one sample (F5) had coverage that was below our minimum NGS threshold of 10 reads. The latter gene, Protein Inhibitor of Activated STAT 4, is a SUMO E3-ligase that promotes responses to DNA double strand breaks²⁰. In addition, we identified a shared fixed 15 kb deletion targeting exons 5 and 6 of NUMB in both our CNV and exome results (Supplemental Fig. 5). Decreased expression of Numb has been linked to activation of the proto-oncogene Notch1 and reduction in TP53²¹. Notably loss of Numb expression is a marker of tumor aggressiveness, potentially linked to BRCA status and a cancer stem cell phenotype in primary breast cancer. The fixed nature of the PIAS4 and NUMB lesions supports a model suggesting that these likely promoted the BRCA2 phenotype of this tumor.

PS13-1750

The tumor has 3105 somatic mutations including 700 (23%) that were shared across the ten biopsies and the eleven sorted populations (Fig. 2A). Of these 84/700 (12%) were fixed. The majority 59/84 (70%) of these targeted non-coding regions while twenty-five targeted coding sequences. The latter included thirteen synonymous variants and twelve non-synonymous variants. One of the latter was the pathogenic nonsense variant BRCA2^R3128X, while eleven were missense variants of which only two, UMPS^G213A and NMBR^L390M, had predicted pathogenicity (Fig. 2B). The shared variants mapped primarily to regions of deletions such as 13q13.1–q21.32 that included the BRCA2 locus (Fig. 2C, D). The CNV patterns were analogous to, but distinct from, those in the BRCA2 germ line carrier PS13-9062 with variable shared and unique CNVs across the sorted populations in each tumor (Supplemental Fig. 6).

Strikingly, the co-existing ploidies A2-3.6 N and A2-3.2 N contained the most unique variants, 347 and 102 respectively (Fig. 2A). In addition, the unique ploidy A2-3.6 N had the highest number (99) of unique fixed variants (Fig. 3A). Principle component analysis (PCA) with the somatic variants distinguished these two ploidies in A2 from each other and from the other 9 populations in the tumor (Fig. 3B). Many of these fixed mutations were present as heterozygous variants in the other populations and were fixed because of somatic CNVs, the majority of which were losses. However, 19/99 of the mutations that were uniquely fixed in A2-3.6 N were absent in the other nine populations. These 19 de novo variants were present in non-coding regions including those of tumor associated genes ATR, RPS6KA2, IRF3, and CHD3 (Fig. 3C)^{22,23,24,25,26}.

PS13-585

We detected a total of 801 somatic mutations across the 25 populations sorted from the 18 biopsies. Of these only 63 (8%) were shared, while mutations unique to a single region varied extensively across the tumor notably within the primary tumor where the co-existing 5.0 N and 5.8 N ploidies in A3 had 144 and 64 unique variants respectively (Fig. 4A). In contrast the same ploidies in the adjacent A4 region had only 5 or no unique variants. Ten fixed variants were shared within the primary tumor and the nodes notably pathogenic missense variants in TP53^V172D, PIK3CA^H1047R, and NF1^D301N (Fig. 4B). Targeted resequencing of these three genes in single nuclei sorted from 5.0 N populations within the primary and lymph nodes confirmed the fixed nature of the three pathogenic variants in single cells (Supplemental Fig. 7). In contrast, the presence of a non-fixed APC^D953V variant included fixed, heterogeneous, and wild type single nuclei in the primary and node samples. In addition a novel nonsense mutation in XPO4^E140X (Exportin 4) a member of a family of nuclear transporters, known to export Smad3 (a component of TGFβ signaling) and Eif5a1 and Eif5a2 (two closely related translation initiation factors) from the nucleus, was also shared^27,28. This gene has been identified as a tumor suppressor in hepatocellular carcinoma²⁹. Four additional shared fixed variants within non-coding regions mapped to the two highest amplicons across the tumor on chromosome 12p13.32 and 12q14.1 and reflect the stability of these CNVs. There were 3 additional node-specific shared fixed variants including two on chromosome 13, a missense variant CCDC168^D4512N and a variant in an intergenic region, and an intronic variant of MYO15A at 17p11.2.

A striking finding in the BRCA^WT tumor PS13-585 was the presence of aberrant CNV patterns but stable genomes across the primary and nodal tissues and multiple ploidies (Fig. 4C). The genomes of each of the 24 sorted aneuploid populations in this tumor shared a high level 20q11.23–q12 amplicon targeting SRC, a chromosome 12 with multiple amplicons and deletions, including a large homozygous deletion spanning 9.5 Mb at 3p12.1–p12.3 that targeted the axon guidance Roundabout Guidance Receptors 1 and 2 (ROBO1 and ROBO2) loci (Supplemental Figs. 8, 9). In each case the boundaries of the aberrant intervals, including amplicons of variable heights and lengths, partial and homozygous deletions were conserved across the different ploidies and tissues.

Clinical markers

Two of the three tumors, PS13-1750 and PS13-585, were scored as ER+ PR– ERBB2+ (3+) based on clinical evaluation of a single biopsy. However, further IHC testing of mapped research biopsies from P13-585 scored ERBB2 as equivocal with heterogeneous staining within primary and lymph nodes. We did not detect evidence for ERBB2 amplification in any of the sorted samples from both cases (Fig. 5). In PS13-1750 there was a uniform low-level copy number gain spanning 17p11-qtel in each sorted population (Fig. 5A). In comparison chromosome 17 had multiple CNVs on both arms in each of the sorted populations and multiple ploidies present in PS13-585 (Fig. 5B). However, chromosome 17, which contains two of the shared fixed pathogenic somatic variants, TP53^V172D on the p arm and NF1^D301N on the q arm, was stable with a gain of 17q11.2 relative to 17q12–q21.32. Thus, in both tumors there was no evidence for selective ERBB2 amplicons in any of the flow sorted samples.

Neoepitope predictions

We applied our established neoepitope prediction pipeline with the total missense mutations in each of the three tumors as inputs³⁰. Multiple variants were identified in the three tumors with each sorted population having one or more predicted neoepitope (Fig. 6; Table 1). PS13-9062, a pathogenic BRCA2 germ line carrier, had three shared predicted neoepitopes that included Prostaglandin F Receptor (PTGFR). The 10 biopsies and 11 sorted aneuploid populations in PS13-1750, containing a somatic pathogenic BRCA2 variant, had six shared predicted neoepitopes derived from variants in genes with diverse functions including a transcription factor (PHTF2), a regulator of apoptosis (DAP), and a protein tyrosine phosphatase (MTMR11). Strikingly, the most clonally diverse tumor (PS13-585) had four shared predicted neoepitopes derived from three genes. Notably, one of these was the shared fixed pathogenic NF1^D301N variant. In contrast, the majority of neoepitope candidates was present in unique or restricted numbers of sorted populations within each tumor.

Table 1 Predicted shared neoepitopes.

Full size table

Discussion

Our analysis of shared and fixed variants within the three breast cancer tumors with distinct genomic drivers and mutational landscapes highlights the extent and nature of heterogeneity within each tumor. Aneuploidy, as defined by a deviation from diploid DNA content, is a hallmark of cancer³¹. Previous flow sorting based studies have shown that cell populations with distinct DNA contents arise in premalignant stages and evolve through the natural history of neoplasias^32,33. Furthermore, the number and distribution of ploidies within a neoplasia can vary from a single population present throughout both premalignant and invasive regions to multiple populations in a single biopsy^3,34. We have used DNA content flow cytometry to interrogate a variety of solid tumors^{30,35,36,37,38}. The genomic analyses, including NGS and CNV profiles, of sorted tumor populations enables objective identification of mutations and genomic lesions that drive the natural and clinical histories of tumors. In this current study we extended our approach to include measures of fixed and shared variants to identify both known and putative drivers of three clinically distinct breast cancers. In addition, we exploited these data to investigate patterns of predicted neoepitopes and whether genomic heterogeneity affected clinical markers. The availability of multiple mapped biopsies from each tumor provides highly favorable models for this study.

In two of the cases, we show that despite extensive and variable HRD related CNV profiles BRCA2 mutant breast cancers retain shared fixed variants targeting a discrete set of genes throughout the tumor. Notably, the majorities of fixed variants are non-coding and map to regions that are subject to copy number losses (Figs. 1, 2). This includes a tumor arising in a germ line carrier PS13-9062 and another somatic tumor PS13-1750. Of significant interest in breast and other solid tumors is the role of fixed somatic variants in BRCA and related genes in the responses to PARP inhibitors and DNA damaging agents. Notably, tumors with pathogenic germ line BRCA mutations display elevated responses to PARP inhibitors³⁹. However, retention of wild type alleles in affected tumors markedly affects therapeutic responses to these targeted therapies⁴⁰. The variation in DNA content within aneuploid tumors and the evolution of clonal populations with highly aberrant CNV patterns may provide tumor cells with multiple copies of given heterogeneous variants. Discriminating those variants, either germ line or somatic, that become fixed in a tumor can establish genotypes for BRCA and other clinically relevant genes. Notably, our approach confirmed that the pathogenic BRCA2 variants were fixed within the multiple biopsies and ploidies profiled for each BRCA^mut tumor by shared interstitial deletions on chromosome 13 that spanned the BRCA2 locus (Figs. 1, 2). Thus, we propose that our analysis of flow sorted biopsies provides the high-resolution detection of variants and the discrimination of those that are fixed from those that co-occur with wild type alleles necessary for the advancement of precise targeted therapies.

Despite the extent and variations in CNVs, the ploidies of the tumor populations in both BRCA mutant cases were relatively stable by our flow cytometry measures of total DNA content. Strikingly the one example of ploidy variation, the co-existing 3.2 N and 3.6 N populations in PS13-1750 biopsy A3, had the highest numbers (347 and 102) of unique variants (Fig. 2A). This dramatic “big bang”-like effect is further highlighted by the separation of these two populations from the remaining 9 sorted populations and each other in our PCA results (Fig. 3B). Notably 19 of the 99 unique variants in the 3.6 N ploidy were fixed and targeted non-coding regions of genes with well characterized roles in breast cancer and a variety of tumors including ATR, CHD3, RPS6KA2, and IRF3 in PS13-1750 (Fig. 3C). The presence of these shared fixed non-coding variants suggests that regulatory elements within these genes may provide select targets in the evolution of these and possibly other BRCA related tumors.

In contrast to the BRCA2^mut tumors the 18 primary and LN biopsies from the BRCA^WT PS13-585 tumor fell into 6 distinct ploidy groups. The genomes of each population had highly aberrant but homogenous CNV profiles, characterized by SARC amplification and homozygous deletions in ROBO1 and ROBO2 (Fig. 4C, Supplemental Figs. 7, 8). The mutational landscape was defined by three shared fixed pathogenic variants, TP53^V172D, PIK3CA^H1047R, and NF1^D301N, arising in a heterogeneous background of variants including a cluster of 144 that were restricted to a single 5.8 N population (Fig. 4A). Notably one of these, NF1^D301N, is a predicted neoepitope that would provide a highly favorable target for the development of a personalized immune therapy (Table 1). In addition, the identification of a co-existing shared fixed variant, XPO4^E140X, suggests a novel role for this mediator of the transport of proteins and other cargo between the nuclear and cytoplasmic compartments in breast cancer pathogenesis.

Parallel or linear evolution?

The genomes of the two BRCA2 mutant tumors displayed elevated numbers of often heterogeneous CNVs that are typical of the HRD phenotype associated with loss of BRCA function. Strikingly, the ploidy within each tumor was relatively stable suggesting that dissemination of the aneuploid populations preceded many of the unique CNVs. In contrast, the large number and variety of somatic lesions detected in PS13-585 were highly conserved within the 6 different ploidies present within the primary tissue and adjacent nodes. The genomes have a high degree of shared aberrations including deletions, amplicons, breakpoints, and mutations. The co-occurrence of these multiple lesions in each aneuploid genome suggests that they evolved from a single progenitor present within the primary tumor and were independent of nodal dissemination and pathologic staging (pN3c). Consequently, changes in ploidy resulted in parallel evolution of both primary and lymph node clonal populations. This later evolution involved driver aberrations within the primary and selection for fixation of existing and de novo variants. One of the fixed shared variants in PS13-585 was TP53^V172D. Loss of function of TP53 is associated with the appearance of aneuploid populations, as measured by DNA content flow cytometry, in esophageal neoplasias^3,41,42. The presence of multiple aneuploid populations with highly aberrant but stable genomes throughout PS13-585 contrasts with the uniform ploidies and unstable CNV pattern of the two BRCA2^mut cases.

Clonal heterogeneity and clinical markers

Our results highlight that, strikingly, tumors with aberrant genomes characterized by multiple regions of high-level copy number gains, homozygous deletions, and distinct sets of driver mutations can also maintain stable ploidies and fixed variants. Furthermore, our systematic interrogation of these tumors contradicts the heterogeneous staining observed with established IHC and FISH clinical markers for ER, PR and ERBB2 and highlights the variety of fixed and shared genomic lesions in these three distinct breast cancer cases. Notably, we propose that discriminating fixed variants will improve the identification of clinically significant CNVs and SNVs even within a single biopsy of a heterogeneous tumor.

This study was limited to three breast cancers with samples obtained at time of surgery. Future studies combining flow sorting and genomic analyses of additional breast cancers, as well as other solid tissue tumors will identify clinically relevant fixed variants and the sources of clonal evolution (e.g. ploidy, CNVs, SNVs) in diverse genetic backgrounds. Of significant interest will be applying our methods to clinical subtypes that currently lack effective targeted therapies such as triple negative breast cancer (TNBC). In addition, an outstanding question is how efficiently fixed clinically relevant variants can be identified in single biopsies, notably those arising in advanced metastatic cases and during the emergence of resistance to targeted agents.

Our findings describe highly aberrant genomes in each case and show that tumor heterogeneity can arise through variations in ploidies, allele frequencies, and copy number aberrations. Furthermore, despite the variable sources of genomic instability and heterogeneous results for established clinical markers, our rigorous interrogation of flow sorted tumor populations distinguishes fixed variants that are shared and unique in tumor populations within each tumor. We propose that this approach will enable the identification of neoepitopes that can be exploited for the development of effective vaccines to complement therapies for heterogeneous tumors.

Materials and methods

Clinical samples

All samples were fresh frozen at time of surgery and obtained under approval from the Mayo Clinic Institutional Review Board prior to undertaking this study (IRB protocol 08-006579). Informed consent was obtained from all patients. All breast cancers underwent central pathologic review including Nottingham histologic scoring and were evaluated by IHC for estrogen receptor (ER), progesterone receptor (PR), and by IHC +/− FISH for Her2/neu under CLIA/CAP guidelines. All research conformed to the Helsinki Declaration (https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/).

Flow cytometry

Biopsies were minced in the presence of NST buffer and DAPI according to published protocols^3,42,43. Prior to sorting each sample was filtered through a 35 µm mesh and collected into a 5 ml polypropylene round bottom tube. The mesh was rinsed with 750 µl of NST/10% fetal bovine serum and placed on ice while processing remaining samples. The total volume in the tube for each sample was approximately 1.5 ml. An equal volume of 20 µg/ml DAPI was added to each tube to achieve a final concentration of 10 µg/ml DAPI for flow sorting with a BD Influx cytometer with ultraviolet excitation (Becton–Dickinson, San Jose, CA, USA). The optimal settings for sorting samples with the Influx sorter were as follows: Drop formation was achieved with piezzo amplitude of 4–7 V and a drop frequency of 29.4–29.6 kHz. The sort mode was set to “one drop pure” mode with a drop delay of 31.5–32. Sheath fluid pressure was typically 17.2–17.8 psi with a 100 µm nozzle tip. For single parameter DNA content assays DAPI emission was collected at > 450 nm. DNA content and cell cycle were then analyzed using MultiCycle (Phoenix Flow Systems, San Diego, CA, USA).

For single nucleus sorting a 96 well plate sort is done using 100 µl nozzle tip and 20 psi with a frequency 29.4 kHz. Nuclei are sorted in a “one drop single” mode (high purity and high recovery of the sample) at a rate of 1500–2000 events per second to maintain purity of ≥ 98% and intactness of sorted material. Population distribution analysis and phase reproducible measurements for each sorting assay were based on 10,000–20,000 acquired events. To assure sorting single nucleus into each well of a 96 well plate both coincident particles and the position of particles inside droplets of the flow stream were monitored. Coincidence occurs when two or more particles are closer than the spacing of the droplets, so that more than one particle ends up in a single droplet.

96 well sort and plate alignment

The WDU (Well Deposition Unit) of the Influx is aligned to the TempPlate semi-skirt 0.2 ml, 96 well PCR plate. A test sort drop is distributed to align the stream with the center of the 96 well plate. The step is repeated several times until the test drop hits the center of the well to ensure accurate distribution of the sorted material. Then a test drop with 50 events is sorted into all wells of the 96 well plate to insure there are no skipped wells or inaccurate cell number. Sorts are done directly into a pre-filled, aligned 96 well plate with 4 µl of PBS sc. For the negative control (NC) an empty drop is sorted into the first 4 wells of the last column of the 96 well plate. Positive controls (PC) of 1000 nuclei of the population of interests are sorted into the remaining 4 wells of the last column of the 96 well plate. To avoid cross contamination during the subsequent sort the entire last column containing the NC and PC controls is covered with the adhesive sealing foil. The single nucleus is sorted into the remaining wells of the 96 well plate. After the sort is done the plate is spun down for 60 s at 900×g. The samples in each plate are then processed for whole genome amplification using the REPLI-g UltraFast Kit from QIAGEN according to supplier’s instructions.

DNA extraction

DNA from sorted nuclei was extracted using an amended protocol from QIAamp^® DNA Micro Kit from Qiagen (Valencia, CA, USA). Briefly each sorted sample was suspended in 180 µl buffer ATL and 20 µl proteinase K (20 mg/ml) then incubated for 3 h at 56 °C for complete lysis. Samples were bound and washed according to QIAamp^® DNA Micro Kit instructions, eluted into 50 µl of H₂O, then precipitated overnight with 5 µl 3 M sodium acetate and 180 µl 100% EtOH. Each sample was then centrifuged for 30 min at 20,000×g, washed in 1 ml of 70% EtOH for 30 min at 20,000×g. The samples were carefully decanted, and the DNA pellet was dried by speed vacuum then resuspended in a small volume (e.g. 10–50 µl) of H₂0 for final concentrations suitable for accurate quantitation.

aCGH analysis

DNAs were treated with DNAse 1 prior to Klenow-based labeling. High molecular weight reference templates were digested for 30 min while the smaller fragmented FFPE-derived DNA samples were digested for only 1 min. In each case 1 µl of 10× DNase 1 reaction buffer and 2 µl of DNase 1 dilution buffer were added to 7 µl of DNA sample and incubated at room temperature then transferred to 70 °C for 30 min to deactivate DNase 1. Sample and reference templates were then labeled with Cy-5 dUTP and Cy-3 dUTP respectively using a BioPrime labeling kit (Invitrogen, Carlsbad, CA, USA) according to our published protocols³¹. All labeling reactions were assessed using a Nanodrop assay (Nanodrop, Wilmington, DE, USA) prior to mixing and hybridization to CGH arrays (Agilent Technologies, Santa Clara, CA, USA) for 40 h in a rotating 65 °C oven. All microarray slides were scanned using an Agilent 2565C DNA scanner and the images were analyzed with Agilent Feature Extraction version 10.7 using default settings. The aCGH data was assessed with a series of QC metrics then analyzed using an aberration detection algorithm (ADM2)⁴⁴. The latter identifies all aberrant intervals in each sample with consistently high or low log₂ ratios based on the statistical score derived from the average normalized log₂ ratios of all probes in the genomic interval multiplied by the square root of the number of these probes. This score represents the deviation of the average of the normalized log₂ ratios from its expected value of zero and is proportional to the height h (absolute average log₂ ratio) of the genomic interval, and to the square root of the number of probes in the interval. All aCGH data in this paper have been deposited at the National Center for Biotechnology Information Gene Expression Omnibus (GSE172262).

Read processing and mapping

We stripped all BAM format reads from all samples using the STRIP_READS module in XYalign with the parameter “—chromosomes ALL”⁴⁵. This module wraps uses SAMtools⁴⁶ and repair.sh and shuffle.sh in the BBTools suite⁴⁷ to strip, sort, and fix pairing information for reads by read group and output FASTQ files. As all samples were derived from females and the sequence similarity between the human X and Y chromosomes can affect variant calling⁴⁵, we then used the PREPARE_REFERENCE module in XYalign to generate XX reference genomes for the hg38 human genome assemblies⁴⁸, in which the Y chromosome was hardmasked. For all subsequent steps, we independently mapped reads, processed alignments, called variants, and conducted analyses on the modified hg38 (hg38_XX) assembly.

For each sample, we then mapped reads using BWA MEM⁴⁹, marked duplicates using SAMBLASTER⁵⁰, fixed pairing information with SAMtools fixmate, sorted alignment files by coordinate using SAMtools sort, and indexed sorted BAM files with SAMtools index⁴⁶. Then, following the GATK Best Practices workflow^51,52, we used GATK⁵³ to recalibrate base quality scores and realign around indels. For both processes, we used the Mills_1000G Gold Standard Indels from the Broad Institute’s resource bundle⁵¹. For base quality score recalibration, we additionally included dbSNP VCF files⁵⁴—(version 146 for hg38_XX)—also from the Broad Institute’s resource bundle⁵¹.

Somatic mutation analysis

We used VarScan version 2.3.9 to call variants with the following thresholds: minimum coverage of 10, minimum variant allele frequency of 0.08, and somatic p value of 0.05⁵⁵. We applied the command somaticFilter from VarScan to remove clusters of false positives and SNV calls near indels. We also applied the false positive filter from VarScan called fpfilter. Variants were annotated using variant effect predictor (VEP) version 86⁵⁶. We used the SNPRelate package in R to perform the Principal Component Analysis using the snpgdsPCA function⁵⁷. We used the library upsetR in R to generate the upset plots for visualizing the overlap in somatic mutations⁵⁸. For the fixed somatic mutations analysis, we identified somatic mutations that are fixed by subsetting the mutations where the genotype is 1/1.

HLA typing

We used HLA-LA for HLA typing⁵⁹). Because HLA-LA implements a graph alignment model that requires the reads being mapped to the reference genome supported by HLA-LA. At the time of analysis, the 1000 Genome version of GRCh38 was supported. Therefore, for HLA typing purposes, we remapped the stripped reads to the 1000 Genome version of GRCh38. We then used the mapped BAM file to perform HLA typing using HLA-LA.

Neoepitope prediction

We generated peptides consisted of 21 amino acids long using pVACseq⁶⁰. Because Wells et al. (2020)⁶¹ found that neoepitopes can be predicted by using the following thresholds: binding affinity less than 34 nM, binding stability greater than 1.4 h, and tumor abundance greater than 33 TPM, we based our predictions of potential neoepitopes based on these criteria. Because we did not have whole transcriptome data, we only used the binding affinity and binding stability criteria. We computed binding affinity using artificial neural networks in netMHCpan⁶². We defined a peptide to be a potential neoepitopes if its binding affinity is less than 34 nM⁶¹.

Data and code availability

We implemented the complete assembly and analysis pipeline, containing all steps described above, in Snakemake⁶³ and used Bioconda⁶⁴ to manage software. All files required to run this pipeline, including the complete conda environment (with program and package version numbers), are hosted on two Github repositories: (1) for exome processing and assembly (https://github.com/thw17/Mayo_breast_cancer_heterogeneity_assembly) and (2) for somatic variant calling and downstream analyses (https://github.com/SexChrLab/Mayo_breast_regional_heterogeneity).

References

Gerlinger, M. et al. Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing. Nat Genet 46, 225 (2014).
Article CAS PubMed PubMed Central Google Scholar
Gerlinger, M. et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N. Engl. J. Med. 366, 883–892 (2012).
Article CAS PubMed PubMed Central Google Scholar
Maley, C. C. et al. Genetic clonal diversity predicts progression to esophageal adenocarcinoma. Nat. Genet. 38, 468–473 (2006).
Article CAS PubMed Google Scholar
Boutros, P. C. et al. Spatial genomic heterogeneity within localized, multifocal prostate cancer. Nat. Genet. 47, 736–745 (2015).
Article CAS PubMed Google Scholar
Yates, L. R. et al. Subclonal diversification of primary breast cancer revealed by multiregion sequencing. Nat. Med. 21, 751–759 (2015).
Article CAS PubMed PubMed Central Google Scholar
Shah, S. P. et al. The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature 486, 395–399 (2012).
Article ADS CAS PubMed Google Scholar
de Bruin, E. C. et al. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution. Science 346, 251–256 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Alizadeh, A. A. et al. Toward understanding and exploiting tumor heterogeneity. Nat. Med. 21, 846–853 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bedard, P. L., Hansen, A. R., Ratain, M. J. & Siu, L. L. Tumour heterogeneity in the clinic. Nature 501, 355–364 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
McGranahan, N. et al. Clonal status of actionable driver events and the timing of mutational processes in cancer evolution. Sci. Transl. Med. 7, 283ra54 (2015).
Article PubMed PubMed Central Google Scholar
Woo, J. W. et al. The updated 2018 American Society of Clinical Oncology/College of American Pathologists guideline on human epidermal growth factor receptor 2 interpretation in breast cancer: comparison with previous guidelines and clinical significance of the proposed in situ hybridization groups. Hum. Pathol. 98, 10–21 (2020).
Article CAS PubMed Google Scholar
Allison, K. H. et al. Estrogen and progesterone receptor testing in breast cancer: ASCO/CAP guideline update. J. Clin. Oncol. 38, 1346–1366 (2020).
Article PubMed Google Scholar
Lehmann, B. D., Pietenpol, J. A. & Tan, A. R. Triple-negative breast cancer: molecular subtypes and new targets for therapy. Am. Soc. Clin. Oncol. Educ. Book 35, e31–e39 (2015).
Article Google Scholar
Narod, S. A. BRCA mutations in the management of breast cancer: the state of the art. Nat. Rev. Clin. Oncol. 7, 702–707 (2010).
Article CAS PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell 144, 646–674 (2011).
Article CAS PubMed Google Scholar
Barrett, M. T. et al. Molecular phenotype of spontaneously arising 4N (G2-tetraploid) intermediates of neoplastic progression in Barrett’s esophagus. Cancer Res. 63, 4211–4217 (2003).
CAS PubMed Google Scholar
Tsai, J. H. et al. Association of aneuploidy and flat dysplasia with development of high-grade dysplasia or colorectal cancer in patients with inflammatory bowel disease. Gastroenterology 153, 1492 (2017).
Article PubMed Google Scholar
Nik-Zainal, S. et al. The life history of 21 breast cancers. Cell 149, 994–1007 (2012).
Article CAS PubMed PubMed Central Google Scholar
Notta, F. et al. A renewed model of pancreatic cancer evolution based on genomic rearrangement patterns. Nature 538, 378–382 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Galanty, Y. et al. Mammalian SUMO E3-ligases PIAS1 and PIAS4 promote responses to DNA double-strand breaks. Nature 462, 935–939 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Rennstam, K. et al. Numb protein expression correlates with a basal-like phenotype and cancer stem cell markers in primary breast cancer. Breast Cancer Res. Treat. 122, 315–324 (2010).
Article CAS PubMed Google Scholar
Tu, X. et al. ATR inhibition is a promising radiosensitizing strategy for triple-negative breast cancer. Mol. Cancer Ther. 17, 2462–2472 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, W. & Mills, A. A. Architects of the genome: CHD dysfunction in cancer, developmental disorders and neurological syndromes. Epigenomics 6, 381–395 (2014).
Article CAS PubMed Google Scholar
Serra, V. et al. RSK3/4 mediate resistance to PI3K pathway inhibitors in breast cancer. J. Clin. Investig. 123, 2551–2563 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bernardo, A. R., Cosgaya, J. M., Aranda, A. & Jimenez-Lara, A. M. Pro-apoptotic signaling induced by Retinoic acid and dsRNA is under the control of Interferon Regulatory Factor-3 in breast cancer cells. Apoptosis An Int. J. Program. Cell Death 22, 920–932 (2017).
Article CAS Google Scholar
Tian, M. et al. IRF3 prevents colorectal tumorigenesis via inhibiting the nuclear translocation of beta-catenin. Nat. Commun. 11, 5762 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zender, L. et al. An oncogenomics-based in vivo RNAi screen identifies tumor suppressors in liver cancer. Cell 135, 852–864 (2008).
Article CAS PubMed PubMed Central Google Scholar
Lipowsky, G. et al. Exportin 4: a mediator of a novel nuclear export pathway in higher eukaryotes. EMBO J. 19, 4362–4371 (2000).
Article CAS PubMed PubMed Central Google Scholar
Liang, X. T. et al. Decreased expression of XPO4 is associated with poor prognosis in hepatocellular carcinoma. J. Gastroenterol. Hepatol. 26, 544–549 (2011).
Article CAS PubMed Google Scholar
Phung, T. N. et al. Unique genomic and neoepitope landscapes across tumors: a study across time, tissues, and space within a single lynch syndrome patient. Sci. Rep. 10, 12190 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Sansregret, L. & Swanton, C. The role of aneuploidy in cancer evolution. Cold Spring Harb. Perspect. Med. 7, a028373 (2017).
Article PubMed PubMed Central CAS Google Scholar
Barrett, M. T. et al. Evolution of neoplastic cell lineages in Barrett oesophagus. Nat. Genet. 22, 106–109 (1999).
Article CAS PubMed PubMed Central Google Scholar
Lai, L. A. et al. Increasing genomic instability during premalignant neoplastic progression revealed through high resolution array-CGH. Genes Chromosomes Cancer 46, 532–542 (2007).
Article CAS PubMed Google Scholar
Martinez, P. et al. Evolution of Barrett’s esophagus through space and time at single-crypt and whole-biopsy levels. Nat. Commun. 9, 794 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Barrett, M. T. et al. Genomic amplification of 9p24.1 targeting JAK2, PD-L1, and PD-L2 is enriched in high-risk triple negative breast cancer. Oncotarget 6, 26483–93 (2015).
Article PubMed PubMed Central Google Scholar
Barrett, M. T. et al. Clinical study of genomic drivers in pancreatic ductal adenocarcinoma. Br. J. Cancer 117, 572–582 (2017).
Article CAS PubMed PubMed Central Google Scholar
Barrett, M. T. et al. Clonal analyses of refractory testicular germ cell tumors. PLoS One 14, 0213815 (2019).
Article CAS Google Scholar
Ruiz, C. et al. Advancing a clinically relevant perspective of the clonal nature of cancer. Proc. Natl. Acad. Sci. U S A 108, 12054–12059 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Robson, M. et al. Olaparib for metastatic breast cancer in patients with a germline BRCA mutation. N. Engl. J. Med. 377, 523–533 (2017).
Article CAS PubMed Google Scholar
Maxwell, K. N. et al. BRCA locus-specific loss of heterozygosity in germline BRCA1 and BRCA2 carriers. Nat. Commun. 8, 319 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Galipeau, P. C. et al. 17p (p53) allelic losses, 4N (G2/tetraploid) populations, and progression to aneuploidy in Barrett’s esophagus. Proc. Natl. Acad. Sci. U S A 93, 7081–7084 (1996).
Article ADS CAS PubMed PubMed Central Google Scholar
Rabinovitch, P. S., Longton, G., Blount, P. L., Levine, D. S. & Reid, B. J. Predictors of progression in Barrett’s esophagus III: baseline flow cytometric variables. Am. J. Gastroenterol. 96, 3071–3083 (2001).
Article CAS PubMed PubMed Central Google Scholar
Galipeau, P. C. et al. NSAIDs modulate CDKN2A, TP53, and DNA content risk for progression to esophageal adenocarcinoma. PLoS Med. 4, e67 (2007).
Article PubMed PubMed Central CAS Google Scholar
Lipson, D., Aumann, Y., Ben-Dor, A., Linial, N. & Yakhini, Z. Efficient calculation of interval scores for DNA copy number data analysis. J. Comput. Biol. 13, 215–228 (2006).
Article MathSciNet CAS PubMed MATH Google Scholar
Webster, T. H. et al. Identifying, understanding, and correcting technical artifacts on the sex chromosomes in next-generation sequencing data. Gigascience 8, giz074 (2019).
Article PubMed PubMed Central CAS Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
PubMed PubMed Central Google Scholar
Bushnell, B. BBTools: A Suite of Fast, Multithreaded Bioinformatics Tools Designed for Analysis of DNA and RNA Sequence Data. Joint Genome Institute (2018).
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Article ADS CAS PubMed Google Scholar
Li, H. Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM. arXiv [q-bioGN] arXiv (2013).
Faust, G. G. & Hall, I. M. SAMBLASTER: fast duplicate marking and structural variant read extraction. Bioinformatics 30, 2503–2505 (2014).
Article CAS PubMed PubMed Central Google Scholar
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protocols Bioinform. 43, 11–10 (2013).
Article Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Sherry, S. T. et al. dbSNP: the NCBI database of genetic variation. Nucl. Acids Res 29, 308–311 (2001).
Article CAS PubMed PubMed Central Google Scholar
Koboldt, D. C. et al. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res 22, 568–576 (2012).
Article CAS PubMed PubMed Central Google Scholar
McLaren, W. et al. The ensemble variant effect predictor. Genome Biol. 17, 122 (2016).
Article PubMed PubMed Central CAS Google Scholar
Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–3328 (2012).
Article CAS PubMed PubMed Central Google Scholar
Conway, J. R., Lex, A. & Gehlenborg, N. UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics 33, 2938–2940 (2017).
Article CAS PubMed PubMed Central Google Scholar
Dilthey, A. T. et al. HLA*LA-HLA typing from linearly projected graph alignments. Bioinformatics 35, 4394–4396 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hundal, J. et al. pVAC-Seq: A genome-guided in silico approach to identifying tumor neoantigens. Genome Med. 8, 11 (2016).
Article PubMed PubMed Central CAS Google Scholar
Wells, D. K. et al. Key parameters of tumor epitope immunogenicity revealed through a consortium approach improve neoantigen prediction. Cell 183, 818–34 (2020).
Article CAS PubMed PubMed Central Google Scholar
Jurtz, V. et al. NetMHCpan-4.0: improved peptide-MHC class I interaction predictions integrating eluted ligand and peptide binding affinity data. J. Immunol. 199, 3360–8 (2017).
Article CAS PubMed Google Scholar
Koster, J. & Rahmann, S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 34, 3600 (2018).
Article PubMed Google Scholar
Gruning, B. et al. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat. Methods 15, 475–476 (2018).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

Funding for this work was provided by the Ziccarelli Foundation and the Breast Cancer Research Foundation (BCRF). Mayo Clinic Cancer Center is supported in part by an NCI Cancer Center Support Grant 5P30 CA15083-36.

Author information

Authors and Affiliations

School of Life Sciences, Arizona State University, Tempe, AZ, USA
Tanya N. Phung, Timothy H. Webster & Melissa A. Wilson
Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
Tanya N. Phung & Melissa A. Wilson
Department of Anthropology, University of Utah, Salt Lake City, UT, USA
Timothy H. Webster
Division of Hematology/Oncology, Department of Internal Medicine, Mayo Clinic, Scottsdale, AZ, USA
Elizabeth Lenkiewicz, Smriti Malasi, Mariacarla Andreozzi & Michael T. Barrett
Department of Pathology and Laboratory Medicine, Mayo Clinic in Arizona, Scottsdale, AZ, USA
Ann E. McCullough
Biodesign Institute, Arizona State University, Tempe, AZ, USA
Karen S. Anderson
Division of General Surgery, Section of Surgical Oncology, Mayo Clinic in Arizona, Phoenix, AZ, USA
Barbara A. Pockaj

Authors

Tanya N. Phung
View author publications
You can also search for this author in PubMed Google Scholar
Timothy H. Webster
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Lenkiewicz
View author publications
You can also search for this author in PubMed Google Scholar
Smriti Malasi
View author publications
You can also search for this author in PubMed Google Scholar
Mariacarla Andreozzi
View author publications
You can also search for this author in PubMed Google Scholar
Ann E. McCullough
View author publications
You can also search for this author in PubMed Google Scholar
Karen S. Anderson
View author publications
You can also search for this author in PubMed Google Scholar
Barbara A. Pockaj
View author publications
You can also search for this author in PubMed Google Scholar
Melissa A. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Michael T. Barrett
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.N.P., T.H.W., and M.A.W. performed mutation and neoepitope prediction analyses. E.L., S.M., and M.A. processed tissue and DNA samples and did the flow sorting and CNV analyses. B.A.P. and A.E.M. reviewed all clinical data and samples. K.S.A., B.A.P. and M.T.B. designed the study and reviewed all data. T.N.P., M.A.W. and M.T.B. wrote the paper.

Corresponding author

Correspondence to Michael T. Barrett.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Supplementary Legends.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Phung, T.N., Webster, T.H., Lenkiewicz, E. et al. Unique evolutionary trajectories of breast cancers with distinct genomic and spatial heterogeneity. Sci Rep 11, 10571 (2021). https://doi.org/10.1038/s41598-021-90170-1

Download citation

Received: 29 January 2021
Accepted: 30 April 2021
Published: 19 May 2021
DOI: https://doi.org/10.1038/s41598-021-90170-1

This article is cited by

Genomic landscape of diploid and aneuploid microsatellite stable early onset colorectal cancer
- Yumei Zhou
- Xianfeng Chen
- Michael T. Barrett
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.