Genomic profiling of multiple breast cancer reveals inter-lesional heterogeneity

Background Multiplicity in breast cancer is common. Studies on multiple breast cancers have revealed high concordance in biomarker status among individual lesions. However, genomic differences among multiple lesions are not well-established. We aimed to investigate the potential genomic heterogeneity of multiple breast cancer. Methods Twenty-one patients with radiologically and histologically evident multiple breast cancer with similar histology were included. Two lesions from each of the 21 patients were selected, and biomarker status was evaluated for each lesion. Capture-based targeted next-generation sequencing was performed using a cancer gene panel consisting of 170 genes. Results We identified discordance in intrinsic subtype in 2 (10%) of the 21 patients. Pathogenic mutations were detected in 13 of the 21 patients, of whom 11 shared oncogenic variants in the two lesions. The remaining two patients yielded different mutation results for TP53, ATM, and PIK3CA. Difference in copy number alteration was observed in 7 (33%) of the 21 patients including ERBB2 (n = 2), FGFR1 (n = 2), and FGFR2 (n = 1) genes. Conclusion Despite similar histologic features of the individual lesions, inter-lesional genomic difference was identified in more than one-third of the patients. Inter-lesional genomic heterogeneity needs to be considered when performing a genomic test in multiple breast cancers.


BACKGROUND
Multiple (multifocal or multicentric) breast cancers are a relatively frequent entity with a reported incidence ranging from 6 to 60% in the literature. [1][2][3][4][5] A multifocal breast cancer usually refers to two or more separate tumours located in the same quadrant while a multicentric breast cancer denotes to two or more separate tumours occupying more than one quadrant of the same breast. 5 However, there exists no international consensus on the definition of multifocality or multicentricity, and the distinction is often difficult. Multiple breast cancers are associated with increased regional lymph node metastasis as well as unfavourable prognosis compared to unifocal breast cancers. 3,4 Considering the more aggressive biologic behaviour of multiple breast cancer, it is recommended to document the multiplicity by placing the (m) modifier for the T category in the current American Joint Committee on Cancer manual. 6 Since multiple breast cancers are known to share similar histologic features and biomarker status, 7,8 current guidelines allow performing oestrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) immunohistochemistry (IHC) in the largest tumour as representative of the multiple tumours. 9,10 Breast cancer is a genetically heterogeneous disease, 11 and recent advances in single-cell sequencing and liquid biopsy have provided insight into genomic heterogeneity among subclonal tumour cell populations during disease progression. 12,13 However, genomic differences among multiple lesions are not well-established. A recent study reported that genomic heterogeneity was common among different lesions of multiple breast cancer. 14 In the era of precision medicine, the number of molecular alterations possessing potential clinical utility is steeply rising; a genomic test using the next-generation sequencing (NGS) technique can be rapidly applied in the clinic. Adequate sampling of a tumour is a prerequisite for genomic testing, and it is usually retrieved from a metastatic tumour to account for genetic evolution during metastatic progression. However, in patients with metastatic breast cancer at inaccessible sites, it is inevitable to perform a genomic test on the primary tumour. In multiple breast cancers, inter-lesional genomic heterogeneity can complicate treatment decisions based on genomic information. 15 In this study, we investigated inter-lesional genomic differences in multiple breast cancers with similar histology to provide a guide on genomic testing in multiple breast cancer.

Patient selection and data collection
Patients with primary multiple breast cancer who received surgery at Seoul National University Bundang Hospital between 2009 and 2012 were included. The inclusion criteria were as follows: (1) no neoadjuvant chemotherapy, (2) grossly and radiologically separate lesions, (3) histologically confirmed separate lesions with at least 5 mm interval in between, (4) size of individual lesions at least 5 mm, and (5) same histology and grade. The distinction between multifocal and multicentric breast cancers was impracticable. Twenty-one patients with multiple breast cancer who met the inclusion criteria were finally chosen for this study. We selected two lesions (tumours #1 and #2) from each patient after reviewing the haematoxylin and eosin (H&E)-stained slides. Tumour #1 represented the largest tumour, and tumour #2 represented the second-largest tumour. We collected the following data: size of each tumour, number of individual tumours, distance between the tumours, lymph node status, histologic grade, presence of ductal carcinoma in situ, extensive intraductal component, and lymphovascular invasion after reviewing the pathology reports and H&E-stained slides. This study was approved by the Institutional Review Board (IRB) at Seoul National University Bundang Hospital (IRB No. B-1902-522-301). Informed consent was waived since this study was a retrospective study using archival paraffin blocks and the samples were anonymised for this study.
Immunohistochemistry and definition of breast cancer subtypes All of the 21 patients had standard biomarker information including ER, PR, and HER2 status, and Ki-67 for the largestindex tumour (tumour #1); however, IHC had not been performed for tumour #2. Thus, we used the following antibodies that had been used for evaluation of tumour #1 and performed IHC for DNA library preparation and next-generation sequencing Deep targeted DNA sequencing was performed using a cancerrelated gene panel consisting of 170 widely known cancerassociated genes (Supplementary Table S1). Tumour-rich areas were marked for manual macro-dissection. The tumour purity ranged from 50 to 90% (Supplementary Table S2). Genomic DNA was extracted from each of the formalin-fixed, paraffin-embedded tissue samples using the QIAamp® DNA FFPE Tissue kit (Qiagen, Hilden, Germany), and 200 ng of DNA was used for library generation. DNA library preparation and target enrichment by the hybrid capture method were performed according to Illumina's standard protocol using Agilent SureSelectXT Target Enrichment Kit (Agilent Technologies, Santa Clara, CA). Target region bases were sequenced for each sample using the HiSeq 2500 system (Illumina, San Diego, CA), achieving an average coverage depth ×715 (Macrogen Inc., Seoul, Republic of Korea).
Sequence alignment and variant calling Adapter sequences were removed from the raw sequencing reads by cutadapt. 17 Trimmed reads were aligned to the reference genome (GRCh37/hg19) using Burrows−Wheeler Aligner-MEM (BWA-MEM). 18 Poorly mapped reads with a mapping quality (MAPQ) below 20 were removed using Samtools version 1.3.1. 19 Somatic mutations including single nucleotide variants (SNV) and small insertions and deletions (INDELs) were detected by the MuTect2 algorithm. 20 All the variants were annotated using SnpEff & SnpSift v4.3i 21 with dbNSFP v2.9.3. 22 Single nucleotide variant and copy number analysis Since matched germline samples were not available, we made efforts to reduce the effect of false-positive variants and germline contamination. The following criteria were used to filter out the less significant variants and narrow down to the clinically relevant variants: (i) variants <3% allele frequency and <100× read depth at the variant were excluded; (ii) variants with an allele frequency greater than 0.1% in the Exome Aggregation Consortium (ExAC) East Asian database were excluded; 23 (iii) all synonymous, intronic, 30-and 50 untranslated region (UTR) variants were excluded, and (iv) variants previously reported to be benign or likely benign in the ClinVar (2017-06 release) archive 24 were excluded. Finally, pathogenic mutations were selected referring to COSMIC. 25 We performed copy number alteration analysis using targeted sequencing data through methods developed in-house that were adapted and modified from CNVkit package. 26 Copy number variation (CNV) data were exported as log 2 ratio values. In case of log 2 ratio > 0.5 of FGFR1, FGFR2, and ERBB2 genes, in situ hybridisation was performed. In other genes, log 2 ratio > 1 (estimated gene copies > 4) was regarded amplification.
Silver in situ hybridisation HER2 silver in situ hybridisation (SISH) assays were performed with INFORM HER2 DNA and Chromosome 17 probes (Ventana Medical Systems) using an ultraView SISH Detection Kit (Ventana Medical Systems) as previously described. 27 After scanning the whole section, at least 50 cells were evaluated in each case, and HER2 status was determined according to the updated 2013 ASCO/CAP guidelines. 28 HER2-equivocal cases were regarded as HER2 non-amplified.
Florescence in situ hybridisation Four-micrometer deparaffinised section was incubated in pretreatment solution (Abbott Molecular, Downers Grove, IL) at 80°C for 40 min and then in protease solution (Abbott Molecular) for 40 min at 37°C. Fibroblast growth factor receptor 1 (FGFR1) FISH was performed with locus-specific bacterial  (Fig. 1). The distance between the tumour pairs showing inter-lesional mutation heterogeneity was 3.5 and 4 cm. Although we did not perform statistical analysis due to the small number of cases, these two tumour pairs tended to have a greater inter-lesional distance compared to the other tumour pairs. The inter-lesional distances for 21 cases are shown in Supplementary Table S2. We performed Sanger sequencing to confirm the discordant point mutations (PIK3CA, ATM, and TP53) in each lesion of patients 3 and 8. However, we failed to confirm these mutations on Sanger sequencing probably due to their low mutation frequencies.

Comparison of point mutations in two lesions
Comparison of copy number alterations in two lesions The copy number alterations in the 21 patients are summarised in Table 3. Copy number alteration was detected in 12 patients.
Genomic heterogeneity of copy number alteration in tumour #1 and tumour #2 was observed in 7 (33%) of 21 patients including ERBB2 in two patients (patients 9 and 19), FGFR1 in two patients (patients 3 and 13), and FGFR2 in one patient (patient 4). We performed ISH to confirm the results of ERBB2, FGFR1, and FGFR2 status in the two lesions, and we confirmed amplification in either tumour #1 or tumour #2. In an amplified tumour, the average copy number of ERBB2 was 5.7 (with HER2/CEP17 ratio of 3.2) in patient 9 (Fig. 1), and 8.7 in patient 19. The average copy number of FGFR1 was 9.1 in patient 3 (Fig. 2), and 16.3 in patient 13. The average copy number of FGFR2 was 25.8 in patient 4 (Fig. 3).
Notably, one FGFR1-amplified case (patient 13) and one FGFR2amplified case (patient 4) showed high-level amplification in the smaller tumour rather than the largest tumour. In patient 9, SMO, NTRK3, IDH2, IGF1R, and CDK12 were also amplified in tumour #1 in addition to ERBB2 amplification (Fig. 1). In patient 1, CCND1 amplification was identified solely in tumour #1. In patient 10, CCND1 and TSC2 amplification were identified only in tumour #1. The distance between the tumour pairs showing inter-lesional copy number heterogeneity was between 0.5 and 3.5 cm (Supplementary Table S2).

DISCUSSION
Multifocal or multicentric breast cancers are relatively common, and they tend to share similar histology and biomarker status. However, genomic inter-lesional heterogeneity in multiple breast cancers is not well-documented. In this study, we compared genetic variations in two representative lesions with similar histologic features from 21 multiple breast cancers.
Our study showed that mutations across individual lesions from the same patient were highly concordant: it was the discordance in copy number alteration that was more frequent. Copy number status was different in 7 of 21 patients. The discordant genes  included clinically relevant genes such as ERBB2, FGFR1, and FGFR2. HER2 overexpression and/or ERBB2 amplification is established as the sole predictive marker for treatment benefits from HER2-targeted therapy. As for FGFR, accumulating evidence supports FGFRs as a new therapeutic target for breast cancers, especially for high-level FGFR1-and FGFR2-amplified cancers. 29,30 However, in this study, two patients revealed high-level FGFR amplification in the smaller lesion. Our results indicate that performing a molecular test in the largest-index tumour could miss important molecular alterations in the smaller lesions in advanced multiple breast cancers. Data on inter-lesional genomic heterogeneity in multiple breast cancer are limited. Yates et al. discovered the existence of complex admixture of subclones across widely separated foci in four cases of multifocal breast cancer. They observed two distinct PTEN driver mutations from different regions in one patient and a CDK6 amplification localised to only one focus in another patient. 11 Such investigation of sub-clonal structure in multifocal breast cancer showed sub-clonal growth and dissemination during tumour progression. Norton et al. investigated copy number of 80 known cancer genes in all tumour foci from 11 patients with multifocal invasive lobular breast cancer and reported that copy number status was genetically homogenous across different foci in an individual patient. 31 Recently, Desmedt et al. conducted targeted sequencing in different lesions from 36 multifocal breast cancers. 14 Interlesional heterogeneity of oncogenic mutations such as PIK3CA, TP53, GATA3, and PTEN was present in 33% of the cases. Upon copy number analysis in eight patients, inter-lesional differences in PTEN loss and MYC amplification were observed. 14 They also investigated whether inter-lesional heterogeneity was associated with clinicopathologic factors and found that the only association that existed was with inter-lesion distance. 14 The distance between the tumours ranged from 0.3 to 7.0 cm (mean 2.4, median 1.9) in their study. 14 In our study, the distance between tumours was not drastically different, ranging from 0.5 to 4.5 cm (mean 1.7, median 1.0). The two tumour pairs that showed inter-lesional mutation heterogeneity tended to be further apart from each other compared to the other tumours. However, our study is limited in that the number of cases was too small to perform statistical analysis. Large-scale studies are warranted to determine clinicopathologic factors associated with inter-lesional heterogeneity.
In terms of the biology of breast cancer, there is substantial evidence that breast cancer is mainly driven by copy number alterations. 32,33 In their study of 2000 breast tumours, Curtis et al. have observed that copy number alterations accounted for the greatest variability in gene expression. 33 A recent investigation using a novel single-cell genome sequencing method revealed that single-cell copy number profiles were fairly similar, suggesting that this structural variation occurs early on followed by stable clonal expansions to form a tumour mass. 13 In contrast, mutations occurring from defects in DNA p.S127F TP53 p.S127F The discordant results between tumour #1 and #2 are marked with bold letters repair or replication accumulate more gradually over cell divisions. 13 Interestingly, inter-lesional difference in copy number was more prominent than point mutation in the present study. However, the number of patients was too small to draw a conclusion. A large-series study investigating copy number alterations in multiple breast cancers is warranted. Besides genomic heterogeneity between lesions, we identified discrepancies in intrinsic subtype in 2 (10%) of 21 patients. In our study, discordance of subtypes did not affect treatment planning in those two patients. However, inter-lesional HER2 heterogeneity may be associated with poor response to HER2-targeted therapy including trastzumab, as intra-tumoural HER2 heterogeneity. 27 Although the clinical significance of subtype discordance in multiple breast cancer has not been determined yet, recognition of inter-lesional heterogeneity of standard biomarkers such as HER2 may provide additional information on treatment response in those patients.
Our study has several limitations. First, not all copy number alteration and mutation results were validated with other methods. In situ hybridisation was performed for select genes. The discordant point mutations between tumours #1 and #2 had low variant allele frequency. We performed Sanger sequencing to confirm discordant point mutations in both tumours #1 and #2 of patients 3 and 8 but failed to confirm these mutations in both patients. It is probably due to the low sensitivity of Sanger sequencing, as the threshold for detection rests at an allele frequency of approximately 15−20%. 34 However, we presume that these were true mutations considering there was no artefact in other genes and those were alleged as hot spot mutations. Second, instead of investigating inter-lesional heterogeneity thoroughly from all lesions in an individual patient, we selected only two largest lesions. Lastly, no alterations were detected in four patients but the tumour purity of those cases was between 70 and 90%. Thus, it was probably due to the limitations pertaining to the targeted gene panel. The number of genes covered in the present study was relatively small. Of the 170 genes in our panel, 134 genes overlapped with those from Desmedt et al.'s panel. 14 Furthermore, genome-wide comparison between lesions was not available with the targeted gene panel. In further studies, genomic investigation of multiple invasive cancers and the associated carcinoma in situ in the whole genome level may provide insight into clonal evolution and timing of divergence in multiple breast cancers. Prospective integration of genomic studies and clinical trials is also warranted. 11,35 In conclusion, inter-lesional genomic heterogeneity, particularly of copy number alteration, was identified in a substantial number of multiple breast cancers. In precision medicine, interlesional genomic heterogeneity should be considered in Genomic profiling of multiple breast cancer reveals inter-lesional. . . S Ahn et al.