Abstract
Encapsulated papillary carcinoma (EPC) of the breast is a rare subtype of tumor. To date, the genetic abnormalities underlying EPC remain elusive. The purpose of this study was to gain further insight into EPC mutation profile. Forty-one EPCs diagnosed from 2015 to 2018 were included. Twenty-six EPCs were submitted to whole-exome sequencing (WES), and a 185 gene-targeted sequencing panel was designed to validate the results of the 26 EPCs that underwent WES and 15 additional cases. Recurrently mutated genes were further confirmed by Sanger sequencing. Our study revealed multiple recurrently mutated genes including PI3K-AKT-mTOR pathway genes (PIK3CA, AKT1, ULK1, MAP3K1, MAP2K4, RHOA, and PTEN) (27/41, 65.8%) and chromatin modification genes (ZFPM1, GATA3, CTCF, and KMT2C) (21/41, 51.2%) in EPC. Importantly, somatic ZFPM1 mutations existed in 9/41 (21.9%) of the EPCs. The frequency of ZFPM1 mutations in the EPCs was significantly higher than that of other tumor types. Of the nine ZFPM1 mutations, seven were frameshift mutations, and the remaining two were nonsense mutations. Moreover, a significant concurrence of ZFPM1 and PI3K-AKT-mTOR mutations were revealed in the EPCs. Of note, no TP53 mutations were detected in our EPCs, whereas it was detected in a considerable proportion of the luminal A invasive ductal carcinomas of no special type (IDC-NSTs) from TCGA. We reveal that recurrent somatic ZFPM1 mutation is characteristic of EPC and concurred with mutations in the PI3K-AKT-mTOR pathway. The distinctive genetic features of EPC might underlie its special histological structures and indolent behavior.
Similar content being viewed by others
Introduction
Encapsulated papillary carcinoma (EPC) is a rare subtype of breast cancer. EPC can be categorized as either a special invasive carcinoma, based on a lack of peripheral myoepithelial cells in the fibrous tumor capsule (as determined by immunohistochemical staining of p63, SMA, etc.), or as a form of ductal carcinoma in situ, due to its indolent biological behavior and excellent prognosis1,2,3. Some pathologists consider EPC as part of a spectrum of progression from in situ to invasive carcinoma4. Although some studies have compared EPC with in situ and invasive carcinomas at the genomic level, the results have been contradictory2,5.
Large-scale genomic technologies, such as whole-exome sequencing (WES), have been successfully applied to identify many genetic alterations in breast invasive ductal carcinomas of no special type (IDC-NST), leading to great progress in the characterization of its mutational landscape6,7. In addition, specific genetic mutations have been identified in specific breast cancers, such as the IDH2 hotspot mutation (R172) in tall cell carcinoma with reverse polarity of the breast and the CDH1 truncating mutations in invasive lobular carcinoma8,9. Transcriptomic analyses have demonstrated that the expression of cell migration genes in EPC is reduced compared with other types of papillary carcinoma5. Furthermore, some genetic alterations have been identified in EPCs from RNA-sequencing data, including mutations in PIK3CA (H1047R) and AKT1 (L52H)5,10. However, the genetic abnormalities underlying EPC have yet to be well clarified. One of the major reasons for this is the small sample size available in previous studies due to its rarity.
In this study, WES and targeted sequencing (TS) were performed in a relatively large cohort of EPCs to explore the mutational profile and reveal characteristic molecular genetic alterations in EPC.
Results
Clinical features of the study population
All 41 patients were female with a median age of 60.5 years (range: 30–83 years). None of the patients received neoadjuvant therapy. All patients in this cohort were diagnosed with low- to intermediate-grade EPCs. We observed that 60.9% (25/41) of cases were pure EPC, and 39.1% (16/41) of cases were EPC with invasive carcinoma (EPC-IC). In EPC-IC, the majority of accompanied invasive carcinoma were IDC-NST (Grade 1), and mucinous carcinoma was found in only one case. In terms of predominant architecture in EPC, papillary and cribriform structure exhibited in 92.7% (38/41) of cases, whereas solid structure was found in 7.3% (3/41) (Fig. 1). 80.5% (33/41) of patients were available for assessment of regional lymph node status. Only one patient had sentinel lymph node micrometastasis (<1 mm), and the remaining patients did not present lymph node metastasis. Our EPCs were all high ER- and PR- positive but Her-2- negative. The average Ki-67 index was 7.3% (range: 1–25%). Most of EPC could be classified as “luminal-A like” according to St. Gallen International Expert Consensus11. Follow-up data were available for 40 patients. The median follow-up was 36.5 months (range: 5–62 months). There were no local recurrences, distant metastases, or cancer-related deaths during the follow-up period (Supplementary Data 1). The detailed clinicopathological features were listed in Supplementary Data 1.
Somatic mutational landscape of EPCs
WES was performed on paired tumor and germline DNA from 26 patients. All tumor DNA was extracted from fixed paraffin-embedded (FFPE) tissues. Of the 26 patients, germline DNA was extracted from whole-blood in 16 patients and FFPE normal tissues in the remaining 10 patients.
The somatic mutations were listed in Supplementary Data 2. An average of 40 Gb data was generated from each sample. The average depths of targeted exome regions in tumors and matched blood/normal tissues were ×400 (range 342–616) and ×100 (range 88–166), respectively. More than 98.79% of the targeted regions were covered sufficiently for confident variant calling (≥10× depth). For nucleotide substitutions, the predominant types of SNVs in EPCs were C > T/G > A and T > C/A > G transitions and C > A/G > T transversions. Three signatures (A, B, and C) were revealed in EPCs by nonnegative matrix factorization clustering12. Signature B showed the largest contribution, which was found to be similar to Signature 5 (α = 0.83). This signature was featured with T > C substitutions and common in all cancer types. Signature C was similar to Signature 13 (α = 0.83), which was associated to old age and common in invasive breast cancer7. In this cohort, 80% of EPC patients were postmenopausal females (Age > 50). This signature could be in part explained by the age of EPC. Lastly, Signature A was associated with Signature 29 (α = 0.66). The underlying molecular mechanisms of signature A are transcriptional strand bias for C > A mutations and common in individuals with a tobacco chewing habit, while the association between signature A and EPC was unknown (Fig. 2).
Recurrent mutated genes including PIK3CA (13/26 50.0%), ZFPM1 (6/26, 23.0%), GATA3 (4/26, 15.3%), AKT1 (2/26, 7.7%) and CTCF (2/26, 7.7%) were identified by WES (Supplementary Data 2). Among these recurrent mutated genes, ZFPM1 gene mutations have not been reported in breast cancer.
Deep TS (185 gene panel) was performed in 41 paired EPCs (26 WES cases and 15 additional cases). Paired tumor and normal FFPE tissues were used for the additional 15 patients. The mean coverage of the targeted genes was 1137× (range 474–1830) in tumor tissues and 517× (range 40–1356) in paired normal tissues. By TS, 179 somatic mutations, including 27 small coding insertions/deletions (indels), 10 nonsense mutations, 134 missense mutations, and 8 splice-site changes were detected in 41 EPC cases. Of 134 missense mutations, 89 were predicted to have a high probability of pathogenicity by polymorphism phenotyping (PolyPhen) and SIFT algorithm (Supplementary Data 3). We identified multiple frequently somatic mutated genes in EPCs and divided them into several subsets including PI3K-AKT-mTOR pathway genes (PIK3CA, AKT1, ULK1, MAP3K1, MAP2K4, RHOA, and PTEN) (27/41, 65.8%), chromatin modification genes (ZFPM1, GATA3, CTCF, and KMT2C) (21/41, 51.2%), and Cancer Gene Census genes (CGC) (SPEN, CBFB, and ATM) (Fig. 3). Recurrent genes mutations were confirmed by Sanger sequencing (Supplementary Fig. 1).
Characteristics of somatic ZFPM1 mutations in EPCs
Somatic ZFPM1 mutations were identified by WES in six cases (6/26, 23.0%), and an additional 3 tumor samples with ZFPM1 mutations were revealed in 15 additional cases by TS. Ultimately, somatic ZFPM1 mutations existed in nine EPCs (21.9%, 9/41). All mutations were confirmed by Sanger sequencing (Fig. 4 and Supplementary Fig. 4).
The human ZFPM1 gene lies on chromosome 16q24 and encodes a nine-zinc-finger protein (1006 aa) that is highly homologous to murine Fog1, which is an essential cofactor that acts via the formation of a heterodimer with transcription factors of the GATA family: GATA1, GATA2, and GATA3. ZFPM1 is also known as the Friend of GATA-1 (FOG1). In our series, most of ZFPM1 mutations located in exon 5 or exon 10. Of the nine ZFPM1 mutations, 77.8% (7/9) were frameshift mutations, five of which were predicted to cause protein truncation, and two of which were predicted to extend the C-terminus. The remaining two cases were nonsense mutations resulting in an early stop codon (Fig. 5a). As a result, all nine ZFPM1 mutations were predicted to fail in encoding natural C-terminal zinc-finger motifs.
To date, ZFPM1 mutations have not been described in breast cancer studies. In COSMIC database (tissue: breast; n = 2534), fifteen (0.59%, 15/2534) invasive breast carcinomas were found with ZFPM1 mutations. Of the fifteen cases with mutations, the majority were breast IDC-NSTs with ER- and PR-positive immunophenotype, with one case exhibiting papillary architecture histologically. The frequency of ZFPM1 mutations in our EPCs was also significantly higher than that in other tumor types according to the TCGA database, including colon adenocarcinoma, uterine corpus endometrial carcinoma, stomach adenocarcinoma, and invasive breast carcinoma (p < 0.05, Chi-square test; Fig. 5b, Supplementary Data 4). All ZFPM1 mutations identified in EPCs were novel, except for S601* in the TCGA database. These results indicate that recurrent somatic ZFPM1 mutation is characteristic of EPC. Interestingly, recurrent ZFPM1 mutations in EPCs co-occurred with mutations affecting canonical PI3K-AKT-mTOR pathway genes (p = 0.017 for co-occurrence, Fisher’s exact test; Supplementary Fig. 3). Six ZFPM-mutated cases accompanied with PIK3CA hotspot mutations (including two H1047R, three E545K, and one N345K). The remaining three cases harbored concurring ALK1 (E17K), PIK3R1 (Y165fs/G35fs) and ULK1 (H72Q) mutation, respectively.
ZFPM proteins are tightly associated with GATA functions13,14,15,16,17. ZFPM1 plays a vital role in binding with GATA proteins and the function of GATA3 could be changed by the ZFPM1 mutation16,17,18. We found that mutations of GATA3 and ZFPM1 seemed to show a mutually exclusive pattern in our EPCs, although there was no statistically significant (p = 0.659 for mutual exclusivity, Fisher’s exact test; Fig. 6).
Mutations affecting the PI3K-AKT-mTOR pathway in EPCs
In total, 65.8% (27/41) of EPCs harbored at least one somatic nonsynonymous mutation in the PI3K-AKT-mTOR signaling pathway. Nineteen cases (46.3%, 19/41) harbored PIK3CA hotspot mutations, including nine H1047R/L, six E545K, one Q546K, one E542K, one N345K, and one G118D. Activating mutations in AKT1 (E17K) were identified in five EPCs. The tumor suppressor gene PTEN, one of the key regulators of the PI3K-AKT-mTOR pathway, had a frameshift mutation (R308 deletion) in one patient resulting in protein inactivation and downstream signal activation. We also detected dual mutations of PIK3R1 (p85α) (Y165 insertion and G35 deletion) in one patient. PIK3R1 mutations, located in the SH3 and Rho-GAP domains of p85α, are known to block binding of the PTEN–p85α heterodimer and destabilize PTEN19. Activation of MAP3K1 and MAP2K4 represents two contiguous steps within the p38/JNK1 pathway downstream of PI3K-AKT signaling6,20. Two EPCs harbored MAP3K1 truncated mutations, of which one had a concurrent PIK3CA (H1047R) mutation and the other had a concurrent PIK3CA (E545K) mutation.
As downstream genes of PI3K-AKT signaling, ULK1 and RHOA mutations are rarely detected in breast cancer. Two ULK1 missense mutations (H72Q and M836I) were predicted as “probably pathogenic” in EPCs. RhoA could repress mTORC1 signaling by reducing the concentration of Rheb·GTP and was a critical downstream target of mTOR involved in actin cytoskeleton remodeling and cell migration [44.45]. One patient harbored an RHOA hotspot mutation (G17R) concomitant with PIK3CA (E454K).
Germline and somatic mutations in DNA damage response and repair (DDR) genes
We classified mutations in DDR genes from WES results according to ACMG/AMP classification criteria in DDR genes from WES results21. The somatic and germline DDR mutations (“Likely pathogenic” or “Pathogenic”) were listed in Table 1. Five EPC cases harbored DDR gene mutations. All mutation genes were related to homologous recombination repair, including SLX4, ATM, RAD54B, PALB2, and ERCC5 (Fig. 7a). Of the five mutations, three were germline variants and two were somatic. The three germline mutations were classified as “likely pathogenic”, according to ACMG classification criteria. The case with PALB2 germline mutation has a family history of breast cancer. The detail information about patients with germline DDR gene mutations was listed in the Supplementary Table 4.
However, BRCA1, BRCA2, and mismatch repair (MMR) genes (MLH1, PMS2, MSH2, and MSH6) showed neither somatic nor germline mutations in all EPCs. The pathogenic mutation in DDR system may induce a hypermutational phenotype22. The average TMB of mutated cases was 2.66 mutations/Mb, which was significantly higher than that of wild type ones (the average TMB: 0.71 mutations/Mb, p = 0.007, Unpaired t test) (Fig. 7b).
Comparison of EPCs with luminal A IDC-NSTs in molecular genetic features
We compared the molecular features of EPCs with those of luminal A IDC-NST (n = 223, data from The Cancer Genome Atlas, TCGA).
No difference was identified between the two subtypes regarding the TMB (the average TMB of 1.09 [range 0.1–8.17] and 1.02 [range 0.13–6.77] mutations/Mb for EPCs and luminal A IDC-NSTs, respectively; p = 0.99, Unpaired t test, Two-Tailed) (Supplementary Fig. 2). The predominant mutation spectra of luminal A IDC-NSTs were C > T/G > A and T > C/A > G transitions and C > A/G > T transversions, which were also consistent with that of EPCs.
Moreover, we compared the recurrently mutated genes between our EPCs and luminal A IDC-NSTs (Table 2). Abnormalities of the PI3K-AKT-mTOR pathway were prominent in both of subtypes, with PIK3CA being the most commonly mutated gene (46.3% vs. 47.1%, p = 1.000, Chi-square test). Some unique features were observed in our EPC. As for chromatin modifications genes, a significantly higher mutation frequency of ZFPM1 (21.9% vs. 0.0%; p < 0.001, Fisher’s exact test) and CTCF (12.2% vs. 4%; p = 0.048, Chi-square test) were found in EPCs compared with that in luminal A IDC-NSTs. Among PI3K-AKT-mTOR pathway genes, AKT1 hotspot mutation was significantly more frequent in EPCs (12.2% vs. 3.1%; p = 0.024, Chi-square test). Importantly, TP53 somatic mutations were found in none of our EPCs, whereas it was detected in a considerable proportion of luminal A IDC-NSTs (0.0% vs. 12.1%; p = 0.011).
Discussion
In this study, a comprehensive genomic landscape of EPCs was generated by WES and TS analysis. We observed that PI3K-AKT-mTOR genes and chromatin modification genes were frequently mutated in EPC. Most importantly, recurrent somatic ZFPM1 mutations were a characteristic of EPC and concurrent with mutations in the PI3K-AKT-mTOR pathway.
Chromatin modification genes are responsible for various epigenetic regulations that engage gene expression programs. In recent studies, more evidence has demonstrated that inappropriate epigenetic remodeling can also drive tumorigenesis and that chromatin remodeling plays a pivotal role in the early stages of breast pathogenesis23,24. Our study found that recurrent mutations in chromatin modification genes (ZFPM1, GATA3, CTCF, and KMT2C) were detected in 51.2% of EPCs. Of the chromatin modification genes, ZFPM1 gene was found to be the most frequently mutated in EPCs, and the mutation of ZFPM1 gene had not been reported in other breast cancer studies. Moreover, all ZFPM1 mutations in EPC were frameshift or stop-gain mutations and were thereby presumed to cause loss of the protein function. We inferred that ZFPM1 might play a role as a potential tumor suppression gene in EPC, which should be further validated by copy number analysis in the region of ZFPM1 and functional assays.
ZFPM1 protein has been demonstrated to be an important cofactor for GATA proteins in multiple tissues. According to previous studies, the zinc-fingers (ZincF-1, 5, 6, and 9) of ZFPM1 protein are able to directly interact with the N-terminal ZincF of GATA315,16,17,18. All ZFPM1 mutations in our EPC cases were predicted to fail in encoding natural zinc-finger motifs, and thus might impair the interaction with GATA3. On the other hand, all mutated ZFPM1 proteins are predicted to be deleterious in the CTBP-binding region (aa794-800, between ZincF-6 and 7). This region of ZFPM1 protein is able to interact with CTBP protein (C-terminal binding protein, such as CTBP1 and CTBP2) that could promote scaffold interactions between GATA transcriptional factors and chromatin-modifying complexes25. GATA3 transcriptional factor relies on the actions of its partner protein to effect changes in the chromatin structure and thereby modulate gene expression16,17,26. The effects and mechanisms of ZFPM1 mutations on GATA transcriptional factors interaction and downstream biological functions in EPCs in EPCs should be further clarified. In addition, ZFPM1 belongs to the positive regulatory domain (PRDM) gene family, which are involved in human cancer through modulation of several processes, such as epigenetic modifications and genetic reprogramming27. Functional studies on epigenetic modifications and genetic reprogramming should be further explored to clarify the potential role of ZFPM1 mutation in EPC.
As the partner of ZFPM1 protein, GATA3 protein is involved in growth control and the maintenance of luminal epithelial differentiation in mammary tissue. GATA3 mutations were detected in 19.5% of EPCs. GATA3 mutations had been reported to contribute to aberrant transactivation activity and tumorigenesis in ER-positive breast cancers28,29,30. Interestingly, our study found that mutations in ZFPM1 and GATA3 seemed to show a mutually exclusive pattern in EPCs and further indicated that mutations in ZFPM1 and GATA3 might exert common or redundant functional effects.
Apart from chromatin modification genes, we observed an enrichment of alterations in the PI3K-AKT-mTOR pathway in EPC. The frequency of mutations of PIK3CA and AKT1 in EPCs is consistent with previous reports in breast papillary neoplasms31,32. Either elevated PI3K activity as a result of PIK3CA mutations or downstream AKT activation could cause an oncogenic transformation in mammary epithelial cells and the formation of heterogeneous mammary tumors in vivo33,34,35. As downstream genes of PI3K-AKT signaling, ULK1 gene could encode an autophagy initiator protein, and its mutation was detected in 4.8% of our EPCs. ULK1 mutations reduce autophagy-dependent apoptosis directly and induce tumor proliferation and survival by regulating the mTOR-ULK1 signaling axis36,37,38,39,40. Given the concurrent ZFPM1 and PI3K-AKT-mTOR pathway mutations, they might synergistically contribute to the oncogenesis of EPC, and further in vitro functional studies are required.
Given that the majority of our EPCs exhibited luminal A-like phenotype (high hormone receptor expression and low Ki-67 index), we made a comprehensive comparison of molecular genetic features between EPC and luminal A IDC-NSTs for the first time. The somatic mutation load and spectra in our EPCs was similar to luminal A IDC-NSTs. Moreover, frequent mutations in PI3K-AKT-mTOR pathway were detected in both luminal A IDC-NSTs and our EPC cases, and approximately half cases harbored PIK3CA mutations. Albeit sharing remarkably similar mutational features with luminal A IDC-NSTs, EPC showed distinctive genetic features. ZFPM1 mutations were exclusively found in about one-fifth of EPCs, and another chromatin modification gene CTCF was also more frequently mutated in EPCs. Among PI3K-AKT-mTOR mutations, AKT1 activation mutations in our EPCs were also significantly more common than those in luminal A IDC-NSTs, with all mutated EPCs harboring the same mutation of E17K. In addition, EPCs harbored a significantly lower TP53 mutations. These distinctive genetic features of EPC might underlie its special histological structures.
In our cohort, there was no local recurrence, distant metastasis, or cancer-related death during the follow-up period, consistent with the excellent prognosis of EPCs as previously reported1,41. The inactivation of tumor suppression gene TP53 and DDR genes has been associated with poor prognosis and aggressive clinicopathological features, such as distance metastasis22,42,43. No TP53 gene mutation and infrequent DDR gene mutations were detected in our EPC cases. Our results indicated that the excellent prognosis and indolent biological behavior of EPC might in part be attributed to the intact TP53 and DDR pathways.
Our study had some limitations. First, although our study constitutes the largest series of EPCs with NGS data to date, our sample size is limited owing to the rarity of EPCs. Therefore, the frequency of ZFPM1 mutation and DDR germline mutation in the EPCs needs more accurate evaluation in large cohorts. Second, only FFPE tumor tissues were available and WES and targeted region sequencing rather than WGS were conducted in our study. The FFPE mutation signature results should be interpreted with caution, and other alteration in EPC, such as structure variation and noncoding region variation, couldn’t been explored in our study. Third, given that the invasive component in our series was small and might be insufficient for DNA sequencing, we did not micro-dissected the invasive component from the EPC component for further comparison in this study. More EPC cases with sufficient invasive component will be collected for comparing the genetic alterations between EPC and its invasive component in our further study.
In conclusion, we revealed for the first time that somatic ZFPM1 mutations were commonly present and characteristic of EPCs. Moreover, ZFPM1 mutations co-occurred with mutations in the PI3K-AKT-mTOR pathway and tended to be mutually exclusive with GATA3 mutations. Our results clarify the genetic mechanism underlying EPC, shed light on EPC tumorigenesis, and may assist diagnosis and treatment for EPC. Our results also suggest that EPC has an excellent prognosis, which might be attributed to the intact TP53 and DDR pathways.
Methods
Clinical information and diagnostic criteria for samples
EPCs diagnosed from 2015 to 2018 were retrospectively retrieved from the surgical pathology files of Peking Union Medical College Hospital. The histological slides were reevaluated by three experienced pathologists (Z.L., H.W., and X.L.) to verify the diagnosis according to the 2019 WHO classification of tumors of the breast44. The exclusion criteria were as follows: (i) patients with multiple tumors; (ii) patients for which the area of accompanied invasive carcinoma was larger than the area of EPC; and (iii) patients who did not have paired blood/normal tissue. A total of 41 patients were selected for this study. All tumor samples were obtained from surgical resection and fixed in formalin and embedded in paraffin. The study protocol was approved by the Ethical Committee of Peking Union Medical University Hospital. Our study is compliant with the “Guidance of the Ministry of Science and Technology (MOST) for the Review and Approval of Human Genetic Resources”. We got the formal approval for the export of human genetic material or data. The approval ID is 2021BAT1149. The clinicopathological information was listed in Supplementary Data 1.
DNA extraction
For FFPE samples, 5 mm sections were cut, placed onto glass slides, and stained with hematoxylin and eosin (H&E). Tumor samples were screened to ensure the percentage of tumor cells was greater than 80%. DNA extraction from FFPE tissues was performed using the QIAmap DNA FFPE Tissue Kit (Qiagen, 56404). The tumor cells of EPCs were enriched using an H&E-stained section as a template, and tumor areas (only the EPC areas) were dissected using a clean scalpel blade. For whole blood samples, genomic DNA was extracted by the DNeasy Blood & Tissue Kit (Qiagen). The quantity of DNA was analyzed by Qubit (Invitrogen, Qubit2.0), and DNA quality was determined using PicoGreen™ fluorometric analysis as well as visual inspection of agarose gel electrophoresis images. The sample tissue information was shown in Supplementary Table 1.
Whole-exome sequencing and mutation calling
Whole-exome sequencing was performed for 26 patients with matched tumor and normal samples. Sequencing libraries were generated using the Agilent Sure Select Human All Exon V6 Kit (Agilent Technologies, CA, USA) following the manufacturer’s recommendations, and index codes were added to each sample. Tumor/normal paired DNA libraries were sequenced on an Illumina HiSeq X Ten platform, and 150 bp paired-end reads were generated. To minimize the inclusion of sequencing artefacts related to formalin fixation, bioinformatics analysis included quality control, read mapping to the reference sequence, variant calling, identification of candidate somatic variants, and functional annotation. The quality control was conducted: (1) remove reads with sequencing adapter; (2) remove reads with a ratio of N base (N represents indeterminable base information) greater than 10%; (3) the sequencing error rate and percentage of reads with Q30 (the percent of bases with Phred-scaled quality scores greater than 30) were calculated and summarized, and to remove low quality reads (the ratio of base with Q phred≤20 greater than 50% in the read). The sequencing reads were aligned to the reference genome (hg19) using bwa and samblaster, and soft-clipped reads were filtered. Somatic mutation analysis was performed with Mutect. To filter SNVs, filtering parameters including coverage (>10X), variant frequency (>0.08), variant read support (>3 reads) were applied. The pileup files created by SAMtools mpileup for tumor and matched normal samples. Then, the C > T or G > A mutation sites were filtered by the strand bias: (1) Discarded mutant sites that occurred only in plus or minus strand; (2) Reserved mutant sites that the ratios of the number of sites occurred in plus or minus strand within 0.2–5. To make the results more reliable, we also adopted a relative deep sequencing depth. All candidate variants were visually inspected in Integrated Genome Viewer (IGV) (Supplementary Data 2). Tumor mutation burden (TMB) was defined as the number of detected mutations over the region of a tumor genome. TMB was calculated for using coding base substitutions and indel alterations. Rearrangements, fusions, and copy number variants were excluded.
Mutation spectrum and mutation signature analysis
Mutation spectrum and signature analysis were performed in each WES tumor sample. Mutation spectrum bar plot was drawn to present mutation spectrum of each sample. By nonnegative matrix factorization (NMF), we conducted cluster analyses on 96 somatic mutation types and acquired three different mutation signatures. Then, mutation spectra were clustered with 30 known signatures on COSMIC to explain mutation process of samples. Signatures A, B, and C were identified in tumor samples and the distribution of them was presented. The cosine similarity (α) has a value between 0 and 1. Two mutational profiles are identical when the cosine similarity is 1, and independent when the cosine similarity is 0.
Targeted sequencing
To validate the WES results, deep targeted sequencing was conducted. The 26 paired WES samples were then subjected to TS analyses. Moreover, to expand the sample size, 15 additional paired EPC samples were submitted for TS. Finally, sequencing libraries were prepared from 41 patients.
Sequencing probes were designed on the Agilent Technologies website for 185 genes. The panel genes included genes frequently mutated in EPCs, genes likely to be pathogenic in EPCs, genes mutated in intraductal papillary neoplasms, and genes associated with invasive breast carcinoma5,6,45. The panel genes are listed in Supplementary Table 2. Target-enriched libraries were then sequenced on the Illumina HiSeq X Ten platform, and 150 bp paired-end reads were generated. For paired tumor and normal samples, the analysis was performed using the same methods as the exome sequencing analysis. When variant allele frequencies were greater than 8%, MuTect and Strelka were used for calling somatic single nucleotide variants (SNVs) and small indels, respectively. All candidate variants were visually inspected in IGV (Supplementary Data 3). A KEGG pathway and GO analysis was performed based on genes affected by nonsynonymous pathogenic somatic mutations in EPCs (n = 41), and pathways found to be significantly enriched (p < 0.01) were selected46,47.
Sanger sequencing
Sanger sequencing was conducted to validate recurrently mutated genes. Primer information for the Sanger sequencing experiments was provided in Supplementary Table 3. PCR amplification was conducted with Phusion Polymerase (Life Technologies). The PCR program included one cycle at 95 °C for 10 min; 35 cycles at 95 °C for 30 s, 62 °C for 30 s and 72 °C for 1 min; and one cycle at 72 °C for 5 min. Bidirectional sequencing of the generated PCR amplicons was performed using the BigDye Terminator v.3.1 kit (Applied Biosystems, Thermo Fisher Scientific, USA). Products were analyzed by the ABI PRISM 3730 Genetic Analyzer (Applied Biosystems, Thermo Fisher Scientific, USA).
Determination of significantly mutated genes and mutation landscape
Significantly mutated genes were defined using MutSigCV48. Cancer Gene Census (CGC513) databases were utilized to characterize the genetic landscape49. The reads with variant allele frequencies (VAFs) lower than 5% were removed from the raw data.
Comparative analysis with luminal A invasive ductal carcinomas of no special type (IDC-NST)
The comparison between EPC and luminal A IDC-NSTs in molecular features were conducted. Molecular data for luminal A IDC-NSTs from The Cancer Genome Atlas (TCGA; n = 223) were obtained from the TCGA Data Portal (https://tcga-data.nci.nih.gov/docs/publications/brca_2012/)6. The mutation spectrum analysis in luminal A IDC-NST was performed as described for EPC.
Statistical analysis
Concurrent mutation and mutual exclusivity were tested using Chi-square test or Fisher’s exact test. Comparisons of continuous and categorical features were performed using Unpaired t test and Fisher exact tests as appropriate. All p values were two-tailed, and 95% confidence intervals were adopted for all analyses. Statistical analyses were performed with R v3.1.2 and SPSS v24.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The data generated and analysed during this study are described in the following data record: https://doi.org/10.6084/m9.figshare.1441081150. The raw sequencing data are openly available in the NCBI Sequence Read Archive, and can be accessed via the following BioProject accession: https://identifiers.org/bioproject:PRJNA68558251. These data have also been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2017) in the National Genomics Data Center (Nucleic Acids Res 2020), Beijing Institute of Genomics (China National Center for Bioinformation), Chinese Academy of Sciences. The accession code is HRA000480. The TCGA dataset used in the related article is from https://tcga-data.nci.nih.gov/docs/publications/brca_2012/. Additional data underlying the figures of the related article are openly available in the files included with the figshare data record.
References
Rakha, E. A. et al. Encapsulated papillary carcinoma of the breast: an invasive tumor with excellent prognosis. Am. J. Surg. Pathol. 35, 1093–1103 (2011).
Wynveen, C. A. et al. Intracystic papillary carcinoma of the breast: an in situ or invasive tumor? Results of immunohistochemical analysis and clinical follow-up. Am. J. Surg. Pathol. 35, 1–14 (2011).
Mulligan, A. M. & O’Malley, F. P. Metastatic potential of encapsulated (intracystic) papillary carcinoma of the breast: a report of 2 cases with axillary lymph node micrometastases. Int J. Surg. Pathol. 15, 143–147 (2007).
Hill, C. B. & Yeh, I. T. Myoepithelial cell staining patterns of papillary breast lesions: from intraductal papillomas to invasive papillary carcinomas. Am. J. Clin. Pathol. 123, 36–44 (2005).
Piscuoglio, S. et al. Integrative genomic and transcriptomic characterization of papillary carcinomas of the breast. Mol. Oncol. 8, 1588–1602 (2014).
Cancer Genome Atlas, N. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70 (2012).
Nik-Zainal, S. et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature 534, 47–54 (2016).
Ciriello, G. et al. Comprehensive Molecular Portraits of Invasive Lobular Breast Cancer. Cell 163, 506–519 (2015).
Chiang, S. et al. IDH2 mutations define a unique subtype of breast cancer with altered nuclear polarity. Cancer Res 76, 7118–7129 (2016).
Khoury, T., Hu, Q., Liu, S. & Wang, J. Intracystic papillary carcinoma of breast: interrelationship with in situ and invasive carcinoma and a proposal of pathogenesis: array comparative genomic hybridization study of 14 cases. Mod. Pathol. 27, 194–203 (2014).
Curigliano, G. et al. De-escalating and escalating treatments for early-stage breast cancer: the St. Gallen International Expert Consensus Conference on the Primary Therapy of Early Breast Cancer 2017. Ann. Oncol. 28, 1700–1712 (2017).
Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415–421 (2013).
Tsang, A. P. et al. FOG, a multitype zinc finger protein, acts as a cofactor for transcription factor GATA-1 in erythroid and megakaryocytic differentiation. Cell 90, 109–119 (1997).
Svensson, E. C., Tufts, R. L., Polk, C. E. & Leiden, J. M. Molecular cloning of FOG-2: a modulator of transcription factor GATA-4 in cardiomyocytes. Proc. Natl Acad. Sci. USA 96, 956–961 (1999).
Chang, A. N. et al. GATA-factor dependence of the multitype zinc-finger protein FOG-1 for its essential role in megakaryopoiesis. Proc. Natl Acad. Sci. USA 99, 9237–9242 (2002).
Kurata, H. et al. Friend of GATA is expressed in naive Th cells and functions as a repressor of GATA-3-mediated Th2 cell development. J. Immunol. 168, 4538–4545 (2002).
Cantor, A. B. & Orkin, S. H. Coregulation of GATA factors by the Friend of GATA (FOG) family of multitype zinc finger proteins. Semin Cell Dev. Biol. 16, 117–128 (2005).
Liew, C. K. et al. Zinc fingers as protein recognition motifs: structural basis for the GATA-1/friend of GATA interaction. Proc. Natl Acad. Sci. USA 102, 583–588 (2005).
Cheung, L. W. et al. High frequency of PIK3R1 and PIK3R2 mutations in endometrial cancer elucidates a novel mechanism for regulation of PTEN protein stability. Cancer Disco. 1, 170–185 (2011).
Johnson, G. L. & Lapadat, R. Mitogen-activated protein kinase pathways mediated by ERK, JNK, and p38 protein kinases. Science 298, 1911–1912 (2002).
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405–424 (2015).
Hoppe, M. M., Sundar, R., Tan, D. S. P. & Jeyasekharan, A. D. Biomarkers for homologous recombination deficiency in cancer. J. Natl. Cancer Inst. 110, 704–713 (2018).
Pasculli, B., Barbano, R. & Parrella, P. Epigenetics of breast cancer: Biology and clinical implication in the era of precision medicine. Semin Cancer Biol. 51, 22–35 (2018).
Holliday, H., Baker, L. A., Junankar, S. R., Clark, S. J. & Swarbrick, A. Epigenomics of mammary gland development. Breast Cancer Res. 20, 100 (2018).
Jack, B. H. & Crossley, M. GATA proteins work together with friend of GATA (FOG) and C-terminal binding protein (CTBP) co-regulators to control adipogenesis. J. Biol. Chem. 285, 32405–32414 (2010).
Kadri, Z. et al. Erythropoietin and IGF-1 signaling synchronize cell proliferation and maturation during erythropoiesis. Genes Dev. 29, 2603–2616 (2015).
Sorrentino, A. et al. PR/SET Domain Family and Cancer: Novel Insights from the Cancer Genome Atlas. Int. J. Mol. Sci. 19, https://doi.org/10.3390/ijms19103250 (2018).
Usary, J. et al. Mutation of GATA3 in human breast tumors. Oncogene 23, 7669–7678 (2004).
Kouros-Mehr, H., Slorach, E. M., Sternlicht, M. D. & Werb, Z. GATA-3 maintains the differentiation of the luminal cell fate in the mammary gland. Cell 127, 1041–1055 (2006).
Gaynor, K. U. et al. GATA3 mutations found in breast cancers may be associated with aberrant nuclear localization, reduced transactivation and cell invasiveness. Horm. Cancer 4, 123–139 (2013).
Troxell, M. L. et al. High prevalence of PIK3CA/AKT pathway mutations in papillary neoplasms of the breast. Mod. Pathol. 23, 27–37 (2010).
Jahn, S. W. et al. Mutation profiling of usual ductal hyperplasia of the breast reveals activating mutations predominantly at different levels of the PI3K/AKT/mTOR pathway. Am. J. Pathol. 186, 15–23 (2016).
Araki, K. & Miyoshi, Y. Mechanism of resistance to endocrine therapy in breast cancer: the important role of PI3K/Akt/mTOR in estrogen receptor-positive, HER2-negative breast cancer. Breast Cancer 25, 392–401 (2018).
Bonin, S. et al. PI3K/AKT signaling in breast cancer molecular subtyping and lymph node involvement. Dis. Markers 2019, 7832376 (2019).
Kanaizumi, H. et al. PI3K/Akt/mTOR signalling pathway activation in patients with ER-positive, metachronous, contralateral breast cancer treated with hormone therapy. Oncol. Lett. 17, 1962–1968 (2019).
Alers, S., Loffler, A. S., Wesselborg, S. & Stork, B. Role of AMPK-mTOR-Ulk1/2 in the regulation of autophagy: cross talk, shortcuts, and feedbacks. Mol. Cell Biol. 32, 2–11 (2012).
Shang, L. & Wang, X. AMPK and mTOR coordinate the regulation of Ulk1 and mammalian autophagy initiation. Autophagy 7, 924–926 (2011).
Suvorova, I. I. & Pospelov, V. A. AMPK/Ulk1-dependent autophagy as a key mTOR regulator in the context of cell pluripotency. Cell Death Dis. 10, 260 (2019).
Cao, C. et al. Narciclasine induces autophagy-dependent apoptosis in triple-negative breast cancer cells by regulating the AMPK-ULK1 axis. Cell Prolif. e12518, https://doi.org/10.1111/cpr.12518 (2018).
Dower, C. M. et al. Targeted inhibition of ULK1 promotes apoptosis and suppresses tumor growth and metastasis in neuroblastoma. Mol. Cancer Ther. 17, 2365–2376 (2018).
Solorzano, C. C. et al. Treatment and outcome of patients with intracystic papillary carcinoma of the breast. Am. J. Surg. 184, 364–368 (2002).
Schon, K. & Tischkowitz, M. Clinical implications of germline mutations in breast cancer: TP53. Breast Cancer Res Treat. 167, 417–423 (2018).
Wang, Z. et al. Comutations in DNA damage response pathways serve as potential biomarkers for immune checkpoint blockade. Cancer Res 78, 6486–6496 (2018).
Hoon Tan, P. et al. The 2019 WHO classification of tumours of the breast. Histopathology. https://doi.org/10.1111/his.14091 (2020).
Mishima, C. et al. Mutational Analysis of AKT1 and PIK3CA in intraductal papillomas of the breast with special reference to cellular components. Am. J. Pathol. 188, 1106–1112 (2018).
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Sanchez-Vega, F. et al. Oncogenic signaling pathways in the cancer genome atlas. Cell 173, 321–337 e310 (2018).
Lawrence, M. S. et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature 499, 214–218 (2013).
Sondka, Z. et al. The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers. Nat. Rev. Cancer 18, 696–705 (2018).
Liu, X. et al. Metadata record for the article: Next-generation sequencing revealed recurrent ZFPM1 mutations in encapsulated papillary carcinoma of the breast. https://doi.org/10.6084/m9.figshare.14410811 (2021).
Sequence Read Archive https://identifiers.org/bioproject:PRJNA685582 (2021).
Acknowledgements
This research is supported by the Chinese Academy of Medical Sciences (CAMS) Initiative for Innovative Medicine (CAMS-I2M) (Project No.2016-I2M-1-002), and the CAMS Innovation Fund for Medical Sciences (Project No.2019-I2M-2-002). We would like to thank Editage (www.editage.cn) for English language editing. The bioinformatics analysis and further analysis of the sequencing data were assisted by Yujie Zhao, Yu Zhang, and Wei Tao from Novogene Bioinformatics Technology Co., Ltd, Beijing.
Author information
Authors and Affiliations
Contributions
X.L. and X.H. contributed equally to this work. X.L. and H.W. designed the study. X.L. wrote the manuscript. T.J., H.W., X.H., and Z.L. managed and supervised the project. X.L., Y.B., and X.H. prepared the samples. H.W., and Z.L. confirmed the pathological diagnosis. X.L. performed the validation of mutations. Z.L., H.W., L.Y., and Z.Z. revised the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, X., Huang, X., Bai, Y. et al. Next-generation sequencing revealed recurrent ZFPM1 mutations in encapsulated papillary carcinoma of the breast. npj Precis. Onc. 5, 42 (2021). https://doi.org/10.1038/s41698-021-00180-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41698-021-00180-5
This article is cited by
-
Phase I study of LZM005 in patients with HER2-positive metastatic breast cancer
npj Breast Cancer (2022)