Male breast cancer (MBC) is rare and poorly characterized. Like the female counterpart, most MBCs are hormonally driven, but relapse after hormonal treatment is also noted. The pan-hormonal action of steroid hormonal receptors, including estrogen receptor alpha (ERα), androgen receptor (AR), progesterone receptor (PR), and glucocorticoid receptor (GR) in this understudied tumor type remains wholly unexamined. This study reveals genomic cross-talk of steroid hormone receptor action and interplay in human tumors, here in the context of MBC, in relation to the female disease and patient outcome. Here we report the characterization of human breast tumors of both genders for cistromic make-up of hormonal regulation in human tumors, revealing genome-wide chromatin binding landscapes of ERα, AR, PR, GR, FOXA1, and GATA3 and enhancer-enriched histone mark H3K4me1. We integrate these data with transcriptomics to reveal gender-selective and genomic location-specific hormone receptor actions, which associate with survival in MBC patients.
Breast cancer in men is rare and largely understudied. Male breast cancer (MBC) accounts for around 1% of all 1.67 million breast cancers diagnosed each year, worldwide1. As compared to the female counterpart, MBC is a distinct disease regarding clinicopathological features (e.g., age of onset and frequency of hormone receptor positivity2) and molecular features (e.g., frequency of BRCA2 mutation3 and gene expression subtypes4).
The majority of male (and female) breast cancers are hormonally driven5, where ERα genomic action dictates transcriptional programs that drive tumor cell proliferation6. Analogous to the female counterpart, male breast cancers are treated with endocrine therapies (such as tamoxifen) to block ERα transcriptional activity, yet relapse after hormonal treatment has also been noted2,7. Even though the genomic action of ERα in MBC remains completely elusive, multiple reports have studied ERα genomics in the female disease. ERα-DNA binding profiles in tumors are dynamically affected by endocrine therapeutics8 and can differentiate female patients on outcome9,10. Cell line studies revealed ERα-DNA binding and ERα-driven transcriptional activation and cell proliferation to depend on its pioneer factors, including FOXA111 and GATA312.
Apart from ERα, other steroid hormone receptors are expressed in breast cancer as well, including androgen receptor (AR)13, progesterone receptor (PR)14, and glucocorticoid receptor (GR)14. AR expression is frequently observed in most male (and female) breast cancers13,14,15,16, although its role in breast cancer is poorly understood17. AR activation in breast cancer cells facilitates downstream gene expression that drives tumorigenesis in a similar manner to ERα16. This tumorigenic action of AR is most extensively studied in prostate cancer18,19, where differential AR-DNA binding profiles can classify prostate cancer patients on outcome20,21,22.
AR and PR are favorable prognostic markers in female breast cancer (FBC)23,24. In addition, PR has recently been shown to modulate ERα-DNA binding, directly reprogramming ERα-driven transcriptional programs25. GR expression has been associated with FOXA1 and GATA3 expression in ERα-positive FBC, and is associated with a favorable outcome in this patient population26. Its functional role in breast cancer in relation to other steroid hormone receptors is poorly characterized. Cumulatively, these data illuminate the likely interplay between different steroid hormone receptors in breast cancer. Although ERα cistromics has previously been studied in female breast tumors9,10, and its interplay with transcription factors has been reported in cell lines10,11,15,27,28,29,30,31, all these transcription factors have never been profiled together in a single study in human breast tumors.
We have characterized DNA binding of six different hormone-related transcription factors in an understudied field of human pathophysiology: male breast cancer. Through multidimensional genomic data integration on the level of transcription factor binding, copy number cistrome profiling (using off-target sequencing reads from ChIP-seq data)32, transcriptomics and the enhancer enriched histone mark H3K4me1, we present a first comprehensive overview of male breast cancer, which we compared with the female counterpart. This comprehensive overview reveals gender-selective and genomic location-specific hormone receptor action, which associate with survival in MBC.
Steroid hormone receptor profiling in male breast cancer
We aimed to generate a compendium of (epi)genomic, transcriptomic and clinical data for 49 ERα–positive MBC samples to better characterize the molecular makeup of this disease. To determine the chromatin binding landscape of ERα in relation to steroid hormone receptors AR, PR, and GR and its pioneer factors FOXA1 and GATA3, we performed ChIP-seq analyses in clinical specimens from patients who did not receive any therapy prior to surgery. These results were integrated with gene expression data and compared with female breast cancer and cell line ChIP-seq data (Fig. 1a, Supplementary Fig. 1). Samples were selected randomly for ChIP-seq of different factors. In this pioneering work, each transcription factor was profiled in MBC for the first time (30 ERα ChIP-seq datastreams and ≥7 samples/factor for other factors with the exception of GATA3 and PR, with 3 and 4 samples, respectively). To position these results into epigenetic context, H3K4me1 was included as active enhancer marker33. We generated RNA-seq data for the series, used to classify samples on subtypes related to outcome; M1 (poor) and M2 (good)4 (Supplementary Fig. 1). Finally, we used copy number data (detected using off-target sequencing reads)32 and RNA-seq data to perform integrative clustering (IntClust) classification, which was previously associated with FBC prognostication (Supplementary Fig. 1)34. As expected, IntClust classifications and intrinsic subtypes (based on immunohistochemistry) were enriched for ERα-positive classifications (29/30 and 28/28, respectively). Clinical data, such as number of positive lymph nodes at diagnosis and survival status were included for male (Supplementary Fig. 1 and Supplementary Table 1) and female patients (Supplementary Table 2). Missing clinical data are indicated in Supplementary Table 1. We identified bound regions (peaks) for each factor: ERα (biological replicate in Supplementary Fig. 2), FOXA1, AR, GR, PR, and GATA3 (ChIP-seq validated by ChIP-QPCR in Supplementary Fig. 3) using validated antibodies (Supplementary Fig. 4A, B), shown at two well-known ERα bound regions in FBC: loci RARA and GREB1 (Fig. 1b). In this principal examination of SHRs, FOXA1 and GATA3 in a tumor series, we observed these two ERα-regions were bound by all other factors. This is reminiscent of FBC transcription factors, such as GATA3, FOXA1, RARα, which have been found to be bound in the same regions35. These findings were confirmed on a genome-wide scale, where all ERα sites were considered bound in ≥50% of tumors in which we find co-binding by all other factors tested (Fig. 1c). In accordance with reports in cell lines11,25,29,36, all factors studied mainly bind intronic and distal intergenic regions (Fig. 1d). DNA motif analysis revealed self-preference for all factors (Fig. 1e, Supplementary Fig. 5) except for GATA3 as was reported previously12.
Interplay of ERα with other SHRs and pioneer factors
We have shown ERα binding sites have considerable overlap with other factors studied. Next, overlap of ERα with the other steroid hormone receptors (Fig. 2a, left) or pioneer factors FOXA1 and GATA3 (Fig. 2a, right) was studied. All sites shared for individual factors: ERα (15 out of 30 samples), AR (5 out of 10 samples), FOXA1 (3 out of 7 samples), PR (2 out of 4 samples), GR (3 out of 7 samples) and GATA3 (1 out of 3 samples). AR and GR have virtually no unique binding regions, while selective sites for ERα and PR are identified. When examining ERα, FOXA1 and GATA3, we identified selective sites in each. Strikingly, 71% of FOXA1 sites were FOXA1-selective while ER and GATA3-selective sites were 42% and 33%, respectively. Next, we counted the reads in the union of sites for all factors (Fig. 2a) and computed the correlation score for each sample (Pearson correlation coefficient), which was represented in a correlation matrix (Fig. 2b). Contrary to what is described for ERα/FOXA1 behavior in FBC cell lines11, we observed FOXA1 profiles do not correlate with other profiles, while all SHRs and GATA3 cluster together. Most notably, a strong similarity in genomic profiles of ERα with GR and AR was found. Practically all AR sites were co-occupied with ERα (Fig. 2c), which was also seen in female breast tumor and MCF7 cells (Supplementary Fig. 6A, B).
MBC is ERα-driven37,38. Therefore, a one-to-one comparison of binding sites of ERα with each factor was performed (Fig. 2c, d and Supplementary Fig. 7), where only samples were analyzed in which both ERα and the other factor were profiled (peak selection as stated above and in figure legends). A significant proportion of ERα binding sites overlap with AR and GR sites as seen in cell line data39,40 which we confirmed in MBC (Fig. 2c, d). Interestingly, although PR modulates ERα binding in FBC25, many PR sites in MBC were devoid of ERα (46%). These findings were confirmed in DNA binding correlation matrices (Fig. 2e). ERα and AR show clustering within patient rather than between factors (Fig. 2e), indicating a stronger correlation between factors within the same tumor as compared to the same factor between tumors.
A dominant dogma of ERα biology purports ERα binding is dependent on its pioneering factor FOXA1, with 95% of binding found at enhancer regions11,29,30,41,42. In line with literature11, we find ~50% overlap between ERα and FOXA1 in cell lines, where sites enriched for FOXA1 or ERα (Fig. 3a, b and Supplementary Fig. 8) were largely found in intronic and distal intergenic regions (Fig. 3c). These findings are in stark contrast to observations in both male and female breast cancers, in which ERα sites devoid of FOXA1 were strongly promoter-enriched (Fig. 3a–c), suggesting the model systems currently used do not adequately capture the genomic distributions of ERα found in clinical samples. In contrast to FOXA1-enriched sites, sites selectively occupied by ERα were weaker for active enhancer mark H3K4me133 (Supplementary Fig. 9A, B). Interestingly, GATA3 was found at both the FOXA1-enriched and ERα-enriched sites (Supplementary Fig. 9A). Motifs at ERα selective sites are related to ESR1 and devoid of forkhead motifs (Supplementary Fig. 10), which is in contrast to the total of ERα sites (Supplementary Fig. 5).
MBC subtypes differ in hormone receptor action
Having characterized MBC with respect to SHRs, GATA3 and FOXA1 DNA binding, we next performed gene expression analyses in these samples (n = 46). In order to assess underlying ERα binding patterns between published MBC intrinsic subtypes M1 and M24, we first confirmed M1/M2 subtype clustering using our RNA-seq data set using only subtype genes (Fig. 4a). Supporting our hypothesis that ERα function may deviate between M1 and M2 subtypes, we found 1395 differential ERα DNA binding sites (Fig. 4b). Analogous analyses were not performed for other factors than ERα, since ChIP-seq datastreams were not sufficiently powered to represent both the M1 and M2 subtypes (Supplementary Fig. 1). With available datastreams, we confirmed the occupancy of FOXA1, AR, GR, PR, and GATA3 at these differential ER α DNA binding sites (Fig. 4c). M1- and M2-specific sites were comparable in genomic location and motif usage (Fig. 4d, e and Supplementary Fig. 11). Genes with proximal binding sites (<20 kb or within the gene body) were subsequently examined for molecular and biological associations using pathway analysis (IPA, Qiagen). Both ERα ChIP-seq associated M1/M2 genes and gene expression-based M1/M2 genes strongly associated with ERα pathway indicators as expected, though some additional regulators are specifically found in the previously reported expression-based classification, such as ERBB2 and KRAS (Fig. 4f). Interestingly, among canonical pathways, AR signaling was the only hormonal signaling pathway more associated with the ERα binding based genes compared to the gene expression-based genes (Fig. 4g), in line with the strong overlap of AR/ERα binding in these tumors (Figs. 2, 4c).
Comparing genomics of ERα and FOXA1 between genders
As ERα is the key driver and therapeutic target in both genders, we compared ERα chromatin binding in female (17 from Ross-Innes et al.10, 9 from Jansen et al.9 and 10 generated in-house) and male (30 generated in-house) breast tumors (Fig. 5a), along with its pioneer factor FOXA1 (n = 7 for both genders) (Fig. 5b). Interestingly, no clear differences in ERα and FOXA1 binding was found between genders, on the level of peak overlap ratio (Fig. 5a–d) or relative read counts in peaks (Fig. 5e, f). For ERα and FOXA1 sites found in ≥50% of male tumors, signal was observed in female samples at comparable intensity (Fig. 5e, f, Supplementary Fig. 12A, B, Supplementary Fig. 13), and vice versa. Furthermore, motif enrichment at ERα and FOXA1 sites was highly comparable between genders (Fig. 5g, h). While clear clustering was observed for ERα between (male and female) tumors and cell lines (Fig. 5a), no separation on gender was observed for any of the factors studied in an integrative analysis (Supplementary Fig. 14), as well as separately for all factors studied (Fig. 5a, b, Supplementary Fig. 15). ERα sites that classify FBC on outcome10 were used to predict male outcome (k-nearest neighbor classifier; Methods section), and a weak but similar trend of ERα signal strength was observed in these sites (Fig. 5i). However, overall survival (OS) was not significantly different between the two groups of male patients (Supplementary Fig. 16). These results suggest that although the vast majority of ERα and FOXA1 sites are conserved between breast cancers from both genders, ERα sites indicative for outcome in FBC may not be applicable in the male disease.
Genomic profiles of ERα and FOXA1 stratify MBC patients on outcome
Since prognostic ERα sites in female tumors do not seem to be indicative for male patient outcome in our MBC series, we analyzed binding sites of ERα and pioneer factor FOXA1 in MBC for outcome prediction. Analogous analyses for ERα/GATA3 were not performed due to insufficient power due to small (n = 3) sample size. Based on lymph node status, indicative for overall survival (Supplementary Fig. 17), 365 ERα and 470 FOXA1 sites differed (Fig. 6a). Differential ERα and FOXA1 sites between patient subgroups were coupled to proximal genes (<20 kb or within the gene body). Unsupervised hierarchical clustering revealed clusters dominated by either of M1 or M2 subtypes, both in our cohort (Fig. 6b) and a validation set4 (Fig. 6c; n = 66 patients). We performed logistic regression with elastic net regularization43 to construct a supervised binary classification model by which predictive gene signatures could be identified, which was trained in our cohort and tested in the validation cohort. Both ERα- and FOXA1-based classifiers captured predictive features, which were outperformed by the union of both classifier gene lists (Fig. 6d, e). A bootstrapping analysis confirmed that comparable performance is rarely achieved with random gene sets (p = 0.013, one-tailed test with bootstrapped performance distribution; Methods section; Supplementary Fig. 18). Dividing patients into two groups of equal size based on the signature (high-risk and row-risk group of LN-status) significantly classified patients on distant metastasis free survival (DMFS; p = 0.048, log-rank test; Fig. 6f; Methods section), which was marginally significant in a multivariate Cox analysis including LN-status (p = 0.066, Cox proportional hazards test). The gene expression signature is a linear combination of gene expression levels, in which 14 contributing genes classify MBC on outcome (Fig. 6g). Cumulatively, we show that global ERα and FOXA1 chromatin binding selectivity reveals gender-specific prognostic features that successfully classify MBC patients on survival.
This work has characterized the DNA binding landscape of ERα in male breast cancer, along with its pioneer factors FOXA1, GATA3, and enhancer-enriched histone modification H3K4me1. In addition, we present the first set of DNA binding data in breast tumor specimens for other members of the steroid hormone receptor family: AR, GR, and PR. Our findings have indicated that the majority of ERα binding sites in both male and female breast tumors are FOXA1-independent and are found at active promoter regions, indicating a novel and unexpected mode of ERα function. These results are in stark contrast to cell line-based studies that illustrated the majority of ERα sites shared with FOXA1, mainly found at enhancers11. Our findings highlight the necessity to address transcription factor functioning in the physiological context of human tissue, rather than limiting analyses to cell line models.
Our data reveal that genomic functions of ERα and AR in male breast tumors are largely overlapping, which strongly co-localized with GR and PR at the same regions. Even though many sites for GATA3, FOXA1, and PR were not shared with ERα, both AR and GR show virtually no unique binding sites with respect to ERα binding. AR has been shown to compensate for ERα in ERα-/AR+ female breast cancers40,44, however the biological interaction between ERα and AR is relatively unknown in both FBC and MBC45. The observed genomic overlap of steroid hormone receptor binding profiles is likely due to the close sequence homology of DNA binding domains between all steroid hormone receptors, which warrants potential competition between them in DNA binding. Alternatively, genomic overlap of SHR binding profiles may be the consequence of ‘tethering’, in which factors associate to the DNA indirectly through complex formation with DNA-bound factors35, as was recently described for ERα and PR25. Such genomic convergence of steroid hormone receptor action in tumors may provide a novel starting point for pharmaceutical intervention strategies, yielding direct biological rationale for the use of small molecule therapeutics to target AR (e.g., clinicaltrials.gov NCT01990209), GR or PR in hormone receptor-positive breast cancer. As MBC is a rare cancer with limited numbers of available tumors for genomic studies, ChIP-seq analyses for some factors including PR and GATA3 were performed on relatively low number of tumors. To focus our analyses on the most-robust peaks and thus minimizing potential impact of patient heterogeneity, for all SHRs we only considered peaks that were found in around 50% of patient samples. This could be considered a rather conservative approach compared to other tissue-derived ChIP-seq papers, where the union of all peaks46 or peaks identified in at least 2 out of 21 tumors10 were used as consensus for analyses. Nonetheless, results for PR and GATA3 still warrant validation in larger cohorts.
Although, we found the vast majority of ERα sites to be shared between male and female breast cancers, ERα sites that are associated with patient outcome appeared gender-selective. In line with these results, genomic selectivity of combinatorial steroid hormone receptor action is associated with the gender-specific intrinsic MBC subtypes M1 and M2. While these data suggest ERα function may be driving these subtypes, causality can only be illustrated when cell line models, organoids or patient-derived xenografts are available for mechanistic studies. As the most clinically relevant observation, we have identified distinct genomic signatures of ERα action, which selectively and exclusively classifies MBC patients on outcome. With differential binding of ERα and FOXA1 as a guide, we developed a gene expression signature that is significantly associated with DMFS in MBC patients. The union of genes under differential control of ERα and FOXA1 jointly classify patients on outcome, and it remains to be determined which transcription factor is facilitated by FOXA1 at these sites. The MLL3 histone methyltransferase may represent one candidate to be tested in future studies based on the published FOXA1 and MLL3 interaction in FBC cells47. The 14 genes classifier we identified may be of added value as male breast cancer-specific prognostic classifier, but further validation of these results would be needed. Furthermore, small molecule inhibitors are available for a number of the 14 genes represented in the classifier, such as CAMKK2 (STO-609)48, CAPN949, BACE250, and TNFSF11 (aka RANKL)51, and future studies could further elucidate whether these inhibitors may be applicable in the treatment of male breast cancer.
With this, we present the first comprehensive genomic overview of shared and unique features of four steroid hormone receptors in human cancer, with outcome prediction. By studying MBC, gender-selective features of ERα action were identified with potentially direct clinical implications, revealing the first biology-driven biomarker for outcome prediction in this highly understudied cancer-type.
In this study, primary male and female breast tumors were used, none of whom received neoadjuvant endocrine therapy. Male breast cancer patients received surgery at the Netherlands Cancer Institute-Antoni van Leeuwenhoek (NKI-AVL; Amsterdam, the Netherlands), University Medical Center Utrecht (UMCU; Utrecht, the Netherlands), Vrije Universiteit Medical Center (VUMC; Amsterdam, the Netherlands), Radboud University Medical Center (RadboudUMC; Nijmegen, the Netherlands), University Medical Center Groningen (UMCG; Groningen, The Netherlands), Leiden University Medical Center (LUMC; Leiden, the Netherlands), and Erasmus Medical Center (ErasmusMC; Rotterdam, the Netherlands).
Female breast cancer patients received surgery at the Netherlands Cancer Institute-Antoni van Leeuwenhoek (NKI-AVL; Amsterdam, the Netherlands). Tumor content and immunohistochemical analyses were assessed by pathological examination. For clinicopathological parameters, see Supplementary Tables 1 (male tumors) and 2 (female tumors). Local medical ethical authorities at above-mentioned centers approved of the collection protocols. All samples were from anonymous left-over material, which would be discarded otherwise. Anonymized, coded leftover material which is not traced back to the patient and therefore does not interfere with care and/or prognosis, under strict requirements can be used without written informed consent according to Dutch legislation on Secondary Use52.
ChIP-seq and antibody validations
Tissue was processed as described previously53 with a few adaptations. In short, tissue was defrosted and crosslinked in solution A (50 mM Hepes, 100 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, pH = 7.4) containing 2 mM DSG, incubated for 25 min at room temperature while rotating. After 25 min formaldehyde was added to 1% final and incubated another 20 min at room temperature with rotation. Samples were quenched by adding a surplus of 0.2 M glycine, pelleted by centrifugation (5′@4000 r.c.f. at 4 °C), washed with cold PBS and mechanically disrupted in cold PBS using a pellet pestle (Sigma). The PicoBioruptor (Diagenode) was used for sonication. For ChIP, antibodies were used to detect ERα (sc-543, Santa Cruz), AR (sc-816, Santa Cruz), FOXA1 (sc-6554, Santa Cruz), PR (sc-7208, Santa Cruz), GR (12041 S lot 3, Cell Signaling Technology), GATA3 (sc-268, Santa Cruz), and H3K4me1 (ab8895, AbCam). Immunoprecipitated DNA was prepared for Illumina multiplex-sequencing with 10 samples per lane at 65 bp single end. For steroid hormone receptor ChIPs except GR, 5 µg of antibody and 50 µl dynabeads (Invitrogen) were used, for GR 7.5 µg of antibody and 75 µl dynabeads were used, and for FOXA1 and H3K4me1, 4 µg of antibody and 40 µl magnetic beads were used. ChIP-QPCR was performed to validate ChIP-seq data for ERα, GR, AR, and FOXA1. For QPCR, relative enrichment of the RARA enhancer (chr:1738478661-38478809) (primers: 5′-GCTGGGTCCTCTGGCTGTTC-3′ (FWD) and 5′-CCGGGATAAAGCCACTCCAA-3′ (REV)) over a negative control region (primers: 5′-TGCCACACACCAGTGACTTT-3′ (FWD) and 5′-ACAGCCAGAAGCTCCAAAAA-3′ (REV)) was normalized over input, and plotted against the log(count/million) measurement from the same region from the MQ20 filtered aligned ChIP-seq file. SHR antibody specificity was validated by immunohistochemistry on U2OS cells transiently transfected with any of the SHRs of interest (Supplementary Fig. 3A). Antibody used for FOXA1 ChIP-seq also detects FOXA254 but only FOXA1 is expressed in these tumors (Supplementary Fig. 3B). Specificity for the antibody used for H3K4me1 ChIP-seq (ab8895)55,56 and GATA3 (sc-268) has been validated by others57,58. Publically available ChIP-seq data sets used are listed in Supplementary Table 3.
Immunohistochemistry staining for ER, PR, and AR was performed as previously described8. Immunohistochemistry of other factors was performed on a BenchMark Ultra autostainer (GATA3 and FOXA1) or Discovery Ultra autostainer (GR). Briefly, paraffin sections were cut at 3 µm, heated and deparaffinized in the instrument with EZ prep solution (Ventana Medical Systems). FOXA1 was detected using clone 2F83 (1/100,000 dilution, 16 min at RT, Seven Hills) followed by the UltraView Universal DAB Detection Kit (Ventana Medical Systems). GR was detected using clone D6H2L (1/600 dilution, 1 h at 37 °C, Cell Signaling) and visualized using Anti-Rabbit HQ (Ventana Medical systems) for 12 min at 37 °C followed by Anti-HQ HRP (Ventana Medical systems) for 12 min at 37 °C, followed by the ChromoMap DAB detection kit (Ventana Medical Systems). GATA3 was visualized using the OptiView DAB Detection Kit (Ventana Medical Systems). Slides were counterstained with Hematoxylin II and Bluing Reagent (Ventana Medical Systems). In the non-clinical setting (GR, FOXA1, GATA3), all scoring was performed on whole slides by a single pathologist (PJvD), blinded to patient status. In the clinical setting (ER, PR, and AR, we obtained positive/negative status from the clinical records. In accordance with clinical guidelines in the United States59, all samples were considered positive when at least 1% of nuclei were stained.
Sequenced samples were aligned to the reference human genome (Ensembl 37) using Burrows-Wheeler Aligner (BWA, v0.7.5a) with a mapping quality >20. Peak calling was performed using MACS (v1.4)60 and DFilter (v1.5)61, where only peaks were considered that were shared by the two peak callers. Genome browser snapshots were generated using IGV (v2.3.67), heatmaps were compiled using SeqMiner (v1.3.3)62 and genomic region enrichment analysis was performed with CEAS (v1.0.2)63. To generate consensus peaksets for each factor we used DiffBind (v1.14)10 with a cutoff defined where a peak must be seen in at least 50% of the samples for that factor. In the event that there were only 2 samples sequenced for a factor, only peaks were considered found in both samples. A list of the union of these sites was generated for Fig. 2a, b). For two-way Venn diagrams, we used these cutoffs in each factor in the set of 4 (ERα vs AR), 3 (ERα vs FOXA1), and 2 (ERα vs GR/PR) (Fig. 2c, d). For differential binding analyses (Fig. 2e), DiffBind was used10 with the following parameters, FDR of 0.1 for comparison between two different factors and p-value of 0.01 for comparison between different LN-status. Given a set of binding site, genomic distribution and significantly enriched motifs were obtained using CEAS63 and SeqPos64 in Galaxy Cistrome (v1.0.0)64. All identified motifs were included in the wordcloud figures without removing close homologues, to prevent selection bias (Supplementary Figures 5, 10 and 11).
RNA isolation and RNA-seq
Sections (30 µm thick) were cut from the frozen tumor tissues for RNA isolation. Total RNA was extracted using the mirVana miRNA isolation kit (Ambion, USA) according to the manufacturer’s protocol until the end of F1. Quality and quantity of the total RNA was assessed by the 2100 Bioanalyzer (Agilent, USA). Total RNA samples having RIN >8 were subjected to library generation.
Strand-specific libraries were generated using the TruSeq Stranded mRNA sample preparation kit (Illumina, USA; RS-122-2101/2) according to the manufacturer’s instructions (Illumina, Part # 15031047 Rev. E). 3′ adenylated and adapter ligated cDNA fragments were subject to 12 cycles of PCR. The libraries were analyzed on a 2100 Bioanalyzer using a 7500 chip (Agilent, Santa Clara, CA), diluted and pooled equimolarly into a multiplexed, 10 nM sequencing pools and stored at −20 °C. Strand-specific cDNA libraries were sequenced with 100 base paired-end reads on a HiSeq2500 using V4 chemistry (Illumina).
For all analyses we used the reference file Ensembl GRCh37.75. Adapter filtered reads were subject to STAR alignment (v2.4.2)65 carried out using default parameters. For expression analysis, HTSeq (v0.6.1p1)66 was used to count reads for all genes in our RNA-seq samples using the htseq-count command. The DESeq2 (v1.16.0) R package was then used to generate a gene expression matrix from these data67. Normalization of the data was carried out using the ‘rlog’ method within the package. Only samples with at least 5 reads across each gene were retained for further analyses (N = 46).
Prediction of male patient outcome on profiles of ERα and FOXA1
We constructed a k-nearest neighbor classifier based on ERα binding profile of female patients using scikit-learn module (v0.19)68 in Python. Taking read counts of ERα in the ERα binding sites that classify female outcome, male patient outcome is predicted by the outcome of five closest female data in terms of Minkowski distance.
Logistic regression with elastic net regularization
We used R package glmnet (v2.0)43 for constructing a logistic regression model using RNA-seq data in our cohort. Leave-one-out cross validation was performed for finding robust coefficients. Taking the independent validation cohort, linear combination of gene expression levels using the coefficients (gene expression signature) were obtained to rank the patients. Measuring sensitivity and specificity of LN-status prediction with varying threshold gives a receiver-operating characteristics (ROC) curve from which area under the curve (AUC) can be measured. A threshold dividing high- and low-risk group was chosen as the median. We used potential target genes of ERα and FOXA1, and union of potential targets to construct three classification models. Note that due to the high dimensionality, no contributing gene is found when the model is trained with whole genome data.
Bootstrapping analysis of classification model
We used the R package boot (v1.3)69 for bootstrapping analysis. For each classification model, 1000 random models were constructed in the same manner but with random gene sets with the same size to obtain bootstrapped AUC distribution. Taking the bootstrapped AUC distribution as a null hypothesis distribution, significance of the performance was assessed by one-tailed test.
We used R package Survival (v2.38)70 for survival analysis. Given two patient groups based on data availability, the log-rank test was used for overall survival (OS) comparison in our cohort and distant metastasis free survival (DMFS) comparison in a validation cohort. Cox regression was used to assess association of LN-status and gene expression signature to survival. The available additional prognostic factors used in multivariate Cox regression were LN-status, endocrine therapy, chemotherapy, radiotherapy and age at diagnosis.
All ChIP-seq data generated in the study are available on GEO repository: GSE104399 and RNA-seq data on GEO repository: GSE104730. All public data streams used in the study are listed in Supplementary Table 3.
Ferlay, J. et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int. J. Cancer 136, E359–E386 (2015).
Ruddy, K. J. & Winer, E. P. Male breast cancer: risk factors, biology, diagnosis, treatment, and survivorship. Ann. Oncol. 24, 1434–1443 (2013).
Rizzolo, P. et al. Male breast cancer: genetics, epigenetics, and ethical aspects. Ann. Oncol. 24(Suppl 8), viii75–viii82 (2013).
Johansson, I. et al. Gene expression profiling of primary male breast cancers reveals two unique subgroups and identifies N-acetyltransferase-1 (NAT1) as a novel prognostic biomarker. Breast Cancer Res. 14, R31 (2012).
Ignatiadis, M. & Sotiriou, C. Luminal breast cancer: from biology to treatment. Nat. Rev. Clin. Oncol. 10, 494–506 (2013).
Droog, M., Beelen, K., Linn, S. & Zwart, W. Tamoxifen resistance: from bench to bedside. Eur. J. Pharmacol. 717, 47–57 (2013).
Abreu, M. H. et al. Male breast cancer: looking for better prognostic subgroups. Breast 26, 18–24 (2016).
Severson, T. M. et al. Neoadjuvant tamoxifen synchronizes ERα binding and gene expression profiles related to outcome and proliferation. Oncotarget 7, 33901–33918 (2016).
Jansen, M. P. H. M. et al. Hallmarks of aromatase inhibitor drug resistance revealed by epigenetic profiling in breast cancer. Cancer Res. 73, 6632–6641 (2013).
Ross-Innes, C. S. et al. Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature 481, 389–393 (2012).
Hurtado, A., Holmes, K. A., Ross-Innes, C. S., Schmidt, D. & Carroll, J. S. FOXA1 is a key determinant of estrogen receptor function and endocrine response. Nat. Genet. 43, 27–33 (2011).
Theodorou, V., Stark, R., Menon, S. & Carroll, J. S. GATA3 acts upstream of FOXA1 in mediating ESR1 binding by shaping enhancer accessibility. Genome Res. 23, 12–22 (2012).
Isola, J. J. Immunohistochemical demonstration of androgen receptor in breast cancer and its relationship to other prognostic factors. J. Pathol. 170, 31–35 (1993).
Allegra, J. C. et al. Distribution, frequency, and quantitative analysis of estrogen, progesterone, androgen, and glucocorticoid receptors in human breast cancer. Cancer Res. 39, 1447–1454 (1979).
Chia, K., O’Brien, M., Brown, M. & Lim, E. Targeting the androgen receptor in breast cancer. Curr. Oncol. Rep. 17, 4 (2015).
Wang, X., Yarid, N., McMahon, L., Yang, Q. & Hicks, D. G. Expression of androgen receptor and its association with estrogen receptor and androgen receptor downstream proteins in normal/benign breast luminal epithelium. Appl. Immunohistochem. Mol. Morphol. 22, 498–504 (2014).
Castellano, I. et al. Androgen receptor expression is a significant prognostic factor in estrogen receptor positive breast cancers. Breast Cancer Res. Treat. 124, 607–617 (2010).
Wang, Q. et al. Androgen receptor regulates a distinct transcription program in androgen-independent prostate cancer. Cell 138, 245–256 (2009).
Xu, Y., Chen, S.-Y., Ross, K. N. & Balk, S. P. Androgens induce prostate cancer cell proliferation through mammalian target of rapamycin activation and post-transcriptional increases in cyclin D proteins. Cancer Res. 66, 7783–7792 (2006).
Nevedomskaya, E. et al. Androgen receptor DNA binding and chromatin accessibility profiling in prostate cancer. Genom. Data 7, 124–126 (2016).
Stelloo, S. et al. Androgen receptor profiling predicts prostate cancer outcome. EMBO Mol. Med. 7, 1450–1464 (2015).
Sharma, N. L. et al. The androgen receptor induces a distinct transcriptional program in castration-resistant prostate cancer in man. Cancer Cell 23, 35–47 (2013).
Mohsin, S. K. et al. Progesterone receptor by immunohistochemistry and clinical outcome in breast cancer: a validation study. Mod. Pathol. 17, 1545–1554 (2004).
Shaaban, A. M. et al. A comparative biomarker study of 514 matched cases of male and female breast cancer reveals gender-specific biological differences. Breast Cancer Res. Treat. 133, 949–958 (2012).
Mohammed, H. et al. Progesterone receptor modulates ERα action in breast cancer. Nature 523, 313–317 (2015).
Abduljabbar, R. et al. Clinical and biological significance of glucocorticoid receptor (GR) expression in breast cancer. Breast Cancer Res. Treat. 150, 335–346 (2015).
Eeckhoute, J. et al. Positive cross-regulatory loop ties GATA-3 to estrogen receptor alpha expression in breast cancer. Cancer Res. 67, 6477–6483 (2007).
Carroll, J. S. et al. Chromosome-wide mapping of estrogen receptor binding reveals long-range regulation requiring the forkhead protein FoxA1. Cell 122, 33–43 (2005).
Carroll, J. S. et al. Genome-wide analysis of estrogen receptor binding sites. Nat. Genet. 38, 1289–1297 (2006).
Lupien, M. et al. FoxA1 translates epigenetic signatures into enhancer-driven lineage-specific transcription. Cell 132, 958–970 (2008).
Zwart, W. et al. Oestrogen receptor-co-factor-chromatin specificity in the transcriptional regulation of breast cancer. EMBO J. 30, 4764–4776 (2011).
Kuilman, T. et al. CopywriteR: DNA copy number detection from off-target sequence data. Genome Biol. 16, 49 (2015).
Heintzman, N. D. et al. Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat. Genet. 39, 311–318 (2007).
Curtis, C. et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486, 346–352 (2012).
Liu, Z. et al. Enhancer activation requires trans-recruitment of a mega transcription factor complex. Cell 159, 358–373 (2014).
Polman, J. A. E. et al. A genome-wide signature of glucocorticoid receptor binding in neuronal PC12 cells. BMC Neurosci. 13, 118 (2012).
Aisner, J., Ross, D. D. & Wiernik, P. H. Tamoxifen in advanced male breast cancer. Arch. Intern. Med. 139, 480–481 (1979).
Doyen, J. et al. Aromatase inhibition in male breast cancer patients: biological and clinical implications. Ann. Oncol. 21, 1243–1245 (2010).
Karmakar, S., Jin, Y. & Nagaich, A. K. Interaction of glucocorticoid receptor (GR) with estrogen receptor (ER) α and activator protein 1 (AP1) in dexamethasone-mediated interference of ERα activity. J. Biol. Chem. 288, 24020–24034 (2013).
Robinson, J. L. L. et al. Androgen receptor driven transcription in molecular apocrine breast cancer is mediated by FoxA1. EMBO J. 30, 3019–3027 (2011).
Cirillo, L. A. et al. Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. Mol. Cell 9, 279–289 (2002).
Laganière, J. et al. From the cover: location analysis of estrogen receptor alpha target promoters reveals that FOXA1 defines a domain of the estrogen response. Proc. Natl Acad. Sci. USA 102, 11651–11656 (2005).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Doane, A. S. et al. An estrogen receptor-negative breast cancer subset characterized by a hormonally regulated transcriptional program and response to androgen. Oncogene 25, 3994–4008 (2006).
Severson, T. M. & Zwart, W. A review of estrogen receptor/androgen receptor genomics in male breast cancer. Endocr. Relat. Cancer 24, R27–R34 (2017).
Kron, K. J. et al. TMPRSS2-ERG fusion co-opts master transcription factors and activates NOTCH signaling in primary prostate cancer. Nat. Genet. 49, 1336–1345 (2017).
Jozwik, K. M., Chernukhin, I., Serandour, A. A., Nagarajan, S. & Carroll, J. S. FOXA1 directs H3K4 monomethylation at enhancers via recruitment of the methyltransferase MLL3. Cell Rep. 17, 2715–2723 (2016).
Hawley, S. A. et al. Calmodulin-dependent protein kinase kinase-beta is an alternative upstream kinase for AMP-activated protein kinase. Cell Metab. 2, 9–19 (2005).
Chen, C.-J., Nguyen, T. & Shively, J. E. Role of calpain-9 and PKC-delta in the apoptotic mechanism of lumen formation in CEACAM1 transfected breast epithelial cells. Exp. Cell Res. 316, 638–648 (2010).
Canalis, E., McCarthy, T. L. & Centrella, M. The role of growth factors in skeletal remodeling. Endocrinol. Metab. Clin. 18, 903–918 (1989).
Pageau, S. C. Denosumab. MAbs 1, 210–215 (2009).
Federation of Dutch Medical Scientific Scocieties (FDMSS). Dutch Medical Treatment Contracts Act and the Code of Conduct for Proper Secondary Use of Human Tissue of the Dutch Federation of Biomedical Scientific Societies. (FDMSS, 2011).
Zwart, W. et al. A carrier-assisted ChIP-seq method for estrogen receptor-chromatin interactions from breast cancer core needle biopsy samples. BMC Genom. 14, 232 (2013).
Droog, M. et al. Comparative cistromics reveals genomic cross-talk between FOXA1 and ERα in Tamoxifen-Associated Endometrial Carcinomas. Cancer Res. 76, 3773–3784 (2016).
Rothbart, S. B. et al. An interactive database for the assessment of histone antibody specificity. Mol. Cell 59, 502–511 (2015).
Egelhofer, T. A. et al. An assessment of histone-modification antibody quality. Nat. Struct. Mol. Biol. 18, 91–93 (2011).
Landt, S. G. et al. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 22, 1813–1831 (2012).
Witt, H. Encode DCC Antibody Validation Document. PDF validation document. Available at: https://www.encodeproject.org/antibody-characterizations/2d3cfa05-cd0a-4a43-9f3e-2ec62260b17d/@@download/attachment/human_GATA3_SC-268_validation_Farnham.pdf (2011).
Hammond, M. E. H. et al. American Society of Clinical Oncology/College Of American Pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer. J. Clin. Oncol. 28, 2784–2795 (2010).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Kumar, V. et al. Uniform, optimal signal processing of mapped deep-sequencing data. Nat. Biotechnol. 31, 615–622 (2013).
Ye, T. et al. seqMINER: an integrated ChIP-seq data interpretation platform. Nucleic Acids Res. 39, e35 (2011).
Shin, H., Liu, T., Manrai, A. K. & Liu, X. S. CEAS: cis-regulatory element annotation system. Bioinformatics 25, 2605–2606 (2009).
Liu, T. et al. Cistrome: an integrative platform for transcriptional regulation studies. Genome Biol. 12, R83 (2011).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Anders, S., Pyl, P. T. & Huber, W. HTSeq--a Python framework to work with high-throughput sequencing data. Bioinforma. Oxf. Engl. 31, 166–169 (2015).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. 12, 2825–2830 (2011).
Canty, A. & Ripley, B. boot: Bootstrap R (S-Plus) Functions. R package version 1.3-20 (2016).
Therneau, T. A Package for Survival Analysis in S. version 2.38 (2015).
We thank all patients who contributed to this study. The authors would like to acknowledge all the members of hospitals that provided tumor material. Wilbert Zwart is supported by the Bas Mulder Award from the Dutch Cancer Society (KWF)/ALpe d’HuZes and a VIDI grant from the Netherlands Organization for Scientific Research (NWO). JWMM received funding through the Cancer Genomics Netherlands from The Netherlands Organization for Scientific research (NWO). The authors would also like to acknowledge the effort and support of The Netherlands Cancer Institute (NKI) Genomics Core Facility. In addition, we would like to acknowledge the NKI Core Facility Molecular Pathology and Biobanking (CFMPB) for supplying Biobank material and lab support, especially Ingrid Hofland for GR IHC optimization.
The authors declare no competing financial interests.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Severson, T.M., Kim, Y., Joosten, S.E.P. et al. Characterizing steroid hormone receptor chromatin binding landscapes in male and female breast cancer. Nat Commun 9, 482 (2018). https://doi.org/10.1038/s41467-018-02856-2
Molecular Aspects of Medicine (2020)
Trends in Endocrinology & Metabolism (2020)
Stress, Sex, and Sugar: Glucocorticoids and Sex-Steroid Crosstalk in the Sex-Specific Misprogramming of Metabolism
Journal of the Endocrine Society (2020)
NR2F2 Orphan Nuclear Receptor is Involved in Estrogen Receptor Alpha-Mediated Transcriptional Regulation in Luminal A Breast Cancer Cells
International Journal of Molecular Sciences (2020)