Reprogramming of the FOXA1 cistrome in treatment-emergent neuroendocrine prostate cancer

Baca, Sylvan C.; Takeda, David Y.; Seo, Ji-Heui; Hwang, Justin; Ku, Sheng Yu; Arafeh, Rand; Arnoff, Taylor; Agarwal, Supreet; Bell, Connor; O’Connor, Edward; Qiu, Xintao; Alaiwi, Sarah Abou; Corona, Rosario I.; Fonseca, Marcos A. S.; Giambartolomei, Claudia; Cejas, Paloma; Lim, Klothilda; He, Monica; Sheahan, Anjali; Nassar, Amin; Berchuck, Jacob E.; Brown, Lisha; Nguyen, Holly M.; Coleman, Ilsa M.; Kaipainen, Arja; De Sarkar, Navonil; Nelson, Peter S.; Morrissey, Colm; Korthauer, Keegan; Pomerantz, Mark M.; Ellis, Leigh; Pasaniuc, Bogdan; Lawrenson, Kate; Kelly, Kathleen; Zoubeidi, Amina; Hahn, William C.; Beltran, Himisha; Long, Henry W.; Brown, Myles; Corey, Eva; Freedman, Matthew L.

doi:10.1038/s41467-021-22139-7

Download PDF

Article
Open access
Published: 30 March 2021

Reprogramming of the FOXA1 cistrome in treatment-emergent neuroendocrine prostate cancer

Nature Communications volume 12, Article number: 1979 (2021) Cite this article

15k Accesses
54 Citations
28 Altmetric
Metrics details

Subjects

Abstract

Lineage plasticity, the ability of a cell to alter its identity, is an increasingly common mechanism of adaptive resistance to targeted therapy in cancer. An archetypal example is the development of neuroendocrine prostate cancer (NEPC) after treatment of prostate adenocarcinoma (PRAD) with inhibitors of androgen signaling. NEPC is an aggressive variant of prostate cancer that aberrantly expresses genes characteristic of neuroendocrine (NE) tissues and no longer depends on androgens. Here, we investigate the epigenomic basis of this resistance mechanism by profiling histone modifications in NEPC and PRAD patient-derived xenografts (PDXs) using chromatin immunoprecipitation and sequencing (ChIP-seq). We identify a vast network of cis-regulatory elements (N~15,000) that are recurrently activated in NEPC. The FOXA1 transcription factor (TF), which pioneers androgen receptor (AR) chromatin binding in the prostate epithelium, is reprogrammed to NE-specific regulatory elements in NEPC. Despite loss of dependence upon AR, NEPC maintains FOXA1 expression and requires FOXA1 for proliferation and expression of NE lineage-defining genes. Ectopic expression of the NE lineage TFs ASCL1 and NKX2-1 in PRAD cells reprograms FOXA1 to bind to NE regulatory elements and induces enhancer activity as evidenced by histone modifications at these sites. Our data establish the importance of FOXA1 in NEPC and provide a principled approach to identifying cancer dependencies through epigenomic profiling.

Spatial transcriptomics reveals discrete tumour microenvironments and autocrine loops within ovarian cancer subclones

Article Open access 03 April 2024

Starfysh integrates spatial transcriptomic and histologic data to reveal heterogeneous tumor–immune hubs

Article Open access 21 March 2024

Spatial transcriptomics reveals distinct and conserved tumor core and edge architectures that predict survival and targeted therapy response

Article Open access 18 August 2023

Introduction

In recent years, potent AR pathway inhibitors have extended the survival of patients with metastatic prostate cancer^1,2. Prostate tumors inevitably escape AR inhibition through reactivation of AR signaling or, increasingly, via lineage plasticity^3,4. The mechanisms underlying lineage plasticity remain unclear but likely involve transdifferentiation of PRAD to NEPC rather than de novo emergence of NEPC. NEPC and PRAD tumors from an individual patient share many somatic DNA alterations, implying a common ancestral tumor clone⁵. While the genomic profiles of NEPC and PRAD are relatively similar, their gene expression profiles and clinical behavior differ markedly⁶. We therefore set out to characterize epigenomic differences between NEPC and PRAD, hypothesizing that reprogramming of distinct regulatory elements drives their divergent phenotypes.

In this study, we profile histone modifications in NEPC and PRAD patient-derived xenografts (PDXs), identifying ~15,000 regulatory elements that are dormant in PRAD but consistently activated in NEPC. A significant portion of NEPC-enriched regulatory elements are bound in NEPC by FOXA1, a transcription factor associated with prostate development and AR-mediated transcription^7,8. The FOXA1 cistrome, or set of binding sites, is extensively reprogrammed in NEPC, with loss of FOXA1 binding at PRAD-enriched regulatory elements and gain of FOXA1 at NEPC-enriched regulatory elements. Unexpectedly, FOXA1 remains active in NEPC despite the loss of luminal identity and AR expression. FOXA1 is necessary for proliferation and neuroendocrine gene expression in experimental models of NEPC. Ectopic expression of NEPC-associated transcription factors is sufficient to reprogram the FOXA1 cistrome in PRAD to resemble its counterpart in NEPC and activate NEPC transcriptional programs. Our data indicate a dependency of NEPC upon FOXA1, which may have therapeutic implications.

Results

Comparative epigenomic profiles of PRAD and NEPC

We performed ChIP-seq for the histone post-translational modification H3K27ac to identify active regulatory elements in the LuCaP PDX series⁹, a set of xenografts derived from advanced PRAD (N = 22) and treatment-emergent NEPC (N = 5). We identified a median of 55,095 H3K27ac peaks per sample (range 37,599–74,640) (Supplementary Data 1). Notably, the transcriptomes of the LuCaP PDXs reflect differences in gene expression observed between clinical PRAD and NEPC metastases (Supplementary Fig. 1a), indicating their relevance to clinical prostate cancer.

Unsupervised hierarchical clustering and principal component analysis based on genome-wide H3K27 acetylation cleanly partitioned NEPC and PRAD LuCaP PDXs (Fig. 1a and Supplementary Fig. 1b, c). We identified 14,985 sites with eight-fold or greater increases in H3K27 acetylation in NEPC compared to PRAD at an adjusted p-value of 10⁻³. We termed these sites neuroendocrine-enriched candidate regulatory elements (“Ne-CREs”; Fig. 1b, Supplementary Data 2, and Supplementary Fig. 1d). A smaller set of sites (4338) bore greater H3K27ac signal in PRAD (termed “Ad-CREs”). Liver metastases from clinical NEPC and PRAD demonstrated enrichment of H3K27ac at Ne-CREs and Ad-CREs, respectively, confirming that the LuCaP PDX models reflect lineage-specific epigenomic features of clinical prostate tumors (Supplementary Fig. 1e).

**Fig. 1: Epigenomic divergence of PRAD and NEPC.**

Ad-CREs were found near prostate-lineage genes such as KLK3, HOXB13, and NKX3-1, while Ne-CREs resided near genes enriched for neuronal and developmental annotations, including CHGA, ASCL1, and SOX2¹⁰ (Fig. 1c and Supplementary Table 1). Genes with higher expression in NEPC compared to PRAD were enriched for nearby Ne-CREs (Supplementary Fig. 1f) and formed three-dimensional contacts with a greater number of Ne-CREs as assessed by H3K27ac HiChIP (Fig. 1d, Supplementary Fig. 1g, h, and Supplementary Data 3 and 4). For example, ASCL1, which encodes a neural lineage TF that is highly upregulated in NEPC (Supplementary Fig. 1a), interacts with 15 gene-distal Ne-CREs between 280 and 465 kb telomeric to ASCL1, including two NEPC-restricted super-enhancers within intronic regions of C12ORF42 (Fig. 1e). These results suggest that Ne-CREs regulate neuroendocrine transcriptional programs through interaction with NEPC gene promoters.

We nominated candidate TFs that may orchestrate NEPC lineage gene expression by binding to Ne-CREs. Lineage-defining TF genes often reside within densely H3K27-acetylated super-enhancers¹¹ and form core regulatory circuits, or “cliques”, by mutual binding of one another’s cis-regulatory regions^12,13. Several TFs showed clique enrichment specifically in NEPC (Fig. 1f) and/or were encompassed by NEPC-restricted super-enhancers (Supplementary Fig. 2), including known NE lineage TFs (e.g., ASCL1 and INSM1) and candidates such as HOXB2-5.

FOXA1 is an essential transcription factor in NEPC

Notably, a single TF gene, FOXA1, demonstrated clique enrichment in all NEPC and PRAD LuCaP PDXs (Fig. 1f). FOXA1 is a pioneer TF of endodermal tissues⁷ with a critical role in prostate development⁸ but no characterized function in NEPC. The forkhead motif recognized by FOXA1 was the second most significantly enriched nucleotide sequence within Ne-CREs (Fig. 1g). FOXA2, a previously-reported NEPC TF¹⁴, does not wholly account for the forkhead motif enrichment because FOXA2 was not expressed in several NEPC samples (Fig. 2a, b and Supplementary Fig. 3a). In contrast, FOXA1 was expressed in all NEPCs (Fig. 2a, b and Supplementary Table 2), as well as in resident neuroendocrine cells of benign prostate tissue (Supplementary Fig. 3).

**Fig. 2: FOXA1 remains a critical lineage transcription factor in NEPC.**

Multiple lines of investigation supported a pivotal role of FOXA1 in NEPC. A super-enhancer encompassed FOXA1 in all NEPC LuCaP PDXs (Fig. 2c and Supplementary Fig. 2). In NEPC, the FOXA1 promoter shed contacts with its regulatory region identified in PRAD¹⁵ and looped to a distinct NEPC-restricted super-enhancer (Fig. 2d). Both the distal super-enhancer and promoter were co-bound by FOXA1 and ASCL1, suggesting an auto-regulatory circuit that is characteristic of master transcriptional regulators¹⁶. Suppression of FOXA1 in a variety of NEPC cellular models^17,18 demonstrated that FOXA1 is essential for cellular proliferation and expression of NE markers, including NE lineage TFs such as FOXA2 and INSM1 (Fig. 2e–h). Analysis of a published shRNA screen confirmed a dependency on FOXA1 in the NEPC cell line NCI-H660 (Fig. 2i). Thus, FOXA1 exhibits several features of a master transcriptional regulator in NEPC.

We profiled FOXA1-binding sites in NEPC and PRAD using ChIP-seq. FOXA1 relocates to a distinct set of binding sites in NEPC PDXs (Fig. 3a), which overlap with the majority of Ne-CREs (Fig. 3b). In PRAD, Ne-CREs were devoid of FOXA1 binding and heterochromatic as assayed by ATAC-seq, but they acquired FOXA1 binding and chromatin accessibility in NEPC (Fig. 3c). Conversely, Ad-CREs lost FOXA1 binding in NEPC and became less accessible by ATAC-seq. These changes were not due to FOXA1 mutations, which can promote neuroendocrine gene transcription¹⁹, because all NEPC LuCaPs contained wild-type FOXA1 sequences. To contextualize the extent of FOXA1 reprogramming in NEPC, we compared FOXA1-binding profiles in normal prostate epithelium, localized PRAD, and PDXs derived from metastatic PRAD. At the same level of stringency, fewer than 500 sites exhibited differential FOXA1 binding between these categories; by comparison, FOXA1 binding was gained at 20,935 and lost at 29,308 sites in NEPC compared to metastatic PRAD (Fig. 3d).

**Fig. 3: Reprogramming of the FOXA1 cistrome in NEPC.**

NEPC genes contain bivalent promoter marks in PRAD

We sought to understand the mechanism by which FOXA1 binding is reprogrammed in NEPC. In addition to DNA sequence, cooperative binding with partner TFs is an important determinant of pioneer factor localization²⁰. Since the motifs recognized by ASCL1 and NKX2-1 were highly enriched at Ne-CREs (Fig. 1g), we tested whether overexpression of these TFs in the PRAD cell line LNCaP could induce FOXA1 binding at Ne-CREs. Overexpression of ASCL1 and NKX2-1 (A + N) increased FOXA1 binding at NEPC-enriched FOXA1-binding sites (Fig. 4a, b) and induced H3K27 acetylation of Ne-CREs (Fig. 4c–f). ASCL1 co-localized with FOXA1 at NEPC-enriched FOXA1-binding sites and Ne-CREs (Fig. 4g–h). A+N expression recapitulated global transcriptional changes between NEPC and PRAD, including suppression of AR and induction of SYP and CHGA (Fig. 4i–k). Thus, ectopic expression of ASCL1 and NKX2-1 is sufficient to partially reprogram FOXA1 binding in PRAD to Ne-CREs and induce de novo H3K27 acetylation at these regions, with resultant NEPC gene expression.

**Fig. 4: FOXA1 is extensively redistributed at lineage-specific regulatory elements.**

Despite intense interest, it remains unclear why PRAD can adopt a seemingly unrelated lineage to overcome androgen blockade, while most cancers do not dramatically alter their cellular identity throughout treatment. Lineage tracing studies have demonstrated that the epithelial cells that give rise to PRAD share a common developmental progenitor with resident neuroendocrine cells in the prostate^21,22. In this common progenitor cell, Ne-CREs and their FOXA1-binding sites might be physiologically poised for activation upon commitment to a neuroendocrine lineage. In support of this model, genes that are highly expressed in normal neuroendocrine prostate cells are also highly expressed in NEPC (Fig. 5a), and are enriched for nearby Ne-CREs and NEPC-restricted FOXA1-binding sites (Fig. 5b). Additionally, Ne-CREs are relatively hypomethylated in normal prostate tissue and PRAD despite absence of H3K27 acetylation, a feature of decommissioned enhancers that were active in development (Fig. 5c)^23,24.

**Fig. 5: Gene expression of benign prostate cells compared to NEPC transcriptomes and epigenomes.**

We hypothesized that a neuroendocrine epigenomic program is encoded in the developmental history of the prostate, thereby priming NEPC genes for inappropriate activation under the selective pressure of androgen blockade. Consistent with this hypothesis, many genes that become highly expressed in NEPC have “bivalent” (H3K4me3⁺/H3K27me3⁺) promoter histone marks in normal prostate tissue and PRAD (Fig. 6a). Bivalent genes are thought to be poised for lineage-specific activation upon removal of H3K27me3 at the appropriate stage of development^25,26. Our data suggested that a similar principle underlies transcriptional changes in neuroendocrine differentiation of prostate cancer. H3K27me3 levels decreased in NEPC compared to PRAD at 633 gene promoters, which were enriched for binding sites of the REST repressor of neuronal lineage transcription²⁷ (Supplementary Fig. 4). Similar numbers of these promoters were bivalent (H3K4me3⁺/H3K27me3⁺; n = 195) and repressed (H3K4me3⁻/H3K27me3⁺; n = 229) in PRAD (Fig. 6b). Critically, however, genes with bivalent (H3K4me3⁺) promoters in PRAD became more highly expressed in NEPC (Fig. 6c) than H3K4me3⁻ genes. These bivalent genes, which included NEPC TFs ASCL1, INSM1, and SOX2, may have been prepared for activation in the development of a prostate progenitor cell. Their residual H3K4me3 and promoter hypomethylation (Fig. 6d) suggest heightened potential for reactivation²⁴ in NEPC with the disruption of pro-luminal AR-driven transcriptional programs.

**Fig. 6: Encoding of neuroendocrine regulatory programs in the developmental history of prostate cancer.**

Discussion

In summary, our work demonstrates that the cis-regulatory landscape of prostate cancer is extensively reprogrammed in NEPC. Epigenomic profiling of human NEPC xenografts supports a central role of FOXA1 in this reprogramming. The FOXA1 cistrome shifts dramatically between NEPC and PRAD, with gain of FOXA1-binding sites at NEPC regulatory elements and loss of FOXA1 at PRAD elements. FOXA1 exhibits features of a master transcriptional regulator. It is encompassed by a super-enhancer in NEPC and is involved in core regulatory circuits with neuronal lineage TFs such as ASCL1. Future studies will be necessary to determine whether the shift in FOXA1-binding sites is required for neuroendocrine differentiation, or merely correlative. In either case, we establish FOXA1 as a dependency in NEPC. The finding that FOXA1 is essential in NEPC has perhaps been overlooked because candidate drivers of NEPC have been identified mainly based on differential transcript expression or somatic DNA alterations^6,10,28,29. Our study demonstrates that epigenomic profiling can identify cancer dependencies that are difficult to detect based on mutational and transcriptional profiling alone.

A dependency of NEPC on FOXA1 is unexpected based on prior work. FOXA1 has been reported to inhibit neuroendocrine differentiation of prostate adenocarcinoma, based on the observations that FOXA1 is downregulated in NEPC and that FOXA1 knock-down induces neuroendocrine features in PRAD cell lines³⁰. Our data demonstrate that FOXA1 remains crucial in NEPC despite consistent, modest transcript downregulation in NEPC compared to PRAD. Our H3K27ac HiChIP data reveal that in NEPC, FOXA1 contacts distal super-enhancers that are distinct from its PRAD enhancers and contain binding sites for NE-associated TFs such as ASCL1 and INSM1 (Fig. 2d and Supplementary Fig. 5). Thus, an NEPC-specific regulatory program may maintain FOXA1 expression at lower levels that are conducive to NE gene expression, reconciling our findings with the reported pro-neuroendocrine effects of partial FOXA1 suppression in PRAD³⁰. This hypothesis should be tested in future mechanistic studies. While our data show that FOXA1 is essential in NEPC, further studies are required to determine if FOXA1 cistrome reprogramming directly activates Ne-CREs and to assess its role dynamic lineage plasticity.

FOXA1 may have a more general role in controlling neuroendocrine differentiation. For example, in small cell lung cancer (SCLC), a neuroendocrine lung cancer variant that can emerge de novo or from EGFR-mutant lung adenocarcinoma after targeted kinase inhibition, FOXA1 is highly expressed and encompassed by a super-enhancer³¹. We observe extensive H3K27 acetylation in SCLC cell lines specifically at Ne-CREs and NEPC-enriched FOXA1-binding sites, suggesting similar enhancer usage between in SCLC and NEPC (Supplementary Fig. 6), consistent with recent reports^28,32. The large set of Ne-CREs and NEPC-enriched FOXA1-binding sites could aid the pathologic diagnosis of neuroendocrine differentiation, which can be challenging and relies on only a handful of markers. Ultimately, therapeutic targeting of FOXA1 and/or proteins that collaborate with or covalently modify this TF³³ presents an attractive strategy as FOXA1 is a common vulnerability in both PRAD and NEPC.

Methods

Patient-derived xenograft and tissue specimens

LuCaP patient-derived xenografts (PDXs) have been described previously^9,34,35 with the exception of LuCaP 208.1. LuCaP 208.1 was derived from treatment-emergent NEPC and demonstrates typical small cell histology. All LuCaP PDXs, including LuCaP 208.1 were derived from resected prostate cancer with explicit written consent of patient donors as described previously⁹ under a protocol approved by the University of Washington Human Subjects Division IRB (#39053). PDXs were generated in compliance with all relevant ethical regulations for animal testing and research under protocol #39053. The University of Washington Institutional Animal Care and Use Committee approved all animal procedures including generation and processing of PDXs, including LuCaP 208.1. Liver metastasis needle biopsy specimens were obtained from the Dana-Farber Cancer Institute Gelb Center biobank and were collected under a DFCI/Harvard Cancer Center IRB-approved protocol (01-045) with informed consent of patients. Metastases were reviewed by a clinical pathologist. The NEPC metastasis was obtained from a patient with de novo metastatic prostate adenocarcinoma after 17 months of androgen deprivation therapy with leuprolide and bicalutamide. Immunohistochemistry revealed staining for synaptophysin, chromogranin, and NKX3-1 (weak), and absence of RB1, AR, and PSA.

Epigenomic profiling

chromatin immunoprecipitation in LuCaP PDXs

Frozen tissue (20–30 mg for histone mark ChIP and 50–80 mg for transcription factor ChIP) was pulverized using the CryoPREP dry impactor system (Covaris). The tissue was then fixed using 1% formaldehyde (Thermo fisher) in phosphate-buffered saline (PBS) for 18 min either at 37 degrees Celsius (histone mark ChIP) or at room temperature (transcription factor ChIP) and was quenched with 125 mM glycine. Chromatin was lysed in ice-cold lysis buffer (50 mM Tris, 10 mM EDTA, 1% SDS with protease inhibitor for histone mark ChIP; 0.1% SDS, 0.5% sodium deoxycholate and 1% NP-40 with protease inhibitor for transcription factor ChIP) and was sheared to 300–800 bp using the Covaris E220 sonicator (105 watt peak incident power, 5% duty cycle, 200 cycles/burst for 10 min for histone mark ChIP; 140 watt peak incident power, 5% duty cycle, 200 cycles/burst for 20 min for transcription factor ChIP). Five volumes of dilution buffer (1% Triton X-100, 2 mM EDTA, 150 mM NaCl, 20 mM Tris-HCl pH 8.1) were added to chromatin for histone mark ChIP. The sample was then incubated with antibodies (H3K27ac, Diagenode, C15410196; H3K27me3, Cell Signaling 9733S; H3K4me3, Diagenode C15410003 premium; FOXA1, ab23738, Abcam) coupled with protein A and protein G beads (Life Technologies) at 4 degrees Celsius overnight. The chromatin was washed with RIPA wash buffer (100 mM Tris pH 7.5, 500 mM LiCl, 1% NP-40, 1% sodium deoxycholate) for 10 min six times and rinsed with TE buffer (pH 8.0) once. DNA was purified using MinElute column followed by incubation in the de-crosslinking buffer (1% SDS, 0.1 M NaHCO3 with Proteinase K and RNase A) at 65 degrees Celsius.

LNCaP ChIP

ChIP in LNCaP was performed according to published protocols³⁶. Ten million cells were fixed with 1% formaldehyde at room temperature for 10 min and quenched. Cells were collected in lysis buffer (1% NP-40, 0.5% sodium deoxycholate, 0.1% SDS and protease inhibitor (#11873580001, Roche) in PBS)³⁷. Chromatin was sonicated to 300–800 bp using a Covaris E220 sonicator (140 watt peak incident power, 5% duty cycle, 200 cycleburtst). Antibodies (FOXA1, ab23738, Abcam; H3K27ac, C15410196, Diagenode; ASCL1, ab74065) were incubated with 40 μl of Dynabeads protein A/G (Invitrogen) for at least 6 h before immunoprecipitation of the sonicated chromatin overnight. Chromatin was washed with LiCl wash buffer (100 mM Tris pH 7.5, 500 mM LiCl, 1% NP-40, 1% sodium deoxycholate) six times for 10 min sequentially.

ChIP sequencing

Sequencing libraries were generated from purified IP sample DNA using the ThruPLEX-FD Prep Kit (Rubicon Genomics). Libraries were sequenced using 150-base paired-end reads on an Illumina platform (Novogene).

ATAC-seq

LuCaP PDX tissues were resuspended and dounced in 300 ul of RSB buffer (10 mM Tris-HCl pH 7.4, 10 mM NaCl, and 3 mM MgCl₂ in water) containing 0.1% NP-40, 0.1% Tween-20, and 0.01% digitonin. Homogenates were transferred to a 1.5 ml microfuge tube and incubated on ice for 10 min. Nuclei were filtered through a 40 μm cell strainer and nuclei were washed with RSB buffer and counted. Fifty thousand nuclei were resuspended in 50 μl of transposition mix³⁸ (2.5 μl transposase (100 nM final), 16.5 μl PBS, 0.5 μl 1% digitonin, 0.5 μl 10% Tween-20, and 5 μl water) by pipetting up and down six times. Transposition reactions were incubated at 37 °C for 30 min in a thermomixer with shaking at 1000 r.p.m. Reactions were cleaned with Qiagen columns. Libraries were amplified³⁹ and sequenced on an Illumina Nextseq 500 with 35 base paired-end reads.

ChIP-seq data analysis

ChIP-sequencing reads were aligned to the human genome build hg19 using the Burrows-Wheeler Aligner (BWA) version 0.7.17⁴⁰. Non-uniquely mapping and redundant reads were discarded. MACS v2.1.1.20140616⁴¹ was used for ChIP-seq peak calling with a q-value (FDR) threshold of 0.01. ChIP-seq data quality was evaluated by a variety of measures, including total peak number, FrIP (fraction of reads in peak) score, number of high-confidence peaks (enriched >10-fold over background), and percent of peak overlap with DNAse hypersensitivity (DHS) peaks derived form the ENCODE project. ChIP-seq peaks were assessed for overlap with gene features and CpG islands using annotatr⁴². IGV v2.8.2⁴³ was used to visualize normalized ChIP-seq read counts at specific genomic loci. ChIP-seq heatmaps were generated with deepTools v3.3.1⁴⁴ and show normalized read counts at the peak center ±2 kb unless otherwise noted. Overlap of ChIP-seq peaks was assessed using BEDTools v2.26.0. Peaks were considered overlapping if they shared one or more base pairs.

Identification and annotation of PRAD- and NEPC-enriched ChIP-seq peaks

Sample–sample clustering, principal component analysis, and identification of lineage-enriched peaks were performed using Cobra v2.0⁴⁵ (https://bitbucket.org/cfce/cobra/src/master/), a ChIP-seq analysis pipeline implemented with Snakemake⁴⁶. ChIP-seq data from PRAD and NEPC LuCaP PDXs were compared to identify H3K27ac, H3K27me3, and FOXA1 peaks with significant enrichment in the NEPC or PRAD lineage. Only LuCaP PDXs from distinct patients were included, with the exception of the H3K27me3 differential peak analysis, which included both LuCaP 145.1 and 145.2, two LuCaP PDXs derived from distinct NEPC metastases from a single patient. A union set of peaks for each histone modification or TF was created using BEDTools. narrowPeak calls from MACS were used for H3K27ac and FOXA1, while broadPeak calls were used for H3K27me3. The number of unique aligned reads overlapping each peak in each sample was calculated from BAM files using BEDtools. Read counts for each peak were normalized to the total number of mapped reads for each sample. Quantile normalization was applied to this matrix of normalized read counts. Using DEseq2 v1.14.1⁴⁷, lineage-enriched peaks were identified at the indicated FDR-adjusted p-value (p_adj) and log₂ fold-change cutoffs (H3K27ac, p_adj < 0.001, |log₂ fold-change| >3; FOXA1, p_adj < 0.001, |log₂ fold-change| >2; H3K27me3, p_adj < 0.01, |log₂ fold-change| > 1). Unsupervised hierarchical clustering was performed based on Spearman correlation between samples. Principal component analysis was performed using the prcomp R function. Enriched de novo motifs in differential peaks were detected using HOMER version 4.7. The top non-redundant motifs were ranked by adjusted p-value.

The GREAT tool⁴⁸ (V3.0) was used to asses for enrichment of Gene Ontology (GO) and MSigDB perturbation annotations among genes near differential ChIP-seq peaks, assigning each peak to the nearest gene within 500 kb. The cistromedb toolkit (http://dbtoolkit.cistrome.org/) was used to compare ChIP-seq peaks for overlap with peaks from a large database of uniformly analyzed published ChIP-seq data (quantified as a “GIGGLE score”)⁴⁹. Published TFs and histone marks were ranked by similarity to the querry dataset based on the top 1000 peaks in each published dataset. Prior to cistromedb toolkit analysis, ChIP-seq peaks were mapped from hg19 to hg38 using the UCSC liftover tool (https://genome.ucsc.edu/cgi-bin/hgLiftOver).

For analysis of H3K27 acetylation in lung cancer at lineage-enriched candidate regulatory elements, fastq files were generated from sequence read archives (SRA) from published ChIP-seq experiments for SCLC⁵⁰ and LUAD^51,52 (SRA numbers SRR568435, SRR3098556, SRR4449027, SRR4449025, and SRR6124068).

For Fig. 5c, H3K27ac ChIP-seq peaks from primary peripheral blood monocytes (ENCFF540CVX) and epithelial keratinocytes (ENCFF943CBQ)⁵³ were used as a comparator to peaks derived from LuCaP PDXs. For these comparisons, monocyte and keratinocyte peaks within 1 kb of a LuCaP peak were excluded.

RNA-seq and differential expression analysis

RNA-seq data from human adenocarcinoma and NEPC have been reported previously⁵ and were obtained from dbGaP (accession number phs000909.v1.p1). Transcriptomes were sequenced from two replicates from each of five PRAD LuCaP PDXs (23, 77, 78, 81, and 96) and five NEPC LuCaP PDXs (49, 93, 145.1, 145.2, and 173.1). RNA concentration, purity, and integrity were assessed by NanoDrop (Thermo Fisher Scientific Inc.) and Agilent Bioanalyzer. RNA-seq libraries were constructed from 1 μg total RNA using the Illumina TruSeq Stranded mRNA LT Sample Prep Kit according to the manufacturer’s protocol. Barcoded libraries were pooled and sequenced on the Illumina HiSeq 2500 generating 50 bp paired-end reads. FASTQ files were processed using the VIPER workflow⁵⁴. Read alignment to human genome build hg19 was performed with STAR v 2.7.0f⁵⁵. Cufflinks was used to assemble transcript-level expression data from filtered alignments⁵⁶. Differential gene expression analysis (NEPC vs. PRAD) was conducted using DESeq2⁴⁷.

H3K27ac HiChIP

Pulverized frozen tissue from LuCaP 173.1 was fixed with 1% formaldehyde in PBS at room temperature for 10 min³⁶. Sample was incubated in lysis buffer and digested with MboI (NEB) for 4 h. After 1 h of biotin incorporation with biotin dATP, the sample was ligated using T4 DNA ligase for 4 h. Chromatin was sheared using 140 PIP, 5% duty cycle, and 200 cycles/burst for 8 min in shearing buffer composed of 1% NP-40, 0.5% sodium deoxycholate, and 0.1% SDS in PBS (LNCaP) or using 100 PIP, 5% duty cycle, 200 cycles/burst for 3 min in 1% SDS, 50 mM Tris (pH 8.1), and 5 mM EDTA (LuCaP 173.1). ChIP was then performed using H3K27Ac antibody (Diagenode, C1541019)⁵⁷.

Immunoprecipitated sample was pulled down with streptavidin C1 beads (Life Technologies) and treated with Transposase (Illumina). Amplification was performed for the number of cycles required to reach 1/3 of the maximal fluorescence on quantitative PCR (qPCR) plot with SYBR® Green I(Life Technologies). Libraries were sequenced using 150-base paired-end reads on the Illumina platform (Novogene).

Alignment and filtering using HiC-Pro

We processed paired-end fastq files using HiC-Pro⁵⁸ to generate intra- and inter-chromosomal contact maps. The reads were first trimmed to remove adaptor sequences using Trim Galore (https://github.com/FelixKrueger/TrimGalore). Default settings from HiC-Pro were used to align reads to the hg19 human genome, assign reads to MboI restriction fragments, and remove duplicate reads. Only uniquely mapped valid read pairs involving two different restriction fragments were used to build the contact maps.

FitHiChIP

We applied FitHiChIP⁵⁹ for bias-corrected peak calling and DNA loop calling.

We used MACS2 broadPeak peak calls from H3K27ac ChIP-seq in LuCaP 173.1 (NEPC). 44,609 peaks were called at a q-value < 0.01. We used a 5 kb resolution and considered only interactions between 5 kb and 3 Mb. We used peak-to-peak (stringent) interactions for the global background estimation of expected counts (and contact probabilities for each genomic distance), and peak-to-all interactions for the foreground, meaning at least one anchor must overlap a H3K27ac peak. The corresponding FitHiChiP options specified are “IntType = 3” and “UseP2PBackgrnd = 1”.

Assignment of enhancer-promoter interactions using H3K27ac HiChIP data

NCBI RefSeq genes (hg19) were downloaded from the UCSC genome table browser (https://genome.ucsc.edu/cgi-bin/hgTables). Only uniquely mapping genes were considered. The longest transcript was selected for genes with multiple annotated transcripts. We searched for H3K27ac HiChIP loops with one anchor (defined with a 5 kb window) overlapping a region between 0 and 5 kb upstream of a gene transcriptional start site. We selected subset of these loops for which the second anchor (with a 5 kb window) overlapped with H3K27ac peaks identified by ChIP-seq in LuCaP 173.1 (NEPC) or with NEPC-enriched H3K27ac peaks (Ne-CREs). Gene promoters and distal H3K27ac peaks/Ne-CREs were considered looped if each overlapped with an anchor of the same high-confidence H3K27ac HiChIP loop(s). To examine the association of regulatory element looping with gene expression, genes were binned by the number of distinct, looped Ne-CREs or H3K27ac peaks. Differential expression between NEPC and PRAD LuCaP PDXs, as assessed by DESeq2 analysis of LuCaP RNA-seq data, was plotted for genes in each bin. Wilcoxon rank-sum p-values were calculated for differential expression of genes looped to one versus two or more H3K27ac/Ne-CRE peaks. A p-value < 0.01 was considered significant.

Master transcription factor analysis

Super-enhancer ranking analyses

Enhancer and super-enhancer (SE) calls were obtained using the Rank Ordering of Super-enhancer (ROSE2) algorithm¹¹. We selected SEs assigned to transcription factors (TFs)^60,61, and for each sample, we obtained the ranks of all TF SEs. Considering only the top 5% TFs by median ranking in NEPC or PRAD, we applied a one-sided Mann–Whitney U-test to identify lineage-enriched TF SEs (FDR = 10%).

Clique enrichment and clustering analysis

Clique enrichment scores (CESs) for each TF were calculated using clique assignments from Coltron⁶². Coltron assembles transcriptional regulatory networks (cliques) based on H3K27 acetylation and TF-binding motif analysis. The clique enrichment score for a given TF is the number of cliques containing the TF divided by the total number of cliques. We incorporated ATAC-seq data to restrict the motif search to regions of open chromatin. Using the CES, we performed clustering (distance = Canberra, agglomeration method = ward.D2) considering only TFs that appear in cliques in at least 80% of the samples in at least one lineage group (4 out of 5 NEPC and 11 out of 14 PRAD).

Motif enrichment at super-enhancers with loops to the FOXA1 locus

H3K27ac HiChIP data were used to select distal SEs that form three-dimensional contacts with the FOXA1 locus. We used the Coltron algorithm to search for TF motifs in ATAC-seq peaks within these SEs. We considered all TFs that were categorized as expressed by Coltron based on H3K27ac levels at the TF gene locus. Motif enrichment for a TF was calculated as the total number of non-overlapping base pairs (bp) covered by the TF motif, divided by the summed length (in bp) of the SEs. Values in the heatmap legend correspond to percent coverage (i.e., the largest value corresponds to 0.4%).

FOXA1 mutational profiling

FOXA1 mutational status was assessed from exome sequence data (62x–110X depth of coverage). Each LuCaP PDX was sequenced using the Illumina Hi-seq platform with 100 bp paired-end reads. Hybrid capture was performed SeqCapV3. Mouse genome subtraction was performed using the mm10 genome build and reads were aligned to human reference genome hg19. For sequence analysis, bam files processed as per Genome Analysis Toolkit (GATK) best practice guideline⁶³. We Used MuTect2 and HaplotypeCaller for mutation calls. All mutations were manually reviewed and subsequently annotated using Annovar⁶⁴. Copy number was derived using the Sequenza R package⁶⁵. Factera⁶⁶ was used to predict structural events involving FOXA1.

FOXA1 siRNA knock-down

WCM154 organoids were cultured and maintained as described¹⁸. Organoids were dissociated to single cells using TrypLE (ThermoFisher). One million cells were resuspended in 20 μl of electroporation buffer (BTXpress) and mixed with 60 pmole of control or FOXA1 On-target pool siRNA (Dharmacon). Then organoid-siRNA mixtures were transferred to a 16-well NucleocuvetteTM Strip and nucleofection was performed in a 4D-Nucleofector (Lonza). Following nucleofection, 10⁵ organoids cells were grown in a 12-well plate coated with 1% collagen I (ThermoFisher) for 7 days. Both adherent and floating cells were collected and stained with 0.4% trypan blue solution (ThermoFisher). Total cell numbers were measured by a hemocytometer. Cell proliferation with FOXA1 knock-down was normalized to control siRNA cells.

FOXA1 shRNA knock-down

LNCaP, LNCaP 42D, and LNCaP 42F cells were seeded in parallel 6-well plates at 500, 500, or 100 k, respectively. Twenty-four hours later, cells were infected with lentivirus containing shRNAs targeting GFP control or FOXA1. Forty-eight hours following infection, equal cell numbers were seeded, and proliferation was assayed 6 days later using a Vi-Cell. Seventy-two hours following infection, a second plate infected in parallel was harvested for immunoblotting. The target sequence against GFP was CCACATGAAGCAGCACGACTT (shGFP). The target sequences against FOXA1 were GCGTACTACCAAGGTGTGTAT (shFOXA1-1) and TCTAGTTTGTGGAGGGTTAT (shFOXA1-2).

FOXA1 CRISPR-Cas9 knock-out

Blasticidin-resistant Cas9-positive LNCaP, LNCaP 42D, and LNCaP 42F cells were cultured in 20 μg/ml blasticidin (Thermo Fisher Scientific, NC9016621) for 72 h to select for cells with optimal Cas9 activity. LNCaP, LNCaP 42D, and LNCaP 42F cells were seeded in parallel 6-well plates at 300, 300, 300, or 60 k, respectively. Cells were infected after 24 h with lentiviruses expressing sgRNAs targeting GFP control or FOXA1. Cells were subject to puromycin selection and harvested for immunoblot after 3 days. Six days following selection, cell viability was determined using a Vi-Cell. The target sequences against GFP were AGCTGGACGGCGACGTAAA (sgGFP1) and GCCACAAGTTCAGCGTGTCG (sgGFP2). The target sequences against FOXA1 were GTTGGACGGCGCGTACGCCA (sgFOXA1-1), GTAGTAGCTGTTCCAGTCGC (sgFOXA1-2), CAGCTACTACGCAGACACGC (sgFOXA1-3), and ACTGCGCCCCCCATAAGCTC (sgFOXA1-4).

Western blots

For WCM154 western blots, cell pellets were lysed in RIPA buffer (MilliporeSigma, 20–188) supplemented with Protease/Phosphatase Inhibitor Cocktail (Cell Signaling Technology, 5872S). Protein concentrations were assayed with a Pierce BCA Protein Assay Kit (Thermo Fisher Scientific, PI23225), and protein was subsequently denatured in NuPAGE LDS sample buffer (Thermo Fisher Scientific, NP0007) containing 5% β-Mercaptoethanol. Thirteen micrograms of each protein sample was loaded onto NuPAGE 4–12% Bis-Tris Protein gels (Thermo Fisher Scientific), and samples were run in NuPAGE MOPS SDS Running Buffer (Thermo Fisher Scientific, NP0001). Following electrophoresis, proteins were transferred to nitrocellulose membranes via an iBlot apparatus (Thermo Fisher Scientific). After blocking in Odyssey Blocking Buffer (LI-COR Biosciences, 927-70010) for 1 h at room temperature, membranes were cut and incubated in primary antibodies diluted 1:1000 in Odyssey Blocking Buffer overnight at 4 °C. The next morning, membranes were washed three times with phosphate-buffer saline, 0.1% Tween (PBST) and then incubated with fluorescent anti-rabbit secondary antibodies (Thermo Fisher Scientific, NC9401842) for 1 h at room temperature. Membranes underwent five PBST washes and were then imaged using an Odyssey Imaging System (LI-COR Biosciences). Primary antibodies used include FOXA1 (Cell Signaling Technology, 58613S) and β-actin (Cell Signaling Technology, 8457L).

For LNCaP, LNCaP 42D, and LNCaP 42F western Blots, cell lysate was extracted using RIPA lysis buffer (Sigma) containing protease inhibitor (Roche) and phosphatase inhibitor (ThermoFisher). Fifty micrograms of protein was subjected to a 4–15% Mini-PROTEAN Precast electrophoresis gel (Bio-Rad) then transferred to 0.22 µm nitrocellulose membrane (Bio-Rad) and blocked in 5% blotting grade blocker (Bio-Rad). Membranes were incubated with primary antibodies overnight (FOXA1, Abcam, 1:2000, ab23738; Synaptophysin, Cell Marque, 1:5000, MRQ-40; INSM1, Santa Cruz, 1:2000, sc-377428; FOXA2, Abcam; 1:2500, ab108422; Chromogranin A, Abcam, 1:2000, ab15160; Vinculin, Cell signaling, 1:5000, #13901). Membranes were then washed in 1x Tris-buffered saline with 0.5% Tween-20 (Boston BioProducts) and incubated with secondary antibodies (mouse, Bio-Rad, 1:2500; rabbit, Bio-Rad, 1:2500). Western HRP substrate kit was used to detect chemiluminescent signal (Millipore, Classico).

Analysis of FOXA1-binding sites across prostate cancer states

FOXA1 cistromes were compared across different states of prostate cancer progression (normal prostate, prostate-localized adenocarcinoma, PDXs derived from metastatic castration resistant prostate cancer, and PDXs derived from NEPC). FOXA1 ChIP from normal prostate tissue and prostate-localized adenocarcinoma will be reported separately (Pomerantz et al., submitted). For normal prostate tissue FOXA1 ChIP, tissue cores were obtained from regions of prostatectomy specimens with dense epithelium and no evidence of neoplasia on review by a genitourinary pathologist. PDX samples used are listed in Supplementary Data 1. PDXs derived from localized prostate cancer were excluded from this analysis. As the normal prostate and localized adenocarcinoma samples were sequenced with single-end sequencing with an average of ~20 million reads, paired-end sequencing data from LuCaP PDXs were down-sampled to 20 M reads, using a single-end trimmed to 75 base pairs using seqtk (https://github.com/lh3/seqtk).

Pairwise comparisons were made between normal prostate (N = 5) and localized PRAD (N = 5), localized PRAD and metastatic PRAD PDXs (N = 11), and metastatic PRAD PDXs and NEPC PDXs (N = 5) using DESeq2 as described above. Peaks were considered significantly different between groups at a log2 |fold-change| threshold of 2 and FDR-adjusted p-value threshold of 0.001. “Shared” peaks were defined as the intersection of all peaks that were present in each group but not significantly different in any comparison.

Immunohistochemistry

Immunohistochemistry was performed on tissue microarray (TMA) sections. TMA slides were stained for FOXA1 (Abcam ab170933, 1:100 dilution with 10 mM NaCitrate antigen retrieval) and FOXA2 (Abcam ab108422, 1:500 dilution with 10 mM NaCitrate antigen retrieval) using a standard procedure⁶⁷. Rabbit IgG was used as a negative control. Nuclear staining intensity was assigned levels 0, 1+, 2+, or 3+ and H-scores were calculated as: [1 x (% of 1+ cells) + 2 x (% of 2+ cells) + 3 x (% of 3+ cells)]. Evaluations were performed in a blinded fashion.

ASCL1/NKX2-1 overexpression in LNCaP

Transduction of LNCaP cells with ASCL1 and NKX2-1

The open reading frames of ASCL1 and NKX2-1 were cloned into the pLX_TRC302 lentiviral expression vector (Broad Institute) using the gateway recombination system. A construct expressing eGFP (pLX_TRC302_GFP) was used as a negative control. Viruses were generated by transfecting 293T cells with packaging vectors pVsVg and pDelta8.9. Supernatant was collected after 48 h. LNCaP cells were transduced in the presence of 4 μg/ml polybrene and harvested after 3 days for RNA-seq, ATAC-seq, and ChIP-seq.

ChIP seq was performed as described above, using 10–15 million cells fixed with 1% paraformaldehyde for 10 min at room temperature, followed by quenching with glycine. RNA was isolated using QIAGEN RNeasy Plus Kit and cDNA synthesized using Clontech RT Advantage Kit. Quantitative PCR was performed on a Quantstudio 6 using SYBR green. Primers used for quantitative reverse transcription PCR (qRT-PCR) are listed in Supplementary Table 4.

Analysis of promoter H3K4 and H3K27 trimethylation

Refseq gene coordinates (hg19) were compiled, selecting the longest isoform where multiple were annotated. Normalized tag counts from H3K27me3 and H3K4me3 ChIP-seq within 2 kb of each transcriptional start site (TSS) were calculated for each sample, then averaged across multiple samples in each group (five NEPC PDXs, five PRAD PDXs, three normal prostates; Pomerantz et al., submitted). Contours were calculated using the R function geom_density_2d from the ggplot2 package; they represent the 2d kernel density estimation for all included transcriptional start sites. Gene promoters were assigned “active”, “bivalent”, “unmarked”, and “repressed” annotations based on H3K4me3 and H3K27me3 levels. High/low cutoffs for these marks were determined as follows. First, the H3K4me3 normalized tag counts near each TSS were fit to two normal distributions using the normalmixEM R function from the mixtools R package. The cutoff between H3K4me3-high and -low was set at four standard deviations below the mean value of the H3K4me3-high distribution. Next, the normalized H3K27me3 tag counts near H3K4me3-high TSSs were fit to two normal distributions. The cutoff for H3K27me3-high promoters was set at four standard deviations above the mean value of the H3K27me3-low distribution. The Pearson Chi-squared test was used to quantify significance of enrichment of NEPC-upregulated genes in the “bivalent” quadrant compared to “repressed” or “unmarked” quadrants. NEPC-upregulated genes were defined as those with log₂ fold-change >3 and adjusted p-value < 1 x 10⁻⁶ in NEPC vs. PRAD. The results of the analysis were robust to using other p-value and differential expression thresholds.

Methylation analysis of normal prostate

Whole-genome bisulfite sequencing data from histologically normal prostate tissue were reported previously⁶⁸ and processed in our prior report⁶⁹. Briefly, bases with a Phred score below 20 were trimmed and adapter sequences were discarded using Trim Galore! (version 0.4.4_dev; https://github.com/FelixKrueger/TrimGalore). Trimmed reads were mapped to the hg19 reference genome using Bowtie2 and Bismark v0.19.0⁷⁰. Methylated read counts and total coverage for CpGs were extracted using the bismark_methylation_extractor command and portions of methylated CpGs were computed with the bsseq Bioconductor package⁷¹. CpG methylation at indicated sites was visualized using deepTools⁴⁴ v3.3.1.

Statistical tests

All statistical tests were two-sided except where otherwise indicated.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The ChIP-seq data generated in this study (sequencing reads in fastq format and normalized read counts in bigwig format) have been deposited in GEO under accession number GSE161948. The RNA-seq data from clinical NEPC are available in dbGaP under accession code phs000909.v.p1. scRNAseq data used in this study are available at in GEO under accession number GSE120716. Lung cancer H3K27ac ChIP-seq data are available in SRA under accession numbers SRR568435, SRR3098556, SRR4449027, SRR4449025, and SRR6124068. H3K27ac ChIP-seq peaks from primary peripheral blood monocytes and keratinocytes are available from ENCODE (https://www.encodeproject.org/) under accession numbers ENCFF540CVX and ENCFF943CBQ. The WGBS data used in this study were obtained from the authors and are available on request by contacting Dr. Jianhua Luo at the Department of Pathology of the University of Pittsburgh (luoj@msx.upmc.edu)⁶⁸. LuCaP PDXs are available upon request from Dr. Eva Corey (ecorey@uw.edu). Source data are provided with this paper.

Code availability

Code for the Cobra pipeline⁴⁵ used for the analyses in this paper is available at https://bitbucket.org/cfce/cobra/src/master/.

References

Scher, H. et al. Increased survival with enzalutamide in prostate cancer after chemotherapy. N. Engl. J. Med. 367, 1187–1197 (2012).
Article CAS PubMed Google Scholar
de Bono, J. S. et al. Abiraterone and increased survival in metastatic prostate cancer. N. Engl. J. Med. 364, 1995–2005 (2011).
Article PubMed PubMed Central Google Scholar
Aggarwal, R. et al. Clinical and genomic characterization of treatment-emergent small-cell neuroendocrine prostate cancer: a multi-institutional prospective study. J. Clin. Oncol. 36, 22492–22503 (2018).
Google Scholar
Bluemn, E. et al. Androgen Receptor pathway-independent prostate cancer is sustained through FGF signaling. Cancer Cell 32, 474–489.e6 (2017).
Article CAS PubMed PubMed Central Google Scholar
Beltran, H. et al. Divergent clonal evolution of castration resistant neuroendocrine prostate cancer. Nat. Med. 22, 298–305 (2016).
Article CAS PubMed PubMed Central Google Scholar
Davies, A. H., Beltran, H. & Zoubeidi, A. Cellular plasticity and the neuroendocrine phenotype in prostate cancer. Nat. Rev. Urol. 15, 271–286 (2018).
Article CAS PubMed Google Scholar
Zaret, K. S. & Carroll, J. S. Pioneer transcription factors: establishing competence for gene expression. Genes Dev. 25, 22227–22241 (2011).
Article CAS Google Scholar
Friedman, J. & Kaestner, K. The Foxa family of transcription factors in development and metabolism. Cell Mol. Life Sci. 63, 2317–2328 (2006).
Article CAS PubMed Google Scholar
Nguyen, H. et al. LuCaP prostate cancer patient‐derived xenografts reflect the molecular heterogeneity of advanced disease and serve as models for evaluating cancer therapeutics. Prostate 77, 654–671 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mu, P. et al. SOX2 promotes lineage plasticity and antiandrogen resistance in TP53- and RB1-deficient prostate cancer. Science 355, 84–88 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Whyte, W. A. et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell 153, 307–319 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lin, C. et al. Active medulloblastoma enhancers reveal subgroup-specific cellular origins. Nature 530, 57 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Ott, C. J. et al. Enhancer architecture and essential core regulatory circuitry of chronic lymphocytic leukemia. Cancer Cell 34, 982–995.e7 (2018).
Article CAS PubMed PubMed Central Google Scholar
Qi, J. et al. Siah2-dependent concerted activity of HIF and FoxA2 regulates formation of neuroendocrine phenotype and neuroendocrine prostate tumors. Cancer Cell 18, 23–38 (2010).
Article CAS PubMed PubMed Central Google Scholar
Parolia, A. et al. Distinct structural classes of activating FOXA1 alterations in advanced prostate cancer. Nature 571, 413–418 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jaenisch, R. & Young, R. Stem cells, the molecular circuitry of pluripotency and nuclear reprogramming. Cell 132, 567–582 (2008).
Article CAS PubMed PubMed Central Google Scholar
Bishop, J. L. et al. The master neural transcription factor BRN2 is an androgen receptor-suppressed driver of neuroendocrine differentiation in prostate cancer. Cancer Discov. 7, 54–71 (2017).
Article CAS PubMed Google Scholar
Puca, L. et al. Patient derived organoids to model rare prostate cancer phenotypes. Nat. Commun. 9, 2404 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Adams, E. J. et al. FOXA1 mutations alter pioneering activity, differentiation and prostate cancer phenotypes. Nature 571, 408–412 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Donaghey, J. et al. Genetic determinants and epigenetic effects of pioneer-factor occupancy. Nat. Genet. 50, 250–258 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ousset, M. et al. Multipotent and unipotent progenitors contribute to prostate postnatal development. Nat. Cell Biol. 14, 1131 (2012).
Article CAS PubMed Google Scholar
Pignon, J.-C. et al. p63-expressing cells are the stem cells of developing prostate, bladder, and colorectal epithelia. Proc. Natl Acad. Sci. 110, 8105–8110 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Hon, G. C. et al. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nat. Genet. 45, 1198–1206 (2013).
Article CAS PubMed PubMed Central Google Scholar
Jadhav, U. et al. Extensive recovery of embryonic enhancer and gene memory stored in hypomethylated enhancer DNA. Mol. Cell 74, 542–554.e5 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bernstein, B. E. et al. A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell 125, 315–326 (2006).
Article CAS PubMed Google Scholar
Mohn, F. et al. Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors. Mol. Cell 30, 755–766 (2008).
Article CAS PubMed Google Scholar
Schoenherr, C. & Anderson, D. The neuron-restrictive silencer factor (NRSF): a coordinate repressor of multiple neuron-specific genes. Science 267, 1360–1363 (1995).
Article ADS CAS PubMed Google Scholar
Park, J. et al. Reprogramming normal human epithelial tissues to a common, lethal neuroendocrine cancer lineage. Science 362, 91–95 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Ku, S. et al. Rb1 and Trp53 cooperate to suppress prostate cancer lineage plasticity, metastasis, and antiandrogen resistance. Science 355, 78–83 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Kim, J. et al. FOXA1 inhibits prostate cancer neuroendocrine differentiation. Oncogene 36, 4072–4080 (2017).
Article CAS PubMed PubMed Central Google Scholar
Borromeo, M. D. et al. ASCL1 and NEUROD1 reveal heterogeneity in pulmonary neuroendocrine tumors and regulate distinct genetic programs. Cell Rep. 16, 1259–1272 (2016).
Article CAS PubMed PubMed Central Google Scholar
Balanis, N. G. et al. Pan-cancer convergence to a small-cell neuroendocrine phenotype that shares susceptibilities with hematological malignancies. Cancer Cell 36, 17–34.e7 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gao, S. et al. Chromatin binding of FOXA1 is promoted by LSD1-mediated demethylation in prostate cancer. Nat. Genet. 52, 1011–1017 (2020).
Article PubMed PubMed Central CAS Google Scholar
Beshiri, M. L. et al. A PDX/Organoid biobank of advanced prostate cancers captures genomic and phenotypic heterogeneity for disease modeling and therapeutic screening. Clin. Cancer Res. 24, 14332–14345 (2018).
Article Google Scholar
Labrecque, M. P. et al. Molecular profiling stratifies diverse phenotypes of treatment-refractory metastatic castration-resistant prostate cancer. J. Clin. Invest. 129, 4492–4505 (2019).
Article PubMed PubMed Central Google Scholar
Pomerantz, M. M. et al. The androgen receptor cistrome is extensively reprogrammed in human prostate tumorigenesis. Nat. Genet. 47, 1346–1351 (2015).
Article CAS PubMed PubMed Central Google Scholar
Johnson, D. S., Mortazavi, A., Myers, R. M. & Wold, B. Genome-wide mapping of in vivo protein-DNA interactions. Science 316, 1497–1502 (2007).
Article ADS CAS PubMed Google Scholar
Corces, M. R. et al. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat. Methods 14, 959–962 (2017).
Article CAS PubMed PubMed Central Google Scholar
Buenrostro, J., Wu, B., Chang, H. & Greenleaf, W. ATAC-seq: a method for assaying chromatin accessibility genome-wide. Curr. Protoc. Mol. Biol. 109, 21.29.1–21.29.9 (2015).
Article Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article PubMed PubMed Central CAS Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central CAS Google Scholar
Cavalcante, R. G. & Sartor, M. A. annotatR: genomic regions in context. Bioinformatics 33, 12381–12383 (2017).
Article CAS Google Scholar
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ramírez, F., Dündar, F., Diehl, S., Grüning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, WW187–W191 (2014).
Article CAS Google Scholar
Qiu, X. et al. CoBRA: Containerized bioinformatics workflow for reproducible chip/atac-seq analysis-from differential peak calling to pathway analysis. Preprint at bioRxiv https://doi.org/10.1101/2020.11.06.367409 (2020).
Köster, J. & Rahmann, S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 28, 2520–2522 (2012).
Article PubMed CAS Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central CAS Google Scholar
McLean, C. Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010).
Article CAS PubMed PubMed Central Google Scholar
Layer, R. M. et al. GIGGLE: a search engine for large-scale integrated genome analysis. Nat. Methods 15, 123–126 (2018).
Article CAS PubMed PubMed Central Google Scholar
Huang, Y.-H. et al. POU2F3 is a master regulator of a tuft cell-like variant of small cell lung cancer. Genes Dev. 32, 915–928 (2018).
Article CAS PubMed PubMed Central Google Scholar
Abraham, B. J. et al. Small genomic insertions form enhancers that misregulate oncogenes. Nat. Commun. 8, 14385 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Handoko, L. et al. JQ1 affects BRD2-dependent and independent transcription regulation without disrupting H4-hyperacetylated chromatin states. Epigenetics 13, 410–431 (2018).
Article PubMed PubMed Central Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Cornwell, M. et al. VIPER: visualization pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis. BMC Bioinforma. 19, 135 (2018).
Article CAS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mumbach, M. R. et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods 13, 1919–1922 (2016).
Article CAS Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Article PubMed PubMed Central CAS Google Scholar
Bhattacharyya, S., Chandra, V., Vijayanand, P., Ay, F. FitHiChIP: identification of significant chromatin contacts from HiChIP data. Nat. Commun. 10, 4221 (2019).
D’Alessio, A. C. et al. A systematic approach to identify candidate transcription factors that control cell identity. Stem Cell Rep. 5, 763–775 (2015).
Article CAS Google Scholar
Lambert, S. A. et al. The human transcription factors. Cell 172, 650–665 (2018).
Article CAS PubMed Google Scholar
Federation, A. J. et al. Identification of candidate master transcription factors within enhancer-centric transcriptional regulatory networks. Preprint at bioRxiv https://doi.org/10.1101/345413 (2018).
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinforma. 43, 11.10.1–11.10.33 (2013).
Article Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, 1e164 (2010).
Google Scholar
Favero, F. et al. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data. Ann. Oncol. 26, 64–70 (2015).
Article CAS PubMed Google Scholar
Newman, A. M. et al. FACTERA: a practical method for the discovery of genomic rearrangements at breakpoint resolution. Bioinformatics 30, 23390–23393 (2014).
Article CAS Google Scholar
Nguyen, H. M. et al. Cabozantinib inhibits growth of androgen-sensitive and castration-resistant prostate cancer and affects bone remodeling. PLoS ONE 8, e78881 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Yu, Y. P. et al. Whole-genome methylation sequencing reveals distinct impact of differential methylations on gene transcription in prostate cancer. Am. J. Pathol. 183, 1960–1970 (2013).
Article CAS PubMed PubMed Central Google Scholar
Takeda, D. Y. et al. A somatically acquired enhancer of the Androgen Receptor is a noncoding driver in advanced prostate cancer. Cell 174, 422–432.e13 (2018).
Article CAS PubMed PubMed Central Google Scholar
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27, 11571–11572 (2011).
Article CAS Google Scholar
Hansen, K. D., Langmead, B. & Irizarry, R. A. BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 13, 1R83 (2012).
Article Google Scholar
Tsherniak, A. et al. Defining a cancer dependency map. Cell 170, 564–576.e16 (2017).
Article CAS PubMed PubMed Central Google Scholar
Henry, G. H. et al. A cellular anatomy of the normal adult human prostate and prostatic urethra. Cell Rep. 25, 3530–3542.e5 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the PNW Prostate Cancer SPORE P50 CA097186, DOD W81XWH-19-1-0565, DOD W81XWH-17-1-0415, P01 CA163227, R01 CA193910, R01 CA233863, The Prostate Cancer Foundation, The PhRMA Foundation, The Richard M. Lucas Foundation, the European Union’s Horizon 2020 Research and Innovation program under the Marie Skłodowska-Curie grant agreement No. 754490, the National Cancer Institute (T32CA009172), and by Rebecca and Nathan Milikowsky. We would like to thank the patients who generously donated tissue that made this research possible.

Author information

Authors and Affiliations

Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
Sylvan C. Baca, Ji-Heui Seo, Justin Hwang, Sheng Yu Ku, Rand Arafeh, Taylor Arnoff, Connor Bell, Edward O’Connor, Xintao Qiu, Sarah Abou Alaiwi, Paloma Cejas, Klothilda Lim, Monica He, Amin Nassar, Jacob E. Berchuck, Mark M. Pomerantz, William C. Hahn, Himisha Beltran, Henry W. Long, Myles Brown & Matthew L. Freedman
The Eli and Edythe L. Broad Institute, Cambridge, MA, USA
Sylvan C. Baca, William C. Hahn & Matthew L. Freedman
Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA
Sylvan C. Baca, Ji-Heui Seo, Connor Bell, Edward O’Connor, Xintao Qiu, Sarah Abou Alaiwi, Paloma Cejas, Klothilda Lim, Monica He, Jacob E. Berchuck, Mark M. Pomerantz, Henry W. Long, Myles Brown & Matthew L. Freedman
Laboratory of Genitourinary Cancer Pathogenesis, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD, USA
David Y. Takeda, Supreet Agarwal & Kathleen Kelly
Department of Obstetrics and Gynecology and the Women’s Cancer Program at the Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Rosario I. Corona, Marcos A. S. Fonseca & Kate Lawrenson
Center for Bioinformatics and Functional Genomics, Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Rosario I. Corona & Kate Lawrenson
Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
Claudia Giambartolomei & Bogdan Pasaniuc
Istituto Italiano di Tecnologia, Genova, Italy
Claudia Giambartolomei
Department of Oncologic Pathology, Dana-Farber Cancer Institute, Boston, MA, USA
Anjali Sheahan & Leigh Ellis
Department of Urology, University of Washington, Seattle, WA, USA
Lisha Brown, Holly M. Nguyen, Colm Morrissey & Eva Corey
Divisions of Human Biology and Clinical Research, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Ilsa M. Coleman, Arja Kaipainen, Navonil De Sarkar & Peter S. Nelson
Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
Keegan Korthauer
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Keegan Korthauer
Department of Pathology, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA, USA
Leigh Ellis
Vancouver Prostate Centre, Vancouver, BC, Canada
Amina Zoubeidi
Department of Urologic Sciences, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
Amina Zoubeidi

Authors

Sylvan C. Baca
View author publications
You can also search for this author in PubMed Google Scholar
David Y. Takeda
View author publications
You can also search for this author in PubMed Google Scholar
Ji-Heui Seo
View author publications
You can also search for this author in PubMed Google Scholar
Justin Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Sheng Yu Ku
View author publications
You can also search for this author in PubMed Google Scholar
Rand Arafeh
View author publications
You can also search for this author in PubMed Google Scholar
Taylor Arnoff
View author publications
You can also search for this author in PubMed Google Scholar
Supreet Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Connor Bell
View author publications
You can also search for this author in PubMed Google Scholar
Edward O’Connor
View author publications
You can also search for this author in PubMed Google Scholar
Xintao Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Abou Alaiwi
View author publications
You can also search for this author in PubMed Google Scholar
Rosario I. Corona
View author publications
You can also search for this author in PubMed Google Scholar
Marcos A. S. Fonseca
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Giambartolomei
View author publications
You can also search for this author in PubMed Google Scholar
Paloma Cejas
View author publications
You can also search for this author in PubMed Google Scholar
Klothilda Lim
View author publications
You can also search for this author in PubMed Google Scholar
Monica He
View author publications
You can also search for this author in PubMed Google Scholar
Anjali Sheahan
View author publications
You can also search for this author in PubMed Google Scholar
Amin Nassar
View author publications
You can also search for this author in PubMed Google Scholar
Jacob E. Berchuck
View author publications
You can also search for this author in PubMed Google Scholar
Lisha Brown
View author publications
You can also search for this author in PubMed Google Scholar
Holly M. Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Ilsa M. Coleman
View author publications
You can also search for this author in PubMed Google Scholar
Arja Kaipainen
View author publications
You can also search for this author in PubMed Google Scholar
Navonil De Sarkar
View author publications
You can also search for this author in PubMed Google Scholar
Peter S. Nelson
View author publications
You can also search for this author in PubMed Google Scholar
Colm Morrissey
View author publications
You can also search for this author in PubMed Google Scholar
Keegan Korthauer
View author publications
You can also search for this author in PubMed Google Scholar
Mark M. Pomerantz
View author publications
You can also search for this author in PubMed Google Scholar
Leigh Ellis
View author publications
You can also search for this author in PubMed Google Scholar
Bogdan Pasaniuc
View author publications
You can also search for this author in PubMed Google Scholar
Kate Lawrenson
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen Kelly
View author publications
You can also search for this author in PubMed Google Scholar
Amina Zoubeidi
View author publications
You can also search for this author in PubMed Google Scholar
William C. Hahn
View author publications
You can also search for this author in PubMed Google Scholar
Himisha Beltran
View author publications
You can also search for this author in PubMed Google Scholar
Henry W. Long
View author publications
You can also search for this author in PubMed Google Scholar
Myles Brown
View author publications
You can also search for this author in PubMed Google Scholar
Eva Corey
View author publications
You can also search for this author in PubMed Google Scholar
Matthew L. Freedman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.C.B. analyzed ChIP-seq data and wrote the manuscript. X.Q. assisted with ChIP-seq data analysis. D.Y.T., J.H. and T.A. performed ASCL1/NKX2-1 overexpression experiments. S.Y.K. performed FOXA1 siRNA experiments under supervision of H.B. J.H., T.A., R.A. and S.A. performed FOXA1 shRNA and CRISPR experiments under the supervision of W.C.H. and K. Kelly. E.O., C.B. and S.A.A. performed ChIP-seq experiments. J.-H.S. performed HiChIP experiments. C.G. and B.P analyzed HiChIP data. R.I.C. and M.A.S.F. performed core regulatory analysis under supervision of K.L. P.C. and K.L. performed ATAC-seq under supervision of H.W.L. and M.B. M.H. and A.N. assisted with procurement of clinical samples. J.E.B. and K. Korthauer assisted with analysis of WGBS methylation data. L.B. performed immunohistochemistry experiments. I.M.C. and A.K. performed RNA-seq under the supervision of P.S.N. H.M.N., C.M. and E.C. provided LuCaP PDXs. A.S. assisted with analysis of tumor RNA-seq data under the supervision of L.E. A.Z. provided cell lines. N.D.S performed gene sequencing of LuCaPs. E.C., M.M.P. and M.L.F. supervised the project.

Corresponding author

Correspondence to Matthew L. Freedman.

Ethics declarations

Competing interests

W.C.H. is a consultant for Thermo Fisher, Solasta Ventures, iTeos, Frontier Medicines, Tyra Biosciences, MPM Capital, KSQ Therapeutics, and Paraxel and is a founder of KSQ Therapeutics. All other authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1–4

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Baca, S.C., Takeda, D.Y., Seo, JH. et al. Reprogramming of the FOXA1 cistrome in treatment-emergent neuroendocrine prostate cancer. Nat Commun 12, 1979 (2021). https://doi.org/10.1038/s41467-021-22139-7

Download citation

Received: 30 September 2020
Accepted: 18 February 2021
Published: 30 March 2021
DOI: https://doi.org/10.1038/s41467-021-22139-7

This article is cited by

Tumor microenvironment deconvolution identifies cell-type-independent aberrant DNA methylation and gene expression in prostate cancer
- Samuel R. Reynolds
- Ze Zhang
- Brock C. Christensen
Clinical Epigenetics (2024)
Decoding the basis of histological variation in human cancer
- Masayuki Fujii
- Shigeki Sekine
- Toshiro Sato
Nature Reviews Cancer (2024)
FOXA1 and FOXA2: the regulatory mechanisms and therapeutic implications in cancer
- Na Liu
- Anran Wang
- Minbin Chen
Cell Death Discovery (2024)
The PENGUIN approach to reconstruct protein interactions at enhancer-promoter regions and its application to prostate cancer
- Alexandros Armaos
- François Serra
- Gian Gaetano Tartaglia
Nature Communications (2023)
Transcription factor NKX2–1 drives serine and glycine synthesis addiction in cancer
- Elien Heylen
- Paulien Verstraete
- Kim R. Kampen
British Journal of Cancer (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.