Abstract
Recent advancements in single-cell RNA sequencing (scRNAseq) have facilitated the discovery of previously unrecognized subtypes within prostate cancer (PCa), offering new insights into cancer heterogeneity and progression. In this study, we integrated scRNAseq data from multiple studies, comprising publicly available cohorts and data generated by our research team, and established the Human Prostate Single cell Atlas (HuPSA) and Mouse Prostate Single cell Atlas (MoPSA) datasets. Through comprehensive analysis, we identified two novel double-negative PCa populations: KRT7 cells characterized by elevated KRT7 expression and progenitor-like cells marked by SOX2 and FOXA2 expression, distinct from NEPCa, and displaying stem/progenitor features. Furthermore, HuPSA-based deconvolution re-classified human PCa specimens, validating the presence of these novel subtypes. We then developed a user-friendly web application, “HuPSA–MoPSA” (https://pcatools.shinyapps.io/HuPSA-MoPSA/), for visualizing gene expression across all newly established datasets. Our study provides comprehensive tools for PCa research and uncovers novel cancer subtypes that can inform clinical diagnosis and treatment strategies.
Similar content being viewed by others
Introduction
Prostate cancer (PCa) poses a substantial public health burden in the United States. While new AR signaling inhibitors have significantly improved patient survival, these treatments eventually fail, leading to the development of castration-resistant PCa (CRPCa)1. About 50% of CRPCa cases retain the characteristics of prostate adenocarcinoma (AdPCa), expressing markers associated with the luminal epithelia of the prostate, such as androgen receptor (AR), AR targets, and HOXB132,3. Approximately 30% of CRPCa present an alternative phenotype known as neuroendocrine prostate cancer (NEPCa). NEPCa cells express minimal levels of AR and its downstream genes while expressing high levels of NEPCa markers such as CHGA and NCAM14. In addition to the aforementioned phenotypes, there is a less studied subtype known as double-negative prostate cancer (DNPCa), which is characterized by the absence of AR signaling and NEPCa features5,6.
Lineage plasticity is implicated in the transition from AdPCa to NEPCa and DNPCa. This phenomenon empowers tumor cells to undergo alterations in their morphology and functions to adapt to challenging environmental conditions, allowing them to transit into new cellular lineages7. Key molecular players implicated in this process include the reduced expression of the prostate lineage marker HOXB133 and the aberrant expression of several genes, including SOX2, FOXA2, SRRM4, ONECUT2, BRN2, and PEG106,7,8,9,10,11,12. Additionally, the activation of the JAK/STAT signaling pathway has been identified as a pivotal contributor to lineage plasticity in PCa progression6,7,8,9,10,11,12.
Single-cell RNA sequencing (scRNAseq) has emerged as a powerful tool for dissecting tumor heterogeneity by analyzing the individual transcriptomes of thousands of cells within a single sample. This technology is crucial for identifying rare cell populations within heterogeneous samples. Additionally, scRNAseq aids in tracing lineage and developmental relationships in contexts like embryonic development and cancer progression13. Previous scRNAseq studies have identified various cell populations within human and mouse prostatic tissues, including AdPCa, NEPCa, luminal and basal epithelia, stromal, and immune cells6,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28. However, limitations such as sample size and model-specific populations persist.
To address these limitations, we integrated scRNAseq data from multiple studies and established the Human Prostate Single-cell Atlas (HuPSA) and the Mouse Prostate Single-cell Atlas (MoPSA). Leveraging these comprehensive datasets, we aimed to identify and characterize known and novel cell populations within prostatic tissues, shedding light on PCa heterogeneity and progression.
Results
Establishment of HuPSA and MoPSA
Using scRNAseq data derived from multiple studies, we constructed two combined datasets, the Human Prostate Single cell Atlas (HuPSA) and the Mouse Prostate Single cell Atlas (MoPSA). The HuPSA dataset compiled single-cell RNA sequencing data of 74 distinct samples from 6 studies, consisting of 368,831 high-quality cells. These samples represent eight histology groups, including normal, normal adjacent, benign, AdPCa, CSPCa (castration sensitive PCa), CRPCa, mCRPCa (metastatic CRPCa), and Cribriform PCa. For the establishment of MoPSA, we generated scRNAseq data from a TRAMP tumor, a widely utilized mouse model for NEPCa research, and combined it with publicly available mouse scRNAseq data. The compiled MoPSA dataset consisted of 395,917 high-quality cells originating from 138 samples obtained from 13 studies. These murine prostatic samples were classified into six groups: wild-type (WT), genetically engineered mice (GEM), control (GEM control mice that express tamoxifen-induced YFP in the prostate), castration, regeneration (androgen-induced prostate regeneration), and development (developing prostate).
Analysis and annotation of cell clusters
UMAP dimensional reduction and unsupervised clustering analysis of the scRNAseq data revealed discernible cell clusters, including luminal, basal/basal-like, progenitor-like (in HuPSA only), NEPCa, and non-epithelial clusters (Fig. 1A and B). These clusters were further annotated into distinct populations, primarily based on the expression of marker genes, with considerations of sample histology and inferred copy number variation (iCNV) patterns. This results in 26 and 21 populations identified within the HuPSA and MoPSA datasets, respectively (Fig. 1C and Supplementary Fig. 1). For a detailed description of each cell population, refer to Supplementary Note 1. Sankey diagrams were used to visualize the data flow between these cell populations and the histology of samples (Supplementary Fig. 1A and B), highlighting the widespread distribution of different cell types across diverse histological groups and sample sources.
The luminal clusters were characterized by the expression of prostatic luminal epithelial markers FOXA1, HOXB13, and NKX3-1 (Fig. 1C). It was present in both HuPSA and MoPSA, comprising multiple populations that expressed AR at various levels (Fig. 1A, B and Supplementary Fig. 2).
In HuPSA, the basal/basal-like clusters comprised three populations (Fig. 1A, C, Supplementary Fig. 3): basal (KRT5+, KRT14+, TP63+), club (MMP7+, WFDC2+), and KRT7 (KRT7+, S100A9+). Despite being clustered together with basal epithelia, the club and KRT7 populations showed minimal expression of canonical basal markers (Fig. 1C). The KRT7 population exhibited heightened expression of KRT7 and several S100 family genes. While KRT7 expression was also detected in three other populations, namely club, basal, and NEPCa, its levels were comparatively lower in these populations (Fig. 1C). Additionally, their transcriptional profiles were distinct from that of the KRT7 population, as demonstrated in the UMAP (Fig. 1A). In MoPSA, the basal and Krt7 populations were observed, while the club group was absent (Fig. 1B). The Krt7 population expressed Wfdc2 but not Mmp7, markers of club cells (Supplementary Fig. 3A), suggesting that murine Krt7 population share some molecular features with human club cells.
NEPCa cells from both HuPSA and MoPSA expressed NEPCa markers INSM1 and NCAM1 (Fig. 1C). While mouse NEPCa cells exhibited relative uniformity, human NEPCa cells could be further divided into distinct sub-populations based on the expression of marker genes or enriched pathways, including CHGA, G2M, TOX, STAT3, and NEUROD1 (Supplementary Fig. 4A–C). It is noteworthy that the expression of marker genes was not mutually exclusive among these groups. In contrast, the expression of ASCL1, a driver transcription factor of NEPCa29, was detected in all the NEPCa subpopulations except NEUROD1. By examining the feature plots of NEUROD1 and ASCL1 in HuPSA, we observed a mutually exclusive expression pattern between NEUROD1 and ASCL1 in NEPCa (Supplementary Fig. 4D). This is consistent with the finding of a recent study30.
In addition to epithelial populations, both HuPSA and MoPSA captured stromal, endothelial, and immune cells as well (Fig. 1A and B). The stromal cells can be further separated into three populations, fibroblast, smooth muscle, and pericyte, for both species. Notably, a large population of T cells was detected in HuPSA, predominantly contributed by normal adjacent and AdPCa samples (Fig. 1A and Supplementary Fig. 1A), consistent with the observation that advanced PCa is an immunologically “cold” tumor.
Identification of two novel double-negative PCa populations: KRT7 and progenitor-like
Our analysis unveiled two novel double negative (DN)PCa populations, namely KRT7, and progenitor-like.
The KRT7 population comprised 1937 cells, 1828 of them (94%) originating from mCRPCa samples. Additionally, this population displayed elevated levels of iCNVs compared to normal cells or early-stage AdPCa (Supplementary Fig. 1C). These features strongly suggest that the KRT7 population predominantly comprises cancerous cells rather than nonmalignant basal epithelial cells.
Moreover, the KRT7 population exhibited minimal expression of NEPCa markers and low AR/AR-signaling activity (Fig. 2A), indicating its classification as a DNPCa subtype. Furthermore, it expressed minimal levels of prostate lineage and differentiation markers, such as FOXA1, HOXB13, and NKX3-1 (Figs. 1C and 2A), indicating a poorly differentiated state.
The progenitor-like population, positioned between the AdPCa and NEPCa populations on UMAP, was exclusively identified in HuPSA (Fig. 1A). Primarily originated from mCRPCa samples, cells within this population exhibited high iCNVs, consisting of their late-stage PCa status (Supplementary Fig. 1C). Termed “progenitor-like”, this population displayed high expression of SOX2 and FOXA2 (Fig. 2A), typically expressed in embryonic prostate cells rather than in differentiated prostate luminal epithelial or AdPCa cells31,32. Notably, NEPCa cells also expressed SOX2 and FOXA231,32 (Fig. 2A). But unlike NEPCa, the progenitor-like cells showed minimal expression of NEPCa marker genes CHGA and NCAM1 (Fig. 2A). Similar to KRT7, progenitor-like cells exhibited minimal expression of AR, AR targets and luminal epithelial lineage genes, supporting a stem/progenitor-like phenotype (Figs. 1C and 2A).
KRT7 and progenitor-like PCa exist in human specimens
As the discovery of novel KRT7 and progenitor-like DNPCa subtypes emerged from HuPSA analysis, it became imperative to validate their presence in an external collection. Therefore, we compiled publicly available bulk RNAseq data that were derived from 877 human prostatic specimens33 and established the ProAtlas (Prostate Atlas) dataset (Fig. 3A).
Using the transcriptome profiles of individual populations in HuPSA, we conducted a covariance-based deconvolution analysis on the ProAtlas collection. This analysis assessed the composition of various cell populations within the samples, including luminal, KRT7, basal, club, progenitor-like, NEPCa, stromal (fibroblast, smooth muscle, pericyte), and immune cells. The molecular subtypes were assigned to each sample based on its dominant populational composition.
As shown in Fig. 3B, most specimens can be re-classified into distinct subtypes, including luminal (n = 570), KRT7 (n = 9), progenitor-like (n = 7), and NEPCa (n = 43). The luminal group comprised both nonmalignant prostate and AdPCa tumors. Mixed groups and unclassified samples were also identified. Samples dominated by stromal cells contained minimal cancer cells. The reclassified samples were collected to construct the ProAtlas dataset.
Principal component analysis (PCA) of ProAtlas revealed clustering of samples of the same subtypes, indicating similar transcriptome profiles within each subtype (Fig. 3C). Notably, KRT7, progenitor-like, and NEPCa samples were distinctly separated from nonmalignant and AdPCa samples of various Gleason Scores, highlighting significant transcriptome differences in these aggressive PCa. This underscores the effectiveness of HuPSA deconvolution-based reclassification in accurately classifying PCa samples.
The analysis of ProAtlas data revealed distinctive features of KRT7 and progenitor-like PCa subtypes, characterized by low NE score, diminished AR score, and reduced prostatic HOX code (Fig. 3B). These findings suggest that both KRT7 and progenitor-like PCa subtypes represent de-differentiated double-negative prostate cancer, signifying a departure from the typical cellular characteristics associated with prostate adenocarcinoma.
Additionally, deconvolution analysis led to the identification of rare KRT7 and progenitor-like PCa samples in public study cohorts (Fig. 4A). Through collaboration, corresponding archived tissues from the GSE126078 cohort were obtained. All retrieved samples were initially diagnosed as DNPCa34. IHC staining was conducted to evaluate the expression of KRT7, FOXA2, and SOX2, the corresponding markers of the KRT7 and progenitor-like subtypes.
Remarkably, all seven DNPCa samples examined exhibited positive KRT7 staining in cancer cells (Fig. 4C–I). In contrast, KRT7 expression was absent in the luminal epithelia of benign prostatic tissues (Fig. 4B) and low-grade AdPCa specimens (data not shown, n > 40).
Additionally, LuCaP-173.2 patient-derived DNPCa xenografts and three DNPCa metastases from the GSE126078 cohort were subjected to IHC evaluation of SOX2 and FOXA2 expression. Two of the three DNPCa metastases examined were positive for FOXA2 (Fig. 4J and K), and the expression of SOX2 and FOXA2 was detected in LuCaP 173.2 early-passage and late-passage tumors, respectively (Fig. 4L and M).
Notably, from DNPCa-2 patients, three samples were obtained, including vertebral, liver, and lung metastases. From HuPSA-based deconvolution, the vertebral metastasis was reclassified as the KRT7 subtype, while the liver and lung metastases were reclassified as the progenitor-like subtype. This classification was validated by the positive IHC staining of subtype markers KRT7 and FOXA2 (Fig. 4E, J, and K), providing additional validation for the accuracy of HuPSA deconvolution-based reclassification.
Collectively, the positive expression of KRT7, FOXA2, and SOX2 in DNPCa specimens serves as compelling evidence for the existence of these two novel subtypes in advanced PCa. This validation underscores the accuracy of the clustering approach employed in the construction of the HuPSA dataset, affirming its utility in delineating distinct PCa subtypes based on their molecular profiles.
Establishment of a publicly available web application for the visualization of gene expression in different subtypes of PCa
We previously introduced the PCTA web app, allowing researchers to visualize gene expression in commonly used cancer cell lines (https://pcatools.shinyapps.io/PCTA_app/)35. The positive reception highlighted a demand for such resources. In response, we developed the HuPSA-MoPSA web app, accessible at https://pcatools.shinyapps.io/HuPSA-MoPSA/. This platform enables users to visualize gene expression at a single cell level by inputting official gene symbols. The app provides feature plots, violin plots, and box plots, allowing simultaneous access to data from all three datasets established in this study, HuPSA, MoPSA, and ProAtlas.
As illustrated in Fig. 5, users can input gene symbols to explore gene expression patterns across different datasets. High-resolution feature plots and violin plots are generated for HuPSA and MoPSA, while box plots are generated for ProAtlas, facilitating enhanced data interpretation and quantification.
Discussion
Advanced technologies have revolutionized our ability to understand pathological processes. RNA sequencing, for instance, provides unbiased and high-throughput gene expression profiling data. Single-cell RNA sequencing technology enables researchers to investigate transcriptome at single-cell resolution, addressing old questions in cancer research and tackling new challenges such as phenotypic plasticity and non-mutational epigenetic reprogramming36. The integrated scRNAseq datasets, HuPSA and MoPSA, allow for the detection of various cell populations and visualization of cancer cells of different phenotypes on dimensional reduction maps. By analyzing differentially expressed genes and enriched signaling pathways, these atlases offer insights into the driver mechanisms of phenotypic plasticity and identify potential druggable targets.
In this study, we identified two novel DNPCa cell populations, progenitor-like and the KRT7 populations. Leveraging the compilation of multiple scRNAseq datasets, the establishment of HuPSA enabled the identification of these rare populations.
Progenitor-like cells, reminiscent of cancer stem cells, may contribute to tumor recurrence and therapy resistance. Characterized by the expression of SOX2 and FOXA2, genes also expressed in embryonic prostate and NEPCa cells, these cells represent a distinct PCa subtype. Our analysis revealed signature genes and enriched signaling pathways in this population, providing potential markers for defining this subtype. Previous studies have identified SOX2+ and FOXA2+ non-NEPCa cases through IHC staining, which likely represent the progenitor-like subtype of PCa31,32.
The KRT7 population, although located closely with basal epithelial cells on the UMAPs in both HuPSA and MoPSA, exhibits distinct gene expression patterns. Despite resembling basal cells on dimensional reduction maps, they lack canonical markers such as KRT5, KRT14, or TP63, instead highly expressing KRT7. This underscores the importance of exploring non-traditional markers in PCa classification.
Indeed, the expression of KRT7 (Cytokeratin 7), a type of intermediate filament protein, is typically found in glandular and transitional epithelial cells such as those in cervical epithelia. While KRT7 expression has been detected in various types of cancer, its presence in PCa cells has not been observed until now. Elevated KRT7 expression has been associated with aggressive cancer phenotypes, including increased cell proliferation, migration, invasion, and epithelial-to-mesenchymal transition37,38,39. Moreover, in pancreatic cancer and gastric cancers, heightened KRT7 expression correlates with poor prognoses40,41. The expression of KRT7 in double-negative PCa cells implies a potential role for this protein in PCa progression.
Furthermore, utilizing HuPSA-based deconvolution analysis against bulk RNAseq data allowed for the re-classification of PCa samples. This approach identified rare samples of the newly defined progenitor-like and KRT7 subtypes, providing valuable insights into PCa heterogeneity.
Despite the advantages of integrated scRNAseq datasets, such as HuPSA and MoPSA, they come with limitations. The complexity of scRNAseq data, represented by multi-dimensional gene expression profiles, poses challenges in data visualization and interpretation. Dimensional reduction techniques may not fully capture transcriptome identities, and interpretation can be subjective. As a famous ancient Chinese poet, Su Shi, eloquently put it, “It’s a range viewed in face and peaks viewed from the side, assuming different shapes viewed from far and wide.” Objective data processing is crucial, but researchers should remain aware of potential biases in data interpretation.
Methods
Single-cell RNA sequencing on TRAMP tumors
TRAMP tumor tissues were harvested and dissociated using “Tumor Dissociation Kit, mouse, #130-096-730” from Miltenyi Biotec. The dead cells were removed using “Dead Cell Removal Kit, #130-090-101” from Miltenyi Biotec. The red blood cells were removed using “Red Blood Cell Lysis Solution, # 130-094-183” from Miltenyi Biotec. The cDNA libraries were constructed using Chromium Next GEM Single Cell 3’ Reagent Kit v3.1. The resulting cDNA was fragmented, processed, and size selected. Adaptors were ligated, and dual indexes were attached via PCR. The libraries were cleaned up and assessed for quality with the Agilent TapeStation D5000 High Sensitivity Kit. The libraries were quantitated with qPCR (NEBNext Library Quant Kit for Illumina, Bio-rad CFX96 Touch Real-Time PCR), normalized to 1.5 pM, and pooled. The library pool was denatured and diluted to approximately 300 pM. A library of 1.5 pM PhiX was spiked in as an internal control. Paired-end sequencing was performed on an Illumina NovaSeq 6000.
Public scRNAseq collection and alignment
The human and mouse scRNAseq data were collected from Gene Expression Omnibus (GEO). Only 10X genomic-based single-cell studies were considered. Raw data from six human studies in the SRA database were used to build HuPSA, including GSE2103586, GSE13782914, GSE19333715, GSE206962, GSE18129416, GSE18534417. Data of 12 mouse studies were used to build MoPSA, including GSE14681142, GSE16496918, GSE20195643, GSE21615819, GSE22175520, GSE17133621, GSE16574122, GSE17447123, GSE2103586, GSE16331624, GSE18930725, GSE15846826, GSE15370127 (Supplementary Data 1). All FASTQ files were downloaded using SRA-Toolkit fasterq-dump function and aligned to human (GRCh38 from NCBI) or mouse (GRCm38 from Ensembl) genome using Alevin-fry44. The nf-core/scrnaseq pipeline was used for software environment, paralleling, and quantity control. Linux ubuntu operating system was used for data analysis.
scRNAseq data integration and processing
Single-cell read count data were loaded into R environment using “loadFry” function from Fishpond package45. “Seurat V5”46 was used for quality control, data integration, UMAP dimensional reduction, clustering, and annotation. Quality control was performed based on the total RNA count and feature count for each cell. Cells with extremely low or high RNA/feature counts were excluded to ensure the removal of low-quality cells and doublets. The RPCA model was used for data integration. In the integration process, each study (instead of the sample) was treated as one “batch” to prevent over-integration (depicted in Supplementary Fig. 5). After the UMAP dimensional reduction and clustering, the clusters were then labeled with population names based on the marker genes expression and original histology information. The annotated human lung V2 (HLCA) dataset from “Azimuth”47 was used as a reference for the manual annotation of immune and stromal populations. The R package “SCpubr”48 was used for data visualization. Dim plots were utilized to visualize the location on the UMAP of each cell population in HuPSA and MoPSA. Feature plots were employed to visualize the gene expression on UMAPs. Geyser plots were used to compare gene expressions among the different cell populations. In the Geyser plot, the dots along the y-axis represent the distribution of gene expression levels in single cells. The black balls indicate the median values of each cell population. The ends of thick line and thin line indicate 66% and 95% of the data points within each group, respectively. Sankey plots were used to depict the data “flow” among populations, histology, and study groups. The single-cell copy number alterations were calculated and visualized using the “InferCNV” package49. Pathway activity was calculated using “decoupleR”50. Transcription factor activity was calculated using “dorothea”50. The “CSCDRNA” package33 was used for HuPSA-based bulk RNAseq deconvolution. The “CellChat” package51 was used to predict ligand-receptor pairs in HuPSA and MoPSA. All detailed parameters were described in the R scripts deposited to GitHub.
Bulk RNAseq data collection and processing
Bulk RNAseq samples were selected based on the following criteria: duplicated reads <60%, total reads >20 million, and STAR unique mapping >70%. However, the TCGA samples were not filtered because these specific metrics were not available. Eventually, 877 high-quality samples comprising normal, BPH, tumor, primary, AdPCa, CRPCa, mCRPCa, NEPCa, organoid, PDX (patient-derived xenograft) histology groups were selected and used for HuPSA-based deconvolution analysis. The FASTQ files were collected from multiple cohorts including GSE12079552, GSE12607834, GSE15628930, GSE114740, GSE217260, GSE17932153, GSE11278654, GSE10413155, GSE3152856, GSE5446057, GSE2226058, GSE21649059; dbGaP accessions: phs000909.v1.p160, phs000915.v2.p22. The TCGA_PRAD read counts and TPM data were acquired using “recount3”61 R package. The sequencing reads were aligned to the human GRCh38 genome (from NCBI) using STAR62 and Salmon63. The nf-core/rnaseq pipeline was used for paralleling and quantity control.
Human prostate specimens
De-identified human prostate tissues (archived) were obtained from the LSU Health-Shreveport Biorepository Core, Overton Brooks VA Medical Center, and Genitourinary Cancer Specimen Biorepository at the University of Washington. All the tissues were used in accordance with the protocols approved by the Institutional Review Board of LSU Health-Shreveport.
Immunohistochemical, immunofluorescence staining and Western blot
Primary antibodies used in immunostaining include KRT7 (abclonal, A4357), MMP7 (abclonal, A0695), CK5 (BioLegend, 905904), CK8 (BioLegend, 904801), SOX2 (Cell signaling, 3728), FOXA2 (Abcam, 108422). IHC staining was performed using a Vectastain elite ABC peroxidase kit (Vector Laboratories, Burlingame, CA). The tissue sections were counterstained, mounted, and imaged with a Zeiss microscope (White Plains, NY).
For Western blot, cells were collected in PBS and lysed in a passive lysis buffer (Promega, Madison, WI). Equal amounts of protein were loaded for Western blot analyses. ProSignal Dura ECL Reagent (Genesee Scientific, San Diego, CA) and Chemidoc (Bio-Rad, Hercules, CA) were used to visualize the protein bands.
Data availability
The sequencing data generated by this study can be accessed through GEO (https://www.ncbi.nlm.nih.gov/geo/) with accession number: GSE246155. All public data listed in the “Methods” section can be accessed through SRA (https://www.ncbi.nlm.nih.gov/sra) and dbGAP (https://www.ncbi.nlm.nih.gov/gap/). The accession numbers were listed in Supplementary Data 1.
Code availability
All codes were deposited to https://github.com/schoo7/hupsa.
References
Debes, J. D. & Tindall, D. J. Mechanisms of androgen-refractory prostate cancer. N. Engl. J. Med. 351, 1488–1490 (2004).
Robinson, D. et al. Integrative clinical genomics of advanced prostate cancer. Cell 161, 1215–1228 (2015).
Cheng, S. et al. Neuroendocrine prostate cancer has distinctive, non-prostatic HOX code that is represented by the loss of HOXB13 expression. Sci. Rep. 11, 1–11 (2021).
Beltran, H. et al. Aggressive variants of castration-resistant prostate cancer. Clin. Cancer Res. 20, 2846–2850 (2014).
Bluemn, E. G. et al. Androgen receptor pathway-independent prostate cancer is sustained through FGF signaling. Cancer Cell 32, 474–489 (2017).
Chan, J. M. et al. Lineage plasticity in prostate cancer depends on JAK/STAT inflammatory signaling. Science 377, 1180–1191 (2022).
Ferguson, A. M. & Rubin, M. A. Lineage plasticity in prostate cancer: Looking beyond intrinsic alterations. Cancer Lett. 548, 215901 (2022).
Mu, P. et al. SOX2 promotes lineage plasticity and antiandrogen resistance in TP53- and RB1-deficient prostate cancer. Science 355, 84–88 (2017).
Zhang, X. et al. SRRM4 expression and the loss of REST activity may promote the emergence of the neuroendocrine phenotype in castration-resistant prostate cancer. Clin. Cancer Res. 21, 4698–4708 (2015).
Rotinen, M. et al. ONECUT2 is a targetable master regulator of lethal prostate cancer that suppresses the androgen axis. Nat. Med. 24, 1887–1898 (2018).
Bishop, J. L. et al. The master neural transcription factor BRN2 is an androgen receptor-suppressed driver of neuroendocrine differentiation in prostate cancer. Cancer Discov. 7, 54–71 (2017).
Akamatsu, S. et al. The placental gene PEG10 promotes progression of neuroendocrine prostate cancer. Cell Rep. 12, 922–936 (2015).
Haque, A. et al. A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications. Genome med. 9, 1–12 (2017).
Dong, B. et al. Single-cell analysis supports a luminal-neuroendocrine transdifferentiation in human prostate cancer. Commun. Biol. 3, 778 (2020).
Heidegger, I. et al. Comprehensive characterization of the prostate tumor microenvironment identifies CXCR4/CXCL12 crosstalk as a novel antiangiogenic therapeutic target in prostate cancer. Mol. Cancer 21, 132 (2022).
Hirz, T. et al. Dissecting the immune suppressive human prostate tumor microenvironment via integrated single-cell and spatial transcriptomic analyses. Nat. Commun. 14, 663 (2023).
Wong, H. Y. et al. Single cell analysis of cribriform prostate cancer reveals cell intrinsic and tumor microenvironmental pathways of aggressive disease. Nat. Commun. 13, 6036 (2022).
Kim, W. K. et al. Aberrant androgen action in prostatic progenitor cells induces oncogenesis and tumor development through IGF1 and Wnt axes. Nat. Commun. 13, 1–18 (2022).
Terzic, J. et al. Hypoxia‐inducible factor 1A inhibition overcomes castration resistance of prostate tumors. EMBO Mol. Med. 15, e17209 (2023).
Zhu, Y. et al. Targeting the chromatin effector Pygo2 promotes cytotoxic T cell responses and overcomes immunotherapy resistance in prostate cancer. Sci. Immunol. 8, eade4656 (2023).
Germanos, A. A. et al. Defining cellular population dynamics at single-cell resolution during prostate cancer progression. Elife 11, e79076 (2022).
Graham, M. K. et al. Single‐cell atlas of epithelial and stromal cell heterogeneity by lobe and strain in the mouse prostate. Prostate 83, 286–303 (2023).
Hiroto, A. et al. Stromal androgen signaling acts as tumor niches to drive prostatic basal epithelial progenitor-initiated oncogenesis. Nat. Commun. 13, 6552 (2022).
Qiu, X. et al. MYC drives aggressive prostate cancer by disrupting transcriptional pause release at androgen receptor targets. Nat. Commun. 13, 2559 (2022).
Troiani, M. et al. Single-cell transcriptomics identifies Mcl-1 as a target for senolytic therapy in cancer. Nat. Commun. 13, 2177 (2022).
Brady, N. J. et al. Temporal evolution of cellular heterogeneity during the progression to advanced AR-negative prostate cancer. Nat. Commun. 12, 3372 (2021).
Lee, D.-H. et al. Androgen action in cell fate and communication during prostate development at single-cell resolution. Development 148, dev196048 (2021).
Song, H. et al. Single-cell analysis of human primary prostate cancer reveals the heterogeneity of tumor-associated epithelial cell states. Nat. Commun. 13, 141 (2022).
Nouruzi, S. et al. ASCL1 activates neuronal stem cell-like lineage programming through remodeling of the chromatin landscape in prostate cancer. Nat. Commun. 13, 2282 (2022).
Cejas, P. et al. Subtype heterogeneity and epigenetic convergence in neuroendocrine prostate cancer. Nat. Commun. 12, 5775 (2021).
Yu, X. et al. SOX2 expression in the developing, adult, as well as, diseased prostate. Prostate Cancer Prostatic Dis. 17, 301–309 (2014).
Connelly, Z. M. et al. FOXA2 promotes prostate cancer growth in the bone. Am. J. Transl. Res. 12, 5619 (2020).
Karimnezhad, A. More accurate estimation of cell composition in bulk expression through robust integration of single-cell information. Bioinform. Adv. 2, vbac049 (2022).
Labrecque, M. P. et al. Cabozantinib can block growth of neuroendocrine prostate cancer patient-derived xenografts by disrupting tumor vasculature. PLoS ONE 16, e0245602 (2021).
Cheng, S., Li, L. & Yu, X. PCTA, a pan-cancer cell line transcriptome atlas. Cancer Lett. 588, 216808 (2024).
Hanahan, D. Hallmarks of cancer: new dimensions. Cancer Discov. 12, 31–46 (2022).
An, Q. et al. KRT7 promotes epithelial–mesenchymal transition in ovarian cancer via the TGF‑β/Smad2/3 signaling pathway. Oncol. Rep. 45, 481–492 (2021).
Zhang, Z. et al. FoxM1 promotes the migration of ovarian cancer cell through KRT5 and KRT7. Gene 757, 144947 (2020).
Donato, R. et al. Functions of S100 proteins. Curr. Mol. Med. 13, 24–57 (2013).
Song, J. et al. Construction of a novel model based on cell-in-cell-related genes and validation of KRT7 as a biomarker for predicting survival and immune microenvironment in pancreatic cancer. BMC Cancer 22, 1–24 (2022).
Yang, J. Identification of novel biomarkers, MUC5AC, MUC1, KRT7, GAPDH, CD44 for gastric cancer. Med. Oncol. 37, 34 (2020).
Karthaus, W. R. et al. Regenerative potential of prostate luminal cells revealed by single-cell analysis. Science 368, 497–505 (2020).
Li, D. et al. ETV4 mediates dosage-dependent prostate tumor initiation and cooperates with p53 loss to generate prostate cancer. Sci. Adv. 9, eadc9446 (2023).
He, D. et al. Alevin-fry unlocks rapid, accurate and memory-frugal quantification of single-cell RNA-seq data. Nat. Methods 19, 316–322 (2022).
Zhu, A., Srivastava, A., Ibrahim, J. G., Patro, R. & Love, M. I. Nonparametric expression analysis using inferential replicate counts. Nucleic Acids Res. 47, e105–e105 (2019).
Hao, Y. et al. Dictionary learning for integrative, multimodal and scalable single-cell analysis. Nat. Biotechnol. 42, 293–304 (2023).
Bakken, T. E. et al. Comparative cellular analysis of motor cortex in human, marmoset and mouse. Nature 598, 111–119 (2021).
Blanco-Carmona, E. Generating publication ready visualizations for single cell transcriptomics using SCpubr. Preprint at bioRxiv 2022-02 (2022).
Puram, S. V. et al. Single-cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer. Cell 171, 1611–1624 (2017).
Badia-i-Mompel, P. et al. decoupleR: ensemble of computational methods to infer biological activities from omics data. Bioinform. Adv. 2, vbac016 (2022).
Jin, S. et al. Inference and analysis of cell–cell communication using CellChat. Nat. Commun. 12, 1–20 (2021).
Suntsova, M. et al. Atlas of RNA sequencing profiles for normal human tissues. Sci. Data 6, 36 (2019).
Luo, G. et al. circITGA7 Acts as a miR-370-3p sponge to suppress the proliferation of prostate cancer. J. Oncol. 2021, 8060389 (2021).
Puca, L. et al. Patient derived organoids to model rare prostate cancer phenotypes. Nat. Commun. 9, 2404 (2018).
Teslow, E. A. et al. Exogenous IL‐6 induces mRNA splice variant MBD2_v2 to promote stemness in TP53 wild‐type, African American PCa cells. Mol. Oncol. 12, 1138–1152 (2018).
Sowalsky, A. G. et al. Whole transcriptome sequencing reveals extensive unspliced mRNA in metastatic castration-resistant prostate cancer. Mol. Cancer Res. 13, 98–106 (2015).
Long, Q. et al. Global transcriptome analysis of formalin-fixed prostate cancer specimens identifies biomarkers of disease recurrence. Cancer Res. 74, 3228–3237 (2014).
Kannan, K. et al. Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencing. Proc. Natl. Acad. Sci. USA 108, 9172–9177 (2011).
Acosta‐Vega, N. L. et al. Metabolic pathways enriched according to ERG status are associated with biochemical recurrence in Hispanic/Latino patients with prostate cancer. Cancer Med. 12, 4306–4320 (2023).
Beltran, H. et al. Molecular characterization of neuroendocrine prostate cancer and identification of new drug targets. Cancer Discov. 1, 487–495 (2011).
Wilks, C. et al. recount3: summaries and queries for large-scale RNA-seq expression and splicing. Genome Biol. 22, 1–40 (2021).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 417–419 (2017).
Beltran, H. et al. Divergent clonal evolution of castration-resistant neuroendocrine prostate cancer. Nat. Med. 22, 298 (2016).
Acknowledgements
This research was supported by NIH R01 CA226285, LSU Collaborative Cancer Research Initiative, LSU Health Shreveport Office of Research, and LSU Health Shreveport FWCC Stimulus grants to X. Yu, LSU Health Shreveport FWCC Carroll Feist postdoctoral Fellowship to S. Cheng, and LSU Health Shreveport FWCC Carroll Feist predoctoral Fellowship to L. Li. RNA sequencing data were downloaded from dbGaP through dbGaP accession numbers phs000909 and phs0009152,64. The data were generated under the support of NHGRI grant # U54 HG003067 to E. Lander at Broad Institute and SU2C/PCF Prostate Dream Team Translational Cancer Research Grant.
Author information
Authors and Affiliations
Contributions
Conceptualization, investigation, methodology: S.C. and X.Y.; Data curation: S.C. and L.L.; Software, visualization, writing—original draft: S.C.; Formal analysis: Y.Y. and Y.S.; Resources, funding acquisition, supervision: X.Y.; Writing—review and editing: Q.G., O.F., E.C., and X.Y.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cheng, S., Li, L., Yeh, Y. et al. Unveiling novel double-negative prostate cancer subtypes through single-cell RNA sequencing analysis. npj Precis. Onc. 8, 171 (2024). https://doi.org/10.1038/s41698-024-00667-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41698-024-00667-x