Spatially restricted drivers and transitional cell populations cooperate with the microenvironment in untreated and chemo-resistant pancreatic cancer

Cui Zhou, Daniel; Jayasinghe, Reyka G.; Chen, Siqi; Herndon, John M.; Iglesia, Michael D.; Navale, Pooja; Wendl, Michael C.; Caravan, Wagma; Sato, Kazuhito; Storrs, Erik; Mo, Chia-Kuei; Liu, Jingxian; Southard-Smith, Austin N.; Wu, Yige; Naser Al Deen, Nataly; Baer, John M.; Fulton, Robert S.; Wyczalkowski, Matthew A.; Liu, Ruiyang; Fronick, Catrina C.; Fulton, Lucinda A.; Shinkle, Andrew; Thammavong, Lisa; Zhu, Houxiang; Sun, Hua; Wang, Liang-Bo; Li, Yize; Zuo, Chong; McMichael, Joshua F.; Davies, Sherri R.; Appelbaum, Elizabeth L.; Robbins, Keenan J.; Chasnoff, Sara E.; Yang, Xiaolu; Reeb, Ashley N.; Oh, Clara; Serasanambati, Mamatha; Lal, Preet; Varghese, Rajees; Mashl, Jay R.; Ponce, Jennifer; Terekhanova, Nadezhda V.; Yao, Lijun; Wang, Fang; Chen, Lijun; Schnaubelt, Michael; Lu, Rita Jui-Hsien; Schwarz, Julie K.; Puram, Sidharth V.; Kim, Albert H.; Song, Sheng-Kwei; Shoghi, Kooresh I.; Lau, Ken S.; Ju, Tao; Chen, Ken; Chatterjee, Deyali; Hawkins, William G.; Zhang, Hui; Achilefu, Samuel; Chheda, Milan G.; Oh, Stephen T.; Gillanders, William E.; Chen, Feng; DeNardo, David G.; Fields, Ryan C.; Ding, Li

doi:10.1038/s41588-022-01157-1

Download PDF

Article
Open access
Published: 22 August 2022

Spatially restricted drivers and transitional cell populations cooperate with the microenvironment in untreated and chemo-resistant pancreatic cancer

Nature Genetics volume 54, pages 1390–1405 (2022)Cite this article

42k Accesses
54 Citations
262 Altmetric
Metrics details

Subjects

Abstract

Pancreatic ductal adenocarcinoma is a lethal disease with limited treatment options and poor survival. We studied 83 spatial samples from 31 patients (11 treatment-naïve and 20 treated) using single-cell/nucleus RNA sequencing, bulk-proteogenomics, spatial transcriptomics and cellular imaging. Subpopulations of tumor cells exhibited signatures of proliferation, KRAS signaling, cell stress and epithelial-to-mesenchymal transition. Mapping mutations and copy number events distinguished tumor populations from normal and transitional cells, including acinar-to-ductal metaplasia and pancreatic intraepithelial neoplasia. Pathology-assisted deconvolution of spatial transcriptomic data identified tumor and transitional subpopulations with distinct histological features. We showed coordinated expression of TIGIT in exhausted and regulatory T cells and Nectin in tumor cells. Chemo-resistant samples contain a threefold enrichment of inflammatory cancer-associated fibroblasts that upregulate metallothioneins. Our study reveals a deeper understanding of the intricate substructure of pancreatic ductal adenocarcinoma tumors that could help improve therapy for patients with this disease.

Single-nucleus and spatial transcriptome profiling of pancreatic cancer identifies multicellular dynamics associated with neoadjuvant treatment

Article 28 July 2022

William L. Hwang, Karthik A. Jagadeesh, … Aviv Regev

Identification of spatially-resolved markers of malignant transformation in Intraductal Papillary Mucinous Neoplasms

Article Open access 29 March 2024

Antonio Agostini, Geny Piro, … Carmine Carbone

Single-cell RNA-seq highlights intra-tumoral heterogeneity and malignant progression in pancreatic ductal adenocarcinoma

Article 04 July 2019

Junya Peng, Bao-Fa Sun, … Wenming Wu

Main

Pancreatic ductal adenocarcinoma (PDAC) has an 11% 5-yr survival rate¹ due to late detection, early metastases and therapy resistance^2,3,4,5,6. First-line treatment is surgery followed by radiation and/or chemotherapy^7,8, with immunotherapy options being limited^9,10. Drivers such as KRAS, TP53, CDKN2A and SMAD4 (HUGO Gene Nomenclature Committee at the European Bioinformatics Institute, https://www.genenames.org/) have been identified¹¹ as have transcriptional subtypes of classical and basal-like^12,13.

Single-cell technologies enable analysis regardless of tumor content and facilitate dissection of the tumor microenvironment (TME), whose role in PDAC remains largely unknown. For instance, cancer-associated fibroblast (CAF) subtypes have been identified and cytotoxic natural killer (NK) and CD8⁺ T cells are often numerically and functionally impaired^14,15,16,17. This creates an immunosuppressed, pro-tumorigenic environment, but how this occurs is poorly understood^18,19. There is a growing appreciation surrounding acinar-to-ductal metaplasia (ADM), in which acinar cells start expressing ductal markers. Animal models posit acinar cells as the origin of PDAC when KRAS(G12D) is expressed^20,21,22,23, but this hypothesis is difficult to evaluate in humans due to the paucity of acinar and ADM cells sampled at single-cell resolution^{24,25,26,27,28}. Recent efforts have focused on acinar heterogeneity in chronic pancreatitis²⁹ and healthy human pancreas³⁰, but adequate sampling of ADM cells is still lacking.

As part of the Human Tumor Atlas Network consortium, we used a spatially distinct, multi-sampling approach to analyze 83 PDAC samples across 31 patients³¹. Samples are physically separate from one another, which allowed interrogating both inter- and intra-tumor heterogeneity via extensive omics, including bulk DNA and RNA sequencing (RNA-seq), bulk proteomics and phosphoproteomics, single-cell and single-nucleus RNA-seq (scRNA-seq and snRNA-seq, respectively), cellular imaging and spatial transcriptomics. We identified and validated transitional populations and their associated molecular signatures along the spectrum from normal pancreas to PDAC that were previously proposed in mouse models. We characterized differential impact of chemotherapy on the abundance and transcriptional programs of tumor and stroma populations using multi-omic approaches. We highlight the necessity of spatial sequencing for polyclonal/heterogeneous PDAC tumor characterization.

Results

Study design and overview of the study cohort

We collected 73 PDAC samples from 21 patients undergoing standard treatment, including four normal adjacent tissue samples. Treatment groups included seven treatment-naïve cases, eight neoadjuvant FOLFIRINOX (a treatment regimen comprising folic acid, 5-fluorouracil, irinotecan and oxaliplatin) cases, four neoadjuvant gemcitabine + nab-paclitaxel cases, one mixed (FOLFIRINOX and gemcitabine + nab-paclitaxel) and one chemoradiation (Chemo-RT) case (Supplementary Table 1). Each tumor was spatially sampled 2–4 times, with sample segments subsequently used to generate histologic, imaging and omics data; hematoxylin and eosin (H&E) slides; scRNA-seq; bulk mass spectrometry-based proteomics and phosphoproteomics; bulk whole-exome sequencing (WES); and bulk RNA-seq (Fig. 1a, Supplementary Table 2 and Methods). We generated scRNA-seq data for all 73 samples, WES for 64 samples and bulk RNA-seq for 65 samples. A subset (n = 30) underwent tandem mass tag (TMT) 11 proteomic and phosphoproteomic characterization (Fig. 1b). Following quality control, we clustered 232,764 cells across all samples based on expression profiles and assigned cell types based on marker gene expression (Fig. 1c, Extended Data Fig. 1a–c, Methods and Supplementary Note). Using the fraction of tumor cells as a proxy for tumor purity, estimates ranged from 0.10% to 82.69% across samples, with an average of 16.28%. H&E pathology review revealed that within-patient tumor content differences across samples averaged 24%, with a range of 5% to 64% (Extended Data Fig. 1d, Methods and Supplementary Note), consistent with tumor percentages from scRNA-seq (Pearson R = 0.40, P = 0.001). Principal component analysis (PCA) of bulk proteomic and phosphoproteomic data confirms that, while most within-tumor regions cluster closely, several specimens from the same tumor have substantial intra-tumor heterogeneity (Extended Data Fig. 1e,f).

**Fig. 1: Sampling strategy and cohort overview.**

We further generated snRNA-seq with matching spatial transcriptomics, RNA-seq and WES data for an additional 10 cases, bringing the total cohort to 31 patients, 83 sc/snRNA-seq samples and 15 spatial transcriptomics slides (Fig. 1d). The treatment groups included three treatment-naïve cases, four neoadjuvant FOLFIRINOX cases, one mixed (FOLFIRINOX and gemcitabine + nab-paclitaxel) and one Chemo-RT case. Following quality control, we assigned cell types to 83,860 nuclei based on marker gene expression and used the paired snRNA-seq to label spots in the spatial transcriptomics slides (Fig. 1d,e and Methods).

PDAC tumor subclusters with distinct cellular functions

Pathway enrichment analysis between case-level tumor subpopulations to dissect tumor heterogeneity (Fig. 2a and Methods) identified case-specific subpopulations enriched in pathways including cell proliferation, cell stress response, epithelial-to-mesenchymal transition and immune-related pathways that displayed spatial heterogeneity (Fig. 2a and Extended Data Fig. 1g). Actively proliferating clusters were present in most cases and were characterized by upregulation of genes belonging to the Molecular Signatures Database hallmark gene sets³² for E2F targets, G2M checkpoint, MYC targets and mitotic spindle. These clusters also exhibited increased oxidative phosphorylation, in line with previous reports³³ (Fig. 2a–c). It was common (15 of 21 cases) for tumor subclusters enriched in certain pathways to originate predominantly from only one of the spatially distinct samples from each case, such as S1H3 in HT185P1 and S1H4 in HT200P1 (Fig. 2d,e). Other sets of co-upregulated genes included ‘KRAS signaling up’ and ‘inflammatory response’, which lead to pancreatitis, pancreatic intraepithelial neoplasia (PanIN) and eventually PDAC³⁴. Increased expression of these sets occurred in samples with lower numbers of proliferating tumor cells, as demonstrated by clusters 7 and 0 from sample HT200P1_M1K1 and cluster 10 from HT185P1_S1H2 (Fig. 2b,c). KRAS-associated inflammatory response was expressed in clusters with increased expression of gene sets associated with cell stress (defined by the TP53 pathway, hypoxia and TNFA signaling via NFKB). This could indicate that parts of the tumor with the most actively proliferating cells were least impacted by KRAS-driven inflammation, or that tumor cells modulate their KRAS-driven associated inflammation during proliferation (Fig. 2d,e and Methods).

**Fig. 2: Tumor subclusters with distinct cellular functions.**

Using the spatial transcriptomics cohort, we characterized spatial heterogeneity by integrating histology features. Most slides had dense stroma intermingled with tumor populations (Fig. 1e). For HT264P1, six of eight tumor subclusters were mapped from snRNA-seq data to the spatial transcriptomics spots using cell-type label transfer and Robust Cell Type Decomposition (RCTD) to deconvolve the cell-type composition of each spot (Methods and Fig. 2f,g)³⁵. Subpopulation ‘Tumor_2’ clustered separately from other tumor cells and had a distinct ‘lower grade’ morphology (as annotated by pathology) (Fig. 2g–i). Gene and pathway enrichment analysis revealed that the ‘Tumor_2’ population upregulates fucosylation, hydroxylation and HIF pathways, and genes associated with the basal-like tumor subtype (BTNL8, AGR3 and LYZ) (Fig. 2j,k). These findings suggest that this population is a spatially distinct cluster with different H&E morphology and likely more aggressive, highlighting the heterogeneous composition of tumor cells in PDAC in terms of pathology and transcriptomics. We identified ‘Tumor_4’ as a proliferative population that upregulates cell cycle pathways (Fig. 2j,k). Lastly, we characterized ‘Tumor_3’, which is scattered around the periphery of the large tumor regions in H&E (Fig. 2g,h). This population has high expression of laminin genes and is enriched in integrin and fibril formation pathways, which interact with the extracellular matrix and may be involved in tumor expansion (Fig. 2j,k)³⁶. In summary, we observe spatially and transcriptionally distinct tumor subpopulations from the same patient sample, within the same H&E section.

KRAS signaling and spatial drivers in pancreatic cancer

We observed substantial variation of driver mutation variant allele fractions (VAFs) between samples (Extended Data Fig. 1g and Supplementary Data 1). We detected a KRAS hotspot mutation in at least one sample in all cases, except for HT138P1 (HT204P1 lacks WES data). For low VAF (<0.01) mutations, 10 of 20 cases had samples whose mutation profiles differed from one another. We also identified pathogenic germline variants³⁷ (Methods). HT138P1 carried a pathogenic germline BRCA2 variant. Three cases carried variants in the homology-directed DNA repair pathway (FANCC D23*, BRCA2 I1470* and K607*, and ATM Y1124*), and, expectedly, all spatial samples carried the same variant in each case. Using RNA-seq, we classified samples into established subtypes^12,38,39 and determined immune subtypes and stromal and immune compartment scores using xCell⁴⁰ and ESTIMATE⁴¹, respectively (Extended Data Figs. 1g and 2a,b, Supplementary Note and Methods).

Tumor cells grouped into patient-specific clusters, consistent with the genomic landscapes within each patient (Fig. 3a and Methods). We mapped mutation and copy number alterations to single cells (Fig. 3b and Methods). We tested the impact of KRAS hotspot variants by comparing gene expression profiles of each subset of tumor cells with a given KRAS mutation (Methods). Interestingly, tumor cells harboring KRAS G12V upregulate several genes associated with more aggressive or metastatic tumors, including COL1A1, VIM and MUC5B (Extended Data Fig. 2c)^42,43,44. We identified five cases with multiple KRAS hotspot drivers, which we interpret as synchronous primary tumor clones^11,45,46 (Extended Data Fig. 2d). For case HT061P1, we obtained four subpopulations of clustered tumor cells, three small clusters largely derived from punch A and one large cluster common to all three punches (the remainder, as expected, represented all clusters) (Fig. 3c and Extended Data Fig. 2d,e). KRAS G12V cells faithfully map into one cluster from punch A predominant populations, with G12D cells mapping onto the large mixed cluster. Thus, two distinct clones carrying different KRAS driver mutations in the same patient are spatially separated, with differing gene expression profiles (Fig. 3c).

**Fig. 3: Genomic landscape and oncogenic driver heterogeneity.**

Using inferCNV⁴⁷, we identified copy number variation (CNV) signatures at both focal and arm levels that are unique to the respective KRAS subclones in case HT061P1 (Fig. 3c, Extended Data Fig. 2f and Methods). Copy number alterations were further evaluated using WES (Supplementary Data 2 and Methods). The G12D population shows amplifications of AKT2 and MYC, while both G12D and G12V clusters harbor amplifications in GATA6, among others (Fig. 3c and Extended Data Fig. 2f). We reconstructed a lineage tree using MEDALT⁴⁸ (Fig. 3d and Methods), separating tumor cells into two major groups, each branching from a normal duct origin, consistent with both gene expression-based clustering and spatial origin of the cells (Fig. 3c,d). Together, we propose a model that integrates the gene expression and CNV data (Fig. 3e and Supplementary Note).

To assess proteomic heterogeneity, we conducted a global pairwise correlation analysis for the n = 30 samples from 9 tumor cases (Methods and Supplementary Note). We determined the impact of mutations on downstream targets by analyzing the associated changes at protein and phosphorylation levels in several oncogenic pathways (Methods). We found a large degree of differential regulation, between and within tumors, in several phosphosites within the PI3K/PDk1/Akt and Raf/Mek/Erk pathways (Fig. 3f, Extended Data Fig. 2g–i and Supplementary Note).

Transitional populations between acinar and tumor cells

The hypothesis that PDAC arises from acinar cells that undergo ADM^49,50,51 has been examined in mice, but not humans⁵². A major hurdle has been the small numbers of acinar and ADM cells that have been sampled from patients at single-cell resolution^24,25.

We identified populations of acinar cells expressing acinar markers (PRSS1, CELA3A) from multiple samples (Fig. 4a–d). The Acinar-REG⁺ cluster exhibits high expression of regenerating proteins^25,30,53,54 thought to promote ADM and PanIN in PDAC^55,56. Two mixed populations of ductal cells^30,54 lacked genomic alterations and maintained high expression of ductal markers (CFTR, SLC4A4, ANXA4, SOX9). Duct-like1 expresses SPP1 and CRP, which have been observed in stressed cells and have progenitor-like features from the pancreatic ductal niche²⁵. Duct-like2 expresses normal ductal genes to a lesser extent and shows increased expression of mucus secretion (MUC5B) and trefoil factor genes. Highly expressed markers in Duct-like2 suggest these cells are a major source of malignant PDAC cells²⁴, and transcriptionally resemble cells identified in healthy pancreases³⁰. Finally, this cluster exhibits expression of ONECUT2, a transcription factor exclusively expressed in metaplastic cells derived from acinar origin in a mouse model⁵⁴. Peng et al.²⁴ hypothesized that subclusters of Duct-like2 could be PanIN-like, but we find that our PanIN and ADM populations are distinct from Duct-like2 and are only identified in our cohort (Extended Data Fig. 3a,b and Supplementary Note). PDAC exhibits high expression of FXYD3, S100P and KRT17 in addition to copy number and driver mutations. PanIN-like cells were derived from 19 patients with a large proportion from sample HT168P1 (>1,700 cells) and, consistent with previous studies, we observe PDAC-initiating mutations in KRAS and CDKN2A (from HT168P1) within PanIN populations⁵². PanIN exhibits increased expression of extracellular matrix-related genes (DCN, SPARC, SPON1), a diversity of collagens⁵⁷, genes involved in acinar-to-ductal reprogramming (KLF4, MMP7)^58,59 and other markers of early-stage malignancy (CXCL12, TIMP3, ITGA1, MUC5AC)^60,61,62,63.

**Fig. 4: Acinar, ductal and transitional populations.**

We detected ADM cells in eight samples, including from cases HT122P1 and HT168P1 that represented a large proportion of ADM cells (>700 cells) (Supplementary Data 3). We found PDAC cells harboring several distinct alterations (Fig. 4a,b). Cells expressing acinar and ductal markers clustered separately from acinar cells and tended to be between acinar cells and normal ductal lineages on the Uniform Manifold Approximation and Projection (UMAP) (Duct-like1, Duct-like2), termed ADM_Normal, or between acinar and PanIN, denoted ADM_Tumor (Fig. 4b,c). While tumor and acinar cells express ductal and acinar markers in a mutually exclusive pattern, ADM cells express a combination, suggestive of an intermediate, reversible state (Fig. 4d). While both ADM_Tumor and ADM_Normal have decreased expression of acinar markers, they also have increased expression of PDAC markers and Duct-like1 markers, respectively. Following copy number analysis (CopyKAT, Methods), we found that a majority of predicted aneuploid cells are annotated as PanIN (n = 524), while only a handful (n = 30) in ADM_Tumor are labeled aneuploid (Fig. 4e,f). By mapping both KRAS and CDKN2A mutations, we identified several cells in the ADM_tumor population with either a KRAS mutation (n = 1) or CDKN2A mutation (n = 7), although this was not as widespread as the predicted PanIN populations (KRAS: 23 cells; CDKN2A: 163 cells) (Fig. 4g,h and Supplementary Data 3).

We examined whether acinar cells transition to different expression states (tumor or normal) by way of the two distinct ADM cell populations by performing Monocle analysis (Methods). We found two different transition states starting either with acinar cells transitioning towards the normal ductal cell route with ADM_Normal cells in between or with cells transitioning from acinar cells towards PanIN cells with ADM_Tumor cells in between (Fig. 4i). This suggests ADM_Normal is a transition state more similar to normal ductal cells and largely lacking genomic alterations, while ADM_Tumor is more related to PanIN and has a few alterations (for example, CDKN2A, aneuploidy). Recent studies in mice suggest acinar-derived tumors are preceded by PanINs, while ductal-derived tumors are PanIN independent⁶⁴.

Validation of ADM using snRNA-seq, immunohistochemistry and mouse models

We orthogonally surveyed two samples by snRNA-seq to see if cells expressing acinar and ductal features could be identified from frozen tissue (Fig. 5a–c). ADM cells in HT288P1 and HT412P1 snRNA-seq samples have higher expression of Duct-like1 features than tumor cells, suggesting similarity to the ADM_Normal population in scRNA-seq.

**Fig. 5: Validation of ADM using snRNA and immunofluorescence.**

As validation, we performed immunofluorescence staining on tumor and normal formalin-fixed paraffin-embedded (FFPE) sections with amylase (acinar), cytokeratin-19 (ductal), Hoechst (nuclei) and Ki67 (proliferation) to evaluate co-staining patterns within individual cells (Fig. 5d, Extended Data Fig. 4a and Methods). In HT122P1 and HT288P1, we observe co-staining of acinar and ductal markers within multiple individual cells across several tumor regions. As controls, we provide a normal section with intermixed acinar and ductal cells, but lacking a co-staining expression pattern (HT288P1), and a tumor section that is predominantly stained by cytokeratin-19 (HT190P1). The paucity of ADM using immunofluorescence was recently validated in a tamoxifen-induced PDAC mouse model where acinar transformed ductal cells similarly co-stained for amylase and Cytokeratin-19 (ref. ⁶⁵). Although rare, this co-staining pattern is confirmed at single-cell resolution in the same samples for which we performed immunofluorescence, thanks to our spatial sampling strategy. We identified two additional samples (HT412P1, HT434P1) with high acinar content with the same co-expression patterns (Extended Data Fig. 4b,c).

Finally, we performed scRNA-seq on eight mice from mouse models including induction of pancreatitis, KRAS-driven early- and late-stage transformation (KPC-OG GEMM mice) and normal pancreas tissues (Methods)⁶⁶. We identified two pancreatitis-acinar populations, one tumor-acinar population and one normal-acinar (Extended Data Fig. 5a–c). Markers within the pancreatitis-acinar populations overlap differentially expressed genes (DEGs) identified in another study⁶⁷ (Extended Data Fig. 5d). The tumor-acinar population from the KPC-OG model was the only acinar cluster with GFP expression (Extended Data Fig. 5b). Within this tumor model, GFP is associated with early transformation and metaplasia. While the tumor-acinar population expresses Reg3a, which is overexpressed in ADM regions⁶⁸, it also maintains high expression of Sox9, a ductal lineage marker in normal ductal and cancer cells, suggesting early-stage metaplasia.

These results support the identification of this rare population with acinar and ductal-like features seemingly lacking widespread genomic alterations. Our findings enable us to expand the proposed models of acinar origin to human PDAC development (Fig. 5e).

Transitional populations in histological features by spatial transcriptomics

Spatial transcriptomics enables the identification of histology features (for example, PanIN) to complement findings in sc/snRNA-seq. Each H&E image with associated spatial transcriptomics data was annotated by a pathologist (Extended Data Fig. 6a). Of ten paired spatial transcriptomics/snRNA-seq samples, HT288P1 presented with ADM (88 cells). We compared pathology annotations with spots defined via integration with snRNA-seq and with RCTD (Fig. 6a–c)³⁵. We observe strong concordance between pathology-defined and molecular regions for tumor, normal duct and acinar cells. Myeloid and plasma cells mapped throughout annotated pancreatitis regions, Duct-like2 mapped to annotated normal ductal structures and ADM mapped sparsely throughout the section. It remains difficult to validate ADM in spatial transcriptomics data since it has not yet reached single-cell resolution and ADM cells are rare. Under these circumstances, pancreatitis regions may appear to have both acinar and ductal features in agreement with Tosti et al.³⁰.

**Fig. 6: Tumor and transitional cell heterogeneity in spatial transcriptomics data.**

We annotated spatial transcriptomics spots in select samples with PanIN and a ductal structure within the capture area (Fig. 6a–e). Extracted spots were then subsetted from the full object and DEGs were analyzed. Isolated PanIN regions exhibited distinct DEGs that spanned multiple samples (Fig. 6f and Supplementary Data 4). DEGs identified with spatial transcriptomics data for normal duct, tumor and PanIN were compared against annotated scRNA-seq data, corroborating our initial annotations of PanIN, normal duct and tumor. Interestingly, the two cases having multiple PanIN regions identified by spatial transcriptomics exhibited distinct DEGs that differed by each uniquely annotated region. Our combined analysis strongly supports the presence of PanIN and existence of ADM in human samples.

CAF subpopulations in PDAC TME

We identified three subtypes of CAFs: myofibroblastic CAFs (myCAFs), inflammatory CAFs (iCAFs) and antigen-presenting CAFs (apCAFs)^14,15,69 (Fig. 7a and Extended Data Fig. 7a). We also observed subpopulations of iCAFs, denoted as CXCR4⁺ iCAFs and CD133⁺ iCAFs. Several CAF markers, including ACTA2 and FAP, used to identify CAF subtypes are not definitive, being often expressed in both iCAFs and myCAFs⁷⁰. We identified the top DEGs between each CAF subtype; TAGLN and ACTA2 discern myCAFs, FAP and CXCL12 distinguish iCAFs, and apCAFs express HLA-DRA and CD74 (refs. ^14,71) (Fig. 7b and Extended Data Fig. 7b). CXCR4⁺ iCAFs and CD133⁺ iCAFs are defined by CXCR4 and CD133 (PROM1), respectively, although they also weakly express myCAF and apCAF marker genes. While most CAFs in tumors are iCAFs or myCAFs, the other CAF subtypes are present at low numbers throughout. These CD133⁺ iCAFs carry no genomic alterations, but express cancer stem cell markers, including CD133, MET, EPCAM, CD24 and CD44. We observed high CD44 expression in apCAFs and CXCR4⁺ iCAFs. VIM and NFE2L2 were highly expressed in apCAFs, which were more abundant in treated samples (P < 10⁻⁵) (Fig. 7b). These results suggest that small unique CAF subpopulations that express cancer-driving programs exist within standard CAF subtypes. We examined expression of CAF genes currently targeted by clinical trials registered since January 2020 (ref. ¹⁵) (Fig. 7c). As treated samples have a depletion of myCAFs and enrichment of iCAFs, the effectiveness of additional therapies targeting CAFs may differ across treatment groups. Further, tumor-specific CAF clusters (relative to normal adjacent tissue) were enriched for TME-remodeling pathways (Extended Data Fig. 7c–e and Supplementary Note).

**Fig. 7: CAF subpopulations across treatment groups.**

By assessing cell-type enrichment across treatment groups, we detected modest changes in endothelial and tumor cells and the largest difference within fibroblasts, where both treated groups had higher numbers than the treatment-naïve group (Fig. 7d). This is largely driven by a threefold higher level of iCAFs in FOLFIRINOX and Gemcitabine + nab-paclitaxel samples (P < 10⁻³), with comparable myCAF abundance between treatment groups (Fig. 7e). As iCAFs are considered to be pro-tumorigenic⁷², this large increase of iCAFs after treatment may be associated with treatment resistance. We observed upregulation in heat shock genes, AP-1 pathway genes and metallothionein genes in treated iCAFs (Fig. 7f). As we only observe substantial expression of metallothionein genes in iCAFs, their prognostic value for predicting chemotherapy resistance originates from the stroma, rather than tumor cells⁷³ (Fig. 7g). Heat shock and AP-1 genes were more highly expressed in FOLFIRINOX samples, while metallothioneins were more highly expressed in Gemcitabine + nab-paclitaxel samples, suggesting iCAF heterogeneity based on treatment regimen (Fig. 7f). These observations suggest that treated tumors have much higher levels of iCAFs, which are potential targets for chemo-resistant tumors.

We identified 18 proteins having at least a twofold or greater change between treated and untreated samples (Methods and Supplementary Note). GBP6, PTGDS and ADAM23 were elevated in treated samples, while REG1A, EIF1AY, PRSS3 and HLA-DRB4 were elevated in naïve samples (Fig. 7h). While these proteins display overall differences between treated and naïve samples, we observe modest heterogeneity between spatial samples and between a subset of tumor cases (Methods). To assess whether these proteins are signals originating from tumor cells or the TME, we compared their expression profiles in scRNA data within each cell type among treatment groups. REG1A is upregulated in naïve apCAFs, while PTGDS is upregulated in treated endothelial and iCAF cells; neither were observed in tumor cells. (Fig. 7i). Only AKR7A3 and SDCBP2 were consistently upregulated strictly in tumor cells. These results suggest that several of these differentially abundant proteins may originate from the TME.

Immunosuppressive PDAC TME and treatment

To examine the immunosuppressed TME characteristics of PDAC¹⁸, we identified and reclustered immune cells into lymphocytes or myeloid/dendritic cells. In the latter group, we further distinguished between type I and II classical dendritic cells (cDC1, cDC2), macrophages, monocytes and neutrophils. Myeloid cells and classical dendritic cells strongly express TME-remodeling pathway genes, such as angiogenesis and hypoxia pathways, at higher levels than tumor cells (Extended Data Fig. 8a–c). While tumor cells do not have high expression of NFE2L2 relative to myeloid cells, elevated expression occurs downstream of the Nrf2 pathway (NQO1, GPX2), which regulates oxidative damage repair (Fig. 8a). Such activation may be triggered via paracrine interactions with TME cells and would indicate that myeloid and dendritic cells contribute towards a pro-tumor TME. Within lymphocytes, we observed slight enrichment of CD4/CD8⁺ T cell subsets in treated samples and expression of heat shock genes in FOLFIRINOX samples (Extended Data Fig. 8d,e,h and Supplementary Note).

**Fig. 8: Myeloid and lymphocyte populations in the TME.**

We analyzed the expression of common immune checkpoint receptors and ligands, including PD-1 (PDCD1), CTLA4 and TIGIT (Fig. 8b). We observed strong expression of exhaustion markers in NK, CD4+ and CD8⁺ T, and regulatory T (Treg) cells. We do not detect any significant expression of immune checkpoint genes in tumor cells or transitioning populations, including PD-L1 and PD-L2, consistent with the poor response of PDAC to anti-PD-1/PD-L1 immunotherapy^74,75. Receptor–ligand analyses reveal interactions between the TIGIT receptor in lymphocytes and NECTIN ligands across all samples, which we found highly expressed in tumor cells, but somewhat less in other transitioning ductal populations (Fig. 8c and Methods). This is consistent with reports that NECTIN4 is a potential target for immune checkpoint blockade^76,77. TIGIT interaction with NECTIN inactivates T and NK effector function, which the tumor could exploit for immune evasion^76,77. NECTIN1 and NECTIN4 are most strongly expressed in tumor cells, but NECTIN2 and NECTIN3 are also expressed in some lymphocyte cell types, while TIGIT is largely expressed in Tregs and exhausted CD4⁺ T cells (Fig. 8c and Extended Data Fig. 8i).

NECTIN1/2/3 are expressed in fibroblasts, endothelial cells and lymphocytes, respectively, while NECTIN4 is the most tumor cell-specific NECTIN (Fig. 8c), consistent with previous reports⁷⁶. We analyzed TIGIT and NECTIN expression of individual samples and observed high expression of all NECTIN receptors in tumor cells and TIGIT in Tregs and exhausted T cells, and noted substantial heterogeneity across cases, particularly in TIGIT expression in exhausted T cells and in NECTIN1 and NECTIN3 expression in tumor cells (Fig. 8d). Finally, in the snRNA-seq cohort, we observe the same expression pattern of TIGIT and NECTIN across cell types (Extended Data Fig. 8j). Using spatial transcriptomics data, we focused on two regions in HT259P1 and HT264P1 to show the expression of TIGIT in spots proximal to the infiltrating lymphocyte regions (Fig. 8e). We find colocalization of tumor regions in the H&E with expression of NECTIN4 across most H&E slides, regardless of treatment status (Fig. 8f). These results provide a rationale for targeting the TIGIT–NECTIN axis to improve anti-tumor T cell activity.

Discussion

Using bulk sequencing and proteomics/phosphoproteomics, single-cell sequencing, spatial transcriptomics and high-resolution cellular imaging on 83 PDAC samples, we identified transitional populations, including ADM and PanIN, and populations of nontransformed acinar and duct cells and PDAC cells. ADM populations express both oncogenes and tumor suppressor genes, significantly upregulating epithelial-to-mesenchymal transition and stem cell genes, compared with tumor cells⁷⁸. The unique expression pattern of ADM as an intermediate state suggests a dynamic transition between tumor and acinar fates and progression towards PDAC via acquisition of a driver KRAS event. This is consistent with acinar sensitivity to KRAS mutations as a catalyst of ADM and inclination toward PDAC⁷⁸. Driver mutations mapped to PanIN-like cells and tumor cells, consistent with their role as a precursor lesion. We used pathology-assisted spatial transcriptomics to identify distinct tumor subpopulations, as well as PanIN and PDAC-associated chronic pancreatitis. PanIN profiling by spatial transcriptomics provides direct confirmation of scRNA-seq findings.

CAFs are poorly understood in PDAC. Historically presumed to be cancer drivers, they are now known to have dual behavior as either drivers or suppressors of cancer, depending upon numerous factors^79,80. We identified iCAFs, myCAFs and apCAFs, further classifying two iCAF subsets as CD133⁺ and CXCR4⁺. We noted that several markers and activated pathways found to be differentially expressed between subtypes are being explored in current clinical trials¹⁵. We observed higher iCAF abundance in treated samples. This is important, as IL-1-mediated and JAK-STAT signaling in iCAFs have motivated trials of adding IL-1R blockade to standard-of-care (FOLFIRINOX-based) chemotherapy⁷² (ClinicalTrials.gov: NCT02021422) and treating KPC mouse models with a JAK inhibitor decreases tumor size⁸¹. In patients treated with gemcitabine and nab-paclitaxel (either alone or in succession with other therapies), we observed upregulation of metallothionein genes in iCAFs. Metallothionein proteins are associated with resistance to a variety of chemotherapeutics, and may signal a chemoresistance mechanism⁸².

Immunotherapy has revolutionized treatment of many tumors, but is not yet effective for PDAC^83,84,85. Lack of immune checkpoint blockade activity in PDAC is multifaceted, including the shortage of naturally occurring T cell responses, partially due to the inhibition of effective T cell priming and/or T cell exclusion^86,87. Single-cell analysis revealed that NECTIN family members, especially NECTIN4, are tumor-specific, uncovering a potentially targetable interaction with TIGIT in Tregs and exhausted T cells. We observed high expression of all NECTIN genes in tumor cells and TIGIT in Tregs and exhausted T cells, but noted substantial heterogeneity across cases. Clarifying the key elements of the immunosuppressive PDAC microenvironment may pave the way for effective immunotherapy in PDAC.

Our study provides a comprehensive analysis of PDAC spatial heterogeneity and treatment effects. We found substantial heterogeneity in PDAC, including spatially separated driver clones, subtype heterogeneity within the same patients and multiple transitional cell populations, including duct-like, ADM and PanIN. Our work provides a resource to identify new targets of clinical relevance. We acknowledge the heterogeneous treatments in the patient population included in the study. Future work using clinical trials, specimens with uniform treatment regimens and comprehensive clinical response data will identify treatment-associated resistance signatures.

Methods

Specimens and clinical data

All samples were collected with informed consent in concordance with the Washington University Institutional Review Board (IRB) at the Washington University School of Medicine in St Louis (St Louis, MO). Primary pancreatic adenocarcinoma samples were collected during surgical resection and verified by standard pathology (IRB protocol 201108117). Blood was collected at the time of surgery into vacuum tubes containing heparin or EDTA (BD Bioscience). Cells were isolated by Ficoll-density centrifugation and frozen in FBS with 5% dimethyl sulfoxide. Clinical data were captured in accordance with IRB protocol 20108117, at the time of informed consent, and entered into the REDCap database.

Sample processing

After verification by an attending pathologist, a 1.5 × 1.5 × 0.5-cm³ portion of the tumor was removed, photographed, weighed and measured. Each piece was then subdivided into 6–9 pieces (depending on the original size) and then further subdivided into four transverse cut pieces. Pieces were each then separately placed into formalin, snap-frozen in liquid nitrogen, DMEM and formalin, respectively. The purpose of the switch from punch sampling to the grid processing method was utility-based, as it minimized remainder tissue.

Genomic DNA and RNA extraction

Tumor tissues and corresponding normal mucosae were obtained from surgically resected specimens, and after a piece was removed for fresh single-cell prep the remaining sample was snap-frozen in liquid nitrogen and stored at −80 °C. Before bulk RNA/DNA extraction, samples were cryo-pulverized (Covaris) and aliquoted for bulk extraction methods. Genomic DNA was extracted from tissue samples with either the DNeasy Blood and Tissue Kit (Qiagen, 69504) or the QIAamp DNA Mini Kit (Qiagen, 51304). Total RNA was extracted with TRI reagent (Millipore Sigma, T9424) and treated with DNase I (Qiagen, 79254) using an RNeasy MinElute Cleanup Kit (Qiagen, 74204). RNA integrity was evaluated using either a Bioanalyzer (Agilent Technologies) or TapeStation (Agilent Technologies). Genomic germline DNA was purified from cryopreserved peripheral blood mononuclear cells using the QiaAMP DNA Mini Kit (Qiagen, 51304) according to the manufacturer’s instructions (Qiagen). The DNA quantity was assessed by fluorometry using the Qubit dsDNA HS Assay (Q32854) according to manufacturer’s instructions (Thermo Fisher Scientific).

WES

First, 100–250 ng of genomic DNA was fragmented on the Covaris LE220 instrument targeting 250-base pair (bp) inserts. Automated dual-indexed libraries were constructed with the KAPA Hyper library prep kit (Roche) on the SciClone NGS platform (Perkin Elmer). Up to ten libraries were pooled at an equimolar ratio by mass before the hybrid capture targeting a 5-µg library pool. The library pools were hybridized with the xGen Exome Research Panel v1.0 reagent (IDT Technologies) that spans a 39-megabase (Mb) target region (19,396 genes) of the human genome. The libraries were hybridized for 16–18 h at 65 °C followed by stringent washing to remove spuriously hybridized library fragments. Enriched library fragments were eluted and PCR cycle optimization was performed to prevent over amplification. The enriched libraries were amplified with KAPA HiFi master mix (Roche) before sequencing. The concentration of each captured library pool was accurately determined through quantitative PCR (qPCR) utilizing the KAPA Library Quantification Kit according to the manufacturer’s protocol (Roche) to produce cluster counts appropriate for the Illumina NovaSeq-6000 instrument. Then, 2 × 150 paired-end reads were generated targeting 12 gigabases of sequence to achieve ~100x coverage per library.

RNA-seq

Total RNA integrity was determined using Agilent Bioanalyzer or 4200 Tapestation. Library preparation was performed with 500 ng to 1 μg of total RNA. Ribosomal RNA was blocked using FastSelect reagents (Qiagen) during complementary DNA synthesis. RNA was fragmented in reverse transcriptase buffer with FastSelect reagent and heated to 94 °C for 5 min, 75 °C for 2 min, 70 °C for 2 min, 65 °C for 2 min, 60 °C for 2 min, 55 °C for 2 min, 37 °C for 5 min and 25 °C for 5 min. mRNA was reverse transcribed to yield cDNA using SuperScript III RT enzyme (Life Technologies, per manufacturer’s instructions) and random hexamers. A second strand reaction was performed to yield double-stranded cDNA (ds-cDNA). cDNA was blunt ended, had an A base added to the 3′ ends and then had Illumina sequencing adapters ligated to the ends. Ligated fragments were then amplified for 15 cycles using primers incorporating unique dual index tags. Fragments were sequenced on an Illumina NovaSeq-6000 S4 instrument, generating approximately 30 million paired-end 2 × 150 reads per library.

Single-cell suspension preparation

For each tumor, approximately 15–100 mg of 2–4 sections of each tumor and/or normal piece of tissue were cut into small pieces using a blade and processed separately. Enzymes and reagents from the Human Tumor Dissociation Kit (Miltenyi Biotec, 130-095-929) were added to the tumor tissue along with 1.75 ml of DMEM. The resulting suspension was loaded into a gentleMACS C-tube (Miltenyi Biotec, 130-093-237) and subjected to the gentleMACS Octo Dissociator with Heaters (Miltenyi Biotec, 130-096-427). After 30–60 min on the heated dissociation program (37h_TDK_1), samples were removed from the dissociator and filtered through a 40-μm Mini-Strainer (PluriSelect no. 43-10040-60) or 40-μm Nylon mesh (Fisher Scientific, 22-363-547) into a 15-ml conical tube on ice. The sample was then spun down at 400g for 5 min at 4 °C. After removing the supernatant, when a red pellet was visible, the cell pellet was resuspended using 200 μl to 3 ml of ACK Lysis Solution (Thermo Fisher, A1049201) for 1–5 min. To quench the reaction, 10 ml of PBS (Corning, 21-040-CM) with 0.5% BSA (Miltenyi Biotec, 130-091-376) was added and spun down at 400g for 5 min at 4 °C. After removing supernatant, the cells were resuspended in 1 ml of PBS with 0.5% BSA, and live and dead cells were visualized using Trypan Blue. If over 40% of dead cells were present, the sample was spun down at 400g for 5 min at 4 °C and subjected to the dead cell removal kit (Miltenyi Biotec, 130-090-101). Finally, the sample was spun down at 400g for 5 min at 4 °C and resuspended in 500 μl to 1 ml of PBS with 0.5% BSA to a final concentration of 700 to 1,500 cells per μl.

Single-nuclei suspension preparation

First, 15–25 mg of pulverized tissue was placed in a 5-ml Eppendorf tube on ice. Using a wide-bore pipette tip (Rainin), a lysis buffer prepared from the Nuclei Isolation protocol (10x Genomics) and SuperRNase inhibitor (Invitrogen) was added to the tube. The tissue solution was gently pipetted until the lysis liquid turned a slightly cloudy color. (The number of pipetting iterations depended on the specific tissue.) The tissue homogenate was then filtered through a 40-μm strainer (pluriSelect) and washed with a BSA wash buffer (2% BSA + 1 × PBS + RNase inhibitor). The filtrate was collected, centrifuged at 500g for 6 min at 4 °C and resuspended with a BSA wash buffer. Then, 100 μl of cell lysis solution was set aside for unstained reference, while the rest was stained with 1 μl of 7AAD per 500 μl of the sample. Nuclei underwent FACS and sorting gates were based on size, granularity and dye staining signal. The final suspension was spun down at 500g for 6 min at 4 °C, and resuspended with a BSA wash buffer.

Single-cell/nuclei library prep and sequencing

Utilizing the Chromium Next GEM Single Cell 3′ GEM, Library & Gel Bead Kit v.3.1 and Chromium instrument, approximately 17,500 to 25,000 cells were partitioned into nanoliter droplets to achieve single-cell resolution for a maximum of 10,000 to 15,000 individual cells per sample (10x Genomics, 1000269). The resulting cDNA was tagged with a common 16-nucleotide (nt) cell barcode and 10-nt Unique Molecular Identifier during the reverse transcriptase (RT) reaction. Full-length cDNA from poly-A mRNA transcripts was enzymatically fragmented and size-selected to optimize the cDNA amplicon size (approximately 400 bp) for library construction (10x Genomics). The concentration of the 10x single-cell library was accurately determined through qPCR (Kapa Biosystems) to produce cluster counts appropriate for the HiSeq 4000 or NovaSeq-6000 platform (Illumina). Then, 26 × 98-bp sequence data were generated targeting 50,000 read pairs per cell, which provided digital gene expression profiles for each individual cell.

Spatial transcriptomics prep and sequencing

Optimal cutting temperature (OCT)-embedded tissues were cryosectioned and placed on a Visium Spatial Gene Expression Slide following Visium Spatial Protocols-Tissue Preparation Guide (10x Genomics, CG000240 Rev A). Briefly, fresh tissues were coated carefully and thoroughly with room temperature OCT without any bubbles. OCT-coated tissues were then placed on a metal block chilled in dry ice until the OCT turned solidified and white. After RNA quality check using Tapestation and morphology check using H&E staining for the OCT-embedded tissues, blocks were scored into a proper size that fit the Capture Areas and then sectioned into 10-μm sections. After the tissue placement into the Capture Area, sections were fixed in methanol, stained with H&E and imaged at ×20 magnification using the brightfield imaging setting on a Leica DMi8 microscope. Tissues were then permeabilized for 18 min and Spatial Transcriptomics libraries were constructed following Visium Spatial Gene Expression Reagent Kits User Guide CG000239 Rev A (10x Genomics). Briefly, cDNA was reverse transcribed from the poly-adenylated messenger RNA which was captured by the primers on the slides. Next, the second strand was synthesized and denatured from the first strand. Free cDNA was then transferred from slides to tubes for further amplification and library construction. Libraries were sequenced on the S4 flow cell of the Illumina NovaSeq-6000 system.

KPC-OG GEMM mouse model

Three KPC-OG GEMM mice were killed at 3–5 months old, at a time when pathologically these mice have early metaplasia and PanIN throughout the pancreas, with only microscopic PDAC detectable^66,88. Age-matched KPC-OG negative littermates (CRE and OG negative) were treated with caerulin to induce acute pancreatitis by administering 6 hourly intraperitoneal injections (that is, once per hour for 6 h) at a dose of 100 μg kg⁻¹ given every other day for 1 week. For normal, we extracted tissue from KPC-OG breeders negative for cre that underwent no treatment. Cell types were annotated from previous publications^14,89. All mice were bred and maintained under specific pathogen-free conditions, 12-h light/dark cycle, in accordance with the National Institute of Health and American Association for Accreditation of Laboratory Animal Care (NIH-AALAC) standards and consistent with Washington University School of Medicine Institutional Animal Care and Use Committee (IACUC) regulations (protocol no. 19-0856). Ethical approval for all mouse work was given by Washington University School of Medicine IACUC under protocol no. 19-0856.

Somatic variant calling

Somatic variants were called from whole-exome tumor-normal paired BAMs using somaticwrapper v.1.5, a pipeline designed for detection of somatic variants from tumor and normal WES data. The pipeline merges and filters variant calls from four callers: Strelka v.2.9.2 (ref. ⁹⁰), VarScan v.2.3.8 (ref. ⁹¹), Pindel v.0.2.5 (ref. ⁹²) and MuTect v.1.1.7 (ref. ⁹³). SNV calls were obtained from Strelka, Varscan and MuTect. Indel calls were obtained from Strelka, Varscan and Pindel. The following filters were applied to obtain variant calls of high confidence: normal VAF ≤ 0.02 and tumor VAF ≥ 0.05, read depth in tumor ≥14 and normal ≥8, indel length <100 bp, all variants must be called by 2 or more callers, all variants must be exonic and variants in dbSNP but not in COSMIC excluded.

KRAS hotspot and within-case genotyping

To verify manually and/or determine the KRAS mutation status at KRAS hotspots G12, G13 and Q61, we used bam-readcount. For each case, we first applied bam-readcount to generate readcounts for each of the nine bases in these loci and then calculated VAF values of all the KRAS hotspots based on reference and alternative base read counts at each position. Additionally, we manually verified every variant present in a sample in a pairwise fashion against other samples within the same case.

Germline variant calling and annotation

Germline variant calling was performed using an in-house pipeline, germlinewrapper v.1.1 (https://github.com/ding-lab/germlinewrapper), which implements multiple tools for the detection of germline INDELs and SNVs. Germline SNVs were identified using VarScan v.2.3.8 (with parameters: --min-var-freq 0.10 --p-value 0.10 --min-coverage 3 --strand-filter 1) operating on an mpileup stream produced by samtools v.1.2 (with parameters: -q 1 -Q 13) and GATK v.4.0.0.0 (ref. ⁹⁴) using its haplotype caller in single-sample mode with duplicate and unmapped reads removed and retaining calls with a minimum quality threshold of 10. All resulting variants were limited to the coding regions of the full-length transcripts obtained from Ensembl release 95 plus an additional 2 bp flanking each exon to cover splice donor/acceptor sites. We required variants to have allelic depth ≥ 5 reads and alternative allele frequencies ≥ 20% in both the tumor and normal samples. We used bam-readcount v.0.8 for reference and alternative alleles quantification (with parameters: -q 10 -b 15) in both normal and tumor samples. Additionally, we filtered all variants with ≥0.05% frequency in gnomAD v.2.1 (ref. ⁹⁵) and the 1000 Genomes Project⁹⁶.

Germline variant pathogenic classification

For annotation and prioritization of the filtered germline variants, we used our automatic variant classification tool CharGer v.0.5.4 (ref. ³⁷), which computes a classification score based on American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG-AMP) guidelines. CharGer automatically marks as pathogenic those input variants that are marked as known pathogenic in ClinVar’s curated database and marks as likely pathogenic those variants with a CharGer score > 8. All pathogenic or likely pathogenic variants had both their normal and tumor samples reviewed manually by us using the Integrative Genomics Viewer software.

sc/snRNA-seq data preprocessing

For each sample, we obtained the unfiltered feature-barcode matrix per sample by passing the demultiplexed FASTQs to Cell Ranger v.3.1.0 ‘count’ command using default parameters and the prebuilt GRCh38 genome reference v.3.0.0 (GRCh38 and Ensembl 93) for scRNA or the pre-mRNA version for snRNA. Seurat v.3.1.2 (refs. ^97,98) was used for all subsequent analyses. First, a series of quality filters was applied to the data to remove those barcodes that fell into any one of these categories recommended by Seurat: too few total transcript counts (<300); possible debris with too few genes expressed (<200) and too few unique molecular identifiers (UMIs) (<1,000); possible more than one cell with too many genes expressed (>10,000) and too many unique molecular identifiers (>10,000); possible dead cell or a sign of cellular stress and apoptosis with too high proportion of mitochondrial gene expression over the total transcript counts (>10%). We constructed a Seurat object using the unfiltered feature-barcode matrix for each sample. Each sample was scaled and normalized using Seurat’s ‘SCTransform’ function to correct for batch effects (with parameters: vars.to.regress = c(‘nCount_RNA’, ‘percent.mito’), variable.features n = 2,000). Any merged analysis or subsequent subsetting of cells/samples underwent the same scaling and normalization method. Cells were clustered using the original Louvain algorithm⁹⁹ and top 30 PCA dimensions via ‘FindNeighbors’ and ‘FindClusters’ (with parameters: resolution = 0.5) functions. The resulting merged and normalized matrix was used for the subsequent analysis. Mouse data were aligned to refdata-gex-mm10-2020-A and GFP was added to the reference using the cellranger mkref function (https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/using/tutorial_mr).

sc/snRNA-seq cell-type annotation

Main cell types were assigned to each cluster by manually reviewing the expression of a comprehensive set of marker genes (Supplementary Note). These assignments were all done by one person to maximize consistency.

Spatially distinct tumor cluster assignment

We used sample provenance of tumor cells as well as a requirement of 95% of cells in a tumor cluster to originate from a sample that is physically 6 mm from another sample to conclude that a subcluster is spatially distinct between samples (Supplementary Table 3).

scVarScan mutation mapping

We applied our in-house tool scVarScan that can identify reads supporting the reference and variant alleles covering the variant site in each individual cell by tracing cell and molecular barcode information in an scRNA bam file. For mapping, we used high-confidence somatic mutations from WES data. Additionally, we use cancerhotspots.org¹⁰⁰ to obtain the most common KRAS hotspot mutations at G12, G13 and Q61, and use scVarScan to detect potential minority KRAS mutations in each sample.

scVarScan statistics

To assess the degree of certainty that mutations were preferentially mapped to tumor cells versus nontumor cells (for which mappings can be reasonably assumed to be noise), we devised the following analysis based on the standard binomial difference of proportions test. Let X_T be the read count for mapped tumor mutations and let N_T be the total read count (mutation plus reference) for the tumor. Similarly, let X_N and N_N be the respective counts for the normal sample. The respective proportions of mapped reads for tumor and normal are clearly P_T = X_T/N_T and P_N = X_N/N_N. Also, define the average joint fraction as P_avg = (X_T + X_N)/(N_T + N_N) and its complement as Q_avg = 1 − P_avg. The large counts we are working with suggest the binomial distribution is well-approximated by the normal (Gaussian) distribution, as assessed by traditional heuristics N_T P_avg Q_avg ≥ 5 and N_N P_avg Q_avg ≥ 5. Adding the standard continuity correction (the normal distribution is continuous, whereas the binomial is discrete), we can then construct the following Z score for the difference of proportions:

$$Z_{\mathrm{score}} = \frac{{\left| {P_{\mathrm{T}} - P_{\mathrm{N}}} \right| - \left( {1/N_{\mathrm{T}} + 1/N_{\mathrm{N}}} \right)/2}}{{P_{{\mathrm{avg}}}Q_{{\mathrm{avg}}}\sqrt {1/N_{\mathrm{T}} + 1/N_{\mathrm{N}}} }}\quad \quad \quad \quad \quad \,P = \varPhi \left( {Z_{{\mathrm{score}}}} \right),$$

which is normally distributed with mean 0 and variance 1. The P value for the one-sided test of whether the tumor proportion is statistically greater than the normal proportion is Φ(Z_score), that is, the area under the standard Gaussian curve within the range Z_score ≤ Z < ∞. We restrict performance of the test only to those cases where P_T > P_N is actually observed, skipping cases of P_T ≤ P_N, to avoid over-correcting in the calculation of false discovery rate (FDR). Using this method, we determined that the rate of mutations mapping is significant in the following comparison: tumor cells versus nontumor cells (P ≈ 0), PanIN cells versus nontumor cells (P ≈ 0), tumor cells versus PanIN cells (P = 1.04 × 10⁻¹¹).

Single-cell RNA CNV detection

To detect large-scale chromosomal CNVs using single-cell RNA-seq data, inferCNV (v.0.8.2) was used with default parameters recommended for 10x Genomics data. All cells that are not tumor cells were pooled together for the reference normal set. InferCNV was run at a sample level and only with post-quality controlled filtered data. To calculate arm-level CNV events, we used an in-house script to match the gene-level inferCNV output to chromosome bands and take the mean value for each arm.

Single-cell mutation and CNV plotting

For clarity, we assigned each cell, represented by a single dot in a UMAP plot, with only one genetic alteration, in a hierarchical fashion. For clarity and to not overcomplicate plotting due to too many comparison groups, if a mutation and a copy number event are detected in the same cell, the cell is labeled with the mutation. Additionally, when multiple mutations or copy number events are detected in the same cell, we plot them hierarchically as follows: KRAS > CDKN2A > SMAD4 > TP53.

Differential sc/snRNA expression analyses

For cell-level and cluster-level differential expression, we used the ‘FindMarkers’ or ‘FindAllMarkers’ Seurat function as appropriate, using a minimum percentage of 0.25 (parameter min.pct = 0.25) and looking only in the positive direction, as lack of expression is harder to interpret due to the sparsity of the data. The resulting DEGs were then filtered for adjusted P < 0.05 and sorted by fold change. All differential expression analyses were carried out using the ‘SCT’ assay.

Tumor subcluster pathway analysis

To demonstrate tumor heterogeneity in merged scRNA/snRNA data, we first took subsets of tumor cells from each individual case and renormalized with Seurat function ‘SCTransform’. We then found case-level clusters with Seurat functions ‘FindNeighbors’ and ‘FindClusters’ (top 20 PCA dimensions, resolution = 0.8). Clusters with fewer than 0.1% of total tumor cells across cases were excluded. For each case-level cluster, we found DEGs with function ‘FindAllMarkers’ with a minimum percentage (min.pct) of 0.1, a minimum percentage difference (min.diff.pct) of 0.1, positive log fold change and adjusted P < 0.05. For each DEG list, we ran an enrichment analysis using the function ‘enricher’ against the 50 MSigDB hallmark gene sets^32,101. The universe background for enrichment analysis was composed of genes detected in more than 0.1% of total tumor cells across cases. For each pathway, genes shown as enriched with adjusted P < 0.05 in any cluster were used to calculate the pathway score. Finally, in the merged tumor cell object, we calculated the average expression of genes identified in each pathway, centered and scaled across all clusters as the final score. To present pathways that distinguish tumor clusters the most, we ranked tumor cell-related pathways by their occurrence shown as enriched significantly (adjusted P < 0.05) in the enrichment analysis and plotted the most common pathways. The final heatmap was generated with the pheatmap package using the Optimal Leaf Ordering clustering method from the seriation package.

Receptor–ligand interactions

We used the CellPhoneDB tool¹⁰² to detect significant pairs of receptor–ligand interactions between cell types. This comparison was done at the sample level using default parameters between tumor and lymphocyte cell types.

Monocle trajectory analysis

We used the Monocle3 tool (https://cole-trapnell-lab.github.io/monocle3/) to infer cell-type transition states among acinar, transitional, PanIN and normal ductal populations. Objects and trajectory mapping were obtained by following tutorials outlined by developers (https://cole-trapnell-lab.github.io/monocle3/).

CopyKAT

To predict copy number ploidy without tumor annotation we utilized CopyKAT (https://github.com/navinlabcode/copykat) and followed the standard tutorial to define populations of aneuploid tumor cells.

Spatial transcriptomics data preprocessing

For each sample, we obtained the unfiltered feature-barcode matrix per sample by passing the demultiplexed FASTQs and associated H&E image to Space Ranger v.1.1.0 ‘count’ command using default parameters with reorient-images enabled, and the prebuilt GRCh38 genome reference 2020-A (GRCh38 and Ensembl 98). Seurat v.4.0.3 was used for all subsequent analyses. We constructed a Seurat object using the ‘Load10X_Spatial’ function for every slide. Each slide was then scaled and normalized with the ‘SCTransform’ function to correct for batch effects (with parameters: vars.to.regress = c(‘nCount_Spatial’)). Any merged analysis or subsequent subsetting of cells/samples for a sample with several slides underwent the same scaling and normalization method. Spots were clustered using the original Louvain algorithm⁹⁹ and top 20 PCA dimensions via ‘FindNeighbors’ and ‘FindClusters’ functions.

sc/snRNA-seq cell-type annotation

For spot-level cell-type assignment, we used the Seurat functions ‘FindTransferAnchors’ and ‘TransferData’ to perform a cell-type label transfer from the paired snRNA-seq annotations to the spatial transcriptomics spots. For further resolution, we used RCTD to deconvolve cell types within a given spot³⁵. We used the default parameters in RCTD using the ‘multi’ mode and a minimum of 25 nuclei for each cell-type identity to deconvolve; https://github.com/dmcable/spacexr.

Manual spot selection

In select samples with PanIN identified and a tumor or normal ductal structure within the capture area, we annotated the spatial transcriptomics spots using the Loupe Browser 5.0 and the lasso tool to manually select and annotate groups of spots. Annotated spots were then used to annotate the UMAP object; then, annotated spots were subsetted from the full object and DEGs were calculated using Seurat (FindAllMarkers).

DNA and RNA sample quality control

Bulk sequencing data quality metrics (adaptor content, mapping quality, coverage and swaps/mislabeling) were determined for DNA and RNA bams using our in-house pipeline SeqQEst. The inclusion criteria for paired DNA and RNA bams with sufficient coverage was >50× coding region coverage in WES or >50 Mb mapped depth in RNA-seq data.

RNA quantification

We used our in-house bulk RNA-seq expression analysis pipeline for quantification. Briefly, for each sample, the raw sequence reads were aligned into BAM files using STAR¹⁰³ (v.2.7.4a) two-pass alignment with GRCh38 as the reference. The resulting BAM files were then quantified as a raw count-matrix using read feature counts using Subread¹⁰⁴ (v.2.0.1). For both alignment and quantification, gene annotations were based on Gencode v.34. The raw counts were then converted to FPKM-UQ based on GDC’s formula (https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/#upper-quartile-fpkm) and then log₂ transformed with 1 pseudocount.

Proteomic and phosphoproteomics quantification

Proteomic data processing followed the methods detailed by Clark et al.¹⁰⁵. Briefly, raw mass spectrometry files were converted into open mzML format, then searched using the MSFragger database against a RefSeq protein sequence database appended with an equal number of decoy sequences. The specific parameters and software are detailed in the Clark et al. 2020 study. We then used the ComBat function from the R sva package to correct for TMT batch effects¹⁰⁶.

Pathway analysis

For each comparison, we obtained the top 30 genes ranked by highest fold change that are significantly different between the comparison groups (FDR < 0.05). We used ConsensusPathDB-human for gene set over-representation analysis¹⁰⁷.

Statistics and reproducibility

Relevant statistics are referred to in each of the associated methods sections. We did not use statistical methods to predetermine a sample size and patients were not randomly selected, as they were enrolled as they passed through the clinic. We excluded samples that did not pass sample prep quality control. For all immunofluorescence imaging, at least three regions of each sample were assayed, but immunofluorescence staining was not repeated for the same sample sections.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All human sequencing and imaging data have been deposited via the Human Tumor Atlas Network (HTAN) dbGaP Study Accession: phs002371.v1.p1 (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs002371.v1.p1). In addition, all data have been deposited to the HTAN Data Coordinating Center Data Portal at the National Cancer Institute: https://data.humantumoratlas.org/ (under the HTAN WUSTL Atlas). References (GRCh38 genome reference v3.0.0 and refdata-gex-mm10-2020-A) used for single-cell analysis of the human and mouse genomes, respectively, are available from public sources, as described in access scripts freely furnished by 10x Genomics: https://support.10xgenomics.com/single-cell-gene-expression/software/release-notes/build. Mouse single-cell RNA-seq data are freely available from the National Library of Medicine BioProject (https://www.ncbi.nlm.nih.gov/bioproject/) under accession: PRJNA835747. Data for single-cell integration from Peng et al.²⁴ were downloaded from the Genome Sequence Archive (PRJCA001063).

Code availability

The code for the mutation mapping from bulk to single cells can be found at: https://github.com/ding-lab/10Xmapping. Code for inferCNV post processing can be found at: https://github.com/ding-lab/infer_cnv_postprocesssing.git. Code for germline wrapper variant calling can be found at: https://github.com/ding-lab/germlinewrapper.

References

Siegel, R. L., Miller, K. D., Fuchs, H. E. & Jemal, A. Cancer statistics, 2022. CA Cancer J. Clin. 72, 7–33 (2022).
Article PubMed Google Scholar
McGuigan, A. et al. Pancreatic cancer: a review of clinical diagnosis, epidemiology, treatment and outcomes. World J. Gastroenterol. 24, 4846–4861 (2018).
Article PubMed PubMed Central Google Scholar
Saad, A. M., Turk, T., Al-Husseini, M. J. & Abdel-Rahman, O. Trends in pancreatic adenocarcinoma incidence and mortality in the United States in the last four decades; a SEER-based study. BMC Cancer 18, 688 (2018).
Article PubMed PubMed Central Google Scholar
Rawla, P., Sunkara, T. & Gaduputi, V. Epidemiology of pancreatic cancer: global trends, etiology and risk factors. World J. Oncol. 10, 10–27 (2019).
Article PubMed PubMed Central Google Scholar
Ilic, M. & Ilic, I. Epidemiology of pancreatic cancer. World J. Gastroenterol. 22, 9694–9705 (2016).
Article PubMed PubMed Central Google Scholar
Viale, P. H. The American Cancer Society’s facts & figures: 2020 edition. J. Adv. Pract. Oncol. 11, 135–136 (2020).
PubMed PubMed Central Google Scholar
Conroy, T. et al. FOLFIRINOX or gemcitabine as adjuvant therapy for pancreatic cancer. N. Engl. J. Med. 379, 2395–2406 (2018).
Article CAS PubMed Google Scholar
Kang, J. et al. Nab-paclitaxel plus gemcitabine versus FOLFIRINOX as the first-line chemotherapy for patients with metastatic pancreatic cancer: retrospective analysis. Invest. New Drugs 36, 732–741 (2018).
Article CAS PubMed Google Scholar
Morrison, A. H., Byrne, K. T. & Vonderheide, R. H. Immunotherapy and prevention of pancreatic cancer. Trends Cancer 4, 418–428 (2018).
Article CAS PubMed PubMed Central Google Scholar
Balachandran, V. P., Beatty, G. L. & Dougan, S. K. Broadening the impact of immunotherapy to pancreatic cancer: challenges and opportunities. Gastroenterology 156, 2056–2072 (2019).
Article CAS PubMed Google Scholar
Raphael, B. J. et al. Integrated genomic characterization of pancreatic ductal adenocarcinoma. Cancer Cell 32, 185–203 (2017).
Article Google Scholar
Moffitt, R. A. et al. Virtual microdissection identifies distinct tumor- and stroma-specific subtypes of pancreatic ductal adenocarcinoma. Nat. Genet. 47, 1168–1178 (2015).
Article CAS PubMed PubMed Central Google Scholar
Maurer, C. et al. Experimental microdissection enables functional harmonisation of pancreatic cancer subtypes. Gut 68, 1034–1043 (2019).
Article CAS PubMed Google Scholar
Elyada, E. et al. Cross-species single-cell analysis of pancreatic ductal adenocarcinoma reveals antigen-presenting cancer-associated fibroblasts. Cancer Discov. 9, 1102–1123 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sahai, E. et al. A framework for advancing our understanding of cancer-associated fibroblasts. Nat. Rev. Cancer 20, 174–186 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schnurr, M. et al. Strategies to relieve immunosuppression in pancreatic cancer. Immunotherapy 7, 363–376 (2015).
Article CAS PubMed Google Scholar
Looi, C.-K. et al. Therapeutic challenges and current immunomodulatory strategies in targeting the immunosuppressive pancreatic tumor microenvironment. J. Exp. Clin. Cancer Res. 38, 162 (2019).
Article PubMed PubMed Central Google Scholar
Uzunparmak, B. & Sahin, I. H. Pancreatic cancer microenvironment: a current dilemma. Clin. Transl. Med. 8, 2 (2019).
Article PubMed PubMed Central Google Scholar
Ren, B. et al. Tumor microenvironment participates in metastasis of pancreatic cancer. Mol. Cancer 17, 108 (2018).
Article PubMed PubMed Central Google Scholar
De La O, J.-P. et al. Notch and Kras reprogram pancreatic acinar cells to ductal intraepithelial neoplasia. Proc. Natl Acad. Sci. USA 105, 18907–18912 (2008).
Article Google Scholar
Gidekel Friedlander, S. Y. et al. Context-dependent transformation of adult pancreatic cells by oncogenic K-Ras. Cancer Cell 16, 379–389 (2009).
Article PubMed PubMed Central Google Scholar
Habbe, N. et al. Spontaneous induction of murine pancreatic intraepithelial neoplasia (mPanIN) by acinar cell targeting of oncogenic Kras in adult mice. Proc. Natl Acad. Sci. USA 105, 18913–18918 (2008).
Article CAS PubMed PubMed Central Google Scholar
Tuveson, D. A. et al. Mist1-KrasG12D knock-in mice develop mixed differentiation metastatic exocrine pancreatic carcinoma and hepatocellular carcinoma. Cancer Res. 66, 242–247 (2006).
Article CAS PubMed Google Scholar
Peng, J. et al. Single-cell RNA-seq highlights intra-tumoral heterogeneity and malignant progression in pancreatic ductal adenocarcinoma. Cell Res. 29, 725–738 (2019).
Article CAS PubMed PubMed Central Google Scholar
Qadir, M. M. F. et al. Single-cell resolution analysis of the human pancreatic ductal progenitor cell niche. Proc. Natl Acad. Sci. USA 117, 10876–10887 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bernard, V. et al. Single-cell transcriptomics of pancreatic cancer precursors demonstrates epithelial and microenvironmental heterogeneity as an early event in neoplastic progression. Clin. Cancer Res. 25, 2194–2205 (2019).
Article CAS PubMed Google Scholar
Lin, W. et al. Single-cell transcriptome analysis of tumor and stromal compartments of pancreatic ductal adenocarcinoma primary tumors and metastatic lesions. Genome Med. 12, 80 (2020).
Article CAS PubMed PubMed Central Google Scholar
Raghavan, S. et al. Microenvironment drives cell state, plasticity, and drug response in pancreatic cancer. Cell 184, 6119–6137.e26 (2021).
Article CAS PubMed PubMed Central Google Scholar
Blobner, B. M. et al. Single-cell analyses of human pancreas: characteristics of two populations of acinar cells in chronic pancreatitis. Am. J. Physiol. Gastrointest. Liver Physiol. 321, G449–G460 (2021).
Article CAS PubMed Google Scholar
Tosti, L. et al. Single-nucleus and in situ RNA-sequencing reveal cell topographies in the human pancreas. Gastroenterology 160, 1330–1344.e11 (2021).
Article CAS PubMed Google Scholar
Rozenblatt-Rosen, O. et al. The Human Tumor Atlas Network: charting tumor transitions across space and time at single-cell resolution. Cell 181, 236–249 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liberzon, A. et al. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 1, 417–425 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yao, C.-H. et al. Mitochondrial fusion supports increased oxidative phosphorylation during cell proliferation. eLife 8, e41351 (2019).
Article PubMed PubMed Central Google Scholar
Kitajima, S., Thummalapalli, R. & Barbie, D. A. Inflammation as a driver and vulnerability of KRAS mediated oncogenesis. Semin. Cell Dev. Biol. 58, 127–135 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cable, D. M. et al. Robust decomposition of cell type mixtures in spatial transcriptomics. Nat. Biotechnol. 40, 517–526 (2022).
Article CAS PubMed Google Scholar
Walker, C., Mojares, E. & Del Río Hernández, A. Role of extracellular matrix in development and cancer progression. Int. J. Mol. Sci. 19, 3028 (2018).
Article PubMed Central Google Scholar
Scott, A. D. et al. CharGer: clinical Characterization of Germline variants. Bioinformatics 35, 865–867 (2019).
Article CAS PubMed Google Scholar
Collisson, E. A. et al. Subtypes of pancreatic ductal adenocarcinoma and their differing responses to therapy. Nat. Med. 17, 500–503 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bailey, P. et al. Genomic analyses identify molecular subtypes of pancreatic cancer. Nature 531, 47–52 (2016).
Article CAS PubMed Google Scholar
Aran, D., Hu, Z. & Butte, A. J. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 18, 220 (2017).
Article PubMed PubMed Central Google Scholar
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612 (2013).
Article PubMed Google Scholar
Valque, H., Gouyer, V., Gottrand, F. & Desseyn, J.-L. MUC5B leads to aggressive behavior of breast cancer MCF7 cells. PLoS ONE 7, e46699 (2012).
Article CAS PubMed PubMed Central Google Scholar
Niknami, Z., Eslamifar, A., Emamirazavi, A., Ebrahimi, A. & Shirkoohi, R. The association of vimentin and fibronectin gene expression with epithelial-mesenchymal transition and tumor malignancy in colorectal carcinoma. EXCLI J. 16, 1009–1017 (2017).
PubMed PubMed Central Google Scholar
Zhang, Z., Wang, Y., Zhang, J., Zhong, J. & Yang, R. COL1A1 promotes metastasis in colorectal cancer by regulating the WNT/PCP pathway. Mol. Med. Rep. 17, 5037–5042 (2018).
CAS PubMed PubMed Central Google Scholar
Kulemann, B. et al. Pancreatic cancer: circulating tumor cells and primary tumors show heterogeneous KRAS mutations. Sci. Rep. 7, 4510 (2017).
Article PubMed PubMed Central Google Scholar
Hashimoto, D. et al. Heterogeneity of KRAS mutations in pancreatic ductal adenocarcinoma. Pancreas 45, 1111–1114 (2016).
Article CAS PubMed Google Scholar
infercnv: Inferring CNV from Single-Cell RNA-Seq. v0.8.2 (Trinity CTAT Poject, 2020).
Wang, F. et al. MEDALT: single-cell copy number lineage tracing enabling gene discovery. Genome Biol. 22, 70 (2021).
Article CAS PubMed PubMed Central Google Scholar
Makohon-Moore, A. P. et al. Precancerous neoplastic cells can move through the pancreatic ductal system. Nature 561, 201–205 (2018).
Article CAS PubMed PubMed Central Google Scholar
Murphy, S. J. et al. Genetic alterations associated with progression from pancreatic intraepithelial neoplasia to invasive pancreatic tumor. Gastroenterology 145, 1098–1109.e1 (2013).
Article CAS PubMed Google Scholar
Kopp, J. L. et al. Identification of Sox9-dependent acinar-to-ductal reprogramming as the principal mechanism for initiation of pancreatic ductal adenocarcinoma. Cancer Cell 22, 737–750 (2012).
Article CAS PubMed PubMed Central Google Scholar
Storz, P. Acinar cell plasticity and development of pancreatic ductal adenocarcinoma. Nat. Rev. Gastroenterol. Hepatol. 14, 296–304 (2017).
Article CAS PubMed PubMed Central Google Scholar
Muraro, M. J. et al. A single-cell transcriptome atlas of the human pancreas. Cell Syst. 3, 385–394.e3 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schlesinger, Y. et al. Single-cell transcriptomes of pancreatic preinvasive lesions and cancer reveal acinar metaplastic cells’ heterogeneity. Nat. Commun. 11, 4516 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liu, X. et al. REG3A accelerates pancreatic cancer cell growth under IL-6-associated inflammatory condition: involvement of a REG3A-JAK2/STAT3 positive feedback loop. Cancer Lett. 362, 45–60 (2015).
Article CAS PubMed Google Scholar
Li, Q. et al. Reg proteins promote acinar-to-ductal metaplasia and act as novel diagnostic and prognostic markers in pancreatic ductal adenocarcinoma. Oncotarget 7, 77838–77853 (2016).
Article PubMed PubMed Central Google Scholar
Crnogorac-Jurcevic, T. et al. Molecular analysis of precursor lesions in familial pancreatic cancer. PLoS ONE 8, e54830 (2013).
Article CAS PubMed PubMed Central Google Scholar
Crawford, H. C., Scoggins, C. R., Washington, M. K., Matrisian, L. M. & Leach, S. D. Matrix metalloproteinase-7 is expressed by pancreatic cancer precursors and regulates acinar-to-ductal metaplasia in exocrine pancreas. J. Clin. Invest. 109, 1437–1444 (2002).
Article CAS PubMed PubMed Central Google Scholar
Wei, D. et al. KLF4 is essential for induction of cellular identity change and acinar-to-ductal reprogramming during early pancreatic carcinogenesis. Cancer Cell 29, 324–338 (2016).
Article CAS PubMed PubMed Central Google Scholar
Demir, I. E. et al. Early pancreatic cancer lesions suppress pain through CXCL12-mediated chemoattraction of Schwann cells. Proc. Natl Acad. Sci. USA 114, E85–E94 (2017).
Article CAS PubMed Google Scholar
Gharibi, A. et al. ITGA1 is a pre-malignant biomarker that promotes therapy resistance and metastatic potential in pancreatic cancer. Sci. Rep. 7, 10060 (2017).
Article PubMed PubMed Central Google Scholar
Thomas, R. M. et al. The chemokine receptor CXCR4 is expressed in pancreatic intraepithelial neoplasia. Gut 57, 1555–1560 (2008).
Article CAS PubMed Google Scholar
Tian, C. et al. Proteomic analyses of ECM during pancreatic ductal adenocarcinoma progression reveal different contributions by tumor and stromal cells. Proc. Natl Acad. Sci. USA 116, 19609–19618 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ferreira, R. M. M. et al. Duct- and acinar-derived pancreatic ductal adenocarcinomas show distinct tumor progression and marker expression. Cell Rep. 21, 966–978 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mallya, K. et al. Acinar transformed ductal cells exhibit differential mucin expression in a tamoxifen-induced pancreatic ductal adenocarcinoma mouse model. Biol. Open 9, bio052878 (2020).
Hegde, S. et al. Dendritic cell paucity leads to dysfunctional immune surveillance in pancreatic cancer. Cancer Cell 37, 289–307.e9 (2020).
Article CAS PubMed PubMed Central Google Scholar
Boggs, K. et al. Pancreatic gene expression during recovery after pancreatitis reveals unique transcriptome profiles. Sci. Rep. 8, 1406 (2018).
Article PubMed PubMed Central Google Scholar
Zhang, H. et al. REG3A/REG3B promotes acinar to ductal metaplasia through binding to EXTL3 and activating the RAS-RAF-MEK-ERK signaling pathway. Commun. Biol. 4, 688 (2021).
Article CAS PubMed PubMed Central Google Scholar
Helms, E., Kathrina Onate, M. & Sherman, M. H. Fibroblast heterogeneity in the pancreatic tumor microenvironment. Cancer Discov. 10, 648–656 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kraman, M. et al. Suppression of antitumor immunity by stromal cells expressing fibroblast activation protein-α. Science 330, 827–830 (2010).
Article CAS PubMed Google Scholar
Öhlund, D. et al. Distinct populations of inflammatory fibroblasts and myofibroblasts in pancreatic cancer. J. Exp. Med. 214, 579–596 (2017).
Article PubMed PubMed Central Google Scholar
Hosein, A. N., Brekken, R. A. & Maitra, A. Pancreatic cancer stroma: an update on therapeutic targeting strategies. Nat. Rev. Gastroenterol. Hepatol. 17, 487–505 (2020).
Article PubMed PubMed Central Google Scholar
Doz, F., Roosen, N. & Rosenblum, M. L. Metallothionein and anticancer agents: the role of metallothionein in cancer chemotherapy. J. Neurooncol. 17, 123–129 (1993).
Article CAS PubMed Google Scholar
Feng, M. et al. PD-1/PD-L1 and immunotherapy for pancreatic cancer. Cancer Lett. 407, 57–65 (2017).
Article CAS PubMed Google Scholar
Birnbaum, D. J. et al. Prognostic value of PDL1 expression in pancreatic cancer. Oncotarget 7, 71198–71210 (2016).
Article PubMed PubMed Central Google Scholar
Reches, A. et al. Nectin4 is a novel TIGIT ligand which combines checkpoint inhibition and tumor specificity. J. Immunother. Cancer 8, e000266 (2020).
Article PubMed PubMed Central Google Scholar
Gorvel, L. & Olive, D. Targeting the ‘PVR–TIGIT axis’ with immune checkpoint therapies. F1000Res. 9, 354 (2020).
Article CAS Google Scholar
Xu, Y., Liu, J., Nipper, M. & Wang, P. Ductal vs. acinar? Recent insights into identifying cell lineage of pancreatic ductal adenocarcinoma. Ann. Pancreat. Cancer 2, 11 (2019).
Article PubMed PubMed Central Google Scholar
Gieniec, K. A., Butler, L. M., Worthley, D. L. & Woods, S. L. Cancer-associated fibroblasts—heroes or villains? Br. J. Cancer 121, 293–302 (2019).
Article PubMed PubMed Central Google Scholar
Pereira, B. A. et al. CAF subpopulations: a new reservoir of stromal targets in pancreatic cancer. Trends Cancer Res. 5, 724–741 (2019).
Article Google Scholar
Biffi, G. et al. IL1-induced JAK/STAT signaling is antagonized by TGFβ to shape CAF heterogeneity in pancreatic ductal adenocarcinoma. Cancer Discov. 9, 282–301 (2019).
Article PubMed Google Scholar
Rodrigo, M. A. M. et al. Metallothionein isoforms as double agents – their roles in carcinogenesis, cancer progression and chemoresistance. Drug Resist. Updat. 52, 100691 (2020).
Article Google Scholar
Brahmer, J. R. et al. Safety and activity of anti-PD-L1 antibody in patients with advanced cancer. N. Engl. J. Med. 366, 2455–2465 (2012).
Article CAS PubMed PubMed Central Google Scholar
O’Reilly, E. M. et al. Durvalumab with or without tremelimumab for patients with metastatic pancreatic ductal adenocarcinoma: a phase 2 randomized clinical trial. JAMA Oncol. 5, 1431–1438 (2019).
Article PubMed PubMed Central Google Scholar
Royal, R. E. et al. Phase 2 trial of single agent Ipilimumab (anti-CTLA-4) for locally advanced or metastatic pancreatic adenocarcinoma. J. Immunother. 33, 828–833 (2010).
Article CAS PubMed PubMed Central Google Scholar
Bear, A. S., Vonderheide, R. H. & O’Hara, M. H. Challenges and opportunities for pancreatic cancer immunotherapy. Cancer Cell 38, 788–802 (2020).
Article CAS PubMed PubMed Central Google Scholar
Stromnes, I. M., Hulbert, A., Pierce, R. H., Greenberg, P. D. & Hingorani, S. R. T-cell localization, activation, and clonal expansion in human pancreatic ductal adenocarcinoma. Cancer Immunol. Res. 5, 978–991 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hingorani, S. R. et al. Trp53R172H and KrasG12D cooperate to promote chromosomal instability and widely metastatic pancreatic ductal adenocarcinoma in mice. Cancer Cell 7, 469–483 (2005).
Article CAS PubMed Google Scholar
Hosein, A. N. et al. Cellular heterogeneity during mouse pancreatic ductal adenocarcinoma progression at single-cell resolution. JCI Insight 5, e129212 (2019).
Article Google Scholar
Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).
Article CAS PubMed Google Scholar
Koboldt, D. C. et al. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res. 22, 568–576 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ye, K., Schulz, M. H., Long, Q., Apweiler, R. & Ning, Z. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 25, 2865–2871 (2009).
Article CAS PubMed PubMed Central Google Scholar
Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213–219 (2013).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Article CAS PubMed PubMed Central Google Scholar
1000 Genomes Project Consortium et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hafemeister, C. & Satija, R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 20, 296 (2019).
Article CAS PubMed PubMed Central Google Scholar
Blondel, V. D., Guillaume, J.-L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. 2008, P10008 (2008).
Article Google Scholar
Chang, M. T. et al. Accelerating discovery of functional mutant alleles in cancer. Cancer Discov. 8, 174–183 (2018).
Article CAS PubMed Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Article CAS PubMed PubMed Central Google Scholar
Efremova, M., Vento-Tormo, M., Teichmann, S. A. & Vento-Tormo, R. CellPhoneDB: inferring cell-cell communication from combined expression of multi-subunit ligand-receptor complexes. Nat. Protoc. 15, 1484–1506 (2020).
Article CAS PubMed Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res. 41, e108 (2013).
Article PubMed PubMed Central Google Scholar
Clark, D. J. et al. Integrated proteogenomic characterization of clear cell renal cell carcinoma. Cell 180, 207 (2020).
Article CAS PubMed Google Scholar
Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28, 882–883 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kamburov, A., Stelzl, U., Lehrach, H. & Herwig, R. The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res. 41, D793–D800 (2013).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the patients, staff and scientists who contributed to this study as well as the NCI and the Human Tumor Atlas Network (HTAN) consortium. All HTAN consortium members are named at humantumoratlas.org. We also thank the Siteman Cancer Center and the McDonnell Genome Institute for their support. The following grants supported this work: grant no. U2CCA233303 to L.D., R.C.F., W.E.G. and S.T.O.; grant no. U24CA211006 to L.D.; grant no. U24CA209837 to K.I.S.; grant no. R01HG009711 to L.D. and F.C.

Author information

These authors contributed equally: Daniel Cui Zhou, Reyka G. Jayasinghe.
These authors jointly supervised this work: David G. DeNardo, Ryan C. Fields, Li Ding.

Authors and Affiliations

Department of Medicine, Washington University in St Louis, St Louis, MO, USA
Daniel Cui Zhou, Reyka G. Jayasinghe, Siqi Chen, Michael D. Iglesia, Pooja Navale, Wagma Caravan, Kazuhito Sato, Erik Storrs, Chia-Kuei Mo, Jingxian Liu, Austin N. Southard-Smith, Yige Wu, Nataly Naser Al Deen, John M. Baer, Matthew A. Wyczalkowski, Ruiyang Liu, Andrew Shinkle, Lisa Thammavong, Houxiang Zhu, Hua Sun, Liang-Bo Wang, Yize Li, Chong Zuo, Joshua F. McMichael, Xiaolu Yang, Ashley N. Reeb, Clara Oh, Mamatha Serasanambati, Preet Lal, Rajees Varghese, Jay R. Mashl, Nadezhda V. Terekhanova, Lijun Yao, Rita Jui-Hsien Lu, Sheng-Kwei Song, Kooresh I. Shoghi, Samuel Achilefu, Milan G. Chheda, Stephen T. Oh, Feng Chen, David G. DeNardo & Li Ding
McDonnell Genome Institute, Washington University in St Louis, St Louis, MO, USA
Daniel Cui Zhou, Reyka G. Jayasinghe, Siqi Chen, Michael D. Iglesia, Michael C. Wendl, Wagma Caravan, Kazuhito Sato, Erik Storrs, Chia-Kuei Mo, Jingxian Liu, Austin N. Southard-Smith, Yige Wu, Nataly Naser Al Deen, Robert S. Fulton, Matthew A. Wyczalkowski, Ruiyang Liu, Catrina C. Fronick, Lucinda A. Fulton, Andrew Shinkle, Lisa Thammavong, Houxiang Zhu, Hua Sun, Liang-Bo Wang, Yize Li, Joshua F. McMichael, Elizabeth L. Appelbaum, Clara Oh, Mamatha Serasanambati, Preet Lal, Rajees Varghese, Jay R. Mashl, Jennifer Ponce, Nadezhda V. Terekhanova, Lijun Yao, Rita Jui-Hsien Lu & Li Ding
Department of Surgery, Washington University in St Louis, St Louis, MO, USA
John M. Herndon, Sherri R. Davies, Keenan J. Robbins, Sara E. Chasnoff, William G. Hawkins, William E. Gillanders & Ryan C. Fields
Siteman Cancer Center, Washington University in St Louis, St Louis, MO, USA
John M. Herndon, Keenan J. Robbins, Julie K. Schwarz, Albert H. Kim, William G. Hawkins, Milan G. Chheda, William E. Gillanders, David G. DeNardo, Ryan C. Fields & Li Ding
Department of Pathology and Immunology, Washington University in St Louis, St Louis, MO, USA
Pooja Navale, John M. Baer, Stephen T. Oh & David G. DeNardo
Department of Genetics, Washington University in St Louis, St Louis, MO, USA
Michael C. Wendl & Li Ding
Department of Mathematics, Washington University in St Louis, St Louis, MO, USA
Michael C. Wendl
Department of Otolaryngology–Head & Neck Surgery, Washington University in St Louis, St Louis, MO, USA
Ashley N. Reeb & Sidharth V. Puram
Department of Bioinformatics and Computational Biology, University of Texas MD Anderson Cancer Center, Houston, TX, USA
Fang Wang & Ken Chen
Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Lijun Chen, Michael Schnaubelt & Hui Zhang
Department of Radiation Oncology, Washington University in St Louis, St Louis, MO, USA
Julie K. Schwarz
Department of Cell Biology and Physiology, Washington University in St Louis, St Louis, MO, USA
Julie K. Schwarz
Department of Neurological Surgery, Washington University in St Louis, St Louis, MO, USA
Albert H. Kim
Department of Radiology, Washington University in St Louis, St Louis, MO, USA
Sheng-Kwei Song, Kooresh I. Shoghi & Samuel Achilefu
Department of Cell and Developmental Biology and Epithelial Biology Center, Vanderbilt University School of Medicine, Vanderbilt, TN, USA
Ken S. Lau
Department of Computer Science and Engineering, Washington University in St Louis, St Louis, MO, USA
Tao Ju
Department of Pathology, University of Texas MD Anderson Cancer Center, Houston, TX, USA
Deyali Chatterjee

Authors

Daniel Cui Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Reyka G. Jayasinghe
View author publications
You can also search for this author in PubMed Google Scholar
Siqi Chen
View author publications
You can also search for this author in PubMed Google Scholar
John M. Herndon
View author publications
You can also search for this author in PubMed Google Scholar
Michael D. Iglesia
View author publications
You can also search for this author in PubMed Google Scholar
Pooja Navale
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Wendl
View author publications
You can also search for this author in PubMed Google Scholar
Wagma Caravan
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhito Sato
View author publications
You can also search for this author in PubMed Google Scholar
Erik Storrs
View author publications
You can also search for this author in PubMed Google Scholar
Chia-Kuei Mo
View author publications
You can also search for this author in PubMed Google Scholar
Jingxian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Austin N. Southard-Smith
View author publications
You can also search for this author in PubMed Google Scholar
Yige Wu
View author publications
You can also search for this author in PubMed Google Scholar
Nataly Naser Al Deen
View author publications
You can also search for this author in PubMed Google Scholar
John M. Baer
View author publications
You can also search for this author in PubMed Google Scholar
Robert S. Fulton
View author publications
You can also search for this author in PubMed Google Scholar
Matthew A. Wyczalkowski
View author publications
You can also search for this author in PubMed Google Scholar
Ruiyang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Catrina C. Fronick
View author publications
You can also search for this author in PubMed Google Scholar
Lucinda A. Fulton
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Shinkle
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Thammavong
View author publications
You can also search for this author in PubMed Google Scholar
Houxiang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Hua Sun
View author publications
You can also search for this author in PubMed Google Scholar
Liang-Bo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yize Li
View author publications
You can also search for this author in PubMed Google Scholar
Chong Zuo
View author publications
You can also search for this author in PubMed Google Scholar
Joshua F. McMichael
View author publications
You can also search for this author in PubMed Google Scholar
Sherri R. Davies
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth L. Appelbaum
View author publications
You can also search for this author in PubMed Google Scholar
Keenan J. Robbins
View author publications
You can also search for this author in PubMed Google Scholar
Sara E. Chasnoff
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ashley N. Reeb
View author publications
You can also search for this author in PubMed Google Scholar
Clara Oh
View author publications
You can also search for this author in PubMed Google Scholar
Mamatha Serasanambati
View author publications
You can also search for this author in PubMed Google Scholar
Preet Lal
View author publications
You can also search for this author in PubMed Google Scholar
Rajees Varghese
View author publications
You can also search for this author in PubMed Google Scholar
Jay R. Mashl
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Ponce
View author publications
You can also search for this author in PubMed Google Scholar
Nadezhda V. Terekhanova
View author publications
You can also search for this author in PubMed Google Scholar
Lijun Yao
View author publications
You can also search for this author in PubMed Google Scholar
Fang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lijun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Michael Schnaubelt
View author publications
You can also search for this author in PubMed Google Scholar
Rita Jui-Hsien Lu
View author publications
You can also search for this author in PubMed Google Scholar
Julie K. Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Sidharth V. Puram
View author publications
You can also search for this author in PubMed Google Scholar
Albert H. Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sheng-Kwei Song
View author publications
You can also search for this author in PubMed Google Scholar
Kooresh I. Shoghi
View author publications
You can also search for this author in PubMed Google Scholar
Ken S. Lau
View author publications
You can also search for this author in PubMed Google Scholar
Tao Ju
View author publications
You can also search for this author in PubMed Google Scholar
Ken Chen
View author publications
You can also search for this author in PubMed Google Scholar
Deyali Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar
William G. Hawkins
View author publications
You can also search for this author in PubMed Google Scholar
Hui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Achilefu
View author publications
You can also search for this author in PubMed Google Scholar
Milan G. Chheda
View author publications
You can also search for this author in PubMed Google Scholar
Stephen T. Oh
View author publications
You can also search for this author in PubMed Google Scholar
William E. Gillanders
View author publications
You can also search for this author in PubMed Google Scholar
Feng Chen
View author publications
You can also search for this author in PubMed Google Scholar
David G. DeNardo
View author publications
You can also search for this author in PubMed Google Scholar
Ryan C. Fields
View author publications
You can also search for this author in PubMed Google Scholar
Li Ding
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.D., R.C.F., D.G.D., W.E.G., F.C. and S.T.O. conceived and designed the study. R.G.J., J.M.H., C.C.F., L.T., E.L.A., S.E.C., X.Y., K.S., A.N.R., J.P., L.C., F.C., R.S.F., C.C.F., R.V., M. Serasanambati, K.J.R., C.O., L.C., S.V.P., D.C., W.G.H., S.C., N.N., A.S., S.C., W.C., J.M.B. and P.L. developed and performed experiments or data collection. D.C.Z., R.G.J., E.S., C.M., Y.W., N.N., M.A.W., L.W., Y.L., C.Z., R.L., N.V.T., H. Zhu, H.S., F.W., M. Schnaubelt, H. Zhang, J.L., A.N.S., M.C.W. and R.J.L. performed computation and statistical analyses. D.C.Z., R.G.J., E.S., J.M.H., C.Z., K.J.R., D.C., M.G.C., D.G.D., W.E.G., R.C.F., L.D., J.F.M., K.C., H. Zhang, L.Y., P.N., J.K.S., M.D.I., J.L., A.N.S. and A.S. performed data interpretation and biological analyses. D.C.Z., R.S.F., L.A.F., J.F.M., M.C.W., R.J.M., S.V.P., A.H.K., S.S., K.I.S., T.J., W.G.H., K.C., H. Zhu, D.C., M.G.C., S.A., D.G.D., S.T.O., F.C., W.E.G., R.C.F., L.D., N.V.T., N.N., M.D.I. and K.S.L. wrote, reviewed and edited the manuscript. L.D., R.C.F., W.E.G., F.C., S.T.O., S.R.D., E.L.A., L.A.F., C.C.F., R.S.F., J.P., A.H.K., K.I.S., T.J. and S.S. were responsible for project administration.

Corresponding authors

Correspondence to David G. DeNardo, Ryan C. Fields or Li Ding.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Itai Yanai and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Details of Data Cohort Overview.

a) All 232,764 cells labeled by case ID. b) Same as A, but cells are labeled by cell type. c) scRNA cell type proportions across samples in the cohort. The bigger the circle, the higher the proportion. d). Spearman correlations of tumor estimates. The 95% confidence interval is shown. Top: histology vs scRNA, Middle: ABSOLUTE vs scRNA, Bottom: ABSOLUTE vs histology. e) Proteomics and phosphoproteomics PCAs labeled by case ID. f) Proteomics and phosphoproteomics PCAs labeled by TMT plex. g) Top: Genomic landscape of the cohort showing the top significantly mutated genes. The color scale denotes variant allele fraction (VAF) for each gene. The top bar plot indicates mutation burden for each sample. Bottom: Bulk omics overview of the cohort. The first row indicates germline mutation status followed by scRNA tumor fraction, immune subtype, relative scores of immune and stroma, and different PDAC subtypes (Moffitt, Collisson, Bailey) by piece.

Extended Data Fig. 2 Genomic Features in Heterogeneous KRAS Subpopulations.

a) Spearman correlation of scRNA estimates and ESTIMATE stroma score. b) Spearman correlation of scRNA estimates and ESTIMATE immune score. For panels A and B, the 95% confidence interval is shown. c) Top significant DEGs between specific KRAS hotspot mutations. Only cells with a mappable KRAS mutation were included in this analysis. d) KRAS mutations in tumor cells of 5 cases with multiple KRAS variants mapped. e) H&E images of the spatial samples in HT061P1. f) Arm and gene-level CNV events in HT061P1 mapped to different tumor clusters. g) Percent of mappable mutations or deep copy number amplifications and/or deletions in each sample for KRAS, TP53, CDKN2A, and SMAD4. Samples are grouped together by case and cases are separated by white lines. h) Protein-level pairwise spearman correlation between all 30 samples that underwent bulk proteomics. Boxed cases represent cases with high heterogeneity: HT064P1 in green, HT123P1 in red, and HT124P1 in purple. i) Cell type proportion distribution of the three heterogeneous cases from panel H. Cell types in dotted boxes represent substantial differences in cellular composition that likely underlie the observed heterogeneity.

Extended Data Fig. 3 Evaluating Transitional Populations in Published Studies.

a) Integration of downsampled cells from tumor samples from Peng et al.²⁴, and WashU samples. UMAP shows integrated single cells colored by cell type. Circled region indicates cells that are specific to the WashU Cohort predominantly made of up PanIN and ADM identified cells. b) Integration of downsampled cells from tumor samples from Peng et al., and WashU samples. UMAP shows integrated single cells colored by case. Cases indicated with ‘HT#P1’ are WashU samples while samples with ‘T#’ are from Peng et. al.

Extended Data Fig. 4 Combined Channel Immunofluorescence Images of ADM Samples.

a) Combined channel immunofluorescence staining across four samples HT288P1 (Adjacent Normal), HT190P1 (Tumor), HT122P1 (Tumor) and HT288P1 (Tumor). Amylase stains acinar cells (green), cytokeratin 19 stains ductal cells (red), Ki67 stains proliferating cells (white), and Hoechst stains nuclei (blue). For select sections, individual cells expressing both acinar and ductal markers indicating acinar to ductal metaplasia (ADM) are highlighted by the yellow triangle. Acinar cells are denoted with a yellow arrow. b) Combined channel immunofluorescence staining across two samples HT412P1 (Tumor) and c) HT434P1 (Tumor). Amylase stains acinar cells (green), cytokeratin 19 stains ductal cells (red), Ki67 stains proliferating cells (white), and Hoechst stains nuclei (blue). Cells exhibiting co-expression of Amylase and cytokeratin 19 are circled in white. Regions of the section with high acinar content, tumor content and ADM content are shown from top to bottom for each sample.

Extended Data Fig. 5 Single Cell Analysis of Mouse Model Validating Transitional Acinar Populations.

a) UMAP of acinar and ductal single cells from mouse model. Cells are colored by cell type. b) UMAP of GFP expression. Cells are colored by expression value. c) UMAP of acinar and ductal cells separated by mouse model from which cells are derived from. d) Selected gene expression markers of acinar and ductal genes across cell types. Each dot indicates expression of a given gene in an annotated cell cluster. The size indicates the percent of cells expressing that gene and the color is average expression. e) Violin plot showing the distribution of expression levels of Sox9 across each annotated cell type.

Extended Data Fig. 6 Evaluating Transitional Populations with Spatial Transcriptomics and Published Studies.

a) H&E images associated with each piece of tumor that underwent spatial transcriptomics processing. Regions on slides are highlighted based on pathology assisted review. Regions are indicated as tumor (Red), PanIN (Yellow), Normal Duct (Green), Pancreatitis (Blue) and Acinar (Purple).

Extended Data Fig. 7 CAF Subtypes.

a) UMAP of all fibroblast cells labeled by CAF subtype. b) Top DEGs and pathways across iCAFs, myCAFs, and apCAFs. c) CAV1 and CAV2 expression in CAF subtypes, tumor cells, and fibroblasts from NAT samples. d) CXCR4 and CXCL12 expression in CAF subtypes and tumor cells. e) HIF1A and NFE2L2 expression in CAF subtypes, tumor cells, macrophages, and monocytes. FDR < 0.0001 for macrophage and monocyte upregulation of NFE2L2 and HIF1A. Panels D-F include expression from all cells in the study of the given cell type and the boxplots show the median with 1.5x IQR whiskers.

Extended Data Fig. 8 Immune Cells in PDAC.

a) UMAP of myeloid and dendritic cells (DC) labeled by cell type. b) Myeloid and DC cell type marker expression. c) Expression of the Keap1-Nrf2 (NFE2L2) pathways genes in all myeloid, DC, and tumor cells in the study. The boxplots show the median with 1.5x IQR whiskers. d) UMAP of lymphocyte and NK cells labeled by cell type. e) Lymphocyte cell type marker expression. f) Cell type percentages of lymphocytes across treatment groups. g) Expression of heat shock genes across treatment groups in Treg and CD4 + T cells (cells from n = 26 FOLFIRINOX samples, n = 15 Gemcitabine + Nab-paclitaxel samples, n = 25 untreated samples). The boxplots show the median with 1.5x IQR whiskers. h) Pathway enrichment of FOLFIRINOX vs treatment-naïve samples in Treg and CD4 + T cells using gene set overrepresentation analysis. i) Average expression of genes in lymphocytes and tumor cells in the scRNA data. j) Average expression of TIGIT and nectin genes across cell types in the snRNA data.

Supplementary information

Supplementary Information

Supplementary Note and Fig. 1.

Reporting Summary

Supplementary Tables

Supplementary Tables 1–5.

Supplementary Data 1

Bulk omics data including somatic and germline variants and proteogenomics data.

Supplementary Data 2

CNVkit raw copy number calls across the sample set.

Supplementary Data 3

Total cell count of transitional cell populations and mutation mapping across samples.

Supplementary Data 4

Differentially expressed genes (DEGs) identified by annotating spatial transcriptomics spots using the Loupe Browser and Seurat. FindAllMarkers function from Seurat was used to identify DEGs.

Supplementary Data 5

Tumor purity estimates predicted by ABSOLUTE across sample cohort.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cui Zhou, D., Jayasinghe, R.G., Chen, S. et al. Spatially restricted drivers and transitional cell populations cooperate with the microenvironment in untreated and chemo-resistant pancreatic cancer. Nat Genet 54, 1390–1405 (2022). https://doi.org/10.1038/s41588-022-01157-1

Download citation

Received: 24 September 2021
Accepted: 13 July 2022
Published: 22 August 2022
Issue Date: September 2022
DOI: https://doi.org/10.1038/s41588-022-01157-1

This article is cited by

Tumor immune microenvironment-based therapies in pancreatic ductal adenocarcinoma: time to update the concept
- Wenyu Luo
- Ti Wen
- Xiujuan Qu
Journal of Experimental & Clinical Cancer Research (2024)
Reconstitution of human PDAC using primary cells reveals oncogenic transcriptomic features at tumor onset
- Yi Xu
- Michael H. Nipper
- Pei Wang
Nature Communications (2024)
The fibro-adipogenic progenitor APOD+DCN+LUM+ cell population in aggressive carcinomas
- Lingyi Cai
- Mikhail G. Kolonin
- Dimitris Anastassiou
Cancer and Metastasis Reviews (2024)
Integrative analysis of spatial and single-cell transcriptome data from human pancreatic cancer reveals an intermediate cancer cell population associated with poor prognosis
- Seongryong Kim
- Galam Leem
- Jong-Eun Park
Genome Medicine (2024)
Advances in spatial transcriptomics and related data analysis strategies
- Jun Du
- Yu-Chen Yang
- Jian Hou
Journal of Translational Medicine (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Study design and overview of the study cohort

PDAC tumor subclusters with distinct cellular functions

KRAS signaling and spatial drivers in pancreatic cancer

Transitional populations between acinar and tumor cells

Validation of ADM using snRNA-seq, immunohistochemistry and mouse models

Transitional populations in histological features by spatial transcriptomics

CAF subpopulations in PDAC TME

Immunosuppressive PDAC TME and treatment

Discussion

Methods

Specimens and clinical data

Sample processing

Genomic DNA and RNA extraction

WES

RNA-seq

Single-cell suspension preparation

Single-nuclei suspension preparation

Single-cell/nuclei library prep and sequencing

Spatial transcriptomics prep and sequencing

KPC-OG GEMM mouse model

Somatic variant calling

KRAS hotspot and within-case genotyping

Germline variant calling and annotation

Germline variant pathogenic classification

sc/snRNA-seq data preprocessing

sc/snRNA-seq cell-type annotation

Spatially distinct tumor cluster assignment

scVarScan mutation mapping

scVarScan statistics

Single-cell RNA CNV detection

Single-cell mutation and CNV plotting

Differential sc/snRNA expression analyses

Tumor subcluster pathway analysis

Receptor–ligand interactions

Monocle trajectory analysis

CopyKAT

Spatial transcriptomics data preprocessing

sc/snRNA-seq cell-type annotation

Manual spot selection

DNA and RNA sample quality control

RNA quantification

Proteomic and phosphoproteomics quantification

Pathway analysis

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links