Abstract
Patients with non-small cell lung cancer (NSCLC) with epidermal growth factor receptor (EGFR) mutations exhibit an unfavorable response to PD-1 inhibitor through unclear mechanisms. Hypothesizing that EGFR mutations alter tumor-immune interactions, we compare tumor-infiltrating lymphocytes between EGFR mutant (EGFR-MT) and wild type (EGFR-WT) tumors through single-cell transcriptomic analysis. We find that B cells, CXCL13-producing follicular helper CD4+ T (TFH)-like cells, and tissue-resident memory CD8+ T (TRM)-like cells decreased in EGFR-MT tumors. The NOTCH-RBPJ regulatory network, which is vital for persistence of TRM state, is perturbed, and the interactions between TFH and B cells through the CXCL13-CXCR5 axis disappear in EGFR-MT tumors. Notably, the proportion of TRM-like cells is predictive for anti-PD-1 response in NSCLC. Our findings suggest that the impairment of TFH-B-TRM cooperation in tertiary lymphoid structure formation, accompanied by the dysregulation of TRM homeostasis and the loss of TFH-B crosstalk, underlies unfavorable anti-PD-1 response in EGFR-MT lung tumors.
Similar content being viewed by others
Introduction
Epidermal growth factor receptor (EGFR) mutations are major drivers of oncogenic alterations that account for an important molecular subtype of lung adenocarcinoma, comprising 40–60% of cases in the South-East Asian population and 10–20% among Caucasian patients1. Patients with lung cancer with sensitizing-EGFR mutations clinically benefit from EGFR-tyrosine kinase inhibitors (TKIs) as first-line therapies1,2,3. However, acquired resistance inevitably develops within 9–18 months, which is the main limitation to the effectiveness of EGFR-TKIs1,2,3.
Programmed death receptor-1 (PD-1) inhibitors improve survival outcomes by offering a durable response in patients with non-small cell lung cancer (NSCLC)4,5,6,7. Accordingly, these inhibitors are used as first- and second-line therapies for NSCLC4,5,6,7. With the increase in the number of checkpoint inhibitor-based clinical trials being conducted, interest in integrating immunotherapy for patients with EGFR-mutant type (EGFR-MT) lung cancer has also increased. However, recent large-scale clinical trials and meta-analysis data have revealed unfavorable treatment outcomes of these EGFR-MT patients compared to those with EGFR wild-type (EGFR-WT), indicating that patients with EGFR-MT do not benefit from PD-1 inhibitors8,9,10. In addition, pembrolizumab for the treatment of PD-L1-positive EGFR-TKI naive patients with sensitizing-EGFR mutation did not show objective response benefit11. A recent clinical trial study reported that a combination of an anti-PD-L1 inhibitor, atezolizumab, with bevacizumab and chemotherapy showed a survival benefit in pretreated EGFR-MT patients who had failed TKI12. Determining the mechanisms underlying the lower efficacy of PD-1 inhibitors in patients with EGFR-MT than EGFR-WT can guide the development of treatment strategies to improve the survival outcomes of these patients.
We predicted that the TME of EGFR-MT tumors differs from that of EGFR-WT tumors in patients with NSCLC. However, previous studies using immunohistochemistry (IHC) approaches did not comprehensively compare the TMEs of the two lung cancer subtypes. Single-cell RNA sequencing (scRNA-seq) enables high-resolution dissection of cellular heterogeneity of solid tissues, including tumors. A systematic comparison of single-cell transcriptome profiles between EGFR-WT and EGFR-MT tumors can reveal immune cell subsets that differentially accumulate between groups of molecular subtypes in NSCLC tumors, some of which may provide important clues regarding their difference in clinical outcomes. Additionally, identifying biomarkers and therapeutic targets can improve anti-PD-1 immunotherapy. Although multiple scRNA-seq studies have depicted immune cell landscapes of lung cancers13,14,15,16, EGFR-WT and EGFR-MT tumors have not been systematically compared.
In this work, we compare the single-cell transcriptome of tumor-infiltrating immune cells between the two molecular subtypes of lung cancer and seek key cell subsets and their molecular features associated with EGFR mutation. We find that B cells, CXCL13-producing follicular helper CD4+ T (TFH)-like cells, and tissue-resident memory CD8+ T (TRM)-like cells are depleted in EGFR-MT compared to EGFR-WT. Homeostasis of TRM cells and CXCL13–CXCR5 axis-mediated crosstalk between TFH and B cells are dysregulated in EGFR-MT tumors. Finally, the proportion of cytotoxic TRM-like cells estimated by the expression signature reveals a highly predictive value for the anti-PD-1 response. Altogether, these results suggest the importance of TFH–B–TRM lymphocyte cooperation in the formation of tertiary lymphoid structure (TLS) for antitumor immunity. Furthermore, its impairment in EGFR-MT tumor results in an unfavorable response to anti-PD-1 immunotherapy.
Results
Comparative scRNA-seq analysis reveals immune cell subsets differentially accumulated between EGFR-mutant and wild-type tumors in NSCLC
To compare the cellular landscape between two different molecular subtypes of lung cancer based on the EGFR mutation status, we prepared tumor specimens surgically resected from 10 patients with NSCLC in stages IA–IIIA: five EGFR-WT and five EGFR-MT (Supplementary Data 1). Focusing on the landscape of tumor-infiltrating immune cells, we sorted CD45+ leukocytes from tumor tissues and conducted scRNA-seq using 10X Genomics technology for individual patients (Fig. 1a and Supplementary Fig. S1). A total of 33,719 cells were sequenced, 31,887 of which passed quality control (Supplementary Data 1). We preprocessed scRNA-seq data using Seurat v.3.0 package17 and visualized them through UMAP dimension reduction18. Systematic bias was observed among data sets across different patients (Fig. 1b). To correct batch effects, scRNA-seq data were integrated using Seurat v.3.0, which is based on canonical correlation analysis and anchoring data sets between samples17. After batch correction, the homogeneous population structures of patients and mutation status were reduced in the latent space by UMAP dimensions (Fig. 1c). Using the Louvain algorithm of network community detection, we defined 18 clusters (C0–17) corresponding to distinct cell subpopulations (Fig. 1d). Based on the expression of canonical marker genes (Supplementary Fig. S2), we annotated clusters for major cell types such as CD8+ T cell (C0, C2, C11, and C14) (Supplementary Figs. S3 and 4), CD4+ T cell (C1, C3, C5, C6, and C9) (Supplementary Figs. S3 and 4), regulatory T cell (C4) (Supplementary Figs. S3 and 4), natural killer (NK) cell (C13) (Supplementary Fig. S5), natural killer T (NKT) cell (C7, C10, and C12) (Supplementary Fig. S5), B cell (C8) (Supplementary Fig. S6), macrophage and monocyte (C15) (Supplementary Fig. S7), dendritic cell (DC) (C16) (Supplementary Fig. S8), and innate lymphoid cell 2 (ILC2) (C17) (Supplementary Fig. S9). Many clusters apparently had unequal cell distributions between EGFR-WT and EGFR-MT (Fig. 1e), indicating differences in the composition of tumor-infiltrating immune cells between the two molecular subtypes of NSCLC. The majority of the clusters were composed of cells derived from most of the 10 patients without severe bias toward a specific patient. However, the three smallest clusters—C15 (macrophage and monocyte), C16 (DC), and C17 (ILC2)—were mostly derived from patient 6 (P6) (Fig. 1f and Supplementary Data 2). Thus, we excluded these clusters from downstream analyses to avoid formulating hypotheses based on patient-biased data.
To investigate proportional changes across the 15 cell subsets (C0–C14) between EFGR-WT and EGFR-MT, we evaluated deviations in the observed cell count from the expected counts for each subset using Pearson residual (r). As the sign of the residual indicates the direction of the difference in the observed count from the expected count, r > 0 and r < 0 represent augmentation and depletion compared to the expected count, respectively. We considered only subsets with r > 3.5 and r < −3.5 as augmentation and depletion with high confidence, respectively, for follow-up functional interpretation (see “Methods” section for a detailed description of r threshold selection). We visualized the results of the goodness of fit test using a mosaic plot, in which subsets with a deviation of observed from expected cell counts are indicated in blue (augmentation), red (depletion), or gray (no significant change) (Fig. 1g). The goodness of fit for all subsets was also evaluated by the chi-square statistic (p-value of Fig. 1g). We observed above-threshold level augmentation or depletion (r > 3.5 or r < −3.5) of the cell count for EGFR-WT and EGFR-MT from eight subsets (Supplementary Data 2). We expected that a successful interpretation of differentially accumulated subsets between EGFR-WT and EGFR-MT would provide clues to understanding the mechanisms underlying unfavorable anti-PD-1 response in patients with EGFR-MT NSCLC. To characterize individual subsets for each major cell type (e.g., CD8+ T, CD4+ T, Treg, NK, NKT, and B cells), we identified differentially expressed genes (DEGs) of a subset compared to all other subsets for the same major cell type (subset-DEG) and DEGs between EGFR-WT and EGFR-MT for each subset (group-DEG) (Supplementary Data 3).
Proportion of cytotoxic NK cells decreases, whereas that of NKT cells with low cytotoxicity increases in EGFR-MT tumors
NK and NKT cells are cytotoxic lymphocytes of the innate immune system that provide the first line of defense in the body before activating the adaptive immune system. Although these cells have a limited range of antigen specificity and lower capacity for memory-generation, they can kill cancer cells directly via cytotoxic molecules19. The subset of cytotoxic NK cells (C13) was depleted in EGFR-MT, although its significance was not above the pre-determined threshold (r = −3.34) (Fig. 2a). Among the three subsets for NKT cells (C7, C10, and C12), the proportion of NKT-C7 and NKT-C10 was significantly augmented in EGFR-MT (r = 6.35 and 10.57, respectively; Fig. 2a). The subset-DEG analysis revealed that NK-C13 and NTK-C12, which were depleted and unchanged in EGFR-MT, respectively, expressed more genes involved in activation (TNFRSF18, IL2RB, CD7, CD44, TYROBP, and TNFRSF1B)20,21,22 (Fig. 2b) and cytotoxicity (PRF1 and GZMB)23 (Fig. 2c) than other subsets. In contrast, NKT-C7 and NKT-C10 proportion increased in EGFR-MT and these subsets expressed genes involved in either activation or cytotoxicity at lower levels than in other subsets. Overall, NKT-C10 is likely to have the lowest cytotoxic activity among all NK and NKT cell subsets, as the expression of genes involved in activation and cytotoxicity was lower than that in all other subsets (Fig. 2b, c), which is consistent with the highest augmentation in EGFR-MT (r = 10.57) (Fig. 2a). In summary, single-cell transcriptome analysis revealed the augmentation of NKT cell subsets with relatively low expression of genes involved in activation and cytotoxicity in EGFR-MT.
Proportion of B cells decreases with reduced antigen presentation in EGFR-MT tumors
B cells secrete antibodies to neutralize antigens or tag tumor cells for attack by other immune cells24. Recently, tumor-infiltrated B cells have been reported to promote the response to immunotherapy and survival in cancer patients through the formation of TLSs, in which B cells can present a tumor-derived neoantigen to T cells to activate them25,26,27,28. We observed a significant depletion of B cells in EGFR-MT compared to EGFR-WT (r = −6.23, Fig. 2d), suggesting that EGFR-MT tumor has a lower proportion of tumor-infiltrating B cells than EGFR-WT. In addition, we observed that B cells from EGFR-MT showed lower expression levels of HLA genes (MHC class I: HLA-C and MHC class II: HLA-DPB1, HLA-DRB5, and HLA-DQA2) and TAP1 (antigen peptide transporter involved in processing antigens that will be presented to MHC class I) (Fig. 2e). These results suggest that B cells in EGFR-MT tend to less effectively activate both CD8+ and CD4+ T cells than EGFR-WT, due to the lower expression of MHC class I and II, respectively.
Proportion of CXCL13-producing TFH-like cells decreases, whereas that of immunosuppressive CD4+ T cells increases in EGFR-MT tumors
Along with CD8+ T cells, CD4+ T cells are major lymphocytes governing adaptive immunity; they differentiate into diverse types of helper T cells that activate other immune cells, including cytotoxic T cells. Accordingly, various conventional CD4+ T cells are involved in antitumor or tumor-promoting processes, whereas CD4+ Treg suppresses antitumor immunity in the TME. We obtained six subsets of CD4+ T cells (Fig. 3a), which were characterized by subset-DEG analysis for various categories of immune signature genes (Fig. 3b). Among the six CD4+ subsets, the two largest ones showed significant augmentation in EGFR-MT (C1 and C3 by r = 8.48 and 7.66, respectively). We found that these subsets showed a high expression of genes involved in CD4+ T cell-mediated immune suppression (CD5229,30, ANXA131,32, ANXA233, and CXCR434). The C5 subset, for which the marker-based cellular identity was relatively unclear, showed augmentation in EGFR-MT with a subthreshold level of significance (r = 2.74). The C6 subset, annotated as naive CD4+ T cells based on the high expression of CCR7 and SELL, showed no significant difference in relative abundance between EGFR-WT and EGFR-MT.
Notably, the C9 subset was CD4+ T cells with the largest depletion in EGFR-MT (r = −7.37). We found several genes related to TFH cells (CXCL1335, ICOS36, MAF37, PDCD135, and SH2D1A35) among the subset-DEGs for C9 (Fig. 3b, c and Supplementary Data 3). TFH cells reportedly help B cells by promoting the maturation of antibody affinity in germinal centers (GC)35. PD-1 regulates GC positioning and helps with the function of TFH cells38. Human TFH cells produce CXCL13, a major chemoattractant in homing B cells to follicles35. CXCL13 has been associated with TLS development39,40. CXCR5, a receptor for CXCL13 and another hallmark of TFH cells, was not detected as a subset-DEG of C9. Moreover, the C9 subset does not express BCL6, a lineage-defining transcription factor of TFH cells35. These CXCR5 and BCL6-negative TFH-like cells that produce CXCL13 are found in local inflammatory sites41, where TLS development frequently occurs42. Given that the major role of the TFH cells is to recruit B cells, depletion of CXCL13-producing TFH-like cells in EGFR-MT may result in the inefficient formation of TLS. The observed depletion of tumor-infiltrating B cells (Fig. 2d) also suggests lower TLS development in EGFR-MT. Recently, B cells and TLS have been shown to be associated with responses to immunotherapy26,27,28. We also observed a substantially lower level of CXCL13 expression in EGFR-MT than in EGFR-WT tumor (Fig. 3d), which further supports the hypothesis that CXCL13-producing TFH-like cells contribute to the anti-PD-1 response through recruiting B cells to functional TLS in tumors. To confirm the reduced abundance of CXCL13+CD4+ T cells in EGFR-MT compared with EGFR-WT tumor, inferred from the single-cell transcriptome analysis, we performed the multiplexed immunofluorescence (IF) staining in archival tissue specimens of the same samples. We could obtain results of multiplexed IF for nine of the prior samples, because the sample quantity for the P3 patient was not sufficient for the downstream analysis. The results of multiplexed IF confirmed the reduced number of CD4+ and CXCL13+CD4+ T cells in EGFR-MT compared with EGFR-WT tumors (Fig. 3e, f and Supplementary Figs. S10, S11, Supplementary Data 4a). Although not statistically significant due to the small sample size, these results are congruous with the observations from the single-cell transcriptional analysis. To validate these findings further, we performed a similar analysis on a validation set of tumor samples from 19 independent NSCLC patients with EGFR-WT (n = 9) or EGFR-MT (n = 10) (Supplementary Data 5a). For this validation set with a larger sample size, we could observe significantly reduced frequencies of CXCL13+CD4+ T cells in EGFR-MT compared with EGFR-WT tumors for the validation set (Fig. 3g and Supplementary Fig. S12, Supplementary Data 4c).
The C4 subset was annotated as Treg cells based on the high expression signature genes of tumor-infiltrating Tregs (FOXP3, IL2RA, IL2RB, IL2RG, IL10RA, MAGEH1, LAYN, and GATA3). They also showed a high expression of genes involved in T cell activation (CD27, CD28, ICOS, and TNFRSF4 encoding OX40, TNFRSF9 encoding 41BB, and TNFRSF18 encoding GITR) and checkpoint molecules (TIGIT and CTLA4). Counterintuitively, we observed a depletion of the Treg subset in EGFR-MT (Fig. 3a). However, we also found that Treg cells of EGFR-MT tumors expressed genes known to promote Treg activity (TSC22D3 encoding GILZ43 and NR4A244) at levels significantly higher than those in EGFR-WT (Fig. 3h and Supplementary Data 3), which may compensate for its disadvantages in abundance.
Proportion of cytotoxic CD8+ TRM-like cells decreases in EGFR-MT tumors
CD8+ T cells were clustered into four different subsets (C0, C2, C11, and C14), among which, only C2 exhibited significant differences in its proportion between EGFR-WT and EGFR-MT, particularly depletion in EGFR-MT (r = −6.44) (Fig. 4a). We characterized each subset of CD8+ T cells based on the expression of various categories of signature genes (Fig. 4b) and found that the C2 subset had the highest cytotoxicity and expressed ITGAE (also known as CD103), a hallmark of TRM cells45. In addition, the C2 subset showed a significantly higher effector signature (ID2 and IFNG) and memory signature (CD7). TRM cells are a subset of effector memory T cells in the peripheral tissue, and their roles in antitumor immunity have recently gained considerable attention46,47. We also found that the C2 subset highly expressed the signature genes for tumor-infiltrating TRM-like CD8+ T cells such as PDCD1, HAVCR2, and LAG3, but rarely expressed TCF7, a lung TRM suppressor48,49,50 (Fig. 4b). Previously, HAVCR2 (also known as TIM-3) expression has been reported as a feature of lung tumor TRM cells that are not involved in exhaustion but rather reflect a functional state49. In this regard, we defined the C2 subset as TRM-like cells. As a fast-reactive memory T cell subset, TRM cells constitutively express perforin (PRF1), granzyme B (GZMB), and IFN-gamma (IFNG). We found that the tumor-infiltrating TRM-like subset expressed a high level of GZMB, whereas other subsets showed much lower expression, indicating the TRM-like subset as a major cytolytic lymphocyte in the tumor (Fig. 4c). High expression of ENTPD1 (also known as CD39) indicates that the C2 subset is composed of tumor-reactive TRM-like cells rather than bystander T cells51 (Fig. 4d). The expression of effector molecules and markers for naive/memory states suggest that C0 and C11 are effector CD8+ T cell subsets with lower cytotoxicity, and C14 is a naive T cell subset (Fig. 4e, f). Notably, we observed a higher expression of GZMB and genes involved in T cell activation in TRM-like cells from EGFR-WT than in those from EGFR-MT (Fig. 4g), indicating that TRM-like cells in EGFR-WT tumors have the highest antitumor activity for the direct killing of cancer cells.
We evaluated the protein expression of selected molecules (CD8, CD103, and GZMB) using multiplexed IF for the whole slides to confirm the presence of TRM-like cells in an archival tissue specimen. The abundance of CD8+, CD103+CD8+, and GZMB+ CD103+CD8+ T cells tended to be lower in EGFR-MT than EGFR-WT (Fig. 4h, i and Supplementary Figs. S10, 11, Supplementary Data 4a), consistent with the results based on the transcriptional level. For the validation set of tumor samples from 19 independent NSCLC patients (Supplementary Data 5a), we could observe a significantly lower abundance of CD8+ and CD103+CD8+ T cells in EGFR-MT compared with EGFR-WT (Fig. 4j and Supplementary Fig. S12, Supplementary Data 4c). For further validation, we performed a flow cytometry analysis using multiple antibodies, including anti-CD8, anti-CD103, and anti-GZMB for another validation set that consist of tumors from independent 35 NSCLC patients with EGFR-WT (n = 16) or EGFR-MT (n = 19) (Supplementary Data 5b). The TRM-like cell subset was highly enriched for cells with a high expression of both CD103 and GZMB (area within the solid circle) among CD8+ cells (area within the dotted circle) in EGFR-WT compared to EGFR-MT (Fig. 4k). The proportion of CD103+ cells among CD8+ cells and that of GZMB+ cells among CD8+CD103+ cells was significantly lower in EGFR-MT than EGFR-WT (Fig. 4l, m). Therefore, the depletion of CD8+ TRM-like cells in EGFR-MT lung tumors was confirmed by protein-based immunophenotyping analysis.
Homeostasis of TRM-like cells is impaired in EGFR-MT tumors
Hitherto our comparative single-cell transcriptome analysis between EGFR-WT and EGFR-MT revealed three lymphocyte subsets significantly depleted in lung tumors with EGFR mutations: B cells, CD4+ TFH-like cells, and CD8+ TRM-like cells. Given that patients with lung cancer with EGFR mutations exhibit unfavorable anti-PD-1 responses, regulation of cooperation among these lymphocyte subsets may be important for antitumor immunity in lung tumors. Based on the constitutively high expression of major cytolytic enzymes, such as perforin and GZMB, in TRM-like cells, the persistence of TRM cell state in a tumor may be a major determinant of the effective killing of cancer cells.
To evaluate the homeostasis of TRM-like cells in EGFR-WT and EGFR-MT tumors, we measured the transition probability between CD8+ subsets (C0, C2, C11, and C14; Fig. 5a) based on RNA velocity analysis using velocyto52, which can be estimated from the ratio of unspliced to spliced mRNA. Based on the probability of future mRNA abundance for all genes, we can predict the future state (subset) of each cell (Fig. 5b). For example, the mean transition probability from C2 to C0 was higher in EGFR-MT than in EGFR-WT, whereas that from C2 to C2 (that is, the persistence of C2 state) was higher in EGFR-WT than in EGFR-MT (Fig. 5c). We compared and calculated the transition probability among four subsets, C0, C2, C11, and C14, between EGFR-WT and EGFR-MT tumors to identify the changes in homeostasis. Notably, C2 (TRM-like cells) derived from EGFR-WT and EGFR-MT showed probabilities of 0.57 and 0.41 for persistence, respectively (Fig. 5d and Supplementary Data 6). In contrast, all other subsets showed similar probabilities for persistence between EGFR-WT and EGFR-MT: 0.77 vs. 0.76 for C0, 0.16 vs. 0.16 for C11, and 0.042 vs. 0.034 for C14. We also performed scVelo53 analysis that enables estimation of RNA velocity without steady-state assumption and observed consistent results (Supplementary Fig. S13). These results suggest that the persistence of the TRM cell state is impaired in EGFR-MT lung tumors.
NOTCH-RBPJ regulatory network for maintaining TRM cells is perturbed in EGFR-MT tumors
Transcriptional regulatory networks tightly control differentiation and functional maintenance of immune cells. We hypothesized that the dysregulation of the transcriptional regulatory network for homeostasis in the TRM cell state occurs in EGFR-MT lung tumors. To reconstruct the key regulatory network, we first identified differentially upregulated transcription factors (TFs) in the C2 subset compared to other CD8+ cell subsets (Fig. 5e). Among the upregulated TFs, ID2, RBPJ, and BHLHE40 are known to be involved in TRM cell development and homeostasis54,55,56. Particularly, RBPJ is associated with NOTCH to form a transcriptional activator required for maintaining TRM cells in humans55. Next, we compared the co-regulatory network of RBPJ between EGFR-WT and EGFR-MT to identify the gain or loss of regulatory targets for RBPJ between the two molecular subtypes of lung tumors. To improve the robustness of gene-gene correlation analysis for highly sparse scRNA-seq data, we employed the MetaCell57 analysis method, which partitions single-cell transcriptome profiles into disjoint and homogeneous groups of profiles. As expected, we observed a high correlation of expression between RBPJ and many known genes for TRM cell differentiation and maintenance45,46,47 in C2 subset cells from EGFR-WT tumors (Fig. 5f). Additionally, NOTCH1 showed a high correlation with the same regulatory target genes, although it was not significantly upregulated in the C2 subset compared with other CD8+ cell subsets. Interestingly, many transcriptional correlations with RBPJ disappeared in the C2 subset cells of EGFR-MT. Particularly, the expression of RGS2, NOTCH1, RGS1, PDCD1, CRTAM, CXCR6, TIA1, HIF1A, NAB1, and ITGAE positively correlated with RBPJ in the C2 subset of EGFR-WT, but the gene-gene correlations were lost or even inversely correlated in the C2 subset of EGFR-MT (Fig. 5g and Supplementary Data 7). The loss of the transcriptional correlation in EGFR-MT was more marked with NOTCH1 (Fig. 5f and Supplementary Data 7). These results indicate that the persistence of the tumor-infiltrating TRM cell state depends on the NOTCH-RBPJ regulatory network, which is dysregulated in EGFR-MT lung tumors.
Proportion of tumor-infiltrating TRM-like cells is highly predictive for anti-PD-1 response in NSCLC
Given that the proportion of tumor-infiltrating TRM-like cells decrease in EGFR-MT lung tumors, which tends to show poor response to anti-PD-1 immunotherapy, we hypothesized that the relative abundance of TRM-like cells in the tumor by itself could predict the anti-PD-1 response. The current gold-standard estimation of the proportion of each immune cell type is flow cytometry analysis using canonical markers specific for each cell type. However, this approach is valid for only major cell types or subtypes with well-characterized marker genes. Single-cell biology enables discovering an increasing number of cell subsets, and identifying specific markers for the subsets would be a daunting task. Therefore, quantifying cell subsets from a heterogeneous cell population using cell-type-specific expression signature has recently been employed. To infer the expression signature of the 18 subsets of immune cells from lung tumor samples, we applied CIBERSORTx58, a deconvolution method developed for inferring gene expression signature for a specific cell type, to the single-cell transcriptome profiles and generated a signature matrix of 18 subsets (Supplementary Data 8a). We then estimated the relative abundance of TRM-like cells in The Cancer Genome Atlas (TCGA) cohort of lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC)59. Consistent with the single-cell transcriptome results, the signature score for TRM-like cells was significantly higher in TCGA tumors with wild-type EGFR than in the mutant type (p = 0.0173, one-tailed Mann–Whitney U test) (Fig. 5h and Supplementary Data 8b). This result confirms that the inferred expression signature is a suitable estimator of the relative abundance of TRM-like cells in the tumor.
Next, we evaluated the expression signature of TRM-like cells to predict the response to anti-PD-1 immunotherapy using bulk RNA-seq data and clinical information available for two independent public cohorts of patients with NSCLC (GSE126045 and GSE135222)60,61. Notably, we observed a positive correlation between the signature score, indicating an abundance of tumor-infiltrating TRM-like cells and anti-PD-1 response in both cohorts (Fig. 5i and Supplementary Data 8c, d). In the two NSCLC cohorts, the signature score of responders was significantly higher than that of non-responders (p = 0.0114 and 0.0404, one-tailed Mann–Whitney U test). Furthermore, we prioritized patients exhibiting the highest signature score. This stratification resulted in a large area under the receiver operating characteristic curve (AUROC) scores, indicating that a high abundance of tumor-infiltrating TRM-like cells is a predictive marker of the anti-PD-1 response (Fig. 5j). Notably, only 2 of the 18 patients in the cohort recruited by Yonsei Cancer Center (YCC)60 were EGFR-MT; one was a responder, whereas the other was not (EGFR mutation status information was not available for the cohort by Jung et al.61). Therefore, the identified expression signature for TRM-like cells by itself can be used as a biomarker to stratify patients with lung cancer for anti-PD-1 immunotherapy regardless of their EGFR mutation status.
CXCL13–CXCR5-mediated TFH–B and TRM–B interactions are dysregulated in EGFR-MT tumors
Previous studies also demonstrated that TFH cells release CXCL13 to recruit B cells into TLS35, which then present tumor antigens to T cells62. As demonstrated in our study, the abundance of CXCL13-producing TFH-like cells was higher in EGFR-WT than in EGFR-MT patients (Fig. 3e, f). These observations suggest that recruiting B cells into the TLS of the tumor become less effective in EGFR-MT, likely due to a loss of CXCL13–CXCR5-mediated interactions, resulting in reduced cytotoxic T cell activation. To test this hypothesis, we evaluated cell-to-cell interactions among TFH-like cells (C9), B cells (C8), and cytotoxic TRM-like cells (C2) using CellPhoneDB63. We found more significant cell–cell interactions in EGFR-WT than EGFR-MT (Fig. 6a). We found that TFH–B cell interaction through the CXCL13–CXCR5 axis was substantially weaker in EGFR-MT (lower mean expression of ligand and receptor) than in EGFR-WT. Notably, we could also detect CXCL13–CXCR5-mediated interactions between TRM–B cells in EGFR-WT but not in EGFR-MT. Indeed, TRM-like cells also express CXCL13, albeit not as much as in TFH-like cells.
Recently, a PD-1+CD8+ T cell subset that secretes CXCL13 has been detected close to the B cell area of the TLS in lung cancer64. B cells of EGFR-WT appeared to present more molecules involved in antigen processing and presentation than EGFR-MT (Fig. 3d). These molecules will promote the activation of CD4+ TFH-like cells (through MHC class II) and CD8+ TRM cells (through MHC class I). Subsequently, the enhanced antigen-presenting ability might boost B cell expansion and stimulate tumor-specific CD8+ TRM cells. In EGFR-WT, the abundance of CD8+ TRM-like cells and their GZMB production were higher than those in EGFR-MT (Fig. 4h, i, k, m). Consequently, the increased proportion of CD8+ TRM-like cells can effectively lyse cancer cells by producing cytolytic materials such as GZMB.
To test our hypothesis, we first defined the area of TLS-like lesion (see “Methods” section) and then evaluated the abundance of TFH-like (CXCL13+CD4+), B (CXCR5+CD20+), and TRM-like (CD103+CD8+) cells within the TLS-like lesions of archival tumor tissue samples using multiplexed IF. Interestingly, we found that the total area of TLS-like lesion was significantly smaller in EGFR-MT than EGFR-WT (Fig. 6b, left and right upper panels and Supplementary Data 4a). For the validation set of 19 independent tumor samples used for multiplexed IF, we also observed a smaller area of TLS-like lesion in EGFR-MT than EGFR-WT (Fig. 6b, right lower panel and Supplementary Data 4c). In overall TLS-like lesion, the much less number of CD4+ and CD8+ T cells were found in EGFR-MT than EGFR-WT (Fig. 6c, d and Supplementary Figs. S10, 11). Notably, the numbers of TFH-like (CXCL13+CD4+) and TRM-like (CD103+CD8+) cells in EGFR-MT were significantly lower than EGFR-WT (Fig. 6c, d and Supplementary Figs. S10, 11, Supplementary Data 4b). We observed consistent results from the validation set of 19 independent tumor samples (Fig. 6e and Supplementary Fig. S12, Supplementary Data 4d). These results altogether suggest that CXCL13–CXCR5-mediated interactions between TFH–B and TRM–B cells scaffold TFH–B–TRM lymphocyte cooperation in TLS for antitumor immunity in lung cancer as summarized in Fig. 6f.
Discussion
In the present study, we performed comparative single-cell transcriptome analysis between EGFR-WT and EGFR-MT lung tumors across subsets of CD45+ immune cells using scRNA-seq to elucidate the underlying mechanisms of the different response rates to anti-PD-1 immunotherapy. Recently, accumulating evidence has indicated that activating the oncogenic pathway in tumor cells could diminish the antitumor immune response by modulating immune cell differentiation and recruitment65. Our study reveals the significant depletion of tumor-infiltrating B cells, CD4+ TFH-like cells, and CD8+ TRM-like cells in EGFR-MT tumors. In addition, TFH-like, B, and TRM-like cells of EGFR-MT showed a lower expression of genes involved in TLS formation, antigen presentation, and cytotoxicity, respectively, than EGFR-WT. EGFR-WT and EGFR-MT patient groups showed bias towards males with smoking history and female non-smokers, respectively, in both discovery and validation cohorts. Previous studies also reported a higher prevalence of EGFR-mutant NSCLC in female non-smokers than male smokers66,67. To test whether gender or smoking status was a confounder of the observed depletion of TFH-like, B, and TRM-like cells in EGFR-MT tumors, we compared CIBERSORTx scores for the three cell types between different smoking status or genders in EGFR-WT and EGFR-MT groups of the NSCLC tumors of TCGA cohort, separately. In neither EGFR-WT nor EGFR-MT group, we could observe a significant decrease of the three cell types in non-smokers and females (Supplementary Fig. S14a), which indicates that neither smoking status nor gender was a confounder of the observed association between EGFR mutation status and TME phenotypes. In addition, we observed only a weak correlation of TRM-like cell abundance or TLS area with PD-L1 expression (Supplementary Fig. S14b), which suggests that PD-L1 was not a confounder of our observed association between EGFR mutation status and TME phenotypes. Altogether, these results suggest that a higher intrinsic resistance of EGFR-mutant tumors to anti-PD-1 immunotherapy is primarily due to the depletion of these three lymphocyte subsets, major functional components of TLS for antitumor immunity, and their dysfunctional modulation in tumors. Furthermore, we found the disappearance of TFH–B and TRM–B interactions mediated by the CXCL13–CXCR5 axis in EGFR-MT tumors, which may negatively affect the formation of TLS in the tumor. Thus, targeting genes involved in these intra- and inter-cellular networks may improve the anti-PD-1 response in EGFR-MT patients.
Previous studies have demonstrated a correlation between the intratumoral abundance of TRM cells and patient prognosis in lung cancer49,68,69. In this study, we demonstrate the feasibility of predicting the response to anti-PD-1 immunotherapy based on the proportion of TRM-like cells. Quantification of a specific intratumoral cell subset would be a useful biomarker for predicting clinical outcomes. Using the recently developed CIBERSORTx for gene expression deconvolution with single-cell transcriptome data, we extracted expression signatures for each cell subset in the lung tumor and found that CD8+ TRM-like cells were highly predictive of the anti-PD-1 response in two independent cohorts of NSCLC. Notably, we tested the expression signature of TRM-like cells constructed from lung tumors to identify anti-PD-1 responders in melanoma cohorts but observed no prediction power. Given that TRM cells permanently reside in the peripheral tissue, these results are attributable to the tissue-specific nature of TRM cells by adaptation to different local microenvironment.
In the present study, we employed Pearson residual analysis with mosaic plot visualization to systematically compare the cell subset distribution between sample groups (EGFR-WT and EGFR-MT tumors). Using the chi-square significance test with Bonferroni adjustment for multiple hypothesis tests, we could retrieve subsets showing proportional changes that were consistent with the observed phenotype differences between sample groups. The same analytical framework can be applied to various comparative single-cell transcriptome analyses between sample groups with distinct genotypes and phenotypes, and the interpretation of results is highly intuitive. The Pearson residual method can be applied in a comparative analysis between more than two groups, if not too many. Therefore, we anticipate expanded the applicability of the Pearson residual framework to single-cell transcriptome analysis in the future.
There are several limitations to this study. First, although we identified three cell subsets associated with antitumor immunity in the TLS area and their functional regulation, whether they are regulated independently or hierarchically coordinated remains unknown. For example, if CXCL13 produced by TFH-like cells promotes the persistence of TRM-like cells by interacting with recruited B cells, the transcriptional regulator of CXCL13 expression in T cells could be the primary target to improve an anti-PD-1 response in EGFR-MT tumors. Second, this study lacks data for myeloid cells (e.g., dendritic cells and myeloid-derived suppressor cells) and stromal cells (e.g., cancer-associated fibroblasts), which also contribute to cancer–immune interactions. Although they are not directly involved in killing cancer cells, their activity enhances or suppresses major cytotoxic lymphocytes through cytokine-mediated cell-to-cell communication and the presentation of tumor antigens. Therefore, a comparison of subsets of myeloid and stromal cells would reveal additional cellular components for determining the response to anti-PD-1 immunotherapy. Third, our study cohort has a bias toward the adenocarcinoma subtype of NSCLC (it includes only a single patient with the squamous carcinoma subtype). Thus, some findings from the present study are possibly relevant to the adenocarcinoma subtype only. Fourth, two patients with G719A (P5) and G719S (P8) were included in the present study. Although G179X are considered sensitizing mutations, these were not included in a phase 3 trial comparing osimertinib with other EGFR-TKIs for EGFR-mutant NSCLC patients as first line (FLAURA)3 and are under-represented in clinical trials compared to exon 19del and L858R. Fifth, although we demonstrated that smoking status and genders are not confounders of the observed decrease of TLS in EGFR mutants, we may not rule out other confounding factors. Much large sample sets of genomically and clinically annotated tumors will provide a more comprehensive assessment of our findings. Finally, although we demonstrated the importance of homeostasis of TRM-like cells for antitumor immunity in TME of NSCLC through single-cell RNA velocity and co-regulatory network analysis, we could not provide functional validations in the present study. Thus, we need to conduct further experimental analyses in vitro and in vivo conditions to confirm our proposed mechanisms in future study.
In conclusion, we performed a comparative single-cell transcriptome analysis between EGFR-mutant and wild-type lung tumors and found several key lymphocyte subsets involved in the mechanisms underlying the anti-PD-1 response. In particular, we elucidated that TFH–B–TRM lymphocyte cooperation is critical to antitumor immunity in lung cancer. Dysregulation of the homeostasis of TRM cell state and TFH–B and TRM–B crosstalk involved in TLS development in EGFR-MT lung tumor may account for unfavorable responses to the antitumor immunotherapy with PD-1 blockades. Our findings provide guidelines for developing strategies to enhance anti-PD-1 efficacy in patients with EGFR mutations.
Methods
Tumor specimen collection and preparation
Patients diagnosed with NSCLC and who underwent surgical resection at Severance Hospital (Seoul, Republic of Korea), between June 2018 and August 2018, were prospectively enrolled. We consecutively enrolled five patients with EGFR-MT and five patients with EGFR-WT lung cancer. As depicted in Supplementary Data 1, six patients were in pathological stage I, two in stage II, and two in stage III; all patients with EGFR-MT were female never smokers, and those with EGFR-WT were males with a smoking history. This bias regarding the contribution of gender and smoking history to EGFR mutation has been previously reported in Asian patients with lung cancer66,67. No patient had undergone chemotherapy or radiation therapy before surgical resection. The study was approved by the Institutional Review Board of Severance Hospital (IRB No. 4-2018-1210, 4-2014-0775, and 4-2020-0945). We have obtained written informed consents from all patients in this study.
We collected fresh tumor tissues on the day of the surgery, which were mechanically and enzymatically dissociated using a gentleMACS dissociator (Miltenyi Biotec, Gladbach Bergisch, Germany, Cat# 130-093-235) and the Human Tumor Dissociation Kit (Miltenyi Biotec, Cat# 130-095-929) following the manufacturer’s instructions. After incubation for 1 h at 37 °C, the resuspended samples were filtered through a 70-µm-MACS SmartStrainer (Miltenyi Biotec, Cat# 130-098-462) into RPMI-1640 medium (Corning, Inc., Corning, NY, USA) supplemented with 10% fetal bovine serum (Biowest, Riverside, MO, USA) and centrifuged at 300 × g for 10 min. The pellet was suspended in PBS containing 2% fetal bovine serum. For fluorescence-activated cell sorting (FACS) analysis, single-cell suspensions were stained for 20 min at 4 °C with the following fluorescent-dye-conjugated antibodies: LIVE/DEAD™ Fixable Red Dead Cell Stain Kit (Invitrogen, Carlsbad, CA, USA, Cat# L23102. 1:100) and CD45 (Biolegend, San Diego, CA, USA, clone 2D1. 1:100). The samples were sorted using a FACS Aria III sorter (BD Biosciences, Franklin Lakes, NJ, USA) into live and single CD45+ cells. The sorted cell population was analyzed to ensure successful cell sorting, as shown in Supplementary Fig. S1. Cell viability was assessed by the Luna FL Fluorescence Cell Counter (Logos Biosystems, Anyang, Korea, Cat#L20001) and confirmed to be 75–98% in all samples.
10× single-cell RNA sequencing
Single-cell samples for constructing libraries and sequencing were prepared using a Chromium controller according to the 10× Single Cell 3′ v2 protocol (10× Genomics, Pleasanton, CA, USA). To acquire 10,000 targeted cells, the cells were diluted in nuclease-free water and mixed with the master mix. The single-cell suspensions were loaded with Single Cell 3′ Gel Beads and Partitioning Oil (GEM) into a Single Cell 3′ Chip. To generate the GEMs, single cells were uniquely barcoded, and cDNAs were used to construct libraries according to the manufacturer’s instructions. The libraries were sequenced using a HiSeq 2500 sequencing system (Illumina, San Diego, CA, USA).
Single-cell RNA sequencing data analysis
From the raw FASTQ files of scRNA-seq data (forward/reverse strands and barcode sequence), we used Cell Ranger analysis pipelines (v2.1.1) to obtain a gene-count matrix. We used the Seurat version3.017,70 pipeline to analyze scRNA-seq data. For quality control, we filtered cells with a higher percentage of mitochondrial genes or those showing outlier behavior in nfeatures_RNA and nCount_RNA from the “FeatureScatter” plot for each sample as recommended. We then normalized the data using the “NormalizeData” function with the recommended setting (log-normalization scale factor of 10,000). The normalized samples were integrated with 7431 highly variable genes by integration functions (“FindIntegrationAnchors” and “IntegrationData”) with default settings. For dimension reduction and visualization of the integrated data, “ScaleData,” “RunPCA,” and “RunUMAP” functions were used with default settings. Unsupervised clustering was performed by “FindNeighbors,” which conducts shared-nearest-neighbor-based clustering with 2000 highly variable genes and “FindClusters,” which conducts Louvain clustering (dimension of 20 and resolution of 0.65). The parameters for UMAP plotting before batch correction were the same as those for the integrated one.
Comparative single-cell transcriptome analysis based on Pearson residual
To investigate proportional changes between sample groups (e.g., EFGR-WT and EGFR-MT) across cell subsets defined by clustering, we evaluated deviations in the observed cell count for each group from the expected count for a subset using Pearson residual:
where i and j represent indices for each group and cell subsets, respectively, and O and E represent the observed and expected cell counts, respectively. The expected cell count (E) for a group i of a subset j was calculated by the following equation:
where Ttot, Ti, Tj represent total cell count for the entire data set, total cell count for a group i, and total cell count for a subset j, respectively. The advantage of the Pearson residual is that the sign of the residual indicates the direction of the difference of the observed count from the expected count (i.e., positive for augmentation and negative for depletion compared to expected count). Pearson residual (r) follows an approximately normal distribution; thus, scores larger than 2 or smaller than −2 are significant by p < 0.05. For a highly conservative statistical test along with Bonferroni correction for multiple hypothesis test adjustment, we used a significance threshold of p-value = 0.00056 (p-value of 0.01 was divided by the number of subsets, 18, for Bonferroni correction). Thus, we considered only cell subsets with r > 3.5 (augmentation) and r < −3.5 (depletion) for follow-up functional interpretation. We visualized the results of the goodness of fit test using a mosaic plot, in which subsets with deviation in observed cell counts from the expected cell counts are indicated by blue (augmentation), red (depletion), or gray (no significant change) colors. The goodness of fit for all subsets was also evaluated by the chi-square statistic (p-value).
Categorizing cell subsets into major immune cell types
We used marker gene expression to categorize cell clusters into immune cell types. The following list of marker genes was adapted from the Reference Gene Expression Profiles (RGEP)71: CD3D, CD3E, CD3G, and CD28 for T cells; CD4 for CD4+ T cells; CD8A and CD8B for CD8+ T cells; FOXP3 and IL2RA for Treg cells; FCGR3A, FCGR3B, NCAM1, KLRB1, KLRC1, KLRD1, KLRF1, and KLRK1 for NK and NKT cells; CD14, CD68, CD163, and C8F1R for macrophages and monocytes; CD19, MS4A1, CD79A, CD79B, and BLNK for B cells; IL3RA, CLEC4C, and NRP1 for dendritic cells; and KIT and IL1RL1 for ILC2 cells. The expression of these marker genes was visualized as violin plots (Supplementary Fig. S2) and validated on UMAP (Supplementary Figs. S3–9).
Differentially expressed gene analysis with single-cell transcriptome data
We performed two types of DEG analysis using the “FindMarkers” function. The “group-DEG” was evaluated by comparing the EGFR-WT group to the EGFR-MT group for each subset. The “subset-DEG” was evaluated by comparing one subset to all other subsets within a major cell type (i.e., NK and NKT cells, CD4+ T cells, CD8+ T cells). DEGs were accepted if their respective q-value was smaller than 0.05 and |avgLogFC| > 0.25 (default threshold by Seurat package). We excluded genes with high expression but a low ratio of non-zero cells.
Multiplex immunofluorescence staining
We cut 4-μm-thick tissue sections from formalin-fixed paraffin-embedded blocks, and then performed multiplexed IF for the nine patients analyzed by scRNA-seq, except P3 that was not available owing to the insufficient quantities. Tissue slides were heated for at least 1 h in a dry oven at 60 °C, dewaxed using xylene, and evaluated by multiplex immunofluorescence staining with a Leica Bond Rx™ Automated Stainer (Leica Biosystems, Wetzlar, Germany). Briefly, the slides were heated for 30 min and dewaxed with Leica Bond Dewax solution (#AR9222), followed by antigen retrieval with Bond Epitope Retrieval 2 (#AR9640, Leica Biosystems) in pH 9.0 solution for 30 min. Each primary antibody was incubated for 30 min, followed by detection using the polymer HRP Ms+Rb secondary antibody (ARH1001EA, Akoya Biosciences, Marlborough, MA, USA) for 10 min. To evaluate the CD103, GZMB in CD8+ T cells, primary antibodies against the following antigens were used: CD8 (MCA1817, Bio-Rad, Hercules, CA, USA, 1:300), CD103 (ab129202, Abcam, Cambridge, UK, 1:500), and GZMB (262A-15, Cell Marque, Rocklin, CA, USA, 1:50). Visualization of each antibody was accomplished by treatment with Opal 520 TSA Plus, Opal 620 TSA Plus, and Opal 780 TSA Plus, respectively, for 10 min, following which the slide was treated with Bond Epitope Retrieval 1 (#AR9961, Leica Biosystems) for 20 min to remove bound antibodies. To evaluate the CXCL13+CD4, CXCR5+CD20, CD103+CD8, we performed another set of multiplexed IF using primary antibodies against the following antigens: for CD4 (ab133616, Abcam, Cambridge, UK, 1:200), CD8 (MCA1817, Bio-Rad, Hercules, CA, USA, 1:300), CXCL13 (PA5-47035, Invitrogen, Massachusetts, USA, 1:40), CXCR5 (72172 S, Cell Signaling Technology, Massachusetts, USA, 1:200), CD20 (ab9475, Abcam, Cambridge, UK, 1:100), and CD103 (ab129202, Abcam, Cambridge, UK, 1:500). Visualization of each antibody was accomplished by treatment with Opal 570 TSA, Opal 520 TSA, Opal 690 TSA, Opal 620 TSA, Opal 480 TSA, Opal 780 TSA. Nuclei were visualized with DAPI, and the section was coverslipped using HIGHDEF® IHC fluoromount (ADI-950-260-0025, Enzo, Farmingdale, NY, USA). To validate the results from the prior tumor samples, we conducted additional multiplex IF for tumor samples from 19 independent NSCLC patients (EGFR-WT, n = 9; EGFR-MT, n = 10) using CD8 (MCA1817, Bio-Rad, Hercules, CA, USA, 1:300), CD103 (ab129202, Abcam, Cambridge, UK, 1:500), CXCL13 (PA5-47035, Invitrogen, Massachusetts, USA, 1:40), CD4 (ab133616, Abcam, Cambridge, UK, 1:200), CD20 (ab9475, Abcam, Cambridge, UK, 1:100) and CK (NBP2-29429, NOVUS, Centennial, USA, 1:200). Visualization of each antibody was accomplished by treatment with Opal 570 TSA, Opal 520 TSA, Opal 690 TSA, Opal 620 TSA, Opal 480 TSA, and Opal 780 TSA.
Image acquisition and quantitative data analysis
Slides were scanned using a Vectra Polaris Automated Quantitative Pathology Imaging System (Akoya Biosciences), and images were analyzed using InForm 2.2 software and TIBCO Spotfire™ (Akoya Biosciences). To acquire reliable unmixed images, representative slides of each emission spectrum and unstained tissue slide were used. Each stained section was used to establish the spectral library of fluorophores required for multispectral analysis as a reference for target quantitation, as the intensity of each fluorescent target was extracted from the multispectral data by linear unmixing. Each cell was identified by detecting nuclear spectral elements (DAPI). The immune cell population from each panel was characterized and quantified using the cell segmentation tool in InForm image analysis software. For co-expression analysis, the data obtained from InForm were sent to Spotfire™ software, and the threshold for positivity of each factor was determined using IF scoring methods. All cells on each slide were designated as positive or negative for each antibody, and the data were categorized and exported to a Microsoft Excel spreadsheet file for analysis. We used the Spotfire™ program, after tissue and cell segmentation, and compared the expression intensity, and assessed our results based on the cutoff values. The number of positive cells was counted on each slide. Immune subsets including GZMB+ CD103+CD8+, CD103+CD8+, CD8+, CXCL13+CD4+, and CD4+ T cells were quantified by cell counts per mm2 in both the tumor nest and stroma based on the whole slide. Pathologists reviewed each slide stained with multiplex IF and H&E of nine patients and manually marked TLS-like lesion based on the colocalization of CD20+ B cells, CD4+, CD8+ T cells. For the total area of the TLS-like lesions in both the tumor nest and stroma of each sample, we also evaluated the counts of CD103+CD8+, CD8+, CXCL13+CD4+, CD4+ T cells, and CD20+ B cells.
Flow cytometry analysis
Flow cytometry was performed using CytoFLEX (Beckman Coulter, Brea, CA, USA). Data were analyzed using FlowJo software (Tree Star, Ashland, OR, USA). For immunolabeling TILs, fluorophore-conjugated monoclonal antibodies against the following proteins were used: CD8 (RPA-T8, Cat# 301048, Biolegend, San Diego, CA, USA, 1:50), CD3 (SK7, Cat# 344808, Biolegend, San Diego, CA, USA, 1:100), PD-1 (EH12.2H7, Cat# 329933, Biolegend, San Diego, CA, USA, 1:20), CD103 (Ber-ACT8, Cat# 350230, Biolegend, San Diego, CA, USA, 1:20), CD39 (A1, Cat# 328210, Biolegend, San Diego, CA, USA, 1:20), GZMB (QA16A02, Cat# 372214, Biolegend, San Diego, CA, USA, 1:50), and CD4 (RPA-T4, Cat# 560837, BD Biosciences, San Diego, CA, USA, 1:50). The LIVE/DEAD Fixable Near-IR Dead Cell Stain Kit was from Invitrogen (L34973, Waltham, Massachusetts, USA, 1:100). The sequential gating strategies used for analysis for data obtained from flow cytometry are described in Supplementary Fig. S15.
For t-distributed stochastic neighbor embedding (tSNE) visualization of flow cytometry data, we used three samples for each group (EGFR-WT and EGFR-MT). Each sample was down-sampled to 7000 randomly selected CD3+ live and singlet-gated cells, yielding 42,000 cells. A tSNE plot for the merged 42,000 cells was constructed using FlowJo (v10.6.2) with default settings (3000 iterations and perplexity = 100).
RNA velocity-based estimation of cellular transition probability
We performed RNA velocity analysis by velocyto52 using the default pipeline. We processed single-cell transcriptome data for EGFR-WT and EGFR-MT tumors separately. We used the pre-built UMAP coordinate of CD8+ T cells from the Seurat package as the embedding coordinates for plotting. The transition probability was obtained from the “transition_prob.T” variable. RNA velocity predicts the future mRNA abundance of each gene from the ratio of spliced and unspliced mRNA levels based on a steady-state model. For ScVelo53 (ver 0.2.3) analysis which enables estimation of RNA velocity without steady-state assumption, we used the default pipeline. Loom files for splicing and un-splicing count information for each cell were obtained from RNA velocity analysis. The transition probability was obtained from the “scv.utials.get_transition_matrix()” class. The future state of each cell is predicted based on all genes. For a given subset of cells, the probability of transition to another subset can be calculated for each cell. The mean values of transition probability for each subset of cells were obtained by averaging the corresponding cell transition to other cell subsets.
MetaCell-based analysis of gene expression correlations
The correlation coefficient calculated from single-cell gene expression data was markedly low, mainly because of the zero-inflation property of scRNA-seq data. The sparsity of single-cell expression profiles can be compensated by imputation approaches, but they generate many false gene-gene correlations72. For more sensitive and robust detection of co-expression between two genes, we employed MetaCell57, which pools k nearest neighbor cells. We collected the raw count matrix from the Seurat object of single-cell data for EGFR-WT and EGFR-MT groups, separately. Each data set was processed using default parameter settings in the MetaCell pipeline without a cell filtering step: T_vm = 0.08 for mcell_gset_filter_varmean; T_tot = 100 and T_top3 = 2 for mcell_gset_filter_cov; K = 100 and dsamp = T for mcell_add_cgraph_from_mat_bknn; min_mc_size = 20, p_resamp = 0.75, and n_resamp = 500 for mcell_coclust_from_graph_resamp; K = 30, min_mc_size = 30, and alpha = 2 for mcell_mc_from_coclust_balanced; and T_lfc = 3 for mcell_mc_split_filt. The pooled cell “metacell” was generated with a corrected expression matrix calculated by the regularized geometric mean of each “metacell.” We calculated Spearman’s rank correlation coefficient between two genes based on the corrected expression matrix.
Determining the abundance of TRM-like cells in tumor samples using expression signature
Expression deconvolution technology was used to estimate the proportion of the TRM-like cells from bulk RNA-seq profiles for tumor samples. To construct a gene expression profile of TRM-like cells, we employed CIBERSORTx58, a recently developed tool for expression deconvolution that can utilize single-cell transcriptome data. We first generated a signature matrix of the 7431 highly variable genes based on the scRNA-seq data. We then deconvoluted bulk RNA-seq data using default parameter settings for each tumor sample obtained from LUAD and lung squamous cell carcinoma (LUSC) of TCGA cohort data set73 to estimate the proportion of TRM-like cells. We also estimated the proportion of TRM-like cells of tumor samples from two other public cohorts of patients with lung cancer treated with anti-PD-1 immunotherapy (GSE126045 and GSE135222)60,61.
Analysis of receptor–ligand-mediated cell–cell interactions
We used CellPhoneDB63 to analyze the receptor–ligand-mediated cell–cell interactions. We used the raw count matrix with the 7431 highly variable genes based on the scRNA-seq data for cells of C0–14 clusters as input data. We ran the EGFR-WT and EGFR-MT data separately with the default pipeline and parameters.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The single-cell RNA sequencing data generated in this study have been deposited in the Gene Expression Omnibus database (GSE144945). TCGA cohort data set were obtained from [https://gdac.broadinstitute.org]59. Bulk RNA-seq data for two public cohorts of patients with lung cancer treated with anti-PD-1 immunotherapy were obtained from GSE12604560 and GSE13522261. The remaining data are available within the Article, Supplementary Information, or Source Data file. Source data are provided with this paper.
References
Hsu, W. H., Yang, J. C., Mok, T. S. & Loong, H. H. Overview of current systemic management of EGFR-mutant NSCLC. Ann. Oncol. 29, i3–i9 (2018).
Mok, T. S. et al. Osimertinib or platinum-pemetrexed in EGFR T790M-positive lung cancer. N. Engl. J. Med. 376, 629–640 (2017).
Soria, J. C. et al. Osimertinib in untreated EGFR-mutated advanced non-small-cell lung cancer. N. Engl. J. Med. 378, 113–125 (2018).
Brahmer, J. et al. Nivolumab versus docetaxel in advanced squamous-cell non-small-cell lung cancer. N. Engl. J. Med. 373, 123–135 (2015).
Borghaei, H. et al. Nivolumab versus docetaxel in advanced nonsquamous non-small-cell lung cancer. N. Engl. J. Med. 373, 1627–1639 (2015).
Herbst, R. S. et al. Pembrolizumab versus docetaxel for previously treated, PD-L1-positive, advanced non-small-cell lung cancer (KEYNOTE-010): a randomised controlled trial. Lancet 387, 1540–1550 (2016).
Reck, M. et al. Updated analysis of KEYNOTE-024: pembrolizumab versus platinum-based chemotherapy for advanced non-small-cell lung cancer with PD-L1 tumor proportion score of 50% or greater. J. Clin. Oncol. 37, 537–546 (2019).
Gainor, J. F. et al. EGFR mutations and ALK rearrangements are associated with low response rates to PD-1 pathway blockade in non-small cell lung cancer: a retrospective analysis. Clin. Cancer Res. 22, 4585–4593 (2016).
Dong, Z. Y. et al. EGFR mutation correlates with uninflamed phenotype and weak immunogenicity, causing impaired response to PD-1 blockade in non-small cell lung cancer. Oncoimmunology 6, e1356145 (2017).
Lee, C. K. et al. Checkpoint inhibitors in metastatic EGFR-mutated non-small cell lung cancer-a meta-analysis. J. Thorac. Oncol. 12, 403–407 (2017).
Lisberg, A. et al. A phase II study of pembrolizumab in EGFR-mutant, PD-L1+, tyrosine kinase inhibitor naïve patients with advanced NSCLC. J. Thorac. Oncol. 13, 1138–1145 (2018).
Reck, M. et al. Atezolizumab plus bevacizumab and chemotherapy in non-small-cell lung cancer (IMpower150): key subgroup analyses of patients with EGFR mutations or baseline liver metastases in a randomised, open-label phase 3 trial. Lancet Respir. Med. 7, 387–401 (2019).
Guo, X. et al. Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing. Nat. Med. 24, 978–985 (2018).
Lambrechts, D. et al. Phenotype molding of stromal cells in the lung tumor microenvironment. Nat. Med. 24, 1277–1289 (2018).
Laughney, A. M. et al. Regenerative lineages and immune-mediated pruning in lung cancer metastasis. Nat. Med. 26, 259–269 (2020).
Kim, N. et al. Single-cell RNA sequencing demonstrates the molecular and cellular reprogramming of metastatic lung adenocarcinoma. Nat. Commun. 11, 2285 (2020).
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420 (2018).
Becht, E. et al. Dimensionality reduction for visualizing single-cell data using UMAP. Nat. Biotechnol. 37, 38–44 (2018).
Wu, J. & Lanier, L. L. Natural killer cells and cancer. Adv. Cancer Res. 90, 127–156 (2003).
Croft, M. The role of TNF superfamily members in T-cell function and diseases. Nat. Rev. Immunol. 9, 271–285 (2009).
Huet, S. et al. CD44 contributes to T cell activation. J. Immunol. 143, 798–801 (1989).
Cohen, N. R. et al. Shared and distinct transcriptional programs underlie the hybrid nature of iNKT cells. Nat. Immunol. 14, 90–99 (2013).
Voskoboinik, I., Whisstock, J. C. & Trapani, J. A. Perforin and granzymes: function, dysfunction and human pathology. Nat. Rev. Immunol. 15, 388–400 (2015).
Yuen, G. J., Demissie, E. & Pillai, S. B lymphocytes and cancer: a love-hate relationship. Trends Cancer 2, 747–757 (2016).
Wennhold, K., Shimabukuro-Vornhagen, A. & von Bergwelt-Baildon, M. B cell-based cancer immunotherapy. Transfus. Med. Hemother. 46, 36–46 (2019).
Petitprez, F. et al. B cells are associated with survival and immunotherapy response in sarcoma. Nature 577, 556–560 (2020).
Cabrita, R. et al. Tertiary lymphoid structures improve immunotherapy and survival in melanoma. Nature 577, 561–565 (2020).
Helmink, B. A. et al. B cells and tertiary lymphoid structures promote immunotherapy response. Nature 577, 549–555 (2020).
Bandala-Sanchez, E. et al. T cell regulation mediated by interaction of soluble CD52 with the inhibitory receptor Siglec-10. Nat. Immunol. 14, 741–748 (2013).
Samten, B. CD52 as both a marker and an effector molecule of T cells with regulatory action: identification of novel regulatory T cells. Cell Mol. Immunol. 10, 456–458 (2013).
Gavins, F. N. & Hickey, M. J. Annexin A1 and the regulation of innate and adaptive immunity. Front. Immunol. 3, 354 (2012).
Yang, Y. H. et al. Deficiency of annexin A1 in CD4+ T cells exacerbates T cell-dependent inflammation. J. Immunol. 190, 997–1007 (2013).
Chen, C. Y., Lin, Y. S., Chen, C. H. & Chen, Y. J. Annexin A2-mediated cancer progression and therapeutic resistance in nasopharyngeal carcinoma. J. Biomed. Sci. 25, 30 (2018).
Zou, L. et al. Bone marrow is a reservoir for CD4+CD25+ regulatory T cells that traffic through CXCL12/CXCR4 signals. Cancer Res. 64, 8451–8455 (2004).
Crotty, S. T follicular helper cell biology: a decade of discovery and diseases. Immunity 50, 1132–1148 (2019).
Choi, Y. S. et al. ICOS receptor instructs T follicular helper cell versus effector cell differentiation via induction of the transcriptional repressor Bcl6. Immunity 34, 932–946 (2011).
Vacchio, M. S. et al. A Thpok-directed transcriptional circuitry promotes Bcl6 and Maf expression to orchestrate T follicular helper differentiation. Immunity 51, 465–478 (2019). e466.
Shi, J. et al. PD-1 controls follicular T helper cell positioning and function. Immunity 49, 264–274 (2018).
Yamamoto, K. et al. Anti-CXCL13 antibody can inhibit the formation of gastric lymphoid follicles induced by Helicobacter infection. Mucosal Immunol. 7, 1244–1254 (2014).
Ciccia, F. et al. Ectopic expression of CXCL13, BAFF, APRIL and LT-beta is associated with artery tertiary lymphoid organs in giant cell arteritis. Ann. Rheum. Dis. 76, 235–243 (2017).
Ueno, H. T follicular helper cells in human autoimmunity. Curr. Opin. Immunol. 43, 24–31 (2016).
Sautes-Fridman, C., Petitprez, F., Calderaro, J. & Fridman, W. H. Tertiary lymphoid structures in the era of cancer immunotherapy. Nat. Rev. Cancer 19, 307–325 (2019).
Cannarile, L., Delfino, D. V., Adorisio, S., Riccardi, C. & Ayroldi, E. Implicating the role of GILZ in glucocorticoid modulation of T-cell activation. Front. Immunol. 10, 1823 (2019).
Sekiya, T. et al. The nuclear orphan receptor Nr4a2 induces Foxp3 and regulates differentiation of CD4+ T cells. Nat. Commun. 2, 269 (2011).
Behr, F. M., Chuwonpad, A., Stark, R. & van Gisbergen, K. Armed and ready: transcriptional regulation of tissue-resident memory CD8 T cells. Front. Immunol. 9, 1770 (2018).
Amsen, D., van Gisbergen, K., Hombrink, P. & van Lier, R. A. W. Tissue-resident memory T cells at the center of immunity to solid tumors. Nat. Immunol. 19, 538–546 (2018).
Park, S. L., Gebhardt, T. & Mackay, L. K. Tissue-resident memory T cells in cancer immunosurveillance. Trends Immunol. 40, 735–747 (2019).
Wu, J. et al. T cell factor 1 suppresses CD103+ lung tissue-resident memory T cell development. Cell Rep. 31, 107484 (2020).
Clarke, J. et al. Single-cell transcriptomic analysis of tissue-resident memory T cells in human lung cancer. J. Exp. Med. 216, 2128–2149 (2019).
Liikanen, I. et al. Hypoxia-inducible factor activity promotes antitumor effector function and tissue residency by CD8+ T cells. J. Clin. Invest. 131, 143729 (2021).
Duhen, T. et al. Co-expression of CD39 and CD103 identifies tumor-reactive CD8 T cells in human solid tumors. Nat. Commun. 9, 2724 (2018).
La Manno, G. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
Bergen, V., Lange, M., Peidli, S., Wolf, F. A. & Theis, F. J. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 38, 1408–1414 (2020).
Milner, J. J. et al. Heterogenous populations of tissue-resident CD8(+) T cells are generated in response to infection and malignancy. Immunity 52, 808–824 (2020).
Hombrink, P. et al. Programs for the persistence, vigilance and control of human CD8(+) lung-resident memory T cells. Nat. Immunol. 17, 1467–1478 (2016).
Li, C. et al. The transcription factor Bhlhe40 programs mitochondrial regulation of resident CD8(+) T cell fitness and functionality. Immunity 51, 491–507 (2019).
Baran, Y. et al. MetaCell: analysis of single-cell RNA-seq data using K-nn graph partitions. Genome Biol. 20, 206 (2019).
Newman, A. M. et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 37, 773–782 (2019).
Grossman, R. L. et al. Toward a shared vision for cancer genomic data. N. Engl. J. Med. 375, 1109–1112 (2016).
Kim, K. et al. Single-cell transcriptome analysis reveals TOX as a promoting factor for T cell exhaustion and a predictor for anti-PD-1 responses in human cancer. Genome Med. 12, 22 (2020).
Jung, H. et al. DNA methylation loss promotes immune evasion of tumours with high mutation and copy number load. Nat. Commun. 10, 4278 (2019).
Bruno, T. C. et al. Antigen-presenting intratumoral B cells affect CD4(+) TIL phenotypes in non-small cell lung cancer patients. Cancer Immunol. Res. 5, 898–907 (2017).
Efremova, M., Vento-Tormo, M., Teichmann, S. A. & Vento-Tormo, R. CellPhoneDB: inferring cell-cell communication from combined expression of multi-subunit ligand-receptor complexes. Nat. Protoc. 15, 1484–1506 (2020).
Thommen, D. S. et al. A transcriptionally and functionally distinct PD-1(+) CD8(+) T cell pool with predictive potential in non-small-cell lung cancer treated with PD-1 blockade. Nat. Med. 24, 994–1004 (2018).
Spranger, S. & Gajewski, T. F. Impact of oncogenic pathways on evasion of antitumour immune responses. Nat. Rev. Cancer 18, 139–147 (2018).
Hendriks, L. E., Rouleau, E. & Besse, B. Clinical utility of tumor mutational burden in patients with non-small cell lung cancer treated with immunotherapy. Transl. Lung Cancer Res. 7, 647–660 (2018).
Bell, D. W. et al. Increased prevalence of EGFR-mutant lung cancer in women and in East Asian populations: analysis of estrogen-related polymorphisms. Clin. Cancer Res. 14, 4079–4084 (2008).
Ganesan, A. P. et al. Tissue-resident memory features are linked to the magnitude of cytotoxic T cell responses in human lung cancer. Nat. Immunol. 18, 940–950 (2017).
Djenidi, F. et al. CD8+CD103+ tumor-infiltrating lymphocytes are tumor-specific tissue-resident memory T cells and a prognostic factor for survival in lung cancer patients. J. Immunol. 194, 3475–3486 (2015).
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902 (2019).
Schelker, M. et al. Estimation of immune cell content in tumour tissue using single-cell RNA-seq data. Nat. Commun. 8, 2032 (2017).
Andrews, T. S. & Hemberg, M. False signals induced by single-cell imputation. F1000Res 7, 1740 (2018).
Collisson, E. A. et al. Comprehensive molecular profiling of lung adenocarcinoma. Nature 511, 543–550 (2014).
Acknowledgements
This research was supported by the Bio & Medical Technology Development Program of the National Research Foundation funded by the Ministry of Science and ICT (NRF-2019M3A9B6065231, 2017M3A9E802971, 2017M3A9E9072669 to H.R.K., 2018R1A5A2025079, 2018M3C9A5064709, 2019M3A9B6065192 to I.L., 2019M3A9B6065221, 2018R1A2A1A05076997, 2017R1A5A1014560 to S.-J.H.). The work was supported in part by Brain Korea 21(BK21) FOUR program.
Author information
Authors and Affiliations
Contributions
S.Y.P., S.-J.H., I.L., and H.R.K. conceived the study. J.C. and H.H. performed single-cell transcriptome data analysis under the supervision of I.L. S.P., G.M.K., and S.H.S. performed the experiments and analyzed the data under the supervision of S.-J.H. and H.R.K. S.Y.P. and H.R.K. organized clinical samples and data collections. Y.-S.B. helped perform multiplexed IF. S.-J.H., I.L., and H.R.K. contributed to the financial and administrative support for this study. H.S.S. contributed to the pathological examination of tumor tissues. J.C., S.P., S.-J.H., I.L., and H.R.K. wrote the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks Dwight H. Owen, Manu Setty and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cho, JW., Park, S., Kim, G. et al. Dysregulation of TFH-B-TRM lymphocyte cooperation is associated with unfavorable anti-PD-1 responses in EGFR-mutant lung cancer. Nat Commun 12, 6068 (2021). https://doi.org/10.1038/s41467-021-26362-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-021-26362-0
This article is cited by
-
Joint analysis of mutational and transcriptional landscapes in human cancer reveals key perturbations during cancer evolution
Genome Biology (2024)
-
Single-cell RNA sequencing in cancer research: discovering novel biomarkers and therapeutic targets for immune checkpoint blockade
Cancer Cell International (2023)
-
IL-2Rα-biased agonist enhances antitumor immunity by invigorating tumor-infiltrating CD25+CD8+ T cells
Nature Cancer (2023)
-
Notch signaling pathway: architecture, disease, and therapeutics
Signal Transduction and Targeted Therapy (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.