Deep learning on tertiary lymphoid structures in hematoxylin-eosin predicts cancer prognosis and immunotherapy response

Chen, Ziqiang; Wang, Xiaobing; Jin, Zelin; Li, Bosen; Jiang, Dongxian; Wang, Yanqiu; Jiang, Mengping; Zhang, Dandan; Yuan, Pei; Zhao, Yahui; Feng, Feiyue; Lin, Yicheng; Jiang, Liping; Wang, Chenxi; Meng, Weida; Ye, Wenjing; Wang, Jie; Qiu, Wenqing; Liu, Houbao; Huang, Dan; Hou, Yingyong; Wang, Xuefei; Jiao, Yuchen; Ying, Jianming; Liu, Zhihua; Liu, Yun

doi:10.1038/s41698-024-00579-w

Download PDF

Article
Open access
Published: 22 March 2024

Deep learning on tertiary lymphoid structures in hematoxylin-eosin predicts cancer prognosis and immunotherapy response

Ziqiang Chen ORCID: orcid.org/0000-0002-6726-5084^1,2^na1,
Xiaobing Wang ORCID: orcid.org/0000-0002-0907-364X³^na1,
Zelin Jin^1,2^na1,
Bosen Li⁴^na1,
Dongxian Jiang⁵,
Yanqiu Wang⁶,
Mengping Jiang^1,2,
Dandan Zhang^1,2,
Pei Yuan⁷,
Yahui Zhao³,
Feiyue Feng⁸,
Yicheng Lin^1,2,
Liping Jiang³,
Chenxi Wang³,
Weida Meng^1,2,
Wenjing Ye⁹,
Jie Wang¹⁰,
Wenqing Qiu¹¹,
Houbao Liu^11,12,
Dan Huang¹³,
Yingyong Hou⁵,
Xuefei Wang^4,14,
Yuchen Jiao³,
Jianming Ying ORCID: orcid.org/0000-0002-7301-4118⁷,
Zhihua Liu ORCID: orcid.org/0000-0003-4242-033X³ &
…
Yun Liu ORCID: orcid.org/0000-0001-5753-1265^1,2

npj Precision Oncology volume 8, Article number: 73 (2024) Cite this article

1744 Accesses
Metrics details

Subjects

Abstract

Tertiary lymphoid structures (TLSs) have been associated with favorable immunotherapy responses and prognosis in various cancers. Despite their significance, their quantification using multiplex immunohistochemistry (mIHC) staining of T and B lymphocytes remains labor-intensive, limiting its clinical utility. To address this challenge, we curated a dataset from matched mIHC and H&E whole-slide images (WSIs) and developed a deep learning model for automated segmentation of TLSs. The model achieved Dice coefficients of 0.91 on the internal test set and 0.866 on the external validation set, along with intersection over union (IoU) scores of 0.819 and 0.787, respectively. The TLS ratio, defined as the segmented TLS area over the total tissue area, correlated with B lymphocyte levels and the expression of CXCL13, a chemokine associated with TLS formation, in 6140 patients spanning 16 tumor types from The Cancer Genome Atlas (TCGA). The prognostic models for overall survival indicated that the inclusion of the TLS ratio with TNM staging significantly enhanced the models’ discriminative ability, outperforming the traditional models that solely incorporated TNM staging, in 10 out of 15 TCGA tumor types. Furthermore, when applied to biopsied treatment-naïve tumor samples, higher TLS ratios predicted a positive immunotherapy response across multiple cohorts, including specific therapies for esophageal squamous cell carcinoma, non-small cell lung cancer, and stomach adenocarcinoma. In conclusion, our deep learning-based approach offers an automated and reproducible method for TLS segmentation and quantification, highlighting its potential in predicting immunotherapy response and informing cancer prognosis.

Multi-resolution deep learning characterizes tertiary lymphoid structures and their prognostic relevance in solid tumors

Article Open access 05 January 2024

Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images

Article Open access 26 March 2024

Deep learning model improves tumor-infiltrating lymphocyte evaluation and therapeutic response prediction in breast cancer

Article Open access 30 August 2023

Introduction

Tertiary lymphoid structures (TLSs) are organized aggregation of immune cells resembling secondary lymphoid organs^1,2,3. While the mechanisms governing TLS formation in tumor microenvironment remain unclear, its presence associates with a positive immunotherapy response in multiple cancers^2,4,5,6,7. A recent clinical trial revealed that the presence of TLS in advanced soft-tissue sarcomas predicts a favorable response to pembrolizumab treatment⁸, underscored its potential as a valuable biomarker for predicting clinical efficacy of immunotherapy. Moreover, several meta-analyses demonstrated the associations between the presence of TLS and prolonged overall survival in gastrointestinal cancers⁹ and digestive system cancers¹⁰, further highlighting the clinical value of TLS across multiple cancer types.

Currently, the gold standard to segment and quantify TLS is based on pathological characteristics using multiplex immunohistochemistry (mIHC) staining on T and B lymphocytes^11,12. However, mIHC is resource intensive and not widely available, limiting its clinical utility. While experienced pathologists can potentially identify TLSs on hematoxylin and eosin (H&E)-stained whole-slide images (WSIs)^13,14, but to our knowledge, the sensitivity and accuracy of this approach to segment TLSs based on H&E staining alone against the results established by mIHC are not systematically evaluated.

With the rise of deep learning, automated histopathological feature extraction has become feasible for a range of tasks, including cancer grading^15,16, diagnosis^17,18,19, prognosis^20,21,22, and predicting immunotherapy response^23,24, molecular expression^25,26, and genetic alterations^27,28. Some algorithms can even achieve diagnostic accuracy rivaling pathologists^29,30. In this work, we curated a dataset from matched mIHC and H&E WSIs and developed a deep-learning approach that segments and calculates the TLS ratio (defined as the segmented TLS area divided by the tissue area) from H&E WSIs. Subsequently, we validated the accuracy of our approach in The Cancer Genome Atlas (TCGA), and evaluated the associations between TLS ratios and overall survival across multiple cancer types. Finally, the TLS ratio was assessed for predicting an immunotherapy response in various cohorts.

Results

Data collection and development of the TLS segmentation model

The overall study design is illustrated in Fig. 1. First, we generated a rigorously curated dataset based on matched mIHC and H&E WSIs (part I of Fig. 1), all at a magnification of 20× (0.5 \({\upmu}{\mathrm{m}}{/}\)pixel), from 60 esophageal squamous cell carcinoma (ESCC) patients and 5 non-small cell lung cancer (NSCLC) patients (Supplementary Table 1). TLSs were identified based on CD3 and CD20 staining and subsequently used as ground truth for the segmentation of TLSs on consecutive H&E-stained slides from the same individuals. The H&E WSIs and their TLS segmentations were cropped into 22,497 equally sized tiles (512 × 512 pixels, 256 mm × 256 mm) (Supplementary Fig. 1) and randomly split into internal training, validation and test sets in a ratio of 7:1:2 (Supplementary Table 2).

**Fig. 1: Overview of the study design and methodology.**

These tiles were then used to train a model to segment TLSs on H&E WSIs. By using a modified encoder-decoder model based on EfficientNet-b0³¹, we achieved a strong segmentation performance with a Dice coefficient of 0.91 (95% confidence interval [CI]: 0.902–0.918) and an intersection over union (IoU) of 0.819 (95% CI: 0.811–0.827) on the internal test set. Moreover, the model showed excellent ability to discriminate TLSs, with areas under the curve (AUCs) for the receiver operator characteristic (ROC) curves (Supplementary Fig. 2a–c) reaching 0.981 (95% CI: 0.892–0.999), 0.965 (95% CI: 0.873–0.998), and 0.966 (95% CI: 0.869–0.989) for the internal training, validation, and test sets, respectively. Examples of TLS segmentation on the holdout internal test set were illustrated in Supplementary Fig. 3. Evaluation of the model’s predictive accuracy was extended by assessing the linear correlations between the predicted and observed TLS area for each tile. These analyses revealed strong correlations across all three internal data sets (all rho >0.89), with highly significant P values (all P values < 0.0001) (Supplementary Fig. 4a–c). Additionally, our analysis did not reveal any significant prediction bias across different samples, as the IoUs for individual slides were consistently above 0.7 (Supplementary Fig. 4d).

To further validate the accuracy of our model, we assembled an external validation set comprised of five ESCC and ten NSCLC samples obtained from the TCGA. From these H&E-stained WSIs, we generated a total of 667 tiles for TLS segmentation. The performance of our model on this external validation set remained robust, as evidenced by a Dice coefficient of 0.866 (95% CI: 0.855–0.877), an IoU of 0.787 (95% CI: 0.773–0.802), and an AUC of 0.934 (95% CI: 0.838–0.968) (Supplementary Fig. 2d). A significant linear correlation between the predicted and actual TLS areas per tile was observed (rho = 0.79, P value < 0.0001) (Supplementary Fig. 4e). Moreover, the IoUs for individual slides were consistently above 0.6 (Supplementary Fig. 4f). Collectively, these results underscore the robustness and reliability of our deep learning model in TLS segmentation.

Deep learning pipeline for TLS ratio calculation

After TLS segmentation, we employed a deep-learning pipeline to calculate the TLS ratio for each H&E WSI. As illustrated in the part II of Fig. 1, the pipeline comprised three distinct branches and comprehensive details of these branches were provided in the Methods section. Briefly, these branches were designed to determine the tissue area, segmented TLS area, and lymphocyte count, respectively. The branch to determine the tissue area employed the OTSU method from the OpenCV Python package³², which segment the tissue region from the non-tissue background. The branch to determine lymphocyte count, specifically designed to exclude small-sized TLSs, utilized the publicly available deep learning model HoVer-Net³³. This model is broadly used for segmenting different cell types, particularly lymphocytes, from H&E WSIs³³. Tiles with a lymphocyte count exceeding 80 within the segmented TLSs were retained for the TLS ratio calculation.

Estimated TLS ratios correlate with B lymphocyte levels and CXCL13 expression across various TCGA tumor types

To evaluate the TLS ratios estimated by our approach, we first analyzed 74 ESCC and 936 NSCLC patients from the external TCGA. While mIHC data was unavailable for these patients, they had H&E WSIs along with RNA sequencing and DNA methylation data. We segmented TLSs (Supplementary Fig. 5) estimated TLS ratios from H&E WSIs, and compared them with molecular signatures reported to be correlated with TLSs (part III of Fig. 1). It has been shown that most tumor-infiltrating B lymphocytes aggregate inside TLSs⁶ and the number of B cells correlates with the number and area of TLSs⁴. By analyzing RNA sequencing data for gene expression levels and DNA methylation patterns, we were able to estimate the B cell percentages in the samples based on molecular signatures of B cell-specific genes. As expected, the estimated TLS ratios significantly correlated with the percentage of B lymphocytes in both ESCC (rho = 0.46, P value < 0.0001) (Fig. 2a) and NSCLC (rho = 0.26, P value < 0.0001) (Fig. 2b). TLS ratios also correlated with the expression of CXCL13, a chemokine associated with TLS formation³⁴, in ESCC (rho = 0.39, P value = 0.0062) (Fig. 2c) and NSCLC (rho = 0.31, P value < 0.0001) (Fig. 2d).

**Fig. 2: Correlation evaluations between the TLS ratios molecular signatures across tumor types from the TCGA.**

Since TLS morphology is similar across cancers, we tested our approach in 14 additional TCGA tumor types (Supplementary Fig. 6). Similarly, estimated TLS ratios significantly correlated with B cell levels (Fig. 2e, f and Supplementary Fig. 7) and CXCL13 expression in these cancers (Fig. 2g, h and Supplementary Fig. 8), suggesting broad applicability of our approach.

Higher TLS ratios are associated with extended survival across various TCGA tumor types

TLSs have been identified as a potential prognostic indicator across multiple tumor types¹. Thus, we explored the relationship between TLS ratios estimated from H&E WSIs and overall survival in various tumor types. Univariate survival analyses indicated that elevated TLS ratios correlated with prolonged overall survival in ESCC (hazard ratio [HR]: 0.28; 95% CI: 0.090–0.84; P value = 0.016) (Fig. 3a) and NSCLC (HR: 0.74; 95% CI: 0.57–0.95; P value = 0.019) (Fig. 3b) from TCGA. This was further validated in NSCLC cases from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) (HR: 0.40; 95% CI: 0.17-0.93; P value = 0.028) (Fig. 3c). Subsequent multivariate analysis, adjusting for age, sex, and TNM staging (depth of invasion, lymph node metastasis and distant metastasis), confirmed that the positive association of TLS ratios with increased overall survival remains statistically significant for TCGA-ESCC, and marginally significant for TCGA-NSCLC and CPTAC-NSCLC (Supplementary Table 4).

**Fig. 3: Overall survival outcomes stratified by the TLS ratios.**

In the other fourteen TCGA tumor types, ten of them also exhibited significant associations with univariate analyses (Supplementary Fig. 9, Supplementary Table 4). After adjusting for potential confounders, the associations remain significant for head and neck squamous cell carcinoma, prostate adenocarcinoma, colon and rectal cancer, and are marginally significant for liver hepatocellular carcinoma, skin cutaneous melanoma, pancreatic adenocarcinoma, testicular germ cell tumor (Supplementary Table 4). Moreover, the concordance index (C-index) values and P values obtained from the Cox regression models indicated that the inclusion of the TLS ratio with TNM staging significantly enhanced the models’ discriminative ability, outperforming the models that solely incorporated TNM staging, in 10 of 15 TCGA cancer types (Supplementary Table 5). Together, our findings underscore the potential of the TLS ratio as a prognostic biomarker in a range of solid tumors.

Higher TLS ratios predicted a positive immunotherapy response across multiple cohorts

Finally, we assessed the TLS ratio as a biomarker for predicting clinical response to immunotherapy (part IV in Fig. 1). We estimated TLS ratios from H&E-stained biopsied tumor tissues before immunotherapy treatment. In an ESCC cohort (n = 43) receiving anti-PD-1 monotherapy in trial NCT02742935, TLS ratios were significantly higher in responders (33%, n = 14) versus non-responders (67%, n = 29) (P value = 0.046) (Fig. 4a). In two NSCLC cohorts given anti-PD-1 plus chemotherapy (n = 56) or anti-PD-1 plus apatinib (an antiangiogenic agent) (n = 18), TLS ratios were also significantly higher in responders compared to non-responders (P value = 0.035 and 0.015, respectively) (Fig. 4b, c). In a STAD cohort (n = 23) given anti-PD-1 and chemoradiotherapy, higher TLS ratio also associated with better immunotherapy response (P value = 0.047) (Fig. 4d). Overall, these data indicate the TLS ratio assessed by our deep learning approach on standard H&E histopathology images may provide useful prognostic and predictive insights across multiple tumor types.

**Fig. 4: Assessment of the TLS ratios and immunotherapy outcomes in four independent cohorts.**

Discussion

Recently, several deep learning models have been developed towards automated segmentation of TLSs from H&E images in various tumor types, including lung cancer^35,36 and gastrointestinal cancers¹³. Wang et al. extended its application by quantifying TLS density in lung adenocarcinoma tissues and explored its prognostic value³⁶. Moreover, Rijthoven et al. introduced a multi-resolution strategy to segment and quantify TLSs, and applied these metrics as prognostic indicators in three distinct cancer types³⁷, highlighting the versatility of computational models in different cancer contexts. Unlike these studies that depended solely on pathologists’ manual annotations of TLS without mIHC guidance, our study leveraged mIHC markers—DAPI, CD3, and CD20—to identify TLSs, thus reducing the influence of subjective human judgment. The robustness of the model was assessed through both internal and external validation sets. Additionally, we developed a pipeline to calculate the TLS ratio, enabling the automatic quantification of this metric. By employing our pipeline to thousands of patients from various external data sets, we demonstrated that estimated TLS ratios significantly correlated with established TLS-associated molecular signatures, including B cell abundance and CXCL13 expression, suggesting the reliability of our approach to segment and quantify TLSs across multiple cancer types. More importantly, the derived TLS ratio holds promise as a robust pan-cancer biomarker that predicts prognosis and a positive immunotherapy response.

A major strength of our approach is the high-quality training dataset utilized, with TLS segmentation on H&E images verified through matched mIHC images and manually reviewed by experienced pathologists. This robust training process enabled the development of an automated, consistent model for precise TLS segmentation and quantification applicable to ubiquitous H&E slides, without reliance on specialized assays. Despite an imbalanced training dataset comprising sixty ESCC and five NSCLC tumor tissues, which might lead to better performance in ESCC, independent evaluation using the external validation set demonstrates that the IoU for each individual NSCLC case is still above 0.65 (Supplementary Fig. 4f), indicating satisfactory results. The strong correlation of estimated TLS ratios with established TLS biology across various TCGA tumor types provides confidence in its accuracy when applied to other cancer types. Moreover, this standardized segmentation methodology may also be generalizable for segmenting and quantifying TLSs in contexts beyond cancer, such as in autoimmune and infectious diseases. However, further benchmarking of the model’s TLS segmentation performance against mIHC in diverse diseases beyond ESCC and NSCLC would be valuable to formally validate its broader applicability.

TLSs are specialized lymphoid aggregates that often form in response to chronic inflammation². They structurally and functionally resemble secondary lymphoid organs, supporting germinal center reactions that enable B cell activation and differentiation into plasma cells². The presence of TLS has been linked to productive anti-tumor immunity in multiple cancers³⁸. TLSs indicate an ongoing immune response and appear associated with better prognosis and immunotherapy outcomes across multiple cancers². However, systematically evaluating TLSs currently requires multiplex imaging, which is resource-intensive and not widely available. Our study provides evidence that TLSs can be quantified through computational analysis of standard H&E histopathology images. Thus, it can be immediately applied to extract spatial and quantitative data on TLSs from abundant archival samples. Pairing these computationally derived TLS metrics with multi-omic data from the same samples provides an opportunity to uncover the molecular mechanisms governing TLS biology and its function in orchestrating anti-tumor immunity.

As immunotherapy expands, biomarkers to select patients and understand resistance mechanisms are urgently needed. Our data complements emerging evidence on the TLS ratio as an easily assessable pan-cancer biomarker predicting improved immunotherapy outcomes^2,4,17. However, whether TLS ratios correlate with other established biomarkers, such as microsatellite instability³⁹, PD-1/PD-L1 expression⁴⁰, and tumor mutation burden⁴¹, warrants further study. In addition, separating mature from immature TLSs in our computational analysis could provide further biological insights and potentially better predict immunotherapy response. Mature TLSs with developed germinal centers likely promote stronger anti-tumor immunity compared to immature ones^11,42. Dissecting these TLS subtypes may refine the utility of the TLS ratio as a predictive biomarker. Moreover, incorporating detailed clinical information about the cohorts analyzed and assessing relationships between computationally derived TLS metrics and other immunotherapy biomarkers could reveal if the TLS ratio gives orthogonal or synergistic value for predicting immunotherapy outcomes. This may enable improved response prediction compared to any single biomarker.

Another limitation of our study is that TLSs have complex 3D structures⁴³, whereas we analyzed single 2D histological sections, which may not fully recapitulate the entire TLS immunologic composition, especially in small biopsied samples. While this methodological constraint is common and affects many types of histopathological analyses, 2D histology remains the standard in clinical settings due to its accessibility and feasibility. Studies have indicated that certain 2D image features can serve as surrogates for their 3D counterparts^44,45, thus providing a feasible method for bridging the gap between practicality and accuracy. In fact, TLS areas calculated from 2D histological sections have been validated as biomarkers for prognosis and immunotherapy responses across various tumor types^13,46. To mitigate this issue and better represent the 3D nature of TLSs, we propose the use of multiple non-consecutive sections from the same tumor to quantify and average the TLS ratios. Assessing TLSs across multiple standard histology images with our deep learning-based approach provides a practical way to better approximate real 3D TLS distribution while still relying on routine histopathology protocols. Further study with multi-slide analysis is warranted to validate improved performance over single-slide quantification. Additionally, the direct measurement of the number of TLSs from 2D histological sections may represent another metric that warrants further investigation.

Overall, we present a practical deep learning-based approach to extract clinically useful insights from H&E histopathology images. The TLS ratio provides a potential biomarker to stratify patients and illuminate cancer biology. Quantitative spatial analyses of the immune context using standard-of-care specimens could open avenues to improve immunotherapy.

Methods

Patients and data collection

Data used for the development of the TLS segmentation model was collected from surgically resected tumor tissues acquired from two distinct groups at the Zhongshan Hospital of Fudan University, Shanghai, China. Group one comprised sixty patients with ESCC, who underwent 1–4 cycles (28 days per cycle) of immunotherapy with combined anti-PD-1 blockade and chemotherapy. Group two consisted of five NSCLC patients who received 2 cycles of combined anti-PD-1 blockade and chemotherapy, with each cycle also lasting 28 days. The clinical characteristics of both groups are detailed in Supplementary Table 1.

In evaluating the TLS ratios, we employed the publicly available TCGA dataset. This dataset we used encompasses 6140 patients possessing H&E WSIs, concomitant RNA sequencing, and DNA methylation data. Detailed inclusion and exclusion criteria for each tumor types are described in Supplementary Fig. 10. Sixteen distinct tumor types were examined to evaluate correlations between estimated TLS ratios, molecular signatures, and prognosis. NSCLC cases from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) used for the survival analysis encompass 960 H&E slides from 209 patients, which include both CPTAC-LSCC⁴⁷ and CPTAC-LUAD⁴⁸.

For evaluating the TLS ratio’s potential as an immunotherapy response predictor, we gathered data from four independent cohorts of ESCC, NSCLC, and STAD patients pre-therapy. The ESCC cohort (n = 43) was from a phase I clinical trial (NCT02742935)^49,50. These patients, resistant or intolerant to prior chemotherapy, underwent 4 cycles (28 days per cycle) of treatment with the anti-PD-1 blockade (SHR-1210) at the Cancer Hospital of the Chinese Academy of Medical Science, Beijing, China. Treatment commenced at 60 mg and escalated to 200 mg and 400 mg, continuing until disease progression or the onset of intolerable side effects. Biopsied tumor tissues were procured as formalin-fixed paraffin-embedded (FFPE) samples before immunotherapy treatment. The clinical response for each patient was evaluated after treatment based on Response Evaluation Criteria in Solid Tumors (RECIST) v1.1⁵¹. Responders were defined as patients diagnosed with a complete response, and partial response; and non-responders were defined as patients diagnosed with stable disease, and progressive disease.

Two retrospective observational cohorts of NSCLC patients were gathered from the Cancer Hospital of the Chinese Academy of Medical Science, Beijing, China, from December 2021 to January 2023. The first cohort consisted of 56 patients who underwent combined anti-PD-1 blockade and chemotherapy treatments. The second cohort comprised 18 patients treated with a combination of anti-PD-1 blockade (camrelizumab) and an antiangiogenic agent (apatinib). All NSCLC patients underwent two cycles of immunotherapy treatment, each lasting 28 days, followed by surgical resection of tumor tissues 1-month post-treatment. Prior to initiating immunotherapy, FFPE tumor tissues were biopsied and subjected to H&E staining. The post-treatment clinical response for individuals in both cohorts was determined by expert pathologists who assessed the pathological response on surgically resected tumor specimens. Responders were characterized as patients manifesting over 90% tumor reduction⁵².

Data pertaining to the STAD cohort (n = 23) was retrospectively acquired from the Neo-PLANET phase II trial (NCT03631615)⁵³, conducted at Zhongshan Hospital of Fudan University, Shanghai, China. Detailed inclusion and exclusion criteria are described in Supplementary Fig. 11. This investigation centered on immunotherapy, combining anti-PD-1 with concurrent chemoradiotherapy for patients with locally advanced adenocarcinoma of the stomach or gastroesophageal junction. The treatment protocol entailed the administration of anti-PD-1 blockade (capecitabine) at a dose of 850 mg/m² twice daily, paired with concurrent radiotherapy spanning five weeks. This regimen was sandwiched by a 21-day cycle featuring oxaliplatin at 130 mg/m² on day 1 and capecitabine at 1000 mg/m² twice daily from days 1 to 14. Chemotherapy was concurrently administered over five cycles, each spanning 21 days, followed by surgical intervention after completing the total 15-week treatment period. Before treatment, tumor specimens were acquired through gastroscopy biopsy and subsequently stained with H&E. The post-treatment clinical responses of these patients were determined based on the expert pathologists’ assessment of the surgically resected tumor tissues. Responders in this cohort were identified as patients with a residual tumor cell count under 10%⁵³. A detailed overview of these four cohorts is provided in Supplementary Table 3.

Every participant provided their informed written consent prior to their involvement in the study. All research procedures and protocols adhered to the principles set forth in the Declaration of Helsinki. Ethical approval for the study was granted by the Ethics Committees of the Cancer Hospital, Chinese Academy of Medical Science (Beijing, China), and the Zhongshan Hospital, Fudan University (Shanghai, China).

Collecting of WSIs

Surgically resected tumor tissues from 60 ESCC patients and 5 NSCLC patients, processed as FFPE, were sectioned into 4 µm slides. These were subsequently stained for multiplex immunohistochemistry (mIHC) using rabbit anti-human monoclonal CD3 antibody (ab16669, Abcam) and mouse anti-human monoclonal CD20 antibody (14-0202-82, eBioscience). Post staining, the slides were treated with fluorescence mounting medium and underwent multispectral imaging at 20× magnification (0.5 μm/pixel) on the Vectra Polaris image system (Perkin Elmer). The channels designated for imaging included Opal 520 for CD3, Opal 690 for CD20, and DAPI for nuclei. These captured WSIs were subsequently visualized using Phenocart (Perkin Elmer).

For H&E staining, consecutive FFPE slides were deparaffinized in xylene and rehydrated through graded ethanol solutions. Slides were stained with Mayer’s hematoxylin for 5 minutes, followed by washing in running tap water for 5–10 minutes. Slides were differentiated in 1% acid alcohol briefly and blued in 0.2% ammonia water or Scott’s solution. Eosin counterstain was applied for 2 minutes. Following staining, slides were dehydrated through 95% and absolute alcohol, cleared in xylene, and mounted with resinous mounting medium.

To increase the robustness of the deep learning model across various H&E staining conditions, additional H&E staining was performed on consecutive slides from a random selection of nine ESCC patients. This involved variations in both the duration of hematoxylin staining and the frequency/duration of eosin incubation. The staining conditions were as follows:

2 slides were stained with Mayer’s hematoxylin for 8 minutes, followed by two eosin incubations of 2 minutes each.
2 slides stained with Mayer’s hematoxylin for 3 minutes, followed by a single eosin incubation of 1 minute.
2 slides stained with Mayer’s hematoxylin for 3 minutes, followed by two eosin incubations of 2 minutes each.
3 slides stained with Mayer’s hematoxylin for 8 minutes, followed by a single eosin incubation of 1 minute.

H&E-stained slides were digitized using a Perkin Elmer scanner at a magnification of 20×, resulting in a resolution of 0.5 μm/pixel. Additionally, 22 H&E slides were imaged at a magnification of 20× (0.5 μm/pixel) using two alternative scanner brands (KFBIO and Olympus) (Supplementary Table 2). Typically, it took approximately 5 minutes to digitize a H&E WSI. A total of 96 H&E WSIs, paired with 65 corresponding mIHC WSIs, were generated and utilized for the development of the TLS segmentation model.

Processing of WSIs and TLS annotations in the internal data sets

TLS segmentation on the mIHC WSIs was conducted using the inForm image analysis software (Perkin Elmer). Briefly, all regions of interest (ROIs) spanning 930 μm × 697 μm, marked by aggregated lymphocytes based on CD3 and CD20 staining, were manually selected. The inForm software⁵⁴ was used for cell segmentation, with the positivity thresholds for each marker set and cataloged for subsequent analyses. Selected ROIs underwent manual TLS segmentation based on CD3 and CD20 staining and used to establish a TLS segmentation algorithm to include at least 50 CD3⁺ or CD20⁺ lymphocytes. After the completion of the TLS segmentation algorithm, the remaining ROIs were batch-processed in the inForm, segregating them into TLS and non-TLS areas. The segmented ROIs were then mapped back into the WSIs to generate a comprehensive TLS segmentation of the mIHC WSIs.

Using the mIHC WSIs as ground truth, we manually generated TLS segmentation masks on the H&E WSIs. Post segmentation, two experienced pathologists (YQW with 12 years’ experience and DXJ with 10 years’ experience) performed the validation on the TLSs segmentation masks of the H&E WSIs using their mIHC counterparts. Using the OpenSlide Python package, H&E WSIs at 20× magnification and their corresponding TLS segmentation were cropped into 512 × 512-pixel tiles (256 μm × 256 μm) using a sliding window approach, retaining a 50% overlap. Only tiles with a TLS segmentation area exceeding 40% were curated. A total of 22,497 such tiles and their corresponding TLS segmentation were extracted from the 96 H&E WSIs (Supplementary Fig. 1). These tiles were then randomly divided into internal training, validation, and test sets in a ratio of 7:1:2, as detailed in Supplementary Table 2.

To segment TLS using the deep learning model, we kept the magnification of WSIs consistently at 20×. Each H&E WSI at 20× magnification was cropped into 512 × 512-pixel tiles (without overlap). For 40× magnification WSIs, 1024 × 1024-pixel tiles were cropped first, and then downscaled to the 512 × 512-pixel resolution.

TLS annotations in the external validation set

From TCGA, we randomly selected five H&E-stained WSIs from ESCC and ten WSIs from NSCLC (including five lung adenocarcinoma and five lung squamous cell carcinoma). These WSIs were acquired either at 20× or 40× magnification and manually annotated by delineating the border of TLSs using the QuPath software⁵⁵. The TLS segmentation annotations on these WSIs were validated by two experienced pathologists, YQW and DXJ, each with over a decade of professional experience in their field. Following this validation, H&E WSIs, together with their TLS segmentation annotations, were cropped into a total of 667 non-overlapping tiles to constitute the external validation set. The tile sizes were determined by their original magnification, with tiles from 20× magnification WSIs sized at 512 × 512 pixels, and those from 40× magnification WSIs at 1024 × 1024 pixels.

Model development for TLS segmentation

TLSs are characterized by organized aggregations of T and B lymphocytes. Therefore, an optimal algorithm for TLS segmentation should capture the surrounding context of each cell to delineate a comprehensive TLS area. While numerous deep learning algorithms for medical image segmentation lean on UNet-like architectures, these often miss capturing pixel correlations across different channels due to their fusion of low-level textual and high-level semantic information. To address this, we adopted a previously described encoder-decoder model³¹, which incorporated two specially designed modules to capture contextual pixel correlations across various channels. Briefly, we chosen the EfficientNet-b0⁵⁶ as the backbone of the TLS segmentation model. We used the AdamW optimizer⁵⁷ to update the network parameters. We set the batch size to 64, the number of epochs to 100, and the learning rate and weight decay both to 1e-4, as described previously³¹. An early stopping operation was applied when the loss in the validation set did not decrease after 10 epochs. Both the internal training and validation sets were utilized exclusively for hyperparameter tuning. We adjusted the model parameters to achieve the best performance on the validation set. Once the optimal parameters were determined, the internal test set was then employed solely for the final evaluation of the model.

The performance of the TLS segmentation model was evaluated by the AUCs for the ROC curves in the internal training, validation, test sets, and external validation set (Supplementary Fig. 2). Briefly, we treated each image as a pixel-level binary classification task. Pixels identified as part of TLS were considered positive cases, while those not part of TLS were considered negative. We converted our model’s prediction probabilities into binary outcomes at various thresholds. This allowed us to calculate the True Positive Rate (TPR) and False Positive Rate (FPR), which facilitated the construction of the ROC curve and the computation of the AUC value.

In this study, pixels with a prediction probability for TLS segmentation above 0.5 were classified as part of the TLS area. The intersection over union was calculated by dividing the pixel count in the overlap between the predicted TLS area and the ground truth TLS area by the pixel count in the combined area of both. To compute the Dice Coefficient, we first doubled the pixel count in the intersection, then divided this by the total number of pixels present in both the predicted and the ground truth TLS area. The total TLS area used to compute the TLS ratio was the count of pixels predicted as belonging to the TLS area.

Deep learning pipeline for TLS ratio calculation

The deep learning pipeline comprised three distinct branches, as illustrated in part II of Fig. 1. In addition to the branch to determine segmented TLS area, the pipeline comprised two branches designed to determine the lymphocyte count and the tissue area, respectively.

Segmentation and quantification of lymphocytes were executed using the publicly available deep learning model, HoVer-Net³³. This model is adept at segmenting four distinct cell types from H&E WSIs, namely lymphocytes, macrophages, epithelial cells, neutrophils. We adopted the model pre-trained on the MoNuSAC2020 data sets⁵⁸ to segment and enumerate lymphocyte counts. Tiles of varying resolutions—either \(512\times 512\) pixels (0.5\(\mu m/\) pixel, at \(20\times\) magnification) or \(1024\times 1024\) pixels (0.25\(\mu m/\) pixel, at \(40\times\) magnification) were resized to a dimension of \(512\times 512\). A sliding window approach without overlap, measuring 256\(\times\)256 pixels, was then applied to segment cell instances. We noted a co-localization of segmented lymphocytes and TLSs (Supplementary Figs. 5 and 6), which emphasizes the model’s accuracy in detecting lymphocytes. By aggregating the results across all sliding windows, we enumerated the total lymphocyte count for each tile. In this study, tiles with a lymphocyte count exceeding 80 within the segmented TLSs were retained for the TLS ratio calculation.

Another branch to determine the tissue area employed the OTSU method³² from the OpenCV Python package to segment the tissue region from the non-tissue background. We applied various filters, including ‘filter_blue_pen’, ‘filter_green_pen’, and ‘filter_red_pen’ with default parameters from public codebase (https://github.com/deroneriksson/python-wsi-preprocessing), to eliminate annotations made using differently colored pens. The tissue area used to compute the TLS ratio was the count of pixels predicted as belonging to the tissue region. Tiles, wherein the segmented tissue area constitutes more than 10% of the entire tile area (equivalent to 26,214 pixels), were retained and processed further within the pipeline to compute the TLS ratio. For each WSI, the TLS ratio was derived by dividing the cumulative segmented TLS area by the total segmented tissue area. For subjects with more than one H&E WSIs, the TLS ratio was averaged among multiple WSIs.

Estimate the percentage of B lymphocytes

For each patient in the TCGA, estimated percentage of B lymphocytes was determined by multiplying the overall leukocyte fraction with the estimated B cell proportion. Using CIBERSORT, we estimated proportions for twelve major immune cells from RNA-seq data. These cells included naive and memory B cells, naive, resting, and activated memory CD4 T cells, among others⁵⁹. The estimated B cell proportion was a cumulative measure of naive, memory B cells, and plasma cells. The overall leukocyte fraction, derived from DNA methylation data, was obtained from the publicly released data⁶⁰.

Prognostic implications of TLS ratios in the TCGA and CPTAC

Upon estimating the TLS ratios for each patient in the TCGA, we utilized the surv_cutpoint function in the survminer R package to define the optimal cutoff, categorizing patients into high or low TLS ratio groups, in each cancer type⁶¹. This categorization was based on the highest standardized log-rank statistics. Both univariate and multivariate Cox regression analyses were conducted to evaluate the impact of TLS ratio categories on overall survival across various TCGA tumor types. Only patients who had complete data for adjusted variables, including sex (male versus female), age (above 60 years versus 60 years or below), and TNM staging, were included in the multivariate analysis (TCGA-SARC was excluded due to the lack of TNM staging). For both univariate and multivariate survival analyses in CPTAC-NSCLC, we used the optimal TLS ratio cutoff derived from the TCGA-NSCLC to stratify patients into high or low TLS ratio groups. 95% CIs were derived using the Wald test.

The C-indexes were determined in patients, who had complete data for TNM staging, across various TCGA tumor types. For these patients, C-indexes were calculated for three Cox regression models, each incorporating different sets of variables. The first model was based solely on the TLS ratio. The second model included the TNM staging, and the third model combined both TNM staging and the TLS ratio. A likelihood ratio test was performed to compare the nested Cox regression models, particularly between the second and the third models, to evaluate the incremental prognostic value of adding the TLS ratio to the conventional TNM staging in these tumor types.

Statistical analyses

Hypothesis tests used to calculate P values were specified at corresponding figure legends and tables. The deep learning model’s performance was assessed using metrics such as the intersection over union, Dice coefficient, and CI. Survival curves were generated using the Kaplan–Meier method and compared using the log-rank test. To calculate the AUC in the TLS segmentation model, the 95% CI was calculated using 500 bootstrap replicates. Spearman’s correlation coefficients were employed for the TCGA to correlate TLS ratios with molecular signatures (B lymphocyte levels and CXCL13 expression). A P value below 0.05 in a two-sided analysis was deemed significant. Analytical procedures were executed using Python (version 3.7.12), R (version 4.1.0), and the SciPy package (version 1.7.3)

Hardware and software configuration

Our computational endeavors were predominantly facilitated by the PyTorch package (version 1.10.0). OpenSlide (version 1.2.0) was used to interpret WSIs. To convert images from various scanners into the ‘svs’ format compatible with OpenSlide, we employed the Pathomation software (version 2.0.0). Prognostic analyses were conducted using R (version 4.1.0). The SciPy python package (version 1.7.3) was employed for statistical evaluations. The R package ggsurvfit (version 0.3.0) was used to plot overall survival curves. For graphical illustrations, including dot and box plots, Matplotlib (version 3.5.3) was utilized.

The design, training, and assessment of our deep learning model were executed on a workstation with dual NVIDIA A100 GPUs, an AMD EPYC 7763 CPU (64 cores, 3.5 GHz), and 520 GB of random-access memory (RAM). On average, it took about 20 minutes to calculate the TLS ratio from an H&E WSI in the current settings.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The diagnostic whole-slide data and overall survival information from the TCGA and corresponding labels are available from NIH Genomic Data Commons (https://portal.gdc.cancer.gov/). The CXCL13 expression data was obtained from UCSC XENA (https://xena.ucsc.edu/). The overall leukocyte fraction, derived from DNA methylation data, was obtained from a source (https://portal.gdc.cancer.gov/)⁶⁰. The data sets of four cohorts receiving ICB therapy and TLS segmentation data sets, including H&E and mIHC WSIs, are available from the corresponding author upon reasonable request. The CPTAC-LSCC⁴⁷ and CPTAC-LUAD⁴⁸ were downloaded from The Cancer Imaging Archive (https://www.cancerimagingarchive.net/).

Code availability

All the codes used in this work are available at: https://github.com/zonechen1994/AI_TLS_segmentation.

References

Schumacher, T. N. & Thommen, D. S. Tertiary lymphoid structures in cancer. Science 375, eabf9419 (2022).
Article CAS PubMed Google Scholar
Sautes-Fridman, C., Petitprez, F., Calderaro, J. & Fridman, W. H. Tertiary lymphoid structures in the era of cancer immunotherapy. Nat. Rev. Cancer 19, 307–325 (2019).
Article CAS PubMed Google Scholar
Helmink, B. A. et al. B cells and tertiary lymphoid structures promote immunotherapy response. Nature 577, 549–555 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fridman, W. H. et al. B cells and tertiary lymphoid structures as determinants of tumour immune contexture and clinical outcome. Nat. Rev. Clin. Oncol. 19, 441–457 (2022).
Article CAS PubMed Google Scholar
Rodriguez, A. B. & Engelhard, V. H. Insights into tumor-associated tertiary lymphoid structures: novel targets for antitumor immunity and cancer immunotherapy. Cancer Immunol. Res. 8, 1338–1345 (2020).
Article CAS PubMed PubMed Central Google Scholar
Petitprez, F. et al. B cells are associated with survival and immunotherapy response in sarcoma. Nature 577, 556–560 (2020).
Article CAS PubMed Google Scholar
Cabrita, R. et al. Tertiary lymphoid structures improve immunotherapy and survival in melanoma. Nature 577, 561–565 (2020).
Article CAS PubMed Google Scholar
Italiano, A. et al. Pembrolizumab in soft-tissue sarcomas with tertiary lymphoid structures: a phase 2 PEMBROSARC trial cohort. Nat. Med. 28, 1199–1206 (2022).
Article CAS PubMed Google Scholar
Yu, A. et al. The prognostic value of the tertiary lymphoid structure in gastrointestinal cancers. Front. Immunol. 14, 1256355 (2023).
Article CAS PubMed PubMed Central Google Scholar
Sun, H. et al. Prognostic value of tertiary lymphoid structures (TLS) in digestive system cancers: a systematic review and meta-analysis. BMC Cancer 23, 1248 (2023).
Article PubMed PubMed Central Google Scholar
Vanhersecke, L. et al. Mature tertiary lymphoid structures predict immune checkpoint inhibitor efficacy in solid tumors independently of PD-L1 expression. Nat. Cancer 2, 794–802 (2021).
Article CAS PubMed PubMed Central Google Scholar
Goff, P. H. et al. Neoadjuvant therapy induces a potent immune response to sarcoma, dominated by myeloid and B cells. Clin. Cancer Res. 28, 1701–1711 (2022).
Article CAS PubMed PubMed Central Google Scholar
Li, Z. et al. Development and validation of a machine learning model for detection and classification of tertiary lymphoid structures in gastrointestinal cancers. JAMA Netw. Open 6, e2252553 (2023).
Article PubMed PubMed Central Google Scholar
Ling, Y. et al. The prognostic value and molecular properties of tertiary lymphoid structures in oesophageal squamous cell carcinoma. Clin. Transl. Med. 12, e1074 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bulten, W. et al. Automated deep-learning system for Gleason grading of prostate cancer using biopsies: a diagnostic study. Lancet Oncol. 21, 233–241 (2020).
Article PubMed Google Scholar
Wang, Y. et al. Improved breast cancer histological grading using deep learning. Ann. Oncol. 33, 89–98 (2022).
Article CAS PubMed Google Scholar
Crombe, A., Roulleau-Dugage, M. & Italiano, A. The diagnosis, classification, and treatment of sarcoma in this era of artificial intelligence and immunotherapy. Cancer Commun. (Lond.) 42, 1288–1313 (2022).
Article PubMed Google Scholar
Bera, K., Schalper, K. A., Rimm, D. L., Velcheti, V. & Madabhushi, A. Artificial intelligence in digital pathology - new tools for diagnosis and precision oncology. Nat. Rev. Clin. Oncol. 16, 703–715 (2019).
Article PubMed PubMed Central Google Scholar
Lee, Y. et al. Derivation of prognostic contextual histopathological features from whole-slide images of tumours via graph deep learning. Nat. Biomed. Eng. https://doi.org/10.1038/s41551-022-00923-0 (2022).
Skrede, O. J. et al. Deep learning for prediction of colorectal cancer outcome: a discovery and validation study. Lancet 395, 350–360 (2020).
Article CAS PubMed Google Scholar
Wang, S. et al. Computational staining of pathology images to study the tumor microenvironment in lung cancer. Cancer Res. 80, 2056–2066 (2020).
Article CAS PubMed PubMed Central Google Scholar
Rakaee, M. et al. Association of machine learning-based assessment of tumor-infiltrating lymphocytes on standard histologic images with outcomes of immunotherapy in patients with NSCLC. JAMA Oncol. 9, 51–60 (2023).
Article PubMed Google Scholar
Johannet, P. et al. Using machine learning algorithms to predict immunotherapy response in patients with advanced melanoma. Clin. Cancer Res. 27, 131–140 (2021).
Article CAS PubMed Google Scholar
Jiang, Y. et al. Biology-guided deep learning predicts prognosis and cancer immunotherapy response. Nat. Commun. 14, 5135 (2023).
Article CAS PubMed PubMed Central Google Scholar
Shamai, G. et al. Deep learning-based image analysis predicts PD-L1 status from H&E-stained histopathology images in breast cancer. Nat. Commun. 13, 6753 (2022).
Article CAS PubMed PubMed Central Google Scholar
Shamai, G. et al. Artificial intelligence algorithms to assess hormonal status from tissue microarrays in patients with breast cancer. JAMA Netw. Open 2, e197700 (2019).
Article PubMed PubMed Central Google Scholar
Coudray, N. et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat. Med. 24, 1559–1567 (2018).
Article CAS PubMed PubMed Central Google Scholar
Liao, H. et al. Deep learning-based classification and mutation prediction from histopathological images of hepatocellular carcinoma. Clin. Transl. Med. 10, e102 (2020).
Article PubMed PubMed Central Google Scholar
Li, D. et al. A deep learning diagnostic platform for diffuse large B-cell lymphoma with high accuracy across multiple hospitals. Nat. Commun. 11, 6004 (2020).
Article CAS PubMed PubMed Central Google Scholar
Xu, Y. et al. Large scale tissue histopathology image classification, segmentation, and visualization via deep convolutional activation features. BMC Bioinformatics 18, 281 (2017).
Article PubMed PubMed Central Google Scholar
Chen, Z., Wang, K. & Liu, Y. Efficient polyp segmentation via integrity learning. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 1826–1830 (2024).
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979).
Article Google Scholar
Graham, S. et al. Hover-Net: simultaneous segmentation and classification of nuclei in multi-tissue histology images. Med Image Anal. 58, 101563 (2019).
Article PubMed Google Scholar
Rouanne, M., Arpaia, N. & Marabelle, A. CXCL13 shapes tertiary lymphoid structures and promotes response to immunotherapy in bladder cancer. Eur. J. Cancer 151, 245–248 (2021).
Article PubMed Google Scholar
Barmpoutis, P. et al. Tertiary lymphoid structures (TLS) identification and density assessment on H&E-stained digital slides of lung cancer. PLoS One 16, e0256907 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Computerized tertiary lymphoid structures density on H&E-images is a prognostic biomarker in resectable lung adenocarcinoma. iScience 26, 107635 (2023).
Article CAS PubMed PubMed Central Google Scholar
van Rijthoven, M. et al. Multi-resolution deep learning characterizes tertiary lymphoid structures and their prognostic relevance in solid tumors. Commun. Med. (Lond.) 4, 5 (2024).
Article PubMed Google Scholar
Meylan, M. et al. Tertiary lymphoid structures generate and propagate anti-tumor antibody-producing plasma cells in renal cell cancer. Immunity 55, 527–541.e5 (2022).
Article CAS PubMed Google Scholar
Florou, V. et al. Real-world pan-cancer landscape of frameshift mutations and their role in predicting responses to immune checkpoint inhibitors in cancers with low tumor mutational burden. J. Immunother. Cancer 11, e007440 (2023).
Lu, S. et al. Comparison of biomarker modalities for predicting response to PD-1/PD-L1 checkpoint blockade: a systematic review and meta-analysis. JAMA Oncol. 5, 1195–1204 (2019).
Article PubMed PubMed Central Google Scholar
Chan, T. A. et al. Development of tumor mutation burden as an immunotherapy biomarker: utility for the oncology clinic. Ann. Oncol. 30, 44–56 (2019).
Article CAS PubMed Google Scholar
Brunet, M. et al. Mature tertiary lymphoid structure is a specific biomarker of cancer immunotherapy and does not predict outcome to chemotherapy in non-small-cell lung cancer. Ann. Oncol. 33, 1084–1085 (2022).
Article CAS PubMed Google Scholar
Mai, H. et al. Whole-body cellular mapping in mouse using standard IgG antibodies. Nat. Biotechnol. https://doi.org/10.1038/s41587-023-01846-0 (2023).
Lee, K. H. et al. Correlation between the size of the solid component on thin-section CT and the invasive component on pathology in small lung adenocarcinomas manifesting as ground-glass nodules. J. Thorac. Oncol. 9, 74–82 (2014).
Article PubMed Google Scholar
Yang, W. T., Tse, G. M., Lam, P. K., Metreweli, C. & Chang, J. Correlation between color power Doppler sonographic measurement of breast tumor vasculature and immunohistochemical analysis of microvessel density for the quantitation of angiogenesis. J. Ultrasound Med. 21, 1227–1235 (2002).
Article CAS PubMed Google Scholar
van Dijk, N. et al. Preoperative ipilimumab plus nivolumab in locoregionally advanced urothelial cancer: the NABUCCO trial. Nat. Med. 26, 1839–1844 (2020).
Article PubMed Google Scholar
Consortium, N.C.I.C.P.T.A. The clinical proteomic tumor analysis consortium lung squamous cell carcinoma collection (CPTAC-LSCC), (The Cancer Imaging Archive, 2018).
Consortium, N.C.I.C.P.T.A. The clinical proteomic tumor analysis consortium lung adenocarcinoma collection (CPTAC-LUAD), (2018).
Liu, Z. et al. Integrated multi-omics profiling yields a clinically relevant molecular classification for esophageal squamous cell carcinoma. Cancer Cell 41, 181–195.e9 (2023).
Article CAS PubMed Google Scholar
Huang, J. et al. Safety, activity, and biomarkers of SHR-1210, an anti-PD-1 antibody, for patients with advanced esophageal carcinoma. Clin. Cancer Res. 24, 1296–1304 (2018).
Article CAS PubMed Google Scholar
Eisenhauer, E. A. et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur. J. Cancer 45, 228–247 (2009).
Article CAS PubMed Google Scholar
Vos, J. L. et al. Neoadjuvant immunotherapy with nivolumab and ipilimumab induces major pathological responses in patients with head and neck squamous cell carcinoma. Nat. Commun. 12, 7348 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tang, Z. et al. The Neo-PLANET phase II trial of neoadjuvant camrelizumab plus concurrent chemoradiotherapy in locally advanced adenocarcinoma of stomach or gastroesophageal junction. Nat. Commun. 13, 6807 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kramer, A. S. et al. InForm software: a semi-automated research tool to identify presumptive human hepatic progenitor cells, and other histological features of pathological significance. Sci. Rep. 8, 3418 (2018).
Article PubMed PubMed Central Google Scholar
Bankhead, P. et al. QuPath: open source software for digital pathology image analysis. Sci. Rep. 7, 16878 (2017).
Article PubMed PubMed Central Google Scholar
Tan, M. & Le, Q. V. EfficientNet: rethinking model scaling for convolutional neural networks. In: Proceedings of the 36th international conference on machine learning, https://arxiv.org/abs/1905.11946 (2019).
Loshchilov, I. & Hutter, F. Decoupled Weight Decay Regularization. In: 7th international conference on learning representations, https://arxiv.org/abs/1711.05101 (2019).
Verma, R. et al. MoNuSAC2020: a multi-organ nuclei segmentation and classification challenge. IEEE Trans. Med. Imaging 40, 3413–3423 (2021).
Article PubMed Google Scholar
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457 (2015).
Article CAS PubMed PubMed Central Google Scholar
Thorsson, V. et al. The immune landscape of cancer. Immunity 51, 411–412 (2019).
Article CAS PubMed Google Scholar
Chen, D. et al. Prognostic and predictive value of a pathomics signature in gastric cancer. Nat. Commun. 13, 6903 (2022).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was supported by funding from National Key R&D Program of China (2021YFC2500900, 2021YFC2501004 and 2021YFC2701001); the National Natural Science Foundation of China (82171837); the Chinese Academy of Medical Sciences (CAMS) Innovation Fund for Medical Sciences (CIFMS) (2021-I2M-1-067; 2021-1-I2M-018); Non-profit central research institute fund of Chinese Academy of Medical Sciences (2022-RC310-08); the Science and Technology Commission of Shanghai Municipality (23JS1400400); and Shanghai Municipal Science and Technology Major Project (2017SHZDZX01 and 2018SHZDZX01) and ZJLab.

Author information

These authors contributed equally: Ziqiang Chen, Xiaobing Wang, Zelin Jin, Bosen Li.

Authors and Affiliations

MOE Key Laboratory of Metabolism and Molecular Medicine, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences and Shanghai Xuhui Central Hospital, Fudan University, Shanghai, China
Ziqiang Chen, Zelin Jin, Mengping Jiang, Dandan Zhang, Yicheng Lin, Weida Meng & Yun Liu
State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, Fudan University, Shanghai, China
Ziqiang Chen, Zelin Jin, Mengping Jiang, Dandan Zhang, Yicheng Lin, Weida Meng & Yun Liu
State Key Laboratory of Molecular Oncology, National Cancer Center, National Clinical Research Center for Cancer, Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Xiaobing Wang, Yahui Zhao, Liping Jiang, Chenxi Wang, Yuchen Jiao & Zhihua Liu
Department of General Surgery/Gastric Cancer Center, Zhongshan Hospital, Fudan University, Shanghai, China
Bosen Li & Xuefei Wang
Department of Pathology, Zhongshan Hospital, Fudan University, Shanghai, China
Dongxian Jiang & Yingyong Hou
Departments of Pathology, International Peace Maternity and Child Health Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
Yanqiu Wang
Department of Pathology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Pei Yuan & Jianming Ying
Thoracic Surgery Department, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Feiyue Feng
Division of Rheumatology and Immunology, Huashan Hospital, Fudan University, Shanghai, China
Wenjing Ye
Departments of Thoracic Surgery, Fudan University Shanghai Cancer Center, Shanghai, China
Jie Wang
Shanghai Xuhui Central Hospital, Shanghai, China
Wenqing Qiu & Houbao Liu
Department of General Surgery/Biliary Tract Disease Center, Zhongshan Hospital, Fudan University, Shanghai, China
Houbao Liu
Department of Pathology, Fudan University Shanghai Cancer Center, Shanghai, China; Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China; Institute of Pathology, Fudan University, Shanghai, China
Dan Huang
Department of General Surgery, Zhongshan Hospital (Xiamen), Fudan University, Shanghai, China
Xuefei Wang

Authors

Ziqiang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaobing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zelin Jin
View author publications
You can also search for this author in PubMed Google Scholar
Bosen Li
View author publications
You can also search for this author in PubMed Google Scholar
Dongxian Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yanqiu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Mengping Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Dandan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Pei Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Yahui Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Feiyue Feng
View author publications
You can also search for this author in PubMed Google Scholar
Yicheng Lin
View author publications
You can also search for this author in PubMed Google Scholar
Liping Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Chenxi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Weida Meng
View author publications
You can also search for this author in PubMed Google Scholar
Wenjing Ye
View author publications
You can also search for this author in PubMed Google Scholar
Jie Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wenqing Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Houbao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Dan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yingyong Hou
View author publications
You can also search for this author in PubMed Google Scholar
Xuefei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuchen Jiao
View author publications
You can also search for this author in PubMed Google Scholar
Jianming Ying
View author publications
You can also search for this author in PubMed Google Scholar
Zhihua Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yun Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: Yun Liu, Z.L., J.Y., and Y.J.; validation: Xiaobing Wang, Z.J., B.L., D.J., Y.W., M.J., and J.W.; methodology, Z.C., Yicheng Lin, W.M. and Yun Liu; formal analysis, Z.C.; investigation, Z.C.; resources, Y.Z., F.F., L.J., C.W., W.Y., W.Q., and H.L.; data curation, Z.C., Z.J., D.J., Y.W., D.Z., P.Y., Y.H., and Xuefei Wang; writing—original draft, Z.C. and Yun Liu; writing—review & editing, D.H., Z.C., and Yun Liu; funding acquisition, Y.J. and Yun Liu; supervision, Yun Liu, Z.L., J.Y. and Y.J. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yuchen Jiao, Jianming Ying, Zhihua Liu or Yun Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

REPORTING SUMMARY

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, Z., Wang, X., Jin, Z. et al. Deep learning on tertiary lymphoid structures in hematoxylin-eosin predicts cancer prognosis and immunotherapy response. npj Precis. Onc. 8, 73 (2024). https://doi.org/10.1038/s41698-024-00579-w

Download citation

Received: 04 December 2023
Accepted: 14 March 2024
Published: 22 March 2024
DOI: https://doi.org/10.1038/s41698-024-00579-w

Subjects

Abstract

Similar content being viewed by others

Multi-resolution deep learning characterizes tertiary lymphoid structures and their prognostic relevance in solid tumors

Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images

Deep learning model improves tumor-infiltrating lymphocyte evaluation and therapeutic response prediction in breast cancer

Introduction

Results

Data collection and development of the TLS segmentation model

Deep learning pipeline for TLS ratio calculation

Estimated TLS ratios correlate with B lymphocyte levels and CXCL13 expression across various TCGA tumor types

Higher TLS ratios are associated with extended survival across various TCGA tumor types

Higher TLS ratios predicted a positive immunotherapy response across multiple cohorts

Discussion

Methods

Patients and data collection

Collecting of WSIs

Processing of WSIs and TLS annotations in the internal data sets

TLS annotations in the external validation set

Model development for TLS segmentation

Deep learning pipeline for TLS ratio calculation

Estimate the percentage of B lymphocytes

Prognostic implications of TLS ratios in the TCGA and CPTAC

Statistical analyses

Hardware and software configuration

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

REPORTING SUMMARY

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links