Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

A transformer-based weakly supervised computational pathology method for clinical-grade diagnosis and molecular marker discovery of gliomas

Abstract

The complex diagnostic criteria for gliomas pose great challenges for making accurate diagnoses with computational pathology methods. There are no in-depth analyses of the accuracy, reliability and auxiliary capability of present approaches from a clinical perspective. Previous studies have overlooked the exploration of molecular and morphological correlations. To overcome these limitations, we propose ROAM, a multiple-instance learning model based on large regions of interest and a pyramid transformer. ROAM enlarges regions of interest to facilitate the consideration of tissue contexts. It utilizes the pyramid transformer to model both intrascale and interscale correlations of morphological features and leverages class-specific multiple-instance learning based on attention to extract slide-level visual representations that can be used to diagnose gliomas. Through comprehensive experiments on both in-house and external glioma datasets, we demonstrate that ROAM can automatically capture key morphological features consistent with the experience of pathologists and thus provide accurate, reliable and adaptable clinical-grade diagnoses of gliomas. Moreover, ROAM has clinical value for auxiliary diagnoses and could pave the way for the study of molecular and morphological correlations.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Overview of the basic framework and architecture of ROAM.
Fig. 2: Performance of ROAM and baseline methods in glioma diagnosis on in-house data.
Fig. 3: Performance of ROAM and baseline methods when generalized to independent external data.
Fig. 4: Visualization for glioma detection and subtyping.
Fig. 5: Performance of the cascade diagnostic system and auxiliary diagnosis.
Fig. 6: Molecular and morphological biomarker discovery through human–AI collaboration.

Similar content being viewed by others

Data availability

The TCGA public glioma WSI dataset is available from the National Institutes of Health’s genomic data commons (https://portal.gdc.cancer.gov). Detailed information about the dataset, including sample IDs and labels, can be obtained from Zenodo via https://doi.org/10.5281/zenodo.11469546 (ref. 43). The Xiangya glioma WSI dataset is not publicly available in accordance with institutional requirements governing the protection of the privacy of human subjects. The in-house Xiangya dataset is not publicly available due to privacy concerns regarding patient information.

Code availability

The ROAM project, including detailed documents and instructions, is available on GitHub (https://github.com/whiteyunjie/ROAM)44. The source code is also available on Zenodo via https://doi.org/10.5281/zenodo.11469423 (ref. 45).

References

  1. Ostrom, Q. T. et al. The epidemiology of glioma in adults: a ‘state of the science’ review. Neuro-oncology 16, 896–913 (2014).

    Article  Google Scholar 

  2. Weller, M. et al. Glioma. Nat. Rev. Dis. Primers 1, 15017 (2015).

  3. Chen, R., Smith-Cohn, M., Cohen, A. L. & Colman, H. Glioma subclassifications and their clinical significance. Neurotherapeutics 14, 284–297 (2017).

    Article  Google Scholar 

  4. Louis, D. N. et al. The 2021 WHO classification of tumors of the central nervous system: a summary. Neuro-oncology 23, 1231–1251 (2021).

    Article  Google Scholar 

  5. Horbinski, C. et al. NCCN guidelines® insights: central nervous system cancers, version 2.2022: featured updates to the NCCN guidelines. J. Natl Compr. Cancer Netw. 21, 12–20 (2023).

    Article  Google Scholar 

  6. Van Den Bent, M. J. Interobserver variation of the histopathological diagnosis in clinical trials on glioma: a clinician’s perspective. Acta Neuropathol. 120, 297–304 (2010).

    Article  Google Scholar 

  7. Ker, J., Bai, Y. Q., Lee, H. Y., Rao, J. & Wang, L. P. Automated brain histology classification using machine learning. J. Clin. Neurosci. 66, 239–245 (2019).

    Article  Google Scholar 

  8. Bera, K., Schalper, K. A., Rimm, D. L., Velcheti, V. & Madabhushi, A. Artificial intelligence in digital pathology – new tools for diagnosis and precision oncology. Nat. Rev. Clin. Oncol. 16, 703–715 (2019).

    Article  Google Scholar 

  9. Niazi, M. K. K., Parwani, A. V. & Gurcan, M. N. Digital pathology and artificial intelligence. Lancet Oncol. 20, e253–e261 (2019).

    Article  Google Scholar 

  10. Echle, A. et al. Deep learning in cancer pathology: a new generation of clinical biomarkers. Br. J. Cancer 124, 686–696 (2021).

    Article  Google Scholar 

  11. Mobadersany, P. et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl Acad. Sci. USA 115, E2970–E2979 (2018).

    Article  Google Scholar 

  12. Ertosun, M. G. & Rubin, D. L. Automated grading of gliomas using deep learning in digital pathology images: a modular approach with ensemble of convolutional neural networks. AMIA Annu. Symp. Proc. Arch. 2015, 1899–1908 (2015).

    Google Scholar 

  13. Jin, L. et al. Artificial intelligence neuropathologist for glioma classification using deep learning on hematoxylin and eosin stained slide images and molecular markers. Neuro-oncology 23, 44–52 (2021).

    Article  Google Scholar 

  14. Im, S. et al. Classification of diffuse glioma subtype from clinical-grade pathological images using deep transfer learning. Sensors 21, 3500 (2021).

    Article  Google Scholar 

  15. Ocampo, P. et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. J. Thorac. Oncol. 13, S562 (2018).

    Article  Google Scholar 

  16. Hsu, W. W. et al. A weakly supervised deep learninÿg-based method for glioma subtype classification using WSI and mpMRIs. Sci. Rep. 12, 6111 (2022).

    Article  Google Scholar 

  17. Campanella, G. et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 25, 1301–1309 (2019).

    Article  Google Scholar 

  18. Lu, M. Y. et al. Data-efficient and weakly supervised computational pathology on whole-slide images. Nat. Biomed. Eng. 5, 555–570 (2021).

    Article  Google Scholar 

  19. Shao, Z. C. et al. TransMIL: transformer based correlated multiple instance learning for whole slide image classification. In Proc. 35th Conference on Neural Information Processing Systems (eds Ranzato, M. et al.) 2136–2147 (Curran Associates, 2021).

  20. Li, B., Li, Y. & Eliceiri, K. W. Dual-stream multiple instance learning network for whole slide image classification with self-supervised contrastive learning. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 14318–14323 (IEEE, 2021).

  21. Zhang, H. R. et al. DTFD-MIL: double-tier feature distillation multiple instance learning for histopathology whole slide image classification. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 18802–18812 (IEEE, 2022).

  22. Zhang, J. et al. Attention multiple instance learning with transformer aggregation for breast cancer whole slide image classification. In Proc. 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 1804–1809 (IEEE, 2022).

  23. Chen, R. J. et al. Scaling vision transformers to gigapixel images via hierarchical self-supervised learning. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition 16144–16155 (IEEE, 2022).

  24. Hou, W. et al. H2-MIL: exploring hierarchical representation with heterogeneous multiple instance learning for whole slide image analysis. In Proc. AAAI Conference on Artificial Intelligence 933–941 (AAAI Press, 2022).

  25. Zheng, Y. et al. A graph-transformer for whole slide image classification. IEEE Trans. Med. Imaging 41, 3003–3015 (2022).

    Article  Google Scholar 

  26. Lee, Y. et al. Derivation of prognostic contextual histopathological features from whole-slide images of tumours via graph deep learning. Nat. Biomed. Eng. https://doi.org/10.1038/s41551-022-00923-0 (2022).

  27. Chefer, H., Gur, S. & Wolf, L. Transformer interpretability beyond attention visualization. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 782–791 (IEEE, 2021).

  28. Dosovitskiy, A. et al. An image is worth 16×16 words: transformers for image recognition at scale. Preprint at https://arxiv.org/abs/2010.11929 (2020).

  29. Vaswani, A. et al. Attention is all you need. In Proc. 30th Conference on Neural Information Processing Systems 5998–6008 (eds Guyon, I. et al.) (Curran Associates, 2017).

  30. Liu, Z. et al. Swin transformer: hierarchical vision transformer using shifted windows. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 10012–10022 (IEEE, 2021).

  31. Ilse, M., Tomczak, J. & Welling, M. Attention-based deep multiple instance learning. In Proc. 35th International Conference on Machine Learning (eds Dy, J. & Krause, A.) 2127–2136 (PMLR, 2018).

  32. He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 770–778 (IEEE, 2016).

  33. Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115, 211–252 (2015).

    Article  MathSciNet  Google Scholar 

  34. Louis, D. N. et al. The 2016 World Health Organization classification of tumors of the central nervous system: a summary. Acta Neuropathol. 131, 803–820 (2016).

    Article  Google Scholar 

  35. Wang, X. et al. RetCCL: clustering-guided contrastive learning for whole-slide image retrieval. Med. Image Anal. 83, 102645 (2023).

    Article  Google Scholar 

  36. Yang, P. et al. CS-CO: a hybrid self-supervised visual representation learning method for H&E-stained histopathological images. Med. Image Anal. 81, 102539 (2022).

    Article  Google Scholar 

  37. Wang, X. et al. Transformer-based unsupervised contrastive learning for histopathological image classification. Med. Image Anal. 81, 102559 (2022).

    Article  Google Scholar 

  38. Ciga, O., Xu, T. & Martel, A. L. Self supervised contrastive learning for digital histopathology. Mach. Learn. Appl. 7, 100198 (2022).

    Google Scholar 

  39. Bach, S. et al. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10, e0130140 (2015).

    Article  Google Scholar 

  40. Selvaraju, R. R. et al. Grad-cam: visual explanations from deep networks via gradient-based localization. In Proc. IEEE Conference on Computer Vision 618–626 (IEEE, 2017).

  41. Vahadane, A. et al. Structure-preserving color normalization and sparse stain separation for histological images. IEEE Trans. Med. Imaging 35, 1962–1971 (2016).

    Article  Google Scholar 

  42. Carpenter, J. & Bithell, J. Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. Stat. Med. 19, 1141–1164 (2000).

    Article  Google Scholar 

  43. Yin, X. TCGA glioma WSI dataset used for evaluating ROAM. Zenodo https://doi.org/10.5281/zenodo.11469546 (2024).

  44. Yin, X. ROAM. GitHub https://github.com/whiteyunjie/ROAM (2024).

  45. Yin, X. A transformer-based weakly supervised computational pathology method for clinical-grade diagnosis and molecular state revelation of gliomas (v1.0.0). Zenodo https://doi.org/10.5281/zenodo.11469423 (2024).

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (Grant Nos. 2023YFF1204802 and 2021YFF1200902 to R.J. and 2022YFF1202403 to H.L.) and the National Natural Science Foundation of China (Grant Nos. 62273194 to R.J. and 62250005 to X.Z.).

Author information

Authors and Affiliations

Authors

Contributions

R.J., X.Z., Z.H. and H.L. conceived the study and supervised the research. R.J., X.Y. and P.Y. designed, implemented and validated the ROAM project. Y.W., J.H., J.Y., Z.H. and L.C. collected, curated and annotated the data and helped with analysing the results. F.C. provided technical support for the online platform. X.F., L.S., L.L., W.L., Z.H., L.C., Y.W. and H.Z. participated in the clinical-grade assessment of ROAM and the auxiliary diagnosis experiments. Z.H. and L.C. participated in the summarization and discovery of molecular and morphological biomarkers. R.J., X.Y., P.Y. and H.L. wrote the paper. L.C., R.J., X.Z., Z.H. and H.L. provided valuable comments on the paper.

Corresponding authors

Correspondence to Xuegong Zhang, Zhongliang Hu or Hairong Lv.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Machine Intelligence thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Workflow of the Pyramid Transformer and ROI-level supervision.

a-b, the implementation process of the multi-scale self-attention module that incorporates visual features of ROI at 10× with features of ROI at 5×. Patch tokens with the same numerical ID represent that their corresponding tissue regions locate at the identity location within ROI, differing only in their magnification scales. c, the process of instance-level supervision. The top k ROIs with the highest attention scores are assigned the same label as their corresponding slide. Classification is performed using an ROI-level classifier, and the instance-level loss is combined with the slice-level loss to train the entire model.

Extended Data Fig. 2 Overview of the glioma data and the cascade diagnostic system.

a, Detailed categories and crucial molecular features of glioma. Glioma can be broadly classified into normal, gliosis and tumor. The subtypes of glioma include astrocytoma, oligodendroglioma, and ependymoma, each having various grades. IDH mutation, and MGMT promoter methylation are two of the most critical molecular features associated with glioma diagnosis. b, Information of the in-house Xiangya glioma dataset. The dataset consists of a total of 1109 slides and supports 7 distinct classification tasks based on the provided labels. c, Details of the external TCGA glioma dataset. The dataset consists of 618 slides and supports 4 external classification tasks. d, The cascade diagnostic system of glioma based on ROAM. For instance, in the case of astrocytoma, the final diagnosis involves a sequential process utilizing 3 diagnostic models: model 1, model 2 and model 3.

Extended Data Fig. 3 Cross-validation of ROAM and baseline methods in glioma diagnosis without the use of the ensemble strategy.

Tasks of glioma diagnosis include glioma detection (a,f,k, n = 222), glioma subtyping (b,g,l, n = 193), astrocytoma grading (c,h,m, n = 106), oligodendroglioma grading (d,i,n, n = 45), and ependymoma grading (e,j,o, n = 43). a-o, Results are derived from 5-fold cross-validation experiments on the Xiangya dataset. Models are trained without the use of the ensemble strategy. a-e, Receiver operating characteristic (ROC) curves and the corresponding area under the curves (AUC±s.d). The confidence bond shows ± 1 s.d for a curve. Inserts: zoomed-in view of the curves. f-j, Averaged Accuracy, macro recall, macro precision, and macro F1-score. The metrics are plotted using box plot and each box ranges from the upper to lower quartiles with the median as the horizontal line, whiskers extend to 1.5 times the interquartile range, and diamond points represent outliers. All the legends are consistent with the legend in f. k-o, Mean normalized confusion matrices of ROAM.

Extended Data Fig. 4 Performance of ROAM and baseline methods in the prediction of molecular features.

Tasks include the prediction of IDH mutation (a, c, e, n = 216) and that of MGMT promoter methylation (b, d, f, n = 218). a-f, Results are derived from five ensemble models that are trained using the in-house training dataset and tested on the in-house test dataset, both split from the Xiangya dataset at the ratio of 2:1. a-b, Receiver operating characteristic (ROC) curves and the corresponding area under the curves (AUC±s.d). The confidence bond shows ± 1 s.d for a curve. Inserts: zoomed-in view of the curves. c-d, Accuracy, macro recall, macro precision, and macro F1-score. The metrics are plotted using box plot and each box ranges from the upper to lower quartiles with the median as the horizontal line, whiskers extend to 1.5 times the interquartile range, and diamond points represent outliers. e-f, Mean normalized confusion matrices of ROAM.

Extended Data Fig. 5 Clinical-grade performance in the overall cascade diagnostic task.

a, Performance of the cascade diagnostic system and the five pathologists for the overall glioma diagnosis task. Macro-averaged one-versus-rest operating points are plotted. There is no ROC curve because the prediction for the task is integrated based on the outcomes of multiple sub-tasks. b, Macro-averaged F1-score and accuracy of ROAM and pathologists (before and after AI assistance) for the overall glioma diagnosis task. The F1-score of mid-career 2 not being displayed implies that this pathologist made completely incorrect predictions for at least one category during the diagnosis.

Supplementary information

Supplementary Information

Supplementary Figs. 1–22, Tables 1–5 and Notes 1–6.

Reporting Summary

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jiang, R., Yin, X., Yang, P. et al. A transformer-based weakly supervised computational pathology method for clinical-grade diagnosis and molecular marker discovery of gliomas. Nat Mach Intell 6, 876–891 (2024). https://doi.org/10.1038/s42256-024-00868-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s42256-024-00868-w

Search

Quick links

Nature Briefing: Cancer

Sign up for the Nature Briefing: Cancer newsletter — what matters in cancer research, free to your inbox weekly.

Get what matters in cancer research, free to your inbox weekly. Sign up for Nature Briefing: Cancer