Pathologist-level interpretable whole-slide cancer diagnosis with deep learning

Zhang, Zizhao; Chen, Pingjun; McGough, Mason; Xing, Fuyong; Wang, Chunbao; Bui, Marilyn; Xie, Yuanpu; Sapkota, Manish; Cui, Lei; Dhillon, Jasreman; Ahmad, Nazeel; Khalil, Farah K.; Dickinson, Shohreh I.; Shi, Xiaoshuang; Liu, Fujun; Su, Hai; Cai, Jinzheng; Yang, Lin

doi:10.1038/s42256-019-0052-1

Article
Published: 13 May 2019

Pathologist-level interpretable whole-slide cancer diagnosis with deep learning

Zizhao Zhang¹,
Pingjun Chen ORCID: orcid.org/0000-0003-0528-1713²,
Mason McGough²,
Fuyong Xing³,
Chunbao Wang⁴,
Marilyn Bui⁵,
Yuanpu Xie²,
Manish Sapkota⁶,
Lei Cui²,
Jasreman Dhillon⁵,
Nazeel Ahmad⁷,
Farah K. Khalil⁵,
Shohreh I. Dickinson⁵,
Xiaoshuang Shi²,
Fujun Liu⁶,
Hai Su²,
Jinzheng Cai² &
…
Lin Yang²

Nature Machine Intelligence volume 1, pages 236–245 (2019)Cite this article

17k Accesses
170 Citations
65 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 17 July 2019

A Publisher Correction to this article was published on 17 May 2019

This article has been updated

Abstract

Diagnostic pathology is the foundation and gold standard for identifying carcinomas. However, high inter-observer variability substantially affects productivity in routine pathology and is especially ubiquitous in diagnostician-deficient medical centres. Despite rapid growth in computer-aided diagnosis (CAD), the application of whole-slide pathology diagnosis remains impractical. Here, we present a novel pathology whole-slide diagnosis method, powered by artificial intelligence, to address the lack of interpretable diagnosis. The proposed method masters the ability to automate the human-like diagnostic reasoning process and translate gigapixels directly to a series of interpretable predictions, providing second opinions and thereby encouraging consensus in clinics. Moreover, using 913 collected examples of whole-slide data representing patients with bladder cancer, we show that our method matches the performance of 17 pathologists in the diagnosis of urothelial carcinoma. We believe that our method provides an innovative and reliable means for making diagnostic suggestions and can be deployed at low cost as next-generation, artificial intelligence-enhanced CAD technology for use in diagnostic pathology.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 2: Data preparation, organized in four data sets.**

**Fig. 3: Results for the whole-slide diagnosis.**

**Fig. 4: Visualization of interpretable predictions of the method.**

**Fig. 5: Visualization of more interpretable predictions of the method.**

**Fig. 6: Evaluation of the network components.**

**Fig. 7: Text-to-image retrieval results.**

Prediction of tumor origin in cancers of unknown primary origin with cytology-based deep learning

Article Open access 16 April 2024

Fei Tian, Dong Liu, … Xiangchun Li

Segment anything in medical images

Article Open access 22 January 2024

Jun Ma, Yuting He, … Bo Wang

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

Richard J. Chen, Tong Ding, … Faisal Mahmood

Data availability

The data that support the findings of this study are available from Figshare: https://figshare.com/projects/nmi-wsi-diagnosis/61973.

Code availability

Source code are available from the Github repository: https://github.com/zizhaozhang/nmi-wsi-diagnosis.

Change history

17 July 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
17 May 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper

References

Brimo, F., Schultz, L. & Epstein, J. I. The value of mandatory second opinion pathology review of prostate needle biopsy interpretation before radical prostatectomy. J. Urol. 184, 126–130 (2010).
Article Google Scholar
Elmore, J. G. et al. Diagnostic concordance among pathologists interpreting breast biopsy specimens. JAMA 313, 1122–1132 (2015).
Article Google Scholar
Djuric, U., Zadeh, G., Aldape, K. & Diamandis, P. Precision histology: how deep learning is poised to revitalize histomorphology for personalized cancer care. npj Precis. Oncol. 1, 22 (2017).
Article Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article Google Scholar
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat. Biomed. Eng. 2, 158–164 (2018).
Article Google Scholar
Bejnordi, B. E. et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 318, 2199–2210 (2017).
Article Google Scholar
Litjens, G. et al. Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Sci. Rep. 6, 26286 (2016).
Article Google Scholar
Araújo, T. et al. Classification of breast cancer histology images using convolutional neural networks. PloS ONE 12, e0177544 (2017).
Article Google Scholar
Xu, Y. et al. Large scale tissue histopathology image classification, segmentation, and visualization via deep convolutional activation features. BMC Bioinformatics 18, 281 (2017).
Article Google Scholar
Yoshida, H. et al. Automated histological classification of whole slide images of colorectal biopsy specimens. Oncotarget 8, 90719 (2017).
Google Scholar
Han, Z. et al. Breast cancer multi-classification from histopathological images with structured deep learning model. Sci. Rep. 7, 4172 (2017).
Article Google Scholar
Hou, L. et al. Patch-based convolutional neural network for whole slide tissue image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2424–2433 (IEEE, 2016).
Holzinger, A., Biemann, C., Pattichis, C. S. & Kell, D. B. What do we need to build explainable AI systems for the medical domain? Preprint at https://arxiv.org/abs/1712.09923 (2017).
Lipton, Z. C. The mythos of model interpretability. Queue. 16, 30 (2018).
Google Scholar
Pasin, E., Josephson, D. Y., Mitra, A. P., Cote, R. J. & Stein, J. P. Superficial bladder cancer: an update on etiology, molecular development, classification, and natural history. Rev. Urol. 10, 31–43 (2008).
Google Scholar
Zhou, M. & Magi-Galluzzi, C. Genitourinary Pathology (Foundations in Diagnostic Pathology, Saunders, 2015).
Humphrey, P. A., Moch, H., Cubilla, A. L., Ulbright, T. M. & Reuter, V. E. The 2016 WHO classification of tumours of the urinary system and male genital organs—Part B: Prostate and bladder tumours. Eur. Urol. 70, 106–119 (2016).
Article Google Scholar
Papineni, K., Roukos, S., Ward, T. & Zhu, W.-J. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics 311–318 (Association for Computational Linguistics, 2002).
Vedantam, R., Lawrence Zitnick, C. & Parikh, D. CIDEr: Consensus-based Image Description Evaluation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 4566–4575 (IEEE, 2015).
Karpathy, A. & Fei-Fei, L. Deep visual–semantic alignments for generating image descriptions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 3128–3137 (IEEE, 2015).
Maaten, Lvd & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
MATH Google Scholar
Miyamoto, H. et al. Non-invasive papillary urothelial neoplasms: the 2004 WHO/ISUP classification system. Pathol. Int. 60, 1–8 (2010).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-assisted Intervention 234–241 (Springer, 2015).
Xu, K. et al. Show, attend and tell: neural image caption generation with visual attention. In International Conference on Machine Learning, 2048–2057 (JMLR, 2015).
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2818–2826 (IEEE, 2016).
Deng, J. et al. Imagenet: a large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 248–255 (IEEE, 2009).
Krause, J., Johnson, J., Krishna, R. & Fei-Fei, L. A hierarchical approach for generating descriptive image paragraphs. Preprint at https://arxiv.org/abs/1611.06607 (2016).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article Google Scholar
Bahdanau, D., Cho, K. & Bengio, Y. Neural machine translation by jointly learning to align and translate. Preprint at https://arxiv.org/abs/1409.0473 (2016).
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A. & Torralba, A. Learning deep features for discriminative localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2921–2929 (IEEE, 2016).
Abadi, M. et al. Tensorflow: a system for large-scale machine learning. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation Vol. 16 265–283 (USENIX Association, 2016).

Download references

Acknowledgements

The authors thank the Department of Pathology, University of Florida (UF), and UF Health Shands Hospital for support with data collection. The authors also thank members of the Moffitt Cancer Center and the Department of Pathology, the First Affiliated Hospital of Xi’an Jiaotong University, for their participation in this research, and thank all participating pathologists for their valuable suggestions and active involvement. Thanks also go to Y. Cai for assistance with figure production. The research reported in this publication was supported by the National Institute of Arthritis and Musculoskeletal and Skin Diseases of the National Institutes of Health under award no. 5R01AR065479-05. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

Department of Computer Information Science Engineering, University of Florida, Gainesville, FL, USA
Zizhao Zhang
J. Crayton Pruitt Family Department of Biomedical Engineering, University of Florida, Gainesville, FL, USA
Pingjun Chen, Mason McGough, Yuanpu Xie, Lei Cui, Xiaoshuang Shi, Hai Su, Jinzheng Cai & Lin Yang
Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
Fuyong Xing
Department of Pathology, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
Chunbao Wang
H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
Marilyn Bui, Jasreman Dhillon, Farah K. Khalil & Shohreh I. Dickinson
Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL, USA
Manish Sapkota & Fujun Liu
James A. Haley Veterans’ Hospital, Tampa, FL, USA
Nazeel Ahmad

Authors

Zizhao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Pingjun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Mason McGough
View author publications
You can also search for this author in PubMed Google Scholar
Fuyong Xing
View author publications
You can also search for this author in PubMed Google Scholar
Chunbao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Marilyn Bui
View author publications
You can also search for this author in PubMed Google Scholar
Yuanpu Xie
View author publications
You can also search for this author in PubMed Google Scholar
Manish Sapkota
View author publications
You can also search for this author in PubMed Google Scholar
Lei Cui
View author publications
You can also search for this author in PubMed Google Scholar
Jasreman Dhillon
View author publications
You can also search for this author in PubMed Google Scholar
Nazeel Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Farah K. Khalil
View author publications
You can also search for this author in PubMed Google Scholar
Shohreh I. Dickinson
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoshuang Shi
View author publications
You can also search for this author in PubMed Google Scholar
Fujun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hai Su
View author publications
You can also search for this author in PubMed Google Scholar
Jinzheng Cai
View author publications
You can also search for this author in PubMed Google Scholar
Lin Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.Z. led the development and evaluation. Z.Z., C.W. and L.Y. designed the research. Z.Z. implemented the algorithm. Z.Z., P.C., M.M. and M.S. collected and cleaned the data and developed the annotation software. L.Y. and M.B. recruited pathologists for annotation and machine–human comparison. L.C. and P.C. managed the machine–human competition. J.D., N.A., F.K.K. and S.I.D. participated in the competition. Z.Z. wrote the manuscript. M.M., F.X., Y.X., X.S., F.L., H.S. and J.C. provided valuable comments on the algorithm design and the manuscript.

Corresponding author

Correspondence to Lin Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, Z., Chen, P., McGough, M. et al. Pathologist-level interpretable whole-slide cancer diagnosis with deep learning. Nat Mach Intell 1, 236–245 (2019). https://doi.org/10.1038/s42256-019-0052-1

Download citation

Received: 02 January 2019
Accepted: 05 April 2019
Published: 13 May 2019
Issue Date: May 2019
DOI: https://doi.org/10.1038/s42256-019-0052-1

This article is cited by

Artificial intelligence applications in histopathology
- Cagla Deniz Bahadir
- Mohamed Omar
- Mert R. Sabuncu
Nature Reviews Electrical Engineering (2024)
Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
- Yusra A. Ameen
- Dalia M. Badary
- Adel A. Sewisy
BMC Bioinformatics (2023)
Hierarchical AI enables global interpretation of culture plates in the era of digital microbiology
- Alberto Signoroni
- Alessandro Ferrari
- Karissa Culbreath
Nature Communications (2023)
Colorectal cancer lymph node metastasis prediction with weakly supervised transformer-based multi-instance learning
- Luxin Tan
- Huan Li
- Zhongwu Li
Medical & Biological Engineering & Computing (2023)
Survey on Explainable AI: From Approaches, Limitations and Applications Aspects
- Wenli Yang
- Yuchen Wei
- Byeong Kang
Human-Centric Intelligent Systems (2023)