An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets

Lee, Hyunkwang; Yune, Sehyo; Mansouri, Mohammad; Kim, Myeongchan; Tajmir, Shahein H.; Guerrier, Claude E.; Ebert, Sarah A.; Pomerantz, Stuart R.; Romero, Javier M.; Kamalian, Shahmir; Gonzalez, Ramon G.; Lev, Michael H.; Do, Synho

doi:10.1038/s41551-018-0324-9

Article
Published: 17 December 2018

An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets

Hyunkwang Lee^1,2^na1,
Sehyo Yune ORCID: orcid.org/0000-0002-9223-3586¹^na1,
Mohammad Mansouri¹,
Myeongchan Kim¹,
Shahein H. Tajmir¹,
Claude E. Guerrier¹,
Sarah A. Ebert¹,
Stuart R. Pomerantz¹,
Javier M. Romero¹,
Shahmir Kamalian¹,
Ramon G. Gonzalez¹,
Michael H. Lev¹ &
…
Synho Do ORCID: orcid.org/0000-0001-6211-7050¹

Nature Biomedical Engineering volume 3, pages 173–182 (2019)Cite this article

9308 Accesses
286 Citations
176 Altmetric
Metrics details

Subjects

Abstract

Owing to improvements in image recognition via deep learning, machine-learning algorithms could eventually be applied to automated medical diagnoses that can guide clinical decision-making. However, these algorithms remain a ‘black box’ in terms of how they generate the predictions from the input data. Also, high-performance deep learning requires large, high-quality training datasets. Here, we report the development of an understandable deep-learning system that detects acute intracranial haemorrhage (ICH) and classifies five ICH subtypes from unenhanced head computed-tomography scans. By using a dataset of only 904 cases for algorithm training, the system achieved a performance similar to that of expert radiologists in two independent test datasets containing 200 cases (sensitivity of 98% and specificity of 95%) and 196 cases (sensitivity of 92% and specificity of 95%). The system includes an attention map and a prediction basis retrieved from training data to enhance explainability, and an iterative process that mimics the workflow of radiologists. Our approach to algorithm development can facilitate the development of deep-learning systems for a variety of clinical applications and accelerate their adoption into clinical practice.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 2: Summary of the system outputs.**

**Fig. 3: Iterative performance improvement by network optimization and preprocessing.**

**Fig. 4: Test performance for ICH detection.**

**Fig. 5: Examples of ICH atlas and prediction basis.**

Deep learning based automatic detection algorithm for acute intracranial haemorrhage: a pivotal randomized clinical trial

Article Open access 07 April 2023

Detection and classification of intracranial haemorrhage on CT images using a novel deep-learning algorithm

Article Open access 25 November 2020

A joint convolutional-recurrent neural network with an attention mechanism for detecting intracranial hemorrhage on noncontrast head CT

Article Open access 08 February 2022

Data availability

The training, validation and test datasets generated for this study are protected patient information. Some data may be available for research purposes from the corresponding author upon reasonable request.

References

Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article CAS Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410 (2016).
Article Google Scholar
Rajpurkar, P. et al. Chexnet: Radiologist-level pneumonia detection on chest X-rays with deep learning. Preprint at https://arxiv.org/abs/1711.05225 (2017).
Castelvecchi, D. Can we open the black box of AI? Nature 538, 20–23 (2016).
Article CAS Google Scholar
Clinical and Patient Decision Support Software. Draft Guidance for Industry and Food and Drug Administration Staff (US FDA, 2017).
Deng, J. et al. Imagenet: a large-scale hierarchical image database. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 248–255 (IEEE, 2009).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. Preprint at https://arxiv.org/abs/1409.1556 (2014).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 770–778 (IEEE, 2016).
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z. Rethinking the inception architecture for computer vision. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 2818–2826 (IEEE, 2016).
Szegedy, C., Ioffe, S., Vanhoucke, V. & Alemi, A. A. Inception-v4, inception-ResNet and the impact of residual connections on learning. In Proc. 31st AAAI Conference on Artificial Intelligence 4278–4284 (AAAI, 2017).
Wang, X. et al. ChestX-ray8: hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 3462–3471 (IEEE, 2017).
Sozykin, K., Khan, A. M., Protasov, S. & Hussain, R. Multi-label class-imbalanced action recognition in hockey videos via 3D convolutional neural networks. Preprint at https://arxiv.org/abs/1709.01421 (2017).
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A. & Torralba, A. Learning deep features for discriminative localization. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 2921–2929 (IEEE, 2016).
Selvaraju, R. R. et al. Grad-cam: visual explanations from deep networks via gradient-based localization. Preprint at https://arxiv.org/abs/1610.02391v3 (2016).
Lazer, D., Kennedy, R., King, G. & Vespignani, A. The parable of Google Flu: traps in big data analysis. Science 343, 1203–1205 (2014).
Article CAS Google Scholar
Chilamkurthy, S. et al. Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet https://doi.org/10.1016/S0140-6736(18)31645-3 (2018).
Grewal, M., Srivastava, M. M., Kumar, P. & Varadarajan, S. RADnet: Radiologist level accuracy using deep learning for hemorrhage detection in CT scans. In IEEE International Symposium on Biomedical Imaging 281–284 (IEEE, 2018).
Desai, V., Flanders, A. E. & Lakhani, P. Application of deep learning in neuroradiology: automated detection of basal ganglia hemorrhage using 2D-convolutional neural networks. Preprint at https://arxiv.org/abs/1710.03823 (2017).
Phong, T. D. et al. Brain hemorrhage diagnosis by using deep learning. In Proc. 2017 International Conference on Machine Learning and Soft Computing 34–39 (ACM, 2017).
Prevedello, L. M. et al. Automated critical test Findings identification and online notification system using artificial ntelligence in imaging. Radiology 285, 923–931 (2017).
Article Google Scholar
Arbabshirani, M. R. et al. Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. npj Digit. Med. 1, 9 (2018).
Article Google Scholar
Rubin, J. et al. Large scale automated reading of frontal and lateral chest X-rays using dual convolutional neural networks. Preprint at https://arxiv.org/abs/1804.07839 (2018).
Lakhani, P. & Sundaram, B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology 284, 574–582 (2017).
Article Google Scholar
Domingos, P. A few useful things to know about machine learning. Commun. ACM 55, 78–87 (2012).
Article Google Scholar
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat. Biomed. Eng. 2, 158 (2018).
Article Google Scholar
Brinjikji, W. et al. Inter- and intraobserver agreement in CT characterization of nonaneurysmal perimesencephalic subarachnoid hemorrhage. AJNR Am. J. Neuroradiol. 31, 1103–1105 (2010).
Article CAS Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A. & Torralba, A. Object detectors emerge in deep scene cnns. Preprint at https://arxiv.org/abs/1412.6856 (2014).
Yosinski, J., Clune, J., Nguyen, A., Fuchs, T. & Lipson, H. Understanding neural networks through deep visualization. Preprint at https://arxiv.org/abs/1506.06579 (2015).
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436 (2015).
Article CAS Google Scholar
Tsoumakas, G. & Katakis, I. Multi-label classification: an overview. Int. J. Data Warehousing Mining 3, 1–13 (2007).
Article Google Scholar
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. J. Comp. Vision 115, 211–252 (2015).
Article Google Scholar
Chollet, F. et al. Keras (2015); http://keras.io
Lin, M., Chen, Q. & Yan, S. Network in network. Preprint at https://arxiv.org/abs/1312.4400 (2013).
Nesterov, Y. A method of solving the convex programming problem with convergence rate O(1/k ²⁾. Dokl. Akad. Nauk USSR 269, 543–547 (1983).
Google Scholar
Abadi, M. et al. Tensorflow: a system for large-scale machine learning. Proc. 12th USENIX Symposium on Operating Systems Design and Implementation 16, 265–283 (2016).
Google Scholar
King, G. & Zeng, L. Logistic regression in rare events data. Political Anal. 9, 137–163 (2001).
Article Google Scholar
Kimpe, T. & Tuytschaever, T. Increasing the number of gray shades in medical display systems—how much is enough? J. Digit. Imaging 20, 422–432 (2007).
Article Google Scholar
Xue, Z., Antani, S., Long, L. R., Demner-Fushman, D. & Thoma, G. R. Window classification of brain CT images in biomedical articles. In AMIA Annual Symposium Proceedings 1023 (American Medical Informatics Association, 2012).
Turner, P. & Holdsworth, G. CT stroke window settings: an unfortunate misleading misnomer? Br. J. Radiol. 84, 1061–1066 (2011).
Article CAS Google Scholar
Ju, C., Bibaut, A. & van der Laan, A. The relative performance of ensemble methods with deep convolutional neural networks for image classification. J. Appl. Stat. 45, 2800–2818 (2018).
Article Google Scholar
Nair, V. & Hinton, G. E. Rectified linear units improve restricted Boltzmann machines. In Proc. 27th International Conference on Machine Learning 807–814 (ICML, 2010).
Davis, J. & Goadrich, M. The relationship between precision-recall and ROC curves. Proc. 23rd International Conference on Machine Learning 233–240 (ACM, 2006).

Download references

Acknowledgements

The authors would like to acknowledge NVIDIA for the use of a DevBox and providing feedback and support, which made this work possible. R.G.G. is funded in part by an NIH U01 grant under the grant number 5U01EB025153.

Author information

These authors contributed equally: Hyunkwang Lee, Sehyo Yune.

Authors and Affiliations

Department of Radiology, Massachusetts General Hospital, Boston, MA, USA
Hyunkwang Lee, Sehyo Yune, Mohammad Mansouri, Myeongchan Kim, Shahein H. Tajmir, Claude E. Guerrier, Sarah A. Ebert, Stuart R. Pomerantz, Javier M. Romero, Shahmir Kamalian, Ramon G. Gonzalez, Michael H. Lev & Synho Do
John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, USA
Hyunkwang Lee

Authors

Hyunkwang Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sehyo Yune
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Mansouri
View author publications
You can also search for this author in PubMed Google Scholar
Myeongchan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Shahein H. Tajmir
View author publications
You can also search for this author in PubMed Google Scholar
Claude E. Guerrier
View author publications
You can also search for this author in PubMed Google Scholar
Sarah A. Ebert
View author publications
You can also search for this author in PubMed Google Scholar
Stuart R. Pomerantz
View author publications
You can also search for this author in PubMed Google Scholar
Javier M. Romero
View author publications
You can also search for this author in PubMed Google Scholar
Shahmir Kamalian
View author publications
You can also search for this author in PubMed Google Scholar
Ramon G. Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Michael H. Lev
View author publications
You can also search for this author in PubMed Google Scholar
Synho Do
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L., S.Y., M.M., R.G.G., M.H.L. and S.D. initiated and designed the research. H.L., S.Y. and M.K. executed the research. M.M., S.H.T., C.E.G., S.A.E., S.R.P., J.M.R., S.K., R.G.G. and M.H.L. acquired and/or interpreted the data. R.G.G. and M.H.L. supervised the data collection. H.L., S.Y., M.H.L. and S.D. analysed and interpreted the data. H.L. and M.K. developed the algorithms and software tools necessary for the experiments. H.L., S.Y., S.H.T. and M.H.L. wrote the manuscript.

Corresponding author

Correspondence to Synho Do.

Ethics declarations

Competing interests

M.H.L. is a consultant of GE Healthcare and Takeda Pharmaceutical Company and receives an institutional research support from Siemens Healthcare. S.R.P. is a consultant of GE Healthcare. S.D. is a consultant of Nulogix and Doai and receives research supports from ZCAI, Tplus and MediBloc. The remaining authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Tables 1–2 and Supplementary Figures 1–7

Reporting Summary

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lee, H., Yune, S., Mansouri, M. et al. An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets. Nat Biomed Eng 3, 173–182 (2019). https://doi.org/10.1038/s41551-018-0324-9

Download citation

Received: 14 March 2018
Accepted: 12 November 2018
Published: 17 December 2018
Issue Date: March 2019
DOI: https://doi.org/10.1038/s41551-018-0324-9

This article is cited by

Uncertainty-aware deep-learning model for prediction of supratentorial hematoma expansion from admission non-contrast head computed tomography scan
- Anh T. Tran
- Tal Zeevi
- Seyedmehdi Payabvash
npj Digital Medicine (2024)
Effects of Explanation Strategy and Autonomy of Explainable AI on Human–AI Collaborative Decision-making
- Bingcheng Wang
- Tianyi Yuan
- Pei-Luen Patrick Rau
International Journal of Social Robotics (2024)
Evaluation of techniques to improve a deep learning algorithm for the automatic detection of intracranial haemorrhage on CT head imaging
- Melissa Yeo
- Bahman Tahayori
- Hamed Asadi
European Radiology Experimental (2023)
Diagnostic test accuracy of machine learning algorithms for the detection intracranial hemorrhage: a systematic review and meta-analysis study
- Masoud Maghami
- Shahab Aldin Sattari
- Kiarash Shirbandi
BioMedical Engineering OnLine (2023)
Deep learning based automatic detection algorithm for acute intracranial haemorrhage: a pivotal randomized clinical trial
- Tae Jin Yun
- Jin Wook Choi
- In Pyeong Hwang
npj Digital Medicine (2023)