Deep learning in biomedicine

Wainberg, Michael; Merico, Daniele; Delong, Andrew; Frey, Brendan J

doi:10.1038/nbt.4233

Perspective
Published: 01 October 2018

Deep learning in biomedicine

Michael Wainberg^1,2,
Daniele Merico¹,
Andrew Delong¹ &
…
Brendan J Frey¹

Nature Biotechnology volume 36, pages 829–838 (2018)Cite this article

35k Accesses
352 Citations
140 Altmetric
Metrics details

Subjects

Abstract

Deep learning is beginning to impact biological research and biomedical applications as a result of its ability to integrate vast datasets, learn arbitrarily complex relationships and incorporate existing knowledge. Already, deep learning models can predict, with varying degrees of success, how genetic variation alters cellular processes involved in pathogenesis, which small molecules will modulate the activity of therapeutically relevant proteins, and whether radiographic images are indicative of disease. However, the flexibility of deep learning creates new challenges in guaranteeing the performance of deployed systems and in establishing trust with stakeholders, clinicians and regulators, who require a rationale for decision making. We argue that these challenges will be overcome using the same flexibility that created them; for example, by training deep models so that they can output a rationale for their predictions. Significant research in this direction will be needed to realize the full potential of deep learning in biomedicine.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: The rise of molecular biology, deep learning and data-driven biomedicine.**

**Figure 2: Supervised machine learning.**

**Figure 3: Key concepts in deep learning.**

Accurate structure prediction of biomolecular interactions with AlphaFold 3

Article 08 May 2024

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

Genome-wide association studies

Article 26 August 2021

References

Waldrop, M.M. Autonomous vehicles: no drivers required. Nature 518, 20–23 (2015).
Article CAS PubMed Google Scholar
Mnih, V. et al. Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015).
Article CAS PubMed Google Scholar
Silver, D. et al. Mastering the game of Go without human knowledge. Nature 550, 354–359 (2017).
Article CAS PubMed Google Scholar
Gatys, L.A., Ecker, A.S. & Bethge, M. Image style transfer using convolutional neural networks. in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2016.265 (2016).
Graves, A., Mohamed, A.-R. & Hinton, G. Speech recognition with deep recurrent neural networks. in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing https://doi.org/10.1109/icassp.2013.6638947 (2013).
Sutskever, I., Vinyals, O. & Le, Q.V. Sequence to sequence learning with neural networks. in. Neural Information Processing Systems 2014, 3104–3112 (2014).
Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article CAS PubMed Google Scholar
Angermueller, C., Pärnamaa, T., Parts, L. & Stegle, O. Deep learning for computational biology. Mol. Syst. Biol. 12, 878 (2016).
Article PubMed PubMed Central Google Scholar
Leung, M.K.K., Andrew, D., Babak, A. & Frey, B.J. Machine learning in genomic medicine: a review of computational problems and data sets. Proc. IEEE 104, 176–197 (2016).
Article Google Scholar
Mamoshina, P., Vieira, A., Putin, E. & Zhavoronkov, A. Applications of deep learning in biomedicine. Mol. Pharm. 13, 1445–1454 (2016).
Article CAS PubMed Google Scholar
Min, S., Lee, B. & Yoon, S. Deep learning in bioinformatics. Brief. Bioinform. 18, 851–869 (2017).
PubMed Google Scholar
Gawehn, E., Hiss, J.A. & Schneider, G. Deep learning in drug discovery. Mol. Inform. 35, 3–14 (2016).
Article CAS PubMed Google Scholar
Jurtz, V.I. et al. An introduction to deep learning on biological sequence data: examples and solutions. Bioinformatics 33, 3685–3690 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ching, T. et al. Opportunities and obstacles for deep learning in biology and medicine. J. R. Soc. Interface 15, 20170387 (2018).
Article PubMed PubMed Central Google Scholar
Baldi, P. Deep learning in biomedical data science. Annu. Rev. Biomed. Data Sci. 1, 181–205 (2018).
Article Google Scholar
Ruder, S. An overview of multi-task learning in deep neural networks. Preprint at https://doi.org/arxiv.org/abs/1706.05098 (2017).
Liu, H., Simonyan, K., Vinyals, O., Fernando, C. & Kavukcuoglu, K. Hierarchical representations for efficient architecture search. Preprint at https://doi.org/arxiv.org/abs/1711.00436 (2017).
Weiss, K., Khoshgoftaar, T.M. & Wang, D. A survey of transfer learning. J. Big Data 3, 9 (2016).
Article Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017).
Article Google Scholar
Schuster, M. & Paliwal, K.K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45, 2673–2681 (1997).
Article Google Scholar
Hinton, G.E., Dayan, P., Frey, B.J. & Neal, R.M. The wake-sleep algorithm for unsupervised neural networks. Science 268, 7761831 (1995).
Article Google Scholar
Goodfellow, I.J. et al. Generative adversarial networks. Preprint at https://doi.org/arxiv.org/abs/1406.2661 (2014).
Tan, J., Ung, M., Cheng, C. & Greene, C.S. Unsupervised feature construction and knowledge extraction from genome-wide assays of breast cancer with denoising autoencoders. Pac. Symp. Biocomput. 2015, 132–143 (2015).
Google Scholar
Gómez-Bombarelli, R. et al. Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent. Sci. 4, 268–276 (2018).
Article PubMed PubMed Central CAS Google Scholar
Kingma, D.P., Rezende, D.J., Mohamed, S. & Welling, M. Semi-supervised learning with deep generative models. Preprint at https://doi.org/arxiv.org/abs/1406.5298 (2014).
Pham, H., Guan, M.Y., Zoph, B., Le, Q.V. & Dean, J. Efficient neural architecture search via parameter sharing. Preprint at https://doi.org/arxiv.org/abs/1802.03268 (2018).
MacKay, D.J.C. A practical Bayesian framework for backpropagation networks. Neural Comput. 4, 448–472 (1992).
Article Google Scholar
Neal, R.M. Bayesian Learning for Neural Networks (Springer, Berlin and Heidelberg, Germany, 1996).
Gal, Y. & Ghahramani, Z. Dropout as a Bayesian approximation: representing model uncertainty in deep learning. Preprint at https://doi.org/arxiv.org/abs/1506.02142 (2015).
Efron, B. Bootstrap methods: another look at the jackknife. Ann. Stat. 7, 1–26 (1979).
Article Google Scholar
Xiong, H.Y. et al. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science 347, 1254806 (2015).
Article CAS PubMed Google Scholar
Lipton, Z.C. The mythos of model interpretability. Preprint at https://doi.org/arxiv.org/abs/1606.03490 (2016).
Pearl, J. Causal inference in statistics: an overview. Stat. Surv. 3, 96–146 (2009).
Article Google Scholar
Shrikumar, A., Greenside, P., Shcherbina, A. & Kundaje, A. Not just a black box: learning important features through propagating activation differences. Preprint at https://doi.org/arxiv.org/abs/1605.01713 (2016).
Hoskins, R.A. et al. Reports from CAGI: the critical assessment of genome interpretation. Hum. Mutat. 38, 1039–1041 (2017).
Article PubMed PubMed Central Google Scholar
Visscher, P.M. et al. 10 years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101, 5–22 (2017).
Article CAS PubMed PubMed Central Google Scholar
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405–424 (2015).
Article PubMed PubMed Central Google Scholar
Timpson, N.J., Greenwood, C.M.T., Soranzo, N., Lawson, D.J. & Richards, J.B. Genetic architecture: the shape of the genetic contribution to human traits and disease. Nat. Rev. Genet. 19, 110–124 (2018).
Article CAS PubMed Google Scholar
Boyle, E.A., Li, Y.I. & Pritchard, J.K. An expanded view of complex traits: from polygenic to omnigenic. Cell 169, 1177–1186 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wagih, O., Merico, D., Delong, A. & Frey, B.J. Allele-specific transcription factor binding as a benchmark for assessing variant impact predictors. Preprint at bioRxiv https://doi.org/10.1101/253427 (2018).
Alipanahi, B., Delong, A., Weirauch, M.T. & Frey, B.J. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol. 33, 831–838 (2015).
CAS PubMed Google Scholar
Kelley, D.R., Snoek, J. & Rinn, J.L. Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks. Genome Res. 26, 990–999 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhou, J. & Troyanskaya, O.G. Predicting effects of noncoding variants with deep learning-based sequence model. Nat. Methods 12, 931–934 (2015).
Article CAS PubMed PubMed Central Google Scholar
Quang, D. & Xie, X. DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences. Nucleic Acids Res. 44, e107 (2016).
Article PubMed PubMed Central CAS Google Scholar
Angermueller, C., Lee, H.J., Reik, W. & Stegle, O. DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning. Genome Biol. 18, 67 (2017).
Article PubMed PubMed Central CAS Google Scholar
Zhang, S., Hu, H., Jiang, T., Zhang, L. & Zeng, J. TITER: predicting translation initiation sites by deep learning. Bioinformatics 33, i234–i242 (2017).
Article CAS PubMed PubMed Central Google Scholar
Shendure, J. & Fields, S. Massively parallel genetics. Genetics 203, 617–619 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ma, J. et al. Using deep learning to model the hierarchical structure and function of a cell. Nat. Methods 15, 290–298 (2018).
Article CAS PubMed PubMed Central Google Scholar
Baldi, P., Brunak, S., Frasconi, P., Soda, G. & Pollastri, G. Exploiting the past and the future in protein secondary structure prediction. Bioinformatics 15, 937–946 (1999).
Article CAS PubMed Google Scholar
Pollastri, G., Przybylski, D., Rost, B. & Baldi, P. Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins 47, 228–235 (2002).
Article CAS PubMed Google Scholar
Duvenaud, D. et al. Convolutional networks on graphs for learning molecular fingerprints. Preprint at https://doi.org/arxiv.org/abs/1509.09292 (2015).
Kearnes, S., McCloskey, K., Berndl, M., Pande, V. & Riley, P. Molecular graph convolutions: moving beyond fingerprints. J. Comput. Aided Mol. Des. 30, 595–608 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dahl, G.E., Jaitly, N. & Salakhutdinov, R. Multi-task neural networks for QSAR predictions. Preprint at https://doi.org/arxiv.org/abs/1406.1231 (2014).
Ma, J., Sheridan, R.P., Liaw, A., Dahl, G.E. & Svetnik, V. Deep neural nets as a method for quantitative structure-activity relationships. J. Chem. Inf. Model. 55, 263–274 (2015).
Article CAS PubMed Google Scholar
Ramsundar, B. et al. Massively multitask networks for drug discovery. Preprint at https://doi.org/arxiv.org/abs/1502.02072 (2015).
Wallach, I., Dzamba, M. & Heifets, A. AtomNet: a deep convolutional neural network for bioactivity prediction in structure-based drug discovery. Preprint at https://doi.org/arxiv.org/abs/1510.02855 (2015).
Liu, Y. et al. Detecting cancer metastases on gigapixel pathology images. Preprint at https://doi.org/arxiv.org/abs/1703.02442 (2017).
Wang, D., Khosla, A., Gargeya, R., Irshad, H. & Beck, A.H. Deep learning for identifying metastatic breast cancer. Preprint at https://doi.org/arxiv.org/abs/1606.05718 (2016).
Lakhani, P. & Sundaram, B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology 284, 574–582 (2017).
Article PubMed Google Scholar
Kraus, O.Z. et al. Automated analysis of high-content microscopy data with deep learning. Mol. Syst. Biol. 13, 924 (2017).
Article PubMed PubMed Central CAS Google Scholar
Carpenter, A.E. et al. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 7, R100 (2006).
Article PubMed PubMed Central CAS Google Scholar
Bruno, M.A., Walker, E.A. & Abujudeh, H.H. Understanding and confronting our mistakes: the epidemiology of error in radiology and strategies for error reduction. Radiographics 35, 1668–1676 (2015).
Article PubMed Google Scholar
Leinonen, R., Sugawara, H. & Shumway, M. The Sequence Read Archive. Nucleic Acids Res. 39, D19–D21 (2011).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

Our perspectives were influenced by conversations with many people, including members of Deep Genomics, B. Andrews, Y. Bengio, B. Blencowe, C. Boone, D. Botstein, C. Francis, A. Heifets, G. Hinton, T. Hughes, P. Hutt, R. Klausner, E. Lander, Y. LeCun, A. Levin, Q. Morris, B. Neale, S. Scherer and J.C. Venter.

Author information

Authors and Affiliations

Deep Genomics Inc., MaRS Discovery District, Toronto, Ontario, Canada
Michael Wainberg, Daniele Merico, Andrew Delong & Brendan J Frey
Department of Computer Science, Stanford University, Stanford, California, USA
Michael Wainberg

Authors

Michael Wainberg
View author publications
You can also search for this author in PubMed Google Scholar
Daniele Merico
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Delong
View author publications
You can also search for this author in PubMed Google Scholar
Brendan J Frey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Brendan J Frey.

Ethics declarations

Competing interests

All authors are, or recently were, employees of Deep Genomics, an AI therapeutics company, which is using deep learning to identify the genetic determinants of disease and to develop therapies.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wainberg, M., Merico, D., Delong, A. et al. Deep learning in biomedicine. Nat Biotechnol 36, 829–838 (2018). https://doi.org/10.1038/nbt.4233

Download citation

Received: 12 October 2017
Accepted: 01 August 2018
Published: 01 October 2018
Issue Date: October 2018
DOI: https://doi.org/10.1038/nbt.4233

This article is cited by

Prediction of anticancer drug sensitivity using an interpretable model guided by deep learning
- Weixiong Pang
- Ming Chen
- Yufang Qin
BMC Bioinformatics (2024)
ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences
- Yao Pei
- Marcus Ho-Hin Shum
- Tommy Tsan-Yuk Lam
Microbiome (2024)
Enhancing urinalysis with smartphone and AI: a comprehensive review of point-of-care urinalysis and nutritional advice
- Haluk Çelik
- Balım Bengisu Caf
- Miray Tayfun
Chemical Papers (2024)
Deep doubly robust outcome weighted learning
- Xiaotong Jiang
- Xin Zhou
- Michael R. Kosorok
Machine Learning (2024)
Feature selection techniques for machine learning: a survey of more than two decades of research
- Dipti Theng
- Kishor K. Bhoyar
Knowledge and Information Systems (2024)