In this Comment, we provide guidelines for reinforcement learning for decisions about patient treatment that we hope will accelerate the rate at which observational cohorts can inform healthcare practice in a safe, risk-conscious manner.
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
An interpretable RL framework for pre-deployment modeling in ICU hypotension management
npj Digital Medicine Open Access 18 November 2022
-
The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms
Scientific Data Open Access 11 November 2022
-
The role of machine learning in clinical research: transforming the future of evidence generation
Trials Open Access 16 August 2021
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 per month
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$189.00 per year
only $15.75 per issue
Rent or buy this article
Get just this article for as long as you need it
$39.95
Prices may be subject to local taxes which are calculated during checkout

Debbie Maizels/Springer Nature

Debbie Maizels/Springer Nature
References
Obermeyer, Z. & Emanuel, E. J. N. Engl. J. Med. 375, 1216 (2016).
Parbhoo, S., Bogojeska, J., Zazzi, M., Roth, V. & Doshi-Velez, F. AMIA Summits on Translational Science Proceedings 2017, 239 (2017).
Guez, A., Vincent, R. D., Avoli, M. & Pineau, J. Treatment of epilepsy via batch-mode reinforcement learning. In Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence 1671–1678 (AAAI, 2008).
Komorowski, M., Celi, L. A., Badawi, O., Gordon, A. & Faisal, A. Nat. Med. 24, 1716–1720 (2018).
Chakraborty, B., Moodie, E. & Erica, E. M. Statistical Methods for Dynamic Treatment Regimes (Springer, New York, 2013).
Simpson, N., Lamontagne, F. & Shankar-Hari, M. Curr Opin Crit Care. 23, 561–566 (2017).
Johansson, F., Shalit, U. & Sontag, D. Learning representations for counterfactual inference. In Proceedings of the 33th International Conference on Machine Learning (ICML, 2016).
Precup, D., Sutton, R. S. & Singh, S. P. Eligibility traces for off-policy policy evaluation. In Proceedings of the Seventeenth International Conference on Machine Learning 759–766 (ICML, 2000).
Gottesman, O. et al. Evaluating Reinforcement Learning Algorithms in Observational Health Settings. Preprint at https://arxiv.org/abs/1805.12298 (2018).
Doshi-Velez, F. & Kim, B. Towards a rigorous science of interpretable machine learning. Preprint at https://arxiv.org/abs/1702.08608 (2017).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
A.A.F. has received funding from Fresenius-KABI in the past.
Rights and permissions
About this article
Cite this article
Gottesman, O., Johansson, F., Komorowski, M. et al. Guidelines for reinforcement learning in healthcare. Nat Med 25, 16–18 (2019). https://doi.org/10.1038/s41591-018-0310-5
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41591-018-0310-5
This article is cited by
-
Dynamic stock-decision ensemble strategy based on deep reinforcement learning
Applied Intelligence (2023)
-
An interpretable RL framework for pre-deployment modeling in ICU hypotension management
npj Digital Medicine (2022)
-
The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms
Scientific Data (2022)
-
Artificial intelligence-enabled decision support in nephrology
Nature Reviews Nephrology (2022)
-
The role of machine learning in clinical research: transforming the future of evidence generation
Trials (2021)