A unifying force for the realization of medical AI

Lennerz, Jochen K.; Green, Ursula; Williamson, Drew F. K.; Mahmood, Faisal

doi:10.1038/s41746-022-00721-7

Download PDF

Comment
Open access
Published: 15 November 2022

A unifying force for the realization of medical AI

npj Digital Medicine volume 5, Article number: 172 (2022) Cite this article

4581 Accesses
8 Citations
18 Altmetric
Metrics details

Subjects

Artificial Intelligence (AI) in medicine has grown rapidly, yet few algorithms have been deployed. It is not the problem with the AI itself but with the way functions and results are communicated. Regulatory science provides the appropriate language and solutions to this problem for three reasons: First, there is value in the intentionally interdisciplinary regulatory language. Second, regulatory concepts are important for AI researchers because these concepts enable tackling of risk and safety concerns as well as understanding of recently proposed regulations in the US and Europe. Third, regulatory science is a scientific discipline that evaluates and challenges current regulation—aiming for evidence-based improvements. Knowledge of the regulatory language, concepts, and science should be regarded a core competency for communicating medical innovation. Regulatory grade communication will be the key to bringing medical AI from hype to standard of care. Foregoing the possible benefits of regulatory science as a unifying force for the realization of medical AI is a missed opportunity.

The past few years has seen a rapid growth of AI in medicine, however, few algorithms have been deployed in clinical practice¹. We view this disconnect between hype and reality as stemming from two main barriers: first, the lack of a common language between AI and medicine, and second, the rapid progress in AI outpacing the comparatively slow adaptation of regulation, forcing regulatory bodies to apply measures that do not always consider the paradigm-shifting capabilities of contemporary AI. We propose regulatory science with its terms and concepts as a solution for both problems because it represents a high-level language that can serve as a unifying force for the realization of medical AI (Fig. 1).

**Fig. 1: Regulatory science and AI in medicine.**

Regulatory science is the scientific discipline that evaluates and challenges current regulation, benefit vs. risk assessments, and submission/approval strategies². It is the application of the scientific method to enable evidence-based improvements of regulation, and just as new scientific evidence can be powerful enough to change the paradigm of a field of study, so too can it change regulatory paradigms.

Fundamentally, regulatory science is about creating a dialogue for launching new ideas and determining how best to allow those ideas to interact with society-not only from within regulatory authorities but also through collaborations between academics, clinicians, industry, payors, policy experts, and patients. Like any scientific discipline, regulatory science comes with a specific language, but given its core translational nature, its language is intentionally interdisciplinary to enable deep collaborations. The terms and concepts traverse specific use cases and provide a contextual vocabulary that enables clear communication beyond use case of medical subspecialty (Supplementary Table 1). In other words, regulatory language is unifying.

For example, one challenge we have personally encountered (and have witnessed frequently among others) is clearly communicating the specific task of medical AI in a way that is mutually intelligible for medical and AI experts. Medical education opens one’s eyes to the enormously complex systems that have evolved for treating patients through our incomplete understanding of biology. The inherent subjectivity and guesswork in medicine can be appalling to AI experts more used to dealing with systems that are, at least in theory, rationally designed and better understood. Given the interconnectedness and subjectivity inherent in essentially all interactions a patient has with the healthcare system, defining the boundaries of a problem where AI could provide a solution becomes an issue in and of itself. For example, subtle changes in diagnosis can lead to huge changes in management. These subtleties are accounted for in the evolving and continuously updated definitions that make up the language of regulatory science. Terminology from regulatory science such as intended use (“what”), indication of use (“who and why”), or instructions for use (“how”); can help both sides communicate precisely about the scope of the problem at hand and how to center the patient in this discussion (Fig. 2).

**Fig. 2: Selected regulatory science concepts.**

Centering benefit to the patient is the goal of effective regulation, but the prevailing regulatory paradigms have not been optimized for AI in medicine. By and large, they have been adapted through continuous iteration to best review and approve drugs, medical devices, or software (as a medical device) that is fundamentally different from AI—especially when algorithms continuously evolve. A burgeoning body of research has shown that AI algorithms can fail in non-trivial ways, from poor generalization due to dataset shift, to overfitting to confounders, to unexpected failure modes³.

These challenges must be addressed before AI can be used safely in clinical practice. Thankfully, similar barriers have been overcome in other domains of medicine and their solutions codified into regulation. For example, there is a growing recognition that ongoing performance assessment of a deployed AI model is key to combating dataset shift, a concept that follows the principles of continued monitoring of post-market surveillance required by the FDA. There are numerous regulatory resources (Supplementary Table 1)⁴ to address software, medical AI, and change modifications^5,6,7,8. Much additional work is needed though, with the prevailing FDA regulations (Supplementary Table 1) or ISO governance approaches (Supplementary Table 2) dispersed across over 25 guidance² or standard documents, respectively.

One key question is whether applying regulatory paradigms can supplement the more traditional strength/weaknesses approach pursued in research. We have reconstructed examples where the addition or regulatory principles resulted in documented improvements (Supplementary Table 3). Briefly, the IBM Watson Content Analytics had a poorly described intended use; however, subsequent publications clearly communicate value propositions in regulatory terms (Supplementary Table 3). Google’s AI-screening for diabetic retinopathy is an example where the lack of instructions for use was responsible for key performance issues (e.g., operating the device in a dark room). Notably, the lack of regulatory aspects was in direct contradiction to simultaneously published regulatory comments from the FDA and (notably) google itself—emphasizing the importance of regulatory consistencies (Supplementary Table 3). In other words, we can reconstruct that two of the most drastic AI fiascoes entailed inconsistencies in communication that resulted in miscommunication between AI and healthcare experts. Other examples include documented improvements in objectivity and reproducibility when tailoring performance measures to the specific target population. Notably, adoption of the algorithm based on the target population-matched (as a mitigation strategy) enabled overcoming a biomarker challenge in ovarian cancer screening previously flagged as a public health concern (Supplementary Table 3). These examples illustrate that regulatory concepts are consequential and hold clinical value beyond a vantage point in a research publication.

The unique strengths and weaknesses of AI require new regulation to be developed and old regulation to be altered. For example, US-based regulatory guidances and the European Artificial Intelligence Act⁹ already account for regulatory compliant reporting of change protocols (Supplementary Table 1), a change that accounts for potential problems identified during and after deployment of continuously learning AI models. These guidance and legislative axioms argue strongly for a role of regulatory terminology as one of the key factors impacting the integration of AI approaches in medicine. Learning the language of regulatory science also confronts us with the fact that regulation, rather than being handed down from on high, is a human endeavor; that regulations are made by people who are reviewing the data and input that AI and medical experts generate, and that regulation can (and should) be challenged and updated. In the US, the FDA established several strategies to address regulatory challenges by obtaining external, interdisciplinary input (Supplementary Table 4). These programs offer concrete and practical approaches to incorporate inputs from the technical communities. For example, the FDA engages with outside experts via collaborative communities, a network of experts, and specific medical device development tool programs, to keep up with changes in the fields under its purview. Concretely, these initiatives have already influenced recent legislative proposals that now clearly spell out the need for “recommendations and other advice” from domain-experts to facilitate meaningful regulatory guidance¹⁰. Learning the language of regulatory science can help those who know the most about medical AI to effectively influence the nascent regulatory landscape.

We view regulatory science as a fundamental building block of healthcare that now also focusses on using AI to improve patients’ lives. Regulatory science, its language and concepts have the potential to facilitate communication and collaboration between the fields of AI and medicine, as well as between the broader medical AI community and regulatory bodies. Knowledge of the regulatory language, concepts, and science should be regarded a core competency for communicating medical innovation. Regulatory grade communication will be the key to bringing medical AI from hype to standard of care.

References

Wang, F., Casalino, L. P. & Khullar, D. Deep learning in medicine-promise, progress, and challenges. JAMA Intern. Med. 179, 293–294 (2019).
Article PubMed Google Scholar
Marble, H. D. et al. A regulatory science initiative to harmonize and standardize digital pathology and machine learning processes to speed up clinical innovation to patients. J. Pathol. Inf. 11, 22 (2020).
Article Google Scholar
Nordan, J. G. & Shah, N. R. What AI in health care can learn from the long road to autonomous vehicles. NEJM Catalyst (2022).
FDA. Artificial Intelligence/Machine Learning (AI/ML)-Based Software as a Medical Device (SaMD) Action Plan (ed. FDA). https://www.fda.gov/news-events/press-announcements/fda-releases-artificial-intelligencemachine-learning-action-plan. (2021).
Stern, A. D. & Price, W. N. Regulatory oversight, causal inference, and safe and effective health care machine learning. Biostatistics 21, 363–367 (2020).
PubMed Google Scholar
Ferryman, K. Addressing health disparities in the Food and Drug Administration’s artificial intelligence and machine learning regulatory framework. J. Am. Med. Inf. Assoc. 27, 2016–2019 (2020).
Article Google Scholar
Vokinger, K. N., Feuerriegel, S. & Kesselheim, A. S. Continual learning in medical devices: FDA’s action plan and beyond. Lancet Digit Health 3, e337–e338 (2021).
Article PubMed Google Scholar
Gallas, B. D. et al. FDA fosters innovative approaches in research, resources and collaboration. Nat. Mach. Intell. 4, 97–98 (2022).
Article Google Scholar
EU. Artificial Intelligence Act. 2021/0106 (COD). https://artificialintelligenceact.eu/the-act/ (European Commission, 2021).
Senate-HELP-Committee. Food and Drug Association Safety and Landmark Advancements Act (FDASLA) including the Verifying Accurate Leading-edge In Vitro Clinical Test Development Act of 2022 (VALID Act; pg 125ff). https://www.help.senate.gov/download/fdasla-discussion-draft-may-17-2022 (2022).

Download references

Acknowledgements

We appreciate the input from all members of the Pathology Innovation Collaborative Community (PIcc; www.pathologyinnovationcc.org). This work was supported by U.S. National Institutes of Health (NIH) grant R37 CA225655 (to J.K.L.). The content of this article is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Health or any other organization.

Author information

Authors and Affiliations

Massachusetts General Hospital/Harvard Medical School, Boston, MA, USA
Jochen K. Lennerz, Ursula Green & Faisal Mahmood
Brigham and Women’s Hospital/Harvard Medical School, Boston, MA, USA
Drew F. K. Williamson & Faisal Mahmood
Cancer Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
Drew F. K. Williamson & Faisal Mahmood

Authors

Jochen K. Lennerz
View author publications
You can also search for this author in PubMed Google Scholar
Ursula Green
View author publications
You can also search for this author in PubMed Google Scholar
Drew F. K. Williamson
View author publications
You can also search for this author in PubMed Google Scholar
Faisal Mahmood
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.K.L. and F.M. wrote an initial draft; U.G. and D.K.F.W. revised the initial draft; U.G., D.F.K.W., and J.K.L. conceptualized the figures; all authors approved the final version of the manuscript.

Corresponding author

Correspondence to Jochen K. Lennerz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lennerz, J.K., Green, U., Williamson, D.F.K. et al. A unifying force for the realization of medical AI. npj Digit. Med. 5, 172 (2022). https://doi.org/10.1038/s41746-022-00721-7

Download citation

Received: 19 July 2022
Accepted: 08 November 2022
Published: 15 November 2022
DOI: https://doi.org/10.1038/s41746-022-00721-7

This article is cited by

Artificial intelligence applications in histopathology
- Cagla Deniz Bahadir
- Mohamed Omar
- Mert R. Sabuncu
Nature Reviews Electrical Engineering (2024)
New regulatory thinking is needed for AI-based personalised drug and cell therapies in precision oncology
- Bouchra Derraz
- Gabriele Breda
- Stephen Gilbert
npj Precision Oncology (2024)
Artificial intelligence for digital and computational pathology
- Andrew H. Song
- Guillaume Jaume
- Faisal Mahmood
Nature Reviews Bioengineering (2023)