A quality assessment tool for artificial intelligence-centered diagnostic test accuracy studies: QUADAS-AI

Sounderajah, Viknesh; Ashrafian, Hutan; Rose, Sherri; Shah, Nigam H.; Ghassemi, Marzyeh; Golub, Robert; Kahn, Charles E.; Esteva, Andre; Karthikesalingam, Alan; Mateen, Bilal; Webster, Dale; Milea, Dan; Ting, Daniel; Treanor, Darren; Cushnan, Dominic; King, Dominic; McPherson, Duncan; Glocker, Ben; Greaves, Felix; Harling, Leanne; Ordish, Johan; Cohen, Jérémie F.; Deeks, Jon; Leeflang, Mariska; Diamond, Matthew; McInnes, Matthew D. F.; McCradden, Melissa; Abràmoff, Michael D.; Normahani, Pasha; Markar, Sheraz R.; Chang, Stephanie; Liu, Xiaoxuan; Mallett, Susan; Shetty, Shravya; Denniston, Alastair; Collins, Gary S.; Moher, David; Whiting, Penny; Bossuyt, Patrick M.; Darzi, Ara

doi:10.1038/s41591-021-01517-0

Download PDF

Correspondence
Published: 11 October 2021

A quality assessment tool for artificial intelligence-centered diagnostic test accuracy studies: QUADAS-AI

Nature Medicine volume 27, pages 1663–1665 (2021)Cite this article

12k Accesses
71 Citations
89 Altmetric
Metrics details

Subjects

To the Editor — Over the next decade, systems that are centered on artificial intelligence (AI), particularly machine learning, are predicted to become key components of several workflows within the health sector. Medical diagnosis is seen as one of the first areas that may be revolutionized by AI innovations. Indeed, more than 90% of health-related AI systems that have reached regulatory approval by the US Food and Drug Administration belong to the field of diagnostics¹.

In the current paradigm, most diagnostic investigations require interpretation from a clinician to identify the presence of a target condition — a crucial step in determining subsequent treatment strategies. Despite being an essential step in the provision of patient care, many health systems find it increasingly difficult to meet the demand for the interpretation of diagnostic tests. To address this issue, diagnostic AI systems have been characterized as medical devices that may alleviate the burden placed on diagnosticians: by serving as case triage tools, enhancing diagnostic accuracy and stepping in as a second reader when necessary. As AI-centered diagnostic test accuracy (AI DTA) studies emerge, there has been a concurrent rise in systematic reviews that amalgamate the findings of comparable studies.

Notably, of these published AI DTA systematic reviews, 94% have been conducted in the absence of an AI-specific quality assessment tool². The most commonly used instrument is the quality assessment of diagnostic accuracy studies (QUADAS-2) tool³. QUADAS-2 is a tool that assesses bias and applicability and its use is encouraged by current PRISMA 2020 guidance⁴. However, QUADAS-2 does not accommodate for niche terminology encountered in AI DTA studies, nor does it signal researchers to the sources of bias found within this class of study. Examples of such biases, when framed against the established domains of QUADAS-2 (patient selection; index test; reference standard; and flow and timing) are listed in Table 1.

Table 1 Examples of bias within AI DTA studies

Full size table

To tackle these sources of bias, as well as AI-specific examples such as algorithmic bias, we propose an AI-specific extension to QUADAS-2 and QUADAS-C⁵, a risk of bias tool that has been developed for comparative accuracy studies. This new tool, termed QUADAS-AI, will provide researchers and policy-makers with a specific framework to evaluate the risk of bias and applicability when conducting reviews that evaluate AI DTA and reviews of comparative accuracy studies that evaluate at least one AI-centered index test.

QUADAS-AI will be complementary to ongoing reporting guideline tool initiatives, such as STARD-AI⁶ and TRIPOD-AI⁷. QUADAS-AI is being coordinated by a global project team and steering committee that consists of clinician scientists, computer scientists, epidemiologists, statisticians, journal editors, representatives of the EQUATOR Network¹¹, regulatory leaders, industry leaders, funders, health policy-makers and bioethicists. Given the reach of AI technologies, we view that connecting global stakeholders is of the utmost importance for this initiative. In turn, we would welcome contact from any new potential collaborators.

References

Benjamens, S., Dhunnoo, P. & Meskó, B. npj Digit. Med. 3, 118 (2020).
Article Google Scholar
Jayakumar, S. et al. npj Digital Med. (in the press).
Whiting, P. F. Ann. Intern. Med. 155, 529 (2011).
Article Google Scholar
Page, M. J. et al. BMJ. 372, n71 (2021).
Article Google Scholar
Yang, B. et al. Open Science Framework https://doi.org/10.17605/OSF.IO/HQ8MF (2018).
Sounderajah, V. et al. Nat. Med. 26, 807–808 (2020).
Article CAS Google Scholar
Collins, G. & Moons, K. Lancet 393, 1577–1579 (2019).
Article Google Scholar
Liu, X. & Rivera, S. C. Nat. Med. 26, 1364–1374 (2020).
Article CAS Google Scholar
Harris, M. et al. PLoS One 14, e0226134 (2019).
Google Scholar
Roberts, M. et al. Nat. Mach. Intell. 3, 199–217 (2021).
Article Google Scholar
The EQUATOR Network.Enhancing the QUAlity and Transparency Of Health Research; https://www.equator-network.org/ (accessed 27 September 2021).

Download references

Acknowledgements

Infrastructure support for this research was provided by the National Institute for Health Research (NIHR) Imperial Biomedical Research Centre (BRC). G.S.C. is supported by the NIHR Biomedical Research Centre and Cancer Research UK (programme grant C49297/A27294). D.T. is funded by National Pathology Imaging Co-operative, NPIC (project no. 104687), supported by a £50 million investment from the Data to Early Diagnosis and Precision Medicine strand of the government’s Industrial Strategy Challenge Fund, and managed and delivered by UK Research and Innovation (UKRI). F.G. is supported by the NIHR Applied Research Collaboration Northwest London. The views and opinions expressed herein are those of the authors and do not necessarily reflect the views of their employers or funders.

Author information

These authors jointly supervised this work: Patrick M. Bossuyt, Ara Darzi.

Authors and Affiliations

Institute of Global Health Innovation, Imperial College London, London, UK
Viknesh Sounderajah, Hutan Ashrafian, Dominic King, Leanne Harling & Ara Darzi
Department of Surgery and Cancer, Imperial College London, London, UK
Viknesh Sounderajah, Hutan Ashrafian, Leanne Harling, Pasha Normahani, Sheraz R. Markar & Ara Darzi
Center for Health Policy and Center for Primary Care and Outcomes Research, Stanford University, Stanford, CA, USA
Sherri Rose
Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA
Nigam H. Shah
Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA
Marzyeh Ghassemi
Journal of the American Medical Association (JAMA), Chicago, IL, USA
Robert Golub
University of Pennsylvania, Philadelphia, Pennsylvania, PA, USA
Charles E. Kahn Jr
Salesforce Research, San Francisco, CA, USA
Andre Esteva
Google Health, Palo Alto, CA, USA
Alan Karthikesalingam, Dale Webster & Shravya Shetty
Wellcome Trust, London, UK
Bilal Mateen
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
Dan Milea & Daniel Ting
Leeds Teaching Hospitals NHS Trust, Leeds, UK
Darren Treanor
University of Leeds, Leeds, UK
Darren Treanor
Department of Clinical Pathology, and Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
Darren Treanor
Center for Medical Image Science and Visualization (CMIV), Linköping University, Linköping, Sweden
Darren Treanor
NHSX, London, UK
Dominic Cushnan
Optum, London, UK
Dominic King
Medicines and Healthcare Products Regulatory Agency, London, UK
Duncan McPherson & Johan Ordish
Faculty of Engineering, Department of Computing, Imperial College London, London, UK
Ben Glocker
National Institute for Health and Care Excellence, London, UK
Felix Greaves
Department of Pediatrics, Centre of Research in Epidemiology and Statistics, Inserm UMR 1153, Necker- Enfants Malades Hospital, Assistance Publique-Hôpitaux de Paris, Université de Paris, Paris, France
Jérémie F. Cohen
Institute of Applied Health Research, University of Birmingham, Birmingham, UK
Jon Deeks
Department of Epidemiology and Data Science, Amsterdam University Medical Centres, University of Amsterdam, Amsterdam, The Netherlands
Mariska Leeflang & Patrick M. Bossuyt
Food and Drug Administration, Silver Spring, MD, USA
Matthew Diamond
Departments of Radiology and Epidemiology, University of Ottawa, The Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
Matthew D. F. McInnes
Department of Bioethics, The Hospital for Sick Kids, Toronto, Ontario, Canada
Melissa McCradden
Department of Ophthalmology and Visual Sciences, University of Iowa, Iowa City, IA, USA
Michael D. Abràmoff
Annals of Internal Medicine, American College of Physicians, Philadelphia, PA, USA
Stephanie Chang
Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
Xiaoxuan Liu & Alastair Denniston
University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
Xiaoxuan Liu & Alastair Denniston
Health Data Research UK, London, UK
Xiaoxuan Liu & Alastair Denniston
Centre for Medical Imaging, University College London, London, UK
Susan Mallett
Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
Gary S. Collins
NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
Gary S. Collins
Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
David Moher
Bristol Medical School, University of Bristol, Bristol, UK
Penny Whiting

Authors

Viknesh Sounderajah
View author publications
You can also search for this author in PubMed Google Scholar
Hutan Ashrafian
View author publications
You can also search for this author in PubMed Google Scholar
Sherri Rose
View author publications
You can also search for this author in PubMed Google Scholar
Nigam H. Shah
View author publications
You can also search for this author in PubMed Google Scholar
Marzyeh Ghassemi
View author publications
You can also search for this author in PubMed Google Scholar
Robert Golub
View author publications
You can also search for this author in PubMed Google Scholar
Charles E. Kahn Jr
View author publications
You can also search for this author in PubMed Google Scholar
Andre Esteva
View author publications
You can also search for this author in PubMed Google Scholar
Alan Karthikesalingam
View author publications
You can also search for this author in PubMed Google Scholar
Bilal Mateen
View author publications
You can also search for this author in PubMed Google Scholar
Dale Webster
View author publications
You can also search for this author in PubMed Google Scholar
Dan Milea
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Ting
View author publications
You can also search for this author in PubMed Google Scholar
Darren Treanor
View author publications
You can also search for this author in PubMed Google Scholar
Dominic Cushnan
View author publications
You can also search for this author in PubMed Google Scholar
Dominic King
View author publications
You can also search for this author in PubMed Google Scholar
Duncan McPherson
View author publications
You can also search for this author in PubMed Google Scholar
Ben Glocker
View author publications
You can also search for this author in PubMed Google Scholar
Felix Greaves
View author publications
You can also search for this author in PubMed Google Scholar
Leanne Harling
View author publications
You can also search for this author in PubMed Google Scholar
Johan Ordish
View author publications
You can also search for this author in PubMed Google Scholar
Jérémie F. Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Jon Deeks
View author publications
You can also search for this author in PubMed Google Scholar
Mariska Leeflang
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Diamond
View author publications
You can also search for this author in PubMed Google Scholar
Matthew D. F. McInnes
View author publications
You can also search for this author in PubMed Google Scholar
Melissa McCradden
View author publications
You can also search for this author in PubMed Google Scholar
Michael D. Abràmoff
View author publications
You can also search for this author in PubMed Google Scholar
Pasha Normahani
View author publications
You can also search for this author in PubMed Google Scholar
Sheraz R. Markar
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Chang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoxuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Susan Mallett
View author publications
You can also search for this author in PubMed Google Scholar
Shravya Shetty
View author publications
You can also search for this author in PubMed Google Scholar
Alastair Denniston
View author publications
You can also search for this author in PubMed Google Scholar
Gary S. Collins
View author publications
You can also search for this author in PubMed Google Scholar
David Moher
View author publications
You can also search for this author in PubMed Google Scholar
Penny Whiting
View author publications
You can also search for this author in PubMed Google Scholar
Patrick M. Bossuyt
View author publications
You can also search for this author in PubMed Google Scholar
Ara Darzi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.S., S.R., N.H.S., M.G., R.G., C.E.K., X.L., G.S.C., D.W., A.E., H.A., D. Milea, D. McPherson, J.O., D. Treanor, J.F.C., M.L., M.M., M.D.F.M., M.D.A., S.M., P.W. and P.M.B. prepared the first draft of the manuscript. Critical edits, further direction and feedback have been attained from all co-authors (including A.K., B.M., D. Ting, D.C., D.K., F.G., L.H., J.D., M.D., P.N., S.M., S.C., S.S., A.D., D.M. and A.D.). The study described in the manuscript has been conceptualized, discussed and agreed upon between all co-authors.

Corresponding authors

Correspondence to Viknesh Sounderajah, Patrick M. Bossuyt or Ara Darzi.

Ethics declarations

Competing interests

A.K., S.S. and D.W. are employees at Google. A.D. and H.A. are employees at Flagship Pioneering UK Ltd. A.E. is an employee at Salesforce. DK is an employee at Optum. None of the other authors have any competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sounderajah, V., Ashrafian, H., Rose, S. et al. A quality assessment tool for artificial intelligence-centered diagnostic test accuracy studies: QUADAS-AI. Nat Med 27, 1663–1665 (2021). https://doi.org/10.1038/s41591-021-01517-0

Download citation

Published: 11 October 2021
Issue Date: October 2021
DOI: https://doi.org/10.1038/s41591-021-01517-0

This article is cited by

Artificial intelligence performance in detecting lymphoma from medical imaging: a systematic review and meta-analysis
- Anying Bai
- Mingyu Si
- Yu Jiang
BMC Medical Informatics and Decision Making (2024)
AI-based diabetes care: risk prediction models and implementation concerns
- Serena C. Y. Wang
- Grace Nickel
- Joseph C. Kvedar
npj Digital Medicine (2024)
Reporting guidelines in medical artificial intelligence: a systematic review and meta-analysis
- Fiona R. Kolbinger
- Gregory P. Veldhuizen
- Jakob Nikolas Kather
Communications Medicine (2024)
Diagnostic performance of artificial intelligence-assisted PET imaging for Parkinson’s disease: a systematic review and meta-analysis
- Jing Wang
- Le Xue
- Mei Tian
npj Digital Medicine (2024)
Artificial intelligence in digital pathology: a systematic review and meta-analysis of diagnostic test accuracy
- Clare McGenity
- Emily L. Clarke
- Darren Treanor
npj Digital Medicine (2024)

A quality assessment tool for artificial intelligence-centered diagnostic test accuracy studies: QUADAS-AI

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

This article is cited by

Artificial intelligence performance in detecting lymphoma from medical imaging: a systematic review and meta-analysis

AI-based diabetes care: risk prediction models and implementation concerns

Reporting guidelines in medical artificial intelligence: a systematic review and meta-analysis

Diagnostic performance of artificial intelligence-assisted PET imaging for Parkinson’s disease: a systematic review and meta-analysis

Artificial intelligence in digital pathology: a systematic review and meta-analysis of diagnostic test accuracy

Search

Quick links

Subjects

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Artificial intelligence performance in detecting lymphoma from medical imaging: a systematic review and meta-analysis

AI-based diabetes care: risk prediction models and implementation concerns

Reporting guidelines in medical artificial intelligence: a systematic review and meta-analysis

Diagnostic performance of artificial intelligence-assisted PET imaging for Parkinson’s disease: a systematic review and meta-analysis

Artificial intelligence in digital pathology: a systematic review and meta-analysis of diagnostic test accuracy

Search

Quick links