Governing AI safety through independent audits

Falco, Gregory; Shneiderman, Ben; Badger, Julia; Carrier, Ryan; Dahbura, Anton; Danks, David; Eling, Martin; Goodloe, Alwyn; Gupta, Jerry; Hart, Christopher; Jirotka, Marina; Johnson, Henric; LaPointe, Cara; Llorens, Ashley J.; Mackworth, Alan K.; Maple, Carsten; Pálsson, Sigurður Emil; Pasquale, Frank; Winfield, Alan; Yeong, Zee Kin

doi:10.1038/s42256-021-00370-7

Perspective
Published: 20 July 2021

Governing AI safety through independent audits

Gregory Falco ORCID: orcid.org/0000-0002-6463-7719^1,2,3,
Ben Shneiderman⁴,
Julia Badger⁵,
Ryan Carrier⁶,
Anton Dahbura²,
David Danks⁷,
Martin Eling⁸,
Alwyn Goodloe⁹,
Jerry Gupta¹⁰,
Christopher Hart¹¹,
Marina Jirotka ORCID: orcid.org/0000-0002-6088-3955¹²,
Henric Johnson¹³,
Cara LaPointe^2,14,
Ashley J. Llorens¹⁴,
Alan K. Mackworth ORCID: orcid.org/0000-0003-0380-105X¹⁵,
Carsten Maple¹⁶,
Sigurður Emil Pálsson¹⁷,
Frank Pasquale¹⁸,
Alan Winfield ORCID: orcid.org/0000-0002-1476-3127¹⁹ &
…
Zee Kin Yeong²⁰

Nature Machine Intelligence volume 3, pages 566–571 (2021)Cite this article

2766 Accesses
60 Citations
50 Altmetric
Metrics details

Subjects

Abstract

Highly automated systems are becoming omnipresent. They range in function from self-driving vehicles to advanced medical diagnostics and afford many benefits. However, there are assurance challenges that have become increasingly visible in high-profile crashes and incidents. Governance of such systems is critical to garner widespread public trust. Governance principles have been previously proposed offering aspirational guidance to automated system developers; however, their implementation is often impractical given the excessive costs and processes required to enact and then enforce the principles. This Perspective, authored by an international and multidisciplinary team across government organizations, industry and academia, proposes a mechanism to drive widespread assurance of highly automated systems: independent audit. As proposed, independent audit of AI systems would embody three ‘AAA’ governance principles of prospective risk Assessments, operation Audit trails and system Adherence to jurisdictional requirements. Independent audit of AI systems serves as a pragmatic approach to an otherwise burdensome and unenforceable assurance challenge.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Drivers are blamed more than their automated cars when both make mistakes

Article 28 October 2019

Edmond Awad, Sydney Levine, … Iyad Rahwan

Towards an international regulatory framework for AI safety: lessons from the IAEA’s nuclear safety regulations

Article Open access 12 April 2024

Seokki Cha

The proposed EU Directives for AI liability leave worrying gaps likely to impact medical AI

Article Open access 26 April 2023

Mindy Nunez Duffourc & Sara Gerke

References

Oliver, N., Calvard, T. & Potočnik, K. The tragic crash of flight AF447 shows the unlikely but catastrophic consequences of automation. Harvard Business Review (15 September 2017).
Sumwalt, R., Landsberg, B. & Homendy, J. Assumptions Used in the Safety Assessment Process and the Effects of Multiple Alerts and Indications on Pilot Performance (National Transportation Safety Board, 2019).
Sumwalt, R. NHTSA-2020-0106-0617 Comment from Robert L. Sumwalt, III (National Transportation Safety Board, 2021).
Maslow, A. H. A theory of human motivation. Psychol. Rev. 50, 370–396 (1943).
Article Google Scholar
Shneiderman, B. Human-centered artificial intelligence: reliable, safe & trustworthy. Int. J. Human Comput. Interact. 36, 495–504 (2020).
Article Google Scholar
Falco, G. et al. Cyber risk research impeded by disciplinary barriers. Science 366, 1066–1069 (2019).
Article Google Scholar
Fjeld, J., Achten, N., Hilligoss, H., Nagy, A. & Srikumar, M. Principled Artificial Intelligence: Mapping Consensus in Ethical and Rights-Based Approaches to Principles for AI (Berkman Klein Center, 2020).
Brundage, M. et al. Toward trustworthy AI development: mechanisms for supporting verifiable claims. Preprint at https://arxiv.org/abs/2004.07213 (2020).
Jobin, A., Ienca, M. & Vayena, E. The global landscape of AI ethics guidelines. Nat. Mach. Intell. 1, 389–399 (2019).
Article Google Scholar
Winfield, A. & Jirotka, M. Ethical governance is essential to building trust in robotics and artificial intelligence systems. Phil. Trans. R. Soc. A 376, 20180085 (2018)..
Mittelstadt, B. Principles alone cannot guarantee ethical AI. Nat. Mach. Intell. 1, 501–507 (2019).
Article Google Scholar
IEEE Global Initiative on Ethics of Autonomous and Intelligent Systems Ethically Aligned Design: A Vision for Prioritizing Human Well-Being with Autonomous and Intelligent Systems 1st edn (IEEE, 2019).
Kueppers, R. J. & Sullivan, K. B. How and why an independent audit matters. Int. J. Discl. Gov. 7, 286–293 (2010).
Article Google Scholar
Fischer, G. Exploring design trade-offs for quality of life in human-centered design. Interactions 25, 26–33 (2017).
Article Google Scholar
Tomlin, C. J., Mitchell, I., Bayen, A. M. & Oishi, M. Computational techniques for the verification of hybrid systems. Proc. IEEE 91, 986–1001 (2003).
Article Google Scholar
Mackworth, A. K. & Zhang, Y. A formal approach to agent design: an overview of constraint-based agents. Constraints 8, 229–242 (2003).
Article MathSciNet Google Scholar
Topcu, U. et al. Assured autonomy: path toward living with autonomous systems we can trust. Preprint at https://arxiv.org/abs/2010.14443 (2020).
Falco, G. & Gilpin, L. A stress testing framework for autonomous system verification and validation (v&v). In IEEE International Conference on Autonomous Systems (IEEE, 2021).
ISO 13849-1:2015 Safety of Machinery — Safety-Related Parts of Control Systems (International Organization for Standardization Technical Committee, 2015).
IEC 31010:2019 Risk Management—Risk Assessment Techniques (ISO, 2019); https://www.iso.org/standard/72140.html
BS8611:2016 Robots and Robotic Devices. Guide to the Ethical Design and Application of Robots and Robotic Systems (British Standards Institute, 2016).
Winfield, A. Ethical standards in robotics and AI. Nat. Electron. 2, 46–48 (2019).
Article Google Scholar
Landwehr, C. E. A building code for building code: putting what we know works to work. In Proc. 29th Annual Computer Security Applications Conference 139–147 (ACM, 2013).
Haigh, T. & Landwehr, C. Building Code for Medical Device Software Security (IEEE Cybersecurity, 2015).
Gebru, T. et al. Datasheets for datasets. Preprint at https://arxiv.org/abs/1803.09010 (2018).
Mitchell, M. et al. Model cards for model reporting. In Proc. Conference on Fairness, Accountability, and Transparency 220–229 (ACM, 2019).
Arnold, M. et al. FactSheets: Increasing trust in AI services through supplier’s declarations of conformity. IBM J. Res. Dev. 63, 6 (2019).
Article Google Scholar
Personal Data Protection Commission of Singapore Model Artificial Intelligence Governance Framework 2nd edn (Infocomm Media Development Authority of Singapore, 2020).
Cummings, M. & Britton, D. in Living with Robots 119–140 (Elsevier, 2020).
Grossi, D. R. Aviation recorder overview. In International Symposium On Transportation Recorders 153–164 (ISTR, 1999).
Winfield, A. & Jirotka, M. The case for an ethical black box. In Annual Conference Towards Autonomous Robotic Systems 262–273 (Springer, 2017).
Yao, Y. & Atkins, E. The smart black box: a value-driven high-bandwidth automotive event data recorder. IEEE Trans. Intell. Transport. Syst. 22, 1484–1496 (2020).
Article Google Scholar
Falco, G. & Siegel, J. E. A distributed ‘black box’ audit trail design specification for connected and automated vehicle data and software assurance. SAE Int. J. Transport. Cybersecur. Privacy 3, 11-03-02-0006 (2020).
Winfield, A. et al. in Software Engineering for Robotics (eds Cavalcanti, A. et al.) 165–187 (Springer, 2021).
Williamsen, M. Near-miss reporting: a missing link in safety culture. Prof. Safety 58, 46–50 (2013).
Google Scholar
Principles to Promote Fairness, Ethics, Accountability and Transparency (Feat) in the Use of Artificial Intelligence and Data Analytics in Singapore’s Financial Sector (Monetary Authority of Singapore & Fairness, Ethics, Accountability and Transparency Committee, 2018).
World Economic Forum Companion to the Model AI Governance Framework—Implementation and Self-Assessment for Organizations (Infocomm Media Development Authority of Singapore, 2020).
Preliminary Study on the Ethics of Artificial Intelligence (World Commission on the Ethics of Scientific Knowledge and Technology, 2019).
Falco, G. A smart city internet for autonomous systems. In 2020 Symposium on Security and Privacy Workshops 215–220 (IEEE, 2020).
Drogkaris, P. & Bourka, A. Towards a Framework for Policy Development in Cybersecurity: Security and Privacy Considerations in Autonomous Agents (European Union Agency for Cybersecurity, 2018).
High-Level Expert Group on AI Ethics Guidelines for Trustworthy AI (European Commission, 2019).
Regulation of the European Parliament and of the Council Laying Down Harmonized Rules of Artificial Intelligence (Artificial Intelligence Act) and Amending Certain Union Legislative Acts (European Commission, 2021).
Personal Data Protection Commission of Singapore Model Artificial Intelligence Governance Framework 1st edn (Infocomm Media Development Authority of Singapore, 2019).
Schmidt, E. et al. National Security Commission on Artificial Intelligence: Final Report (National Security Commission on Artificial Intelligence, 2021).
Pasquale, F. The Black Box Society (Harvard Univ. Press, 2015).
The American Recovery and Reinvestment Act of 2009 (US Government, 2009).
Spiekermann, S. & Winkler, T. Value-based engineering for ethics by design. Preprint at https://arxiv.org/abs/2004.13676l (2020).

Download references

Acknowledgements

We owe special thanks to the participants of the CCC Workshop on Assured Autonomy for their ideas, inspiration and discussion that contributed to this paper. C.H. is a former Chairman of the National Transportation Safety Board.

Author information

Authors and Affiliations

Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD, USA
Gregory Falco
Institute for Assured Autonomy, Johns Hopkins University, Baltimore, MD, USA
Gregory Falco, Anton Dahbura & Cara LaPointe
Freeman Spogli Institute, Stanford University, Stanford, CA, USA
Gregory Falco
Department of Computer Science and Institute for Advanced Computer Studies, University of Maryland, Maryland, MD, USA
Ben Shneiderman
NASA Johnson Space Center, Houston, TX, USA
Julia Badger
ForHumanity, Thornwood, NY, USA
Ryan Carrier
Department of Philosophy and Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, USA
David Danks
Institute of Insurance Economics, School of Finance, University of St. Gallen, St. Gallen, Switzerland
Martin Eling
NASA Langley Research Center, Hampton, VA, USA
Alwyn Goodloe
Swiss RE, Zurich, Switzerland
Jerry Gupta
Washington Metrorail Safety Commission, Washington, DC, USA
Christopher Hart
Department of Computer Science, University of Oxford, Oxford, UK
Marina Jirotka
Wallenberg Research Link, Stanford University, Stanford, CA, USA
Henric Johnson
Applied Physics Laboratory, Johns Hopkins University, Baltimore, MD, USA
Cara LaPointe & Ashley J. Llorens
Department of Computer Science, University of British Columbia, British Columbia, Canada
Alan K. Mackworth
WMG, University of Warwick, Coventry, UK
Carsten Maple
Faculty of Physical Sciences, University of Iceland, Reykjavik, Iceland
Sigurður Emil Pálsson
Brooklyn Law School, New York City, NY, USA
Frank Pasquale
Bristol Robotics Lab, UWE, Bristol, UK
Alan Winfield
Data Innovation and Protection Group, Infocomm Media Development Authority of Singapore, Singapore, Singapore
Zee Kin Yeong

Authors

Gregory Falco
View author publications
You can also search for this author in PubMed Google Scholar
Ben Shneiderman
View author publications
You can also search for this author in PubMed Google Scholar
Julia Badger
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Carrier
View author publications
You can also search for this author in PubMed Google Scholar
Anton Dahbura
View author publications
You can also search for this author in PubMed Google Scholar
David Danks
View author publications
You can also search for this author in PubMed Google Scholar
Martin Eling
View author publications
You can also search for this author in PubMed Google Scholar
Alwyn Goodloe
View author publications
You can also search for this author in PubMed Google Scholar
Jerry Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Hart
View author publications
You can also search for this author in PubMed Google Scholar
Marina Jirotka
View author publications
You can also search for this author in PubMed Google Scholar
Henric Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Cara LaPointe
View author publications
You can also search for this author in PubMed Google Scholar
Ashley J. Llorens
View author publications
You can also search for this author in PubMed Google Scholar
Alan K. Mackworth
View author publications
You can also search for this author in PubMed Google Scholar
Carsten Maple
View author publications
You can also search for this author in PubMed Google Scholar
Sigurður Emil Pálsson
View author publications
You can also search for this author in PubMed Google Scholar
Frank Pasquale
View author publications
You can also search for this author in PubMed Google Scholar
Alan Winfield
View author publications
You can also search for this author in PubMed Google Scholar
Zee Kin Yeong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gregory Falco.

Ethics declarations

Competing interests

G.F. is a consultant for the World Bank Group on autonomous vehicle regulation and is a ForHumanity fellow; he thanks the US National Institute for Standards and Technology (NIST), the Icelandic Fulbright Commission and the National Science Foundation for research funding. B.S. is a ForHumanity fellow. J.B. is a US government civil servant working for and funded by NASA. R.C. is the Executive Director of ForHumanity, a registered 501(c)(3) not-for-profit organization. A.D. is a member of the Maryland Cybersecurity Council, established by the Maryland State Legislature to work with the National Institute of Standards and Technology and other federal agencies, private sector businesses and private cybersecurity experts to improve cybersecurity in Maryland. D.D. is an external member of the Salesforce Advisory Council on Ethical & Humane Use of Technology. A.G. is a US government civil servant working for and funded by NASA; he is a voting member of SAE 34 working group on AI in Aviation that is writing guidelines for AI in aviation. C.H. is the Chairman of the Washington Metrorail Safety Commission where the opinions expressed in this article are his and not those of the Commission; he is bound by confidentiality agreements that prevent him from disclosing other competing interests in this work. M.J. holds an EPSRC Fellowship investigating ethical data recorders in robots, leads a project on legality and ethics of data recorders in autonomous vehicles and is a member of the All-Party Parliamentary Group on Data Analytics; she is a Director of ORBIT-RRI Ltd. H.J. is the Swedish Science and Innovation Counsellor to the United States. A.J.L. is Vice President and Global Outreach Director at Microsoft Research and currently serves as the Science Representative on the steering committee of the Global Partnership on Artificial Intelligence. A.K.M. is a Director of Minerva Intelligence Inc; he is a member of the Centre for AI Decision-making and Action (CAIDA) Steering Committee, the AI network of British Columbia (AInBC) Board and The Confederation of Laboratories for Artificial Intelligence Research in Europe (CLAIRE) international advisory board. C.M. is a Fellow of the Alan Turing Institute, a member of the Strategic Advisory Board of Zenzic and of the ENISA CarSec Expert Group. S.E.P. is the Chief National Cyber Security Adviser of the Icelandic Government, chairs the Icelandic Cyber Security Council and is a member of the Board of the European Union Agency for Cybersecurity (ENISA); he leads the development and implementation of Iceland’s cybersecurity strategy including the cybersecurity aspects of AI. A.W. sits on British Standards Institute committee AMT/010/01 Ethics for Robots and Autonomous Systems, the executive committee of the IEEE Standards Association Global Initiative on Ethics of Autonomous and Intelligent Systems and the WEF Global AI Council; he is a member of the Advisory Committee of robotics company Karakuri Ltd. Z.K.Y. leads the development and implementation of Singapore’s AI Governance Framework and is a member of OECD’s Network of Experts in AI and the Global Partnership on Artificial Intelligence’s expert working group on Data Governance. Authors not mentioned declare no competing interests.

Additional information

Peer review information Nature Machine Intelligence thanks Ryan Calo and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Falco, G., Shneiderman, B., Badger, J. et al. Governing AI safety through independent audits. Nat Mach Intell 3, 566–571 (2021). https://doi.org/10.1038/s42256-021-00370-7

Download citation

Received: 15 December 2020
Accepted: 08 June 2021
Published: 20 July 2021
Issue Date: July 2021
DOI: https://doi.org/10.1038/s42256-021-00370-7

This article is cited by

How to design an AI ethics board
- Jonas Schuett
- Ann-Katrin Reuel
- Alexis Carlier
AI and Ethics (2024)
On monitorability of AI
- Roman V. Yampolskiy
AI and Ethics (2024)
Ethical implications of AI in the Metaverse
- Alesia Zhuk
AI and Ethics (2024)
Managing the race to the moon: Global policy and governance in Artificial Intelligence regulation—A contemporary overview and an analysis of socioeconomic consequences
- Yoshija Walter
Discover Artificial Intelligence (2024)
AI-deploying organizations are key to addressing ‘perfect storm’ of AI risks
- Caitlin Curtis
- Nicole Gillespie
- Steven Lockey
AI and Ethics (2023)