Large language models associate Muslims with violence

Abid, Abubakar; Farooqi, Maheen; Zou, James

doi:10.1038/s42256-021-00359-2

Comment
Published: 17 June 2021

Large language models associate Muslims with violence

Nature Machine Intelligence volume 3, pages 461–463 (2021)Cite this article

3144 Accesses
41 Citations
250 Altmetric
Metrics details

Subjects

Large language models, which are increasingly used in AI applications, display undesirable stereotypes such as persistent associations between Muslims and violence. New approaches are needed to systematically reduce the harmful bias of language models in deployment.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Relevant articles

Open Access articles citing this article.

Manifestations of xenophobia in AI systems
- Nenad Tomasev
- , Jonathan Leader Maynard
- & Iason Gabriel
AI & SOCIETY Open Access 21 March 2024
Mitigating the impact of biased artificial intelligence in emergency decision-making
- Hammaad Adam
- , Aparna Balagopalan
- … Marzyeh Ghassemi
Communications Medicine Open Access 21 November 2022
Identity of AI
- Vladan Devedzic
Discover Artificial Intelligence Open Access 14 November 2022

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: GPT-3 exhibits Muslim–violence bias.**

**Fig. 2: Debiasing GPT-3 completions.**

References

Mikolov, T., Chen, K., Corrado, G. & Dean, J. in Proc. International Conference on Learning Representations (ICLR, 2013).
Dai, A. M. & Le, Q. V. in Advances in Neural Information Processing Systems Vol. 28, 3079–3087 (NeurIPS, 2015).
Brown, T. et al. in Advances in Neural Information Processing Systems Vol. 33, 1877–1901 (NeurIPS, 2020).
Kitaev, N., Kaiser, L. & Levskaya, A. in Proc. International Conference on Learning Representations (ICLR, 2020).
Bolukbasi, T., Chang, K.-W., Zou, J. Y., Saligrama, V. & Kalai, A. T. in Advances in Neural Information Processing Systems Vol. 29, 4349–4357 (NeurIPS, 2016).
Nadeem, M., Bethke, A. & Reddy, S. Preprint at https://arxiv.org/abs/2004.09456 (2020).
Sheng, E., Chang, K.-W., Natarajan, P. & Peng, N. in Proc. Conference on Empirical Methods in Natural Language Processing 3407–3412 (ACL, 2019).
Bordia, S. & Bowman, S. R. in Proc. Conference of the North American Chapter of the Association for Computational Linguistics (ACL, 2019).
Lu, K., Mardziel, P., Wu, F., Amancharla, P. & Datta, A. in Logic, Language, and Security (eds Nigam, V. et al.) 189–202 (Springer, 2020).
Lewis, M. et al. in Proc. 58th Annual Meeting of the Association for Computational Linguistics 7871–7880 (ACL, 2020).
Wallace, E., Feng, S., Kandpal, N., Gardner, M. & Singh, S. in Proc. Conference on Empirical Methods in Natural Language Processing (EMNLP) 2153–2162 (ACL, 2019).
Qian, Y., Muaz, U., Zhang, B. & Hyun, J. W. Preprint at https://arxiv.org/abs/1905.12801 (2019).
Bender, E. M., Gebru, T., McMillan-Major, A. & Mitchell, S. in ACM Conference on Fairness, Accountability, and Transparency 610–623 (ACM, 2021).
Li, X. L. & Liang, P. Preprint at https://arxiv.org/abs/2101.00190 (2021).

Download references

Acknowledgements

We thank A. Abid, A. Abdalla, D. Khan, and M. Ghassemi for the helpful feedback on the manuscript and experiments. J.Z. is supported by NSF CAREER 1942926.

Author information

Authors and Affiliations

Department of Electrical Engineering, Stanford University, Stanford, CA, USA
Abubakar Abid
Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
Maheen Farooqi
Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
James Zou

Authors

Abubakar Abid
View author publications
You can also search for this author in PubMed Google Scholar
Maheen Farooqi
View author publications
You can also search for this author in PubMed Google Scholar
James Zou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to James Zou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Machine Intelligence thanks Arvind Narayaran for their contribution to the peer review of this work.

Supplementary information

Supplementary Information

Supplementary discussions A–C

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abid, A., Farooqi, M. & Zou, J. Large language models associate Muslims with violence. Nat Mach Intell 3, 461–463 (2021). https://doi.org/10.1038/s42256-021-00359-2

Download citation

Published: 17 June 2021
Issue Date: June 2021
DOI: https://doi.org/10.1038/s42256-021-00359-2

This article is cited by

Manifestations of xenophobia in AI systems
- Nenad Tomasev
- Jonathan Leader Maynard
- Iason Gabriel
AI & SOCIETY (2024)
Large language models in medicine
- Arun James Thirunavukarasu
- Darren Shu Jeng Ting
- Daniel Shu Wei Ting
Nature Medicine (2023)
Mitigating the impact of biased artificial intelligence in emergency decision-making
- Hammaad Adam
- Aparna Balagopalan
- Marzyeh Ghassemi
Communications Medicine (2022)
Shifting machine learning for healthcare from development to deployment and from models to data
- Angela Zhang
- Lei Xing
- Joseph C. Wu
Nature Biomedical Engineering (2022)
Identity of AI
- Vladan Devedzic
Discover Artificial Intelligence (2022)