EDITORIAL
15 November 2023

Why teachers should explore ChatGPT’s potential — despite the risks

Many students now use AI chatbots to help with their assignments. Educators need to study how to include these tools in teaching and learning — and minimize pitfalls.

You have full access to this article via your institution.

Download PDF

The Sapienza University of Rome - Faculty of Medicine Rome, October 2011. — Experiments to harness ChatGPT in education are under way in many universities.Credit: Riccardo Venturi/Contrasto/eyevine

Teachers were spooked when ChatGPT was launched a year ago. The artificial-intelligence (AI) chatbot can write lucid, apparently well-researched essays in response to assignment questions, forcing educators around the world to rethink their evaluation methods. A few countries brought back pen-and-paper exams. And some schools are ‘flipping’ the classroom model: students do their assignments at school, after learning about a subject at home.

But after that initial shock, educators have started studying the chatbots’ potential benefits. As we report in a News feature, experiments to harness the use of ChatGPT in education are under way in many schools and universities. There are risks, but some educators think that ChatGPT and other large language models (LLMs) can be powerful learning tools. They could help students by providing a personalized tutoring experience that is available at any time and might be accessible to more students than human tutors would be. Or they could help teachers and students by making information and concepts normally restricted to textbooks much easier to find and digest.

ChatGPT has entered the classroom: how LLMs could transform education

There are still problems to be ironed out. Questions remain about whether LLMs can be made accurate and reliable enough to be trusted as learning assistants. It’s too soon to know what their ultimate effect on education will be, but more institutions need to explore ChatGPT’s advantages and pitfalls, and share what they are learning, or their students might miss out on a valuable tool.

Many students are already using ChatGPT. Within months of its launch, reports surfaced of students using the chatbot to do their homework and essays for them. Teachers were often unimpressed by the quality of the output. Crucially, the chatbot was inventing fictitious references or citations. And although it excelled in some mathematical tests¹, it didn’t do as well in others. That’s because ChatGPT has not been trained specifically to solve mathematical problems — rather, it finds plausible words to finish a sentence or respond to a query on the basis of billions of pieces of text it has seen.

In a February preprint, researchers described how, in a benchmark set of relatively simple mathematical problems usually answered by students aged 12–17, ChatGPT answered about half of the questions correctly². If the problems were more complex — requiring ChatGPT to do four or more additions or subtractions in the same calculation — it was particularly likely to fail.

AI bot ChatGPT writes smart essays — should professors worry?

And the authors of a preprint study published in July found that the mathematical skills of the LLM that underlies ChatGPT might have worsened³. In March 2023, the GPT-4 version of the chatbot correctly differentiated between prime and composite numbers 84% of the time. By June, it did so in only 51% of cases. The study’s authors note that “improving the model’s performance on some tasks, for example with fine-tuning on additional data, can have unexpected side effects on its behavior in other tasks”.

Despite these risks, educators should not avoid using LLMs. Rather, they need to teach students the chatbots’ strengths and weaknesses and support institutions’ efforts to improve the models for education-specific purposes. This could mean building task-specific versions of LLMs that harness their strengths in dialogue and summarization and minimize the risks of a chatbot providing students with inaccurate information or enabling them to cheat.

Arizona State University (ASU), for example, is rolling out a platform that enables faculty members to use generative AI models, including GPT-4 and Google’s Bard — another LLM-powered chatbot. The platform uses a technique called retrieval-augmented generation in ASU courses. ChatGPT or Bard are instructed to seek answers to users’ questions in specific data sets, such as scientific papers or lecture notes. This approach not only harnesses the chatbots’ conversational power, but also reduces the chance of errors.

One of the greatest risks is that LLMs might perpetuate or worsen long-standing societal concerns, such as biases and discrimination. For example, when summarizing existing literature, LLMs probably take cues from their training data and give less weight to the viewpoints of people from under-represented groups. ASU says that its platform helps to address such concerns by ensuring that the LLMs provide the sources that they used to generate answers, allowing students to think critically about whose ideas the chatbots present.

How Nature readers are using ChatGPT

Vanderbilt University in Nashville, Tennessee, has an initiative called the Future of Learning and Generative AI. Students who need to use ChatGPT, for courses such as computer science, get access to a paid version. This variant of the chatbot can use other programs to execute computer code, augmenting the bot’s mathematical capabilities.

As understanding of the LLMs’ power and limitations increases, more university-wide initiatives will no doubt emerge. Using LLMs without considering their downsides is counterproductive. For many educational purposes, error-prone tools are unhelpful at best and, at worst, damage students’ ability to learn. But some institutes, such as ASU, are trying to reduce the LLMs’ weaknesses — even aiming to turn those into strengths by, for example, using them to improve students’ critical-thinking skills. Educators must be bold to avoid missing a huge opportunity — and vigilant to ensure that institutions everywhere use LLMs in a way that makes the world better, not worse.

Nature 623, 457-458 (2023)

doi: https://doi.org/10.1038/d41586-023-03505-5

References

OpenAI. Preprint at https://arxiv.org/abs/2303.08774 (2023).
Shakarian, P. et al. Preprint at https://arxiv.org/abs/2302.13814 (2023).
Chen, L. et al. Preprint at https://arxiv.org/abs/2307.09009 (2023).

Download references

Reprints and permissions

Subjects

Latest on:

Lethal AI weapons are here: how can we control them?

News Feature 23 APR 24

Will AI accelerate or delay the race to net-zero emissions?

Comment 22 APR 24

AI’s keen diagnostic eye

Outlook 18 APR 24

AI now beats humans at basic tasks — new benchmarks are needed, says major report

News 15 APR 24

High-threshold and low-overhead fault-tolerant quantum memory

Article 27 MAR 24

Three reasons why AI doesn’t model human language

Correspondence 19 MAR 24

How young people benefit from Swiss apprenticeships

Spotlight 17 APR 24

Ready or not, AI is coming to science education — and students have opinions

Career Feature 08 APR 24

After the genocide: what scientists are learning from Rwanda

News Feature 05 APR 24

Jobs

Junior Group Leader

The Imagine Institute is a leading European research centre dedicated to genetic diseases, with the primary objective to better understand and trea...

Paris, Ile-de-France (FR)

Imagine Institute
Director of the Czech Advanced Technology and Research Institute of Palacký University Olomouc

The Rector of Palacký University Olomouc announces a Call for the Position of Director of the Czech Advanced Technology and Research Institute of P...

Czech Republic (CZ)

Palacký University Olomouc
Course lecturer for INFH 5000

The HKUST(GZ) Information Hub is recruiting course lecturer for INFH 5000: Information Science and Technology: Essentials and Trends.

Guangzhou, Guangdong, China

The Hong Kong University of Science and Technology (Guangzhou)
Suzhou Institute of Systems Medicine Seeking High-level Talents

Full Professor, Associate Professor, Assistant Professor

Suzhou, Jiangsu, China

Suzhou Institute of Systems Medicine (ISM)
Postdoctoral Fellowships: Early Diagnosis and Precision Oncology of Gastrointestinal Cancers

We currently have multiple postdoctoral fellowship positions within the multidisciplinary research team headed by Dr. Ajay Goel, professor and foun...

Monrovia, California

Beckman Research Institute, City of Hope, Goel Lab

Why teachers should explore ChatGPT’s potential — despite the risks

References

Subjects

Latest on:

Jobs

Junior Group Leader

Director of the Czech Advanced Technology and Research Institute of Palacký University Olomouc

Course lecturer for INFH 5000

Suzhou Institute of Systems Medicine Seeking High-level Talents

Postdoctoral Fellowships: Early Diagnosis and Precision Oncology of Gastrointestinal Cancers

Search

Quick links

References

Related Articles

Subjects

Latest on:

Jobs

Junior Group Leader

Director of the Czech Advanced Technology and Research Institute of Palacký University Olomouc

Course lecturer for INFH 5000

Suzhou Institute of Systems Medicine Seeking High-level Talents

Postdoctoral Fellowships: Early Diagnosis and Precision Oncology of Gastrointestinal Cancers

Search

Quick links