Use of automated conversational agents in improving young population mental health: a scoping review

Balan, Raluca; Dobrean, Anca; Poetar, Costina R.

doi:10.1038/s41746-024-01072-1

Download PDF

Review Article
Open access
Published: 19 March 2024

Use of automated conversational agents in improving young population mental health: a scoping review

npj Digital Medicine volume 7, Article number: 75 (2024) Cite this article

1102 Accesses
15 Altmetric
Metrics details

Subjects

Abstract

Automated conversational agents (CAs) emerged as a promising solution in mental health interventions among young people. Therefore, the objective of this scoping review is to examine the current state of research into fully automated CAs mediated interventions for the emotional component of mental health among young people. Selected databases were searched in March 2023. Included studies were primary research, reporting on development, feasibility/usability, or evaluation of fully automated CAs as a tool to improve the emotional component of mental health among young population. Twenty-five studies were included (N = 1707). Most automated CAs applications were standalone preventions targeting anxiety and depression. Automated CAs were predominantly AI-based chatbots, using text as the main communication channel. Overall, the results of the current scoping review showed that automated CAs mediated interventions for emotional problems are acceptable, engaging and with high usability. However, the results for clinical efficacy are far less conclusive, since almost half of evaluation studies reported no significant effect on emotional mental health outcomes. Based on these findings, it can be concluded that there is a pressing need to improve the existing automated CAs applications to increase their efficacy as well as conducting more rigorous methodological research in this area.

Systematic review and meta-analysis of AI-based conversational agents for promoting mental health and well-being

Article Open access 19 December 2023

Health-focused conversational agents in person-centered care: a review of apps

Article Open access 17 February 2022

Patient perceptions of disease burden and treatment of myasthenia gravis based on sentiment analysis of digital conversations

Article Open access 27 March 2024

Introduction

Mental health problems are an area of particular concern among young people. According to WHO, 20% of youths have a mental health disorder, a rate that is two times higher than in the general population¹. A history of mental health problems in young age forecasts a range of psychosocial difficulties in adult life². Despite high prevalence and long-term negative consequences of mental health problems, most children and youths do not participate in preventive or intervention actions because of attitudinal or logistic barriers³.

Use of technology has emerged as an important alternative to face-to-face approach in deploying assistive, preventive, and therapeutic solutions for those in need, increasing the availability and providing a stigma free environment for exploring their vulnerabilities related to mental health problems⁴. One such cutting edge digital solution is conversational agents (CAs), defined as systems simulating human interaction using text, speech, gestures, facial, or sensorial expressions as input and/or output⁵. The category of CAs covers a broad spectrum of embodiment types, from disembodied agents with no dynamic physical representation (chatbots) to agents with virtual representation or robots with a physical representation⁶. The autonomy level ranges from non-autonomous CAs, whose functionality totally depends on the decisions and actions of a human being, to semi-autonomous CAs (that have a certain degree of independence but require the real time control by humans for some specific scenarios and functionalities) to fully automated CAs, that can be used totally independently without any form of human support⁷. In this paper, the focus will be on fully automated CAs, irrespective of embodiment type.

With a rapid technological expansion, fully automated CAs seem to hold a great potential in mental health care for young people. In recent years, a growing body of research has been interested in developing and testing the efficacy of fully automated CAs for addressing mental health problems in a variety of settings with youths. In the healthcare setting, automated CAs are used to tackle distress related to medical procedures among youths, such as vaccination or cancer treatments^8,9. In an educational context, they have been employed as a tool to reduce problems such as general distress or performance anxiety^10,11. Automated CAs have also been used to prevent or to treat depression and anxiety in the general or psychiatric population¹².

While several reviews have been conducted to characterize various types of CAs as tools for treatment of mental health problems, several limitations have been identified. First, the previous reviews rely mostly on the adult population or do not distinguish between young and older population, with no comprehensive synthesis of existing automated CAs specifically designed to tackle mental health problems among young populations^13,14,15,16. Justification for focusing on the young population is rooted in prior research demonstrating distinctive preferences, attitudes, and utilization patterns compared to adults^17,18. As first adopters of the latest technological developments, including mental healthcare services, youths exhibit greater familiarity and comfort with these innovations¹⁹.

Second, most of the previous reviews did not distinguish fully automated CAs from non- or semi-autonomous CAs^13,20. Fully automated CA are a scalable, cost effective and alternative to human therapist support, moving the field towards a new paradigm. However, full automatization can pose significant challenges when used in mental health care with youths, such as limited capacity to respond to safety-critical situations, less personalization of the content or confidentiality issues^21,22.

Third, the previous reviews limited their focus to a subset of CAs based on the embodiment level, such as disembodied CAs^13,20, CAs with virtual representation¹⁵, or with a physical representation^23,24,25. Moreover, use of CAs was predominantly investigated in relation to a broad range of mental health problems^15,16, or specifically related to cognitive and social abilities, without considering the emotional component of mental health^24,25. This scoping review was formulated to focus specifically on the emotional component of mental health as defined through the lens of the medical model (e.g., changes in anxiety, depression, psychological distress) rather than social (e.g., repertoire of verbal/non-verbal abilities to communicate and interact with others) and cognitive skills (e.g., executive functioning skills) to specifically capture this innovative and growing application area for automated CAs.

In response to these gaps, this scoping review aims to provide a comprehensive overview of fully automated CAs and their role in enhancing the emotional component of mental health in the young population. The scoping review was guided by the following research questions:

(1)
What are the technological characteristics of automated CAs used to deliver interventions for youth’s mental health?
(2)
What are the characteristics of the interventions provided by automated CAs in children, adolescents, and young adults aiming to improve mental health outcomes?

Results

Study selection

The systematic search in databases and external sources returned 9905 articles. After duplicates removal, 6874 articles were screened for title and abstract and further 6719 studies were excluded. Out of the remaining 155 studies, we retrieved full-text copies for 152 articles that were screened in full. This resulted in a total of 25 studies included in the current scoping review. The study selection is detailed in Fig. 1 PRISMA flowchart.

A detailed overview of characteristics of included studies is provided in Supplementary Table 1 and 2.

Of the 25 studies, 19 were recently published (between 2020 and 2023)^{8,9,10,12,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40}. Studies were conducted predominantly in the US (n = 12)^{11,12,27,28,33,34,38,41,42,43,44,45}, followed by Europe (n = 5)^{10,26,29,32,35}, Asia (n = 4)^9,31,36,39, New Zealand (n = 2)^37,40, and Australia (n = 1)³⁰.

Technological characteristics

The summative results for technological characteristics of automated CAs are presented in Table 1. In total, there were 21 different agents described in the included studies. Only 3 of the CA were the focus of more than one study – Paro^11,41,45, Nao^8,38, and Woebot^12,42. These automated CAs were predominantly disembodied chatbots (n = 15)^{10,26,28,29,30,31,32,35,36,37,40,42,43,44}, followed by robots (n = 7)^{8,9,33,34,38,41,45}. Automated CAs with a virtual representation were the focus in 2 studies^11,27. In addition, one application consisted of a chatbot with features of avatar³⁹.

Table 1 Summative results per technological characteristics

Full size table

Regarding the dialog system underlying the process of conversation, almost half of automated CAs (n = 12) employed natural language processing and machine learning to carry on an interaction^{8,9,12,27,30,31,34,38,39,41,43,45}. Predefined dialog or interactions assembled, and matched to the user input in a dynamic manner was used in 10 studies^{10,11,26,28,29,32,33,35,40,44}, while 3 used a mixed dialog system^36,37,42. These agents communicated through text (n = 13)^{10,12,26,28,29,30,32,35,37,40,42,43,44}, speech (n = 2)^8,38, and non-verbal cues (n = 4)^9,34,41,45, while multiple modalities communication was employed by 5 studies^{11,31,33,36,39}. For one study no information was provided on modality of communication²⁷. Among automated CAs investigated in the included studies, 17 are available to purchase or for free use^{8,9,10,12,27,31,33,34,36,37,38,40,41,42,43,44,45}.

Characteristics of interventions

Characteristics of mental health interventions using automated CAs are detailed in Table 2.

Table 2 Summative results per characteristics of interventions

Full size table

Anxiety was the most frequent targeted emotional component of mental health by automated CAs (n = 12)^{8,10,12,27,33,35,37,38,41,42,43,45}. Depression was the second most targeted emotional component (n = 8)^{12,28,31,35,36,38,42,43}, followed by psychological well-being (n = 5)^{26,29,30,33,44}, general distress (n = 5)^{9,10,34,39,40}, and mood (n = 2)^33,41. One intervention had as target mental health problems as a broad construct³².

With respect to the scope of interventions, most of the studies labeled the CAs applications as interventions. In fact, those were designed and tested as having mainly a preventive scope, since the research was conducted with general or at-risk population^{8,9,10,11,26,28,29,30,32,33,34,35,37,38,43,44,45}. Only 8 studies were conducted on samples of youths screened as having detectable mental health problems, mainly based on youth or parent report^{12,27,31,36,40,41,42}.

Duration of interventions was reported by 19 studies. Most of the interventions last between 2- and 4-weeks (n = 8)^{8,10,29,38,39,40,42,43,44}, followed by interventions with a duration of 1 day or less (n = 5)^{9,28,34,41,45}, and interventions of 2 up to 7 days (n = 3)^26,31,33. Only 3 studies investigated interventions longer than 4 weeks^12,35,36. In terms of sessions’ frequency only 8 studies provide information and include daily sessions^26,33,43, bi-weekly^10,29,43, once a week³⁹ or 3 times per week³⁵.

Out of 25 included studies, only 5 focused on automated CAs as components embedded in other types of technologies or mental health services for mental health problems^{11,12,27,35,39}. The remaining 20 studies designed or evaluated automated CAs agents as standalone psychological interventions. Automated CAs that were not independent interventions were integrated components of web-based interventions, with additional technological features enabling the intervention such as videoconference or serious games^11,27,35,39 or as an additive component to primary care management¹².

Theoretical framework for automated CAs interventions was reported by 17 studies. Cognitive behavioral theory (CBT) principles were applied to most of the interventions to derive their content. More specifically, CBT was mentioned as a theoretical framework for 14 automated CAs applications^{8,10,11,12,26,28,31,35,36,37,42,43,44}. Among CBT based interventions, 2 applications mentioned relying exclusively on the third wave of CBT principles—acceptance and commitment therapy (ACT)^26,35. The second most reported theoretical framework was positive psychology, with 5 of automated CAs applications mentioning it as guiding theory for the content of the intervention^{29,33,37,40,44}. Other theoretical frameworks were Interpersonal Theory¹², Person Centered Theory³⁹, Metacognitive Intervention of Narrative Imagery³⁸, Motivational Interview⁴³, Transtheoretical Approach⁴³, Emotion Focused Theory⁴³, and Dialectical Behavioral Theory¹². The number of theoretical approaches guiding one intervention ranged from 1 to 4 (median 2.5).

Characteristics of peer-reviewed research

Summative results for characteristics of peer reviewed research are presented in Table 3.

Table 3 Summative results per characteristics of peer reviewed research

Full size table

Participants were predominantly recruited from an educational setting (n = 10)^{10,11,31,33,35,36,39,40,42,43}, followed by community setting (n = 6)^{26,28,34,37,41,44}, and hospital/healthcare settings (n = 6)^{8,9,12,27,38,45}. Sample sizes ranged between 8 and 234 participants, with 9 studies conducted on samples of less than 50 participants^{12,26,28,29,33,38,39,44,45}, 8 studies on samples between 50 and 100 participants^{9,10,27,34,36,41,42,43}, and 6 studies on samples above 100 participants^{8,11,31,35,37,40}. The presence of emotional problems on a certain level was required by 7 studies^{12,31,36,39,40,41,42}, whereas 4 studies focused on physical health condition as selection criteria^8,38,44,45. Additionally, undergoing a medical procedure, irrespective of health condition, was a selection criterion for 2 studies^9,27. The mean age of participants was 16.64. Females represented 58.14% of the total sample size.

With respect to the stage of research, most studies fall under combinations of research stages: 12 studies on feasibility/usability and evaluation^{10,12,26,27,31,33,36,38,40,42,43,44}, 1 on development and feasibility/usability²⁹, 1 on design and evaluation³⁹, and 1 on design, feasibility/usability, and evaluation³⁷.

Among the 23 feasibility/usability and/or evaluation studies, more than half were controlled studies (n = 14)^{8,9,11,12,27,31,34,35,36,41,42,43,44,45}. Controlled studies predominantly employed an active control group (n = 11)^{8,9,27,30,31,35,36,41,42,43,45}. Among the studies reporting on design and development of automated CAs, 3 used co-participatory and iterative designs, involving the young end users in different stages of development^30,32,37. One study reporting on development relied only on mental health specialists and researchers input in design³⁹. The methodological approaches most frequently employed were mixed (n = 15)^{10,12,26,27,28,29,31,33,36,37,39,40,42,43,44} and quantitative methods (n = 8)^{8,9,11,34,35,38,41,45}.

The feasibility/usability outcomes were reported in 15 studies and include parameters such as engagement, retention/adherence rate, acceptability, user satisfaction, usability of the system, safety, and functionality^{10,12,26,27,31,33,36,38,40,42,43,44}. Overall, the feasibility and usability parameters were reported to be relatively high across studies. However, a few exceptions are worth mentioning. Safety issues were reported in 2 studies^12,26. More than half of the participants reported at least one negative effect of the intervention delivered through SISU chatbot²⁶. A serious adverse event occurred, 1 participant reporting suicidal tendency for the first time after intervention²⁶. One study reported that during study participation, 4 (24%) participants had one alert for suicidal ideation 4 participants had 3, and 2 participants had 6. One parent from the intervention group reported in week 12 that his child was seen in an emergency department and discharged to go home¹². With respect to engagement and adherence, 2 studies point out a decrease of these parameters over time^29,31. The drop-out rates ranged between 0 and 70.9%.

All studies reporting evaluation outcomes included efficacy parameters (n = 21), with no study on cost-effectiveness. In terms of efficacy outcomes, almost half of the studies reported more than one mental health outcome. Summative results for efficacy outcomes per outcome and research design are presented in Table 4.

Table 4 Summative results for efficacy per outcome and study design

Full size table

Anxiety outcomes were reported in 15 studies. When comparing the effect of automated CAs with a control group on anxiety measures, 5 studies reported a positive significant difference compared to control, favoring the automated CA condition^{12,33,36,43,45}, whereas 4 studies found no significant difference^11,35,41,42. One RCT found an improvement in medical procedure related anxiety only for a subgroup of participants, namely those undergoing more invasive procedures and with more frequent exposure to medical procedures²⁷. Among uncontrolled studies, a significant decrease in anxiety from baseline to post-intervention was reported in 2 studies^36,38, no effect in one study⁴⁰, while one study reported a negative effect of the automated CA mediated intervention expressed as an increase in anxiety symptoms²⁶. One uncontrolled study reported a significant decrease in anxiety only for youths with initial high levels of anxiety¹⁰.

Depression was reported in 9 studies. Among controlled trials focusing on reducing depression, 5 studies reported a significant difference between control and automated CA group, favoring the experimental condition^{12,31,36,42,43}, whereas 2 controlled studies found no significant difference on depression scores^35,44. Among uncontrolled trials, a minimal change in depression score was reported in one study using a robot³⁸, whereas another study showed no improvement from pre to post test²⁶.

Positive and negative affect were separately assessed in 6 studies^{34,36,41,42,43,44}, whereas one study used a composite measure of overall affect, combining both facets in one score³³. All but one study⁴³ reported no significant difference between control group and automated CA condition in reducing negative affect. However, an improvement in positive affect was found in 3 studies^34,41,43, while the other 3 remaining studies reported no difference between groups on this outcome^36,42,44. In one study, a robot coach delivering a positive psychology intervention improved the overall affect among young adults³³.

The effect of automated CAs mediated intervention on distress was explored in 5 studies. Out of the 5 studies, 2 used a controlled design and found a significant effect on distress after 5- and 20-min post-intervention, but not immediately following the intervention^8,9. Among uncontrolled studies, 2 studies report a significant decrease in distress outcomes from pre to post intervention^38,39, while other study found a significant effect on distress only for participants with initial high distress scores¹⁰ Moreover, a negative effect was reported for those with initial low levels of distress, for whom distress increased from pre to post intervention¹⁰.

Two uncontrolled studies were conducted to test the effectiveness of automated CAs mediated intervention on psychological well-being, showing a significant improvement^33,40. One study reported as outcome a measure of psychological sensitivity, which also showed a significant decrease from pre- to post-intervention³⁹. No significant effect of a chatbot based intervention on subjective happiness was reported in the uncontrolled study³⁹. An indicator of anxiety—physiological arousal—was reported in one study, with no change from pre- to post-intervention⁴¹. Similarly, post-traumatic stress disorder symptoms showed no significant improvement after an agent-based software intervention²⁶.

Discussion

The field is marked by a notable surge in the deployment of fully automated CAs specifically designed to address the emotional facets of mental health in the youth, with our review scrutinizing 21 distinct automated CAs across 25 included papers. Considering that most of these studies were published between 2020 and 2023, it is evident that the literature in this realm is still in its early stages. Despite the potential to extend support to a larger demographic of the young population, our findings underscore a significant lag in the adoption of automated CA-mediated interventions in less developed countries. The deployment of such entities typically incurs substantial financial outlays, a factor that inherently influences their accessibility and widespread adoption. This economic consideration is a critical aspect in understanding the differential integration of these technologies, particularly in contexts where resource allocation plays a pivotal role. However, there was an expansion of digital application in mental health and of shipped phones – that can be used to access at least text based and speech automated CAs available in the commercial market, therefore more research in other geographic areas is expected to be conducted⁴⁶.

The technological capabilities of automated CAs interventions for youths are evolving from simple oriented tasks and predefined decision trees to more complex and interactive solutions, as shown by the predominance of AI-based technologies. However, the state-of-the-art lags in terms of other technological capabilities such as embodiment and communication channels. This aspect holds particular significance, as previous research indicated that youths exhibit improved responsiveness and greater openness to CAs that possess virtual or physical representation, in contrast to disembodied CAs⁴⁷. Furthermore, although young people are used to typing and text messaging, there is evidence pointing to youth preference towards an interaction with CAs using speech and auditory channels beyond text⁴⁸. Similar conclusions were drawn by reviews conducted with the adult population in clinical psychology and healthcare with respect to the status of CAs technological capabilities, showing a rapid development in terms of dialog systems employed but a slower progress related to other technological capabilities such as type of representation and communication^14,20. However, while adults’ acceptability of CAs might revolve around less sophisticated and thus more familiar technologies, youths hold higher expectations since they learn and adopt new cutting-edge technologies from their infancy. Therefore, these aspects might weigh more for youths than adults when it comes to the acceptability and uptake of current automated CAs as mental health solutions.

The prevailing focus of current automated CAs mediated interventions centers on mitigating emotional problems, leaving limited attention to fostering positive aspects of emotional mental health, such as happiness or psychological well-being. A recent study showed that youth’s preference regarding psychological interventions for emotional problems revolves around a balance between the medical model of mental health, oriented to solving problems and the growth positive models, based on the assumption that all human beings have the capacity to flourish, and build upon existing strengths⁴⁹. This might be more relevant when it comes to appealing technologies such as automated CAs, since it is possible that youths make an indirect association between the appealing, interactive tool and positive aspects in its content.

Our review emphasizes an advanced stage of research development, with a predominance of a combination of feasibility/usability and evaluation studies, conducted as controlled trials using an active control condition. This contrasts with research conducted on subsets of CAs or with adults, that identified mainly pilot uncontrolled studies investigating their feasibility and usability²⁰. However, as shown by the other reviews, the stage of system design and development of automated CAs mediated intervention as well as the input from end users from initial stages is often neglected^14,15. Relatively little attention has been given to the investigation of a priori preference of end users in terms of scope, features, personality, and content and to the use of the results to inform the development of the automated CAs from early stages^26,28. This is in contradiction with the advocated human centered approaches, that have the potential to enhance the uptake of CAs as mental health digital solutions^50,51.

The existing automated CAs appear to hold possibilities to support youths’ mental health mainly in community settings and less in clinical context. While previous reviews on adults show a growing use of CAs in treatment of mental health problems, the evidence supporting applicability of automated CAs in improving emotional health among youths is limited to non-clinical populations⁸. However, the broad spectrum of the care sector, ranging from healthcare applications to providing emotional support during medical procedures, as well as educational contexts addressing anxiety and distress, reflects the versatile potential of automated CAs. Nevertheless, our review highlights a scarcity of applications targeted at younger children, potentially attributed to the fully autonomous nature of the CAs reviewed, requiring human facilitation. Furthermore, our investigation revealed a discernible pattern associating distinct types of embodiments with specific emotional challenges and age groups. Notably, automated CAs with physical embodiments demonstrated enhanced relevance in addressing transient, momentary emotional states among children. In contrast, disembodied CAs emerged as the predominant choice for ameliorating more stable emotional problems among adolescents and young adults. This nuanced understanding prompts a crucial consideration in the strategic deployment of CAs within the young population. Decisions regarding the selection of automated CA types should not only factor in age group distinctions but also align with the specific type of representation and emotional outcomes targeted by the intervention.

Feasibility and usability outcomes present an optimistic outlook, portraying automated CAs mediated interventions for youths’ emotional problems as generally acceptable and feasible, with high usability. Nevertheless, the implementation of automated CA interventions with youths encounters specific challenges. Firstly, automated CAs introduce potential safety risks, underscoring the imperative to address concerns related to suicidal ideation^12,26. Second, engagement and adherence appear to decrease over time^29,31. Third, the drop-out rate is overall higher than those reported in previous studies for other therapy formats⁵¹. These findings can be due to the fully automated nature of the CAs which acts as a self-help intervention. A review on the acceptability of online mental health programs for adolescents and young people found that drop-out rates were higher than the average when there was no concurrent therapist contact alongside digital components⁵². Although there is virtual guidance provided by the automated CA itself, it seems this might not be enough, and human assistance is needed besides the virtual assistance⁵². It is also possible that introducing youths to cutting-edge technology such as automated CAs may have a novelty effect, and that effect wears off in time, resulting in reduced engagement and adherence after prolonged interaction¹⁴.

Effectiveness remains inconclusive, challenging the assumption that technological advancements translate into improved efficacy. This finding is in accordance with some of the previous reviews conducted on evaluation of CAs in adult healthcare^14,53. There are several potential explanations for these results. First, most of the automated CAs interventions reviewed here were in fact universal prevention, directed at youths from the general population, with initial low levels of mental health problems and consequently with limited room for improvement⁵⁴. Indeed, when conducted on adults with clinical levels of anxiety or depression, a previous review showed medium to large effects of automated CAs interventions¹³. Second, according to a meta-analysis, for self-guided digital interventions to be efficient for youths, at least minimal support from a human therapist is needed⁵⁵. The CAs mediated interventions included in the current paper were automated and, with only a few exceptions, standalone interventions, where the therapeutic agent was only the CA itself. Moreover, despite the limited evidence supporting their efficacy, a majority of automated CAs are commercially accessible, potentially emphasizing market accessibility over clinical efficacy. This incongruity underscores the imperative for a more robust evaluation of CAs’ effectiveness in addressing the mental health needs of the youth population.

The results of the current review must be interpreted in the light of several limitations. First, only studies published in peer reviewed journals were considered and it is possible that other automated CAs applications in gray literature, conference proceedings or other sources were not considered. Given the emerging status of the research in this area, it is plausible that a handful of ongoing studies are only published in conference proceedings. Second, the review focus was limited to the emotional component of mental health. Future reviews should consider the potential of automated CAs to address a wider range of clinical problems and symptoms, beyond those examined in our investigation. Small sample sizes, predominantly recruited from non-clinical populations are largely responsible for reduced generalizability of findings across many included articles. Therefore, a critical consideration for future research in the area is to enroll larger samples from the clinical population into trials to increase the power and generalizability of the findings. Fourth, there was a substantial heterogeneity in how the reported feasibility/usability and efficacy parameters were measured and conceptualized across studies, which makes findings hard to generalize. For example, engagement was defined in terms of subjective impressions on s attractiveness and enjoyment²⁷, time spent per day or session in interacting with the automated CA¹⁰, percentage of target users returning for repeated sessions³⁷, and number of exchanged messages with the application⁴³. Similarly, efficacy outcomes such as anxiety and general distress were measured as salivary cortisol levels^8,41, subjective feelings^27,35, or in terms of behavioral cues⁹. Therefore, future research into automated CAs application would benefit from adhering to a standardized framework of measurement and conceptualization both in terms of feasibility/usability and evaluation outcomes to ensure comparability across studies.

Although most of the studies included measures of efficacy, usability or acceptability, there was no measurement of costs. Cost-effectiveness studies are needed to inform upon the affordability of such interventions in low and middle-income countries. Therefore, in our scoping review it was not possible to ascertain that automated CAs mediated psychological interventions are also cost effective when compared to the alternative approaches. Furthermore, more research on safety is warranted when speaking of fully automated CAs.

Another important direction would be to test whether integrating automated CAs as supporting the human therapist produces better results rather than just substituting it. Maybe a blended approach (face-to-face psychotherapy/ counseling) is the optimal solution for promoting mental health among youths while keeping the psychotherapeutic process engaging, attractive and safe at the same time. In addition, no comparison on feasibility, usability, or efficacy between different types of automated CAs was identified, despite preliminary results showing potential for differential responses to disembodied CAs, agents with virtual representation, and physical representation. It would be interesting to examine whether embodiment type predicts better engagement and clinical efficacy or is more preferred in a certain age group or context. Based on the existing research conducted on automated CAs we can’t generalize our findings to young people from low-income countries. Nonetheless, it is important to address this disparity through further investigations on the clinical efficacy of automated CAs with participants from different contexts, especially young people from low-income countries that face significant barriers in mental health treatment such as stigma, lack of financial resources, or lack of specialists.

Not lastly, we recommend involvement of end-users from early stages of development of automated CAs and changing the approach from developing automated CAs for youths to designing and devising them with youths, to enhance the uptake and acceptability of the application^50,51. Additionally, the current state-of-the-art lacks information about the sustainability of effects; therefore, a more thorough investigation of usability and efficacy outcomes on long term is strongly recommended.

In conclusion, the field is characterized by a rapid expansion of use of fully automated CAs, with more and more evolved technical capabilities and especially in high income countries. Despite being highly acceptable, feasible and engaging as well as highly available for use, automated CAs do not appear to be yet prepared to be implemented in clinical practice with the young population. Although it is a promising approach for young population mental health promotion, efforts should be made to improve the efficacy and the safety of automated CAs. Future research with a standardized assessment, larger and diverse samples (e.g., different clinical conditions) and rigorous designs (e.g., efficacy and effectiveness studies, longer follow-ups) needs to be conducted.

Methods

The scoping review was conducted in line with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines for conducting systematic scoping reviews⁵⁶. The protocol for this scoping review was prospectively registered in OSF under the ID 10.17605/OSF.IO/8KU6P.

Eligibility criteria

Inclusion criteria were: (1) primary studies based on either qualitative, quantitative, or mixed methods aiming to develop/design or test the usability, feasibility, efficacy, or economic cost effectiveness of a CA as a tool to improve a mental health outcome; (2) the CA is fully autonomous, meaning that it doesn’t rely on humans to generate responses or operate; (3) targeted samples of young population as end users, with a mean age ≤25 years; (4) published in a peer-reviewed journal and written in English. There were no inclusion restrictions on study design or on the mental or health status of participants.

Exclusion criteria were: (1) secondary research, conference proceedings, dissertations, and commentary articles aiming to describe or report on general aspects of human-CA interactions or interventions studies aimed to exclusively test general aspects of human-technology interactions using CAs; we also excluded studies describing or reporting on use of CA as a tool to improve social, cognitive, educational, or physical health outcomes as well as those focusing on CA applications for the purpose of assessment or monitoring only; (2) report on non-autonomous CAs, relying on a human user to generate responses (e.g., ‘Wizard of Oz’ methods) or semi-autonomous CAs, requiring a minimal human support to operate; (3) intervention targeted samples with a mean age >25; (4) written in languages other than English and published in gray literature.

Search strategy

Systematic searches were conducted by RB in March 2023 in multidisciplinary as well as specific domain databases (Web of Science, PubMed, Scopus, PsychInfo, ACM -Association for Computing Machinery Digital Library and IEEE Xplore) and studies references using keywords related to conversational agents, the age of the population of interest, and the role/scope of intervention (see Supplementary Note 1 for a detailed sample of the search strategy).

Study selection

The results of the search query were uploaded in EndNote (version 20; Clarivate Analytics). Following Cochrane recommendations, the screening process was piloted with a random sample of studies for both abstract and full text⁵⁷. The articles were screened by the RB and CRP. Any disagreements between the 2 independent reviewers were resolved through consulting with AD.

Data items and charting

A data form for exaction of information was designed prior to data charting and is detailed in the protocol for the current scoping review, published on Open Science Framework. The data extraction form was piloted and calibrated with the screening team. Like the study selection process, two reviewers (RB and CRP) independently conducted the process of data extraction, and any disagreements were resolved by the third reviewer (AD).

The following data items were charted: general information regarding the article (year, authors, country); technological characteristics (name, type of dialog system, availability, modality of communication, embodiment type); characteristics of the intervention (scope, mental health outcome targeted, duration, frequency, whether is standalone intervention and theoretical framework); characteristics of peer reviewed research (participants information, stage of research, study design and methodology and, if applicable, main results). A detailed overview of the definitions of each item together with corresponding categories is provided in Supplementary Table 3.

Synthesis of the results

First, information about study meta-characteristics of articles as well as about landscape of the automated CAs’ based interventions, characteristics of research conducted in the area and technological characteristics of CA from data-charting were summarized using descriptive statistics and descriptive narration. Key findings from usability/feasibility and evaluation studies were tabulated and narratively summarized.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The authors declare that the data supporting the findings of this study are available within the paper.

References

WHO. World Health Statistics 2018:: Monitoring Health for the SDGs. (WHO, 2018).
Schlack, R., Peerenboom, N., Neuperdt, L., Junker, S. & Beyer, A.-K. The effects of mental health problems in childhood and adolescence in young adults: results of the KiGGS cohort. J. Health Monit. 6, 3–19 (2021).
PubMed PubMed Central Google Scholar
Aguirre Velasco, A., Cruz, I. S. S., Billings, J., Jimenez, M. & Rowe, S. What are the barriers, facilitators and interventions targeting help-seeking behaviours for common mental health problems in adolescents? A systematic review. BMC Psychiatry 20, 293 (2020).
Article PubMed PubMed Central Google Scholar
Wolters, L. H., Op de Beek, V., Weidle, B. & Skokauskas, N. How can technology enhance cognitive behavioral therapy: the case of pediatric obsessive compulsive disorder. BMC Psychiatry 17, 226 (2017).
Article PubMed PubMed Central Google Scholar
Laranjo, L. et al. Conversational agents in healthcare: a systematic review. J. Am. Med. Inform. Assoc. 25, 1248–1258 (2018).
Article PubMed PubMed Central Google Scholar
Araujo, T. Living up to the chatbot hype: the influence of anthropomorphic design cues and communicative agency framing on conversational agent and company perceptions. Comput. Hum. Behav. 85, 183–189 (2018).
Article Google Scholar
Catania, F., Spitale, M. & Garzotto, F. Conversational agents in therapeutic interventions for neurodevelopmental disorders: a survey. ACM Comput. Surv. 55, 1–209 (2023).
Article Google Scholar
Rossi, S. et al. Using the Social Robot NAO for emotional support to children at a pediatric emergency department: randomized clinical trial. J. Med. Internet Res. 24, e29656 (2022).
Article PubMed PubMed Central Google Scholar
Tanaka, K., Hayakawa, M., Noda, C., Nakamura, A. & Akiyama, C. Effects of artificial intelligence aibo intervention on alleviating distress and fear in children. Child Adolesc. Psychiatry Ment. Health 16, 87 (2022).
Article PubMed PubMed Central Google Scholar
Gabrielli, S. et al. Engagement and effectiveness of a healthy-coping intervention via Chatbot for university students during the COVID-19 pandemic: mixed methods proof-of-concept study. JMIR MHealth UHealth 9, e27965 (2021).
Article PubMed PubMed Central Google Scholar
Kim, Y., Thayne, J. & Wei, Q. An embodied agent helps anxious students in mathematics learning. Educ. Technol. Res. Dev. 65, 219–235 (2017).
Article Google Scholar
Nicol, G., Wang, R., Graham, S., Dodd, S. & Garbutt, J. Chatbot-delivered cognitive behavioral therapy in adolescents with depression and anxiety during the COVID-19 pandemic: feasibility and acceptability study. JMIR Form. Res. 6, e40242 (2022).
Article PubMed PubMed Central Google Scholar
Lim, S. M., Shiau, C. W. C., Cheng, L. J. & Lau, Y. Chatbot-delivered psychotherapy for adults with depressive and anxiety symptoms: a systematic review and meta-regression. Behav. Ther. 53, 334–347 (2022).
Article PubMed Google Scholar
Kramer, L. L., Ter Stal, S., Mulder, B. C., de Vet, E. & van Velsen, L. Developing embodied conversational agents for coaching people in a healthy lifestyle: scoping review. J. Med. Internet Res. 22, e14058 (2020).
Article PubMed PubMed Central Google Scholar
Provoost, S., Lau, H. M., Ruwaard, J. & Riper, H. Embodied conversational agents in clinical psychology: a scoping review. J. Med. Internet Res. 19, e151 (2017).
Article PubMed PubMed Central Google Scholar
Gaffney, H., Mansell, W. & Tai, S. Conversational agents in the treatment of mental health problems: mixed-method systematic review. JMIR Ment. Health 6, e14166 (2019).
Article PubMed PubMed Central Google Scholar
Koulouri, T., Macredie, R. D. & Olakitan, D. Chatbots to support young adults’ mental health: an exploratory study of acceptability. ACM Trans. Interact. Intell. Syst. 12, 1–11 (2022).
Article Google Scholar
Bae Brandtzæg, P. B., Skjuve, M., Kristoffer Dysthe, K. K. & Følstad, A. When the social becomes non-human: young people’s perception of social support in chatbots. In Proc. of the 2021 CHI Conference on Human Factors in Computing Systems 1–13 (Association for Computing Machinery, New York, NY, USA,). https://doi.org/10.1145/3411764.3445318 (2021).
Apolinário-Hagen, J., Kemper, J. & Stürmer, C. Public acceptability of E-Mental health treatment services for psychological problems: a scoping review. JMIR Ment. Health 4, e6186 (2017).
Article Google Scholar
Car, L. T. et al. Conversational agents in health care: scoping review and conceptual analysis. J. Med. Internet Res. 22, e17158 (2020).
Article Google Scholar
Kretzschmar, K. et al. Can your phone be your therapist? Young People’s Ethical Perspectives on the Use of Fully Automated Conversational Agents (Chatbots) in Mental Health Support. Biomed. Inform. Insights 11, 1178222619829083 (2019).
Article PubMed PubMed Central Google Scholar
Ly, K. H., Ly, A.-M. & Andersson, G. A fully automated conversational agent for promoting mental well-being: a pilot RCT using mixed methods. Internet Inter. 10, 39–46 (2017).
Article Google Scholar
Bartl-Pokorny, K. D. et al. Robot-based intervention for children with autism spectrum disorder: a systematic literature review. IEEE Access 9, 165433–165450 (2021).
Article Google Scholar
Berrezueta-Guzman, J., Robles-Bykbaev, V. E., Pau, I., Pesántez-Avilés, F. & Martín-Ruiz, M.-L. Robotic technologies in ADHD care: literature review. IEEE Access 10, 608–625 (2022).
Article Google Scholar
Kabacińska, K., Prescott, T. J. & Robillard, J. M. Socially assistive robots as mental health interventions for children: a scoping review. Int. J. Soc. Robot. 13, 919–935 (2021).
Article Google Scholar
Bendig, E., Erb, B., Meißner, D., Bauereiß, N. & Baumeister, H. Feasibility of a Software agent providing a brief Intervention for Self-help to Uplift psychological wellbeing (“SISU”). A single-group pretest-posttest trial investigating the potential of SISU to act as therapeutic agent. Internet Inter. 24, 100377 (2021).
Article Google Scholar
Bray, L. et al. The acceptability and impact of the Xploro digital therapeutic platform to inform and prepare children for planned procedures in a hospital: before and after evaluation study. J. Med. Internet Res. 22, e17367 (2020).
Article PubMed PubMed Central Google Scholar
Dosovitsky, G. & Bunge, E. Development of a chatbot for depression: adolescent perceptions and recommendations. Child Adolesc. Ment. Health 28, 124–127 (2023).
Article PubMed Google Scholar
Gabrielli, S., Rizzi, S., Carbone, S. & Donisi, V. A chatbot-based coaching intervention for adolescents to promote life skills: pilot study. JMIR Hum. Factors 7, e16762 (2020).
Article PubMed PubMed Central Google Scholar
Grové, C. Co-developing a Mental Health and Wellbeing Chatbot With and for Young People. Front. Psychiatry 11, 606041 (2020).
Article PubMed Google Scholar
He, Y. et al. Mental health chatbot for young adults with depressive symptoms during the COVID-19 pandemic: single-blind, three-arm randomized controlled trial. J. Med. Internet Res. 24, e40719 (2022).
Article PubMed PubMed Central Google Scholar
Høiland, C. G., Følstad, A. & Karahasanovic, A. Hi, can I help? Exploring how to design a mental health chatbot for youths. Hum. Technol. 16, 139–169 (2020).
Article Google Scholar
Jeong, S. et al. Deploying a robotic positive psychology coach to improve college students’ psychological well-being. Use. Model. Use.Adapt. Interact. 33, 571–615 (2023).
Article Google Scholar
Kitt, E. R., Crossman, M. K., Matijczak, A., Burns, G. B. & Kazdin, A. E. Evaluating the role of a socially assistive robot in children’s mental health care. J. Child Fam. Stud. 30, 1722–1735 (2021).
Article PubMed PubMed Central Google Scholar
Lappalainen, P. et al. In the shadow of COVID-19: a randomized controlled online ACT trial promoting adolescent psychological flexibility and self-compassion. J. Context. Behav. Sci. 27, 34–44 (2023).
Article Google Scholar
Liu, H. et al. chatbots to provide self-help depression interventions for university students: a randomized trial of effectiveness. Internet Inter. 27, 100495 (2022).
Article MathSciNet Google Scholar
Ludin, N. et al. A chatbot to support young people during the COVID-19 pandemic in New Zealand: evaluation of the real-world rollout of an open trial. J. Med. Internet Res. 24, e38743 (2022).
Article PubMed PubMed Central Google Scholar
Russell, J. K., Strodl, E. & Kavanagh, D. Use of a social robot in the implementation of a narrative intervention for young people with cystic fibrosis: a feasibility study. Int. J. Soc. Robot. 13, 1787–1801 (2021).
Article Google Scholar
Trappey, A. J. C., Lin, A. P. C., Hsu, K. Y. K., Trappey, C. V. & Tu, K. L. K. Development of an empathy-centric counseling chatbot system capable of sentimental dialogue analysis. Processes 10, 930 (2022).
Article Google Scholar
Williams, R. et al. 21-day stress detox: open trial of a universal well-being chatbot for young adults. Soc. Sci. 10, 416 (2021).
Article Google Scholar
Crossman, M. K., Kazdin, A. E. & Kitt, E. R. The influence of a socially assistive robot on mood, anxiety, and arousal in children. Prof. Psychol. Res. Pract. 49, 48–56 (2018).
Article Google Scholar
Fitzpatrick, K. K., Darcy, A. & Vierhile, M. Delivering cognitive behavior therapy to young adults with symptoms of depression and anxiety using a fully automated conversational agent (Woebot): a randomized controlled trial. JMIR Ment. Health 4, e19 (2017).
Article PubMed PubMed Central Google Scholar
Fulmer, R., Joerin, A., Gentile, B., Lakerink, L. & Rauws, M. Using psychological artificial intelligence (Tess) to relieve symptoms of depression and anxiety: randomized controlled trial. JMIR Ment. Health 5, e9782 (2018).
Article Google Scholar
Greer, S. et al. Use of the Chatbot ‘Vivibot’ to deliver positive psychology skills and promote well-being among young people after cancer treatment: randomized controlled feasibility trial. JMIR MHealth UHealth 7, e15018 (2019).
Article PubMed PubMed Central Google Scholar
Okita, S. Y. Self-other’s perspective taking: the use of therapeutic robot companions as social agents for reducing pain and anxiety in pediatric patients. Cyberpsycho. Behav. Soc. Netw. 16, 436–441 (2013).
Article Google Scholar
Global Smartphone Market Analysis and Outlook: Disruption in a Changing Market - PDF Free Download. https://docplayer.net/1973753-Global-smartphone-market-analysis-and-outlook-disruption-in-a-changing-market.html.
Schuetzler, R. M., Giboney, J. S., Grimes, G. M. & Nunamaker, J. F. The influence of conversational agent embodiment and conversational relevance on socially desirable responding. Decis. Support Syst. 114, 94–102 (2018).
Article Google Scholar
Nguyen, H. Examining teenagers’ perceptions of conversational agents in learning settings. In Interaction Design and Children 374–381 (ACM, Braga Portugal,). https://doi.org/10.1145/3501712.3529740 (2022).
Michel, T., Tachtler, F., Slovak, P. & Fitzpatrick, G. Young People’s attitude toward positive psychology interventions: thematic analysis. JMIR Hum. Factors 7, e21145 (2020).
Article PubMed PubMed Central Google Scholar
Harte, R. et al. A human-centered design methodology to enhance the usability, human factors, and user experience of connected health systems: a three-phase methodology. JMIR Hum. Factors 4, e8 (2017).
Article PubMed PubMed Central Google Scholar
Wight, D., Wimbush, E., Jepson, R. & Doi, L. Six steps in quality intervention development (6SQuID). J. Epidemiol. Community Health 70, 520–525 (2016).
Article PubMed Google Scholar
Struthers, A. et al. The acceptability of E-mental health services for children, adolescents, and young adults: a systematic search and review. Can. J. Commun. Ment. Health 34, 1–21 (2015).
Article Google Scholar
Bendig, E., Erb, B., Schulze-Thuesing, L. & Baumeister, H. The next generation: chatbots in clinical psychology and psychotherapy to foster mental health – a scoping review. Verhaltenstherapie 32, 64–76 (2019).
Article Google Scholar
Werner-Seidler, A., Perry, Y., Calear, A. L., Newby, J. M. & Christensen, H. School-based depression and anxiety prevention programs for young people: a systematic review and meta-analysis. Clin. Psychol. Rev. 51, 30–47 (2017).
Article PubMed Google Scholar
Bennett, S. D. et al. Practitioner review: unguided and guided self-help interventions for common mental health disorders in children and adolescents: a systematic review and meta-analysis. J. Child Psychol. Psychiatry 60, 828–847 (2019).
Article PubMed Google Scholar
Tricco, A. C. et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): checklist and explanation. Ann. Intern. Med. 169, 467–473 (2018).
Article PubMed Google Scholar
Garritty, C. et al. Cochrane rapid reviews methods group offers evidence-informed guidance to conduct rapid reviews. J. Clin. Epidemiol. 130, 13–22 (2021).
Article PubMed Google Scholar

Download references

Acknowledgements

This study was funded by the Ministry of Research and Innovation, Grant no. PN-III-P1-1.1-PD-2021-0538 and by Babeş-Bolyai University, Grant no PDI-PFE ID 550, Contract no 31377/13.03.2023. The funders played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript.

Author information

These authors contributed equally: Raluca Balan, Anca Dobrean, Costina R. Poetar.

Authors and Affiliations

The International Institute for the Advanced Studies of Psychotherapy and Applied Mental Health, Babeș-Bolyai University, Cluj-Napoca, Romania
Raluca Balan, Anca Dobrean & Costina R. Poetar
Department of Clinical Psychology and Psychotherapy, Babeş-Bolyai University, Cluj-Napoca, Cluj, Romania
Raluca Balan, Anca Dobrean & Costina R. Poetar

Authors

Raluca Balan
View author publications
You can also search for this author in PubMed Google Scholar
Anca Dobrean
View author publications
You can also search for this author in PubMed Google Scholar
Costina R. Poetar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.B., A.D. and C.R.P. contributed to the conception and design of the study. R.B., A.D. and C.R.P. contributed to the literature search and data extraction. R.B., A.D. and C.R.P. contributed to data analysis and interpretation. R.B drafted the initial manuscript. A.D. and C.R.P. revised the manuscript. R. R.B., A.D. and C.R.P. were responsible for the decision to submit the manuscript. All authors contributed to the critical revision of the manuscript. All authors approved the manuscript.

Corresponding author

Correspondence to Anca Dobrean.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Balan, R., Dobrean, A. & Poetar, C.R. Use of automated conversational agents in improving young population mental health: a scoping review. npj Digit. Med. 7, 75 (2024). https://doi.org/10.1038/s41746-024-01072-1

Download citation

Received: 19 August 2023
Accepted: 07 March 2024
Published: 19 March 2024
DOI: https://doi.org/10.1038/s41746-024-01072-1