Abstract
Automated conversational agents (CAs) emerged as a promising solution in mental health interventions among young people. Therefore, the objective of this scoping review is to examine the current state of research into fully automated CAs mediated interventions for the emotional component of mental health among young people. Selected databases were searched in March 2023. Included studies were primary research, reporting on development, feasibility/usability, or evaluation of fully automated CAs as a tool to improve the emotional component of mental health among young population. Twenty-five studies were included (N = 1707). Most automated CAs applications were standalone preventions targeting anxiety and depression. Automated CAs were predominantly AI-based chatbots, using text as the main communication channel. Overall, the results of the current scoping review showed that automated CAs mediated interventions for emotional problems are acceptable, engaging and with high usability. However, the results for clinical efficacy are far less conclusive, since almost half of evaluation studies reported no significant effect on emotional mental health outcomes. Based on these findings, it can be concluded that there is a pressing need to improve the existing automated CAs applications to increase their efficacy as well as conducting more rigorous methodological research in this area.
Similar content being viewed by others
Introduction
Mental health problems are an area of particular concern among young people. According to WHO, 20% of youths have a mental health disorder, a rate that is two times higher than in the general population1. A history of mental health problems in young age forecasts a range of psychosocial difficulties in adult life2. Despite high prevalence and long-term negative consequences of mental health problems, most children and youths do not participate in preventive or intervention actions because of attitudinal or logistic barriers3.
Use of technology has emerged as an important alternative to face-to-face approach in deploying assistive, preventive, and therapeutic solutions for those in need, increasing the availability and providing a stigma free environment for exploring their vulnerabilities related to mental health problems4. One such cutting edge digital solution is conversational agents (CAs), defined as systems simulating human interaction using text, speech, gestures, facial, or sensorial expressions as input and/or output5. The category of CAs covers a broad spectrum of embodiment types, from disembodied agents with no dynamic physical representation (chatbots) to agents with virtual representation or robots with a physical representation6. The autonomy level ranges from non-autonomous CAs, whose functionality totally depends on the decisions and actions of a human being, to semi-autonomous CAs (that have a certain degree of independence but require the real time control by humans for some specific scenarios and functionalities) to fully automated CAs, that can be used totally independently without any form of human support7. In this paper, the focus will be on fully automated CAs, irrespective of embodiment type.
With a rapid technological expansion, fully automated CAs seem to hold a great potential in mental health care for young people. In recent years, a growing body of research has been interested in developing and testing the efficacy of fully automated CAs for addressing mental health problems in a variety of settings with youths. In the healthcare setting, automated CAs are used to tackle distress related to medical procedures among youths, such as vaccination or cancer treatments8,9. In an educational context, they have been employed as a tool to reduce problems such as general distress or performance anxiety10,11. Automated CAs have also been used to prevent or to treat depression and anxiety in the general or psychiatric population12.
While several reviews have been conducted to characterize various types of CAs as tools for treatment of mental health problems, several limitations have been identified. First, the previous reviews rely mostly on the adult population or do not distinguish between young and older population, with no comprehensive synthesis of existing automated CAs specifically designed to tackle mental health problems among young populations13,14,15,16. Justification for focusing on the young population is rooted in prior research demonstrating distinctive preferences, attitudes, and utilization patterns compared to adults17,18. As first adopters of the latest technological developments, including mental healthcare services, youths exhibit greater familiarity and comfort with these innovations19.
Second, most of the previous reviews did not distinguish fully automated CAs from non- or semi-autonomous CAs13,20. Fully automated CA are a scalable, cost effective and alternative to human therapist support, moving the field towards a new paradigm. However, full automatization can pose significant challenges when used in mental health care with youths, such as limited capacity to respond to safety-critical situations, less personalization of the content or confidentiality issues21,22.
Third, the previous reviews limited their focus to a subset of CAs based on the embodiment level, such as disembodied CAs13,20, CAs with virtual representation15, or with a physical representation23,24,25. Moreover, use of CAs was predominantly investigated in relation to a broad range of mental health problems15,16, or specifically related to cognitive and social abilities, without considering the emotional component of mental health24,25. This scoping review was formulated to focus specifically on the emotional component of mental health as defined through the lens of the medical model (e.g., changes in anxiety, depression, psychological distress) rather than social (e.g., repertoire of verbal/non-verbal abilities to communicate and interact with others) and cognitive skills (e.g., executive functioning skills) to specifically capture this innovative and growing application area for automated CAs.
In response to these gaps, this scoping review aims to provide a comprehensive overview of fully automated CAs and their role in enhancing the emotional component of mental health in the young population. The scoping review was guided by the following research questions:
-
(1)
What are the technological characteristics of automated CAs used to deliver interventions for youth’s mental health?
-
(2)
What are the characteristics of the interventions provided by automated CAs in children, adolescents, and young adults aiming to improve mental health outcomes?
Results
Study selection
The systematic search in databases and external sources returned 9905 articles. After duplicates removal, 6874 articles were screened for title and abstract and further 6719 studies were excluded. Out of the remaining 155 studies, we retrieved full-text copies for 152 articles that were screened in full. This resulted in a total of 25 studies included in the current scoping review. The study selection is detailed in Fig. 1 PRISMA flowchart.
A detailed overview of characteristics of included studies is provided in Supplementary Table 1 and 2.
Of the 25 studies, 19 were recently published (between 2020 and 2023)8,9,10,12,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40. Studies were conducted predominantly in the US (n = 12)11,12,27,28,33,34,38,41,42,43,44,45, followed by Europe (n = 5)10,26,29,32,35, Asia (n = 4)9,31,36,39, New Zealand (n = 2)37,40, and Australia (n = 1)30.
Technological characteristics
The summative results for technological characteristics of automated CAs are presented in Table 1. In total, there were 21 different agents described in the included studies. Only 3 of the CA were the focus of more than one study – Paro11,41,45, Nao8,38, and Woebot12,42. These automated CAs were predominantly disembodied chatbots (n = 15)10,26,28,29,30,31,32,35,36,37,40,42,43,44, followed by robots (n = 7)8,9,33,34,38,41,45. Automated CAs with a virtual representation were the focus in 2 studies11,27. In addition, one application consisted of a chatbot with features of avatar39.
Regarding the dialog system underlying the process of conversation, almost half of automated CAs (n = 12) employed natural language processing and machine learning to carry on an interaction8,9,12,27,30,31,34,38,39,41,43,45. Predefined dialog or interactions assembled, and matched to the user input in a dynamic manner was used in 10 studies10,11,26,28,29,32,33,35,40,44, while 3 used a mixed dialog system36,37,42. These agents communicated through text (n = 13)10,12,26,28,29,30,32,35,37,40,42,43,44, speech (n = 2)8,38, and non-verbal cues (n = 4)9,34,41,45, while multiple modalities communication was employed by 5 studies11,31,33,36,39. For one study no information was provided on modality of communication27. Among automated CAs investigated in the included studies, 17 are available to purchase or for free use8,9,10,12,27,31,33,34,36,37,38,40,41,42,43,44,45.
Characteristics of interventions
Characteristics of mental health interventions using automated CAs are detailed in Table 2.
Anxiety was the most frequent targeted emotional component of mental health by automated CAs (n = 12)8,10,12,27,33,35,37,38,41,42,43,45. Depression was the second most targeted emotional component (n = 8)12,28,31,35,36,38,42,43, followed by psychological well-being (n = 5)26,29,30,33,44, general distress (n = 5)9,10,34,39,40, and mood (n = 2)33,41. One intervention had as target mental health problems as a broad construct32.
With respect to the scope of interventions, most of the studies labeled the CAs applications as interventions. In fact, those were designed and tested as having mainly a preventive scope, since the research was conducted with general or at-risk population8,9,10,11,26,28,29,30,32,33,34,35,37,38,43,44,45. Only 8 studies were conducted on samples of youths screened as having detectable mental health problems, mainly based on youth or parent report12,27,31,36,40,41,42.
Duration of interventions was reported by 19 studies. Most of the interventions last between 2- and 4-weeks (n = 8)8,10,29,38,39,40,42,43,44, followed by interventions with a duration of 1 day or less (n = 5)9,28,34,41,45, and interventions of 2 up to 7 days (n = 3)26,31,33. Only 3 studies investigated interventions longer than 4 weeks12,35,36. In terms of sessions’ frequency only 8 studies provide information and include daily sessions26,33,43, bi-weekly10,29,43, once a week39 or 3 times per week35.
Out of 25 included studies, only 5 focused on automated CAs as components embedded in other types of technologies or mental health services for mental health problems11,12,27,35,39. The remaining 20 studies designed or evaluated automated CAs agents as standalone psychological interventions. Automated CAs that were not independent interventions were integrated components of web-based interventions, with additional technological features enabling the intervention such as videoconference or serious games11,27,35,39 or as an additive component to primary care management12.
Theoretical framework for automated CAs interventions was reported by 17 studies. Cognitive behavioral theory (CBT) principles were applied to most of the interventions to derive their content. More specifically, CBT was mentioned as a theoretical framework for 14 automated CAs applications8,10,11,12,26,28,31,35,36,37,42,43,44. Among CBT based interventions, 2 applications mentioned relying exclusively on the third wave of CBT principles—acceptance and commitment therapy (ACT)26,35. The second most reported theoretical framework was positive psychology, with 5 of automated CAs applications mentioning it as guiding theory for the content of the intervention29,33,37,40,44. Other theoretical frameworks were Interpersonal Theory12, Person Centered Theory39, Metacognitive Intervention of Narrative Imagery38, Motivational Interview43, Transtheoretical Approach43, Emotion Focused Theory43, and Dialectical Behavioral Theory12. The number of theoretical approaches guiding one intervention ranged from 1 to 4 (median 2.5).
Characteristics of peer-reviewed research
Summative results for characteristics of peer reviewed research are presented in Table 3.
Participants were predominantly recruited from an educational setting (n = 10)10,11,31,33,35,36,39,40,42,43, followed by community setting (n = 6)26,28,34,37,41,44, and hospital/healthcare settings (n = 6)8,9,12,27,38,45. Sample sizes ranged between 8 and 234 participants, with 9 studies conducted on samples of less than 50 participants12,26,28,29,33,38,39,44,45, 8 studies on samples between 50 and 100 participants9,10,27,34,36,41,42,43, and 6 studies on samples above 100 participants8,11,31,35,37,40. The presence of emotional problems on a certain level was required by 7 studies12,31,36,39,40,41,42, whereas 4 studies focused on physical health condition as selection criteria8,38,44,45. Additionally, undergoing a medical procedure, irrespective of health condition, was a selection criterion for 2 studies9,27. The mean age of participants was 16.64. Females represented 58.14% of the total sample size.
With respect to the stage of research, most studies fall under combinations of research stages: 12 studies on feasibility/usability and evaluation10,12,26,27,31,33,36,38,40,42,43,44, 1 on development and feasibility/usability29, 1 on design and evaluation39, and 1 on design, feasibility/usability, and evaluation37.
Among the 23 feasibility/usability and/or evaluation studies, more than half were controlled studies (n = 14)8,9,11,12,27,31,34,35,36,41,42,43,44,45. Controlled studies predominantly employed an active control group (n = 11)8,9,27,30,31,35,36,41,42,43,45. Among the studies reporting on design and development of automated CAs, 3 used co-participatory and iterative designs, involving the young end users in different stages of development30,32,37. One study reporting on development relied only on mental health specialists and researchers input in design39. The methodological approaches most frequently employed were mixed (n = 15)10,12,26,27,28,29,31,33,36,37,39,40,42,43,44 and quantitative methods (n = 8)8,9,11,34,35,38,41,45.
The feasibility/usability outcomes were reported in 15 studies and include parameters such as engagement, retention/adherence rate, acceptability, user satisfaction, usability of the system, safety, and functionality10,12,26,27,31,33,36,38,40,42,43,44. Overall, the feasibility and usability parameters were reported to be relatively high across studies. However, a few exceptions are worth mentioning. Safety issues were reported in 2 studies12,26. More than half of the participants reported at least one negative effect of the intervention delivered through SISU chatbot26. A serious adverse event occurred, 1 participant reporting suicidal tendency for the first time after intervention26. One study reported that during study participation, 4 (24%) participants had one alert for suicidal ideation 4 participants had 3, and 2 participants had 6. One parent from the intervention group reported in week 12 that his child was seen in an emergency department and discharged to go home12. With respect to engagement and adherence, 2 studies point out a decrease of these parameters over time29,31. The drop-out rates ranged between 0 and 70.9%.
All studies reporting evaluation outcomes included efficacy parameters (n = 21), with no study on cost-effectiveness. In terms of efficacy outcomes, almost half of the studies reported more than one mental health outcome. Summative results for efficacy outcomes per outcome and research design are presented in Table 4.
Anxiety outcomes were reported in 15 studies. When comparing the effect of automated CAs with a control group on anxiety measures, 5 studies reported a positive significant difference compared to control, favoring the automated CA condition12,33,36,43,45, whereas 4 studies found no significant difference11,35,41,42. One RCT found an improvement in medical procedure related anxiety only for a subgroup of participants, namely those undergoing more invasive procedures and with more frequent exposure to medical procedures27. Among uncontrolled studies, a significant decrease in anxiety from baseline to post-intervention was reported in 2 studies36,38, no effect in one study40, while one study reported a negative effect of the automated CA mediated intervention expressed as an increase in anxiety symptoms26. One uncontrolled study reported a significant decrease in anxiety only for youths with initial high levels of anxiety10.
Depression was reported in 9 studies. Among controlled trials focusing on reducing depression, 5 studies reported a significant difference between control and automated CA group, favoring the experimental condition12,31,36,42,43, whereas 2 controlled studies found no significant difference on depression scores35,44. Among uncontrolled trials, a minimal change in depression score was reported in one study using a robot38, whereas another study showed no improvement from pre to post test26.
Positive and negative affect were separately assessed in 6 studies34,36,41,42,43,44, whereas one study used a composite measure of overall affect, combining both facets in one score33. All but one study43 reported no significant difference between control group and automated CA condition in reducing negative affect. However, an improvement in positive affect was found in 3 studies34,41,43, while the other 3 remaining studies reported no difference between groups on this outcome36,42,44. In one study, a robot coach delivering a positive psychology intervention improved the overall affect among young adults33.
The effect of automated CAs mediated intervention on distress was explored in 5 studies. Out of the 5 studies, 2 used a controlled design and found a significant effect on distress after 5- and 20-min post-intervention, but not immediately following the intervention8,9. Among uncontrolled studies, 2 studies report a significant decrease in distress outcomes from pre to post intervention38,39, while other study found a significant effect on distress only for participants with initial high distress scores10 Moreover, a negative effect was reported for those with initial low levels of distress, for whom distress increased from pre to post intervention10.
Two uncontrolled studies were conducted to test the effectiveness of automated CAs mediated intervention on psychological well-being, showing a significant improvement33,40. One study reported as outcome a measure of psychological sensitivity, which also showed a significant decrease from pre- to post-intervention39. No significant effect of a chatbot based intervention on subjective happiness was reported in the uncontrolled study39. An indicator of anxiety—physiological arousal—was reported in one study, with no change from pre- to post-intervention41. Similarly, post-traumatic stress disorder symptoms showed no significant improvement after an agent-based software intervention26.
Discussion
The field is marked by a notable surge in the deployment of fully automated CAs specifically designed to address the emotional facets of mental health in the youth, with our review scrutinizing 21 distinct automated CAs across 25 included papers. Considering that most of these studies were published between 2020 and 2023, it is evident that the literature in this realm is still in its early stages. Despite the potential to extend support to a larger demographic of the young population, our findings underscore a significant lag in the adoption of automated CA-mediated interventions in less developed countries. The deployment of such entities typically incurs substantial financial outlays, a factor that inherently influences their accessibility and widespread adoption. This economic consideration is a critical aspect in understanding the differential integration of these technologies, particularly in contexts where resource allocation plays a pivotal role. However, there was an expansion of digital application in mental health and of shipped phones – that can be used to access at least text based and speech automated CAs available in the commercial market, therefore more research in other geographic areas is expected to be conducted46.
The technological capabilities of automated CAs interventions for youths are evolving from simple oriented tasks and predefined decision trees to more complex and interactive solutions, as shown by the predominance of AI-based technologies. However, the state-of-the-art lags in terms of other technological capabilities such as embodiment and communication channels. This aspect holds particular significance, as previous research indicated that youths exhibit improved responsiveness and greater openness to CAs that possess virtual or physical representation, in contrast to disembodied CAs47. Furthermore, although young people are used to typing and text messaging, there is evidence pointing to youth preference towards an interaction with CAs using speech and auditory channels beyond text48. Similar conclusions were drawn by reviews conducted with the adult population in clinical psychology and healthcare with respect to the status of CAs technological capabilities, showing a rapid development in terms of dialog systems employed but a slower progress related to other technological capabilities such as type of representation and communication14,20. However, while adults’ acceptability of CAs might revolve around less sophisticated and thus more familiar technologies, youths hold higher expectations since they learn and adopt new cutting-edge technologies from their infancy. Therefore, these aspects might weigh more for youths than adults when it comes to the acceptability and uptake of current automated CAs as mental health solutions.
The prevailing focus of current automated CAs mediated interventions centers on mitigating emotional problems, leaving limited attention to fostering positive aspects of emotional mental health, such as happiness or psychological well-being. A recent study showed that youth’s preference regarding psychological interventions for emotional problems revolves around a balance between the medical model of mental health, oriented to solving problems and the growth positive models, based on the assumption that all human beings have the capacity to flourish, and build upon existing strengths49. This might be more relevant when it comes to appealing technologies such as automated CAs, since it is possible that youths make an indirect association between the appealing, interactive tool and positive aspects in its content.
Our review emphasizes an advanced stage of research development, with a predominance of a combination of feasibility/usability and evaluation studies, conducted as controlled trials using an active control condition. This contrasts with research conducted on subsets of CAs or with adults, that identified mainly pilot uncontrolled studies investigating their feasibility and usability20. However, as shown by the other reviews, the stage of system design and development of automated CAs mediated intervention as well as the input from end users from initial stages is often neglected14,15. Relatively little attention has been given to the investigation of a priori preference of end users in terms of scope, features, personality, and content and to the use of the results to inform the development of the automated CAs from early stages26,28. This is in contradiction with the advocated human centered approaches, that have the potential to enhance the uptake of CAs as mental health digital solutions50,51.
The existing automated CAs appear to hold possibilities to support youths’ mental health mainly in community settings and less in clinical context. While previous reviews on adults show a growing use of CAs in treatment of mental health problems, the evidence supporting applicability of automated CAs in improving emotional health among youths is limited to non-clinical populations8. However, the broad spectrum of the care sector, ranging from healthcare applications to providing emotional support during medical procedures, as well as educational contexts addressing anxiety and distress, reflects the versatile potential of automated CAs. Nevertheless, our review highlights a scarcity of applications targeted at younger children, potentially attributed to the fully autonomous nature of the CAs reviewed, requiring human facilitation. Furthermore, our investigation revealed a discernible pattern associating distinct types of embodiments with specific emotional challenges and age groups. Notably, automated CAs with physical embodiments demonstrated enhanced relevance in addressing transient, momentary emotional states among children. In contrast, disembodied CAs emerged as the predominant choice for ameliorating more stable emotional problems among adolescents and young adults. This nuanced understanding prompts a crucial consideration in the strategic deployment of CAs within the young population. Decisions regarding the selection of automated CA types should not only factor in age group distinctions but also align with the specific type of representation and emotional outcomes targeted by the intervention.
Feasibility and usability outcomes present an optimistic outlook, portraying automated CAs mediated interventions for youths’ emotional problems as generally acceptable and feasible, with high usability. Nevertheless, the implementation of automated CA interventions with youths encounters specific challenges. Firstly, automated CAs introduce potential safety risks, underscoring the imperative to address concerns related to suicidal ideation12,26. Second, engagement and adherence appear to decrease over time29,31. Third, the drop-out rate is overall higher than those reported in previous studies for other therapy formats51. These findings can be due to the fully automated nature of the CAs which acts as a self-help intervention. A review on the acceptability of online mental health programs for adolescents and young people found that drop-out rates were higher than the average when there was no concurrent therapist contact alongside digital components52. Although there is virtual guidance provided by the automated CA itself, it seems this might not be enough, and human assistance is needed besides the virtual assistance52. It is also possible that introducing youths to cutting-edge technology such as automated CAs may have a novelty effect, and that effect wears off in time, resulting in reduced engagement and adherence after prolonged interaction14.
Effectiveness remains inconclusive, challenging the assumption that technological advancements translate into improved efficacy. This finding is in accordance with some of the previous reviews conducted on evaluation of CAs in adult healthcare14,53. There are several potential explanations for these results. First, most of the automated CAs interventions reviewed here were in fact universal prevention, directed at youths from the general population, with initial low levels of mental health problems and consequently with limited room for improvement54. Indeed, when conducted on adults with clinical levels of anxiety or depression, a previous review showed medium to large effects of automated CAs interventions13. Second, according to a meta-analysis, for self-guided digital interventions to be efficient for youths, at least minimal support from a human therapist is needed55. The CAs mediated interventions included in the current paper were automated and, with only a few exceptions, standalone interventions, where the therapeutic agent was only the CA itself. Moreover, despite the limited evidence supporting their efficacy, a majority of automated CAs are commercially accessible, potentially emphasizing market accessibility over clinical efficacy. This incongruity underscores the imperative for a more robust evaluation of CAs’ effectiveness in addressing the mental health needs of the youth population.
The results of the current review must be interpreted in the light of several limitations. First, only studies published in peer reviewed journals were considered and it is possible that other automated CAs applications in gray literature, conference proceedings or other sources were not considered. Given the emerging status of the research in this area, it is plausible that a handful of ongoing studies are only published in conference proceedings. Second, the review focus was limited to the emotional component of mental health. Future reviews should consider the potential of automated CAs to address a wider range of clinical problems and symptoms, beyond those examined in our investigation. Small sample sizes, predominantly recruited from non-clinical populations are largely responsible for reduced generalizability of findings across many included articles. Therefore, a critical consideration for future research in the area is to enroll larger samples from the clinical population into trials to increase the power and generalizability of the findings. Fourth, there was a substantial heterogeneity in how the reported feasibility/usability and efficacy parameters were measured and conceptualized across studies, which makes findings hard to generalize. For example, engagement was defined in terms of subjective impressions on s attractiveness and enjoyment27, time spent per day or session in interacting with the automated CA10, percentage of target users returning for repeated sessions37, and number of exchanged messages with the application43. Similarly, efficacy outcomes such as anxiety and general distress were measured as salivary cortisol levels8,41, subjective feelings27,35, or in terms of behavioral cues9. Therefore, future research into automated CAs application would benefit from adhering to a standardized framework of measurement and conceptualization both in terms of feasibility/usability and evaluation outcomes to ensure comparability across studies.
Although most of the studies included measures of efficacy, usability or acceptability, there was no measurement of costs. Cost-effectiveness studies are needed to inform upon the affordability of such interventions in low and middle-income countries. Therefore, in our scoping review it was not possible to ascertain that automated CAs mediated psychological interventions are also cost effective when compared to the alternative approaches. Furthermore, more research on safety is warranted when speaking of fully automated CAs.
Another important direction would be to test whether integrating automated CAs as supporting the human therapist produces better results rather than just substituting it. Maybe a blended approach (face-to-face psychotherapy/ counseling) is the optimal solution for promoting mental health among youths while keeping the psychotherapeutic process engaging, attractive and safe at the same time. In addition, no comparison on feasibility, usability, or efficacy between different types of automated CAs was identified, despite preliminary results showing potential for differential responses to disembodied CAs, agents with virtual representation, and physical representation. It would be interesting to examine whether embodiment type predicts better engagement and clinical efficacy or is more preferred in a certain age group or context. Based on the existing research conducted on automated CAs we can’t generalize our findings to young people from low-income countries. Nonetheless, it is important to address this disparity through further investigations on the clinical efficacy of automated CAs with participants from different contexts, especially young people from low-income countries that face significant barriers in mental health treatment such as stigma, lack of financial resources, or lack of specialists.
Not lastly, we recommend involvement of end-users from early stages of development of automated CAs and changing the approach from developing automated CAs for youths to designing and devising them with youths, to enhance the uptake and acceptability of the application50,51. Additionally, the current state-of-the-art lacks information about the sustainability of effects; therefore, a more thorough investigation of usability and efficacy outcomes on long term is strongly recommended.
In conclusion, the field is characterized by a rapid expansion of use of fully automated CAs, with more and more evolved technical capabilities and especially in high income countries. Despite being highly acceptable, feasible and engaging as well as highly available for use, automated CAs do not appear to be yet prepared to be implemented in clinical practice with the young population. Although it is a promising approach for young population mental health promotion, efforts should be made to improve the efficacy and the safety of automated CAs. Future research with a standardized assessment, larger and diverse samples (e.g., different clinical conditions) and rigorous designs (e.g., efficacy and effectiveness studies, longer follow-ups) needs to be conducted.
Methods
The scoping review was conducted in line with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines for conducting systematic scoping reviews56. The protocol for this scoping review was prospectively registered in OSF under the ID 10.17605/OSF.IO/8KU6P.
Eligibility criteria
Inclusion criteria were: (1) primary studies based on either qualitative, quantitative, or mixed methods aiming to develop/design or test the usability, feasibility, efficacy, or economic cost effectiveness of a CA as a tool to improve a mental health outcome; (2) the CA is fully autonomous, meaning that it doesn’t rely on humans to generate responses or operate; (3) targeted samples of young population as end users, with a mean age ≤25 years; (4) published in a peer-reviewed journal and written in English. There were no inclusion restrictions on study design or on the mental or health status of participants.
Exclusion criteria were: (1) secondary research, conference proceedings, dissertations, and commentary articles aiming to describe or report on general aspects of human-CA interactions or interventions studies aimed to exclusively test general aspects of human-technology interactions using CAs; we also excluded studies describing or reporting on use of CA as a tool to improve social, cognitive, educational, or physical health outcomes as well as those focusing on CA applications for the purpose of assessment or monitoring only; (2) report on non-autonomous CAs, relying on a human user to generate responses (e.g., ‘Wizard of Oz’ methods) or semi-autonomous CAs, requiring a minimal human support to operate; (3) intervention targeted samples with a mean age >25; (4) written in languages other than English and published in gray literature.
Search strategy
Systematic searches were conducted by RB in March 2023 in multidisciplinary as well as specific domain databases (Web of Science, PubMed, Scopus, PsychInfo, ACM -Association for Computing Machinery Digital Library and IEEE Xplore) and studies references using keywords related to conversational agents, the age of the population of interest, and the role/scope of intervention (see Supplementary Note 1 for a detailed sample of the search strategy).
Study selection
The results of the search query were uploaded in EndNote (version 20; Clarivate Analytics). Following Cochrane recommendations, the screening process was piloted with a random sample of studies for both abstract and full text57. The articles were screened by the RB and CRP. Any disagreements between the 2 independent reviewers were resolved through consulting with AD.
Data items and charting
A data form for exaction of information was designed prior to data charting and is detailed in the protocol for the current scoping review, published on Open Science Framework. The data extraction form was piloted and calibrated with the screening team. Like the study selection process, two reviewers (RB and CRP) independently conducted the process of data extraction, and any disagreements were resolved by the third reviewer (AD).
The following data items were charted: general information regarding the article (year, authors, country); technological characteristics (name, type of dialog system, availability, modality of communication, embodiment type); characteristics of the intervention (scope, mental health outcome targeted, duration, frequency, whether is standalone intervention and theoretical framework); characteristics of peer reviewed research (participants information, stage of research, study design and methodology and, if applicable, main results). A detailed overview of the definitions of each item together with corresponding categories is provided in Supplementary Table 3.
Synthesis of the results
First, information about study meta-characteristics of articles as well as about landscape of the automated CAs’ based interventions, characteristics of research conducted in the area and technological characteristics of CA from data-charting were summarized using descriptive statistics and descriptive narration. Key findings from usability/feasibility and evaluation studies were tabulated and narratively summarized.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The authors declare that the data supporting the findings of this study are available within the paper.
References
WHO. World Health Statistics 2018:: Monitoring Health for the SDGs. (WHO, 2018).
Schlack, R., Peerenboom, N., Neuperdt, L., Junker, S. & Beyer, A.-K. The effects of mental health problems in childhood and adolescence in young adults: results of the KiGGS cohort. J. Health Monit. 6, 3–19 (2021).
Aguirre Velasco, A., Cruz, I. S. S., Billings, J., Jimenez, M. & Rowe, S. What are the barriers, facilitators and interventions targeting help-seeking behaviours for common mental health problems in adolescents? A systematic review. BMC Psychiatry 20, 293 (2020).
Wolters, L. H., Op de Beek, V., Weidle, B. & Skokauskas, N. How can technology enhance cognitive behavioral therapy: the case of pediatric obsessive compulsive disorder. BMC Psychiatry 17, 226 (2017).
Laranjo, L. et al. Conversational agents in healthcare: a systematic review. J. Am. Med. Inform. Assoc. 25, 1248–1258 (2018).
Araujo, T. Living up to the chatbot hype: the influence of anthropomorphic design cues and communicative agency framing on conversational agent and company perceptions. Comput. Hum. Behav. 85, 183–189 (2018).
Catania, F., Spitale, M. & Garzotto, F. Conversational agents in therapeutic interventions for neurodevelopmental disorders: a survey. ACM Comput. Surv. 55, 1–209 (2023).
Rossi, S. et al. Using the Social Robot NAO for emotional support to children at a pediatric emergency department: randomized clinical trial. J. Med. Internet Res. 24, e29656 (2022).
Tanaka, K., Hayakawa, M., Noda, C., Nakamura, A. & Akiyama, C. Effects of artificial intelligence aibo intervention on alleviating distress and fear in children. Child Adolesc. Psychiatry Ment. Health 16, 87 (2022).
Gabrielli, S. et al. Engagement and effectiveness of a healthy-coping intervention via Chatbot for university students during the COVID-19 pandemic: mixed methods proof-of-concept study. JMIR MHealth UHealth 9, e27965 (2021).
Kim, Y., Thayne, J. & Wei, Q. An embodied agent helps anxious students in mathematics learning. Educ. Technol. Res. Dev. 65, 219–235 (2017).
Nicol, G., Wang, R., Graham, S., Dodd, S. & Garbutt, J. Chatbot-delivered cognitive behavioral therapy in adolescents with depression and anxiety during the COVID-19 pandemic: feasibility and acceptability study. JMIR Form. Res. 6, e40242 (2022).
Lim, S. M., Shiau, C. W. C., Cheng, L. J. & Lau, Y. Chatbot-delivered psychotherapy for adults with depressive and anxiety symptoms: a systematic review and meta-regression. Behav. Ther. 53, 334–347 (2022).
Kramer, L. L., Ter Stal, S., Mulder, B. C., de Vet, E. & van Velsen, L. Developing embodied conversational agents for coaching people in a healthy lifestyle: scoping review. J. Med. Internet Res. 22, e14058 (2020).
Provoost, S., Lau, H. M., Ruwaard, J. & Riper, H. Embodied conversational agents in clinical psychology: a scoping review. J. Med. Internet Res. 19, e151 (2017).
Gaffney, H., Mansell, W. & Tai, S. Conversational agents in the treatment of mental health problems: mixed-method systematic review. JMIR Ment. Health 6, e14166 (2019).
Koulouri, T., Macredie, R. D. & Olakitan, D. Chatbots to support young adults’ mental health: an exploratory study of acceptability. ACM Trans. Interact. Intell. Syst. 12, 1–11 (2022).
Bae Brandtzæg, P. B., Skjuve, M., Kristoffer Dysthe, K. K. & Følstad, A. When the social becomes non-human: young people’s perception of social support in chatbots. In Proc. of the 2021 CHI Conference on Human Factors in Computing Systems 1–13 (Association for Computing Machinery, New York, NY, USA,). https://doi.org/10.1145/3411764.3445318 (2021).
Apolinário-Hagen, J., Kemper, J. & Stürmer, C. Public acceptability of E-Mental health treatment services for psychological problems: a scoping review. JMIR Ment. Health 4, e6186 (2017).
Car, L. T. et al. Conversational agents in health care: scoping review and conceptual analysis. J. Med. Internet Res. 22, e17158 (2020).
Kretzschmar, K. et al. Can your phone be your therapist? Young People’s Ethical Perspectives on the Use of Fully Automated Conversational Agents (Chatbots) in Mental Health Support. Biomed. Inform. Insights 11, 1178222619829083 (2019).
Ly, K. H., Ly, A.-M. & Andersson, G. A fully automated conversational agent for promoting mental well-being: a pilot RCT using mixed methods. Internet Inter. 10, 39–46 (2017).
Bartl-Pokorny, K. D. et al. Robot-based intervention for children with autism spectrum disorder: a systematic literature review. IEEE Access 9, 165433–165450 (2021).
Berrezueta-Guzman, J., Robles-Bykbaev, V. E., Pau, I., Pesántez-Avilés, F. & Martín-Ruiz, M.-L. Robotic technologies in ADHD care: literature review. IEEE Access 10, 608–625 (2022).
Kabacińska, K., Prescott, T. J. & Robillard, J. M. Socially assistive robots as mental health interventions for children: a scoping review. Int. J. Soc. Robot. 13, 919–935 (2021).
Bendig, E., Erb, B., Meißner, D., Bauereiß, N. & Baumeister, H. Feasibility of a Software agent providing a brief Intervention for Self-help to Uplift psychological wellbeing (“SISU”). A single-group pretest-posttest trial investigating the potential of SISU to act as therapeutic agent. Internet Inter. 24, 100377 (2021).
Bray, L. et al. The acceptability and impact of the Xploro digital therapeutic platform to inform and prepare children for planned procedures in a hospital: before and after evaluation study. J. Med. Internet Res. 22, e17367 (2020).
Dosovitsky, G. & Bunge, E. Development of a chatbot for depression: adolescent perceptions and recommendations. Child Adolesc. Ment. Health 28, 124–127 (2023).
Gabrielli, S., Rizzi, S., Carbone, S. & Donisi, V. A chatbot-based coaching intervention for adolescents to promote life skills: pilot study. JMIR Hum. Factors 7, e16762 (2020).
Grové, C. Co-developing a Mental Health and Wellbeing Chatbot With and for Young People. Front. Psychiatry 11, 606041 (2020).
He, Y. et al. Mental health chatbot for young adults with depressive symptoms during the COVID-19 pandemic: single-blind, three-arm randomized controlled trial. J. Med. Internet Res. 24, e40719 (2022).
Høiland, C. G., Følstad, A. & Karahasanovic, A. Hi, can I help? Exploring how to design a mental health chatbot for youths. Hum. Technol. 16, 139–169 (2020).
Jeong, S. et al. Deploying a robotic positive psychology coach to improve college students’ psychological well-being. Use. Model. Use.Adapt. Interact. 33, 571–615 (2023).
Kitt, E. R., Crossman, M. K., Matijczak, A., Burns, G. B. & Kazdin, A. E. Evaluating the role of a socially assistive robot in children’s mental health care. J. Child Fam. Stud. 30, 1722–1735 (2021).
Lappalainen, P. et al. In the shadow of COVID-19: a randomized controlled online ACT trial promoting adolescent psychological flexibility and self-compassion. J. Context. Behav. Sci. 27, 34–44 (2023).
Liu, H. et al. chatbots to provide self-help depression interventions for university students: a randomized trial of effectiveness. Internet Inter. 27, 100495 (2022).
Ludin, N. et al. A chatbot to support young people during the COVID-19 pandemic in New Zealand: evaluation of the real-world rollout of an open trial. J. Med. Internet Res. 24, e38743 (2022).
Russell, J. K., Strodl, E. & Kavanagh, D. Use of a social robot in the implementation of a narrative intervention for young people with cystic fibrosis: a feasibility study. Int. J. Soc. Robot. 13, 1787–1801 (2021).
Trappey, A. J. C., Lin, A. P. C., Hsu, K. Y. K., Trappey, C. V. & Tu, K. L. K. Development of an empathy-centric counseling chatbot system capable of sentimental dialogue analysis. Processes 10, 930 (2022).
Williams, R. et al. 21-day stress detox: open trial of a universal well-being chatbot for young adults. Soc. Sci. 10, 416 (2021).
Crossman, M. K., Kazdin, A. E. & Kitt, E. R. The influence of a socially assistive robot on mood, anxiety, and arousal in children. Prof. Psychol. Res. Pract. 49, 48–56 (2018).
Fitzpatrick, K. K., Darcy, A. & Vierhile, M. Delivering cognitive behavior therapy to young adults with symptoms of depression and anxiety using a fully automated conversational agent (Woebot): a randomized controlled trial. JMIR Ment. Health 4, e19 (2017).
Fulmer, R., Joerin, A., Gentile, B., Lakerink, L. & Rauws, M. Using psychological artificial intelligence (Tess) to relieve symptoms of depression and anxiety: randomized controlled trial. JMIR Ment. Health 5, e9782 (2018).
Greer, S. et al. Use of the Chatbot ‘Vivibot’ to deliver positive psychology skills and promote well-being among young people after cancer treatment: randomized controlled feasibility trial. JMIR MHealth UHealth 7, e15018 (2019).
Okita, S. Y. Self-other’s perspective taking: the use of therapeutic robot companions as social agents for reducing pain and anxiety in pediatric patients. Cyberpsycho. Behav. Soc. Netw. 16, 436–441 (2013).
Global Smartphone Market Analysis and Outlook: Disruption in a Changing Market - PDF Free Download. https://docplayer.net/1973753-Global-smartphone-market-analysis-and-outlook-disruption-in-a-changing-market.html.
Schuetzler, R. M., Giboney, J. S., Grimes, G. M. & Nunamaker, J. F. The influence of conversational agent embodiment and conversational relevance on socially desirable responding. Decis. Support Syst. 114, 94–102 (2018).
Nguyen, H. Examining teenagers’ perceptions of conversational agents in learning settings. In Interaction Design and Children 374–381 (ACM, Braga Portugal,). https://doi.org/10.1145/3501712.3529740 (2022).
Michel, T., Tachtler, F., Slovak, P. & Fitzpatrick, G. Young People’s attitude toward positive psychology interventions: thematic analysis. JMIR Hum. Factors 7, e21145 (2020).
Harte, R. et al. A human-centered design methodology to enhance the usability, human factors, and user experience of connected health systems: a three-phase methodology. JMIR Hum. Factors 4, e8 (2017).
Wight, D., Wimbush, E., Jepson, R. & Doi, L. Six steps in quality intervention development (6SQuID). J. Epidemiol. Community Health 70, 520–525 (2016).
Struthers, A. et al. The acceptability of E-mental health services for children, adolescents, and young adults: a systematic search and review. Can. J. Commun. Ment. Health 34, 1–21 (2015).
Bendig, E., Erb, B., Schulze-Thuesing, L. & Baumeister, H. The next generation: chatbots in clinical psychology and psychotherapy to foster mental health – a scoping review. Verhaltenstherapie 32, 64–76 (2019).
Werner-Seidler, A., Perry, Y., Calear, A. L., Newby, J. M. & Christensen, H. School-based depression and anxiety prevention programs for young people: a systematic review and meta-analysis. Clin. Psychol. Rev. 51, 30–47 (2017).
Bennett, S. D. et al. Practitioner review: unguided and guided self-help interventions for common mental health disorders in children and adolescents: a systematic review and meta-analysis. J. Child Psychol. Psychiatry 60, 828–847 (2019).
Tricco, A. C. et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): checklist and explanation. Ann. Intern. Med. 169, 467–473 (2018).
Garritty, C. et al. Cochrane rapid reviews methods group offers evidence-informed guidance to conduct rapid reviews. J. Clin. Epidemiol. 130, 13–22 (2021).
Acknowledgements
This study was funded by the Ministry of Research and Innovation, Grant no. PN-III-P1-1.1-PD-2021-0538 and by Babeş-Bolyai University, Grant no PDI-PFE ID 550, Contract no 31377/13.03.2023. The funders played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript.
Author information
Authors and Affiliations
Contributions
R.B., A.D. and C.R.P. contributed to the conception and design of the study. R.B., A.D. and C.R.P. contributed to the literature search and data extraction. R.B., A.D. and C.R.P. contributed to data analysis and interpretation. R.B drafted the initial manuscript. A.D. and C.R.P. revised the manuscript. R. R.B., A.D. and C.R.P. were responsible for the decision to submit the manuscript. All authors contributed to the critical revision of the manuscript. All authors approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Balan, R., Dobrean, A. & Poetar, C.R. Use of automated conversational agents in improving young population mental health: a scoping review. npj Digit. Med. 7, 75 (2024). https://doi.org/10.1038/s41746-024-01072-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41746-024-01072-1