Meta-analyses have shown that digital mental health apps can be efficacious in reducing symptoms of depression and anxiety. However, real-world usage of apps is typically not sustained over time, and no studies systematically examine which features increase sustained engagement with apps or the relationship between engagement features and clinical efficacy. We conducted a systematic search of the literature to identify empirical studies that (1) investigate standalone apps for depression and/or anxiety in symptomatic participants and (2) report at least one measure of engagement. Features intended to increase engagement were categorized using the persuasive system design (PSD) framework and principles of behavioral economics. Twenty-five studies with 4159 participants were included in the analysis. PSD features were commonly used, whereas behavioral economics techniques were not. Smartphone apps were efficacious in treating symptoms of anxiety and depression in randomized controlled trials, with overall small-to-medium effects (g = 0.2888, SE = 0.0999, z(15) = 2.89, p = 0.0119, Q(df = 14) = 41.93, p < 0.0001, I2 = 66.6%), and apps that employed a greater number of engagement features as compared to the control condition had larger effect sizes (β = 0.0450, SE = 0.0164, t(15) = 2.7344, p = 0.0161). We observed an unexpected negative association between PSD features and engagement, as measured by completion rate (β = −0.0293, SE = 0.0121, t(17) = 02.4142, p = 0.0281). Overall, PSD features show promise for augmenting app efficacy, though engagement, as reflected in study completion, may not be the primary factor driving this association. The results suggest that expanding the use of PSD features in mental health apps may increase clinical benefits and that other techniques, such as those informed by behavioral economics, are employed infrequently.
Digital mental health interventions are proliferating rapidly, garnering heightened interest from the public as well as the scientific community. Well over 10,000 mental health apps are now available for download1, and multiple meta-analyses demonstrate that apps are modestly efficacious in treating symptoms of depression and anxiety2,3,4,5. However, an analysis of real-world usage of mental health apps show that user engagement over time is low, with a median 15-day retention of only 3.9%6. Apps with features that increase engagement have generally been associated with greater reductions in depression and anxiety symptoms than apps that lack engagement features5. No study systematically examines which features increase engagement with apps or the relationship between the number of engagement features and clinical efficacy.
Persuasive system design (PSD) is a framework that identifies four mechanisms through which app features increase user engagement: facilitating the primary purpose of the app, promoting user-app interactions, leveraging social relationships, and increasing app credibility7 (Supplementary Table 1). PSD is rooted in research on human-computer interaction and computer-mediated communication and is in part adapted from concepts of persuasive technology7,8. These categories allow for comprehensive and objective consideration of the characteristics of technologies. Specific PSD features predict adherence among internet-based health and lifestyle interventions, which suggests that incorporating PSD features may improve adherence in health apps as well 7.
Behavioral economics is focused on understanding ways that human decision-making differs from what would be expected in traditional economic models of “rational” actors9. A comprehensive list of behavioral economics techniques has not been delineated, but common findings include that people are motivated by avoiding losses10, “starting fresh,”11 committing to action12, and by lotteries13 (Supplementary Table 2). These techniques from behavioral economics can be applied to “gamification,” the use of game-like experiences in non-game services, to further improve user engagement14,15,16,17. Gamification features, including points, badges, levels, and avatars, are used to increase engagement and may enhance apps’ intended effects16. In one head-to-head comparison of a gamified vs non-gamified versions of a smartphone app for anxiety, the gamified app was found to elicit significantly greater engagement, demonstrating the potential of behavioral economics for improving engagement with mental health apps18. PSD, behavioral economics, and gamification often overlap and identify a wide variety of techniques that increase engagement with smartphone apps.
Though smartphone apps show potential for treating depression and anxiety, a barrier to their real-world efficacy may be a lack of sustained user engagement19. Understanding the extent to which the engagement features of PSD techniques, behavioral economics, and/or gamification translate into improvements in engagement and mood outcomes may inform the development and deployment of more efficacious apps for anxiety and depression. We therefore conducted a systematic review of the literature and meta-analysis on smartphone apps for depression and anxiety with the aim of evaluating the impact of persuasive design and behavioral economics techniques on both engagement and clinical outcomes.
The full systematic search retrieved a total of 6287 results. Following the removal of duplicate articles and the later addition of articles cited by included studies, 4143 articles were screened by the title and abstract for relevance. After title and abstract screening, 302 articles were identified as potentially eligible and were screened in full. Full-text screening excluded 277 articles for reasons specified in Fig. 1, which details the full PRISMA search and screening process.
After full text screening, 25 independent studies were found to be eligible for inclusion in the systematic review (Table 1)20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44. Ten studies were non-randomized trials, while 15 were randomized controlled trials. In addition, 3 of the trials were unpublished, including one pre-print and two dissertations. Across all studies, there were 4159 participants total, with 2905 participants using 29 unique smartphone apps. The participants ranged from 13 to 76 years old, with an average age of 32.9. The majority of participants were female (66.6%). The majority of studies required participants to report significant symptoms of major depressive disorder (19) or generalized anxiety disorder (5), and symptoms were most commonly identified by scoring above a predetermined threshold on a validated self-assessment questionnaire.
Control conditions included waitlist, treatment as usual, non-app control intervention, or placebo app control conditions (Supplementary Table 3). Average intervention duration was 5.8 weeks (standard deviation 2.8 weeks), with a range of 2–12 weeks. Most studies accounted by drop out by conducting an intention to treat analysis or by removing missing cases (Supplementary Table 4).
According to the Joanna Briggs Institute critical appraisal checklist for quasi-experimental studies45, among the 10 quasi-experimental studies, 2 were rated to be high quality, 6 were medium quality, and 2 were low quality. According to the Cochrane Collaboration Revised Risk of Bias tool46, among the 15 randomized controlled trials, 0 were judged to have an overall low risk of bias, 8 were judged to raise some concerns for bias, and 7 were judged to be at high risk of bias.
Across all studies, 29 unique smartphone apps were used, including experimental and control apps. The apps offered a variety of content (Supplementary Table 5), the most common being behavioral techniques (76%), cognitive techniques (72%), and psychoeducation (69%). Some apps provided means to track behavior (45%), mood (38%), or thoughts (21%). Relaxation techniques and mindfulness techniques were each used in 31% of apps. Fewer apps tracked physiological parameters such as sleep or exercise (10%) or tracked clinical symptoms of anxiety or depression (7%).
Safety and privacy
Of the 29 apps assessed, 41% were available to the public through the Apple App Store, Google Play, or both (Supplementary Table 7). The remaining apps were accessible to research participants only, or their accessibility was not specified. The majority of apps were free or had free versions available.
Persuasive system design
Overall, apps used an average of 6.5 out of the 28 total features of PSD and 2.5 of the 4 total categories of PSD (Tables 2 and 3). Primary task support was most commonly used, with an average of 3.30 (SD 1.26) out of 7 total features. Social support was least commonly used, with an average of 0.37 (SD 0.93) out of 7 total features.
Table 4 describes the number and percentage of apps that used each PSD feature or category. Of note, features of credibility support were most difficult to evaluate as most articles did not describe these features in detail, and 81.4% of ratings were marked as “unable to assess.” The most frequently used features (reduction − 86.7%, self-monitoring − 70.0%, personalization − 53.3%) all belonged to the primary task support category. The least frequently used features were not used in any of the apps (cooperation, competition, recognition, third-party endorsements, and verifiability) and belonged to the dialogue support category or credibility support category.
Overall, apps used an average of 0.17 out of the 4 total behavioral economics features (Supplementary Table 8). The most commonly used was the pre-commitment pledge, which was used in 4 apps. One app used loss aversion, whereas none of the apps used the fresh start effect or lottery.
Meta-analysis of effect of smartphone apps on depressive or anxiety symptoms
A random-effects meta-analysis of the effect of smartphone apps on depression and anxiety symptoms demonstrated a moderate positive effect size of smartphone apps for reducing pre versus post scores on depression and anxiety questionnaires (g = 0.6217, SE = 0.0669, t(25) = 9.2982, p < 0.001) (Fig. 2). Heterogeneity across studies was substantial (Q(df = 24) = 87.8894, p < 0.0001, I2 = 82.70%). Egger’s regression test indicated significant publication bias (p = 0.0135). A trim-and-fill analysis identified six missing studies (g = 0.5184, SE = 0.0803, z(31) = 6.4589, p < 0.001, Q(df = 30) = 125.7114, p < 0.0001, I2 = 87.57%).
Figure 3 displays the pooled and individual effect sizes of smartphone apps on depression or anxiety when compared to control conditions in randomized controlled trials. A random-effects meta-analysis demonstrated a similar pattern as the pre/post meta-analysis of the full sample albeit with a smaller effect size (g = 0.2888, SE = 0.0999, z(15) = 2.89, p = 0.0119, Q(df = 14) = 41.93, p < 0.0001, I2 = 66.6%). Egger’s regression test indicated no publication bias (p = 0.47091). A trim-and-fill analysis identified one missing study (g = 0.2473, SE = 0.1254, t(16) = 1.97, p = 0.0672, Q(df = 15) = 49.72, p < 0.0001, I2 = 69.8%).
Meta-regression of PSD features and efficacy
Meta-regression analysis demonstrated that the magnitude of the effect size of improvement in mood in active relative to control conditions had a significant positive relationship with the difference in number of PSD features used in the app (β = 0.0450, SE = 0.0164, t(15) = 2.7344, p = 0.0161; Fig. 4), as well as with the difference in number of PSD categories (β = 0.1249, SE = 0.0362, t(15) = 3.4522, p = 0.0039).
Meta-analysis of study completion rate
A random-effects meta-analysis of the study completion rate of app vs control conditions in randomized controlled trials revealed a significant effect, and heterogeneity was moderate to substantial (g = 0.8730, SE = 0.0761, z(17) = −1.79, p = 0.0931, Q(df = 16) = 41.33, p = 0.0005, I2 = 61.3%). Egger’s regression test indicated no publication bias (p = 0.2738). A trim-and-fill analysis identified no missing studies.
Meta-regression of PSD features and study completion rate
The study completion rate in the intervention compared to control condition had a significant negative relationship with the difference in number of PSD features (β = −0.0293, SE = 0.0121, t(17) = 02.4142, p = 0.0281; Fig. 5), as well as with the difference in number of PSD categories (β = −0.0830, SE = 0.0292, t(17) = −2.8472, p = 0.0116).
Removal of outliers did not significantly impact any of the results (Supplementary Note 1).
No direct association was found between efficacy and completion rate (g = −0.2134, SE = 0.1442, t(15) = −1.4806, p = 0.1609). Efficacy also did not meaningfully change the association between completion rate and PSD features (β = −0.0649, SE = 0.1434, p = 0.6588, F(df1 = 2, df2 = 12) = 0.8320, p = 0.4588). Including a regressor for the type of control condition (waitlist vs. active control) also did not meaningfully change the association between PSD features and study completion rate (β = −0.1900, SE = 0.1254, t(17) = −1.5150, p = 0.1520). Including a regressor for initial symptom severity did not meaningfully change the association between PSD features and study completion rate (β = −0.0949, SE = 0.0820, t(16) = −1.1566, p = 0.2682).
The principal findings of this meta-analysis indicate that standalone smartphone apps for depression and anxiety are efficacious, with modest overall effects in randomized controlled trials (g = 0.2888). The efficacy findings are consistent with those shown in previous meta-analyses2,3,4,5. This meta-analysis is the first to demonstrate that apps that use a greater number of engagement features have larger clinical effects. Thus, the findings extend the extant literature by demonstrating the importance of engagement techniques to improve efficacy of app-based interventions for depression and anxiety. These findings can inform development and implementation of future apps.
A lack of engagement with mental health apps has been reported as a key limitation for realizing the potential of apps to broadly disseminate mental health treatments47. Apps that include guidance (i.e., from a therapist or coach) have generally been found to have larger effects than standalone interventions5,19,48. Given that standalone mental health apps require fewer resources than guided apps, identifying factors that might increase engagement and efficacy without therapist involvement could greatly improve dissemination efforts. “Engagement features” such as those elements of PSD have been found to increase engagement with mental health apps as measured by duration of use and completion of interventions7,49. Our review indicates that engagement features also are associated with increased clinical efficacy and therefore should be included in standalone mental health apps to increase their impact for broader dissemination.
The number of engagement features used in a given app had a significant negative association with study completion rate. The negative association between PSD features and engagement as measured by completion rate was contrary to our a priori hypothesis that PSD features would be associated with increased app completion and thereby increase efficacy. One possible interpretation of the negative association between PSD features and completion rates is that participants using apps with more PSD features may use the app more frequently, benefit more quickly, and lose motivation to continue using the app once their symptoms have improved compared to participants using less engaging apps; however, our exploratory analyses suggest that this is not the case. Additionally, we found that these results were not driven by the type of control condition or initial symptom severity. In subsequent studies, analyses of interim symptom change could help to better understand the association between PSD features and completion rates. Additionally, while completion rate was the most readily available measure of engagement, other engagement outcomes such as minutes spent using the app or self-reported user experience of the usefulness, usability, and satisfaction with the app50 might yield more meaningful information about PSD features and engagement.
We found that some PSD features were incorporated into all apps (e.g., primary task support), while others were less commonly used (e.g., social support), and behavioral economics features such as pre-commitment pledges, loss aversion, and lotteries, were rarely incorporated. The reason may be, in part, because of the historical roots of these literatures. PSD is well-established as an approach to development of new technologies and therefore may be more naturally incorporated into app development7,8. Behavioral economics has traditionally focused on economic decision-making51, then was applied to behavioral health52 but has only more recently been used in the context of mental health interventions53. The results of our meta-analysis suggest that increasing the number of engagement techniques may be beneficial, and so it is likely that designers may need to draw from a range of engagement approaches to maximize the impact of app-based interventions. Furthermore, elements such as social support, which span both the PSD and behavioral economics literatures, may be particularly promising6. Incorporating social support may also be the mostly likely way to engage patients without the use of therapists or coaches and, in itself, is a predictor of mental health outcomes 54.
There are several important limitations of the current review. The literature on mental health apps is heterogenous, and a number of studies included samples without psychiatric diagnoses and/or were used in combination with other interventions to varying degrees. Clearer standards for reporting on app content in research studies could help subsequent syntheses of the literature. In our review, all available information about app content, design elements, privacy and security features, and accessibility were extracted from descriptions and screenshots of the apps in the papers, appendices and supplementary materials. However, additional features may have been included in apps that were not described in these materials. Another limitation is the use of completion rate as the primary metric of engagement. While this approach has been used previously in mental health app engagement literature19 and is an accessible metric because it is reported in most studies, it may be a crude measure of engagement. The number of minutes spent using the app, number of logins, or number of modules completed (if applicable) may measure engagement more meaningfully55. Furthermore, the quality of engagement may be more likely to affect clinical outcomes as much as or even more than the duration of engagement. The issue of defining meaningful engagement has been gaining attention and will likely influence the next steps in measuring app engagement 56.
Overall, smartphone apps appear to be efficacious in decreasing anxiety and depression symptoms. The use of persuasive design features is associated with larger effect sizes and may be useful in increasing clinical efficacy. Relatively few mental health apps used non-PSD behavioral economics techniques, which have shown promise for increasing outcomes in app-based interventions for smoking cessation57, increasing physical activity for obese patients58, and adherence to diabetes medication59. Incorporating additional PSD and behavioral economics techniques that have been rarely used, but have shown promise (e.g., social support7) may further improve outcomes in mental health treatments.
This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement for transparent, comprehensive reporting of methodology and results60. The review adhered to a strict protocol submitted for registration with PROSPERO on April 1, 2020 (ID 171194) before the start of analyses. The protocol review was ongoing as of December 4, 2020 (CRD42020171194): https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42020171194.
A systematic search was performed of the following databases: Ovid Medline (ALL − 1946 to present); Ovid Embase (1974 to present); The Cochrane Library (Wiley); the Cumulative Index to Nursing and Allied Health Literature (CINAHL; EBSCO interface); PsycINFO (EBSCO); PsycArticle (EBSCO); and Ovid Allied and Complementary Medicine (AMED). Since the focus of the study was on published research trials conducted on the apps, searches were not carried out directly in the Google Play or the Apple App Store. Our search terms included combinations, truncations, and synonyms of “depression,” “social anxiety,” or “anxiety,” in combination with “mobile application” or “smartphone.” The search terms were adapted for use with all chosen bibliographic databases. Additional records were retrieved by checking the bibliographies of studies that met inclusion criteria. No language, article type, or publication date restrictions were used. Searches were completed in March and April 2020.
Included articles were English-language empirical studies published in peer-reviewed publications, conference presentations, dissertations, and gray literature/unpublished work. The intervention was required to be delivered by smartphone app alone and to be delivered over time with the purpose of treating depression and/or anxiety (major depressive disorder, dysthymia, other depressive disorder, generalized anxiety disorder, and/or social anxiety disorder). Study participants could be any age but needed to identify as having depression and/or anxiety, either through clinical diagnosis or self-endorsement. To be included, studies had to report at least one measure of participant engagement (i.e., attrition, adherence, engagement, dropout).
Studies were excluded if: a) the intervention was delivered in part or entirely by a non-smartphone method; b) the study population was required to have a specific medical condition, other DSM-5 diagnosis, no diagnosis (healthy), or unspecified diagnosis, unless the study included a subgroup analysis of participants with depression and/or anxiety only; and c) adequate information regarding intervention characteristics and/or engagement outcomes could not be obtained from the paper or from contacting the study authors.
Data collection and analysis
Two of three raters (E.B., J.B., or A.W.) independently completed each of the following steps, and conflicts were resolved by another rater (M.S.).
Selection of articles
Titles and abstracts were evaluated against the inclusion and exclusion criteria. Studies that were deemed potentially relevant were next evaluated by their full text. Reasons for exclusion were recorded. When more than one publication of a study was found, the study with the most information was used for data analysis.
Data extracted included study identifiers, study characteristics, intervention characteristics, techniques to increase engagement, engagement measures, efficacy measures, app content, app modality, accessory interventions, and usability features. Techniques to increase engagement are described further in Supplementary Information. Only data that were explicitly reported in the publications were extracted. App profiles in the Google Play or Apple App Store were not accessed as we could not verify that features identified in the app profile were available in the version of the app referenced in the paper.
Study Identifiers: the first author’s last name and the publication year.
Study Characteristics: the population, comparator (if applicable), outcome, and study design. Participant demographics such as age and gender.
Intervention Characteristics: the intervention or cohort name, number of trial arms, primary condition, duration of intervention, duration of follow-up, modules to be completed, automated or guided delivery, format of delivery, and outcome measures.
Techniques to Increase Engagement: components of PSD as outlined in Kelders et al.7 include: primary task support (reduction, tunneling, tailoring, personalization, self-monitoring, simulation, and rehearsal), dialogue support (praise, rewards, reminders, suggestion, similarity, liking, and social role), and credibility support (trustworthiness, expertise, surface credibility, real-world feel, authority, third-party endorsements, and verifiability). Non-PSD behavioral economics techniques included loss aversion, fresh start effect, pre-commitment pledges, and financial incentives including lotteries. Each component or technique had a predefined definition and guideline for being coded as present. Additional information can be found in Supplementary Tables 1 and 2.
Measures of Engagement: adherence to study protocol, completion rate, amount of time spent in app, modules completed, features or exercises accessed, duration of use of app, or other. All available measures of engagement were recorded for each study.
Measures of Efficacy: change in score on a validated scale for depression or anxiety.
App Content: app content was categorized based on categories identified in previously published systematic evaluations of publicly available mental health apps6,61. App content included psychoeducation, cognitive techniques, behavioral techniques, mindfulness, relaxation, mood expression, tracking of mood patterns, tracking of thought patterns, tracking of behavior, tracking of symptoms, and tracking of physiological parameters.
App Modality: information on app modality was recorded when available. App modalities included text, graphs, photos, illustrations, videos/films, and others.
Safety, Privacy, and Accessibility Features: integration and accessibility features such as cost, availability to the public, safety features, and privacy protection features. These features were adapted from a framework for evaluating mobile mental health apps62.
At the title and abstract screening stage, inter-rater agreement was 93.8%. At the full-text screening stage, inter-rater agreement was 90.1%. At the engagement feature rating stage, inter-rater agreement was 82.6%. Disagreements were resolved by a third rater (M.S.) at the screening stages and by consensus at the feature rating stage.
Assessment of risk of bias
Non-randomized experimental studies were evaluated based on the Joanna Briggs Institute critical appraisal checklist for quasi-experimental studies to assess quality45. Because the Joanna Briggs Institute checklist does not assign specific ratings (“high,” “medium,” or “low”) to studies based on the number of criteria met, we adapted a rating system used in previously published systematic evaluations of mental health apps6,61. The methodological quality of each study was rated as: high (7–9 criteria have been fulfilled), medium (4–6 criteria have been fulfilled), or low (0–3 criteria have been fulfilled).
The Cochrane Collaboration Revised Risk of Bias tool was applied to randomized controlled trials46. Each RCT was assessed against five bias domains in order to produce a summary risk of bias assessment score for each domain and overall (low risk, some concerns, or high risk of bias) 46.
A meta-analysis was conducted to calculate a pooled effect size of treatment efficacy. To test pre- and post-test change scores on anxiety and depression questionnaires, we applied a random-effects model to account for the expected heterogeneity among the studies, given the variety of types of apps, duration of intervention, and other factors. If a study used multiple measures of anxiety and/or depression symptoms, then the instrument most frequently used among all studies was used to calculate efficacy so as to reduce heterogeneity. If an app was used in multiple independent studies, it was included multiple times. In supplementary analyses, we removed outliers, defined as studies with a 95% confidence interval of the effect size that did not overlap with the 95% confidence interval of the pooled effect size63. We compared analyses with and without outliers to determine whether the presence of outliers changed the observed results. Risk of publication bias was examined using Eggers’ regression, and Duval and Tweedie’s trim-and-fill analysis was used to re-calculate pooled dropout rates after statistically accounting for any studies that may have introduced publication bias64. The degree of statistical heterogeneity was quantified using I² values, with <25% representing low heterogeneity, 25–50% for low-to-moderate heterogeneity, 50–75% for moderate-to-substantial heterogeneity, and >75% for substantial heterogeneity63. τ2 was calculated using the Hartung–Knapp–Sidik–Jonkman (HKSJ) method, which is appropriate when there is a large degree of heterogeneity between studies.
A second meta-analysis was run to examine efficacy of apps compared to control conditions in randomized controlled trials. A random-effects model was again used given the expected heterogeneity among the studies. Efficacy was calculated as the pre/post change on a depression or anxiety scale. When a study had multiple control conditions, the active control condition was selected, as opposed to a waitlist or treatment-as-usual control. If an app was used in multiple studies, it was included multiple times. If a study used multiple apps, both apps were included if there was a non-app control condition. Otherwise, the intervention app was compared to the control app. To examine whether risk of bias score affected the efficacy, a subgroup analysis stratifying studies by risk of bias score was performed (Supplementary Note 1). Independent mixed-effects meta-regression models were run to examine how app efficacy was affected by the number of PSD features used in the app compared to the control condition, the number of PSD categories used in the app compared to the control condition, and the study completion rate of app users compared to the control condition. As the number of PSD features or categories denoted levels as opposed to continuous variables, the intercept was set to “false.”
A third meta-analysis examined the study completion rate of apps compared to control conditions in randomized controlled trials. Study completion rate was calculated as the proportion of enrolled and/or randomized study participants who remained in the study at the time the intervention ended. A random-effects model was again used given the expected heterogeneity among the studies. Procedures were otherwise the same as for the second meta-analysis and the meta-regression. In addition, in order to further explore the association between PSD features and completion rates, exploratory control analyses were conducted. An independent mixed-effects meta-regression model was run to examine how app efficacy affected study completion rate. Multiple meta-regressions were conducted to examine the relationship of various covariates to study completion rate: PSD features and efficacy, PSD features and type of control condition (waitlist vs. active control), and PSD features and initial symptom severity. Initial symptom severity was rated numerically based on the interpretation of the instrument used, with minimal or none rated as 0, mild as 1, moderate as 2, moderately severe as 3, and severe as 4. All analyses were conducted in R.
The data used in this study are publicly available. The extracted data are available in the following github repository: https://github.com/ashlwu/SmartphoneAppsDepressionAnxiety. Of note, the data were analyzed in.csv format but have been converted to.xlsx format to allow the data to be password protected. The password is “Engage.”
Meta-analyses and meta-regressions were conducted using the Meta, Metafor, and Dmetar packages in R65,66,67. The code is publicly available in the following github repository: https://github.com/ashlwu/SmartphoneAppsDepressionAnxiety.
Torous, J. & Roberts, L. W. Needed innovation in digital health and smartphone applications for mental health: transparency and trust. JAMA Psychiatry 74, 437–438 (2017).
Weisel, K. K. et al. Standalone smartphone apps for mental health-a systematic review and meta-analysis. npj Digital Med. 2, 118 (2019).
Firth, J. et al. Can smartphone mental health interventions reduce symptoms of anxiety? A meta-analysis of randomized controlled trials. J. Affect. Disord. 218, 15–22 (2017).
Firth, J. et al. The efficacy of smartphone-based mental health interventions for depressive symptoms: a meta-analysis of randomized controlled trials. World Psychiatry 16, 287–298 (2017).
Linardon, J., Cuijpers, P., Carlbring, P., Messer, M. & Fuller-Tyszkiewicz, M. The efficacy of app-supported smartphone interventions for mental health problems: a meta-analysis of randomized controlled trials. World Psychiatry 18, 325–336 (2019).
Baumel, A., Muench, F., Edan, S. & Kane, J. M. Objective user engagement with mental health apps: systematic search and panel-based usage analysis. J. Med. Internet Res. 21, e14567 (2019).
Kelders, S. M., Kok, R. N., Ossebaard, H. C. & Van Gemert-Pijnen, J. E. W. C. Persuasive system design does matter: a systematic review of adherence to web-based interventions. J. Med. Internet Res. 14, e152 (2012).
Oinas-Kukkonen, H. & Harjumaa, M. Persuasive systems design: key issues, process model, and system features. CAIS 24, 485–500 (2009).
Kahneman, D. Maps of bounded rationality: psychology for behavioral economics. Am. Econ. Rev. 93, 1449–1475 (2003).
Patel, M. S., Asch, D. A. & Volpp, K. G. Framing financial incentives to increase physical activity among overweight and obese adults. Ann. Intern. Med. 165, 600 (2016).
Dai, H., Milkman, K. L. & Riis, J. The fresh start effect: temporal landmarks motivate aspirational behavior. Manag. Sci. 60, 2563–2582 (2014).
Rogers, T., Milkman, K. L. & Volpp, K. G. Commitment devices: using initiatives to change behavior. JAMA 311, 2065–2066 (2014).
Patel, M. S. et al. A randomized, controlled trial of lottery-based financial incentives to increase physical activity among overweight and obese adults. Am. J. Health Promot 32, 1568–1575 (2018).
Huotari, K. & Hamari, J. Defining gamification: a service marketing perspective. in Proceeding of the 16th International Academic MindTrek Conference on - MindTrek ’12 17 (ACM Press, 2012). https://doi.org/10.1145/2393132.2393137.
Kelders, S. M., Sommers-Spijkerman, M. & Goldberg, J. Investigating the direct impact of a gamified versus nongamified well-being intervention: an exploratory experiment. J. Med. Internet Res. 20, e247 (2018).
Cheng, V. W. S., Davenport, T., Johnson, D., Vella, K. & Hickie, I. B. Gamification in apps and technologies for improving mental health and well-being: systematic review. JMIR Ment. Health 6, e13717 (2019).
Cotton, V. & Patel, M. S. Gamification use and design in popular health and fitness mobile applications. Am. J. Health Promot 33, 448–451 (2019).
Pramana, G. et al. Using mobile health gamification to facilitate cognitive behavioral therapy skills practice in child anxiety treatment: open clinical trial. JMIR Serious Games 6, e9 (2018).
Torous, J., Lipschitz, J., Ng, M. & Firth, J. Dropout rates in clinical trials of smartphone apps for depressive symptoms: a systematic review and meta-analysis. J. Affect. Disord. 263, 413–419 (2020).
Anguera, J. A., Jordan, J. T., Castaneda, D., Gazzaley, A. & Areán, P. A. Conducting a fully mobile and randomised clinical trial for depression: access, engagement and expense. BMJ Innov. 2, 14–21 (2016).
Bakker, D. & Rickard, N. Engagement with a cognitive behavioural therapy mobile phone app predicts changes in mental health and wellbeing: MoodMission. Aust. Psychol. 54, 245–260 (2019).
Ben-Zeev, D. et al. mHealth for schizophrenia: patient engagement with a mobile phone intervention following hospital discharge. JMIR Ment. Health 3, e34 (2016).
Bustillos, D. The effects of using pacifica on depressed patients. https://search.proquest.com/docview/2172401714?pq-origsite=primo (2018).
Caplan, S., Sosa Lovera, A. & Reyna Liberato, P. A feasibility study of a mental health mobile app in the Dominican Republic: the untold story. Int. J. Ment. Health 47, 311–345 (2018).
Chen, A. T., Wu, S., Tomasino, K. N., Lattie, E. G. & Mohr, D. C. A multi-faceted approach to characterizing user behavior and experience in a digital mental health intervention. J. Biomed. Inform. 94, 103187 (2019).
Dahne, J. et al. Pilot randomized trial of a self-help behavioral activation mobile app for utilization in primary care. Behav. Ther. 50, 817–827 (2019).
Dahne, J. et al. Pilot randomized controlled trial of a Spanish-language Behavioral Activation mobile app (¡Aptívate!) for the treatment of depressive symptoms among united states Latinx adults with limited English proficiency. J. Affect. Disord. 250, 210–217 (2019).
Enock, P. M., Hofmann, S. G. & McNally, R. J. Attention bias modification training via smartphone to reduce social anxiety: a randomized, controlled multi-session experiment. Cogn. Ther. Res. 38, 200–216 (2014).
Hur, J.-W., Kim, B., Park, D. & Choi, S.-W. A scenario-based cognitive behavioral therapy mobile app to reduce dysfunctional beliefs in individuals with depression: a randomized controlled trial. Telemed. J. E Health 24, 710–716 (2018).
Inkster, B., Sarda, S. & Subramanian, V. An empathy-driven, conversational artificial intelligence agent (Wysa) for digital mental well-being: real-world data evaluation mixed-methods study. JMIR Mhealth Uhealth 6, e12106 (2018).
Lüdtke, T., Pult, L. K., Schröder, J., Moritz, S. & Bücker, L. A randomized controlled trial on a smartphone self-help application (Be Good to Yourself) to reduce depressive symptoms. Psychiatry Res 269, 753–762 (2018).
Lukas, C. A., Eskofier, B. & Berking, M. A gamified smartphone-based intervention for depression: results from a randomized controlled pilot trial. Preprint https://doi.org/10.2196/preprints.16643 (2019).
Ly, K. H. et al. Behavioural activation versus mindfulness-based guided self-help treatment administered through a smartphone application: a randomised controlled trial. BMJ Open 4, e003440 (2014).
Mehrotra, S., Sudhir, P., Rao, G., Thirthalli, J. & Srikanth, T. K. Development and pilot testing of an internet-based self-help intervention for depression for indian users. Behav. Sci. 8, 36 (2018).
Moberg, C., Niles, A. & Beermann, D. Guided self-help works: randomized waitlist controlled trial of pacifica, a mobile app integrating cognitive behavioral therapy and mindfulness for stress, anxiety, and depression. J. Med. Internet Res. 21, e12556 (2019).
Mohr, D. C. et al. Comparison of the effects of coaching and receipt of app recommendations on depression, anxiety, and engagement in the intellicare platform: factorial randomized controlled trial. J. Med. Internet Res. 21, e13609 (2019).
Norton, V. P. The CALM project: teaching mindfulness meditation in primary care using computer-based application (ProQuest, 2017).
Pratap, A. et al. Using mobile apps to assess and treat depression in hispanic and latino populations: fully remote randomized clinical trial. J. Med. Internet Res. 20, e10130 (2018).
Roepke, A. M. et al. Randomized controlled trial of superbetter, a smartphone-based/internet-based self-help tool to reduce depressive symptoms. Games Health J. 4, 235–246 (2015).
Stiles-Shields, C., Montague, E., Kwasny, M. J. & Mohr, D. C. Behavioral and cognitive intervention strategies delivered via coached apps for depression: pilot trial. Psychol. Serv. 16, 233–238 (2019).
Stolz, T. et al. A mobile app for social anxiety disorder: a three-arm randomized controlled trial comparing mobile and PC-based guided self-help interventions. J. Consult. Clin. Psychol. 86, 493–504 (2018).
Wahle, F., Kowatsch, T., Fleisch, E., Rufer, M. & Weidt, S. Mobile sensing and support for people with depression: a pilot trial in the wild. JMIR Mhealth Uhealth 4, e111 (2016).
Watts, S. et al. CBT for depression: a pilot RCT comparing mobile phone vs. computer. BMC Psychiatry 13, 49 (2013).
Lim, M. H. et al. A pilot digital intervention targeting loneliness in youth mental health. Front. Psychiatry 10, 604 (2019).
Aromataris, E. & Munn, Z. Joanna Briggs Institute reviewer’s manual. The Joanna Briggs Institute https://reviewersmanual.joannabriggs.org (2017).
Sterne, J. A. C. et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ 366, l4898 (2019).
Torous, J., Nicholas, J., Larsen, M. E., Firth, J. & Christensen, H. Clinical review of user engagement with mental health smartphone apps: evidence, theory and improvements. Evid. Based Ment. Health 21, 116–119 (2018).
Baumeister, H., Reichler, L., Munzinger, M. & Lin, J. The impact of guidance on Internet-based mental health interventions — a systematic review. Internet Interv. 1, 205–215 (2014).
Baumel, A. & Yom-Tov, E. Predicting user adherence to behavioral eHealth interventions in the real world: examining which aspects of intervention design matter most. Transl. Behav. Med. 8, 793–798 (2018).
Graham, A. K., Lattie, E. G. & Mohr, D. C. Experimental therapeutics for digital mental health. JAMA Psychiatry https://doi.org/10.1001/jamapsychiatry.2019.2075 (2019).
Kahneman, D. A psychological perspective on economics. Am. Econ. Rev. 93, 162–168 (2003).
Vlaev, I., King, D., Darzi, A. & Dolan, P. Changing health behaviors using financial incentives: a review from behavioral economics. BMC Public Health 19, 1059 (2019).
Beidas, R. S. et al. Transforming mental health delivery through behavioral economics and implementation science: protocol for three exploratory projects. JMIR Res Protoc. 8, e12121 (2019).
Wang, J., Mann, F., Lloyd-Evans, B., Ma, R. & Johnson, S. Associations between loneliness and perceived social support and outcomes of mental health problems: a systematic review. BMC Psychiatry 18, 156 (2018).
Donkin, L. et al. A systematic review of the impact of adherence on the effectiveness of e-therapies. J. Med. Internet Res. 13, e52 (2011).
Torous, J., Michalak, E. E. & O’Brien, H. L. Digital health and engagement-looking behind the measures and methods. JAMA Netw. Open 3, e2010918 (2020).
Volpp, K. G. et al. A randomized, controlled trial of financial incentives for smoking cessation. N. Engl. J. Med. 360, 699–709 (2009).
Patel, M. S. et al. Effectiveness of behaviorally designed gamification interventions with social incentives for increasing physical activity among overweight and obese adults across the united states: the STEP UP randomized clinical trial. JAMA Intern. Med. 1–9, https://doi.org/10.1001/jamainternmed.2019.3505 (2019).
Wong, C. A. et al. Effect of financial incentives on glucose monitoring adherence and glycemic control among adolescents and young adults with type 1 diabetes: a randomized clinical trial. JAMA Pediatr. 171, 1176–1183 (2017).
Moher, D., Liberati, A., Tetzlaff, J. & Altman, D. G. PRISMA Group Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ 339, b2535 (2009).
Qu, C., Sas, C., Daudén Roquet, C. & Doherty, G. Functionality of top-rated mobile apps for depression: systematic search and evaluation. JMIR Ment. Health 7, e15321 (2020).
Chan, S., Torous, J., Hinton, L. & Yellowlees, P. Towards a framework for evaluating mobile mental health apps. Telemed. J. E Health 21, 1038–1041 (2015).
Higgins, J. P. T., Thompson, S. G., Deeks, J. J. & Altman, D. G. Measuring inconsistency in meta-analyses. BMJ 327, 557–560 (2003).
Duval, S. & Tweedie, R. Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 56, 455–463 (2000).
Viechtbauer, W. Conducting meta-analyses in R with the metafor package. J Stat Softw 36, 1–48 (2010).
Harrer, M., Cuijpers, P., Furukawa, T. & Ebert, D. Doing meta-analysis in R: a hands-on guide. Zenodo https://doi.org/10.5281/zenodo.2551803 (2019).
Balduzzi, S., Rücker, G. & Schwarzer, G. How to perform a meta-analysis with R: a practical tutorial. Evid. Based Ment. Health 22, 153–160 (2019).
This study was supported by the Weill Cornell Medicine Children’s Health Council Investigator Fund, the Pritzker Consortium, the Khoury Foundation, the Paul and Jenna Segal Family Foundation, and the Saks Fifth Avenue Foundation. A.W. conducted this research as part of the Areas of Concentration (AOC) Program of the Weill Cornell MD curriculum. The authors would like to thank Michelle Demetres for suggestions on crafting the initial systematic search criteria.
The authors declare no competing interests. The Weill Cornell Department of Psychiatry has developed an app for the treatment of anxiety in youth. The app is not commercially available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Wu, A., Scult, M.A., Barnes, E.D. et al. Smartphone apps for depression and anxiety: a systematic review and meta-analysis of techniques to increase engagement. npj Digit. Med. 4, 20 (2021). https://doi.org/10.1038/s41746-021-00386-8