Smartphone apps for depression and anxiety: a systematic review and meta-analysis of techniques to increase engagement

Wu, Ashley; Scult, Matthew A.; Barnes, Emily D.; Betancourt, Jessica A.; Falk, Avital; Gunning, Faith M.

doi:10.1038/s41746-021-00386-8

Download PDF

Review Article
Open access
Published: 11 February 2021

Smartphone apps for depression and anxiety: a systematic review and meta-analysis of techniques to increase engagement

npj Digital Medicine volume 4, Article number: 20 (2021) Cite this article

15k Accesses
56 Citations
63 Altmetric
Metrics details

Subjects

Abstract

Meta-analyses have shown that digital mental health apps can be efficacious in reducing symptoms of depression and anxiety. However, real-world usage of apps is typically not sustained over time, and no studies systematically examine which features increase sustained engagement with apps or the relationship between engagement features and clinical efficacy. We conducted a systematic search of the literature to identify empirical studies that (1) investigate standalone apps for depression and/or anxiety in symptomatic participants and (2) report at least one measure of engagement. Features intended to increase engagement were categorized using the persuasive system design (PSD) framework and principles of behavioral economics. Twenty-five studies with 4159 participants were included in the analysis. PSD features were commonly used, whereas behavioral economics techniques were not. Smartphone apps were efficacious in treating symptoms of anxiety and depression in randomized controlled trials, with overall small-to-medium effects (g = 0.2888, SE = 0.0999, z(15) = 2.89, p = 0.0119, Q(df = 14) = 41.93, p < 0.0001, I² = 66.6%), and apps that employed a greater number of engagement features as compared to the control condition had larger effect sizes (β = 0.0450, SE = 0.0164, t(15) = 2.7344, p = 0.0161). We observed an unexpected negative association between PSD features and engagement, as measured by completion rate (β = −0.0293, SE = 0.0121, t(17) = 02.4142, p = 0.0281). Overall, PSD features show promise for augmenting app efficacy, though engagement, as reflected in study completion, may not be the primary factor driving this association. The results suggest that expanding the use of PSD features in mental health apps may increase clinical benefits and that other techniques, such as those informed by behavioral economics, are employed infrequently.

Standalone smartphone apps for mental health—a systematic review and meta-analysis

Article Open access 02 December 2019

The impact of mindfulness apps on psychological processes of change: a systematic review

Article Open access 18 March 2024

Assessing the quality of mobile applications in chronic disease management: a scoping review

Article Open access 10 March 2021

Introduction

Digital mental health interventions are proliferating rapidly, garnering heightened interest from the public as well as the scientific community. Well over 10,000 mental health apps are now available for download¹, and multiple meta-analyses demonstrate that apps are modestly efficacious in treating symptoms of depression and anxiety^2,3,4,5. However, an analysis of real-world usage of mental health apps show that user engagement over time is low, with a median 15-day retention of only 3.9%⁶. Apps with features that increase engagement have generally been associated with greater reductions in depression and anxiety symptoms than apps that lack engagement features⁵. No study systematically examines which features increase engagement with apps or the relationship between the number of engagement features and clinical efficacy.

Persuasive system design (PSD) is a framework that identifies four mechanisms through which app features increase user engagement: facilitating the primary purpose of the app, promoting user-app interactions, leveraging social relationships, and increasing app credibility⁷ (Supplementary Table 1). PSD is rooted in research on human-computer interaction and computer-mediated communication and is in part adapted from concepts of persuasive technology^7,8. These categories allow for comprehensive and objective consideration of the characteristics of technologies. Specific PSD features predict adherence among internet-based health and lifestyle interventions, which suggests that incorporating PSD features may improve adherence in health apps as well ⁷.

Behavioral economics is focused on understanding ways that human decision-making differs from what would be expected in traditional economic models of “rational” actors⁹. A comprehensive list of behavioral economics techniques has not been delineated, but common findings include that people are motivated by avoiding losses¹⁰, “starting fresh,”¹¹ committing to action¹², and by lotteries¹³ (Supplementary Table 2). These techniques from behavioral economics can be applied to “gamification,” the use of game-like experiences in non-game services, to further improve user engagement^14,15,16,17. Gamification features, including points, badges, levels, and avatars, are used to increase engagement and may enhance apps’ intended effects¹⁶. In one head-to-head comparison of a gamified vs non-gamified versions of a smartphone app for anxiety, the gamified app was found to elicit significantly greater engagement, demonstrating the potential of behavioral economics for improving engagement with mental health apps¹⁸. PSD, behavioral economics, and gamification often overlap and identify a wide variety of techniques that increase engagement with smartphone apps.

Though smartphone apps show potential for treating depression and anxiety, a barrier to their real-world efficacy may be a lack of sustained user engagement¹⁹. Understanding the extent to which the engagement features of PSD techniques, behavioral economics, and/or gamification translate into improvements in engagement and mood outcomes may inform the development and deployment of more efficacious apps for anxiety and depression. We therefore conducted a systematic review of the literature and meta-analysis on smartphone apps for depression and anxiety with the aim of evaluating the impact of persuasive design and behavioral economics techniques on both engagement and clinical outcomes.

Results

Included studies

The full systematic search retrieved a total of 6287 results. Following the removal of duplicate articles and the later addition of articles cited by included studies, 4143 articles were screened by the title and abstract for relevance. After title and abstract screening, 302 articles were identified as potentially eligible and were screened in full. Full-text screening excluded 277 articles for reasons specified in Fig. 1, which details the full PRISMA search and screening process.

After full text screening, 25 independent studies were found to be eligible for inclusion in the systematic review (Table 1)^{20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44}. Ten studies were non-randomized trials, while 15 were randomized controlled trials. In addition, 3 of the trials were unpublished, including one pre-print and two dissertations. Across all studies, there were 4159 participants total, with 2905 participants using 29 unique smartphone apps. The participants ranged from 13 to 76 years old, with an average age of 32.9. The majority of participants were female (66.6%). The majority of studies required participants to report significant symptoms of major depressive disorder (19) or generalized anxiety disorder (5), and symptoms were most commonly identified by scoring above a predetermined threshold on a validated self-assessment questionnaire.

Table 1 Study characteristics.

Full size table

Control conditions included waitlist, treatment as usual, non-app control intervention, or placebo app control conditions (Supplementary Table 3). Average intervention duration was 5.8 weeks (standard deviation 2.8 weeks), with a range of 2–12 weeks. Most studies accounted by drop out by conducting an intention to treat analysis or by removing missing cases (Supplementary Table 4).

Quality assessment

According to the Joanna Briggs Institute critical appraisal checklist for quasi-experimental studies⁴⁵, among the 10 quasi-experimental studies, 2 were rated to be high quality, 6 were medium quality, and 2 were low quality. According to the Cochrane Collaboration Revised Risk of Bias tool⁴⁶, among the 15 randomized controlled trials, 0 were judged to have an overall low risk of bias, 8 were judged to raise some concerns for bias, and 7 were judged to be at high risk of bias.

App content

Across all studies, 29 unique smartphone apps were used, including experimental and control apps. The apps offered a variety of content (Supplementary Table 5), the most common being behavioral techniques (76%), cognitive techniques (72%), and psychoeducation (69%). Some apps provided means to track behavior (45%), mood (38%), or thoughts (21%). Relaxation techniques and mindfulness techniques were each used in 31% of apps. Fewer apps tracked physiological parameters such as sleep or exercise (10%) or tracked clinical symptoms of anxiety or depression (7%).

Safety and privacy

Safety features such as assessing for suicidality and providing resources for suicidality were reported in a minority of apps (14% and 17%, respectively). For 20% of apps, a privacy policy was available as specified in the study. Additional privacy and security features such as encryption and password protection were reported for 27% of apps (Supplementary Table 6).

Accessibility

Of the 29 apps assessed, 41% were available to the public through the Apple App Store, Google Play, or both (Supplementary Table 7). The remaining apps were accessible to research participants only, or their accessibility was not specified. The majority of apps were free or had free versions available.

Persuasive system design

Overall, apps used an average of 6.5 out of the 28 total features of PSD and 2.5 of the 4 total categories of PSD (Tables 2 and 3). Primary task support was most commonly used, with an average of 3.30 (SD 1.26) out of 7 total features. Social support was least commonly used, with an average of 0.37 (SD 0.93) out of 7 total features.

Table 2 Components of persuasive design.

Full size table

Table 3 Average persuasive system design features and categories.

Full size table

Table 4 describes the number and percentage of apps that used each PSD feature or category. Of note, features of credibility support were most difficult to evaluate as most articles did not describe these features in detail, and 81.4% of ratings were marked as “unable to assess.” The most frequently used features (reduction − 86.7%, self-monitoring − 70.0%, personalization − 53.3%) all belonged to the primary task support category. The least frequently used features were not used in any of the apps (cooperation, competition, recognition, third-party endorsements, and verifiability) and belonged to the dialogue support category or credibility support category.

Table 4 Frequency of PSD features.

Full size table

Behavioral economics

Overall, apps used an average of 0.17 out of the 4 total behavioral economics features (Supplementary Table 8). The most commonly used was the pre-commitment pledge, which was used in 4 apps. One app used loss aversion, whereas none of the apps used the fresh start effect or lottery.

Meta-analysis of effect of smartphone apps on depressive or anxiety symptoms

A random-effects meta-analysis of the effect of smartphone apps on depression and anxiety symptoms demonstrated a moderate positive effect size of smartphone apps for reducing pre versus post scores on depression and anxiety questionnaires (g = 0.6217, SE = 0.0669, t(25) = 9.2982, p < 0.001) (Fig. 2). Heterogeneity across studies was substantial (Q(df = 24) = 87.8894, p < 0.0001, I² = 82.70%). Egger’s regression test indicated significant publication bias (p = 0.0135). A trim-and-fill analysis identified six missing studies (g = 0.5184, SE = 0.0803, z(31) = 6.4589, p < 0.001, Q(df = 30) = 125.7114, p < 0.0001, I² = 87.57%).

**Fig. 2: Random effects meta-analysis of app efficacy comparing pre-intervention to post-intervention change on an anxiety or depression symptoms.**

Figure 3 displays the pooled and individual effect sizes of smartphone apps on depression or anxiety when compared to control conditions in randomized controlled trials. A random-effects meta-analysis demonstrated a similar pattern as the pre/post meta-analysis of the full sample albeit with a smaller effect size (g = 0.2888, SE = 0.0999, z(15) = 2.89, p = 0.0119, Q(df = 14) = 41.93, p < 0.0001, I² = 66.6%). Egger’s regression test indicated no publication bias (p = 0.47091). A trim-and-fill analysis identified one missing study (g = 0.2473, SE = 0.1254, t(16) = 1.97, p = 0.0672, Q(df = 15) = 49.72, p < 0.0001, I² = 69.8%).

**Fig. 3: Random effects meta-analysis in RCTs of app efficacy comparing change in anxiety or depression symptoms in apps vs. control.**

Meta-regression of PSD features and efficacy

Meta-regression analysis demonstrated that the magnitude of the effect size of improvement in mood in active relative to control conditions had a significant positive relationship with the difference in number of PSD features used in the app (β = 0.0450, SE = 0.0164, t(15) = 2.7344, p = 0.0161; Fig. 4), as well as with the difference in number of PSD categories (β = 0.1249, SE = 0.0362, t(15) = 3.4522, p = 0.0039).

**Fig. 4: Mixed effects meta-regression analysis of difference in PSD features and efficacy for app vs control.**

Meta-analysis of study completion rate

A random-effects meta-analysis of the study completion rate of app vs control conditions in randomized controlled trials revealed a significant effect, and heterogeneity was moderate to substantial (g = 0.8730, SE = 0.0761, z(17) = −1.79, p = 0.0931, Q(df = 16) = 41.33, p = 0.0005, I² = 61.3%). Egger’s regression test indicated no publication bias (p = 0.2738). A trim-and-fill analysis identified no missing studies.

Meta-regression of PSD features and study completion rate

The study completion rate in the intervention compared to control condition had a significant negative relationship with the difference in number of PSD features (β = −0.0293, SE = 0.0121, t(17) = 02.4142, p = 0.0281; Fig. 5), as well as with the difference in number of PSD categories (β = −0.0830, SE = 0.0292, t(17) = −2.8472, p = 0.0116).

**Fig. 5: Mixed effects meta-regression analysis of difference in PSD features and study completion rate for app vs control.**

Removal of outliers did not significantly impact any of the results (Supplementary Note 1).

Exploratory analyses

No direct association was found between efficacy and completion rate (g = −0.2134, SE = 0.1442, t(15) = −1.4806, p = 0.1609). Efficacy also did not meaningfully change the association between completion rate and PSD features (β = −0.0649, SE = 0.1434, p = 0.6588, F(df1 = 2, df2 = 12) = 0.8320, p = 0.4588). Including a regressor for the type of control condition (waitlist vs. active control) also did not meaningfully change the association between PSD features and study completion rate (β = −0.1900, SE = 0.1254, t(17) = −1.5150, p = 0.1520). Including a regressor for initial symptom severity did not meaningfully change the association between PSD features and study completion rate (β = −0.0949, SE = 0.0820, t(16) = −1.1566, p = 0.2682).

Discussion

The principal findings of this meta-analysis indicate that standalone smartphone apps for depression and anxiety are efficacious, with modest overall effects in randomized controlled trials (g = 0.2888). The efficacy findings are consistent with those shown in previous meta-analyses^2,3,4,5. This meta-analysis is the first to demonstrate that apps that use a greater number of engagement features have larger clinical effects. Thus, the findings extend the extant literature by demonstrating the importance of engagement techniques to improve efficacy of app-based interventions for depression and anxiety. These findings can inform development and implementation of future apps.

A lack of engagement with mental health apps has been reported as a key limitation for realizing the potential of apps to broadly disseminate mental health treatments⁴⁷. Apps that include guidance (i.e., from a therapist or coach) have generally been found to have larger effects than standalone interventions^5,19,48. Given that standalone mental health apps require fewer resources than guided apps, identifying factors that might increase engagement and efficacy without therapist involvement could greatly improve dissemination efforts. “Engagement features” such as those elements of PSD have been found to increase engagement with mental health apps as measured by duration of use and completion of interventions^7,49. Our review indicates that engagement features also are associated with increased clinical efficacy and therefore should be included in standalone mental health apps to increase their impact for broader dissemination.

The number of engagement features used in a given app had a significant negative association with study completion rate. The negative association between PSD features and engagement as measured by completion rate was contrary to our a priori hypothesis that PSD features would be associated with increased app completion and thereby increase efficacy. One possible interpretation of the negative association between PSD features and completion rates is that participants using apps with more PSD features may use the app more frequently, benefit more quickly, and lose motivation to continue using the app once their symptoms have improved compared to participants using less engaging apps; however, our exploratory analyses suggest that this is not the case. Additionally, we found that these results were not driven by the type of control condition or initial symptom severity. In subsequent studies, analyses of interim symptom change could help to better understand the association between PSD features and completion rates. Additionally, while completion rate was the most readily available measure of engagement, other engagement outcomes such as minutes spent using the app or self-reported user experience of the usefulness, usability, and satisfaction with the app⁵⁰ might yield more meaningful information about PSD features and engagement.

We found that some PSD features were incorporated into all apps (e.g., primary task support), while others were less commonly used (e.g., social support), and behavioral economics features such as pre-commitment pledges, loss aversion, and lotteries, were rarely incorporated. The reason may be, in part, because of the historical roots of these literatures. PSD is well-established as an approach to development of new technologies and therefore may be more naturally incorporated into app development^7,8. Behavioral economics has traditionally focused on economic decision-making⁵¹, then was applied to behavioral health⁵² but has only more recently been used in the context of mental health interventions⁵³. The results of our meta-analysis suggest that increasing the number of engagement techniques may be beneficial, and so it is likely that designers may need to draw from a range of engagement approaches to maximize the impact of app-based interventions. Furthermore, elements such as social support, which span both the PSD and behavioral economics literatures, may be particularly promising⁶. Incorporating social support may also be the mostly likely way to engage patients without the use of therapists or coaches and, in itself, is a predictor of mental health outcomes ⁵⁴.

There are several important limitations of the current review. The literature on mental health apps is heterogenous, and a number of studies included samples without psychiatric diagnoses and/or were used in combination with other interventions to varying degrees. Clearer standards for reporting on app content in research studies could help subsequent syntheses of the literature. In our review, all available information about app content, design elements, privacy and security features, and accessibility were extracted from descriptions and screenshots of the apps in the papers, appendices and supplementary materials. However, additional features may have been included in apps that were not described in these materials. Another limitation is the use of completion rate as the primary metric of engagement. While this approach has been used previously in mental health app engagement literature¹⁹ and is an accessible metric because it is reported in most studies, it may be a crude measure of engagement. The number of minutes spent using the app, number of logins, or number of modules completed (if applicable) may measure engagement more meaningfully⁵⁵. Furthermore, the quality of engagement may be more likely to affect clinical outcomes as much as or even more than the duration of engagement. The issue of defining meaningful engagement has been gaining attention and will likely influence the next steps in measuring app engagement ⁵⁶.

Overall, smartphone apps appear to be efficacious in decreasing anxiety and depression symptoms. The use of persuasive design features is associated with larger effect sizes and may be useful in increasing clinical efficacy. Relatively few mental health apps used non-PSD behavioral economics techniques, which have shown promise for increasing outcomes in app-based interventions for smoking cessation⁵⁷, increasing physical activity for obese patients⁵⁸, and adherence to diabetes medication⁵⁹. Incorporating additional PSD and behavioral economics techniques that have been rarely used, but have shown promise (e.g., social support⁷) may further improve outcomes in mental health treatments.

Methods

Literature search

This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement for transparent, comprehensive reporting of methodology and results⁶⁰. The review adhered to a strict protocol submitted for registration with PROSPERO on April 1, 2020 (ID 171194) before the start of analyses. The protocol review was ongoing as of December 4, 2020 (CRD42020171194): https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42020171194.

A systematic search was performed of the following databases: Ovid Medline (ALL − 1946 to present); Ovid Embase (1974 to present); The Cochrane Library (Wiley); the Cumulative Index to Nursing and Allied Health Literature (CINAHL; EBSCO interface); PsycINFO (EBSCO); PsycArticle (EBSCO); and Ovid Allied and Complementary Medicine (AMED). Since the focus of the study was on published research trials conducted on the apps, searches were not carried out directly in the Google Play or the Apple App Store. Our search terms included combinations, truncations, and synonyms of “depression,” “social anxiety,” or “anxiety,” in combination with “mobile application” or “smartphone.” The search terms were adapted for use with all chosen bibliographic databases. Additional records were retrieved by checking the bibliographies of studies that met inclusion criteria. No language, article type, or publication date restrictions were used. Searches were completed in March and April 2020.

Eligibility criteria

Included articles were English-language empirical studies published in peer-reviewed publications, conference presentations, dissertations, and gray literature/unpublished work. The intervention was required to be delivered by smartphone app alone and to be delivered over time with the purpose of treating depression and/or anxiety (major depressive disorder, dysthymia, other depressive disorder, generalized anxiety disorder, and/or social anxiety disorder). Study participants could be any age but needed to identify as having depression and/or anxiety, either through clinical diagnosis or self-endorsement. To be included, studies had to report at least one measure of participant engagement (i.e., attrition, adherence, engagement, dropout).

Studies were excluded if: a) the intervention was delivered in part or entirely by a non-smartphone method; b) the study population was required to have a specific medical condition, other DSM-5 diagnosis, no diagnosis (healthy), or unspecified diagnosis, unless the study included a subgroup analysis of participants with depression and/or anxiety only; and c) adequate information regarding intervention characteristics and/or engagement outcomes could not be obtained from the paper or from contacting the study authors.

Data collection and analysis

Two of three raters (E.B., J.B., or A.W.) independently completed each of the following steps, and conflicts were resolved by another rater (M.S.).

Selection of articles

Titles and abstracts were evaluated against the inclusion and exclusion criteria. Studies that were deemed potentially relevant were next evaluated by their full text. Reasons for exclusion were recorded. When more than one publication of a study was found, the study with the most information was used for data analysis.

Data extraction

Data extracted included study identifiers, study characteristics, intervention characteristics, techniques to increase engagement, engagement measures, efficacy measures, app content, app modality, accessory interventions, and usability features. Techniques to increase engagement are described further in Supplementary Information. Only data that were explicitly reported in the publications were extracted. App profiles in the Google Play or Apple App Store were not accessed as we could not verify that features identified in the app profile were available in the version of the app referenced in the paper.

1.
Study Identifiers: the first author’s last name and the publication year.
2.
Study Characteristics: the population, comparator (if applicable), outcome, and study design. Participant demographics such as age and gender.
3.
Intervention Characteristics: the intervention or cohort name, number of trial arms, primary condition, duration of intervention, duration of follow-up, modules to be completed, automated or guided delivery, format of delivery, and outcome measures.
4.
Techniques to Increase Engagement: components of PSD as outlined in Kelders et al.⁷ include: primary task support (reduction, tunneling, tailoring, personalization, self-monitoring, simulation, and rehearsal), dialogue support (praise, rewards, reminders, suggestion, similarity, liking, and social role), and credibility support (trustworthiness, expertise, surface credibility, real-world feel, authority, third-party endorsements, and verifiability). Non-PSD behavioral economics techniques included loss aversion, fresh start effect, pre-commitment pledges, and financial incentives including lotteries. Each component or technique had a predefined definition and guideline for being coded as present. Additional information can be found in Supplementary Tables 1 and 2.
5.
Measures of Engagement: adherence to study protocol, completion rate, amount of time spent in app, modules completed, features or exercises accessed, duration of use of app, or other. All available measures of engagement were recorded for each study.
6.
Measures of Efficacy: change in score on a validated scale for depression or anxiety.
7.
App Content: app content was categorized based on categories identified in previously published systematic evaluations of publicly available mental health apps^6,61. App content included psychoeducation, cognitive techniques, behavioral techniques, mindfulness, relaxation, mood expression, tracking of mood patterns, tracking of thought patterns, tracking of behavior, tracking of symptoms, and tracking of physiological parameters.
8.
App Modality: information on app modality was recorded when available. App modalities included text, graphs, photos, illustrations, videos/films, and others.
9.
Safety, Privacy, and Accessibility Features: integration and accessibility features such as cost, availability to the public, safety features, and privacy protection features. These features were adapted from a framework for evaluating mobile mental health apps⁶².

Inter-rater agreement

At the title and abstract screening stage, inter-rater agreement was 93.8%. At the full-text screening stage, inter-rater agreement was 90.1%. At the engagement feature rating stage, inter-rater agreement was 82.6%. Disagreements were resolved by a third rater (M.S.) at the screening stages and by consensus at the feature rating stage.

Assessment of risk of bias

Non-randomized experimental studies were evaluated based on the Joanna Briggs Institute critical appraisal checklist for quasi-experimental studies to assess quality⁴⁵. Because the Joanna Briggs Institute checklist does not assign specific ratings (“high,” “medium,” or “low”) to studies based on the number of criteria met, we adapted a rating system used in previously published systematic evaluations of mental health apps^6,61. The methodological quality of each study was rated as: high (7–9 criteria have been fulfilled), medium (4–6 criteria have been fulfilled), or low (0–3 criteria have been fulfilled).

The Cochrane Collaboration Revised Risk of Bias tool was applied to randomized controlled trials⁴⁶. Each RCT was assessed against five bias domains in order to produce a summary risk of bias assessment score for each domain and overall (low risk, some concerns, or high risk of bias) ⁴⁶.

Data analysis

A meta-analysis was conducted to calculate a pooled effect size of treatment efficacy. To test pre- and post-test change scores on anxiety and depression questionnaires, we applied a random-effects model to account for the expected heterogeneity among the studies, given the variety of types of apps, duration of intervention, and other factors. If a study used multiple measures of anxiety and/or depression symptoms, then the instrument most frequently used among all studies was used to calculate efficacy so as to reduce heterogeneity. If an app was used in multiple independent studies, it was included multiple times. In supplementary analyses, we removed outliers, defined as studies with a 95% confidence interval of the effect size that did not overlap with the 95% confidence interval of the pooled effect size⁶³. We compared analyses with and without outliers to determine whether the presence of outliers changed the observed results. Risk of publication bias was examined using Eggers’ regression, and Duval and Tweedie’s trim-and-fill analysis was used to re-calculate pooled dropout rates after statistically accounting for any studies that may have introduced publication bias⁶⁴. The degree of statistical heterogeneity was quantified using I² values, with <25% representing low heterogeneity, 25–50% for low-to-moderate heterogeneity, 50–75% for moderate-to-substantial heterogeneity, and >75% for substantial heterogeneity⁶³. τ² was calculated using the Hartung–Knapp–Sidik–Jonkman (HKSJ) method, which is appropriate when there is a large degree of heterogeneity between studies.

A second meta-analysis was run to examine efficacy of apps compared to control conditions in randomized controlled trials. A random-effects model was again used given the expected heterogeneity among the studies. Efficacy was calculated as the pre/post change on a depression or anxiety scale. When a study had multiple control conditions, the active control condition was selected, as opposed to a waitlist or treatment-as-usual control. If an app was used in multiple studies, it was included multiple times. If a study used multiple apps, both apps were included if there was a non-app control condition. Otherwise, the intervention app was compared to the control app. To examine whether risk of bias score affected the efficacy, a subgroup analysis stratifying studies by risk of bias score was performed (Supplementary Note 1). Independent mixed-effects meta-regression models were run to examine how app efficacy was affected by the number of PSD features used in the app compared to the control condition, the number of PSD categories used in the app compared to the control condition, and the study completion rate of app users compared to the control condition. As the number of PSD features or categories denoted levels as opposed to continuous variables, the intercept was set to “false.”

A third meta-analysis examined the study completion rate of apps compared to control conditions in randomized controlled trials. Study completion rate was calculated as the proportion of enrolled and/or randomized study participants who remained in the study at the time the intervention ended. A random-effects model was again used given the expected heterogeneity among the studies. Procedures were otherwise the same as for the second meta-analysis and the meta-regression. In addition, in order to further explore the association between PSD features and completion rates, exploratory control analyses were conducted. An independent mixed-effects meta-regression model was run to examine how app efficacy affected study completion rate. Multiple meta-regressions were conducted to examine the relationship of various covariates to study completion rate: PSD features and efficacy, PSD features and type of control condition (waitlist vs. active control), and PSD features and initial symptom severity. Initial symptom severity was rated numerically based on the interpretation of the instrument used, with minimal or none rated as 0, mild as 1, moderate as 2, moderately severe as 3, and severe as 4. All analyses were conducted in R.

Data availability

The data used in this study are publicly available. The extracted data are available in the following github repository: https://github.com/ashlwu/SmartphoneAppsDepressionAnxiety. Of note, the data were analyzed in.csv format but have been converted to.xlsx format to allow the data to be password protected. The password is “Engage.”

Code availability

Meta-analyses and meta-regressions were conducted using the Meta, Metafor, and Dmetar packages in R^65,66,67. The code is publicly available in the following github repository: https://github.com/ashlwu/SmartphoneAppsDepressionAnxiety.

References

Torous, J. & Roberts, L. W. Needed innovation in digital health and smartphone applications for mental health: transparency and trust. JAMA Psychiatry 74, 437–438 (2017).
Article PubMed Google Scholar
Weisel, K. K. et al. Standalone smartphone apps for mental health-a systematic review and meta-analysis. npj Digital Med. 2, 118 (2019).
Article Google Scholar
Firth, J. et al. Can smartphone mental health interventions reduce symptoms of anxiety? A meta-analysis of randomized controlled trials. J. Affect. Disord. 218, 15–22 (2017).
Article PubMed Google Scholar
Firth, J. et al. The efficacy of smartphone-based mental health interventions for depressive symptoms: a meta-analysis of randomized controlled trials. World Psychiatry 16, 287–298 (2017).
Article PubMed PubMed Central Google Scholar
Linardon, J., Cuijpers, P., Carlbring, P., Messer, M. & Fuller-Tyszkiewicz, M. The efficacy of app-supported smartphone interventions for mental health problems: a meta-analysis of randomized controlled trials. World Psychiatry 18, 325–336 (2019).
Article PubMed PubMed Central Google Scholar
Baumel, A., Muench, F., Edan, S. & Kane, J. M. Objective user engagement with mental health apps: systematic search and panel-based usage analysis. J. Med. Internet Res. 21, e14567 (2019).
Article PubMed PubMed Central Google Scholar
Kelders, S. M., Kok, R. N., Ossebaard, H. C. & Van Gemert-Pijnen, J. E. W. C. Persuasive system design does matter: a systematic review of adherence to web-based interventions. J. Med. Internet Res. 14, e152 (2012).
Article PubMed PubMed Central Google Scholar
Oinas-Kukkonen, H. & Harjumaa, M. Persuasive systems design: key issues, process model, and system features. CAIS 24, 485–500 (2009).
Article Google Scholar
Kahneman, D. Maps of bounded rationality: psychology for behavioral economics. Am. Econ. Rev. 93, 1449–1475 (2003).
Article Google Scholar
Patel, M. S., Asch, D. A. & Volpp, K. G. Framing financial incentives to increase physical activity among overweight and obese adults. Ann. Intern. Med. 165, 600 (2016).
Article PubMed Google Scholar
Dai, H., Milkman, K. L. & Riis, J. The fresh start effect: temporal landmarks motivate aspirational behavior. Manag. Sci. 60, 2563–2582 (2014).
Article Google Scholar
Rogers, T., Milkman, K. L. & Volpp, K. G. Commitment devices: using initiatives to change behavior. JAMA 311, 2065–2066 (2014).
Article CAS PubMed Google Scholar
Patel, M. S. et al. A randomized, controlled trial of lottery-based financial incentives to increase physical activity among overweight and obese adults. Am. J. Health Promot 32, 1568–1575 (2018).
Article PubMed Google Scholar
Huotari, K. & Hamari, J. Defining gamification: a service marketing perspective. in Proceeding of the 16th International Academic MindTrek Conference on - MindTrek ’12 17 (ACM Press, 2012). https://doi.org/10.1145/2393132.2393137.
Kelders, S. M., Sommers-Spijkerman, M. & Goldberg, J. Investigating the direct impact of a gamified versus nongamified well-being intervention: an exploratory experiment. J. Med. Internet Res. 20, e247 (2018).
Article PubMed PubMed Central Google Scholar
Cheng, V. W. S., Davenport, T., Johnson, D., Vella, K. & Hickie, I. B. Gamification in apps and technologies for improving mental health and well-being: systematic review. JMIR Ment. Health 6, e13717 (2019).
Article PubMed PubMed Central Google Scholar
Cotton, V. & Patel, M. S. Gamification use and design in popular health and fitness mobile applications. Am. J. Health Promot 33, 448–451 (2019).
Article PubMed Google Scholar
Pramana, G. et al. Using mobile health gamification to facilitate cognitive behavioral therapy skills practice in child anxiety treatment: open clinical trial. JMIR Serious Games 6, e9 (2018).
Article PubMed PubMed Central Google Scholar
Torous, J., Lipschitz, J., Ng, M. & Firth, J. Dropout rates in clinical trials of smartphone apps for depressive symptoms: a systematic review and meta-analysis. J. Affect. Disord. 263, 413–419 (2020).
Article PubMed Google Scholar
Anguera, J. A., Jordan, J. T., Castaneda, D., Gazzaley, A. & Areán, P. A. Conducting a fully mobile and randomised clinical trial for depression: access, engagement and expense. BMJ Innov. 2, 14–21 (2016).
Article PubMed PubMed Central Google Scholar
Bakker, D. & Rickard, N. Engagement with a cognitive behavioural therapy mobile phone app predicts changes in mental health and wellbeing: MoodMission. Aust. Psychol. 54, 245–260 (2019).
Article Google Scholar
Ben-Zeev, D. et al. mHealth for schizophrenia: patient engagement with a mobile phone intervention following hospital discharge. JMIR Ment. Health 3, e34 (2016).
Article PubMed PubMed Central Google Scholar
Bustillos, D. The effects of using pacifica on depressed patients. https://search.proquest.com/docview/2172401714?pq-origsite=primo (2018).
Caplan, S., Sosa Lovera, A. & Reyna Liberato, P. A feasibility study of a mental health mobile app in the Dominican Republic: the untold story. Int. J. Ment. Health 47, 311–345 (2018).
Article Google Scholar
Chen, A. T., Wu, S., Tomasino, K. N., Lattie, E. G. & Mohr, D. C. A multi-faceted approach to characterizing user behavior and experience in a digital mental health intervention. J. Biomed. Inform. 94, 103187 (2019).
Article PubMed PubMed Central Google Scholar
Dahne, J. et al. Pilot randomized trial of a self-help behavioral activation mobile app for utilization in primary care. Behav. Ther. 50, 817–827 (2019).
Article PubMed Google Scholar
Dahne, J. et al. Pilot randomized controlled trial of a Spanish-language Behavioral Activation mobile app (¡Aptívate!) for the treatment of depressive symptoms among united states Latinx adults with limited English proficiency. J. Affect. Disord. 250, 210–217 (2019).
Article PubMed PubMed Central Google Scholar
Enock, P. M., Hofmann, S. G. & McNally, R. J. Attention bias modification training via smartphone to reduce social anxiety: a randomized, controlled multi-session experiment. Cogn. Ther. Res. 38, 200–216 (2014).
Article Google Scholar
Hur, J.-W., Kim, B., Park, D. & Choi, S.-W. A scenario-based cognitive behavioral therapy mobile app to reduce dysfunctional beliefs in individuals with depression: a randomized controlled trial. Telemed. J. E Health 24, 710–716 (2018).
Article PubMed Google Scholar
Inkster, B., Sarda, S. & Subramanian, V. An empathy-driven, conversational artificial intelligence agent (Wysa) for digital mental well-being: real-world data evaluation mixed-methods study. JMIR Mhealth Uhealth 6, e12106 (2018).
Article PubMed PubMed Central Google Scholar
Lüdtke, T., Pult, L. K., Schröder, J., Moritz, S. & Bücker, L. A randomized controlled trial on a smartphone self-help application (Be Good to Yourself) to reduce depressive symptoms. Psychiatry Res 269, 753–762 (2018).
Article PubMed Google Scholar
Lukas, C. A., Eskofier, B. & Berking, M. A gamified smartphone-based intervention for depression: results from a randomized controlled pilot trial. Preprint https://doi.org/10.2196/preprints.16643 (2019).
Ly, K. H. et al. Behavioural activation versus mindfulness-based guided self-help treatment administered through a smartphone application: a randomised controlled trial. BMJ Open 4, e003440 (2014).
Article PubMed PubMed Central Google Scholar
Mehrotra, S., Sudhir, P., Rao, G., Thirthalli, J. & Srikanth, T. K. Development and pilot testing of an internet-based self-help intervention for depression for indian users. Behav. Sci. 8, 36 (2018).
Article PubMed Central Google Scholar
Moberg, C., Niles, A. & Beermann, D. Guided self-help works: randomized waitlist controlled trial of pacifica, a mobile app integrating cognitive behavioral therapy and mindfulness for stress, anxiety, and depression. J. Med. Internet Res. 21, e12556 (2019).
Article PubMed PubMed Central Google Scholar
Mohr, D. C. et al. Comparison of the effects of coaching and receipt of app recommendations on depression, anxiety, and engagement in the intellicare platform: factorial randomized controlled trial. J. Med. Internet Res. 21, e13609 (2019).
Article PubMed PubMed Central Google Scholar
Norton, V. P. The CALM project: teaching mindfulness meditation in primary care using computer-based application (ProQuest, 2017).
Pratap, A. et al. Using mobile apps to assess and treat depression in hispanic and latino populations: fully remote randomized clinical trial. J. Med. Internet Res. 20, e10130 (2018).
Article PubMed PubMed Central Google Scholar
Roepke, A. M. et al. Randomized controlled trial of superbetter, a smartphone-based/internet-based self-help tool to reduce depressive symptoms. Games Health J. 4, 235–246 (2015).
Article PubMed Google Scholar
Stiles-Shields, C., Montague, E., Kwasny, M. J. & Mohr, D. C. Behavioral and cognitive intervention strategies delivered via coached apps for depression: pilot trial. Psychol. Serv. 16, 233–238 (2019).
Article PubMed Google Scholar
Stolz, T. et al. A mobile app for social anxiety disorder: a three-arm randomized controlled trial comparing mobile and PC-based guided self-help interventions. J. Consult. Clin. Psychol. 86, 493–504 (2018).
Article PubMed Google Scholar
Wahle, F., Kowatsch, T., Fleisch, E., Rufer, M. & Weidt, S. Mobile sensing and support for people with depression: a pilot trial in the wild. JMIR Mhealth Uhealth 4, e111 (2016).
Article PubMed PubMed Central Google Scholar
Watts, S. et al. CBT for depression: a pilot RCT comparing mobile phone vs. computer. BMC Psychiatry 13, 49 (2013).
Article PubMed PubMed Central Google Scholar
Lim, M. H. et al. A pilot digital intervention targeting loneliness in youth mental health. Front. Psychiatry 10, 604 (2019).
Article PubMed PubMed Central Google Scholar
Aromataris, E. & Munn, Z. Joanna Briggs Institute reviewer’s manual. The Joanna Briggs Institute https://reviewersmanual.joannabriggs.org (2017).
Sterne, J. A. C. et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ 366, l4898 (2019).
Article PubMed Google Scholar
Torous, J., Nicholas, J., Larsen, M. E., Firth, J. & Christensen, H. Clinical review of user engagement with mental health smartphone apps: evidence, theory and improvements. Evid. Based Ment. Health 21, 116–119 (2018).
Article PubMed Google Scholar
Baumeister, H., Reichler, L., Munzinger, M. & Lin, J. The impact of guidance on Internet-based mental health interventions — a systematic review. Internet Interv. 1, 205–215 (2014).
Article Google Scholar
Baumel, A. & Yom-Tov, E. Predicting user adherence to behavioral eHealth interventions in the real world: examining which aspects of intervention design matter most. Transl. Behav. Med. 8, 793–798 (2018).
Article PubMed Google Scholar
Graham, A. K., Lattie, E. G. & Mohr, D. C. Experimental therapeutics for digital mental health. JAMA Psychiatry https://doi.org/10.1001/jamapsychiatry.2019.2075 (2019).
Article PubMed PubMed Central Google Scholar
Kahneman, D. A psychological perspective on economics. Am. Econ. Rev. 93, 162–168 (2003).
Article Google Scholar
Vlaev, I., King, D., Darzi, A. & Dolan, P. Changing health behaviors using financial incentives: a review from behavioral economics. BMC Public Health 19, 1059 (2019).
Article PubMed PubMed Central Google Scholar
Beidas, R. S. et al. Transforming mental health delivery through behavioral economics and implementation science: protocol for three exploratory projects. JMIR Res Protoc. 8, e12121 (2019).
Article PubMed PubMed Central Google Scholar
Wang, J., Mann, F., Lloyd-Evans, B., Ma, R. & Johnson, S. Associations between loneliness and perceived social support and outcomes of mental health problems: a systematic review. BMC Psychiatry 18, 156 (2018).
Article PubMed PubMed Central Google Scholar
Donkin, L. et al. A systematic review of the impact of adherence on the effectiveness of e-therapies. J. Med. Internet Res. 13, e52 (2011).
Article PubMed PubMed Central Google Scholar
Torous, J., Michalak, E. E. & O’Brien, H. L. Digital health and engagement-looking behind the measures and methods. JAMA Netw. Open 3, e2010918 (2020).
Article PubMed Google Scholar
Volpp, K. G. et al. A randomized, controlled trial of financial incentives for smoking cessation. N. Engl. J. Med. 360, 699–709 (2009).
Article CAS PubMed Google Scholar
Patel, M. S. et al. Effectiveness of behaviorally designed gamification interventions with social incentives for increasing physical activity among overweight and obese adults across the united states: the STEP UP randomized clinical trial. JAMA Intern. Med. 1–9, https://doi.org/10.1001/jamainternmed.2019.3505 (2019).
Wong, C. A. et al. Effect of financial incentives on glucose monitoring adherence and glycemic control among adolescents and young adults with type 1 diabetes: a randomized clinical trial. JAMA Pediatr. 171, 1176–1183 (2017).
Article PubMed PubMed Central Google Scholar
Moher, D., Liberati, A., Tetzlaff, J. & Altman, D. G. PRISMA Group Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ 339, b2535 (2009).
Article PubMed PubMed Central Google Scholar
Qu, C., Sas, C., Daudén Roquet, C. & Doherty, G. Functionality of top-rated mobile apps for depression: systematic search and evaluation. JMIR Ment. Health 7, e15321 (2020).
Article PubMed PubMed Central Google Scholar
Chan, S., Torous, J., Hinton, L. & Yellowlees, P. Towards a framework for evaluating mobile mental health apps. Telemed. J. E Health 21, 1038–1041 (2015).
Article PubMed Google Scholar
Higgins, J. P. T., Thompson, S. G., Deeks, J. J. & Altman, D. G. Measuring inconsistency in meta-analyses. BMJ 327, 557–560 (2003).
Article PubMed PubMed Central Google Scholar
Duval, S. & Tweedie, R. Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 56, 455–463 (2000).
Article CAS PubMed Google Scholar
Viechtbauer, W. Conducting meta-analyses in R with the metafor package. J Stat Softw 36, 1–48 (2010).
Article Google Scholar
Harrer, M., Cuijpers, P., Furukawa, T. & Ebert, D. Doing meta-analysis in R: a hands-on guide. Zenodo https://doi.org/10.5281/zenodo.2551803 (2019).
Article Google Scholar
Balduzzi, S., Rücker, G. & Schwarzer, G. How to perform a meta-analysis with R: a practical tutorial. Evid. Based Ment. Health 22, 153–160 (2019).
Article PubMed Google Scholar

Download references

Acknowledgements

This study was supported by the Weill Cornell Medicine Children’s Health Council Investigator Fund, the Pritzker Consortium, the Khoury Foundation, the Paul and Jenna Segal Family Foundation, and the Saks Fifth Avenue Foundation. A.W. conducted this research as part of the Areas of Concentration (AOC) Program of the Weill Cornell MD curriculum. The authors would like to thank Michelle Demetres for suggestions on crafting the initial systematic search criteria.

Author information

These authors contributed equally: Ashley Wu, Matthew A. Scult.

Authors and Affiliations

MD Program, Weill Cornell Medicine, New York, NY, USA
Ashley Wu
Department of Psychiatry, Weill Cornell Medicine, New York, NY, USA
Matthew A. Scult, Emily D. Barnes, Jessica A. Betancourt, Avital Falk & Faith M. Gunning

Authors

Ashley Wu
View author publications
You can also search for this author in PubMed Google Scholar
Matthew A. Scult
View author publications
You can also search for this author in PubMed Google Scholar
Emily D. Barnes
View author publications
You can also search for this author in PubMed Google Scholar
Jessica A. Betancourt
View author publications
You can also search for this author in PubMed Google Scholar
Avital Falk
View author publications
You can also search for this author in PubMed Google Scholar
Faith M. Gunning
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.W., M.S., E.B., A.F., and F.G. formed the search strategy and inclusion and exclusion criteria. A.W. performed the database searches. A.W., E.B., and J.B. reviewed all records for eligibility and extracted the data, and conflicts were resolved by M.S. Data analysis was conducted by A.W. in collaboration with M.S. and consultation with F.G. All authors contributed to writing the manuscript. A.W. and M.S. are co-first authors.

Corresponding authors

Correspondence to Matthew A. Scult or Faith M. Gunning.

Ethics declarations

Competing interests

The authors declare no competing interests. The Weill Cornell Department of Psychiatry has developed an app for the treatment of anxiety in youth. The app is not commercially available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, A., Scult, M.A., Barnes, E.D. et al. Smartphone apps for depression and anxiety: a systematic review and meta-analysis of techniques to increase engagement. npj Digit. Med. 4, 20 (2021). https://doi.org/10.1038/s41746-021-00386-8

Download citation

Received: 05 August 2020
Accepted: 24 December 2020
Published: 11 February 2021
DOI: https://doi.org/10.1038/s41746-021-00386-8

This article is cited by

Evaluation of the internet-based intervention “Selfapy” in participants with unipolar depression and the impact on quality of life: a randomized, parallel group study
- Cora Schefft
- Rico Krämer
- Stephan Köhler
Quality of Life Research (2024)
Review and content analysis of textual expressions as a marker for depressive and anxiety disorders (DAD) detection using machine learning
- Chandra Mani Sharma
- Darsh Damani
- Vijayaraghavan M. Chariar
Discover Artificial Intelligence (2023)
Fidelity to the Inhibitory Learning Model, Functionality, and Availability of Free Anxiety Treatment Apps
- Luisa de Mello Barreto
- Dana Borkum
- Brian D. Ott
Journal of Technology in Behavioral Science (2022)
Efficacy of post-inpatient aftercare treatments for anorexia nervosa: a systematic review of randomized controlled trials
- Katrin E. Giel
- Simone C. Behrens
- Florian Junne
Journal of Eating Disorders (2021)
Effect of smartphone-based stress management programs on depression and anxiety of hospital nurses in Vietnam: a three-arm randomized controlled trial
- Kotaro Imamura
- Thuy Thi Thu Tran
- Norito Kawakami
Scientific Reports (2021)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Included studies

Quality assessment

App content

Safety and privacy

Accessibility

Persuasive system design

Behavioral economics

Meta-analysis of effect of smartphone apps on depressive or anxiety symptoms

Meta-regression of PSD features and efficacy

Meta-analysis of study completion rate

Meta-regression of PSD features and study completion rate

Exploratory analyses

Discussion

Methods

Literature search

Eligibility criteria

Data collection and analysis

Selection of articles

Data extraction

Inter-rater agreement

Assessment of risk of bias

Data analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links