An umbrella review of effectiveness and efficacy trials for app-based health interventions

Chong, Sherry On Ki; Pedron, Sara; Abdelmalak, Nancy; Laxy, Michael; Stephan, Anna-Janina

doi:10.1038/s41746-023-00981-x

Download PDF

Review Article
Open access
Published: 16 December 2023

An umbrella review of effectiveness and efficacy trials for app-based health interventions

npj Digital Medicine volume 6, Article number: 233 (2023) Cite this article

3578 Accesses
1 Citations
32 Altmetric
Metrics details

Subjects

Abstract

Health interventions based on mobile phone or tablet applications (apps) are promising tools to help patients manage their conditions more effectively. Evidence from randomized controlled trials (RCTs) on efficacy and effectiveness of such interventions is increasingly available. This umbrella review aimed at mapping and narratively summarizing published systematic reviews on efficacy and effectiveness of mobile app-based health interventions within patient populations. We followed a pre-specified publicly available protocol. Systematic reviews were searched in two databases from inception until August 28, 2023. Reviews that included RCTs evaluating integrated or stand-alone health app interventions in patient populations with regard to efficacy/effectiveness were considered eligible. Information on indications, outcomes, app characteristics, efficacy/effectiveness results and authors’ conclusions was extracted. Methodological quality was assessed using the AMSTAR2 tool. We identified 48 systematic reviews published between 2013 and 2023 (35 with meta-analyses) that met our inclusion criteria. Eleven reviews included a broad spectrum of conditions, thirteen focused on diabetes, five on anxiety and/or depression, and others on various other indications. Reported outcomes ranged from medication adherence to laboratory, anthropometric and functional parameters, symptom scores and quality of life. Fourty-one reviews concluded that health apps may be effective in improving health outcomes. We rated one review as moderate quality. Here we report that the synthesized evidence on health app effectiveness varies largely between indications. Future RCTs should consider reporting behavioral (process) outcomes and measures of healthcare resource utilization to provide deeper insights on mechanisms that make health apps effective, and further elucidate their impact on healthcare systems.

Assessing the quality of mobile applications in chronic disease management: a scoping review

Article Open access 10 March 2021

Development features and study characteristics of mobile health apps in the management of chronic conditions: a systematic review of randomised trials

Article Open access 05 October 2021

A systematic review of feasibility studies promoting the use of mobile technologies in clinical research

Article Open access 06 June 2019

Introduction

Ageing populations, rising prevalence in treatment-intensive and costly non-communicable diseases and increasing shortage of specialized personnel pose serious threats to the financial sustainability of healthcare systems¹. Without timely transformations of healthcare systems, rising socioeconomic and geographical inequalities in disease burden and unmet patient needs may be further exacerbated by inequalities in access to adequate healthcare services².

Over the last decade, advancements in mobile technology have created new opportunities to meet this challenge. Most notably, mobile- or tablet-based health applications (apps) gained attention for their potentially beneficial effect on patients’ lives. For example, the use of mobile health (mhealth) apps can activate patients with chronic conditions to engage in online education, peer support, lifestyle monitoring and coaching consultations and help track their health status, fostering self-engagement and self-compliance in the disease management process to improve health outcomes³. Additionally, mhealth apps can bridge geographical barriers for access to healthcare, offering real-time reaction to patient needs in remote locations⁴. Lastly, health apps can relieve the burden on medical personnel by supporting medication prescription management and intake, as well as symptoms monitoring⁵.

The importance of new technologies has also been highlighted in the World Health Organization (WHO) global strategy on digital health 2020–2025⁶. Member countries are encouraged to develop digital healthcare strategies considering national contexts such as culture, public needs and available resources. However, large-scale integration of new technologies into standard care processes requires sufficient confidence in their effectiveness and cost-efficiency. Effectiveness can for example be hampered through technological challenges faced by users and delivery agents, data protection issues or privacy concerns, use of ineffective components and suboptimal sustainability in user engagement or long-term effects⁷. Also, population-wide implementation may, in some instances, add to existing health inequalities in society by introducing a digital divide^8,9 with regard to access to (first), usage of (second), and benefits from usage (third) of digital health technology¹⁰. Large-scale implementation would, in such cases, entail the waste or misallocation of (usually scarce) healthcare system resources and, in the worst case, pose the risk of detrimental individual health effects, and loss of trust in technology or the healthcare system. A corroborated respective evidence base is therefore a prerequisite for health systems to initiate adequate policy reforms.

Mirroring the increase in available health apps, also the number of scientific evaluations of their efficacy and effectiveness has increased over the last decade. To make these research results actionable, an up-to-date, comprehensive yet concise mapping of the available high-quality evidence on efficacy and effectiveness of mhealth apps is required. A previous umbrella review attempted to summarize systematic reviews on a broad spectrum of telemedicine interventions beyond mhealth apps with heterogenous study designs including non-randomized controlled trials and various disease indications¹¹. However, this broad summation of different intervention technologies and evidence levels makes it hard to draw conclusions specifically for mhealth apps. A second existent mapping of effectiveness reviews on mhealth interventions focused specifically on diabetes indications, but also included systematic reviews on heterogeneous study designs and various types of mhealth interventions beyond mhealth apps¹². Another umbrella review focused on randomized controlled trials (RCTs) and restricted its scope to diabetes, dyslipidemia and hypertension, while trying to summarize evidence not only on mhealth apps, but for a broader range of telemedicine interventions¹³. Thus, there is still a currently unmet need to identify both well-researched and potentially under-researched indications with regard specifically to mhealth app effectiveness.

The objective of this umbrella review is to systematically map and summarize existing systematic reviews of RCTs investigating the effectiveness of mobile phone or tablet app-based mhealth interventions in patients. We provide a summary of investigated patient populations, the specific intervention configurations and features, reported comparators, outcomes used to assess efficacy and effectiveness, and assess overall review quality, whereas synthesizing or even re-analysing outcome data is beyond the scope of this study.

Results

Study selection

The study selection process according to PRISMA requirements¹⁴ is summarized in Fig. 1. The database search yielded a total of 1895 records, with additional 2513 records identified through forward and backward citation searching of records from the initial search deemed eligible after full text screening by the first author. After de-duplication, 4253 articles were screened by title and abstract. Of these, 3892 records were excluded, and 361 records were included for full text screening. The final number of included articles was 48. Inter-rater reliability (IRR) for title-/abstract screening and full-text screening was κ = 0.3469 and κ = 0.9326, respectively. A list of the 313 studies excluded after full-text screening with exclusion reasons for each study can be found in Supplementary Table 1.

**Fig. 1: PRISMA flow chart of retrieved, screened and included articles.**

Review characteristics

Included reviews were published between 2013 and 2023, with the highest number of reviews published in 2020 (n = 10) and the first three quarters of 2023 (n = 9) (see Fig. 2).

**Fig. 2: Number of included reviews by publication year.**

All included reviews considered articles without geographic restrictions, except one focusing on China¹⁵. The number of RCT studies included in a review ranged from two to 36. Out of the 48 included reviews, 35^{15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49} conducted data pooling and meta-analyses whereas 13 reviews^{50,51,52,53,54,55,56,57,58,59,60,61,62} provided a narrative synthesis without meta-analysis. Median follow-up periods ranged from 1 to 10 months, with no respective information reported in six reviews^{15,31,32,33,50,53}. A summary of review characteristics is shown in Supplementary Table 2.

Methodological quality

Figure 3 summarizes the frequency of each AMSTAR2 rating for each domain across reviews. Supplementary Fig. 1 additionally presents the domain-specific methodological quality ratings for each review.

**Fig. 3: Frequency of risk of bias for each domain.**

Sixteen reviews stated that they had registered or otherwise published a review protocol^{17,19,25,34,35,38,40,41,42,47,48,51,54,56,60,62}. After checking these protocols, thirteen were rated as incomplete as they missed information on the search terms defining the search strategy (item 2)^{17,19,34,38,40,41,47,48,51,54,56,60,62}. All reviews searched at least two databases and provided their full search strategy in the final report, but 25 reviews^{16,17,19,21,22,27,28,29,33,38,39,41,43,45,47,48,49,50,52,53,54,55,58,59,61} failed to justify publication restrictions, for example regarding language, entailing a “no” on item 4. Six reviews provided a list of studies excluded at full-text screening stage (item 7)^{26,37,42,48,56,57}. Overall, a satisfactory assessment tool for risk of bias was used (item 9). Three reviews reported conflicts of interest (item 16)^16,42,49. We rated one review as moderate quality⁵⁶. IRR for quality assessment across all items and reviews was κ = 0.6671. Item-specific IRRs can be found in Supplementary Table 3.

Extraction results

Included RCTs covered populations from all continents, with a majority of studies conducted in high- or middle-income countries such as the United States, China, Australia, United Kingdom, Spain, Norway and Japan. Seven reviews^{33,38,45,46,48,49,50} did not report countries of included studies.

An overview of covered health indications is displayed in Supplementary Fig. 2 and, in more aggregated disease groups, Fig. 4. Most reviews targeted specific indications, including type 2 diabetes (T2DM) (n = 5)^{19,20,22,23,26}, hypertension (n = 4)^15,27,31,38, depression (n = 3)^33,53,61, overweight/obesity (n = 3)^40,41,52, chronic obstructive pulmonary disease (COPD) (n = 2)^35,39, urinary incontinence (n = 2)^56,62, asthma (n = 1)⁵⁷, autism spectrum disorders (n = 1)³², post-traumatic stress disorder (PTSD) (n = 1)⁵⁹, type 1 diabetes (n = 1)⁴⁷, Parkinson’s disease (n = 1)⁴⁵, knee arthroplasty (n = 1)⁴⁶ and lower back pain (n = 1)⁵¹. Twenty-two reviews covered multiple conditions within their scope, such as diabetes of various types (n = 7)^{18,21,24,25,36,37,50}, chronic non-communicable diseases (n = 2)^55,58, anxiety and depression (n = 2)^43,49, conditions requiring rehabilitation (n = 2)^42,44, pediatric diseases (n = 1)⁵⁴, diseases requiring medication (n = 2)^17,34, cardiovascular diseases (n = 2)^16,30, pain conditions (n = 2)^48,60, mental illnesses (n = 1)²⁸, or a combination of diabetes and hypertension (n = 1)²⁹.

**Fig. 4: Frequency of aggregated disease indications addressed in the included systematic reviews.**

Information on pooled sample size was provided by all except three reviews^31,45,46 and ranged from 282 to 7669 patients. Further information on extracted population characteristics can be found in Supplementary Tables 4 and 5.

The health apps performed a wide array of functions including symptoms monitoring and assessments, medication reminders, real-time biofeedback, personalized programs and education, tailor-made motivational messages or cues and feedback, social support, communication with healthcare professionals, goal setting, data storage, and visualization.

A summary of reported app characteristics and functionalities is documented in Supplementary Table 4.

Comparator conditions were described in 43 out of the 48 reviews. Some reviews included usual care comparators only, others varied between usual care or other control apps, to lighter technological features, text messages, paper-based monitoring diaries, in-person and standard education, and no treatment. A summary of reported comparators is shown in Supplementary Table 4.

Eleven reviews reported results on T2DM patients. Five focused on T2DM alone^{19,20,22,23,26}, while six included broader populations but conducted (subgroup) analyses specifically on T2DM^{21,24,25,36,37,58}. All eleven reviews except one¹⁹ assessed glycemic control, operationalized as change in glycated hemoglobin (HbA1c) reduction, as main or secondary outcome. Further outcomes comprised changes in body weight, waist circumference or body mass index^19,20,22,23, fat mass or percentage of body fat¹⁹, lipids, blood pressure, lifestyle changes, medication use^20,22,23, psychological symptoms and quality of life (QoL)²³. All studies that focused on other types of diabetes (e.g., type 1 diabetes, mixed types, prediabetes, gestational diabetes)^{18,36,37,47,50} focused on HbA1c changes as main outcome, while only a few included adverse events^37,54 and QoL⁵⁴. Another outcome reported for diabetic populations was medication adherence, but it was reported in samples that did not exclusively include diabetes patients (patients with prescription drugs¹⁶, chronic disease patients^34,54).

Reviews including patients with hypertension focused on evaluating the impact of health app interventions on medication adherence^27,31,38, systolic and diastolic blood pressure^15,27,38, and health behaviors^27,38. Three reviews^16,17,34 reported on medication adherence, and two reviews^16,29 on systolic and diastolic blood pressure, lipids and anthropometric outcomes in samples that did not exclusively include hypertensive patients.

Reviews focusing on patients with depression measured improvements of depressive symptoms^33,53,61, and self-esteem and QoL^53,61. Two reviews additionally reported results for medication adherence^17,61, one⁶¹ on psychiatric admissions, medication adherence and side effects, resilience, attitudes, sleep disturbances and further psychological and behavioral outcomes⁶¹. Further reviews reported on depressive^28,43,49, mania and psychotic symptoms as well as adverse events²⁸ and anxiety symptoms^43,49 in samples that did include depression patients, however not exclusively. Outcomes evaluated in other mental health indications included symptoms related to PTSD⁵⁹, positive and negative psychotic symptoms including hallucinations or delusions and absence of experience (in schizophrenia), mania and depression symptoms (bipolar disorder)²⁸, and autism-related outcomes based on the Mullen Scales of Early Learning, MacCarthur-Bates Communication Development Inventory and Communication and Symbolic Behavior Scales³².

Reviews focusing on overweight and obesity used the following outcomes: weight loss^40,41,52, waist circumference, blood pressure, lipids, HbA1c, energy intake^40,41, physical activity, body fat, BMI⁴⁰, motivation and adherence⁵².

Outcomes reported in other indications can be found in Supplementary Tables 4 and 6.

Figure 5 illustrates the types of outcomes reported in the systematic reviews by aggregated groups of investigated health conditions. More details on the uncategorized outcomes can be found in Supplementary Tables 4 and 6.

**Fig. 5: Distribution of outcome types reported by categorized disease indications.**

Twenty-three out of 35 meta-analyses conducted subgroup analyses^{18,19,20,21,23,24,25,26,27,28,29,33,34,36,37,40,41,43,47,48,49,53,57}. Investigated subgroups were defined by number, types and intensities of app features, differentiation between standalone or integrated interventions, baseline demographic or disease-related participant characteristics, follow-up duration, intervention duration, study quality, type of comparator, sample size, attrition, analytic approaches, and outcome assessment methods. Summaries of the subgroups investigated are in Supplementary Table 7.

Overall, 41 out of the 48 reviews concluded that app-based health interventions were effective in improving health outcomes. The seven systematic reviews which did not conclude that app-based health interventions were effective reported inconclusive results as some studies showed effectiveness and others did not^{35,51,53,54,57,61}, or reported clinically irrelevant improvements⁴¹. Reported synthesized outcomes, types of effect estimates, and number of underlying individual studies were heterogeneous. A complete overview of extracted results and summaries of author’s conclusions is shown in Supplementary Table 6. For example, for medication adherence, meta-analysed effect estimates reported in 6 systematic reviews ranged between 0.38 and 0.8 standardized mean difference, with 2−14 studies summarized, 6 out of 6 meta-analysed point estimates suggesting an increase in medication adherence, and 6 out of 6 meta-analytic results suggesting statistically significant effects. Three reviews additionally expressed effect estimates for medication adherence in terms of Odds Ratios or mean differences. For HbA1c, meta-analysed effect estimates from 13 systematic reviews ranged between 0.06% and −0.6% (weighted) mean difference, with 2−24 studies summarized, 27 out of 28 meta-analysed point estimates suggesting a reduction in % HbA1c, and 18 out of 28 meta-analytic results suggesting statistically significant effects. For systolic blood pressure (SBP), meta-analysed effect estimates from 9 systematic reviews ranged between 0.1 and −8.12 mmHg (weighted) mean difference, with 2−13 studies summarized, 8 out of 10 meta-analysed point estimates suggesting a reduction in SBP, and 4 out of 10 meta-analytic results suggesting statistically significant effects. Two reviews additionally expressed effect estimates for SBP in terms of Odds Ratios or mean differences. In two reviews with meta-analysed results on SBP the outcome unit was unclear.

Discussion

Despite a rapid increase in evidence syntheses from RCTs on health app effectiveness, availability of systematic reviews varies widely between indications. By far, the most frequently covered indication is T2DM (confirming a trend that was already emerging in 2018 according to a previous umbrella review with a similar scope⁶³), followed by hypertension, obesity and depression, potentially leaving evidence gaps for other diseases. A substantial proportion of systematic reviews attempted to cover multiple indications at the same time or had no disease focus or outcome restrictions, entailing less punctuated result interpretations and more frequent narrative syntheses without meta-analysis instead of quantitative syntheses due to the resulting large heterogeneity across included studies.

Generally, as also criticized in the abovementioned 2018 umbrella review⁶³, average follow-up times remained short. Also, outcome measures varied between and within indications, ranging from more objective and proximal laboratory parameters such as HbA1c and blood pressure to more subjective patient-reported measures such as symptom scores and QoL. In contrast, measures of healthcare resource utilization such as frequency of physician visits or hospitalizations, were rarely reported. This leaves knowledge gaps as to how health apps affect health outcomes, how long their effects are sustained and the resulting burden for healthcare systems in terms of healthcare utilization effects.

Interestingly, effectiveness was evaluated with different degrees of standardization across indications (e.g., involving standardized cut-offs and objectively measurable outcomes). For the most frequently summarized cardiometabolic disease indications (i.e., diabetes, obesity and hypertension), we noted a more consistent and standardized use of outcome measures compared to other indications. Measurements of HbA1c levels in T2DM, blood pressure levels in hypertension, weight in obesity and medication adherence independent of indication^17,25,38 were very frequently used. In contrast, symptom severity was overall measured in a much more heterogeneous fashion, as illustrated by a review which reported results on seven different measures for depressive symptoms alone³³. This heterogeneity is partially expected given the nature of different conditions but makes comparability and pooling of evidence across studies more difficult.

In addition, a high proportion of the reviews mainly summarized evidence from high and higher-middle income countries, a phenomenon that did seem to be driven by a skewed availability of primary RCTs rather than selection bias introduced at the level of the systematic reviews (for example through narrowly defined inclusion criteria). This lack of research in low and lower-middle income countries is surprising and not proportional to the large potential for improving healthcare access in underserved communities usually credited to app-based interventions⁶⁴. In these settings, healthcare staff is especially scarce, while smartphones are already widely available⁶⁴. Consequently, more attention should be paid to these settings, especially to investigate how mhealth apps can help overcome barriers encountered to access healthcare.

While we extracted both effect estimates and authors' conclusions from all systematic reviews, with 41 out of the 48 reviews concluding that app-based health interventions may be effective and the other seven stating inconclusive results, we would like to point out not only that a large share of systematic reviews was rated to be of low quality, but also that interpretation should ideally be indication- and outcome specific, and take not just statistical significance, but also clinical relevance and timing of outcome measurement into account. The broad range of effect sizes and the effect heterogeneity within and across reviews included in this umbrella review underlines that, like other complex interventions, mhealth apps can have multidimensional effects as they can simultaneously target multiple patient-relevant outcome parameters and succeed in positively affecting some of these, but not others. We believe that this umbrella review can provide helpful and easily accessible orientation for policy leaders, clinicians and patients to find relevant synthesized evidence (and an assessment of its quality). However, weighing the potential benefits of mhealth app use against the resource investments that are required for their implementation, will necessarily remain a context-specific task.

Even though we did not extract the effect estimates from pooled stratified and subgroup analyses conducted by included systematic reviews, we extracted the type of the conducted subgroup analyses, which we believe provides already interesting insights about which factors researchers seem to perceive as impacting mhealth apps’ effectiveness in improving patients’ health outcomes. Our results suggest that specific app characteristics and features, such as data tracking and patient feedback modalities^24,28, were seen as decisive for effectiveness. This is in line with previous findings: health tracking features and enhanced proximal engagement with the health app through push notifications with tailored health messages are essential in motivating users for self-management of their chronic diseases, and thereby enhance effectiveness of mhealth apps^65,66.

Additionally, specific design aspects of the intervention trials were often investigated as potential contributors to heterogeneity in measured effects. Specifically, the choice of comparators^33,49, duration of the app-based intervention and follow-up^23,36,47, and integration of mhealth apps with other supplementary intervention strategies were often treated as stratification factors for additional analyses. It is known that comparator choice can affect effect sizes and success of blinding to study group assignment⁶⁷. Subgroup analyses by intervention/follow-up duration may help identify waning efficacy over time, for example if participants lose motivation after having used a specific app for a period of time to track their health status, or in case of dissatisfaction with app features⁶⁸. Frequently used cut-offs for differentiation of follow-up were three to six months compared to longer term duration (e.g., up to 12 months), indicating a relatively narrow focus on short- and mid-run effects. Subgroup meta-analyses by socio-economic characteristics of study participants, or by availability of alternative healthcare access routes (e.g., rural/urban setting), were rarely conducted, indicating that equity aspects may currently not be adequately considered in effectiveness and efficacy trials of health apps, despite ongoing discussions about the potential of digital health interventions to entail positive and negative equity effects (digital divide⁶⁹).

To further increase the systematic and evidence-based evaluation of mhealth app interventions, more comprehensive and consistent reporting of app functionalities and selection of outcome measures is needed. For example, application developers and researchers could follow international guidelines such as the WHO’s mHealth evidence reporting and assessment checklist⁷⁰. This checklist could be a useful tool in further standardizing reporting as it captures different essential domains from intervention delivery to replicability. Furthermore, future research could follow our comprehensive search strategy to map other parts (e.g., with regards to population base or study design) of the existing evidence base for effectiveness of mobile phone applications. Additionally, it might be an interesting next step to pool effect estimates across systematic reviews after deduplicating the pool of underlying individual studies. Additional extractions and syntheses of results from pooled stratified/subgroup analyses may further elucidate factors which drive app effectiveness.

To maximize efficacy and minimize harms of mhealth apps, the social determinants of successful app-based health interventions should be analyzed. In fact, previous effectiveness reviews did not investigate if different population subgroups (e.g., different age groups, genders, socio-economic status) benefit equally and equitably from app-based health interventions. Individuals who lack technological or digital literacy might engage less with or benefit less from such health interventions⁷¹ potentially aggravating existing inequities.

In spite of the remaining open questions outlined above, given their potential effectiveness in improving health behaviors and health outcomes, fostering diffusion of mhealth apps into health care systems to support patients in getting actively involved in their own disease management process, bridge geographical barriers in healthcare and relieve detrimental consequences of medical personnel shortages might be worthwhile.

The strength of this umbrella review is in its scope which was to map and characterize the growing volume of systematic reviews on randomized effectiveness trials of mhealth apps in patients, highlighting the skewed amount of scientific evidence for different indications and providing a concise overview of reported outcomes and the most frequently conducted subgroup analyses, stressing the importance of specific app features for effectiveness.

Further strengths of this umbrella review are the publicly available pre-specified protocol and the systematic process of summarizing the available evidence. This provides a transparent and replicable approach for future research and potentially regular updates in this fast-moving field. Our systematic search strategy with well-defined terms and criteria according to the PRISMA guidelines in two large and widely used databases enhanced the replicability of the results.

Nevertheless, several limitations should be considered when interpreting our findings. First, only published articles written in English were included because we considered it improbable that authors would decide on publishing the results of an extensive endeavor such as the conduct of a systematic review in a language other than English, as this would considerably restrict the potential for impact and uptake of the results. Nevertheless, systematic reviews written in other languages may have been missed. Even though we deemed this risk relatively low we cannot fully exclude that this decision may have contributed to the lop-sided distribution of countries represented in the included systematic reviews. Similarly, although we searched the most important systematic reviews database with the Cochrane Database of Systematic Reviews (CDSR), the most comprehensive international biomedical citation database (MEDLINE)⁷², and the high-recency database PubMedCentral, we cannot fully exclude that searches of additional (for example, more regionally focused) databases may have identified some additionally relevant reviews. However, since our objective was not so much to summarize or even re-analyze outcome data (which requires strict completeness of the evidence base), we deemed the residual risk of missing out on a minority of available systematic reviews reconcilable with the primary purpose of mapping and characterizing the growing volume of systematic reviews on randomized effectiveness trials of mhealth apps in patients through this umbrella review. Also, some of the primary RCTs may have been included in more than one of the systematic reviews which constituted the evidence base for this umbrella review. An interesting next step might be to conduct meta-analyses for specific indications and outcomes after deduplication of the underlying original empirical studies, taking into account additional heterogeneities at individual study level, such as those in terms of follow-up time and RCT quality. While beyond the mapping scope of this umbrella review, such synthesis attempts at deduplicated individual RCT level can lead to interpretable non-biased effect estimates and avoid attribution of undue weight to the results of those RCTs that were of low quality or included in multiple systematic reviews⁷³, thereby providing additional insights.

We acknowledge that by focusing on effectiveness outcomes from systematic reviews of RCTs within patient groups only, this umbrella review only reflects a specific part of the available evidence on smartphone applications, since other evidence dimensions except effectiveness/efficacy (e.g., equity, cost effectiveness) were not considered. Furthermore, we did not include systematic reviews that comprised studies on the general population, other observational study designs that may allow for causal inference, or other intervention study types, such as non-randomized trials. Quasi-experimental studies could have also been an interesting source of evidence which we did not include. The rationale for our rather strict inclusion criteria with regard to study design was twofold: First, we wanted to specifically include systematic reviews with the highest internal validity when it comes to causal inference which could be undermined with lack of randomization⁷⁴. Second, we attempted to keep the basis for this umbrella review manageable and at the same time as homogenous as possible. We acknowledge that the downside of these decisions is that some relevant studies may have been excluded from this umbrella review.

Lastly, there were several systematic reviews that we excluded because they marginally failed to meet our inclusion criteria. Examples include a large systematic review by Cucciniello et al.⁷⁵ which included 69 studies from chronic disease indications, however at least one study used WhatsApp as the intervention instead of a full-blown health app. Another example is a systematic review by Widdison et al.⁷⁶ which summarized three RCTs on health app effectiveness in urinary incontinence, but additionally included one observational follow-up study of a previous RCT intervention arm. Although these studies potentially summarized evidence that could have been relevant to our research question, they were excluded from our umbrella review.

In conclusion, we found 48 systematic reviews published since 2013 that narratively or quantitatively synthesized effectiveness results for app-based health interventions in patients. These reviews targeted a range of different health conditions, with diabetes and hypertension being the most intensely covered and evaluated. There was substantial heterogeneity of what was defined as primary outcomes, but the majority of reviews concluded that app-based health interventions are likely to be effective. In reviews focusing on diabetes, obesity and hypertension, variability in reported outcome measures was lowest. Future research in other indications might follow these examples and attempt higher standardization of measurements, easing quantitative inference and allowing for more actionable conclusions. Additionally, studies with longer follow-up periods are required. Furthermore, the heterogenous methodological quality of the evidence included in this umbrella review highlights the need to take quality assessments into account for policy decisions. Lastly, future evidence synthesis attempts should also map the additional evidence provided by systematic reviews summarizing other study designs and general population instead of diseased populations. This would provide a definitive and full picture of the effectiveness of health app-based interventions and would support evidence-based public health and healthcare policy decisions alleviating economic pressures on healthcare systems.

Methods

Study design

We conducted an umbrella review of existing systematic reviews following (where applicable) the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines (PRISMA) checklist^14,77 (Supplementary Table 8) as recommended elsewhere⁷⁸. We uploaded a pre-specified review protocol to the Open Science Framework database prior to conducting the initial literature search⁷⁹. For protocol development, we consulted guidance documents⁷⁸ including those published by Cochrane⁷¹ and JBI⁸⁰. The scope of the review, as well as the pre-defined search strategy, eligibility criteria and extraction targets outlined below remained essentially unchanged throughout the conduct of the review.

Eligibility criteria

We defined eligibility criteria around types of studies, population, interventions, outcomes, and study language (Table 1).

Table 1 Eligibility criteria.

Full size table

This umbrella review included systematic reviews with and without meta-analyses. Being the gold standard for efficacy evaluations, we included only systematic reviews of RCTs. Systematic reviews that did not include RCTs or that included RCTs together with other study types were excluded.

We considered only reviews that included efficacy/effectiveness trials of app-based health interventions in patients. The study participants had to have a specific disease or health issue (as defined by the International Classification of Diseases, 10th Revision [ICD-10])⁸¹, that was targeted by the intervention in question. Health issues could be diagnosed by a health professional or self-reported. In contrast, reviews that considered studies on general populations without any specific health problems were excluded, even if they reported sub-analyses on separate patient groups. Furthermore, we excluded reviews targeting individuals with potentially addictive behaviors such as tobacco use, drinking, gambling or other substance use, as these behaviors are classified in Chapter XXI as “factors influencing health status and contact with health services”, and not within the disease-related ICD-10 chapters, and we deemed a potential separate umbrella review for such behavioral factors more appropriate than a combination with clear-cut diseases. Similarly, reviews on pregnant women without any additional specific medical condition were also excluded, as ICD-10 does not classify normal pregnancy within the disease-related chapters but in Chapter XXI.

We included reviews focusing on interventions which aimed at improving specific health-related outcomes via smartphone or tablet apps. The app could be a standalone or complementary intervention tool (i.e., coupled with personal interactions, text messaging or social media). In contrast, reviews comprising studies that evaluated solely non-app technologies such as text messaging, social media, wearable devices or websites were excluded. Reviews comprising studies that solely involved online communication applications (e.g., Instagram, WhatsApp, WeChat, Telegram, Skype) were excluded unless the app was specifically designed for health or medical purposes. Reviews comprising studies that evaluated health apps aiming to support users in primary prevention or the process of (self-)diagnosis and/or (self-)screening for yet undetected conditions were excluded.

No restrictions were set with regard to the types of comparators.

Reviews reporting on health or care process outcomes pertaining to the efficacy or effectiveness dimension of evidence were included. These outcomes included, but were not limited to health outcomes, medication adherence, chronic disease management, or symptoms relief. Reviews reporting exclusively on other dimensions of evidence such as diagnostic accuracy, concordance, feasibility, cost-effectiveness, resource consumption, costs, equity, or measurement accuracy were excluded.

Only articles with available full text in English were considered, as we assumed the efficiency gains of implementing a language restriction to outweigh the risk of missing out on important evidence.

Databases and search strategy

We searched MEDLINE and PubMedCentral via PubMed and the CDSR. Articles were included from database inception until March 15, 2022 and the search was updated on August 28, 2023.

The search strategy combined keywords and Medical Subject Headings (MeSH) structured around three components: (i) intervention; (ii) study design; (iii) outcome dimension (see Supplementary Tables 9 and 10 for the complete search strategy and number of associated hits in each electronic database).

Forward and backward citation searches were additionally conducted for articles deemed eligible after initial full text screening. Forward citation searches were conducted until August 8, 2022 using PubMed and Scopus.

Selection process

After search completion and deduplication, two authors (SOKC and NA) carried out independent title and abstract screening according to the predefined eligibility criteria using the online software Rayyan⁸². Diverging decisions were resolved unanimously after discussion with up to two additional authors (SP and AJS).

After title/abstract screening, we assessed full texts of all potentially eligible articles against the eligibility criteria⁵⁷. Whenever an inclusion criterion was not met, we stopped the screening of the respective full text and excluded the systematic review. One author (SOKC) conducted the full text screening of all systematic reviews. A second author (NA) independently double-checked all exclusion decisions. Diverging decisions were resolved unanimously with up to two additional authors (AJS and SP) included in the discussion.

Data collection/extraction process and data items

One author (SOKC or NA) extracted data from all eligible articles after full-text screening using a predefined and pretested extraction form⁷⁹. A second reviewer (NA or AJS) double-checked the extracted data. Conflicts were resolved unanimously, where necessary after discussion with a third reviewer (AJS). We extracted the following information from the included reviews: general information about the review (e.g., publication date, number of included studies), pooled population characteristics, app characteristics, comparators, outcomes, subgroup analyses, authors’ narrative conclusions on overall efficacy/effectiveness of app-based interventions.

Methodological quality assessment

We used the Assessing the Methodological Quality of Systematic Reviews (AMSTAR2, see Supplementary Note 1) tool to evaluate methodological quality of all included systematic reviews⁸³. AMSTAR2 covers 16 domains, of which seven are considered critical. Critical domains are deemed especially influential for review validity and include protocol pre-registration (item 2), literature search strategy (item 4), list and justification for excluded studies (item 7), risk of bias assessment (item 9), meta-analytical methods (item 11), consideration of risk of bias in results interpretation (item 13) and assessment of presence and likely impact of publication bias (item 15).

Each included review was rated for adequacy on each domain as either “Yes”, “No”, or “Partial Yes” (available only for domains 2, 4, 7, 8, and 9). For those articles that did not conduct meta-analyses, items 11, 12 and 15 were rated “Not Applicable”. Fulfillment of each dimension across the different reviews was illustrated using a table and heat map. Based on these domains, we also assigned a summary quality rating as “critically low” ( ≥ 2 “no” ratings on critical domains), “low” ( ≤ 1 “no” ratings on critical domains), “moderate” ( ≥ 2 “no” ratings on non-critical domains) or “high” ( ≤ 1 “no” on a non-critical domain) to each review. Quality appraisals were conducted in duplicate by two review authors (SOKC and AJS) and diverging ratings were resolved through discussion.

Inter-rater reliability

IRR was calculated using Cohen’s Kappa (κ) for title- and abstract screening, full-text screening and the methodological quality assessment (overall and item specific).

Data synthesis

We provide narrative summaries and graphical representations of publication years, population characteristics, type of underlying condition, type of intervention and type of outcomes assessed. As data from the same RCT may have contributed to the pooled effect estimates of more than one included systematic review, and due to the high expected heterogeneity of diseases and outcomes covered in the systematic reviews, a meta-analysis pooling systematic review results was not planned nor performed.

Data availability

All the data presented and analyzed in this study is available in the paper and in the Supplementary Information File.

References

Wiederhold, B. K., Riva, G. & Graffigna, G. Ensuring the best care for our increasing aging population: health engagement and positive technology can help patients achieve a more active role in future healthcare. Cyberpsychol. Behav. Soc. Netw. 16, 411–412 (2013).
Article PubMed Google Scholar
Centre for Disease Control. Telehealth in Rural Communities, https://www.cdc.gov/chronicdisease/resources/publications/factsheets/telehealth-in-rural-communities.htm (2022).
American Diabetes Association. 1. Promoting health and reducing disparities in populations. Diabetes Care 40, S6–S10 (2017).
Article Google Scholar
Silva, B. M., Rodrigues, J. J., de la Torre Diez, I., Lopez-Coronado, M. & Saleem, K. Mobile-health: a review of current state in 2015. J. Biomed. Inf. 56, 265–272 (2015).
Article Google Scholar
Marvel, F. A. et al. Digital health intervention in acute myocardial infarction. Circ. Cardiovasc. Qual. Outcomes 14, e007741 (2021).
Article PubMed PubMed Central Google Scholar
World Health Organization. Global strategy on digital health 2020-2025. (World Health Organization, 2021).
Koh, J., Tng, G. Y. Q. & Hartanto, A. Potential and Pitfalls of mobile mental health apps in traditional treatment: an umbrella review. J. Personal. Med. 12, 1376 (2022).
Article Google Scholar
Eruchalu, C. N. et al. The expanding digital divide: digital health access inequities during the COVID-19 Pandemic in New York City. J. Urban Health 98, 183–186 (2021).
Article PubMed PubMed Central Google Scholar
Lawrence, K. In Digital Health (ed S. L. Linwood) (Exon Publications, 2022).
Sawert, T. & Tuppat, J. Social inequality in the digital transformation: risks and potentials of mobile health technologies for social inequalities in health. (SOEPpapers on Multidisciplinary Panel Data Research, 2020).
Eze, N. D., Mateus, C. & Cravo Oliveira Hashiguchi, T. Telemedicine in the OECD: an umbrella review of clinical and cost-effectiveness, patient experience and implementation. PLoS One 15, e0237585 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Effectiveness of mobile health interventions on diabetes and obesity treatment and management: systematic review of systematic reviews. JMIR Mhealth Uhealth 8, e15400 (2020).
Article PubMed PubMed Central Google Scholar
Timpel, P., Oswald, S., Schwarz, P. E. H. & Harst, L. Mapping the evidence on the effectiveness of telemedicine interventions in diabetes, dyslipidemia, and hypertension: an umbrella review of systematic reviews and meta-analyses. J. Med. Internet Res. 22, e16791 (2020).
Article PubMed PubMed Central Google Scholar
Page, M. J. et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372, n71 (2021).
Article PubMed PubMed Central Google Scholar
Han, H., Guo, W., Lu, Y. & Wang, M. Effect of mobile applications on blood pressure control and their development in China: a systematic review and meta-analysis. Public Health 185, 356–363 (2020).
Article PubMed Google Scholar
Al-Arkee, S. et al. Mobile apps to improve medication adherence in cardiovascular disease: systematic review and meta-analysis. J. Med. Internet Res. 23, e24190 (2021).
Article PubMed PubMed Central Google Scholar
Armitage, L. C., Kassavou, A. & Sutton, S. Do mobile device apps designed to support medication adherence demonstrate efficacy? A systematic review of randomised controlled trials, with meta-analysis. BMJ Open 10, e032045 (2020).
Article PubMed PubMed Central Google Scholar
Bonoto, B. C. et al. Efficacy of mobile apps to support the care of patients with diabetes mellitus: a systematic review and meta-analysis of randomized controlled trials. JMIR Mhealth Uhealth 5, e4 (2017).
Article PubMed PubMed Central Google Scholar
Cai, X. et al. Mobile application interventions and weight loss in Type 2 diabetes: a meta-analysis. Obesity 28, 502–509 (2020).
Article PubMed Google Scholar
Cui, M., Wu, X., Mao, J., Wang, X. & Nie, M. T2DM self-management via smartphone applications: a systematic review and meta-analysis. PLoS One 11, e0166718 (2016).
Article PubMed PubMed Central Google Scholar
El-Gayar, O., Ofori, M. & Nawar, N. On the efficacy of behavior change techniques in mHealth for self-management of diabetes: a meta-analysis. J. Biomed. Inf. 119, 103839 (2021).
Article Google Scholar
Enricho Nkhoma, D. et al. Impact of DSMES app interventions on medication adherence in type 2 diabetes mellitus: systematic review and meta-analysis. BMJ Health Care Inform 28 (2021). https://doi.org/10.1136/bmjhci-2020-100291.
He, Q., Zhao, X., Wang, Y., Xie, Q. & Cheng, L. Effectiveness of smartphone application-based self-management interventions in patients with type 2 diabetes: a systematic review and meta-analysis of randomized controlled trials. J. Adv. Nurs. 78, 348–362 (2022).
Article PubMed Google Scholar
Hou, C., Carter, B., Hewitt, J., Francisa, T. & Mayor, S. Do mobile phone applications improve glycemic control (HbA1c) in the Self-management of Diabetes? A systematic review, meta-analysis, and GRADE of 14 Randomized Trials. Diabetes Care 39, 2089–2095 (2016).
Article PubMed Google Scholar
Hou, C. et al. Mobile phone applications and self-management of diabetes: a systematic review with meta-analysis, meta-regression of 21 randomized trials and GRADE. Diabetes Obes. Metab. 20, 2009–2013 (2018).
Article PubMed Google Scholar
Hyun, M. K., Lee, J. W., Ko, S. H. & Hwang, J. S. Improving Glycemic Control in Type 2 diabetes using mobile applications and e-Coaching: a mixed treatment comparison network meta-analysis. J. Diabetes Sci. Technol. 16, 1239–1252 (2022).
Article CAS PubMed Google Scholar
Kassavou, A., Wang, M., Mirzaei, V., Shpendi, S. & Hasan, R. The association between smartphone app-based self-monitoring of hypertension-related behaviors and reductions in high blood pressure: systematic review and meta-analysis. JMIR Mhealth Uhealth 10, e34767 (2022).
Article PubMed PubMed Central Google Scholar
Kim, S. K., Lee, M., Jeong, H. & Jang, Y. M. Effectiveness of mobile applications for patients with severe mental illness: a meta-analysis of randomized controlled trials. Jpn J. Nurs. Sci. 19, e12476 (2022).
Article PubMed Google Scholar
Liu, K., Xie, Z. & Or, C. K. Effectiveness of mobile app-assisted self-care interventions for improving patient outcomes in Type 2 diabetes and/or hypertension: systematic review and meta-analysis of randomized controlled trials. JMIR Mhealth Uhealth 8, e15779 (2020).
Article PubMed PubMed Central Google Scholar
Lunde, P., Nilsson, B. B., Bergland, A., Kvaerner, K. J. & Bye, A. The effectiveness of smartphone apps for lifestyle improvement in noncommunicable diseases: systematic review and meta-analyses. J. Med. Internet Res. 20, e162 (2018).
Article PubMed PubMed Central Google Scholar
Mikulski, B. S., Bellei, E. A., Biduski, D. & De Marchi, A. C. B. Mobile health applications and medication adherence of patients with hypertension: a systematic review and meta-analysis. Am. J. Prev. Med. 62, 626–634 (2022).
Article PubMed Google Scholar
Moon, S. J. et al. Mobile device applications and treatment of autism spectrum disorder: a systematic review and meta-analysis of effectiveness. Arch. Dis. Child 105, 458–462 (2020).
Article PubMed Google Scholar
Park, C. et al. Smartphone applications for the treatment of depressive symptoms: A meta-analysis and qualitative review. Ann. Clin. Psychiatry 32, 48–68 (2020).
PubMed Google Scholar
Peng, Y. et al. Effectiveness of mobile applications on medication adherence in adults with chronic diseases: a systematic review and meta-analysis. J. Manag. Care Spec. Pharm. 26, 550–561 (2020).
PubMed Google Scholar
Shaw, G., Whelan, M. E., Armitage, L. C., Roberts, N. & Farmer, A. J. Are COPD self-management mobile applications effective? A systematic review and meta-analysis. NPJ Prim. Care Respir. Med. 30, 11 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wu, X., Guo, X. & Zhang, Z. The efficacy of mobile phone apps for lifestyle modification in diabetes: systematic review and meta-analysis. JMIR Mhealth Uhealth 7, e12297 (2019).
Article PubMed PubMed Central Google Scholar
Wu, Y. et al. Mobile app-based interventions to support diabetes self-management: a systematic review of randomized controlled trials to identify functions associated with glycemic efficacy. JMIR Mhealth Uhealth 5, e35 (2017).
Article PubMed PubMed Central Google Scholar
Xu, H. & Long, H. The effect of smartphone app-based interventions for patients with hypertension: systematic review and meta-analysis. JMIR Mhealth Uhealth 8, e21759 (2020).
Article PubMed PubMed Central Google Scholar
Yang, F., Wang, Y., Yang, C., Hu, H. & Xiong, Z. Mobile health applications in self-management of patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis of their efficacy. BMC Pulm. Med. 18, 147 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chew, H. S. J., Rajasegaran, N. N., Chin, Y. H., Chew, W. S. N. & Kim, K. M. Effectiveness of combined health coaching and self-monitoring apps on weight-related outcomes in people with overweight and obesity: systematic review and meta-analysis. J. Med. Internet Res. 25, e42432 (2023).
Article PubMed PubMed Central Google Scholar
Chew, H. S. J., Koh, W. L., Ng, J. & Tan, K. K. Sustainability of weight loss through smartphone apps: systematic review and meta-analysis on anthropometric, metabolic, and dietary outcomes. J. Med. Internet Res. 24, e40141 (2022).
Article PubMed PubMed Central Google Scholar
Davergne, T., Meidinger, P., Dechartres, A. & Gossec, L. The effectiveness of digital apps providing personalized exercise videos: systematic review with meta-analysis. J. Med. Internet Res. 25, e45207 (2023).
Article PubMed PubMed Central Google Scholar
Lu, S. C. et al. Effectiveness and minimum effective dose of app-based mobile health interventions for anxiety and depression symptom reduction: systematic review and meta-analysis. JMIR Ment. Health 9, e39454 (2022).
Article PubMed PubMed Central Google Scholar
Moreno-Ligero, M., Lucena-Anton, D., Salazar, A., Failde, I. & Moral-Munoz, J. A. mHealth impact on gait and dynamic balance outcomes in neurorehabilitation: systematic review and meta-analysis. J. Med. Syst. 47, 75 (2023).
Article PubMed PubMed Central Google Scholar
Ozden, F. The effect of mobile application-based rehabilitation in patients with Parkinson’s disease: a systematic review and meta-analysis. Clin. Neurol. Neurosurg. 225, 107579 (2023).
Article PubMed Google Scholar
Ozden, F. & Sari, Z. The effect of mobile application-based rehabilitation in patients with total knee arthroplasty: a systematic review and meta-analysis. Arch. Gerontol. Geriatr. 113, 105058 (2023).
Article PubMed Google Scholar
Pi, L., Shi, X., Wang, Z. & Zhou, Z. Effect of smartphone apps on glycemic control in young patients with type 1 diabetes: a meta-analysis. Front. Public Health 11, 1074946 (2023).
Article PubMed PubMed Central Google Scholar
Thompson, D. et al. Mobile app use to support therapeutic exercise for musculoskeletal pain conditions may help improve pain intensity and self-reported physical function: a systematic review. J. Physiother. 69, 23–34 (2023).
Article PubMed Google Scholar
Seegan, P. L., Miller, M. J., Heliste, J. L., Fathi, L. & McGuire, J. F. Efficacy of stand-alone digital mental health applications for anxiety and depression: a meta-analysis of randomized controlled trials. J. Psychiatr. Res. 164, 171–183 (2023).
Article PubMed Google Scholar
Amalindah, D., Winarto, A. & Rahmi, A. H. Effectiveness of mobile app-based interventions to support diabetes self-management: a systematic review. J. Ners 15, 9–18 (2020).
Article Google Scholar
Didyk, C., Lewis, L. K. & Lange, B. Effectiveness of smartphone apps for the self-management of low back pain in adults: a systematic review. Disabil. Rehabil. 1-10 (2021). https://doi.org/10.1080/09638288.2021.2005161.
DiFilippo, K. N., Huang, W.-H., Andrade, J. E. & Chapman-Novakofski, K. M. The use of mobile apps to improve nutrition outcomes: a systematic literature review. J. Telemed. telecare 21, 243–253 (2015).
Article PubMed Google Scholar
Hrynyschyn, R. & Dockweiler, C. Effectiveness of smartphone-based cognitive behavioral therapy among patients with major depression: systematic review of health implications. JMIR Mhealth Uhealth 9, e24703 (2021).
Article PubMed PubMed Central Google Scholar
Karatas, N., Kaya, A. & Isler Dalgic, A. The effectiveness of user-focused mobile health applications in paediatric chronic disease management: a systematic review. J. Pediatr. Nurs. 63, e149–e156 (2022).
Article PubMed Google Scholar
Lee, J. A., Choi, M., Lee, S. A. & Jiang, N. Effective behavioral intervention strategies using mobile health applications for chronic disease management: a systematic review. BMC Med. Inf. Decis. Mak. 18, 12 (2018).
Article Google Scholar
Leme Nagib, A. B. et al. Use of mobile apps for controlling of the urinary incontinence: a systematic review. Neurourol. Urodyn. 39, 1036–1048 (2020).
Article PubMed Google Scholar
Marcano Belisario, J. S., Huckvale, K., Greenfield, G., Car, J. & Gunn, L. H. Smartphone and tablet self management apps for asthma. Cochrane Database Syst. Rev. 2013, CD010013 (2013).
PubMed PubMed Central Google Scholar
Whitehead, L. & Seaton, P. The effectiveness of self-management mobile phone and tablet apps in long-term condition management: a systematic review. J. Med. Internet Res. 18, e97 (2016).
Article PubMed PubMed Central Google Scholar
Wickersham, A., Petrides, P. M., Williamson, V. & Leightley, D. Efficacy of mobile application interventions for the treatment of post-traumatic stress disorder: a systematic review. Digital Health 5, 2055207619842986 (2019).
Article PubMed PubMed Central Google Scholar
Moreno-Ligero, M., Moral-Munoz, J. A., Salazar, A. & Failde, I. mHealth intervention for improving pain, quality of life, and functional disability in patients with chronic pain: systematic review. JMIR Mhealth Uhealth 11, e40844 (2023).
Article PubMed PubMed Central Google Scholar
Hernandez-Gomez, A., Valdes-Florido, M. J., Lahera, G. & Andrade-Gonzalez, N. Efficacy of smartphone apps in patients with depressive disorders: a systematic review. Front. Psychiatry 13, 871966 (2022).
Article PubMed PubMed Central Google Scholar
Hou, Y., Feng, S., Tong, B., Lu, S. & Jin, Y. Effect of pelvic floor muscle training using mobile health applications for stress urinary incontinence in women: a systematic review. BMC Women’s Health 22, 400 (2022).
Article PubMed PubMed Central Google Scholar
Byambasuren, O., Sanders, S., Beller, E. & Glasziou, P. Prescribable mHealth apps identified from an overview of systematic reviews. npj Digital Med. 1, 12 (2018).
Article Google Scholar
McCool, J., Dobson, R., Whittaker, R. & Paton, C. Mobile Health (mHealth) in low- and middle-income countries. Annu. Rev. Public Health 43, 525–539 (2022).
Article PubMed Google Scholar
Zahra, F., Hussain, A. & Moh, H. Factor affecting mobile health application for chronic diseases. J. Telecommun. Electron. Comput. Eng. (JTEC) 10, 77–81 (2018).
Google Scholar
Bidargaddi, N. et al. To prompt or not to prompt? A microrandomized trial of time-varying push notifications to increase proximal engagement with a mobile health app. JMIR Mhealth Uhealth 6, e10123 (2018).
Article PubMed PubMed Central Google Scholar
Committee for Proprietary Medicinal Products. ICH topic E 10: choice of control group in clinical trials. London, UK: European Medicines Agency (EMEA) 30 (2001).
Murnane, E. L., Huffaker, D. & Kossinets, G. In 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2015 ACM International Symposium on Wearable Computers 261-264.
Rogers, E. M. The Digital Divide. Convergence 7, 96–111 (2001).
Article Google Scholar
Agarwal, S. et al. Guidelines for reporting of health interventions using mobile phones: mobile health (mHealth) evidence reporting and assessment (mERA) checklist. BMJ 352, i1174 (2016).
Article PubMed Google Scholar
Liem, A., Natari, R. B., Jimmy & Hall, B. J. Digital health applications in mental health care for immigrants and refugees: a rapid review. Telemed. J. E Health 27, 3–16 (2021).
Article PubMed Google Scholar
Hartling, L. et al. The contribution of databases to the results of systematic reviews: a cross-sectional study. BMC Med. Res. Methodol. 16, 127 (2016).
Article PubMed PubMed Central Google Scholar
Pollock, M. et al. In Overviews of Reviews (eds Higgins, JPT. et al.) Ch. 5: Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (Cochrane, 2023). Available from www.training.cochrane.org/handbook.
Sorensen, H. T., Lash, T. L. & Rothman, K. J. Beyond randomized controlled trials: a critical comparison of trials with nonrandomized studies. Hepatology 44, 1075–1082 (2006).
Article PubMed Google Scholar
Cucciniello, M., Petracca, F., Ciani, O. & Tarricone, R. Development features and study characteristics of mobile health apps in the management of chronic conditions: a systematic review of randomised trials. NPJ Digital Med. 4, 144 (2021).
Article Google Scholar
Widdison, R., Rashidi, A. & Whitehead, L. Effectiveness of mobile apps to improve urinary incontinence: a systematic review of randomised controlled trials. BMC Nurs. 21, 32 (2022).
Article PubMed PubMed Central Google Scholar
Moher, D., Liberati, A., Tetzlaff, J., Altman, D. G. & Group, P. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Int J. Surg. 8, 336–341 (2010).
Article PubMed Google Scholar
Smith, V., Devane, D., Begley, C. M. & Clarke, M. Methodology in conducting a systematic review of systematic reviews of healthcare interventions. BMC Med. Res. Methodol. 11, 15 (2011).
Article PubMed PubMed Central Google Scholar
Chong, O. K., Pedron, S. & Stephan, A.-J. Effectiveness of app-based health interventions: an umbrella review protocol, https://osf.io/db49y/ (2022).
Aromataris, E. et al. Summarizing systematic reviews: methodological development, conduct and reporting of an umbrella review approach. JBI Evid. Implement. 13, 132–140 (2015).
Google Scholar
World Health Organization. ICD-10: International classification of diseases and related health problems, tenth revision. (2004).
Ouzzani, M., Hammady, H., Fedorowicz, Z. & Elmagarmid, A. Rayyan-a web and mobile app for systematic reviews. Syst. Rev. 5, 210 (2016).
Article PubMed PubMed Central Google Scholar
Shea, B. J. et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ 358, j4008 (2017).
Article PubMed PubMed Central Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Professorship of Public Health and Prevention, TUM School of Medicine and Health, Technical University of Munich, Munich, Germany
Sherry On Ki Chong, Sara Pedron, Nancy Abdelmalak, Michael Laxy & Anna-Janina Stephan

Authors

Sherry On Ki Chong
View author publications
You can also search for this author in PubMed Google Scholar
Sara Pedron
View author publications
You can also search for this author in PubMed Google Scholar
Nancy Abdelmalak
View author publications
You can also search for this author in PubMed Google Scholar
Michael Laxy
View author publications
You can also search for this author in PubMed Google Scholar
Anna-Janina Stephan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.J.S., S.P. and S.O.K.C. conceptualized the study and developed the study protocol; S.O.K.C. and N.A. performed the literature search and data analysis; A.J.S. and S.P. supervised the study; S.O.K.C., N.A., A.J.S., S.P. and M.L. interpreted the data; S.O.K.C. and A.J.S. drafted the manuscript; all authors substantively and critically reviewed the manuscript for important intellectual content and approved the final version to be published.

Corresponding author

Correspondence to Sherry On Ki Chong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chong, S.O.K., Pedron, S., Abdelmalak, N. et al. An umbrella review of effectiveness and efficacy trials for app-based health interventions. npj Digit. Med. 6, 233 (2023). https://doi.org/10.1038/s41746-023-00981-x

Download citation

Received: 06 July 2023
Accepted: 29 November 2023
Published: 16 December 2023
DOI: https://doi.org/10.1038/s41746-023-00981-x