Reply to: Alternative meta-analysis of behavioural interventions to promote action on climate change yields different conclusions

Nisa, Claudia F.; Sasin, Edyta M.; Faller, Daiane G.; Schumpe, Birga M.; Belanger, Jocelyn J.

doi:10.1038/s41467-020-17614-6

Download PDF

Matters Arising
Open access
Published: 05 August 2020

Reply to: Alternative meta-analysis of behavioural interventions to promote action on climate change yields different conclusions

Claudia F. Nisa ORCID: orcid.org/0000-0002-8351-6402¹,
Edyta M. Sasin¹,
Daiane G. Faller¹,
Birga M. Schumpe¹ &
…
Jocelyn J. Belanger¹

Nature Communications volume 11, Article number: 3901 (2020) Cite this article

2009 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

The Original Article was published on 05 August 2020

Replying to S. van der Linden & M. H. Goldberg Nature Communicationshttps://doi.org/10.1038/s41467-020-17613-7 (2020)

The point made by van der Linder and Goldberg¹ about the impact of variance estimators is valid. Random-effects meta-analysis requires an estimation of heterogeneity, and which estimator is selected matters because it influences the calculation of the effect size^2,3. However, there is no universally superior estimator. Whether an estimator is considered more or less biased in simulations studies depends on a variety of parameters such as, for instance, the number of studies included, the number of participants per study (n), and how much this n varies from study to study^3,4,5.

The DerSimonian–Laird (DL) estimator is the most popular estimator in the literature^2,3,4,5, implemented by default in multiple meta-analysis software⁴. Therefore, its use facilitates replication by a wide range of audiences with varying levels of technical expertise. Nonetheless, the DL estimator has been challenged when heterogeneity is moderate to high, when meta-analysis includes only a few studies, and/or when it examines binary outcomes^3,5,6. Yet, DL has been shown to perform adequately when the number of studies in the meta-analysis is moderate to large (>~30), and estimating continuous outcomes^4,6,7—consistent with our data.

Van der Linder and Goldberg note that the high heterogeneity associated with the overall estimate and some subgroups analyses requires additional attention. We follow van der Linder and Goldberg’s valid suggestion and present overall effect sizes comparing multiple estimators (Table 1). These authors also present some alternative estimates, but failed to discuss important nuances in their interpretation.

Table 1 Overall effect size per estimator.

Full size table

Results show that the overall estimate ranges from d = −0.058 to d = −0.219, depending on the estimator used. The Hunter–Schmidt (HS) and the Sidik–Jonkman (SJ) estimators tend to report, respectively, negative and positive biased estimates compared to other estimators⁴. Maximum likelihood (ML) and Restricted Maximum Likelihood (REML) have good properties when heterogeneity is high and the size of studies varies considerably⁵, which applies well to our case. EB (equivalent to Paule–Mandel) has been recommended as an alternative to DL when the number of studies is small, and the heterogeneity is moderate to high^3,4,5. Nevertheless, when meta-analysis combines small and large studies (as it is our case), EB tends to produce a positive bias compared with DL and REML⁷.

Table 1 shows that several estimators produce an overall effect size higher than what we reported. Does this mean our conclusions are wrong? No—our conclusions hold and here is why.

Firstly, our primary main conclusion is that behavioural interventions have a very small average effect size. This conclusion is grounded on Cohen’s d guidelines⁸—arbitrary to some extent but—followed in the absence of more objective standards. All effect sizes below d = 0.2 are interpreted as very small:⁹ if two groups’ means differ by <0.2 standard deviations, the difference between them is trivial, even if statistically significant^9,10. The highest estimates within a variety of estimators reach, at best, the threshold of the small effect classification (which ranges from d = 0.20 to d = 0.49⁹). Moreover, the estimators that reach this threshold are associated with positive bias when there are large differences in study sizes (which is our case). We call to mind that Cohen’s d does not vary from 0 to 1, but from 0 to infinity, which puts in perspective the small differences between estimators. Moreover, these results can be compared to more intuitive measures of effect size. A Cohen’s d = 0.2 is equivalent to Cohen’s r = 0.1¹⁰, and it is difficult to argue that correlations r ≤ 0.1 could be interpreted as meaningful. Under several estimators, the probability of benefit¹¹ doubles from 7% to 14%, equivalent to a probability of superiority¹⁰ changing from 53% to 56%, not much better than the flip of a coin.

Secondly, the idea that such small effects scaled at the population level can become meaningful is misleading. Interventions scaled up to the general population imply these will target a more heterogeneous set of individuals regarding their motivation to behave pro-environmentally, compared to the small-scale interventions where people self-select to participate¹², page 4, 10)—which represent more than half our sample. This suggests that effect sizes would likely approach our estimates for naïve subjects (i.e., no self-selection) (d = −0.040 95% CI−0.103, −0.016).

Thirdly, van der Linder and Goldberg failed to note that random-effects meta-analysis pays more attention to small studies when pooling the overall effect⁹. Yet, small studies are prone to bias because small studies tend to be published only when reaching significance, and significance in small studies occurs when there are large effect sizes. This is a noteworthy concern because small studies (n ≤ 100) correspond to 57% of our total estimates (k = 82). We make this point in the original paper¹², reporting a substantial small-studies bias¹³ and showing that an analysis restricted to the more robust samples (moderate and large studies) produces overall effect sizes well below d = 0.2. Table 2 shows that this conclusion holds across estimators.

Table 2 Sensitivity analyses per moderator with different estimators (Cohen’s d).

Full size table

Lastly, our secondary main conclusions also hold. We discouraged the isolated use of information-based interventions and its very small effect size (below d = 0.1) holds across estimators. We encouraged the use of nudges (firstly) and social comparison (secondly). The estimates for nudges hold across estimators ((I² = 0). These results hold across estimators due to low between-studies heterogeneity. In the case of social norms, there is high between-studies heterogeneity (I² = 72.2%), generating variation between estimators. But high heterogeneity often originates from substantial differences in sample sizes between the pooled studies¹⁴. And social norms are another good example of a strong statistically significant difference between small studies (d = −0.387, n ≤ 100, k = 13), and medium/large studies (d = −0.036, n > 100, k = 19) (meta-regression t-test = 5.21 p < 0.001).

Fundamentally, in case of high heterogeneity, DL provided more conservative estimates compared with other good alternatives, but DL estimates leaned closer to the estimates that would result from restricting the analysis to more robust samples, larger and naïve, less likely to be biased.

In summary, our conclusions hold because most overall effect sizes produced by different estimators are still interpretable as very small or, at best, borderline small. Furthermore, there is a substantial small-studies bias that pushes many estimates up. Notwithstanding, discussions about interventions and their (accurate) effect sizes are imperative and timely. The U.N. declared the next 10 years as the Decade of Action for climate change to deliver on the 2030 promises¹⁵, motivated by the awareness that action is not advancing at the speed or scale required, and calling for interventions to step-up their impact.

Stating that effect-sizes in psychology are known to be small should not be used as a justification to inflate the meaningfulness of (very) small effects. A thoughtful debate beyond statistical significance is long overdue to make psychological and behavioural science more relevant to intervention and policy-making.

Data availability

We provide a Source Data File that is publicly available at Figshare [https://doi.org/10.6084/m9.figshare.9641999].

Code availability

All commands used in STATA are publicly available here: metaan https://www.stata-journal.com/article.html?article = st0201[metaan smd se, dl [OR pe OR ml].These analyses can be replicated in R using metagen or rma, and these R codes are publicly available at Figshare [https://doi.org/10.6084/m9.figshare.9641999]. The code is the same has presented at Figshare; the only change to be made between estimators is in “method.tau = "DL", which needs to be replaced by alternatives initials e.g., ML, REML, EB.

References

Van der Linder, S. & Goldberg, M. Alternative meta-analysis of behavioural interventions to promote action on climate change yields different conclusions. Nat. Commun. https://doi.org/10.1038/s41467-020-17613-7 (2020).
Bakbergenuly, I., Hoaglin, D. C. & Kulinskaya, E. Estimation in meta‐analyses of mean difference and standardized mean difference. Stat. Med.39, 171–191 (2020).
Article MathSciNet Google Scholar
Langan, D., Higgins, J. P. & Simmonds, M. Comparative performance of heterogeneity variance estimators in meta‐analysis: a review of simulation studies. Res. Synth. Methods8, 181–198 (2017).
Article Google Scholar
Veroniki, A. A. et al. Methods to estimate the between‐study variance and its uncertainty in meta‐analysis. Res. Synth. Methods7, 55–79 (2016).
Article Google Scholar
Veroniki, A. A. et al. Methods to calculate uncertainty in the estimated overall effect size from a random‐effects meta‐analysis. Res. Synth. Methods10, 23–43 (2019).
Article Google Scholar
Jackson, D., Bowden, J. & Baker, R. How does the DerSimonian and Laird procedure for random effects meta-analysis compare with its more efficient but harder to compute counterparts? J. Stat. Plan. Inference140, 961–970 (2010).
Article MathSciNet Google Scholar
Langan, D. et al. A comparison of heterogeneity variance estimators in simulated random‐effects meta‐analyses. Res. Synth. Methods10, 83–98 (2019).
Article Google Scholar
Cohen, J. Statistical power analysis for the behavioral sciences. (Erlbaum, Hillsdale, NJ, 1988).
MATH Google Scholar
Lipsey, M. W. & Wilson, D. B. Practical meta-analysis. (Sage Publications, Inc, Thousand Oaks, CA, US, 2001).
Google Scholar
Fritz, C. O., Morris, P. E. & Richler, J. J. Effect size estimates: current use, calculations, and interpretation. J. Exp. Psychol. Gen.141, 2 (2012).
Article Google Scholar
McGraw, K. O. & Wong, S. P. A common language effect size statistic. Psychol. Bull.111, 361 (1992).
Article Google Scholar
Nisa, C. F. et al. Meta-analysis of randomised controlled trials testing behavioural interventions to promote household action on climate change. Nat. Commun.10, 4545 (2019).
Article ADS CAS Google Scholar
Lin, L., Shi, L., Chu, H. & Murad, M. H. The magnitude of small-study effects in the Cochrane Database of Systematic Reviews: an empirical study of nearly 30 000 meta-analyses. BMJ Evid. Based Med.25, 27–32 (2020).
Article Google Scholar
Rücker, G., Schwarzer, G., Carpenter, J. R. & Schumacher, M. Undue reliance on I 2 in assessing heterogeneity may mislead. BMC Med. Res. Methodol.8, 79 (2008).
Article Google Scholar
United Nations (UN). Decade of Action https://www.un.org/sustainabledevelopment/decade-of-action/ (2020).

Download references

Author information

Authors and Affiliations

New York University Abu Dhabi, Saadiyat Island, PO Box 129188, Abu Dhabi, UAE
Claudia F. Nisa, Edyta M. Sasin, Daiane G. Faller, Birga M. Schumpe & Jocelyn J. Belanger

Authors

Claudia F. Nisa
View author publications
You can also search for this author in PubMed Google Scholar
Edyta M. Sasin
View author publications
You can also search for this author in PubMed Google Scholar
Daiane G. Faller
View author publications
You can also search for this author in PubMed Google Scholar
Birga M. Schumpe
View author publications
You can also search for this author in PubMed Google Scholar
Jocelyn J. Belanger
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.F.N. was responsible for the conceptualisation of the response, data analysis in STATA and manuscript writing/editing. E.M.S. was responsible for data analysis in R, and for final manuscript editing. D.G.F. was responsible for data analysis in R. B.M.S. was responsible for manuscript editing. J.J.B. was responsible for manuscript writing and editing.

Corresponding author

Correspondence to Claudia F. Nisa.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review informationNature Communications thanks Julian Higgins for the contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nisa, C.F., Sasin, E.M., Faller, D.G. et al. Reply to: Alternative meta-analysis of behavioural interventions to promote action on climate change yields different conclusions. Nat Commun 11, 3901 (2020). https://doi.org/10.1038/s41467-020-17614-6

Download citation

Received: 02 March 2020
Accepted: 07 July 2020
Published: 05 August 2020
DOI: https://doi.org/10.1038/s41467-020-17614-6

This article is cited by

AI, Behavioural Science, and Consumer Welfare
- S. Mills
- S. Costa
- C. R. Sunstein
Journal of Consumer Policy (2023)
The Promise of Private-Sphere Pro-environmental Behavior as Climate Action
- Amanda R. Carrico
Current Climate Change Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Reply to: Alternative meta-analysis of behavioural interventions to promote action on climate change yields different conclusions

Subjects

Data availability

Code availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

This article is cited by

AI, Behavioural Science, and Consumer Welfare

The Promise of Private-Sphere Pro-environmental Behavior as Climate Action

Comments

Meta-analysis of randomised controlled trials testing behavioural interventions to promote household action on climate change

Search

Quick links

Subjects

Data availability

Code availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

AI, Behavioural Science, and Consumer Welfare

The Promise of Private-Sphere Pro-environmental Behavior as Climate Action

Comments

Search

Quick links