Best practices for interpreting large-scale replications

Ackerman, Joshua M.

doi:10.1038/s41562-018-0447-8

Download PDF

Correspondence
Published: 20 September 2018

Best practices for interpreting large-scale replications

Joshua M. Ackerman¹

Nature Human Behaviour volume 2, page 712 (2018)Cite this article

677 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Human behaviour

Main

To the Editor — I appreciate the opportunity to comment on the Social Science Replication Project (SSRP)¹. I thank the team for their attention to the important issue of reproducibility in social science. The work being done on this issue, including the current project, is vital for improving the health of scientific fields. The project team replicated the first study from our paper², producing mixed evidence for the central effect: the stage 1 attempt replicated, but the stage 2 attempt did not; the meta-analytic effect showed evidence of a significant effect, though again, certain complementary analytic procedures did not. I believe the team endeavoured to reproduce the original study procedure as exactly as possible, despite some hiccups along the way. Most importantly, a second replication study was run in error (featuring changes in sample characteristics and a larger number of participants excluded). This error spoke to a more general concern I had about the SSRP report.

Specifically, there are many ways to evaluate reproducibility of findings. The current method isolated individual studies and reproduced them once if replication was achieved, twice otherwise. My concern is that conclusions drawn from this approach may sometimes extend beyond its inherent limitations. To be sure, the report provides an illuminating look at research published in two top scientific journals, and the prediction market and complementary replicability indicators are extremely useful for understanding (and learning about) how we can evaluate effects. But does focusing on one or two replications of a variety of studies provide that much value?

Consider the unique case of how our study was reproduced. During the first attempt to replicate our study, the analyses contained an error, which led the replication team to incorrectly conclude that the replication had failed. They therefore conducted a stage 2 study, at which point the original error became clear. If the stage 1 replication analyses had originally been conducted correctly, the team would have concluded that the effect replicates and no second replication study would have been run. Yet, this mistaken second study found no sufficient replication evidence. Presumably, a similar replicated/failed-to-replicate pattern could characterize all other investigations that terminated at stage 1 (11 of the studies). We cannot know using a procedure like the one here. If we want to gain meaningful information about the reproducibility of specific effects, my guess is that the current procedure will not help much. A better approach may be for many groups to reproduce a single study many times³. In the meantime, given variation in reproducibility across stages for other studies in the SSRP, it may be reasonable to give more weight to the meta-analytic effects than single replication failures (though the team’s point about publication bias is certainly relevant).

I raise this concern primarily because I increasingly hear problematic conclusions drawn by colleagues, students and the public following projects like the SSRP. People sometimes presume that one failure to replicate is the final word on an idea instead of a point made within an ongoing conversation. My suspicion is that highlighting a conclusion such as ‘X% of studies failed to replicate’ following only one or two attempts unfortunately contributes to this problem.

Of course, the SSRP team is surely aware of such issues. Disagreements about approach are bound to emerge as the field sorts through the best practices needed to improve our science, and the current project is an important step along this path.

References

Camerer, C. F. et al. Nat. Hum. Behav. https://doi.org/10.1038/s41562-018-0399-z (2018).
Article Google Scholar
Ackerman, J. M., Nocera, C. C. & Bargh, J. A. Science 328, 1712–1715 (2010).
Article CAS Google Scholar
Hagger, M. S. et al. Perspect. Psychol. Sci. 11, 546–573 (2016).
Article Google Scholar

Download references

Acknowledgements

The comments provided are entirely the sole author’s.

Author information

Authors and Affiliations

University of Michigan, Ann Arbor, MI, USA
Joshua M. Ackerman

Authors

Joshua M. Ackerman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joshua M. Ackerman.

Ethics declarations

Competing interests

The author declares no competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ackerman, J.M. Best practices for interpreting large-scale replications. Nat Hum Behav 2, 712 (2018). https://doi.org/10.1038/s41562-018-0447-8

Download citation

Published: 20 September 2018
Issue Date: October 2018
DOI: https://doi.org/10.1038/s41562-018-0447-8

This article is cited by

Five years of Nature Human Behaviour
- Samantha Antusch
- Aisha Bradshaw
- Mary Elizabeth Sutherland
Nature Human Behaviour (2022)

Best practices for interpreting large-scale replications

Subjects

Main

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

This article is cited by

Five years of Nature Human Behaviour

The reproducibility opportunity

Evaluating the replicability of social science experiments in Nature and Science between 2010 and 2015

Search

Quick links

Subjects

Main

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Five years of Nature Human Behaviour

Search

Quick links