The past, present and future of Registered Reports

Chambers, Christopher D.; Tzavella, Loukia

doi:10.1038/s41562-021-01193-7

Download PDF

Review Article
Published: 15 November 2021

The past, present and future of Registered Reports

Nature Human Behaviour volume 6, pages 29–42 (2022)Cite this article

21k Accesses
135 Citations
183 Altmetric
Metrics details

Subjects

Abstract

Registered Reports are a form of empirical publication in which study proposals are peer reviewed and pre-accepted before research is undertaken. By deciding which articles are published based on the question, theory and methods, Registered Reports offer a remedy for a range of reporting and publication biases. Here, we reflect on the history, progress and future prospects of the Registered Reports initiative and offer practical guidance for authors, reviewers and editors. We review early evidence that Registered Reports are working as intended, while at the same time acknowledging that they are not a universal solution for irreproducibility. We also consider how the policies and practices surrounding Registered Reports are changing, or must change in the future, to address limitations and adapt to new challenges. We conclude that Registered Reports are promoting reproducibility, transparency and self-correction across disciplines and may help reshape how society evaluates research and researchers.

Worldwide divergence of values

Article Open access 09 April 2024

Improving microbial phylogeny with citizen science within a mass-market video game

Article Open access 15 April 2024

Causal machine learning for predicting treatment outcomes

Article 19 April 2024

Main

After more than a decade of meta-research and debate, the life and social sciences are well in the midst of a credibility revolution^1,2,3. Faced with evidence of publication bias^4,5,6,7, hindsight bias and selective reporting^{8,9,10,11,12,13,14,15}, insufficient sample sizes¹⁶, inadequate data sharing¹⁷ and suboptimal rates of both attempted^18,19 and successful replication^20,21,22, researchers from across a broad range of fields are unifying around a core mission to improve reproducibility and transparency. In doing so, the deeper aim of the openness agenda is to stimulate cultural reform, aligning what is beneficial for individual scientists with what is beneficial for science²³.

As scientists and policymakers grapple with the causes of irreproducibility, it has become clear that one of its main drivers is the so-called ‘results paradox’. On the one hand, scientists are taught from their earliest years that the one part of the research process that they must keep at arm’s length is the results of their research. The objective investigator—the detective—follows the data with discipline and restraint, never pressuring it to bend to their will, lest they fall prey to Richard Feynman’s famous warning that “the first principle is that you must not fool yourself and you are the easiest person to fool” (p. 12)²⁴. On the other hand, the very same researcher is sent a message from prestigious journals, funding agencies and evaluation committees that if you want to succeed in science, be sure to publish a lot of clear, novel, positive findings. Researchers are therefore presented with conflicting goals: be a good detective who never indulges in data massaging or cherry picking, but also be a good lawyer who wins arguments and produces a continual supply of beautiful results²⁵.

Many observed problems with reproducibility stem from researchers attempting to resolve this paradox while protecting their careers. When publishing in prestigious journals requires confirming one’s hypotheses but the results defy expectations, researchers can resolve this conflict by following the advice of Bem^26,27 and rewriting their hypotheses to ‘predict’ those results—a form of hindsight bias known as “hypothesizing after results are known” (HARKing)⁸. When academic leaders tell authors to “go with strongest studies” because “weak data dilute strong data” (p. 79)²⁸ and that “what you don’t have to do is tell the whole truth ... you can select the results you present”²⁹, the responsive researcher answers by reporting the analyses that tell the best story, diverting negative or inconvenient results to the file drawer or converting them into publishable (probably false) positives. And when journal editors tell researchers that, all else being equal, some results are simply more deserving of publication than others, the strategic researcher responds by conducting a large number of small studies and reporting only the most persuasive findings (even if unreliable) rather than gambling on the outcome of larger, more definitive projects that may yield inconclusive data³⁰.

Registered Reports (RRs) were proposed in 2012 as a way to free researchers from the pressure to engage in these counterproductive practices, thereby breaking the cycle that perpetuates bias and irreproducibility (Fig. 1). The RR model originates from the simple philosophy that to defeat the distorting effects of outcome bias on science, we must focus on the process and blind the evaluation of science to research outcomes³¹. This blinding is achieved by splitting peer review into two stages. In the first stage, authors submit their research question(s), theory, hypotheses, detailed methods and analysis plans and any preliminary data as needed. Following detailed review and revision—usually according to specific criteria—proposals that are favourably assessed receive in principle acceptance (IPA), which commits the journal to publishing the final paper regardless of whether the hypotheses are supported, provided that the authors adhere to their approved protocol and interpret the results in line with the evidence. Following IPA, authors then typically register their approved protocol in a repository, either publicly or under a temporary embargo. Then, after completing the research, they submit a stage 2 manuscript that includes the approved protocol plus the results and discussion, which may include clearly labelled post hoc analyses in addition to the preregistered outcomes (that is, findings from both confirmatory and exploratory analyses). The reviewers from stage 1 and/or newly invited reviewers then assess the completed stage 2 manuscript, focusing on compliance with the protocol and whether the conclusions are justified by the evidence. Crucially, reviewers do not relitigate the theory, hypotheses or methods, thereby preventing knowledge of the results from influencing recommendations. RR guidelines specify that editors similarly cannot reject a manuscript on the basis of any new concerns about the methodology or rationale or on the basis of the results themselves.

The aim of this modified review process is to reduce as much as possible the potential for biased research practices such as HARKing and selective reporting, while also eliminating the incentive for researchers to employ such practices in the first place. RRs are also designed to mitigate publication bias by journals and outcome bias by reviewers, since the decision to accept or reject is made before results are known. Finally, the format is designed to clearly distinguish the outcomes of pre-planned confirmatory research from exploratory data analysis.

In this Review, we take stock of the RR initiative, consider its recent history and historical underpinnings, emerging variants, impacts and limitations, and the probable future of the format into the 2020s and beyond. We also offer guidance to authors, reviewers and editors who are becoming familiar with RRs.

Past

RRs as they exist today were first proposed in 2012 independently and simultaneously at two journals: Cortex and Perspectives on Psychological Science^32,33. The format was then formally offered at these journals and at Social Psychology in 2013 (refs. ^34,35,36). These first steps precipitated a gradual rise in adoptions, with now over 300 journals across a range of disciplines offering RRs as a new article type (Fig. 2). Early launches triggered the rise of RRs into mainstream publishing, but the origins of the format, and of preregistration in general, are much older. As early as 1878, chemist and logician Charles Peirce laid the foundations for the preregistration of protocols, writing that “[t]he hypothesis should be distinctly put as a question, before making the observations which are to test its truth” (p. 476)³⁷. In the mid-twentieth century, psychologist Adriaan de Groot further argued that distinguishing exploratory from confirmatory research was vital for scientific progress, and that “it is a serious offense against the social ethics of science to pass off an exploration as a genuine testing procedure” (see Wagenmakers et al.³⁸ for a detailed historical overview). Embedded in the arguments of Peirce, de Groot and many others is the maxim that prespecifying predictions and analyses is an important tool for preventing confirmation bias in hypothesis testing.

**Fig. 2: Key milestones in the evolution of RRs.**

Perhaps the earliest proposal for a RR-type review process, in which journal editors reach editorial decisions based on pre-study or results-blind review, was advanced by psychologist Robert Rosenthal who wrote in 1966: “What we may need is a system for evaluating research based only on the procedures employed. If the procedures are judged appropriate, sensible, and sufficiently rigorous to permit conclusions from the results, the research cannot then be judged inconclusive on the basis of the results and rejected by the referees or editors” (p. 36)³⁹. Similar ideas were proposed throughout the 1970s and 1980s^{40,41,42,43,44} but were not widely implemented. Yet, remarkably and unknown to mainstream science, by 1976, the first RR format had already been launched, albeit in the fringe discipline of parapsychology. For 17 years, the European Journal of Parapsychology quietly published RRs alongside regular articles before discontinuing them in 1992 (ref. ⁴⁵).

While non-clinical researchers were debating the potential merits of results-blind review, medical researchers were busy weighing up the costs and benefits of public preregistration to address publication bias, particularly in the context of clinical trials. With the US Food and Drug Administration Modernization Act of 1997 came the first law requiring trial (pre)registration, which in turn led to the launch of the ClinicalTrials.gov registry in 2000. By 2005, the International Committee of Medical Journal Editors was requiring trial registration as a condition of journal publication. An increasing number of journals began offering protocol article types (and in some cases, entire journals), with some performing pre-study review of the protocols (for example, Trials, and The Lancet’s since-abandoned protocol reviews format⁴⁶).

Crucially, none of these initiatives, article types or journals provided IPA regardless of the results. Thus, despite decades of debate about preregistration, pre-study review and results-blind acceptance within isolated channels, it would take until the launch of RRs at Cortex and Perspectives on Psychological Science in 2013 for the combined model to take hold. From there, the next 2 years witnessed a gradual increase in the number of psychology journals adopting RRs, followed by the first general science, technology, engineering and mathematics (STEM) journal in 2016 (Royal Society Open Science; Fig. 2). After a series of major launches (for example, Nature Human Behaviour, BMC Medicine and PLoS Biology) and broader disciplinary expansion throughout 2017–2018, the RR format permanently entered the mainstream.

Present

At the time of writing, RRs are offered by over 300 journals, with 591 stage 2 articles so far published by 94 adopting outlets. With the format becoming more available and associated with a growing published corpus, the first signs of impact are emerging. In this section, we review some of the early signs of its effectiveness. We also introduce recent variants of the model, summarize the key ingredients that make a high-quality RR, address some major misconceptions (Table 1) and genuine limitations that have emerged before offering specific recommendations to authors (Box 1), reviewers (Box 2) and editors (Box 3).

Table 1 Misconceptions and realities of RRs

Full size table

Box 1 Top tips for authors

The planning stage

1.
RRs famously champion the rigour of the proposed methodology and analyses, but the specificity and importance of the research question(s) should also be evaluated during the planning stage. Study plans should first target specific, clear research questions and respective hypotheses (if applicable), on which robust methods and analyses can then be developed.
2.
The feasibility of the proposed study should be considered as early as possible during the planning stage. A well-justified sampling plan that meets journal requirements (for example, 90% statistical power) should be formulated depending on time and resource constraints. Hypotheses and statistical models may need to be adjusted accordingly.
3.
The feasibility and validity of the proposed methods and analyses should be assessed before submission to maximize the efficiency of the peer review process and minimize the need for deviations from the stage 1 protocol following IPA. This can be challenging given that researchers often have not yet acquired data. For this reason, pilot studies or data simulations are highly recommended.

The submission stage

1.
Authors should ensure that there are precise and exhaustive links between each research question, hypothesis, sampling plan, analysis plan and contingent interpretation depending on different outcomes. A RR that minimizes researcher degrees of freedom may receive IPA without the need for major corrections given that the robustness of the methods has been established. The use of pre-study coding protocols can help achieve maximum cohesion (for example, see https://osf.io/6bv27/).
2.
Authors should formulate analysis plans that take into account all the steps in data pre-processing (for example, exclusions, cleaning and aggregation) and analyses (for example, statistical model assumptions). If such details are impossible to specify, then authors should propose a more general plan in combination with blinded analysis methods¹¹⁵, for instance, as deployed in a recent RR by Dutilh et al.¹¹⁶.
3.
The stage 1 protocol should, where possible, include outcome-neutral checks for ensuring that the proposed hypotheses are capable of being tested, such as positive/negative controls, tests of intervention fidelity or data quality checks. These conditions need to be successfully met at stage 2; therefore, authors should carefully consider which, if any, such conditions can be preregistered at stage 1.

Box 2 Top tips for reviewers—questions to ask during stage 1 and stage 2 assessment

At stage 1

1.
Do the research questions and proposed hypotheses make sense in light of the theory or application? Are they defined precisely? Are the hypotheses capable of answering the research question?
2.
Is the protocol sufficiently detailed to enable replication by an expert in the field, and to close off sources of undisclosed procedural or analytic flexibility?
3.
Is there an exact mapping between the theory, scientific hypotheses, sampling plan (for example, power analysis, where applicable), preregistered statistical tests and possible interpretations given different outcomes?
4.
Where relevant, does the power analysis (or alternative sampling plan) reach the minimum threshold required by the journal’s policy (for example, 90% power, Bayes factor > 6)?
5.
Does the sampling plan for each hypothesis propose a realistic and well-justified estimate of the effect size?
6.
Have the authors avoided the common pitfall of relying on conventional null hypothesis significance testing to conclude evidence of absence from null results? Where the authors intend to interpret a negative result as evidence that an effect is absent, have authors proposed an inferential method that is capable of drawing such a conclusion, such as Bayesian hypothesis testing^72,117 or frequentist equivalence testing¹¹⁸?
7.
Have the authors minimized all discussion of exploratory analyses apart from those that must be explained to justify specific design features? Maintaining this clear distinction at stage 1 can prevent exploratory analyses at stage 2 being inadvertently presented as pre-planned.
8.
Have the authors clearly distinguished work that has already been done (for example, preliminary studies and data analyses) from work yet to be done?
9.
Have the authors prespecified positive controls, manipulation checks or other data quality checks? If not, have they justified why such tests are either infeasible or unnecessary? Is the design sufficiently well controlled in all other respects?
10.
When proposing positive controls or other data quality checks that rely on inferential testing, have the authors included a statistical sampling plan that meets the minimum requirement in terms of statistical power or evidence strength (if there is one)?

At stage 2

1.
Did the authors formally preregister their stage 1 protocol and have they provided a direct URL to the approved protocol in the stage 2 manuscript? Did they stay true to their protocol? Are any deviations from the protocol clearly justified and fully documented?
2.
Is the introduction section in the stage 1 manuscript (including hypotheses) the same as in the stage 2 manuscript? Are any changes transparently flagged?
3.
Did any prespecified data quality checks or positive controls succeed?
4.
Are any additional post hoc analyses justified, performed appropriately and clearly distinguished from the preregistered analyses? Are the conclusions appropriately centred on the outcomes of the preregistered analyses?
5.
Are the overall conclusions based on the evidence?

Dos and don’ts

Do suggest additional exploratory analyses at stage 2, but do not expect the editor to necessarily require the authors to conduct them. Authors are not obliged to conduct any unregistered analyses unless such tests are necessary to support conclusions that go beyond the preregistered analyses, and these must be conclusions that the authors (not reviewers or editors) wish to draw. This protection exists to prevent goal-post shifting and subtle forms of publication bias from affecting the stage 2 process.
If you find a flaw in the protocol at stage 2 that was missed or unaddressed at stage 1, do mention it but do not expect the manuscript to be rejected on that basis. It is important to remember that the protocol is not generally subject to re-review at stage 2, and editors cannot require authors to do extra studies. Barring rare cases in which the editor, authors and reviewers agree that a severe error was made, the most that will probably happen is that the authors will be asked to address potential design limitations in the discussion section.

Box 3 Advice for journal editors considering offering RRs

Are RRs appropriate for my discipline?

RRs are potentially useful in any field where at least one of the following problems exist: publication bias, hindsight bias (HARKing), selective reporting of desirable results by authors (including but not limited to p-hacking), lack of sufficient sample sizes to draw meaningful conclusions or lack of close replication. Not all fields experience these problems to the same extent. For example, many sciences do not employ null hypothesis significance testing or even inferential statistics. Other fields, particularly in the physical sciences, have a strong culture of replication. Publication bias, however, exists across almost all sciences⁴. Therefore, in considering whether to offer RRs, the first question an editor should ask is: would two studies in my field that addressed the exact same question and were conducted to the exact same methodological standard have the same chance of being published in my field’s most prestigious journals if one study showed clear, persuasive results while the other study showed null or ambiguous results? If the answer is no—and the study showing more compelling results would have a greater chance of being accepted—then RRs should be offered as a means to reduce publication bias.

Can RRs be suitable even when authors are not testing hypotheses?

Yes. It is true that almost all RRs test hypotheses; however, the format can also have value in an observational setting where publication bias could either suppress certain outcomes or encourage biased analysis or interpretation by authors. For example, a marine biologist may be interested in measuring the concentration of fish, plankton and particulate organic matter to assess the overall health of an ecosystem. Even though the researcher has no specific hypotheses, by detailing the data acquisition method, analysis techniques and which outcomes will lead to which conclusions, the researcher can control their own analytical and interpretive bias, and by incorporating pre-study review and IPA, they eliminate the risk of publication bias.

Are RRs suitable only for replication studies?

No. While RRs are well suited for replications (especially when the review criteria are modified to ensure accountability; Supplementary Note), approximately 50% of the hypotheses in published RRs arise from original studies⁴⁸.

If a journal offers RRs, does the RR format need to replace the traditional research article format at the journal?

No. So far, virtually all new adopters have added RRs as a new article type, with the exception of Comprehensive Results in Social Psychology and a small number of additional journals/platforms that were created as RR-only outlets. Given current limitations in the pace of stage 1 review (see the sections “Limitations and drawbacks” and “Future”), there will always be a place for hypothesis-driven research outside the RR format in addition to pure exploratory research.

Will RRs lower my journal’s impact factor?

Probably not. Preliminary evidence suggests that RRs are cited the same as or slightly higher than comparable non-RRs⁵⁹. Although impact factor carries a currency with many journals and publishers, the prevailing evidence suggests that it does not contain information about scientific quality^55,56,57,58.

How should editors triage initial stage 1 submissions?

Many journals assess stage 1 manuscripts according to specific criteria (Supplementary Note). To maximize the efficiency of the review process, we recommend that editors always perform a desk assessment against these criteria (Box 2) to ensure that a submitted manuscript avoids common pitfalls before being sent for specialist peer review.

Would editors be required to accept any methodologically sound protocol, regardless of the importance or relevance of the research question?

No. Many journals assess the subjective value of the research as part of the stage 1 assessment. For instance, at PLoS Biology, the stage 1 criterion 1 is “The importance of the research question(s)”¹¹⁹, while Nature Human Behaviour judges “The importance of the research question(s) and relevance for a broad, multidisciplinary audience”¹²⁰. Other journals place less emphasis on such judgements; for example, at Cortex and at Royal Society Open Science, the “importance” of the research question is replaced with an assessment of the “scientific validity of the research question”^121,122. Each journal is free to determine how selective it wishes to be at stage 1. The main requirement is that any such selectivity is applied before the results are known and transparently communicated to authors in the journal’s policy.

Would a journal be obligated to publish the results of a RR that appeared promising at stage 1 but was conducted to a low standard?

No. The stage 1 review process allows reviewers and editors to prespecify positive controls, manipulation checks and data checks for assessing quality of implementation (for example, data verifying that a particular intervention or measure was appropriately administered). To prevent publication bias, the only requirement is that such tests are prespecified at stage 1 before results are known and that they are independent of the primary outcome measures and main study hypotheses. Stage 2 rejection will be very rare; for example, at Cortex, the European Journal of Neuroscience and Royal Society Open Science, where C.D.C. is a RR editor, the stage 2 rejection rate is currently zero.

How complicated and arduous is the process of adding RRs to a journal?

Installing RRs has become increasingly easy over time. With around 300 journals now offering them, all major publishers have at least one adopter under their umbrella. In many cases, the format and workflow can be imported very easily between journals. The Center for Open Science hosts dedicated resources for editors to assist in implementation¹²³. Journals can also offer RRs by joining the Peer Community in Registered Reports (PCI RR) and, if they wish, use the review infrastructure of the PCI RR in place of establishing an internal infrastructure (https://rr.peercommunityin.org/).

Do RRs create more work for editors compared with regular articles?

Editing RRs requires careful attention to the study rationale and methodology, ensuring that authors adhere to the journal’s RR policy, guiding them on how to best address (the sometimes competing) recommendations from reviewers and making certain that authors do not feel compelled to comply with requests from reviewers that violate RR policy. An RR editor must therefore be highly engaged with each submission and read the article. Insofar as these practices define the minimum requirements for competent editing, the RR process will produce a comparable editorial workload. At the same time, it is important to note that RR submissions undergo a minimum of two phases of review, at stage 1 and stage 2, whereas a regular article is subjected to a single review process. Whether this bifurcation of review for RRs translates to increased editorial workload is unclear and is likely to depend on the extent to which each RR replaces a regular article that the authors would otherwise have submitted and on the comparative rejection rate of RRs versus regular articles at the journal. If, as observed informally (Table 1), the rejection rate for RRs is lower than for regular articles and if each RR replaces at least one regular article that the authors would have submitted, then RRs may still reduce overall editorial workload by reducing the number of sequential journal submissions and therefore the duplication of the review process across journals.

Are RRs possible in fields where researchers analyse existing datasets?

Yes. The majority of adopting journals invite RRs proposing secondary analyses of existing datasets, provided that authors take sufficient steps to minimize risk of bias and analytical overfitting. Such measures could include avoiding all prior observation of the data or key variables, proposing key analyses in an unseen holdout sample or recruiting a blinded analyst.

Could RRs be useful in qualitative research?

Yes. To the extent that publication bias is a concern in both qualitative and quantitative research¹²⁴, RRs hold promise for improving reproducibility. To date, however, only a handful journals currently offer qualitative RRs (for example, BMC Medicine and 13 more; see column 20 at https://tinyurl.com/RRpolicylist), and none include a specific RR policy text tailored for qualitative methods. The development of qualitative protocols for preregistration and RRs is a key area for future innovation^{125,126,127,128}.

Field spread and author demographics

Since their initial launch within psychology and neuroscience, RRs have spread to specialist journals covering a range of disciplines, primarily in the life and social sciences (Supplementary Fig. 1). As this reach has grown, we can begin to explore the demographics of submitting authors to discover the accessibility of the format. The prospect of being pre-accepted at a respectable journal is likely to be appealing to many researchers—and perhaps early-career researchers (ECRs) in particular—seeking to eliminate the risk that the results of their research will determine publication and, consequently, their career prospects. However, the often substantial sample sizes needed for RRs to achieve minimum levels of statistical power required by many journals, combined with the time taken for stage 1 review (see the section “Limitations and drawbacks“), could act as a deterrent, especially for researchers with major resource constraints. To provide a preliminary insight into accessibility, we analysed the author demographics of 141 stage 1 RRs submitted to Cortex, the European Journal of Neuroscience, NeuroImage and Royal Society Open Science. We found that 77% of submitted stage 1 manuscripts were first-authored by PhD students or postdoctoral researchers (Supplementary Fig. 2a). At the journal Cortex, where a direct comparison between different article types was possible, we found that 78% of submitted RRs were led by ECRs compared with 67% in a comparison sample of regular articles (Supplementary Fig. 2b,c). It would be premature to conclude that RRs present no barriers for researchers⁴⁷, but these results at least provide no reason to fear that RRs are beyond the reach of ECRs.

Early impacts

Are RRs working as intended to reduce bias and improve reliability? Although the initiative is too young to answer this question with confidence, metascientific investigations are beginning to reveal signs of bias control, study quality, computational reproducibility and citation influence.

Bias control

Since reporting and publication biases typically favour positive results, RRs, if successful, should yield a greater proportion of negative results compared with the conventional literature. So far, this prediction appears to hold: a recent analysis of 296 hypotheses published across 127 RRs in different fields found that 60% of RRs report null results, which is approximately five times greater than the rate in regular articles⁴⁸. In psychology, this difference is even more striking, with a new study⁴⁹ finding that just 4% of regular articles failed to confirm the first hypothesis compared with 56% for RRs⁴ (see also Wiseman et al.⁴⁵). It would be tempting to conclude that this increase is caused by the elimination of selective reporting, HARKing and publication bias resulting from pre-study review and IPA, which are the key ingredients of the RR process. However, it is possible that authors, knowing that their study will be published regardless of whether their hypotheses are supported, might employ the RR format to test riskier hypotheses. Moreover, RRs themselves might select for authors who are diligent in controlling their own reporting bias, regardless of the article type. To address such confounds, future observational studies could compare the plausibility of hypotheses in RRs compared with non-RRs, as well as indicators of biased reporting in RRs and non-RRs within the same sets of authors (see the section “Future”).

Computational reproducibility

There are several reasons why RRs might be more computationally reproducible than conventional articles. At many journals, the RR review policy has more stringent expectations concerning open data and code, which is associated with greater accuracy in statistical reporting⁵⁰. In addition, IPA eliminates the incentive for authors to conceal messy or inconvenient elements of their data, and early adopters of the format may also be predisposed to performing research to a higher level of transparency. A recent study⁵¹ indeed suggests that the results of RRs can be more readily reproduced from the acquired data compared with regular articles. Of the 35 RRs published in psychology that made data and code openly available, 57% were computationally reproducible compared with 31% in a previous analysis of regular articles⁵². Although RRs appear to perform better than the status quo, these results clearly show room for improvement and require substantially more data to be confirmed.

Citation profile

Clinical trials reporting negative results receive between two and ten times fewer citations than trials reporting positive results^53,54. Given the increased rate of negative results in RRs, authors may therefore be concerned that submitting a RR could be disadvantageous to their careers. Similarly, one of the immediate reactions to RRs from many journal editors is that the format could risk reducing their outlet’s impact factor, a powerful albeit spurious measure of research influence^55,56,57,58. In fact, such concerns may be unwarranted; a recent analysis⁵⁹ of 70 RRs reported in a non-peer-reviewed preprint found that RRs are cited the same as or slightly higher than comparable regular articles.

Study quality

How do expert assessments of RRs compare with non-RRs? In a recent study⁶⁰, Soderberg et al. reported an experiment in which 353 scientists rated a sample of published, partially blinded RRs and non-RRs on 19 study characteristics, including importance, novelty, creativity, innovation and rigour. RRs numerically outperformed non-RRs on every criterion, showing statistically robust and large improvements in attributes such as methodological rigour and overall article quality, while being statistically indistinguishable from comparison papers in terms of features such as novelty and creativity. These results held even among reviewers who admitted being sceptical or neutral about RRs.

Emerging variants

As the reach of RRs has grown, several modified versions of the format have arisen to accommodate specific needs. Five major strands in particular have emerged, including results-blind review, accountable replication policies, RRs involving post-publication peer review, publisher-level RRs and publisher-independent RRs. These variants are briefly summarized below and discussed in more detail in the Supplementary Note.

Results-blind review

In this modified workflow, stage 1 peer review is undertaken after results are known to the authors but before they are known to the reviewers and editors⁶¹. Following IPA, the authors then submit the full manuscript containing the data and conclusions. Because authors need not wait until IPA to conduct their research, this format prevents the stage 1 review time from delaying data collection and analysis. However, reviewers are unable to improve the study design, and the format does not prevent reporting bias (for example, p-hacking or HARKing) by authors. To date, at least 13 journals, primarily in psychology and management, have adopted results-blind review as an optional article track. Two journals, Cortex and Infant and Child Development, have also launched Verification Reports, a results-blind format dedicated to assessing the computational reproducibility and robustness of previous findings based on a re-analysis of the original study data.

Accountable replications

Conceived by psychologist Sanjay Srivastava⁶², this variant emerged from the principle that when a journal publishes a research finding, it should commit to publishing all methodologically sound replications of that finding regardless of how the results turn out and regardless of subjective importance or methodological flaws in the original study. Using a modified set of the RR assessment criteria, the journal reaches a stage 1 IPA decision on the basis of technical validity and the methodological proximity between the replication and target study (Supplementary Fig. 3). To date, Royal Society Open Science is the only journal that implements a complete and fully specified version of this concept, following partial implementations at Clinical Psychological Science, the Journal of Research in Personality and Psychological Science^{63,64,65,66,67}.

Post-publication peer review RRs

RRs usually rely on conventional pre-publication review in which reviewers serve as gatekeepers to IPA and stage 2 acceptance. In contrast, by combining post-publication peer review with RRs, the stage 1 manuscript is published almost immediately following initial receipt and is then openly reviewed^68,69. If the reviews are positive (with authors having the usual opportunity to revise the protocol), then the article is awarded IPA and, once passing stage 2 review, the final manuscript with results is badged as a RR. To date, this model has been adopted across ten journals, including F1000Research and Wellcome Open Research.

Publisher-level RRs

The review process for RRs is typically managed by the one journal, but recently, some journals have begun implementing a distributed model in which stage 1 and stage 2 manuscripts can be reviewed and published in different journals under the same publisher. In one working model, the completed stage 2 RR is then cross-linked to the accepted stage 1 protocol using an international RR identifier⁴⁹.

Publisher-independent RRs

Can RRs exist beyond journals? As part of the recently created Peer Community in Registered Reports (PCI RR), stage 1 and stage 2 preprints are reviewed independently of journals (https://rr.peercommunityin.org). Where the reviews are positive, PCI RR issues a positive recommendation and authors can then choose to publish their recommended preprint in any ‘PCI RR-friendly’ journal without further peer review.

Seven virtues of high-quality RRs

What makes a good RR? In this section, we describe seven desirable characteristics that authors should aim to capture in their stage 1 and stage 2 manuscripts. Further guidance may be found in Box 1, in the RR policies for specific journals (see the list at https://cos.io/rr/) and in a recent practical primer by Kiyonaga and Scimeca⁷⁰.

The first and foremost ingredient is that the proposal tackles a scientifically valid question and ideally one that other scientists agree is important to answer. The introduction section of the stage 1 manuscript should make clear the underlying theory or application from which the question arises, leaving the reader in no doubt as to why the study is being proposed. Second, where the study proposes hypotheses, they should be stated as precisely as possible in terms of specific variables to ensure falsifiability. In quantitative hypothesis-driven sciences, we recommend that researchers consider the open-theory pathway proposed by Guest and Martin⁷¹ to help ensure that hypotheses are formulated as a natural specification of computational theory rather than emerging loosely—and often with questionable rationale—from a vague conceptual framework. Some of the most effective RRs achieve this by identifying pressure points in competing theories and then devising hypotheses to adjudicate between them.

With the theory, rationale and hypotheses in place, the third key ingredient is a study procedure and analysis plan that is as rigorous, transparent and as comprehensive as possible. Data acquisition protocols and analysis plans should be prespecified with sufficient detail to be reproduced by experts in the field, ideally with accompanying code, and with rigorous experimental controls where appropriate, including both negative and positive controls. Where the conclusions will depend on inferential statistics, the procedure should include a detailed sampling plan, such as a statistical power analysis, Bayes factor design analysis⁷² or an appropriate alternative, which, crucially, should also make clear the specific hypothesis it interrogates and the rationale for deciding the sensitivity of each statistical test (such as justification of the target effect size or Bayesian prior). When planning the analyses, it is vital to choose the right tools for the job, including assumption checks, detailed consideration of data preprocessing and filtering, and planned contingencies for any data-driven analysis decisions. Where these contingencies would be too numerous (or even impossible) to specify in advance, the inclusion of pilot data at stage 1 can be used to verify assumptions and narrow the range of possibilities. Alternatively, authors can embrace uncertainty and use blinded analysis methods to control risk of bias.

These three features are the essential building blocks for the fourth key ingredient: a seamless link between the research question, theory and its specification, the hypotheses, sampling plan and contingent interpretation given different outcomes (Box 1). A stage 1 RR can be thought of as a preparatory chain of inference, leading from the why to the what and how, which, as with any chain, is only as strong as its weakest link. Many RRs now include study design tables to elucidate these links as precisely as possible (for example, see https://osf.io/sbmx9/).

Fifth, once the study is completed and results are known, it is essential that the outcomes of the prospective (confirmatory) analyses are clearly distinguished from the outcomes of any post hoc (exploratory) analyses that deviated from the preregistered plans. While RRs are not intended to restrict valid deviation or post hoc exploration, it is vital that at this final hurdle the outcomes that were decided after observing data, and therefore potentially bias-prone, are not conflated with those that were protected from bias by prespecification. Clear differentiation of exploratory and confirmatory outcomes in turn furnishes the sixth key ingredient: ensuring that the conclusions of the RR are based firmly on the evidence presented and appropriately weighted in favour of the confirmatory outcomes. Finally, in line with level 2 of the Transparency and Openness Promotion Guidelines⁷³, the seventh key ingredient of a high-quality RR is that study data, code and digital materials are made publicly available to the maximum extent permitted by relevant ethical or legal constraints.

Limitations and drawbacks

Despite their advantages, RRs are neither a panacea nor a one-size-fits all solution for irreproducibility. As a tool for improving the quality of confirmatory research, they are particularly well suited to hypothesis-driven studies and are not designed to improve the robustness or transparency of purely exploratory science (for which better suited article types are available^74,75,76). As the format has evolved, various shortcomings have also been revealed in the workflow and implementation.

Lack of protocol transparency

In 2018, Hardwicke et al.⁷⁷ reported that of the 70 journals that had adopted RRs permanently, only 50% required that the accepted stage 1 protocols were publicly registered and available alongside the completed stage 2 articles. Protocol transparency is an important element of RRs because it enables readers to check whether authors followed the approved protocol rather than relying on the (typically) closed review process to ensure compliance⁷⁸. One reason for the lack of protocol transparency is that, in 2013, the key progenitor of RRs, Cortex, did not require accepted stage 1 protocols to be made public, and this policy omission was then duplicated among subsequent adopters. However, since the analysis by Hardwicke et al., the recommended author guidelines for RRs at Cortex and at the Center for Open Science have been updated to include protocol transparency⁷⁹. To date, of the 213 permanent adopters with published RR policies, 87% now either publish the stage 1 protocol as a separate article or require stage 1 protocols to be registered and made public no later than the point of stage 2 acceptance (see Supplementary Note for details, and ref. ⁸⁰ for a dedicated RR registry supported by the Center for Open Science). It remains an ongoing task to persuade all RR-adopting journals to require protocol transparency, and a key aim of future metascience will be to confirm that journals are enforcing their policies appropriately (Box 4).

Box 4 The big questions for metascience

1.
How exactly do RRs differ from regular articles, both quantitatively and qualitatively¹²⁹? Are the method sections longer, more detailed and more easily repeatable? Are the sample sizes larger, as expected by the high statistical power requirements at many adopting journals? Following Soderberg et al.⁶⁰, do blinded expert raters judge RRs to be of higher quality than comparable regular articles? Are RRs more likely to include open data and materials?
2.
Are the results reported in RRs more likely to be replicated when the studies are independently repeated? Are the outcomes of the confirmatory preregistered analyses in RRs more likely to be replicated than the outcomes of post hoc exploratory analyses?
3.
To what extent do stage 2 RRs deviate from their accepted stage 1 protocols? Are such deviations always explicit?
4.
Is the review process for RRs, on average, more or less efficient than the review process for regular articles? Across all journals, is the acceptance rate higher or lower for RRs?
5.
Using a randomized controlled trial, can we conduct a definitive causal test of the hypothesis that the key ingredients of RRs—pre-study review and IPA—reduce publication bias and reporting bias?
6.
Are RRs more likely to include ‘single shot’ studies (versus sequences of studies) compared with regular articles?
7.
Are RRs having beneficial collateral effects on open research policies or practices? For example, is the adoption of RRs at a journal associated with rising standards of reproducibility and transparency for regular articles within the same journal? Are there any detrimental effects?
8.
Are journal editors adhering to their own RR policies? Are stage 1 or stage 2 manuscripts ever rejected for reasons that violate RR policy? Are journals requiring that stage 1 protocols are formally registered and linked to the stage 2 manuscript?
9.
What are the career costs and benefits of pursuing RRs for individual researchers? Does publishing a RR lead to changes in a researcher’s attitude towards their own work or the work of their field? Are there any signs of RRs influencing career opportunities? Once authors have published their first RR, do the regular articles they subsequently publish demonstrate reduced risk of bias compared with the regular articles that they published in the past?
10.
How do the wider author demographics of RRs compare with the normative demographics in their field? Do any discrepancies vary between fields?
11.
How does the tone and content of peer review differ between RRs and regular articles? Does the fact that stage 1 review offers reviewers the opportunity to improve a study design lead to a more constructive and collegial process, as suggested by preliminary survey evidence¹³⁰? Are the reviews themselves higher quality? This question could be answered by performing qualitative content analysis of the RR reviews versus regular reviews published by journals that employ an open review policy. The comparison would be especially powerful for journals such as Meta-Psychology, which also publishes the reviews of rejected articles.
12.
In the long term, what contribution are RRs making to theory or applications? Can we measure their costs and benefits to health, society and the economy⁸¹?

Lack of standardization

Previous analyses by Hardwicke et al.⁷⁷ and Scheel et al.⁴⁹ show that RRs are registered and reported inconsistently and, in many cases, even lack sufficient information to determine the specific hypotheses. This lack of specificity probably arises from the incompatibility between the seventeenth-century traditional manuscript format—involving discursive and often vague prose documentation—and the demand for precision within RRs. An RR should articulate falsifiable predictions that are linked to specific sampling plans, inferential analyses and contingent interpretations given different outcomes.

Stage 1 delay and bureaucratic tennis

Despite the fact that RRs might, in aggregate, lead to more efficient knowledge generation compared with regular articles (Table 1), the fact remains that the stage 1 review time typically adds a period of several months between submitting a stage 1 manuscript and the commencement of the research. This downtime can present a substantial barrier for researchers on short-term contracts or who hold grants that demand immediate data acquisition. Furthermore, in fields that require very specific ethical approval, such as the clinical sciences, authors can find themselves locked into a time-consuming tennis match between the journal and their ethics committee, both of which can insist on approving a precisely specified protocol (Fig. 3).

Future

What does the next decade and beyond hold for RRs? The gradual rise of the format has unlocked a range of possibilities for expansion and innovation while also posing challenges for implementation and quality control. Here, we consider some of the major possible developments as RRs scale up. We also reflect on the key outstanding questions for metascience (Box 4) and consider how RRs may influence systems for evaluating research and researchers.

Improving efficiency

Perhaps the greatest limitation of the RR format is the time taken for submissions to be reviewed at stage 1 and receive IPA, thus delaying the commencement of research (see the section “Limitations and drawbacks”). While it can be argued that elimination of publication bias offsets this cost at a community level, and that the stage 1 delay could improve quality by increasing start-up costs⁸¹, this downtime nevertheless reduces accessibility of the format to individual researchers and can make it prohibitive for short-term projects. Here, we consider four innovations that could substantially accelerate stage 1 review without reducing quality.

Rapid review

One way to accelerate RR review is to create a network in which reviewers agree to evaluate submissions within a short time frame. In 2020, Royal Society Open Science became the first journal to launch such a network for RRs related to the COVID-19 pandemic⁸². As part of this special initiative, the journal calls for submissions that are relevant to any aspect of COVID-19 in any field, including biological, medical, economic and psychological research, while also seeking specialist reviewers who are willing and able to evaluate stage 1 RRs within 24–48 h of accepting a review request. To date, nearly 900 scientists across a range of disciplines have joined the reviewer network, which is also accessible to other journals. To gain access, the journal must commit to rapid peer review of COVID-19 RRs—striving for 7 days for the initial stage 1 review round—and waive all article processing charges. Since then, 11 additional journals have joined the network, including Nature Human Behaviour, Nature Communications and PLoS Biology. To date, Royal Society Open Science has published six stage 2 RRs arising from the initiative, and additional submission statistics are available in the Supplementary Note^{83,84,85,86,87,88}.

Scheduled review

To date, RRs at all journals are subjected to the same serial review process as regular articles. At stage 1, the manuscript is received and undergoes editorial triage; if it meets minimal requirements, editors then seek and obtain specialist reviews, ideally leading to IPA following revision and, in many cases, re-review (Fig. 4a). Despite prompt engagement by editors and reviewers, this process can take several months to achieve resolution. An alternative approach is to perform key elements of the initial stage 1 assessment in parallel (Fig. 4b). Under this model, authors initially submit a short, structured protocol for consideration before writing the stage 1 manuscript. If this passes editorial triage, then reviewers are invited to assess a complete stage 1 manuscript at a fixed date in the future (for example, 6 weeks ahead). During this time, the authors write and submit their complete stage 1 manuscript, which is then reviewed on the scheduled date or during a short range of dates. With sufficient contingencies in place, this modified review process could reduce the initial stage 1 review time (but not re-review time) from weeks/months down to a matter of days.

Observer–evaluator review and ‘rolling IPA’

A more radical alternative to accelerated review and scheduled review would be to abolish the current peer review system altogether, replacing the serial assessment of documents—arguably a throwback to the seventeenth-century exchange of letters—with a more dynamic observer–evaluator mechanism. Authors could use an existing infrastructure such as the Open Science Framework (and associated add-ons) to create a virtual laboratory space containing the project rationale, study protocol(s), code and data as applicable. Reviewers could then be parachuted in as virtual observers, monitoring, commenting and approving specific components as the research unfolds in real-time, with the editor providing guidance, monitoring and oversight. This system could make RRs more compatible with the rapid sequential workflow that is common in fields such as chemical biology, virology and psychophysics, where the results of one experiment often lead within days to the design and implementation of the next experiment. As each update to the protocol is approved, IPA could be rolled over and extended.

RR funding models

One of the major barriers to RRs in clinical research is the additional bureaucracy imposed by stage 1 review. Many researchers already face multiple pre-study hurdles, including grant review, ethics review and, in some cases, regulatory review, and all before even contemplating a stage 1 RR submission. One promising solution to this problem is for journals and funders to perform concurrent or near concurrent reviews of RR proposals. Under this partnership model, which was first trialled in 2017 (refs. ^89,90), authors submit a stage 1 proposal to a journal and funder, either simultaneously or in succession. Following assessment by both parties (either separately or as part of a joint process), if expert reviews are favourable, then IPA and funding are awarded in synchrony, reducing two pre-study review phases into one. This mechanism could be further enhanced by incorporating ethics and regulatory review, which would be particularly useful for clinical trials for which these phases of review often require the assessment of a detailed and precise protocol.

Improving quality, accountability and rewards

Alongside improvements in efficiency, the next decade is likely to see a range of measures to optimize the openness and reliability of both RRs and the assessment of RRs. In this final section, we consider six key innovations, including computationally generated RRs, mandatory RRs for clinical trials, the development of a RR training and accreditation process for journal editors, tools for monitoring the speed and quality of journal assessment, ways to improve the recognition of reviewer contributions and emerging steps to incorporate RRs into formal research evaluation.

Computationally generated RRs

Quality control of accepted protocols is limited by the subjectivity of the review process and variable implementation of RR criteria across journals and fields. As previously noted (see the section “Lack of standardization”), the specificity of hypotheses, statistical tests and interpretations may not always be achievable with the current format. It may therefore be unrealistic to expect authors to achieve the required level of precision without tools that can guide them through this process, such as RR study design templates or software (for example, the ScienceVerse project developed by L. DeBruine and D. Lakëns available at https://scienceverse.github.io/scienceverse/). For RRs that test specific confirmatory hypotheses, all preregistered hypotheses and statistical predictions could also become machine-readable⁹¹. A machine-readable output can be used to extract key metadata from the RR, as the results either confirm or disconfirm the prespecified hypotheses. Evaluating a stage 2 manuscript on the basis of adherence to preregistered statistical predictions would then become a more efficient and standardized process for the reviewers and editors.

Several preregistration templates are available for assisting researchers in communicating their study plans in a concise and structured format⁹², and some journals also implement their own protocol-coding checklists with strict criteria for IPA⁹³ (for example, https://osf.io/6bv27/). These template and checklists can be time-consuming to complete on top of manuscript preparation; therefore, an essential innovation for RRs will be the creation of a user-friendly web-based RR generation tool that guides the authors through the implementation of stringent criteria, including precise specification of hypotheses and linking with sampling plans, analysis plans, inference criteria and contingent interpretation given different outcomes. The tool would ideally accommodate a wide range of disciplines, similar to the experimental design assistant offered by NC3Rs (https://www.nc3rs.org.uk/experimental-design-assistant-eda), thereby producing a standardized, submission-ready stage 1 protocol.

Mandatory RRs for clinical trials

In basic (non-clinical) research, RRs are usually proposed as an additional option for authors rather than a requirement, which makes sense given the vital importance of exploratory science. However, we believe a strong case can be made for all clinical trials to be conducted and reported exclusively as RRs. Even though trial registration is now the norm, registration does not guarantee that trials are preregistered rather than ‘post-registered’^94,95, that trial results will be reported free from bias^96,97 or that the results will be published at all^98,99. With trials being vulnerable to all the same publication and reporting biases that afflict basic research, and with the first RR model for clinical trials now available at BMC Medicine¹⁰⁰, the next decade will hopefully see mounting pressure on clinical trial funders and major medical journals to embrace the format, ideally via RR funding models to maximize efficiency.

Training and accreditation for editors

As a general rule, the standard with which a journal reviews and administers RRs can never exceed its standard of RR editing. For this reason, it is therefore crucial that editors have the required skills and training. Guidelines to assist editors in evaluating stage 1 RR submissions are available in most RR-adopting journals or at the Center for Open Science (https://cos.io/rr/), and the PCI RR initiative requires new editors (called “recommenders”) to pass a 2 h entrance test. The expansion of RRs into new disciplines and less familiar terrain is bound to introduce variability in the standard of editing. Therefore, an important future step for increasing and standardizing the quality of RR editing will be to provide editors with training materials that are tailored for the busy schedules of academics and possibly using a massive open online course. In this way, editors, and perhaps entire journals, could receive accreditation for their knowledge and understanding of the RR process, including criteria for manuscript acceptance/rejection at stage 1 (IPA) and stage 2. This system would equip authors with data to inform their choice of journal and readers with the confidence in the standard of RR editing across journals.

Community monitoring and feedback

Related to the issue of editorial training is the issue of journal accountability. At present, there is little information available for authors to judge the quality of editing and review at a RR-adopting journal, apart from word of mouth and conventional (probably uninformative) indicators such as journal prestige⁵⁵. We believe journals should regularly publish all data on the number of RR submissions received, rejection rates at the different stages and time spent under review. In addition, open review policies and a Yelp-style website in which authors and reviewers could leave anonymous feedback ratings on the quality of the editorial process would provide an incentive for journals to maintain high standards.

Reviewer recognition

Reviewers often make major contributions to RRs that are not transparently recognized. During stage 1, reviewers can recommend major changes in the study design, hypotheses, methods and analyses, contributions that would readily justify authorship if made outside the review process. One way to ensure that reviewers are properly credited for their contribution to RRs is for a journal to adopt an open, signed review policy. Although most journals employ closed, anonymous peer review, a minority of RR-adopting journals, such as Royal Society Open Science and Meta-Psychology, publish the accepted article alongside the reviews, which reviewers can sign in order to increase transparency, credit and accountability. Reviewers can further list their contributions in online platforms such as Publons (https://publons.com/). We believe it is important for reviewers, and especially ECR reviewers, to have publicly available evidence demonstrating the quality of their reviewing. To further recognize this contribution, one possibility would be to create a ‘reviewer contributor’ role that formally acknowledges the intellectual input of reviewers to the final RR, without being named as an author. This role could also be recognized through use of the CRediT taxonomy (https://casrai.org/credit/).

Research evaluation

To become normative in the long term, RRs will need to be recognized within formal systems for evaluating research quality. In the United Kingdom, there are already promising moves in this direction. In the 2021 Research Excellence Framework—a regular national exercise for assessing research quality and apportioning public funds—RRs are specifically noted as an indicator of research rigour¹⁰¹, which in turn means that authors who publish RRs could attract increased funding for their institutions. Following the recent formation of the UK Reproducibility Network (http://www.ukrn.org), institutions are also signalling their support for RRs¹⁰². University College London “strongly encourages” researchers to use RRs where appropriate¹⁰³, an approach echoed by learned societies including the British Neuroscience Association¹⁰⁴ and the British Psychological Society¹⁰⁵. The Norwegian funder Stiftelsen Dam also recommends that grantees consider publishing their research in the form of RRs¹⁰⁶, while the Templeton World Charity Foundation goes so far as to mandate RRs for certain funding schemes¹⁰⁷. The next 5 years will hopefully see international expansion in the recognition of RRs at all stages of evaluation, from research outputs and grant applications to the criteria for employment and promotions. It is crucial that such judgments are applied cautiously with continual reference to ongoing metascience that will establish evidence of the costs and benefits of the format (Box 4).

Conclusion

In this Review, we reflected on the history, preliminary impacts and future potential of the RR initiative. For the past 8 years, the life and social sciences have embarked on a journey into the unknown—one that has been mooted for decades but has only now reached open waters. Early suggestions of impact are promising, with RRs more likely to disconfirm a priori hypotheses and to be computationally reproducible, while also receiving higher quality ratings and the same or higher attention through citations. The prospects of the initiative now hinge on more detailed metascience, while addressing limitations and maintaining quality control as the format scales up and into new disciplines. As we look into the next decade, we believe RRs are showing all the signs of becoming a powerful antidote to reporting and publication bias, realigning incentives to ensure that the practices that are best for science—transparent, reproducible, accurate reporting—also serve the interests of individual scientists.

References

Vazire, S. Implications of the credibility revolution for productivity, creativity, and progress: Perspect. Psychol. Sci. https://doi.org/10.1177/1745691617751884 (2018).
Munafò, M. R. et al. A manifesto for reproducible science. Nat. Hum. Behav. 1, 0021 (2017).
Article PubMed PubMed Central Google Scholar
Reproducibility and Reliability of Biomedical Research: Improving Research Practice (The Academy of Medical Sciences, 2015); https://acmedsci.ac.uk/file-download/38189-56531416e2949.pdf
Fanelli, D. “Positive” results increase down the hierarchy of the sciences. PLoS ONE 5, e10068 (2010).
Article PubMed PubMed Central Google Scholar
Franco, A., Malhotra, N. & Simonovits, G. Publication bias in the social sciences: unlocking the file drawer. Science 345, 1502–1505 (2014).
Article CAS PubMed Google Scholar
Harrison, J. S., Banks, G. C., Pollack, J. M., O’Boyle, E. H. & Short, J. Publication bias in strategic management research. J. Manag. https://doi.org/10.1177/0149206314535438 (2014).
Jennions, M. D. & Møller, A. P. Publication bias in ecology and evolution: an empirical assessment using the ‘trim and fill’ method. Biol. Rev. 77, 211–222 (2002).
Article PubMed Google Scholar
Kerr, N. L. HARKing: hypothesizing after the results are known. Personal. Soc. Psychol. Rev. 2, 196–217 (1998).
Article CAS Google Scholar
Bruns, S. B. & Ioannidis, J. P. A. p-Curve and p-Hacking in observational research. PLoS ONE 11, e0149144 (2016).
Article PubMed PubMed Central Google Scholar
Khan, M. J. & Trønnes, P. C. p-Hacking in experimental audit research. Behav. Res. Account. 31, 119–131 (2018).
Article Google Scholar
Holman, L., Head, M. L., Lanfear, R. & Jennions, M. D. Evidence of experimental bias in the life sciences: why we need blind data recording. PLoS Biol. 13, e1002190 (2015).
Article PubMed PubMed Central Google Scholar
Fiedler, K. & Schwarz, N. Questionable research practices revisited. Soc. Psychol. Personal. Sci. 7, 45–52 (2016).
Article Google Scholar
Rabelo, A. L. A. et al. Questionable research practices among Brazilian psychological researchers: results from a replication study and an international comparison. Int. J. Psychol. https://doi.org/10.1002/ijop.12632 (2019).
Fraser, H., Parker, T., Nakagawa, S., Barnett, A. & Fidler, F. Questionable research practices in ecology and evolution. PLoS ONE 13, e0200303 (2018).
Article PubMed PubMed Central Google Scholar
John, L. K., Loewenstein, G. & Prelec, D. Measuring the prevalence of questionable research practices with incentives for truth telling. Psychol. Sci. 23, 524–532 (2012).
Article PubMed Google Scholar
Button, K. S. et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 14, 365–376 (2013).
Article CAS PubMed Google Scholar
Wicherts, J. M., Borsboom, D., Kats, J. & Molenaar, D. The poor availability of psychological research data for reanalysis. Am. Psychol. 61, 726–728 (2006).
Article PubMed Google Scholar
Mueller-Langer, F., Fecher, B., Harhoff, D. & Wagner, G. G. Replication studies in economics—how many and which papers are chosen for replication, and why? Res. Policy 48, 62–83 (2019).
Article Google Scholar
Makel, M. C., Plucker, J. A. & Hegarty, B. Replications in psychology research: how often do they really occur? Perspect. Psychol. Sci. 7, 537–542 (2012).
Article PubMed Google Scholar
Camerer, C. F. et al. Evaluating replicability of laboratory experiments in economics. Science 351, 1433–1436 (2016).
Article CAS PubMed Google Scholar
Davis, R. J. et al. Reproducibility project: cancer biology. eLife https://elifesciences.org/collections/9b1e83d1/reproducibility-project-cancer-biology (2014).
Open Science Collaboration. Estimating the reproducibility of psychological science. Science 349, aac4716 (2015).
Article Google Scholar
Nosek, B. A., Spies, J. R. & Motyl, M. Scientific utopia: II. Restructuring incentives and practices to promote truth over publishability. Perspect. Psychol. Sci. 7, 615–631 (2012).
Article PubMed Google Scholar
Feynman, R. P. Cargo cult science. Eng. Sci. 37, 10–13 (1974).
Google Scholar
Johnson, J. A. Are research psychologists more like detectives or lawyers? Psychol. Today http://www.psychologytoday.com/blog/cui-bono/201307/are-research-psychologists-more-detectives-or-lawyers-0 (2013).
Bem, D. J. In The Compleat Academic: A Practical Guide for the Beginning Social Scientist (eds Zanna, M. P. & Darley, J. M.) Ch. 8 (Lawerence Erlbaum, 1987).
Bem, D. J. In The Compleat Academic: A Career Guide 2nd edn (eds Darley, J. M. et al.) Ch. 10 (American Psychological Association, 2003).
Fiske, S. T. In The Sage Handbook of Methods in Social Psychology (eds Sansone, C. et al.) Ch. 4 (SAGE, 2003).
Sanes, J. R. Tell me a story. eLife 8, e50527 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bakker, M., van Dijk, A. & Wicherts, J. M. The rules of the game called psychological science. Perspect. Psychol. Sci. 7, 543–554 (2012).
Article PubMed Google Scholar
Grand, J. A., Rogelberg, S. G., Banks, G. C., Landis, R. S. & Tonidandel, S. From outcome to process focus: fostering a more robust psychological science through Registered Reports and results—blind reviewing. Perspect. Psychol. Sci. 13, 448–456 (2018).
Article PubMed Google Scholar
Eich, E. PSCI Initiatives for 2013. https://groups.google.com/group/openscienceframework/attach/8e518ad385b642e5/PSCI%20Initiatives%20for%202013%20%2820121008%29.docx?part=0.1 (2012).
Chambers, C. Changing the culture of scientific publishing from within. NeuroChambers (8 October 2021); https://neurochambers.blogspot.com/2012/10/changing-culture-of-scientific.html
Simons, D. J. Registered Replication Reports—Stay Tuned! Daniel Simons Blog (13 May 2013); http://blog.dansimons.com/2013/05/registered-replication-reports-stay.html
Nosek, B. A. & Lakens, D. Call for proposals: special issue of social psychology on “replications of important results in social psychology”. Soc. Psychol. 44, 59–60 (2013).
Article Google Scholar
Chambers, C. D. Registered Reports: a new publishing initiative at Cortex. Cortex 49, 609–610 (2013).
Article PubMed Google Scholar
Peirce, C. S. Illustrations of the logic of science VI: deduction, induction, and hypothesis. Pop. Sci. Monthly 13, 470–482 (1878).
Google Scholar
Wagenmakers, E.-J., Dutilh, G. & Sarafoglou, A. The creativity–verification cycle in psychological science: new methods to combat old Iidols. Perspect. Psychol. Sci. 13, 418–427 (2018).
Article PubMed PubMed Central Google Scholar
Rosenthal, R. Experimenter Effects in Behavioral Research (Appleton-Century-Crofts, 1966).
Weiss, D. J. An experiment in publication: advance publication review. Appl. Psychol. Meas. 13, 1–7 (1989).
Article Google Scholar
Kupfersmid, J. Improving what is published: a model in search of an editor. Am. Psychol. 43, 635–642 (1988).
Article Google Scholar
Newcombe, R. G. Towards a reduction in publication bias. Br. Med J. Clin. Res. Ed. 295, 656–659 (1987).
Article CAS PubMed PubMed Central Google Scholar
Mahoney, M. J. Publication prejudices: an experimental study of confirmatory bias in the peer review system. Cogn. Ther. Res. 1, 161–175 (1977).
Article Google Scholar
Walster, G. W. & Cleary, T. A. A proposal for a new editorial policy in the social sciences. Am. Stat. 24, 16–19 (1970).
Google Scholar
Wiseman, R., Watt, C. & Kornbrot, D. Registered Reports: an early example and analysis. PeerJ 7, e6232 (2019).
Article PubMed PubMed Central Google Scholar
The Editors of the Lancet. Protocol review at The Lancet: 1997–2015. Lancet 386, 2456–2457 (2015).
Article Google Scholar
Maizey, L. & Tzavella, L. Barriers and solutions for early career researchers in tackling the reproducibility crisis in cognitive neuroscience. Cortex 113, 357–359 (2019).
Article PubMed Google Scholar
Allen, C. & Mehler, D. M. A. Open science challenges, benefits and tips in early career and beyond. PLoS Biol. 17, e3000246 (2019).
Article CAS PubMed PubMed Central Google Scholar
Scheel, A. M., Schijen, M. & Lakens, D. An excess of positive results: Comparing the standard Psychology literature with Registered Reports. Adv. Meth. Pract. Psychol. Sci. 4, 1–12 (2021).
Google Scholar
Wicherts, J. M., Bakker, M. & Molenaar, D. Willingness to share research data is related to the strength of the evidence and the quality of reporting of statistical results. PLoS ONE 6, e26828 (2011).
Article CAS PubMed PubMed Central Google Scholar
Obels, P., Lakens, D., Coles, N. A., Gottfried, J. & Green, S. A. Analysis of open data and computational reproducibility in Registered Reports in psychology. Preprint at PsyArXiv https://doi.org/10.31234/osf.io/fk8vh (2019).
Hardwicke, T. E. et al. Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition. R. Soc. Open Sci. 5, 180448 (2018).
Article PubMed PubMed Central Google Scholar
Jannot, A.-S., Agoritsas, T., Gayet-Ageron, A. & Perneger, T. V. Citation bias favoring statistically significant studies was present in medical research. J. Clin. Epidemiol. 66, 296–301 (2013).
Article PubMed Google Scholar
Misemer, B. S., Platts-Mills, T. F. & Jones, C. W. Citation bias favoring positive clinical trials of thrombolytics for acute ischemic stroke: a cross-sectional analysis. Trials 17, 473 (2016).
Article PubMed PubMed Central Google Scholar
Brembs, B., Button, K. & Munafò, M. Deep impact: unintended consequences of journal rank. Front. Hum. Neurosci. 7, 291 (2013).
Article PubMed PubMed Central Google Scholar
Fang, F. C., Steen, R. G. & Casadevall, A. Misconduct accounts for the majority of retracted scientific publications. Proc. Natl Acad. Sci. USA 109, 17028–17033 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lozano, G. A., Larivière, V. & Gingras, Y. The weakening relationship between the impact factor and papers’ citations in the digital age. J. Am. Soc. Inf. Sci. Technol. 63, 2140–2145 (2012).
Article Google Scholar
Seglen, P. O. Why the impact factor of journals should not be used for evaluating research. BMJ 314, 498–502 (1997).
Article CAS PubMed PubMed Central Google Scholar
Hummer, L., Thorn, F. S., Nosek, B. A. & Errington, T. Evaluating Registered Reports: a naturalistic comparative study of article impact. Preprint at OSF https://doi.org/10.31219/osf.io/5y8w7 (2017).
Soderberg, C. K. et al. Initial evidence of research quality of Registered Reports compared with the standard publishing model. Nat. Hum. Behav. https://doi.org/10.1038/s41562-021-01142-4 (2021).
Button, K. S., Bal, L., Clark, A. & Shipley, T. Preventing the ends from justifying the means: withholding results to address publication bias in peer-review. BMC Psychol. 4, 59 (2016).
Article PubMed PubMed Central Google Scholar
Srivastava, S. A Pottery Barn rule for scientific journals. The Hardest Science (27 September 2012); https://thehardestscience.com/2012/09/27/a-pottery-barn-rule-for-scientific-journals/
Lilienfeld, S. O. Clinical psychological science: then and now. Clin. Psychol. Sci. 5, 3–13 (2017).
Article Google Scholar
Lucas, R. E. & Donnellan, M. B. Enhancing transparency and openness at the Journal of Research in Personality. J. Res. Personal. 68, 1–4 (2017).
Article Google Scholar
Anonymous. Preregistered direct replications: a new article type in psychological science. APS Obs. https://www.psychologicalscience.org/observer/preregistered-direct-replications-a-new-article-type-in-psychological-science (2017).
Replication Studies (Royal Society Open Science, 2021); https://royalsocietypublishing.org/rsos/replication-studies
Reproducibility and Transparency Collection (The Royal Society, 2021); https://royalsocietypublishing.org/topic/special-collections/rsos-reproducibility
Murray, H. Transparency meets transparency. F1000 Blogs (12 October 2017); https://blog.f1000.com/2017/10/12/transparency-meets-transparency/
Carlsson, R. et al. Inaugural editorial of Meta-Psychology. Meta-Psychol. 1, a1001 (2017).
Article Google Scholar
Kiyonaga, A. & Scimeca, J. M. Practical considerations for navigating Registered Reports. Trends Neurosci. 42, 568–572 (2019).
Article CAS PubMed Google Scholar
Guest, O. & Martin, A. E. How computational modeling can force theory building in psychological science. Perspect. Psychol. Sci. https://doi.org/10.1177/1745691620970585 (2021).
Schönbrodt, F. D. & Wagenmakers, E.-J. Bayes factor design analysis: planning for compelling evidence. Psychon. Bull. Rev. 25, 128–142 (2018).
Article PubMed Google Scholar
Nosek, B. A. et al. Promoting an open research culture. Science 348, 1422–1425 (2015).
Article CAS PubMed PubMed Central Google Scholar
For Authors (BMJ Open Science, 2021); https://openscience.bmj.com/pages/authors/
Exploratory Reports at IRSP: Guidelines for Authors (International Review of Social Psychology, 2021); http://www.rips-irsp.com/about/exploratory-reports/
McIntosh, R. D. Exploratory reports: a new article type for Cortex. Cortex 96, A1–A4 (2017).
Article PubMed Google Scholar
Hardwicke, T. E. & Ioannidis, J. P. A. Mapping the universe of Registered Reports. Nat. Hum. Behav. 2, 793–796 (2018).
Article PubMed Google Scholar
Chambers, C. D. & Mellor, D. T. Protocol transparency is vital for Registered Reports. Nat. Hum. Behav. 2, 791–792 (2018).
Article PubMed Google Scholar
Center for Open Science: Template Reviewer and Author Guidelines (Open Science Framework, 2018); https://osf.io/8mpji/
OSF Registries (OSF, 2021); https://osf.io/registries/discover?provider=OSF&type=Registered%20Report%20Protocol%20Preregistration
Tiokhin, L., Morgan, T. & Yan, M. Competition for priority and the cultural evolution of research strategies. Preprint at MetaArXiv https://doi.org/10.31222/osf.io/x4t7q (2020).
Chambers, C. Calling all scientists: rapid evaluation of COVID19-related Registered Reports at Royal Society Open Science. NeuroChambers (16 March 2020); http://neurochambers.blogspot.com/2020/03/calling-all-scientists-rapid-evaluation.html
Zhou, T., Nguyen, T. T., Zhong, J. & Liu, J. A COVID-19 descriptive study of life after lockdown in Wuhan, China. R. Soc. Open Sci. 7, 200705 (2020).
Article PubMed PubMed Central Google Scholar
Weinstein, N. & Nguyen, T.-V. Motivation and preference in isolation: a test of their different influences on responses to self-isolation during the COVID-19 outbreak. R. Soc. Open Sci. 7, 200458 (2020).
Article CAS PubMed PubMed Central Google Scholar
Khan, K. A. & Cheung, P. Presence of mismatches between diagnostic PCR assays and coronavirus SARS-CoV-2 genome. R. Soc. Open Sci. 7, 200636 (2020).
Article CAS PubMed PubMed Central Google Scholar
Riello, M., Purgato, M., Bove, C., MacTaggart, D. & Rusconi, E. Prevalence of post-traumatic symptomatology and anxiety among residential nursing and care home workers following the first COVID-19 outbreak in Northern Italy. R. Soc. Open Sci. 7, 200880 (2020).
Article PubMed PubMed Central Google Scholar
Lieberoth, A. et al. Stress and worry in the 2020 coronavirus pandemic: relationships to trust and compliance with preventive measures across 48 countries in the COVIDiSTRESS global survey. R. Soc. Open Sci. 8, 200589 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yonemitsu, F. et al. Warning ‘don’t spread’ versus ‘don’t be a spreader’ to prevent the COVID-19 pandemic. R. Soc. Open Sci. 7, 200793 (2020).
Article CAS PubMed PubMed Central Google Scholar
PLoS ONE Editors. PLoS ONE partners with the Children’s Tumor Foundation to trial Registered Reports. EveryONE: The PLoS ONE blog (26 September 2017); https://blogs.plos.org/everyone/2017/09/26/registered-reports-with-ctf/
Munafò, M. R. Improving the efficiency of grant and journal peer review: Registered Reports funding. Nicotine Tob. Res. 19, 773–773 (2017).
Article PubMed Google Scholar
Lakens, D. & DeBruine, L. Improving transparency, falsifiability, and rigour by making hypothesis tests machine readable. Preprint at PsyArXiv https://doi.org/10.31234/osf.io/5xcda (2020).
Mellor, D. & DeHaven, A. Templates of OSF Registration Forms (OSF, 2016); https://osf.io/zab38/
Wicherts, J. M. et al. Degrees of freedom in planning, running, analyzing, and reporting psychological studies: a checklist to avoid p-hacking. Front. Psychol. 7, 1832 (2016).
Article PubMed PubMed Central Google Scholar
Mathieu, S., Boutron, I., Moher, D., Altman, D. G. & Ravaud, P. Comparison of registered and published primary outcomes in randomized controlled trials. JAMA 302, 977–984 (2009).
Article CAS PubMed Google Scholar
Gopal, A. D. et al. Adherence to the International Committee of Medical Journal Editors’ (ICMJE) prospective registration policy and implications for outcome integrity: a cross-sectional analysis of trials published in high-impact specialty society journals. Trials 19, 448 (2018).
Article PubMed PubMed Central Google Scholar
Goldacre, B. et al. Tracking switched outcomes in clinical trials. COMPare http://compare-trials.org (2016).
Ramagopalan, S. V. et al. Funding source and primary outcome changes in clinical trials registered on ClinicalTrials.gov are associated with the reporting of a statistically significant primary outcome: a cross-sectional study. F1000Research 4, 80 (2015).
Article PubMed PubMed Central Google Scholar
Goldacre, B. et al. Compliance with requirement to report results on the EU Clinical Trials Register: cohort study and web resource. BMJ 362, k3218 (2018).
Article PubMed PubMed Central Google Scholar
Chen, R. et al. Publication and reporting of clinical trial results: cross sectional analysis across academic medical centers. BMJ 352, i637 (2016).
Article PubMed PubMed Central Google Scholar
The BMC Medicine Team. BMC Medicine becomes the first medical journal to accept Registered Reports. Research in Progress Blog (24 August 2017); http://blogs.biomedcentral.com/bmcblog/2017/08/24/bmc-medicine-becomes-the-first-medical-journal-to-accept-registered-reports/
Panel Criteria and Working Methods (Research Excellence Framework, 2019); https://www.ref.ac.uk/media/1084/ref-2019_02-panel-criteria-and-working-methods.pdf
Munafò, M. Raising research quality will require collective action. Nature 576, 183–183 (2019).
Article PubMed Google Scholar
UCL Statement on Transparency in Research (University College London, 2019); https://www.ucl.ac.uk/research/sites/research/files/ucl_statement_on_transparency_in_research_november_20191.pdf
Rousselet, G. A., Hazell, G., Cooke, A. & Dalley, J. W. Promoting and supporting credibility in neuroscience. Brain Neurosci. Adv. 3, 2398212819844167 (2019).
Article PubMed PubMed Central Google Scholar
British Psychological Society. We’re offering Registered Reports across all eleven of our academic journals. BPS News (13 July 2018); https://www.bps.org.uk/news-and-policy/were-offering-registered-reports-across-all-eleven-our-academic-journals
Stiftelsen Dam. Krav om forhåndregistrering av studier finansiert av Stiftelsen Dam. Stiftelsen Dam (9 November 2018); https://dam.no/krav-om-forhandsregistrering-av-studier-finansiert-av-stiftelsen-dam/
Accelerating Research on Consciousness (Templeton World Charity Foundation, 2021); https://www.templetonworldcharity.org/our-priorities/accelerating-research-consciousness
Heycke, T., Aust, F. & Stahl, C. Subliminal influence on preferences? A test of evaluative conditioning for brief visual conditioned stimuli using auditory unconditioned stimuli. R. Soc. Open Sci. 4, 160935 (2017).
Article PubMed PubMed Central Google Scholar
Ait Ouares, K., Beurrier, C., Canepari, M., Laverne, G. & Kuczewski, N. Opto nongenetics inhibition of neuronal firing. Eur. J. Neurosci. 49, 6–26 (2019).
Article PubMed Google Scholar
Sassenhagen, J. & Bornkessel-Schlesewsky, I. The P600 as a correlate of ventral attention network reorientation. Cortex 66, A3–A20 (2015).
Article PubMed Google Scholar
Allinson, M. Royal Society Open Science launches Registered Reports. The Royal Society Blog (27 November 2015); https://web.archive.org/web/20160702062134/https://blogs.royalsociety.org/publishing/registered-reports/
Nosek, B. A. & Errington, T. M. Reproducibility in cancer biology: making sense of replications. eLife 6, e23383 (2017).
Article PubMed PubMed Central Google Scholar
Guo, W., Del Vecchio, M. & Pogrebna, G. Global network centrality of university rankings. R. Soc. Open Sci. 4, 171172 (2017).
Article PubMed PubMed Central Google Scholar
[No authors listed]. Promoting reproducibility with Registered Reports. Nat. Hum. Behav. https://doi.org/10.1038/s41562-016-0034 (2017).
MacCoun, R. & Perlmutter, S. Blind analysis: hide results to seek the truth. Nature 526, 187–189 (2015).
Article CAS PubMed Google Scholar
Dutilh, G. et al. A test of the diffusion model explanation for the worst performance rule using preregistration and blinding. Atten. Percept. Psychophys. 79, 713–725 (2017).
Article PubMed PubMed Central Google Scholar
Dienes, Z. Using Bayes to get the most out of non-significant results. Front. Psychol. 5, 781 (2014).
Article PubMed PubMed Central Google Scholar
Lakens, D. Equivalence tests: a practical primer for t tests, correlations, and meta-analyses. Soc. Psychol. Personal. Sci. 8, 355–362 (2017).
Article PubMed PubMed Central Google Scholar
Preregistered Research Article Guidelines for Authors (PLoS Biology, 2020); https://plos-marketing.s3.amazonaws.com/Marketing/Biology+Preregistered+Articles+Guidelines+for+Authors.pdf
Registered Reports: Author and Reviewer Guidelines (Nature Human Behaviour, 2021); https://media.nature.com/original/nature-cms/uploads/ckeditor/attachments/4127/RegisteredReportsGuidelines_NatureHumanBehaviour.pdf
Guidelines for Reviewers (Cortex, 2013); https://cdn.elsevier.com/promis_misc/PROMIS%20pub_idt_CORTEX%20Guidelines_RR_29_04_2013.pdf
Royal Society Open Science. Registered Reports (The Royal Society, 2021); https://royalsocietypublishing.org/rsos/registered-reports#ReviewerGuideRegRep
Registered Reports: Resources for Editors (Center for Open Science, 2021); https://cos.io/rr/
Petticrew, M. et al. Publication bias in qualitative research: what becomes of qualitative research presented at conferences? J. Epidemiol. Community Health 62, 552–554 (2008).
Article CAS PubMed Google Scholar
Piñeiro, R. & Rosenblatt, F. Pre-analysis plans for qualitative research. Rev. Cienc. Política 36, 785–796 (2016).
Google Scholar
Kern, F. G. & Gleditsch, K. S. Exploring pre-registration and pre-analysis plans for qualitative inference. Preprint at ResearchGate https://doi.org/10.13140/RG.2.2.14428.69769 (2017).
Haven, T. L. & Grootel, D. L. V. Preregistering qualitative research. Account. Res. 26, 229–244 (2019).
Article Google Scholar
Hartman, A., Kern, F. & Mellor, D. Preregistration for Qualitative Research Template (OSF, 2018); https://osf.io/j7ghv/
Mehlenbacher, A. R. Registered Reports: genre evolution and the research article. Writ. Commun. 36, 38–67 (2019).
Article Google Scholar
DeHaven, A. C. et al. Registered Reports: views from editors, reviewers and authors. Preprint at MetaArXiv https://doi.org/10.31222/osf.io/ndvek (2019).

Download references

Acknowledgements

We are grateful to D. Mellor, B. Nosek and the Center for Open Science for their ongoing collaboration and discussion, to A. O’Mahony for the collation of key statistics concerning published RRs, and to the many authors, reviewers and editors who have supported the RR initiative.

Author information

Authors and Affiliations

Cardiff University Brain Research Imaging Centre (CUBRIC), School of Psychology, Cardiff University, Cardiff, UK
Christopher D. Chambers & Loukia Tzavella

Authors

Christopher D. Chambers
View author publications
You can also search for this author in PubMed Google Scholar
Loukia Tzavella
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christopher D. Chambers.

Ethics declarations

Competing interests

C.D.C. is a member of the Advisory Board of Nature Human Behaviour, is chair of the RRs committee supported by the Center for Open Science, is a co-founder of Peer Community in Registered Reports, and currently serves as RR editor at BMJ Open Science, Cortex, European Journal of Neuroscience, NeuroImage, Neuroimage: Reports, PLoS Biology and Royal Society Open Science.

Additional information

Peer review information Nature Human Behaviour thanks Bert Bakker and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–3, Supplementary Table 1 and Supplementary Note.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chambers, C.D., Tzavella, L. The past, present and future of Registered Reports. Nat Hum Behav 6, 29–42 (2022). https://doi.org/10.1038/s41562-021-01193-7

Download citation

Received: 10 February 2020
Accepted: 05 August 2021
Published: 15 November 2021
Issue Date: January 2022
DOI: https://doi.org/10.1038/s41562-021-01193-7

This article is cited by

Association between resting-state connectivity patterns in the defensive system network and treatment response in spider phobia—a replication approach
- Elisabeth J. Leehr
- Fabian R. Seeger
- Ulrike Lueken
Translational Psychiatry (2024)
Registered Report for climate research

Nature Climate Change (2024)
The scientific system must bend to avoid breaking
- Francesco Santini
European Radiology (2024)
A guide for social science journal editors on easing into open science
- Priya Silverstein
- Colin Elman
- Moin Syed
Research Integrity and Peer Review (2024)
Next steps after airing disagreement on a scientific issue with policy implications: a meta-analysis, multi-lab replication and adversarial collaboration
- Shinichi Nakagawa
- Malgorzata Lagisz
BMC Biology (2023)