Finding a sensible approach to sensitive data

    Today the journal released a checklist for researchers submitting descriptions of data derived from human subjects. The checklist is designed to make it easier for authors to understand and comply with our policies, while also making any restrictions on the data more transparent to the journal’s editors and staff.

    Datasets derived from human patients or participants must often be distributed under special conditions in order to enforce ethical and legal use of the data, and to protect privacy. Peer review of these datasets can be inherently challenging.

    When authors submit descriptions of sensitive human datasets, we initiate a conversation with them and their relevant ethics officers to identify a safe and secure way for peer review to occur. The checklist we have released today is designed to help facilitate this conversation. Authors who are considering submitting a description of a human-derived dataset should review this checklist and contact us early to begin this conversation. The checklist is now available online (

    Key parts of this checklist have been developed based on an editorial policy checklist already in use at the Nature-titled journals1. Our checklist additionally asks authors to provide detailed information on how data will be made accessible during peer-review and after publication.

    Since launch, the journal has been working to develop sensible policies that allow us to evaluate and publish sensitive human-derived datasets in a thorough manner2,3. First, we ask that the data be hosted in a repository that has experience handling and sharing sensitive data. Our repository list now includes a series of recommendations ( Second, there must be a viable and well-described mechanism by which other qualified researchers can apply for access to the data. Conditions should permit competitive reuse and should not limit reuse only to collaborators of the authors. Lastly, our referees should be able to access the data in a timely manner and without revealing their identities. This last point is often the most difficult to arrange.

    If there is no way for our referees to safely access a dataset, we will generally decline the submission. Data review, after all, is a central pillar of the journal’s editorial process.


    1. 1

      Announcement: Towards greater reproducibility for life-sciences research in Nature. Nature 546, 8 10.1038/546008a (2017).

    2. 2

      Hrynaszkiewicz, I., Khodiyar, V., Hufton, A. L. & Sansone, S. A. Publishing descriptions of non-public clinical datasets: proposed guidance for researchers, repositories, editors and funding organisations. Res Integr Peer Rev 1, 6 10.1186/s41073-016-0015-6 (2016).

    3. 3

      Let’s be pragmatic about clinical data. Sci. Data 2, 150034 10.1038/sdata.2015.34 (2015).

    Download references

    Rights and permissions

    Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit

    Reprints and Permissions

    About this article

    Verify currency and authenticity via CrossMark

    Cite this article

    Finding a sensible approach to sensitive data. Sci Data 5, 180253 (2018).

    Download citation