Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain
the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in
Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles
To understand our world's changing climate, it is imperative that we understand how climates of the past varied. This collection brings together a series of papers published at Scientific Data that describe and share valuable data on past climates, i.e. 'paleoclimate data'. The data were obtained from a variety of sources, including historical records that can inform us about the recent past, and biological or fossil proxies that allow us to learn about climates that existed long before man walked the Earth.
Improving the energy efficiency of the buildings in which we live and work requires not just changes in construction techniques, or the use of more energy efficient technologies. Human behaviour also shapes and influences energy usage in complex and sometimes counter-intuitive ways. The papers in this collection present a series of datasets designed to help researchers explore this important topic. It is hoped that these works will help guide policy development and spark wider data sharing among researchers studying human behaviour in relation to our built environment.
'Multi-omics' refers to a family of complex experimental designs where researchers apply more than one molecular profiling technology – capturing, for example, the genome, proteome and metabolome – across a common set of biological samples. These experiments offer a wealth of opportunities for subsequent analyses, but the size of the resulting datasets and the diversity of the study designs makes data sharing inherently challenging. In this collection, we present a series of multi-omics studies where the authors have used innovative means to maximize the accessibility and reusability of their datasets.
In this collection, we highlight commentary and reviews from across the Nature Research journals that address some of the questions and considerations surrounding access of human genomic data. We have also selected research articles from across our journals that demonstrate the power of large-scale genomic data and data access
Metadata, data about data, are an essential component of any data sharing system. They power discovery and bind together related datasets. They provide essential context, describing, for example, who generated a dataset and how. The papers in this collection explore and critically analyze issues related to the quality and FAIRness of metadata in public resources. Collectively, these works highlight obstacles, both technical and behavioural, that affect metadata quality, and propose possible solutions. They also shine a light on the important, but often underappreciated, role of data curators and research data managers.
The robustness of a scientific finding must be judged not just by the merits of the original experiments, but also by the ability of these findings to be independently reproduced. Concerns that published findings, however, are commonly failing to reproduce have shaken trust in science, and led to calls for reforms in how scientific findings are evaluated and transmitted. As part of this movement, groups have called for replication studies – studies that repeat experiments in previous work to test the reproducibility of previous findings – to be better rewarded and more widely shared. This collection presents datasets collected from a series of replication studies, each presented in a transparent manner that would allow others to analyse the data themselves or compare it with past works. Several of the studies host their data in the Open Science Framework, a service of the Center for Open Science, which has been a strong advocate for wider research replication.
Understanding a gene’s functions is no less important than knowing its sequence, but data derived from functional screens have often been shared less systematically and thoroughly than sequencing data. This collection of data descriptors, organized in partnership with groups from leading functional genomics screening facilities around the globe, demonstrates the feasibility and value of sharing these inherently complex data types.
High-quality data on human population distributions are required for a wide range of applications, from assessing the impacts of population growth to planning elections. This collection brings together a series of publications at Scientific Data describing datasets from WorldPop, an initiative that aims to provide detailed, open-access spatial demographic datasets using transparent approaches – focusing, in particular, on low and middle-income countries where high-quality data has historically been lacking.
X-ray lasers scheduled to come online in the next few years promise to create a flood of new structural biology data that could overwhelm current processing and analysis methods. This collection describes a series of X-ray free-electron laser datasets generated at the Linac Coherent Light Source, which the organizers hope will help researchers develop new tools and methods to meet these challenges. All data are deposited at the Coherent X-ray Imaging Data Bank (CXIDB) and openly available to the scientific community.