Comment

Filter By:

Article Type
  • As big data, open data, and open science advance to increase access to complex and large datasets for innovation, discovery, and decision-making, Indigenous Peoples’ rights to control and access their data within these data environments remain limited. Operationalizing the FAIR Principles for scientific data with the CARE Principles for Indigenous Data Governance enhances machine actionability and brings people and purpose to the fore to resolve Indigenous Peoples’ rights to and interests in their data across the data lifecycle.

    • Stephanie Russo Carroll
    • Edit Herczog
    • Shelley Stall
    CommentOpen Access
  • Open access to global forest data, especially ground-measured (in situ) records, is critical for saving the world’s forest systems. Integrated approaches to achieve sustainable data openness will involve legal assurances, shared ethics, innovative funding schemes and capacity development.

    • Jingjing Liang
    • Javier G. P. Gamarra
    CommentOpen Access
  • Development of world-class artificial intelligence (AI) for medical imaging requires access to massive amounts of training data from clinical sources, but effective data sharing is often hindered by uncertainty regarding data protection. We describe an initiative to reduce this uncertainty through a policy describing a national community consensus on sound data sharing practices.

    • Joel Hedlund
    • Anders Eklund
    • Claes Lundström
    CommentOpen Access
  • “Speak to the past and it shall teach thee”. I first read those words on a dedication tablet within the John Carter Brown library at Brown University where I was a graduate student. Little did I know the phrase would accurately describe the next three and a half decades of my career. Paleoclimate data are the language we use to look into the past to understand ourselves and ultimately our future.

    • Harry Dowsett
    CommentOpen Access
  • Efficient response to the pandemic through the mobilization of the larger scientific community is challenged by the limited reusability of the available primary genomic data. Here, the Genomic Standards Consortium board highlights the essential need for contextual genomic data FAIRness, for empowering key data-driven biological questions.

    • Lynn M. Schriml
    • Maria Chuvochina
    • Ramona Walls
    CommentOpen Access
  • As information and communication technology has become pervasive in our society, we are increasingly dependent on both digital data and repositories that provide access to and enable the use of such resources. Repositories must earn the trust of the communities they intend to serve and demonstrate that they are reliable and capable of appropriately managing the data they hold.

    • Dawei Lin
    • Jonathan Crabtree
    • John Westbrook
    CommentOpen Access
  • A significant challenge facing rare disease communities is finding a sufficient quantity and variety of data to develop and test disease-specific hypotheses. Here we describe an approach to data sharing in which stakeholders from the neurofibromatosis (NF) research community collaborated to develop a disease-focused data portal with the goal of supporting scientists within and outside the community as well as clinicians and patient advocates.

    • Robert J. Allaway
    • Salvatore La Rosa
    • Sara J. C. Gosline
    CommentOpen Access
  • We outline a principled approach to data FAIRification rooted in the notions of experimental design, and whose main intent is to clarify the semantics of data matrices. Using two related metabolomics datasets associated to journal articles, we perform retrospective data and metadata curation and re-annotation, using community, open, interoperability standards. The results are semantically-anchored data matrices, deposited in public archives, which are readable by software agents for data-level queries, and which can support the reproducibility and reuse of the data underpinning the publications.

    • Philippe Rocca-Serra
    • Susanna-Assunta Sansone
    CommentOpen Access
  • In the past decade, there has been a surge in the number of sensitive human genomic and health datasets available to researchers via Data Access Agreements (DAAs) and managed by Data Access Committees (DACs). As this form of sharing increases, so do the challenges of achieving a reasonable level of data protection, particularly in the context of international data sharing. Here, we consider how excessive variation across DAAs can hinder these goals, and suggest a core set of clauses that could prove useful in future attempts to harmonize data governance.

    • Katie M. Saulnier
    • David Bujold
    • Yann Joly
    CommentOpen Access
  • Climate change cannot be addressed without improving the energy efficiency of the buildings in which we live and work. The papers in this collection describe and release a series of datasets that help us understand how occupants influence and experience building energy use, both to aid future research and policy-development, and to spark wider data sharing in this important area.

    • Gesche Margarethe Huebner
    • Ardeshir Mahdavi
    CommentOpen Access
  • A special collection on multi-omics data sharing, launched today at Scientific Data, offers to the scientific community a compendium of multi-omics datasets ready for reuse, which showcase the diversity of multi-omics projects and highlights innovative approaches for preprocessing, quality control, hosting and access.

    • Ana Conesa
    • Stephan Beck
    CommentOpen Access
  • UniProt continues to support the ongoing process of making scientific data FAIR. Here we contribute to this process with a FAIRness assessment of our UniProtKB dataset followed by a critical reflection on the challenges and future directions of the adoption and validation of the FAIR principles and metrics.

    • Leyla Garcia
    • Jerven Bolleman
    • Jian Zhang
    CommentOpen Access
  • The Brain Imaging Data Structure (BIDS) project is a rapidly evolving effort in the human brain imaging research community to create standards allowing researchers to readily organize and share study data within and between laboratories. Here we present an extension to BIDS for electroencephalography (EEG) data, EEG-BIDS, along with tools and references to a series of public EEG datasets organized using this new standard.

    • Cyril R. Pernet
    • Stefan Appelhoff
    • Robert Oostenveld
    CommentOpen Access
  • The Brain Imaging Data Structure (BIDS) is a community-driven specification for organizing neuroscience data and metadata with the aim to make datasets more transparent, reusable, and reproducible. Intracranial electroencephalography (iEEG) data offer a unique combination of high spatial and temporal resolution measurements of the living human brain. To improve internal (re)use and external sharing of these unique data, we present a specification for storing and sharing iEEG data: iEEG-BIDS.

    • Christopher Holdgraf
    • Stefan Appelhoff
    • Dora Hermes
    CommentOpen Access
  • Although increasingly recognized as critical to genomic research, genomic data sharing is hindered by an absence of standards regarding timing, patient privacy, use agreement standards, and data characterization and quality. Only after months of identifying, permissioning for use, committing to terms restricting use and sharing, downloading, and assessing quality, is it possible to know whether or not a dataset can be used. In this paper, we evaluate the barriers to data sharing based on the Treehouse experience and offer recommendations for use agreement standards, data characterization and metadata standardization to enhance data sharing and outcomes for all pediatric cancer patients.

    • Katrina Learned
    • Ann Durbin
    • Isabel M. Bjork
    CommentOpen Access
  • There is an urgent need to improve integrity of large industrial infrastructure. Sharing data can support better understanding of accidents such as recent mining dam collapses, making them less likely to occur, and contributing to sustainability.

    • Paulo A. de Souza Jr.
    CommentOpen Access
  • The number of chemical compounds and associated experimental data in public databases is growing, but presently there is no simple way to access these data in a quick and synoptic manner. Instead, data are fragmented across different resources and interested parties need to invest invaluable time and effort to navigate these systems.

    • Sten Ilmjärv
    • Fiona Augsburger
    • Karl-Heinz Krause
    CommentOpen Access
  • The past two decades have seen a revolution in digital imaging techniques for capturing gross morphology, offering an unprecedented volume of data for biological research. Despite the rapid increase in scientific publications incorporating those images, the underlying datasets remain largely inaccessible. As the technical barriers to data sharing continue to fall, we face a more intimate, and perhaps more complicated, obstacle to open data – the one in our minds.

    • Christy A. Hipsley
    • Emma Sherratt
    CommentOpen Access