The value of broadening searches for data across multiple repositories has been identified by the biomedical research community. As part of the US National Institutes of Health (NIH) Big Data to Knowledge initiative, we work with an international community of researchers, service providers and knowledge experts to develop and test a data index and search engine, which are based on metadata extracted from various data sets in a range of repositories. DataMed is designed to be, for data, what PubMed has been for the scientific literature. DataMed supports the findability and accessibility of data sets. These characteristics—along with interoperability and reusability—compose the four FAIR principles to facilitate knowledge discovery in today's big data–intensive science landscape.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.


  1. 1.

    et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016).

  2. 2.

    & Policy: NIH plans to enhance reproducibility. Nature 505, 612–613 (2014).

  3. 3.

    et al. The NIH Big Data to Knowledge (BD2K) initiative. J. Am. Med. Inform. Assoc. 22, 1114 (2015).

  4. 4.

    PubMed and beyond: a survey of web tools for searching biomedical literature. Database (Oxford) 2011, baq036 (2011).

  5. 5.

    et al. DATS: the data tag suite to enable discoverability of datasets. Sci. Data 4, 170059 (2017).

  6. 6.

    Google Scholar: the new generation of citation indexes. Libri 55, 170–180 (2005).

  7. 7.

    Microsoft Academic Search—http://academic.research.microsoft.com. Tech. Serv. Q. 29, 251–252 (2012).

  8. 8.

    & Is your journal indexed in PubMed? Relevance of PubMed in biomedical scientific literature today. WebmedCentral MISCELLANEOUS 3, WMC003159 (2012).

  9. 9.

    Journal Article Tag Suite 1.0: National Information Standards Organization standard of journal extensible markup language. Sci. Ed. 1, 99–104 (2014).

  10. 10.

    et al. Nat. Biotechnol. 35, 406–409 (2017)

  11. 11.

    in 2009 Fourth International Conference on Cooperation and Promotion of Information Resources in Science and Technology (COINFO 2009) 257–261 (IEEE, 2009).

  12. 12.

    MongoDB: The Definitive Guide (O'Reilly Media, 2013).

  13. 13.

    & ElasticSearch Server (Packt Publishing, 2016).

  14. 14.

    & Open archives initiative. Protocol for metadata harvesting (OAI-PMH): descripción, funciones y aplicaciones de un protocolo. Prof. Inf. 12, 99–106 (2003).

  15. 15.

    & RESTful Web Services (O'Reilly Media, 2008).

  16. 16.

    , , , & PDBML: the representation of archival macromolecular structure data in XML. Bioinformatics 21, 988–992 (2005).

  17. 17.

    , , & Semantic annotation, indexing, and retrieval. Web Semantics 2, 49–79 (2004).

  18. 18.

    , , , & Tweeting biomedicine: an analysis of tweets and citations in the biomedical literature. J. Assoc. Inf. Sci. Technol. 65, 656–669 (2014).

Download references


This project is funded by grant U24AI117966 from NIAID, NIH, as part of the BD2K program. The co-authors, who are the lead investigators and chairs/co-chairs of the core activities, thank all contributors to the bioCADDIE consortium and list them in the Supplementary Note in alphabetical order within each activity group (each name appears only once even though many people participated in different activities).

Author information

Author notes

    • Lucila Ohno-Machado
    • , Susanna-Assunta Sansone
    • , George Alter
    • , Ian Fore
    • , Jeffrey Grethe
    • , Hua Xu
    •  & Hyeon-eui Kim

    These authors contributed equally to this work.


  1. Health System Department of Biomedical Informatics, University of California, San Diego, La Jolla, California, USA.

    • Lucila Ohno-Machado
    • , Elizabeth Bell
    • , Nansu Zong
    •  & Hyeon-eui Kim
  2. Veterans Administration San Diego Healthcare System, San Diego, California, USA.

    • Lucila Ohno-Machado
  3. e-Research Centre, University of Oxford, Oxford, UK.

    • Susanna-Assunta Sansone
    • , Alejandra Gonzalez-Beltran
    •  & Philippe Rocca-Serra
  4. Department of History and Inter-University Consortium for Political and Social Research (ICPSR), Institute for Social Research, University of Michigan, Ann Arbor, Michigan, USA.

    • George Alter
  5. US National Institutes of Health, Bethesda, Maryland, USA.

    • Ian Fore
  6. Department of Neurosciences, University of California, San Diego, La Jolla, California, USA.

    • Jeffrey Grethe
  7. School of Biomedical Informatics,University of Texas Health Science Center at Houston, Houston, Texas, USA.

    • Hua Xu
    • , Anupama E Gururaj
    •  & Ergin Soysal


  1. Search for Lucila Ohno-Machado in:

  2. Search for Susanna-Assunta Sansone in:

  3. Search for George Alter in:

  4. Search for Ian Fore in:

  5. Search for Jeffrey Grethe in:

  6. Search for Hua Xu in:

  7. Search for Alejandra Gonzalez-Beltran in:

  8. Search for Philippe Rocca-Serra in:

  9. Search for Anupama E Gururaj in:

  10. Search for Elizabeth Bell in:

  11. Search for Ergin Soysal in:

  12. Search for Nansu Zong in:

  13. Search for Hyeon-eui Kim in:

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Lucila Ohno-Machado.

Supplementary information

Word documents

  1. 1.

    Supplementary Text and Figures

    Supplementary Table 1 and Supplementary Note

About this article

Publication history




Further reading

Newsletter Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing