A knowledge base for tracking the impact of genomics on population health

Yu, Wei; Gwinn, Marta; Dotson, W. David; Green, Ridgely Fisk; Clyne, Mindy; Wulf, Anja; Bowen, Scott; Kolor, Katherine; Khoury, Muin J.

doi:10.1038/gim.2016.63

Download PDF

Brief Report
Published: 09 June 2016

A knowledge base for tracking the impact of genomics on population health

Genetics in Medicine volume 18, pages 1312–1314 (2016)Cite this article

928 Accesses
13 Citations
23 Altmetric
Metrics details

Subjects

Abstract

Purpose:

We created an online knowledge base (the Public Health Genomics Knowledge Base (PHGKB)) to provide systematically curated and updated information that bridges population-based research on genomics with clinical and public health applications.

Methods:

Weekly horizon scanning of a wide variety of online resources is used to retrieve relevant scientific publications, guidelines, and commentaries. After curation by domain experts, links are deposited into Web-based databases.

Results:

PHGKB currently consists of nine component databases. Users can search the entire knowledge base or search one or more component databases directly and choose options for customizing the display of their search results.

Conclusion:

PHGKB offers researchers, policy makers, practitioners, and the general public a way to find information they need to understand the complicated landscape of genomics and population health.

Genet Med 18 12, 1312–1314.

Strategic vision for improving human health at The Forefront of Genomics

Article 28 October 2020

The Singapore National Precision Medicine Strategy

Article 19 January 2023

A machine-compiled database of genome-wide association studies

Article Open access 26 July 2019

Introduction

Genomic information is increasingly finding its way into clinical and public health practice, from diagnosis of rare genetic diseases to diagnosis and treatment of common chronic diseases and infections. The new Precision Medicine Initiative¹ and related developments will increase the number of people whose genomes are sequenced in the next decade. Meaningful clinical interpretation of emerging information requires the integration of data from basic, clinical, and population studies. Although such data are abundant, they are widely dispersed across the peer-reviewed literature and other online resources. Clinicians and public health professionals require credible information to use genomic information in practice.

In 2001, the Centers for Disease Control and Prevention (CDC) Office of Public Health Genomics began to systematically compile and curate an online catalog of published population-based studies of human gene–disease associations. In 2008, Office of Public Health Genomics launched the Human Genome Epidemiology Navigator (HuGE Navigator)² as an application for mining the rapidly growing database. By 2015, it contained citations for more than 100,000 scientific publications and had acquired more than 150,000 users (i.e., unique IP addresses). At that time, the scientific agenda and public interest were shifting increasingly from gene discovery to translation—that is, the use of genomic information to develop genome-based tests, drugs, and other applications.³ The Genomic Applications in Practice and Prevention Network was one of several government-sponsored, interdisciplinary efforts to address a perceived “lack of readily accessible information about the utility of most genomic applications and the lack of necessary knowledge by consumers and providers to implement what is known.”⁴

To help address this need, we have taken what we view as the next step in showing how epidemiologic and other information can be used to improve population health: launching the Public Health Genomics Knowledge Base (PHGKB) (http://phgkb.cdc.gov). Our goal is to organize information from a wide variety of sources and in varying formats that are needed to describe the translational trajectories of genomic discoveries. Thus, although PHGKB’s component databases have different formats and data structures and can be searched individually, searching PHGKB as a whole also produces seamless results. Here, we briefly describe PHGKB and present an initial cross-sectional analysis of its contents.

Materials and Methods

PHGKB includes publications and other relevant Web-based resources captured by our weekly horizon scan. Some of our methods and early results have been described in previous publications.^5,6 The content is indexed and grouped into categories that include practice guidelines, systematic reviews, implementation studies, and applications of genomic tests and family health history classified according to the level of available evidence.⁷ PHGKB was built using J2EE technology⁸ and other Java open-source frameworks, including Hibernate⁹ and Strut.¹⁰ As the largest constituent of PHGKB content, the scientific literature is represented by PubMed abstracts indexed with Medical Subject Headings (MeSH) terminology. Use of the MeSH tree hierarchies and the Unified Medical Language System metathesaurus enhances the system’s search capacity.

Results

PHGKB is an open access, Web-based, searchable database that provides access to a spectrum of information on genomics and population health, from basic research to implementation. PHGKB currently consists of nine component databases, including HuGE Navigator ( Table 1 ). Users interested in specific topics can perform a global search of the entire knowledge base or search one or more component databases directly and choose options for customizing the display of their search results. The results of searching all databases in PHGKB are displayed according to steps in the translational pathway from discovery to implementation.¹¹ For example, results of a search for breast cancer ( Figure 1 ) are arrayed from discovery to implementation, with special emphasis on evidence synthesis and guidelines, as well as on CDC products. Each category also includes links to specialized external resources.

Table 1 PHGKB component databases

Full size table

PHGKB also offers users several ways to keep abreast of new information. First, the PHGKB main page features two sections that are updated almost daily: Hot Topics of the Day, curated by domain experts, and What’s New, which displays recent additions to the database and summary statistics. In addition, two weekly e-mail newsletters are available by subscription—Genomics & Health Impact Weekly Scan and Advanced Molecular Detection Clips (focused on human and pathogen genomics, respectively)—which direct users to new content posted on the PHGKB website.

Discussion

Genomics has given rise to many specialized online databases that were designed primarily for use by researchers and other expert users. PHGKB is unique in providing systematically curated and updated information that bridges population-based research with clinical and public health applications. We acknowledge that PHGKB is not comprehensive, especially given the fluid state of translation and implementation research. Most genomic research is still focused on new discoveries; however, the focus of PHGKB is the small fraction—perhaps 1%—of genomics-related publications that address epidemiology, evaluation and evidence synthesis, implementation, and outcomes, as we have described elsewhere.⁵ Finding these needles in a haystack is important because they are most relevant to population health. As the knowledge base grows, it will become useful for tracing the translational trajectories of specific discoveries into clinical application and population health outcomes. So far, PHGKB has undergone limited pilot testing by selected users at the CDC and in state health departments. With this report, we invite potential users to explore the resource. We intend to conduct additional evaluation studies in the near future.

Genomic literacy is becoming a fundamental requirement for clinical and public health decision makers who have the power to improve patient and population health. We hope that PHGKB offers a useful resource to researchers, policy makers, practitioners, and members of the public who are interested in understanding how genomic research can contribute to better health.

Disclosure

The authors declare no conflict of interest.

References

National Institutes of Health. Precision Medicine Initiative Cohort Program. https://www.nih.gov/precision-medicine-initiative-cohort-program. Accessed 20 January 2016.
Yu W, Gwinn M, Clyne M, Yesupriya A, Khoury MJ. A navigator for human genome epidemiology. Nat Genet 2008;40:124–125.
Article CAS Google Scholar
Green ED, Guyer MS ; National Human Genome Research Institute. Charting a course for genomic medicine from base pairs to bedside. Nature 2011;470:204–213.
Article CAS Google Scholar
Khoury MJ, Feero WG, Reyes M, et al.; GAPPNet Planning Group. The genomic applications in practice and prevention network. Genet Med 2009;11:488–494.
Article Google Scholar
Clyne M, Schully SD, Dotson WD, et al. Horizon scanning for translational genomic research beyond bench to bedside. Genet Med 2014;16:535–538.
Article Google Scholar
Yu W, Clyne M, Dolan SM, et al. GAPscreener: an automatic tool for screening human genetic association literature in PubMed using the support vector machine technique. BMC Bioinformatics 2008;9:205.
Article CAS Google Scholar
Dotson WD, Douglas MP, Kolor K, et al. Prioritizing genomic applications for action by level of evidence: a horizon-scanning method. Clin Pharmacol Ther 2014;95:394–402.
Article CAS Google Scholar
ORACLE. Jave EE 7 SDK downloads. http://www.oracle.com/technetwork/java/javaee/downloads/index.html. Accessed 20 December 2014.
Hibernate. http://hibernate.org/. Accessed 20 December 2014.
Apache Software Foundation. Apache Struts. http://struts.apache.org/. Accessed 20 December 2014.
Khoury MJ, Gwinn M, Yoon PW, Dowling N, Moore CA, Bradley L. The continuum of translation research in genomic medicine: how can we accelerate the appropriate integration of human genome discoveries into health care and disease prevention? Genet Med 2007;9:665–674.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Office of Public Health Genomics, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Wei Yu MS, PhD, Marta Gwinn MD, MPH, W. David Dotson PhD, Ridgely Fisk Green MS, PhD, Anja Wulf BA, Scott Bowen MPH, Katherine Kolor MS, PhD & Muin J. Khoury MD, PhD
McKing Consulting Corporation, Atlanta, Georgia, USA
Marta Gwinn MD, MPH
Carter Consulting, Inc., Atlanta, Georgia, USA
Ridgely Fisk Green MS, PhD
Epidemiology and Genomics Research Program, National Cancer Institute, Bethesda, Maryland, USA
Mindy Clyne MHS
Kelly Services, Troy, Michigan, USA
Mindy Clyne MHS
Cadence Group, Atlanta, Georgia, USA
Anja Wulf BA

Authors

Wei Yu MS, PhD
View author publications
You can also search for this author in PubMed Google Scholar
Marta Gwinn MD, MPH
View author publications
You can also search for this author in PubMed Google Scholar
W. David Dotson PhD
View author publications
You can also search for this author in PubMed Google Scholar
Ridgely Fisk Green MS, PhD
View author publications
You can also search for this author in PubMed Google Scholar
Mindy Clyne MHS
View author publications
You can also search for this author in PubMed Google Scholar
Anja Wulf BA
View author publications
You can also search for this author in PubMed Google Scholar
Scott Bowen MPH
View author publications
You can also search for this author in PubMed Google Scholar
Katherine Kolor MS, PhD
View author publications
You can also search for this author in PubMed Google Scholar
Muin J. Khoury MD, PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Yu MS, PhD.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yu, W., Gwinn, M., Dotson, W. et al. A knowledge base for tracking the impact of genomics on population health. Genet Med 18, 1312–1314 (2016). https://doi.org/10.1038/gim.2016.63

Download citation

Received: 12 February 2016
Accepted: 06 April 2016
Published: 09 June 2016
Issue Date: December 2016
DOI: https://doi.org/10.1038/gim.2016.63

This article is cited by

COVID-19 GPH: tracking the contribution of genomics and precision health to the COVID-19 pandemic response
- Wei Yu
- Emily Drzymalla
- Muin J. Khoury
BMC Infectious Diseases (2022)
Impact of BMI and waist circumference on epigenome-wide DNA methylation and identification of epigenetic biomarkers in blood: an EWAS in multi-ethnic Asian individuals
- Yuqing Chen
- Irfahan Kassam
- Xueling Sim
Clinical Epigenetics (2021)
Tracking human genes along the translational continuum
- Kyubum Lee
- Mindy Clyne
- Muin J. Khoury
npj Genomic Medicine (2019)