Neptune: an environment for the delivery of genomic medicine

Eric, Venner; Yi, Victoria; Murdock, David; Kalla, Sara E.; Wu, Tsung-Jung; Sabo, Aniko; Li, Shoudong; Meng, Qingchang; Tian, Xia; Murugan, Mullai; Cohen, Michelle; Kovar, Christie; Wei, Wei-Qi; Chung, Wendy K.; Weng, Chunhua; Wiesner, Georgia L.; Jarvik, Gail P.; Muzny, Donna; Gibbs, Richard A.

doi:10.1038/s41436-021-01230-w

Special Article
Published: 13 July 2021

Neptune: an environment for the delivery of genomic medicine

Venner Eric^1,2,
Victoria Yi¹,
David Murdock^1,2,
Sara E. Kalla¹,
Tsung-Jung Wu¹,
Aniko Sabo^1,2,
Shoudong Li¹,
Qingchang Meng¹,
Xia Tian¹,
Mullai Murugan¹,
Michelle Cohen¹,
Christie Kovar¹,
Wei-Qi Wei³,
Wendy K. Chung⁴,
Chunhua Weng⁵,
Georgia L. Wiesner⁶,
Gail P. Jarvik^7,8,
Donna Muzny^1,2,
Richard A. Gibbs^1,2 &
eMERGE Consortium

Genetics in Medicine volume 23, pages 1838–1846 (2021)Cite this article

689 Accesses
1 Citations
3 Altmetric
Metrics details

Abstract

Purpose

Genomic medicine holds great promise for improving health care, but integrating searchable and actionable genetic data into electronic health records (EHRs) remains a challenge. Here we describe Neptune, a system for managing the interaction between a clinical laboratory and an EHR system during the clinical reporting process.

Methods

We developed Neptune and applied it to two clinical sequencing projects that required report customization, variant reanalysis, and EHR integration.

Results

Neptune has been applied for the generation and delivery of over 15,000 clinical genomic reports. This work spans two clinical tests based on targeted gene panels that contain 68 and 153 genes respectively. These projects demanded customizable clinical reports that contained a variety of genetic data types including single-nucleotide variants (SNVs), copy-number variants (CNVs), pharmacogenomics, and polygenic risk scores. Two variant reanalysis activities were also supported, highlighting this important workflow.

Conclusion

Methods are needed for delivering structured genetic data to EHRs. This need extends beyond developing data formats to providing infrastructure that manages the reporting process itself. Neptune was successfully applied on two high-throughput clinical sequencing projects to build and deliver clinical reports to EHR systems. The software is open source and available at https://gitlab.com/bcm-hgsc/neptune.

You have full access to this article via your institution.

Download PDF

Best practices for the interpretation and reporting of clinical whole genome sequencing

Article Open access 08 April 2022

Gene.iobio: an interactive web tool for versatile, clinically-driven variant interrogation and prioritization

Article Open access 13 October 2021

Children’s rare disease cohorts: an integrative research and clinical genomics initiative

Article Open access 06 July 2020

INTRODUCTION

Genomic medicine seeks to improve clinical outcomes by identifying risk for adverse drug events, providing molecular diagnoses, and identifying patients with increased lifetime risk of genetic disease, but implementation is limited by many factors. These include (1) insufficient infrastructure for high-throughput clinical reporting [1,2,3], (2) challenges handling protected health information (PHI) [4, 5], (3) labor-intensive genomic variant interpretation [6], (4) clinical site specific data integration requirements [7, 8], (5) few actionable findings in some disease areas [9], (6) additional burden on providers to integrate genetic data [10], and (7) a reluctance from insurance providers to pay for precision medicine testing [11]. Addressing these challenges demands research that pairs large genomic data sets with clinical outcomes. Many national and international clinical sequencing projects have been established to fill this need, including the eMERGE Network [12], All of Us [13], the IGNITE network [14], and the Clinical Sequencing Evidence-Generating Research [15] (CSER) consortium as well as a large number of private and regional initiatives [16, 17].

Integrating genomic data in electronic health records (EHRs) will allow researchers to improve the clinical impact of genomic data, demonstrate its utility, and make it accessible to clinical decision support tools. Unfortunately, genomic data are often heterogeneous, mix or lack standards, are updated regularly, and require domain expertise to handle correctly. Data standards are in development [18] but there is a lack of flexible, comprehensive, and open source solutions for structuring genomic data and cleanly bridging the gap to EHR systems. There are commercial clinical reporting offerings in this space, but they are closed source [19,20,21]. PharmCat combines a similar set of features by capturing domain knowledge, providing sample analysis, and generating clinical reports, but it focuses on pharmacogenomic reporting [22]. Genomics informatics resources like dbGaP [23] offer longer-term data storage solutions or focus on reanalysis [24]. Finally, some tools provide general support for building HL7 or FHIR messages, but do not provide domain-specific support for clinical genetics reporting [25]. In summary, Neptune offers a robust, open source package of tools for integrating genomics data into the EHR (Supplementary Table 1).

While many laboratories have solutions to aid variant interpretation, incorporating structured genetic testing results into the EHR is widely considered so difficult that few laboratories attempt it, instead preferring to load PDFs as media files [26]. To support delivering genomic data to the EHR, we have developed Neptune, an environment that manages the clinical reporting process. The key features of Neptune are (1) to take as input genomic data (genotypes and coverage information) and compare against a VIP database of known genetic variation, marking known variants with previously curated data, selecting novel genomic variants for review, and identifying samples where all variants have been curated, which is essential for automated reporting; (2) to combine data from diverse sources including sample metadata from a Laboratory Information Management System (LIMS) and variant information from the VIP database and output data in a structured report file ready to be accepted by EHR systems; (3) to convert that structured data into a customizable human-readable report; (4) to enable corrected and updated reports; and (5) to enable the reanalysis and reinterpretation of data over time. In this report we describe Neptune’s workflow and its application to two gene panel based clinical tests that required data integration into EHRs: eMERGE III and HeartCare.

MATERIALS AND METHODS

Following the detection of genomic variants using standard bioinformatics pipelines [27], Neptune communicates via API with an external variant interpretation interface to obtain the most up-to-date valiant interpretation data. Annotated variants and associated metadata are used to populate a structured JSON format that represents the clinical report for that sample. This functionality is encapsulated in an API (Table 1). Automated reporting is possible when all variants in a sample have been previously curated.

Table 1 Modes for running Neptune.

Full size table

VIP database

The VIP database of genomic variation is maintained externally from Neptune. This database contains variant information (position, allele), frequency, transcript data, gene annotations (disease association, inheritance), and internal curation data (PubMed IDs of related publications; comments and categories from clinical sites). It currently contains data from 381,564 variants (Fig. 1b). This database was initially seeded by the two clinical reporting laboratories for the eMERGE III network [28], and has been subsequently updated to incorporate novel variants that are detected in samples in the Baylor College of Medicine Human Genome Sequencing Center (HGSC) Clinical Lab and other public variant resources. This resource draws on both public resources (ClinVar, OMIM, literature review) and internal data sets. The VIP database is available for download at https://gitlab.com/bcm-hgsc/neptune. Neptune interacts with a snapshot of the VIP database in vcf format. If a clinical laboratory maintains its own variant database, Neptune can be modified to retrieve it instead using Neptune’s module system, or the ClinVar data format could be used directly.

**Fig. 1: Overview of Neptune functionality.**

Variant filtering and interpretation in eMERGE and HeartCare

Clinical genomic variant filtering and interpretation were implemented separately from Neptune in an annotation pipeline and external curation interface, following American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) guidelines. As ClinGen recommendations become available (e.g., MYH7 [29] or copy-number variant (CNV) guidelines [30]) we have adopted them. eMERGE and HeartCare used a similar set of project-specific filters to reduce the review burden of benign variation. These filters were implemented separately and are not part of Neptune.

To calculate precision, recall, f measure, and specificity, we define a positive as a reportable, pathogenic variant and not reportable variants as negative. A true positive then would be a reportable variant was either in the VIP or novel (i.e., was selected for review), a false positive would be a variant selected for review that was not reportable, and a false negative would be a variant that was reportable that was not selected for review. Metrics were evaluated for a recent batch (IR277) containing 138 samples.

Variant annotation with locally curated variant data

Novel variants are detected by comparing their genomic coordinates and alternate allele. Variants that are not present in the VIP database can be forwarded to a variant review system for manual curation. Following manual curation, novel variants are added to the VIP database by an external tool. Once all variants in a sample have been categorized, Neptune extracts reportable, pathogenic variants using curations stored in the VIP database, and outputs an automated clinical report populated with prioritized variants (or a negative report if no relevant variants are found).

The assessment of variants reviewed per sample in this study (Fig. 2) was done by “replaying” our review process, starting from an empty VIP database. Variants were limited to the 68 eMERGE consensus reportable genes (Supplementary Table 1). Each sample was analyzed in the order in which it was received. For each variant selected for review during our initial review process, we checked for it in the database. The database was empty or nearly empty early in this process, so many variants were assessed. We then added all reviewed variants to the database. As we progressed through the 7,258 data freeze samples we recorded how many reviewable variants were not present in the database for each new sample.

**Fig. 2: Variant review burden over time.**

Copy-number variation

Neptune can integrate CNVs by incorporating AtlasCNV [31] output into the report. If activated, reports contain a CNV section. CNVs and single-nucleotide variants (SNVs) are reported alongside one another to highlight cases of compound heterozygosity, in which one gene contains both a CNV and another deleterious variant. Many of the CNVs reported in these studies were reviewed prior to the release of guidelines by ClinGen [30, 32], though reviews conducted after their release followed them. Prior to their release we applied ClinGen haploinsufficiency/triplosensitivity data, assessed whether the CNV was in or out of frame if possible, and considered known pathogenic CNVs or indels that overlapped the CNV in question. In eMERGE we initially required the CNV to span three exons until the release of our updated CNV caller, atlas-CNV [31] which allowed us to begin reporting single-exon CNVs. In HeartCare, we reported single-exon events throughout the duration of the project.

Pharmacogenomics

Pharmacogenomic analysis is available for a subset of commonly reported genotypes and star alleles [33]. The module is configurable and the set of pharmacogenomic findings that are reported are defined using a mapping file that links reportable genotypes to their associated star alleles, phenotypes, and interpretation notes. Pharmacogenomic analysis requires either a gvcf input or external QC file with coverage values for all pharmacogenomic variant sites. Variants are assumed to be unphased, leading to ambiguous star allele assignments in some cases (e.g., TPMT *1/*3A vs. *3b/*3c). If the pharmacogenomic analysis is active, an additional table will be added to the report that describes the pharmacogenomic variants in the patient, as well as adding the corresponding data to the structured JSON file.

Polygenic risk scores

Neptune includes a module that enables the clinical reporting of polygenic risk score (PRS). This module reads a file in variant call format (vcf), restricted to sites of interest for a given PRS. It then calculates the risk score, using weights provided in a configuration file and the zygosity of each allele. Finally, the score for each sample is then compared against a reference distribution (also provided in the configuration) to determine the risk category for that sample. The PRS, risk category, and weighted genotypes can be added to structured outputs. Although the clinical utility of PRS is currently not settled [32], gathering additional clinical data sets will facilitate the assessment of their utility.

Report templates

Reports are designed to meet all CAP/CLIA requirements and are highly customizable using an HTML-based templating system. Sections of the report can be activated or deactivated based on sample metadata such as project or sequencing methodology. Neptune supports both corrections and amendments to existing reports, with changes tracked and timestamped. By integrating with our variant review system, our internal deployment of Neptune streamlines the generation of batches of negative reports, which is critical in projects with a large number of negative reports.

Conversion to structured data formats

Neptune allows structured outputs to be in one of a variety of formats, including FHIR, HTML, and JSON. Regardless of the format, the output captures all elements of the report including variant information, descriptive text, and coverage statistics produced by the ExCiD software. In the next step, this prereport is merged with PHI within a fully HIPAA-compliant environment and the final report is made available to a laboratory director for approval. For ease of viewing, an HTML version of the report is also made available.

For the eMERGE III project, the JSON file was converted into a proprietary XML format selected for use by the eMERGE network. This format was standardized across the two clinical reporting laboratories which allowed clinical sites to accept reports in a unified format [34]. In our HeartCare project, work is ongoing to develop a FHIR-compatible data specification and a conversion tool that can take this specification and JSON data to produce FHIR-compatible outputs (https://emerge-fhir-spec.readthedocs.io/en/latest/).

The BCM HeartCare study

In the Baylor College of Medicine (BCM) HeartCare study, patients who presented at BCM clinical sites were invited to participate in a clinical genomics study that included return of genomic results and integration into the EHR. This project increased the complexity of the clinical report by adding a section for reporting a PRS alongside integrated small variant and CNV genomic findings from 168 genes related to cardiac disease, pharmacogenomic findings for a set of drugs related to cardiovascular disease, and the reporting of two risk alleles [35] for lipoprotein(a) (Lp(a)) [36].

RESULTS

We developed Neptune to facilitate delivering genetic test data to EHRs. Neptune follows object oriented design principles, with separate classes used to contain logic for samples, metadata, variants, VIP snapshots, report builders, and database connections among others (Supplementary figure 1). A key challenge with developing a system like Neptune is separating logic that is specific to the clinical laboratory in which it was developed from generalizable logic. To address this, we created a module system that allows development of separable components. These modules are loaded dynamically, based on a configuration file. For example, the report for a particular project may include CNVs, so the CNV “report_feature” can be activated in that project’s configuration file, which will instruct Neptune on the module to use for loading and displaying CNVs on the report. Neptune depends on the pyyaml, qrcode, and sqlite3 python packages. The FHIR client is also developed at the HGSC and available at https://gitlab.com/HGSC-NGSI/heartcare/heartcare-hl7.

Case study: Electronic Medical Records and Genomics Network

The eMERGE Network brings together researchers and clinical laboratories to study the implementation of genomic medicine [28]. Previously, as part of the eMERGE III Network, we performed clinical interpretation and issued over 14,500 clinical reports to 7 clinical sites for a targeted gene panel of 68 consensus genes with additional clinical site specific genes. Clinical reports needed to be customized to each clinical site, which presented a challenge. Customizations included modifying the gene list depending on the clinical site, allowing specific single-nucleotide polymorphisms (SNPs) to be reported depending on the clinical site, adding a PRS for one clinical site and hiding it from others, displaying a pharmacogenomic section for some sites and modifying the content of that section depending on site preferences, and modifying which set of metadata was displayed depending on the clinical site. Neptune implemented these customizations by employing a templating system that can key off sample-specific metadata that is extracted from the LIMS.

Genomic variants were interpreted according to ACMG/AMP guidelines [37] externally from Neptune and stored in the VIP database, in a high-throughput manner that relied on a set of automated filters, defined prior to the project start. In general, manual review of variants is the exception. In eMERGE over 99.99% (682,343/682,398) from a representative sample) of variants were handled automatically, and in a recent batch we see recall of 100%, precision of 26.4%, f1 measure of 41% and specificity of 99.99% (Supplementary Tables 3,4). We employed a defined process for handling variant harmonization that has been previously described [28]. We started with a single reviewer who handled all variant interpretation and report sign-out activities. Later, we added a small team of 2–4 second reviewers and a dedicated first reviewer. Taking advantage of recurrent variant interpretations using the VIP database, we observed a rapid decline in novel variants per sample, followed by a stabilization around one reviewable variant per sample (Fig. 2). A key lesson learned was the benefit of gene-centric reviews; we adopted a review approach that “batched” together a large number of samples (typically 1,200), and then reviewers curated all variants in a particular gene from this batch in a single session. For example, a typical batch might contain 10 rare BRCA2 variants; these would all be interpreted in the same session by one reviewer. This approach reduced context switching for reviewers, streamlined literature review, and simplified adding additional members to the review team. The change proved to be popular with the review team and will be applied to future projects.

We engaged in multiple reanalysis activities as part of eMERGE III, supported by Neptune. First, we compared two snapshots of the ClinVar download (available from ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/), from August 2018 and August 2019. Variants with a new pathogenic or likely pathogenic (P/LP) interpretation where there were none previously were considered candidate upgrades. Variants where a previous P/LP assertion had been removed, leaving only variants of uncertain significance (VUS), benign, or likely benign, were candidates for a classification downgrade. In the genomic regions covered by our test, we identified 614 unique variants with changed assertions. For potential downgrades, we only considered variants that we had previously reported as P/LP, as many of the new ClinVar entries supported our decision during reporting to not report a variant that had been previously classified as P/LP in ClinVar. The result of this filtering was 109 unique variants to review (99 upgrades, 10 downgrades) of which 34 (28 upgrades, 6 downgrades) of these had 2, 3, or 4 stars in the August 2019 ClinVar snapshot (indicating multiple submitters with no conflicts, expert panel review, or practice guideline, respectively). For each of these variants, we performed a full, manual variant interpretation, considering all ACMG/AMP evidence categories. Ultimately, we found five variants with sufficient evidence to change the variant interpretation and issued corrected reports. The total time required for manual review varied greatly from between a few minutes and >5 hours, based primarily on the additional information available about the variant and the number of discussions required by the review team to finalize their interpretation. For first review, reanalysis took 32 minutes on average (SD 9.4). The majority of variants could be reclassified by a first reviewer, but a small fraction (<9%) required attention from a laboratory director.

In a separate reanalysis activity, we identified genomic VUS that, with the addition of one ACMG/AMP subcategory, could reach P/LP status. As the phenotypic and family history information gathered during eMERGE was quite limited, we requested a manual chart review from clinical sites for these variants (Fig. 3b). There were 83 variants identified initially, of which we reclassified 4, either using ACMG/AMP subcategory PS4 (prevalence in affecteds significantly increased over controls) or PP4 (patient’s phenotype or family history highly specific for gene). An example was the NM_000551.3:c.551T>C variant in the VHL gene, which was borderline VUS based on the evidence we had (PP3: computationally predicted to be deleterious, PM2: absent from population databases). Two papers reported the variant associated with affected individuals, but this was not enough evidence to apply PS4. However, upon contacting the clinical site, we learned that the patient was diagnosed with von Hippel–Lindau disease, which allowed us to apply the PP4 subcategory, moving this variant to LP status.

**Fig. 3: eMERGE III reanalysis activities.**

In total, we reissued nine reports based on variant classification updates. By using the size of the eMERGE panel (68 consensus genes) and the number of reports in circulation when we started that effort (approximately 15,000) we can estimate that the burden placed on clinical laboratories by reanalysis will require assessing 0.0001 (109/1,020,000) variants per gene on an issued report. The rate of reissued reports remains low, at 0.03% (5/15,000). As the number of interpreted variants increases, this problem will continue to grow.

Case study: BCM HeartCare

In a second application, we performed variant interpretation and reporting for 709 patients who presented at BCM cardiovascular clinics. Of these cases, 8.5% were positive for a P or LP SNV or CNV, and 49% were positive for a pharmacogenomic finding. Management changes as a result of these findings included recommending additional specific laboratory testing including imaging, referral for a genetic consultation, or a change in medication.

For HeartCare, our review team of 2–4 analysts handled the initial variant reviewers, while a dedicated clinical geneticist with expertise in cardiovascular genetics handled the final review and report sign out. Discordances with groups outside of the project are handled by the reanalysis process. A new addition was patient and family management recommendations, written by a clinical geneticist. This section provides feedback to the ordering physician on managing a genetic finding, and when appropriate contains advice on additional testing, drug regimens to start or avoid, additional genetic counseling, and recommendations on cascade testing. Composing the physician guidance section added significant amounts of time to report preparation. These changes were implemented by creating a new report template to support the additional fields. Supplementary Figure 2 shows an example HeartCare report.

Neptune enabled the reporting of structured PRS data for HeartCare. We implemented a previously developed PRS for coronary artery disease [38], based on 50 SNPs. High-risk individuals have a 91% higher relative risk of hospitalization after 10 years than low-risk individuals. In HeartCare, after clinician feedback, we reported the top 5% of individuals in this distribution as the “high-risk” group (top 5% ≥4.5824), which is somewhat more stringent than the original study [38]. The assessment of the clinical utility of these scores are ongoing, and the creation of clinical data sets in which PRS data are integrated with EMR data, enabled by tools like Neptune, will aid these assessments.

We also implemented a HIPAA-compliant reporting portal, hosted on Amazon Web Services (AWS), for the final report rendering and storage. We piloted an integration of this reporting platform with Epic. This required generating HL7v2 messages that contain the encoded clinical report and key report results using the HAPI api (https://hapifhir.github.io/hapi-hl7v2/). The Epic team developed a new interface for displaying this information, and a new data model for storing it. HL7 messages were transferred by sftp, and automatically loaded by Epic and attached to the test order. To keep the HL7 message simple we included fields for the order number, Medical Record Number (MRN), test name, environment, last name, first name, middle initial, date of birth (DOB), gender, visit number, HGSC accession, observation date, specimen received date, ordering provider, results report date, result status, LP(a) finding, genetic finding, and address. Supplementary figure 3 shows an example of how this data appeared in Epic for ordering providers. In coordination with the Epic team, we tested the functionality, performance, and security of this approach using HL7 messages from 32 samples. These samples were loaded by the Epic team who then shared screenshots of the Epic interface and PDF reports for review. At the conclusion of the HeartCare project, we had successfully connected Neptune to Epic and ensured the resulting interface was secure, performant, and that data were received correctly by Epic. A full description and lessons learned from the HeartCare study are described in Murdock et al. 2021 (under review).

DISCUSSION

Neptune provides a customizable platform that enables the delivery of genomic results to support genomic medicine. It facilitates complex reporting workflows including reanalysis, and connects genomic data to clinical geneticists and the EHR. It is backed by a VIP database of genetic variation that stores variant curations. We have utilized this environment to enable two exemplar projects in which clinical genetic data were reviewed, reported out, and transferred back to a clinical site. Neptune is a validated approach to clinical genetic reporting that can alleviate some of the problems related to delivering scalable clinical genetic data.

Reanalysis places a substantial workload on clinical genetics activities and the overall effort will increase with the volume of reports issued. Based on the number of genes present on the gene panel designs used in the tests reviewed here, we observed a rate of 0.0001 variants per gene on an issued report per year. Thus, when reporting clinical genetic data at a large scale, complete reanalysis may not be feasible and clear guidelines will be crucial to define the extent to which reanalysis activities are necessary. Future work will examine the extent to which accelerating submissions to ClinVar might change this estimate and whether potential increasing concordance between laboratories will reduce the amount of work required.

The approach to variant review presented here relies on manual interpretation of variants, and thus has limitations to scalability as the number of reported genes increases to, e.g., an exome. This limit is evident in the plateau that is reached in the review burden per sample (Fig. 2) as additional samples are added to the study that we and others have observed [39]. Based on harmonization activities that we have conducted with other labs [28, 40] the approach here is consistent with best practices in the field, and scaling variant interpretation is likely to be a general challenge for the field in the coming years. Active efforts toward rule-based interpretation underway by ClinGen will help automatable genomic variant interpretation become standard.

The challenge of integrating genomic data into an EHR was made clear during HeartCare, where developers were unable to access the Epic test environment directly. Instead, our testing methodology relied on sharing screenshots for review, resulting in many slow iterations. Simplifying the HL7 message itself also proved to be key. A more complex message would have required still more rounds of testing and would have been challenging to review in multiple views in Epic. A surprising challenge was the difficulty of receiving confirmation from Epic for correct receipt of a message. This feature required additional configuration in Epic but was essential for the smooth operation of clinical reporting. Finally, we only started exploring the patient experience, but this aspect of the project is critical and should be a focus from the outset. True interoperability with the EMR will require the ability to extract de-identified data, which can be useful during variant interpretation and discovery. This level of interaction has not been achieved yet by our systems, but will be a future goal.

The successful implementation of genomic medicine relies on structured integration of genomic data into the EHR systems. These data cannot remain in silos; rather, they should be shared as widely as possible given the constraints of research consent and PHI data protection. When stored in a structured format, these data can be acted on by Clinical Decision Support (CDS) tools to provide context-dependent decision support to clinicians. Optimally, data would flow smoothly both into and out of the EHR. Health information can be used to support variant interpretation and genomic data are already proving actionable in the clinic, with its utility increasing rapidly. Data interchange formats like FHIR (https://emerge-fhir-spec.readthedocs.io/en/latest/) are crucial for enabling this interchange and will empower the next generation of clinical genomic integration.

Data availability

Data are available in dbGaP for controlled public access (phs001616.v1.p1).

The software is available from https://gitlab.com/bcm-hgsc/neptune.

References

Aronson SJ, Rehm HL. Building the foundation for genomics in precision medicine. Nature. 2015;526:336–42.
Article CAS Google Scholar
McPadden J, Durant TJ, Bunch DR, Coppi A, Price N, Rodgerson K, et al. Health care and precision medicine research: analysis of a scalable data science platform. J Med Internet Res. 2019;21:e13043.
Article Google Scholar
Johnson A, Zeng J, Bailey AM, Holla V, Litzenburger B, Lara-Guerra H, et al. The right drugs at the right time for the right patient: the MD Anderson precision oncology decision support platform. Drug Discov Today. 2015;20:1433–8.
Article Google Scholar
Alzu’bi A, Zhou L, Watzlaf V. Personal genomic information management and personalized medicine: challenges, current solutions, and roles of HIM professionals. Perspect Health Inf Manag. 2014;11:1c.
PubMed PubMed Central Google Scholar
Erlich Y, Williams JB, Glazer D, Yocum K, Farahany N, Olson M, et al. Redefining genomic privacy: trust and empowerment. PLoS Biol. 2014;12:e1001983.
Article Google Scholar
Holt JM, Wilk B, Birch CL, Brown DM, Gajapathy M, Moss AC, et al. VarSight: prioritizing clinically reported variants with binary classification algorithms. BMC Bioinformatics. 2019;20:496.
Article Google Scholar
Rehm HL. Evolving health care through personal genomics. Nat Rev Genet. 2017;18:259–67.
Article CAS Google Scholar
Huang BE, Mulyasasmita W, Rajagopal G. The path from big data to precision medicine. Expert Rev Precis Med Drug Dev. 2016;1:129–43.
Article Google Scholar
Clark MM, Stark Z, Farnaes L, Tan TY, White SM, Dimmock D, et al. Meta-analysis of the diagnostic and clinical utility of genome and exome sequencing and chromosomal microarray in children with suspected genetic diseases. NPJ Genom Med. 2018;3:16.
Article Google Scholar
Manolio TA, Chisholm RL, Ozenberger B, Roden DM, Williams MS, Wilson R, et al. Implementing genomic medicine in the clinic: the future is here. Genet Med. 2013;15:258–67.
Article Google Scholar
Vozikis A, Cooper DN, Mitropoulou C, Kambouris ME, Brand A, Dolzan V, et al. Test pricing and reimbursement in genomic medicine: towards a generalstrategy. Public Health Genomics. 2016;19:352–63.
Article Google Scholar
Consortium TE, The eMERGE Consortium, Gibbs RA, Rehm HL. Harmonizing clinical sequencing and interpretation for the Emerge III Network. Am J Hum Genet. 2019;105:588–605.
IAll of Us Research Program Investigators, et al. The “All of Us” Research Program. N Engl J Med. 2019;381:668–76.
Weitzel KW, et al. The IGNITE network: a model for genomic medicine implementation and research. BMC Med Genomics. 2016;9:1.
Amendola LM, Berg JS, Horowitz CR, Angelo F, Bensen JT, Biesecker BB, et al. The Clinical Sequencing Evidence-Generating Research Consortium: integrating genomic sequencing in diverse and medically underserved populations. Am J Hum Genet. 2018;103:319–27.
Article CAS Google Scholar
Williams MS. Early lessons from the implementation of genomic medicine programs. Annu Rev Genomics Hum Genet. 2019;20:389–411.
Article Google Scholar
Dewey FE, et al. Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study. Science. 2016;354:aaf6814.
eMERGE. Results FHIR specification—emerge-fhir-spec 1.0 documentation, 2021. https://emerge-fhir-spec.readthedocs.io/en/latest/. Accessed 28 April 2021.
AI Genome Analysis & Reporting Platform, 2021. https://fabricgenomics.com/. Accessed 28 April 2021.
SOPHiA Genetics. https://www.sophiagenetics.com/en_US/home.html. Accessed 28 April 2021.
Machine learning genomic analysis platform. https://www.emedgene.com/. Accessed 28 April 2021.
PharmCAT. http://pharmcat.org/. Accessed 28 April 2021.
Mailman MD, Feolo M, Jin Y, Kimura M, Tryka K, Bagoutdinov R, et al. The NCBI dbGaP database of genotypes and phenotypes. Nat Genet. 2007;39:1181–6.
Article CAS Google Scholar
Lassmann T, Francis RW, Weeks A, Tang D, Jamieson SE, Broley S, et al. A flexible computational pipeline for research analyses of unsolved clinical exome cases. NPJ Genom Med. 2020;5:54.
Article Google Scholar
Hussain MA, Langer SG, Kohli M. Learning HL7 FHIR using the HAPI FHIR server and its use in medical imaging with the SIIM dataset. J Digit Imaging. 2018;31:334–40.
Article Google Scholar
Shirts BH, Salama JS, Aronson SJ, Chung WK, Gray SW, Hindorff LA, et al. CSER and eMERGE: current and potential state of the display of genetic information in the electronic health record. J Am Med Inform Assoc. 2015;22:1231–42.
Article Google Scholar
Reid JG, Carroll A, Veeraraghavan N, Dahdouli M, Sundquist A, English A, et al. Launching genomics into the cloud: deployment of Mercury, a next generation sequence analysis pipeline. BMC Bioinformatics. 2014;15:30.
Article Google Scholar
eMERGE Consortium. Harmonizing clinical sequencing and interpretation for the eMERGE III Network. Am J Hum Genet. 2019;105:588–605.
Article Google Scholar
Kelly MA, Caleshu C, Morales A, Buchan J, Wolf Z, Harrison SM, et al. Adaptation and validation of the ACMG/AMP variant classification framework for MYH7 -associated inherited cardiomyopathies: recommendations by ClinGen’s Inherited Cardiomyopathy Expert Panel. Genet Med. 2018;20:351–9.
Article Google Scholar
Riggs ER, Andersen EF, Cherry AM, Kantarci S, Kearney H, Patel A, et al. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). Genet Med. 2020;22:245–57.
Article Google Scholar
Chiang T, Liu X, Wu T-J, Hu J, Sedlazeck FJ, White S, et al. Atlas-CNV: a validated approach to call single-exon CNVs in the eMERGESeq gene panel. Genet Med. 2019;21:2135–44.
Article Google Scholar
Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat Rev Genet. 2018;19:581–90.
Article CAS Google Scholar
Relling MV, Klein TE. CPIC: Clinical Pharmacogenetics Implementation Consortium of the Pharmacogenomics Research Network. Clin Pharmacol Ther. 2011;89:464–467.
Article CAS Google Scholar
Aronson S, Babb L, Ames D, Gibbs RA, Venner E, Connelly JJ, et al. Empowering genomic medicine by establishing critical sequencing result data flows: the eMERGE example. J Am Med Inform Assoc. 2018;25:1375–81.
Article Google Scholar
Senol-Cosar O, Schmidt RJ, Qian E, Hoskinson D, Mason-Suares H, Funke B, et al. Considerations for clinical curation, classification, and reporting of low-penetrance and low effect size variants associated with disease risk. Genet Med. 2019;21:2765–73.
Article Google Scholar
Schwartz GG, Ballantyne CM, Barter PJ, Kallend D, Leiter LA, Leitersdorf E, et al. Association of lipoprotein(a) with risk of recurrent ischemic events following acute coronary syndrome: analysis of the dal-outcomes randomized clinical trial. JAMA Cardiol. 2018;3:164–8.
Article Google Scholar
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–24.
Article Google Scholar
Khera AV, Emdin CA, Drake I, Natarajan P, Bick AG, Cook NR, et al. Genetic risk, adherence to a healthy lifestyle, and coronary disease. N Engl J Med. 2016;375:2349–58.
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–43.
Article CAS Google Scholar
Amendola LM, Muenzen K, Biesecker LG, Bowling KM, Cooper GM, Dorschner MO, et al. Variant classification concordance using the ACMG-AMP variant interpretation buidelines across nine genomic implementation research studies. Am J Hum Genet. 2020;107:932–41.
Article CAS Google Scholar

Download references

Acknowledgements

This work was funded by internal operating funds of the Baylor College of Medicine Human Genome Sequencing Center (HGSC), and by the National Institutes of Health (NIH) eMERGE program Phase III: U01HG8657 (Kaiser Permanente Washington/University of Washington); U01HG8685 (Brigham and Women’s Hospital); U01HG8672 (Vanderbilt University Medical Center); U01HG8666 (Cincinnati Children’s Hospital Medical Center); U01HG6379 (Mayo Clinic); U01HG8679 (Geisinger Clinic); U01HG8680 (Columbia University Health Sciences); U01HG8684 (Children’s Hospital of Philadelphia); U01HG8673 (Northwestern University); U01HG8701 (Vanderbilt University Medical Center serving as the Coordinating Center); U01HG8676 (Partners Healthcare/Broad Institute); and U01HG8664 (Baylor College of Medicine).

Author information

Authors and Affiliations

Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Venner Eric, Victoria Yi, David Murdock, Sara E. Kalla, Tsung-Jung Wu, Aniko Sabo, Shoudong Li, Qingchang Meng, Xia Tian, Mullai Murugan, Michelle Cohen, Christie Kovar, Donna Muzny, Richard A. Gibbs, Adithya Balasubramanian, Gauthami Chandanavelli, Jessica De la Cruz, Shawn Denson, Andrea Foster, Richard A. Gibbs, Jianhong Hu, Joy C. Jayaseelan, Yunyun Jiang, Divya Kalra, Viktoriya Korchina, Wen Liu, Xiuping Liu, Harshad Mahadeshwar, Jesse Muniz, David R. Murdock, Ritika Raj, Steven Scherer, Eric Venner & Lan Zhang
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
Venner Eric, David Murdock, Aniko Sabo, Donna Muzny, Richard A. Gibbs, Shawn Denson & Andrea Foster
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Wei-Qi Wei
Departments of Pediatrics and Medicine, Columbia University, New York, NY, USA
Wendy K. Chung
Department of Biomedical Informatics, Columbia University, New York, New York, NY, USA
Chunhua Weng
Division of Genetic Medicine, Department of Internal Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Georgia L. Wiesner
Department of Medicine (Medical Genetics), University of Washington School of Medicine, Seattle, WA, USA
Gail P. Jarvik
Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
Gail P. Jarvik
Center for Applied Genomics, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Debra Abrams, Meckenzie Behr, Elizabeth Bhoj, John J. Connolly & Frank Mentch
Department of Biochemistry and Cancer Biology, Meharry Medical College, Nashville, TN, USA
Samuel E. Adunyah
Kaiser Permanente of WA Health Research Institute, Seattle, WA, USA
Ladia Albertson-Junkans, David Carrell, Dustin Key & Arvind Ramaprasan
Center for Applied Genomics, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Berta Almoguera
DNAnexus Inc, Mountain View, CA, USA
Darren C. Ames
Department of Psychiatry, Columbia University, New York State Psychiatric Institute, NYSPI, New York, NY, USA
Paul Appelbaum
Partners HealthCare, Cambridge, MA, USA
Samuel Aronson
Center for Genetic Medicine, Northwestern University, Chicago, IL, USA
Sharon Aufox
Broad Institute, Massachusetts, MA, USA
Lawrence J. Babb, Samantha Baxter, Tim DeSmet, Michael J. Dinsmore, Sheila Dodge, Phil Dunlea, Mark Fleharty & Christopher Friedrich
Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, USA
Adithya Balasubramanian, Jessica De la Cruz, Christine M. Eng, Richard A. Gibbs, Yunyun Jiang, David R. Murdock, Steven Scherer & Eric Venner
Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
Hana Bangash, Ozan Dikilitas & Justin H. Gundelach
Vanderbilt Institute for Clinical & Translational Research, Vanderbilt University Medical Center, Nashville, TN, USA
Melissa Basford, Brittany City, Kayla M. Howell, Jodell E. Linder & Laura Allison Woods
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Lisa Bastarache, Sarah T. Bland, Mary Stroud & Wei-Qi Wei
Research Information Science and Computing, Partners Healthcare, Somerville, MA, USA
Barbara Benoit
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Suzette J. Bielinski, Mariza De Andrade & Schaid Dan
Brigham and Women’s Hospital, Boston, MA, USA
Carrie Blout
Geisinger, Hood Center for Health Research, Danville, PA, USA
Kenneth Borthwick
Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Erwin P. Bottinger
Partners HealthCare Personalized Medicine, Cambridge, MA, USA
Mark Bowser & Elizabeth Duffy Hynes
Massachusetts General Hospital, Boston, MA, USA
Harrison Brand
Marshfield Clinic Research Institute, Marshfield, WI, USA
Murray Brilliant, Terrie Kitchner & James G. Linneman
Broad Institute of MIT and Harvard, Massachusetts, MA, USA
Wendy Brodeur, Stacey Gabriel, Maegan V. Harden, Niall J. Lennon, Jim Meldrim, Thomas E. Mullen, Robert C. Onofrio, Michael W. Wilson & Betty Woolf
Mayo Clinic, Rochester, MN, USA
Pedro Caraballo
Google Inc, Mountain View, CA, USA
Andrew Carroll
Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University, Department of Cardiology, The Louis A Simpson and Kimberly K Querrey Biomedical Research Center Room 5-408, Chicago, IL, USA
Lisa Castillo
Research Information Science and Computing (RISC), Partners Healthcare, Somerville, MA, USA
Victor Castro
Baylor College of Medicine, One Baylor Plaza, Houston, USA
Theodore Chiang
Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
Rex L. Chisholm & Christin Hoell
Division of Genetics, Department of Medicine, Brigham and Women’s Hospital, Department of Medicine, Harvard Medical School, Boston, MA, USA
Kurt D. Christensen
Departments of Pediatrics and Medicine, Columbia University, New York, NY, USA
Wendy Chung
Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, MD, USA
Christopher G. Chute
Center for Autoimmune Genomics and Etiology, Cincinnati Children’s Hospital Medical Center (CCHMC), Cincinnati, OH, USA
Beth L. Cobb & Bahram Namjou
Department of Medicine, University of Washington, Seattle, WA, USA
Paul Crane & Shubhabrata Mukherjee
Columbia University Irving Medical Center, New York, NY, USA
Katherine Crew
Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
David R. Crosslin & Ian B. Stanaway
National Human Genome Research Institute, Maryland, MD, USA
Jyoti Dayal, Sheethal Jose, Teri Manolio & Ken L. Wiley Jr
Departments of Biomedical Informatics and Medicine, Vanderbilt University, Nashville, TN, USA
Josh C. Denny
Division of Epidemiology, Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Todd L. Edwards
Department of Biomedical Informatics, Columbia University, New York, NY, USA
David Fasel, George Hripcsak & Ning Shang
Irving Institute for Clinical and Translational Research, Columbia University, New York, NY, USA
Alex Fedotov
Department of Medicine, Division of Clinical Pharmacology, Vanderbilt University Medical Center, Nashville, TN, USA
Qiping Feng
Department of Health Sciences Research, Mayo Clinic, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA
Robert Freimuth
Department of Bioethics & Humanties, University of Washington, Seattle, WA, USA
Stephanie M. Fullerton
Harvard Medical School, Boston, MA, USA
Birgit Funke
Partners HealthCare, Somerville, MA, USA
Vivian Gainer
Department of Medicine, Division of Nephrology, Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA
Ali Gharavi
Vanderbilt University Medical Center, Department of Medicine, Nashville, TN, USA
Andrew M. Glazer
Center for Applied Genomics Children’s Hospital of Philadelphia, Division of Human Genetics Children’s Hospital of Philadelphia, Department of Pediatrics Perelman School of Medicine University of Pennsylvania, Philadelphia, PA, USA
Joseph T. Glessner
Geisinger Medical Center, Danville, PA, USA
Jessica Goehringer
Department of Medicine (Medical Genetics), University of Washington School of Medicine, Seattle, WA, USA
Adam S. Gordon & Gail P. Jarvik
Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Adam S. Gordon & Elizabeth McNally
Laboratory for Molecular Medicine, Partners Healthcare Personalized Medicine, Cambridge, MA, USA
Chet Graham & Emily Kudalkar
Brigham and Women’s Hospital, Broad Institute, Harvard Medical School, EC Alumnae Building, Boston, MA, USA
Robert C. Green
Center for Applied Genomics Children’s Hospital of Philadelphia, Division of Human Genetics Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Heather S. Hain
Center for Applied Genomics Children’s Hospital of Philadelphia, Divisions of Human Genetics and Pulmonary Medicine Children’s Hospital of Philadelphia, Department of Pediatrics Perelman School of Medicine University of Pennsylvania, Philadelphia, PA, USA
Hakon Hakonarson
Cincinnati Children’s Hospital Medical Center, University of Cincinnati College of Medicine, US Department of Veterans Affairs Medical Center, Cincinnati, Cincinnati, OH, USA
John Harley
Center for Applied Genomics Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Margaret Harr
Department of Biomedical Informatics and Medical Education, University of Washignton School of Medicine, KP Washington Health Research Institute, Seattle, WA, USA
Andrea Hartzler
Division of Endocrinology, Metabolism, and Molecular Medicine, Department of Medicine, Northwestern University Feinberg School of Medicine, Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Department of Anthropology, Northwestern University, Chicago, IL, USA
M. Geoffrey Hayes
Center for Precision Medicine Research, Marshfield Clinic Research Institute, Marshfield, WI, USA
Scott Hebbring
KP Washington Health Research Institute, Univ of Washington School of Public Health, Dept of Health Services, Seattle, WA, USA
Nora Henrikson
Cincinnati Children’s Hospital Medical Center (CCHMC), University of Cincinnati College of Medicine, Cincinnati, OH, USA
Andrew Hershey
Division of Genetics and Genomics, Boston Children’s Hospital, Department of Pediatrics, Harvard Medical School, Boston, MA, USA
Ingrid Holm
Medical Informatics Services, NewYork-Presbyterian Hospital, New York, NY, USA
George Hripcsak
Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
Gail P. Jarvik
Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Yoonjung Yoonie Joo
Biomedical and Translational Informatics, Geisinger, Fremont, CA, USA
Navya Shilpa Josyula
Biomedical and Translational Informatics, Geisinger, Danville, PA, USA
Anne E. Justice
Brigham & Women’s Hospital, Harvard Medical School, Boston, MA, USA
Elizabeth W. Karlson
Children’s Hospital of Philadelphia, Department of Surgery, University of Pennsylvania, Department of Surgery, University of Pennsylvania, Philadelphia, PA, USA
Brendan J. Keating
Geisinger, Danville, PA, USA
Melissa A. Kelly
Center for Genomic Health, Icahn School of Medicine at Mount Sinai, The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, Departments of Genetics and Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Eimear E. Kenny
Columbia University, New York, NY, USA
Krzysztof Kiryluk
Laboratory for Molecular Medicine, Partners Healthcare Personalized Medicine, Brigham and Women’s Hospital, Cambridge, MA, USA
Barbara Klanderman
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Eric Klee
Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
David C. Kochan
Department of Pediatrics, University of Cincinnati college of Medicine, University of Cincinnati, Center of Autoimmune Genomics and Etiology, Division of Allergy & Immunology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
Leah Kottyan
Geisinger Genomic Medicine Institute, Danville, PA, USA
Alanna Kulchak Rahm
Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
Iftikhar J. Kullo
Meharry Medical College, Baptist Cancer Center, Memphis, TN, USA
Philip Lammers
Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA
Eric B. Larson
Partners Healthcare Personalized Medicine, Brigham and Women’s Hospital, Harvard Medical School, Cambridge, MA, USA
Matthew S. Lebo
Baylor College of Medicine, One Baylor Plaza, Houston, TX, USA
Magalie Leduc
Geisinger, Danville, PA, USA
Ming Ta (Michael) Lee
Genetic Services Kaiser Permanente of Washington, Seattle, WA, USA
Kathleen A. Leppig
Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
Nancy D. Leslie
National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
Rongling Li
University of Alabama at Birmingham, Birmingham, AL, USA
Wayne H. Liang
Partners Healthcare Personalized Medicine, Harvard Medical School, Mountain View, CA, USA
Chiao-Feng Lin
Mayo Clinic, Scottsdale, AZ, USA
Noralane M. Lindor
Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
Todd Lingren
Department of Biomedical Informatics, Columbia University Medical Center, Columbia University, New York, NY, USA
Cong Liu
Department of Communication, University of Cincinnati, Cincinnati, OH, USA
John Lynch
Broad Institute of MIT & Harvard, Cambridge, MA, USA
Hayley Lyon
Broad Institute of MIT & Harvard, Cambridge, MA, USA
Alyssa Macbeth
Partners Healthcare Personalized Medicine, Cambridge, MA, USA
Lisa Mahanta
Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
Bradley Malin
Department of Medicine, Division of Nephrology, Columbia University, New York, NY, USA
Maddalena Marasa
Department of Population Health Sciences, Duke University School of Medicine, Durham, NC, USA
Keith Marsolo
Ethics Center, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati, Cincinnati, OH, USA
Michelle L. McGowan
Department of Medicine, Division of Nephrology, Columbia University, New York, NY, USA
Hila Milo Rasouly
Department of Internal Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Jonathan Mosley
Massachusetts General Hospital, Partners Healthcare, Harvard Medical School, Somerville, MA, USA
Shawn Murphy
Human Genome Sequencing Center at the Baylor College of Medicine, One Baylor Plaza, Houston, TX, USA
Mullai Murugan
Human Genome Sequencing Center at the Baylor College of Medicine, One Baylor Plaza, Houston, TX, USA
Donna Muzny
Division of Human Genetics, Cincinnati Children’s Hospital Medical Center, College of Medicine, University of Cincinnati, Cincinnati, OH, USA
Melanie F. Myers
Department of Pediatrics, University of Cincinnati, Cincinnati, OH, USA
Bahram Namjou
Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
Yizhao Ni
The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Aniwaa Owusu Obeng
Pharmacy Department, Mount Sinai Hospital, New York, NY, USA
Aniwaa Owusu Obeng
Geisinger, Danville, PA, USA
Thomas N. Person
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Josh F. Peterson
Department of Dermatology, Columbia University, New York, NY, USA
Lynn Petukhova
Geisinger, Danville, PA, USA
Cassandra J. Pisieczko
School of Graduate Studies and Research, Meharry Medical College, Nashville, TN, USA
Siddharth Pratap
Division of Human Genetics, Division of Patient Services, Cincinnati Children’s Hospital, Cincinnati, OH, USA
Cynthia A. Prows
Department of Pharmacology, Northwestern University Feinberg School of Medicine, Center for Genetic Medicine, Northwestern University, Chicago, IL, USA
Megan J. Puckelwartz
Kaiser Permanente Washington Health Research Institute, University of Washington Department of Biomedical Informatics and Medical Education, Seattle, WA, USA
James D. Ralston
Vanderbilt University Medical Center Department of Medicine, Nashville, TN, USA
Andrea Ramirez
Department of Preventive Medicine Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Luke Rasmussen
Department of Preventive Medicine Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Laura Rasmussen-Torvik
Medical and Population Genetics, Broad Institute or MIT and Harvard, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Soumya Raychaudhuri
Center for Genomic Medicine, Massachusetts General Hospital, Department of pathology, Massachusetts General Hospital/Harvard Medical School, Broad Institue Clinical Research Sequencing Platform (CRSP), Simches Research Building, Boston, MA, USA
Heidi L. Rehm
University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
Marylyn D. Ritchie
Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
Catherine Rives
University of Texas at Arlington, Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
Beenish Riza
Vanderbilt University Medical Center, Nashville, TN, USA
Dan M. Roden
Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
Elisabeth A. Rosenthal
Center for Applied Genomics, Children’s Hospital of Philadelphia, Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
Avni Santani
Icahn School of Medicine at Mount Sinai, New York, NY, USA
Stuart Scott
KP Washington Health Research Institute, Seattle, WA, USA
Aaron Scrol
Columbia University, New York, NY, USA
Soumitra Sengupta
Partners Healthcare, Cambridge, MA, USA
Himanshu Sharma
Biomedical Ethics Program, Mayo Clinic, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Richard R. Sharp
Clinical and Translational Research Center, Meharry Medical College, Nashville, TN, USA
Rajbir Singh
Center for Applied Genomics, Children’s Hospital of Philadelphia, Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Patrick M. A. Sleiman
Broad Institute of MIT & Harvard, Cambridge, MA, USA
Kara Slowik
Department of Biomedical Informatics, Vanderbilt University Medical Center, Center for Patient and Professional Advocacy, Vanderbilt University, Nashville, TN, USA
Joshua C. Smith
Northwestern University, Chicago, IL, USA
Maureen E. Smith
Department of Internal Medicine, Meharry Medical College, Nashville, TN, USA
Duane T. Smoot
Department of Psychiatry and Center for Genomic Medicine, Massachusetts General Hospital, Stanley Center for Psychiatric Research, Simches Research Building, Boston, MA, USA
Jordan W. Smoller
Mayo Clinic, Rochester, MN, USA
Sunghwan Sohn
Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
Justin Starren
Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, USA
Jessica Su
Johns Hopkins University, Geisinger, Baltimore, MD, USA
Casey Overby Taylor
Biomedical Ethics Unit, Social Studies of Medicine, Faculty of Medicine, McGill University, Montreal, QC, Canada
Kasia Tolwinski
Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN, USA
Sara L. Van Driest
Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Sara L. Van Driest
The University of Texas at San Antonio, San Antonio, TX, USA
Sean M. Vargas
Massachusetts General Hospital, Partners HealthCare, Cambridge, MA, USA
Matthew Varugheese
Comparative Health Outcomes, Policy & Economics (CHOICE) Institute, Department of Pharmacy, University of Washington, Seattle, WA, USA
David Veenstra
Division of Nephrology Department of Medicine, Columbia University, New York, NY, USA
Miguel Verbitsky
Broad Institute, Cambridge, MA, USA
Gina Vicente
Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
Michael Wagner
Human Genome Sequencing Center, Baylor College of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX, USA
Kimberly Walker
Northwestern University, Chicago, IL, USA
Theresa Walunas
Human Genome Sequencing Center at the Baylor College of Medicine, One Baylor Plaza, Houston, TX, USA
Liwen Wang
Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX, USA
Qiaoyan Wang
Channing Division of Network Medicine, Brigham and Women’s Hospital, Department of Medicine, Harvard Medical School, Boston, MA, USA
Scott T. Weiss
Department of Medicine, Division of Cardiovascular Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Quinn S. Wells
Columbia University, New York, NY, USA
Chunhua Weng
Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Department of Biomedical Informatics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
Peter S. White
Department of Medicine, Vanderbilt Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN, USA
Georgia L. Wiesner
Geisinger, Danville, PA, USA
Janet L. Williams
Geisinger, Danville, PA, USA
Marc S. Williams
Laboratory for Molecular Medicine, Partners Healthcare Personalized Medicine, Department of Pathology, Massachusetts General Hospital/Harvard Medical School, Cambridge, MA, USA
Leora Witkowski
Departments of Pediatrics, Columbia University, New York, NY, USA
Julia Wynn
Department of Molecular and Genetics, Baylor College of Medicine, Houston, TX, USA
Yaping Yang
Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute and March of Dimes Prematurity Research Center Ohio Collaborative, Cincinnati Children’s Hospital Medical Center, Cincinnati, Cincinnati, OH, USA
Ge Zhang
Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
Ge Zhang
Laboratory for Molecular Medicine, Partners Healthcare Personalized Medicine, Department of Pathology, Massachusetts General Hospital/Harvard Medical School, Cambridge, MA, USA
Hana Zouk

Authors

Venner Eric
View author publications
You can also search for this author in PubMed Google Scholar
Victoria Yi
View author publications
You can also search for this author in PubMed Google Scholar
David Murdock
View author publications
You can also search for this author in PubMed Google Scholar
Sara E. Kalla
View author publications
You can also search for this author in PubMed Google Scholar
Tsung-Jung Wu
View author publications
You can also search for this author in PubMed Google Scholar
Aniko Sabo
View author publications
You can also search for this author in PubMed Google Scholar
Shoudong Li
View author publications
You can also search for this author in PubMed Google Scholar
Qingchang Meng
View author publications
You can also search for this author in PubMed Google Scholar
Xia Tian
View author publications
You can also search for this author in PubMed Google Scholar
Mullai Murugan
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Christie Kovar
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Qi Wei
View author publications
You can also search for this author in PubMed Google Scholar
Wendy K. Chung
View author publications
You can also search for this author in PubMed Google Scholar
Chunhua Weng
View author publications
You can also search for this author in PubMed Google Scholar
Georgia L. Wiesner
View author publications
You can also search for this author in PubMed Google Scholar
Gail P. Jarvik
View author publications
You can also search for this author in PubMed Google Scholar
Donna Muzny
View author publications
You can also search for this author in PubMed Google Scholar
Richard A. Gibbs
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

eMERGE Consortium

Debra Abrams
, Samuel E. Adunyah
, Ladia Albertson-Junkans
, Berta Almoguera
, Darren C. Ames
, Paul Appelbaum
, Samuel Aronson
, Sharon Aufox
, Lawrence J. Babb
, Adithya Balasubramanian
, Hana Bangash
, Melissa Basford
, Lisa Bastarache
, Samantha Baxter
, Meckenzie Behr
, Barbara Benoit
, Elizabeth Bhoj
, Suzette J. Bielinski
, Sarah T. Bland
, Carrie Blout
, Kenneth Borthwick
, Erwin P. Bottinger
, Mark Bowser
, Harrison Brand
, Murray Brilliant
, Wendy Brodeur
, Pedro Caraballo
, David Carrell
, Andrew Carroll
, Lisa Castillo
, Victor Castro
, Gauthami Chandanavelli
, Theodore Chiang
, Rex L. Chisholm
, Kurt D. Christensen
, Wendy Chung
, Christopher G. Chute
, Brittany City
, Beth L. Cobb
, John J. Connolly
, Paul Crane
, Katherine Crew
, David R. Crosslin
, Jyoti Dayal
, Mariza De Andrade
, Jessica De la Cruz
, Josh C. Denny
, Shawn Denson
, Tim DeSmet
, Ozan Dikilitas
, Michael J. Dinsmore
, Sheila Dodge
, Phil Dunlea
, Todd L. Edwards
, Christine M. Eng
, David Fasel
, Alex Fedotov
, Qiping Feng
, Mark Fleharty
, Andrea Foster
, Robert Freimuth
, Christopher Friedrich
, Stephanie M. Fullerton
, Birgit Funke
, Stacey Gabriel
, Vivian Gainer
, Ali Gharavi
, Richard A. Gibbs
, Andrew M. Glazer
, Joseph T. Glessner
, Jessica Goehringer
, Adam S. Gordon
, Chet Graham
, Robert C. Green
, Justin H. Gundelach
, Heather S. Hain
, Hakon Hakonarson
, Maegan V. Harden
, John Harley
, Margaret Harr
, Andrea Hartzler
, M. Geoffrey Hayes
, Scott Hebbring
, Nora Henrikson
, Andrew Hershey
, Christin Hoell
, Ingrid Holm
, Kayla M. Howell
, George Hripcsak
, Jianhong Hu
, Elizabeth Duffy Hynes
, Gail P. Jarvik
, Joy C. Jayaseelan
, Yunyun Jiang
, Yoonjung Yoonie Joo
, Sheethal Jose
, Navya Shilpa Josyula
, Anne E. Justice
, Sara E. Kalla
, Divya Kalra
, Elizabeth W. Karlson
, Brendan J. Keating
, Melissa A. Kelly
, Eimear E. Kenny
, Dustin Key
, Krzysztof Kiryluk
, Terrie Kitchner
, Barbara Klanderman
, Eric Klee
, David C. Kochan
, Viktoriya Korchina
, Leah Kottyan
, Christie Kovar
, Emily Kudalkar
, Alanna Kulchak Rahm
, Iftikhar J. Kullo
, Philip Lammers
, Eric B. Larson
, Matthew S. Lebo
, Magalie Leduc
, Ming Ta (Michael) Lee
, Niall J. Lennon
, Kathleen A. Leppig
, Nancy D. Leslie
, Rongling Li
, Wayne H. Liang
, Chiao-Feng Lin
, Jodell E. Linder
, Noralane M. Lindor
, Todd Lingren
, James G. Linneman
, Cong Liu
, Wen Liu
, Xiuping Liu
, John Lynch
, Hayley Lyon
, Alyssa Macbeth
, Harshad Mahadeshwar
, Lisa Mahanta
, Bradley Malin
, Teri Manolio
, Maddalena Marasa
, Keith Marsolo
, Michelle L. McGowan
, Elizabeth McNally
, Jim Meldrim
, Frank Mentch
, Hila Milo Rasouly
, Jonathan Mosley
, Shubhabrata Mukherjee
, Thomas E. Mullen
, Jesse Muniz
, David R. Murdock
, Shawn Murphy
, Mullai Murugan
, Donna Muzny
, Melanie F. Myers
, Bahram Namjou
, Yizhao Ni
, Robert C. Onofrio
, Aniwaa Owusu Obeng
, Thomas N. Person
, Josh F. Peterson
, Lynn Petukhova
, Cassandra J. Pisieczko
, Siddharth Pratap
, Cynthia A. Prows
, Megan J. Puckelwartz
, Ritika Raj
, James D. Ralston
, Arvind Ramaprasan
, Andrea Ramirez
, Luke Rasmussen
, Laura Rasmussen-Torvik
, Soumya Raychaudhuri
, Heidi L. Rehm
, Marylyn D. Ritchie
, Catherine Rives
, Beenish Riza
, Dan M. Roden
, Elisabeth A. Rosenthal
, Avni Santani
, Schaid Dan
, Steven Scherer
, Stuart Scott
, Aaron Scrol
, Soumitra Sengupta
, Ning Shang
, Himanshu Sharma
, Richard R. Sharp
, Rajbir Singh
, Patrick M. A. Sleiman
, Kara Slowik
, Joshua C. Smith
, Maureen E. Smith
, Duane T. Smoot
, Jordan W. Smoller
, Sunghwan Sohn
, Ian B. Stanaway
, Justin Starren
, Mary Stroud
, Jessica Su
, Casey Overby Taylor
, Kasia Tolwinski
, Sara L. Van Driest
, Sean M. Vargas
, Matthew Varugheese
, David Veenstra
, Eric Venner
, Miguel Verbitsky
, Gina Vicente
, Michael Wagner
, Kimberly Walker
, Theresa Walunas
, Liwen Wang
, Qiaoyan Wang
, Wei-Qi Wei
, Scott T. Weiss
, Quinn S. Wells
, Chunhua Weng
, Peter S. White
, Georgia L. Wiesner
, Ken L. Wiley Jr
, Janet L. Williams
, Marc S. Williams
, Michael W. Wilson
, Leora Witkowski
, Laura Allison Woods
, Betty Woolf
, Tsung-Jung Wu
, Julia Wynn
, Yaping Yang
, Victoria Yi
, Ge Zhang
, Lan Zhang
& Hana Zouk

Contributions

Conceptualization: E.V., R.G. Data curation: A.S., S.L., Q.M., X.T., M.C., D.M. Formal analysis: Funding acquisition: R.G, D.M. Investigation: E.V., M.C., D.M. Methodology: E.V., R.G., V.Y., C.K. Project administration: M.C., C.K., M.M. Resources: W.W., W.C., C.W., G.W., G.J., R.G. Software: E.V., V.Y., S.K., T.W., Supervision: E.V., M.M., D.M., C.K. Validation: E.V., V.Y., T.W. Visualization: E.V., M.C. Writing—original draft: E.V., V.Y. Writing—review & editing: E.V., V.Y., D.M., S.K., T.W., A.S., S.L., Q.M., X.T., M.M., M.C., C.K., W.W. W.C., C.W.,G.W., G.J., D.M., R.G.

Corresponding author

Correspondence to Venner Eric.

Ethics declarations

Ethics Declaration

The Electronic Medical Records and Genomics (eMERGE) Network is a National Human Genome Research Institute (NHGRI)-funded consortium tasked with developing methods and best practices for utilization of the electronic medical record (EMR) as a tool for genomic research. All 11 sample collection sites consented participants under institutional review board (IRB)-approved protocols and the two sequencing centers had IRB-approved protocols that deferred consent to the participating sites. The protocol number for Baylor College of Medicine was(#H-40455).

Competing interests

E.V. is a cofounder of Codified Genomics, which provides variant interpretation services. R.G., D.M., D.M., disclose that the Baylor Genetics Laboratory is co-owned by Baylor College of Medicine. The other authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Eric, V., Yi, V., Murdock, D. et al. Neptune: an environment for the delivery of genomic medicine. Genet Med 23, 1838–1846 (2021). https://doi.org/10.1038/s41436-021-01230-w

Download citation

Received: 10 November 2020
Accepted: 13 May 2021
Published: 13 July 2021
Issue Date: October 2021
DOI: https://doi.org/10.1038/s41436-021-01230-w

This article is cited by

The frequency of pathogenic variation in the All of Us cohort reveals ancestry-driven disparities
- Eric Venner
- Karynne Patterson
- Miriam Vos
Communications Biology (2024)