Main

Many sectors—scientific, medical, commercial, and regulatory—have an interest in the development and evaluation of new, health-related genomic tests. As a result of public and private investment, many new genomic tests are being offered to health care providers and, in some cases, directly to consumers. This activity is decentralized and dynamic, involving a complex network of relationships among government, academic, and commercial entities. Consequently, it is not easy to determine which genomic tests are in development, currently available, or discontinued. Information about the tests is mostly scattered and often incomplete.

Efforts are underway to consolidate relevant information and make it accessible to stakeholders and the public. Since 1993, information about genetic tests for mostly rare, single-gene and chromosomal disorders has been available online from GeneTests.1,2 Several endeavors—such as the Evaluation of Genomic Applications in Practice and Prevention (EGAPP) project3 and the Secretary's Advisory Committee on Heritable Disorders in Newborns and Children4—concentrate on systematic reviews of evidence for the validity and utility of existing genetic tests.

Horizon scans complement these retrospective review processes by identifying new, health care-related genetic tests before they become widely used. For example, the Agency for Healthcare Research and Quality has published annual horizon scans since 2006, with a focus alternating between cancer and other conditions. The 2009 report reviewed selected pharmacogenetic tests and the 2010 report updates information on genetic tests for noncancer conditions.5,6 An updated list of companies offering genetic tests directly to consumers is available from the Genetics and Public Policy Center.7 Most recently, the National Institutes of Health has announced plans to develop a new, voluntary Genetic Testing Registry at National Center for Biotechnology.8,9

Since 2001, the Office of Public Health Genomics (OPHG) at the Centers for Disease Control and Prevention has actively monitored online news and scientific resources for information on genomic research and its potential implications for public health. OPHG posts an online weekly update, which contains links to news stories and scientific articles (http://www.cdc.gov/genomics/update/current.htm). In addition, OPHG staff curates an online database of scientific publications on genetics and disease in human populations, which is populated by a weekly sweep of PubMed.10 Since 2005, the OPHG-sponsored EGAPP project staff has also conducted periodic online searches for genomic tests that are in transition from research to clinical practice.

In mid-2009, we initiated a review of these online search procedures to develop a more systematic and comprehensive horizon scanning process to identify new, health-related genetic and genomic tests. We describe the development of this process and evaluate the initial results, collected from mid-October 2009 through June 2010. Finally, we consider how horizon scanning can contribute to future evaluation of genomic testing.

METHODS

We focused our review on tests that analyzed (1) genetic variants (e.g., single nucleotide polymorphisms or copy number variants), singly or in combination; (2) gene expression; or (3) protein biomarkers (excluding single antibodies). We also sought to identify relevant tests based on other methods (such as fluorescent in situ hybridization, immunohistochemistry, or mass spectrometry) and assays of other molecular analytes (such as epigenetic markers and micro-RNAs), either alone or in combination. Although we were primarily interested in tests related to screening, diagnosis, management, and prediction of adult-onset and chronic diseases, we did not exclude newly described tests for inherited disorders. We did not attempt a retrospective survey of previously developed tests, including some that are already in widespread use.

We used Google Alerts (http://www.google.com/alerts) to search for Web pages, newspaper articles, and blogs that might identify a new, health-related genomic test. We first composed a set of Google Alert queries by combining the words test or testing with a word from the following list: gene, genetic, DNA, molecular, expression, or proteomic. We did not use the term genomic because, although it describes these tests as a group, it is rarely used to describe specific tests. We then created a set of selected three-word combinations by adding a term from a second list: cancer, cardiovascular disease, and diabetes. Results from the two- and three-word queries were returned daily by e-mail and compiled weekly. Results were reviewed in two stages: first, for general relevance and to remove duplicates, and next, to identify relevant tests that had not been found previously.

From July 27 through October 5, 2009, we monitored the relative yields of our queries and eliminated those that returned no useful or duplicate results. Although disease-specific queries for cancer, cardiovascular disease, and diabetes yielded few unique items, we retained them because of their potential public health importance. The final set of queries included the following search terms: cancer gene test, cancer molecular test, cancer test, cancer test human, cardiovascular disease genetic test, diabetes genetic test, gene expression, genetic test, genetic testing, and proteomic test.

During a pilot phase (October 12, 2009, through March 31, 2010), the final set of queries was run daily and compiled weekly to create a list of online mentions of health-related genomic tests in Web pages, newspaper articles, and blogs. After eliminating duplicates and any tests that were not health-related (e.g., tests used for genealogy research or social networking), we manually extracted relevant information about each test. We adapted EGAPP definitions to describe four characteristics that, when combined, could be used to define a unique test: the disease or condition (e.g., irritable bowel syndrome), the type of test (e.g., expression of 32 genes in tumor tissue), the target population (e.g., women with a family history of ovarian cancer), and the intended use (e.g., to determine appropriate warfarin dosing).3 In addition, we noted the source query, the initial Web link, and trade and company names.

For future searchability, we further classified tests according to their proposed applications (screening, diagnosis, prognosis, pharmacogenomics, risk prediction, or other) and status (research and development, commercially available, marketed directly to consumers, Food and Drug Administration license claimed, and unknown). By the end of March 2010, we had finalized search, review, and curation procedures and developed an online database. Since then, daily horizon scanning, weekly review, and curation have continued. We analyzed data collected through June 2010 to describe the numbers and characteristics of genetic tests identified by this process.

RESULTS

The final set of Google Alert queries returned approximately 400 items each week (average 433, range 401–493 in June 2010). These often included large numbers of duplicates, which typically occurred when a scientific publication or news release describing a new test was covered by many different news outlets and blogs. We attempted to identify and document the primary source (often a test developer's or licensee's Web site) and excluded the others. The time required to screen Google Alert results varied from week to week, ranging from one to several hours; finding and extracting relevant information and entering it into the database required approximately 30 minutes per test.

We identified 151 tests during the pilot phase (October 12, 2009, to March 31, 2010) and 37 more during the first 3 months of routine data collection, from April through June, 2010 (Fig. 1). The large numbers of tests identified during the early weeks of the pilot phase reflected a backlog of recently introduced or existing tests with ongoing mentions in blogs and news items. After capturing information about these tests, subsequent mentions were treated as duplicate information. After Week 12, the number of new tests detected averaged approximately two to three per week.

Fig. 1
figure 1

Number of newly identified genomic tests by week, GAPP Finder, October 12, 2009, to June 30, 2010.

The distribution of tests by disease or condition is summarized in Table 1. Overall, nearly two thirds of tests (122/188, 65%) were related to cancer. Tests for colorectal, lung, and breast cancers accounted for about half of the cancer-related tests (63/122, 52%) and about one third of tests overall (63/188, 33%). The next largest group, tests related to cardiovascular diseases, comprised 19 tests (10%). Hereditary disorders accounted for 6% of all tests; the “Other” category (7%) included tests for conditions such as aging, early menopause, and weight management.

Table 1 Distribution of new genomic tests by disease or condition, October 12, 2009, to June 30, 2010

More than one fourth (51/188) of the tests we found were proposed for multiple uses, e.g., for detecting and classifying cancer (diagnosis) and for predicting survival (prognosis). After excluding 2 tests with 3 proposed uses, we summarized 235 proposed uses for the remaining 186 tests in Table 2. Overall, diagnostic tests constituted the largest category (66/235, 28%), followed by prognostic tests (54/235, 23%) and pharmacogenetic tests (41/235, 18%). Tests for risk prediction (i.e., for estimating the likelihood of future disease in currently asymptomatic persons) accounted for only 11% of the total. Although nearly half of all tests (88/188, including 2 tests marketed directly to consumers) were commercially available, Food and Drug Administration licensure was claimed for only 12 (6%) tests. There were 82 (44%) tests still in development and 6 for which no information about availability was provided.

Table 2 Numbers of genomic tests by combinations of proposed uses, GAPP Finder, October 12, 2009, to June 30, 2010

The information we extracted from the Internet sources for the 188 tests included in this analysis, in addition to those identified in subsequent weeks, is available online in a searchable database, the GAPP Finder (http://www.gappnet.org, Fig. 2).

Fig. 2
figure 2

Example of search results for colorectal cancer, GAPP Finder. Detail page for each test includes additional information, including trade names, Web links, and indexing terms. Trade names displayed for demonstration only.

DISCUSSION

We used Google Alerts to detect newly developed or launched genomic tests. The Internet's widespread use, and frequently updated entries from test developers eager for public notice, helped us capture new announcements and extract relevant scientific information. We acknowledge that many challenges to identifying and validating new genomic tests remain. For most new tests, the information available online is incomplete. We encountered missing data for test analyses, indications, and target populations. In some cases, test developers may not have shared what they consider to be proprietary information. Tests described as “genomic profiles,” which may include hundreds of different markers to predict risks for multiple conditions, present particular challenges. Information on the numbers, types, and combinations of markers tested in such profiles is often not presented; changes may not be reported, and we did not make special efforts to find them.11 In addition, we learned that it could be difficult to accurately identify tests when their names were changed or they were licensed to multiple companies.

After the pilot phase, when a large backlog of tests was reviewed, the amount of time required to identify new tests and enter them into the database averaged less than one-half day per week. Recently, we changed the Google queries to run weekly; during a 4-week period, this approach returned approximately one third as many items as the same queries run daily and compiled weekly (125 vs. 400 per week, on average), without any loss in sensitivity. Although additional refinement of the Google queries could further reduce the amount of time required to identify new tests, growth in the number of new tests will increase the time required to extract and enter the data into the GAPP Finder.

Our approach to horizon scanning aimed to maximize timeliness and transparency by finding and sharing information online; we saw this as the best way to complement other efforts to monitor and evaluate genomic tests. We are currently sharing data with National Center for Biotechnology to support their development of the proposed Genetic Test Registry; at their request, we have standardized some data elements to enhance exchangeability. In the future, active online surveillance for genomic test information could complement registry data, which will be reported by test developers on a voluntary basis. Our online database, the GAPP Finder, is currently accessible and embedded in the GAPP Knowledge Base, a suite of Web-based products that promote objective synthesis and timely dissemination of information on genome-based tests and technologies. We plan to compare GAPP Finder data prospectively with the results of other horizon scans for genetic tests, including both general scans (such as those conducted by Agency for Health care Research and Quality) and scans targeted to specific diseases or test types.

It seems unlikely that any single database will be able to capture all of the information essential for evaluating genomic tests. The field is growing and evolving rapidly and not always predictably. Different types of information—gathered by different processes, from different sources—are needed to evaluate the analytic and clinical validity and utility of genomic tests. Although test developers may be able to report analytic sensitivity and specificity to a voluntary registry, they will rarely have access to the epidemiologic data needed to evaluate clinical validity.12 Evidence for clinical utility is even more diverse and will continue to accrue after a test is in widespread use. We used information that is freely available online to establish our surveillance system and will share the results freely via an online resource, the GAPP Finder. By allowing easy navigation to other online resources on genomic testing, including the newly launched knowledge collection PLoS currents: evidence on genomic tests,13 we hope to enhance the value of our horizon scanning to a wide group of stakeholders.14