Introduction

Three million women in the United States live with breast cancer. One-third are Medicare beneficiaries.1 The median age of diagnosis is 61 years, and 40% of newly diagnosed cases are women over age 64 years.2,3 Approximately 96,000 Medicare beneficiaries are diagnosed yearly. The incidence rate has increased slowly, from 4.4% in 2003 to 5.4% in 2012.1 For women over age 69, the incidence rate is expected to increase from 24 to 35%.4 This trend has contributed to delays in care and increased costs.5,6 These dynamics illustrate the need for tools that stratify patients by risk so that clinicians can identify and prioritize patients who require aggressive treatment versus patients who may avoid or delay treatment.

One tool that stratifies patients by risk is the 21-gene recurrence score test (trade name, Oncotype DX Breast Cancer Assay) developed and conducted by Genomic Health (Redwood City, CA). The test measures tumor gene expression (16 cancer-related, including HER2, ER and PR, and 5 normative) to quantify the likelihood of distant recurrence at 10 years.7 Tumors are categorized as low-, moderate-, and high-risk, with an average recurrence risk of 7, 14, and 31%, respectively. These risk scores are used to predict the benefit from adjuvant chemotherapy after surgery for women with early-stage, lymph node–negative (LN-), estrogen receptor–positive (ER+), human epidermal growth factor receptor 2–negative (HER2-) breast cancer patients.8,9 The 21-gene test has been included in American Society of Clinical Oncology and National Comprehensive Cancer Network (NCCN) guidelines for treatment of stage I to IIA ER+/LN- tumors since 2007 and 2008, respectively.10,11 More recently, it has been included in NCCN guidelines for certain stage IIA–IIIB breast cancers with LN micrometastases.11 Preliminary results of the large TAILORx trial of more than 11,000 HR+/HER2-/LN- patients showed that patients with a low-risk score had remarkably low rates of recurrence with hormonal therapy alone.12 In 2016, the American Joint Committee on Cancer (AJCC) incorporated the Oncotype DX test in the Eighth Edition of the AJCC Cancer Staging Manual.13

Medicare coverage of the 21-gene test began in 2008.14 A retrospective analysis of Surveillance, Epidemiology, and End Results (SEER) data from 2005 to 2009 demonstrated that the 21-gene recurrence score influenced receipt of chemotherapy among Medicare beneficiaries.15

Several studies have analyzed the clinical utility and cost-effectiveness of the 21-gene test.16,17 However, few studies have analyzed national, patient-level utilization. The aims of this study were to identify and characterize Medicare beneficiaries who underwent the test in 2011–2012, analyze factors that predict the likelihood to undergo testing, and evaluate concordance with guidelines.

Materials and Methods

Data sources

We conducted a retrospective study using secondary data analysis methods. The primary data source was Medicare claims, including 2010–2013 100% inpatient MedPAR, Part B, and outpatient files. Additional data sources included the denominator file (100%), the hierarchical condition categories (HCC) risk score file (developed by RTI International), the Provider of Service file, the Health Resources and Services Administration Area Health Resource file,18 and the Dartmouth Atlas Hospital Referral Region (HRR) database.19

We used validated algorithms to identify incident breast cancer cases using Medicare claims data.20,21 We created a variable to indicate the year when a claim for breast cancer treatment was first submitted. Beneficiaries identified in 2010 claims included prevalent cases from previous years; we eliminated those patients from subsequent analysis. Beneficiaries who had initial claims in 2011 through 2013 represented newly identified cases. We analyzed and reported data for beneficiaries identified in 2011 to 2013 claims. However, our previous research revealed that approximately half the claims for the 21-gene tests were ordered more than 30 days after the surgical pathology procedure.22 Consequently, patients identified in 2013 claims may have had claims for testing in 2014 that were not available in our dataset. We therefore restricted multivariate and regional analysis to years 2011 and 2012.

Our analytic sample met the following criteria:

  • 1. Were beneficiaries of Medicare fee-for-service who had at least one inpatient claim with a breast cancer diagnosis code (International Classification of Diseases, ninth revision (ICD-9), diagnosis codes 174.0–175.9) within MedPAR or two or more outpatient or Part B claims on separate dates with a breast cancer diagnosis code.

  • 2. Had a short-term or specialty hospital stay in a physician office, inpatient or outpatient hospital, or ambulatory surgical center.

  • 3. Sought breast cancer treatment in 2011–2013, defined as having a breast biopsy and/or a breast surgery (ICD-9 procedure codes 85.11–85.48) claim in MedPAR, or breast biopsy (CPT/HCPCS codes 10021–10022; 19100–19103; 76942), surgical pathology analysis (88305, 88307, 88309), or a complex diagnostic test (83890–98; 83900–14; 83950–51; 86215; 86225; 86294; 86300–05; 86316; 87149; 88371–72) in outpatient or Part B files.

We restricted our analysis to claims that had breast cancer listed as the line item or principal diagnosis code. MedPAR, outpatient, and Part B claims were placed in a patient-level analytic file that became the basis for the study.

Variables

The unit of observation was the patient. The outcome variable was whether the patient had a claim for the 21-gene test. This was coded as 1 if the claim was identified as Genomic Health’s Clinical Laboratory Improvement Amendment (CLIA) 05D1018272. Independent variables expected to predict testing included patient demographics (age, gender, race), Medicaid status, clinical characteristics (ER status, LN status, secondary cancer, end-stage renal disease (ESRD), HCC score, inpatient stay for breast surgery, other molecular tests), and regional characteristics. We also analyzed the correlation between undergoing the 21-gene test and undergoing a similar genomic test for breast cancer. This was identified by a claim with the CLIA number for Agendia (Irvine, CA), the supplier of the Symphony suite of microarray-based assays for clinical management of breast cancer. These include the 70-gene signature test (MammaPrint) and TargetPrint and BluePrint. MammaPrint is used for selection of breast cancer patients unlikely to benefit from adjuvant chemotherapy.23 TargetPrint provides a quantitative assessment of tumor receptor status by measuring messenger RNA expression levels of three markers (ER, PR, and HER2). The BluePrint test identifies the molecular subtype for breast cancer–based expression patterns of 80 genes and refines the prognostic ability of MammaPrint. In this analysis, the existence of a claim with Agendia was treated as an independent variable.

Patient demographic variables and postal codes of residence were obtained from the Medicare Denominator enrollment file. Postal code was used to characterize the region in which the patient lived (county, HRR, Urban Influence Code, and distance from a National Cancer Institute (NCI) cancer center).

Clinical characteristics obtained from claims data included ER status (V86.0-V86.1), LN status (196.0–196.9), secondary cancers (197.0–198.9), and ESRD (585.6). Secondary cancers and ESRD diagnoses were included to identify patients who were inappropriate candidates for the test. Each patient’s health insurance claim number was used to obtain the weighted average HCC score, which served as a proxy to identify the patient’s health status. This number is derived from a patient’s prior year of claims to predict future costs. It represents the relative expenditures that are likely to be incurred for a patient, based on factors such as age, sex, Medicaid status, and individual disease groups.24 Patients with lower HCC scores are considered healthier.

Most independent variables, including the HRR, were dichotomous; others such as age and HCC score were continuous. Age was measured in years. Distance was measured in miles. HCC score was measured in increments of 0.001.

Statistical analysis

Statistical analysis was performed using Stata software (version 12.0; StataCorp, College Station, TX). We conducted univariate and bivariate analyses and obtained descriptive statistics for all variables. As appropriate, group differences were tested using the chi-square test of independence for categorical variables and t-tests or ANOVA for continuous variables. We performed multivariate logistic regression to identify characteristics associated with a claim for the 21-gene test. We performed sensitivity analysis to ensure that the observed effects were not attributable to an artifact of modeling.

Results

Claims and payments

Between 2010 and 2013, we identified 50,873 claims for the 21-gene test ( Table 1 ). The number of claims per year increased by 23%, from 10,988 in 2010 to 13,517 in 2013. The mean Medicare payment for the test remained stable at approximately $3,311, and the total expenditures amounted to $168,418,052 over the 4-year period. There were fewer claims for the 70-gene signature and other tests from Agendia. However, the number of claims for Agendia tests increased more than fourfold, from 308 in 2010 to 1,356 in 2013.

Table 1 Claims for 21-gene and 70-gene breast cancer tests among Medicare beneficiaries in 2010 and 2013

We identified 383,070 patients who had a new claim for breast cancer biopsy, surgery, complex laboratory diagnostic tests, or surgical pathology procedure from 2011 to 2013 for breast cancer diagnosis or treatment. Bivariate analysis identified several demographic, clinical, and regional characteristics associated with a claim for the 21-gene test. Table 2 summarizes patient characteristics by testing status from 2011 through 2013.

Table 2 Characteristics of patients who underwent the 21-gene RS test in 2011–2013

Demographic and clinical characteristics of patients tested

Of the 256,818 patients identified in 2011 through 2012, 25,352 (9.9%) had a claim for the 21-gene test. Patients tested were younger than those not tested (mean = 69.2 years, SD = 6.9 for tested; mean = 73.0, SD = 10.0 for not tested). The highest percentage of patients tested (16.0%) were 65–69 years of age. Women were more likely to be tested than men (9.9 vs. 7.5%). Analysis of testing across race/ethnicities revealed that among non-Hispanic whites who underwent breast cancer treatment, 10.4% were tested compared to 6.8% of Hispanics and 7.0% of non-Hispanic blacks. There was a negative association between Medicaid status and testing. Only 7.4% of Medicaid recipients were tested, compared with 10.4% of those who did not receive Medicaid.

Analysis of clinical characteristics illustrated substantial undercoding of ER status, LN status, and potentially secondary cancers. These variables are needed to identify the eligible patient population. ER status was missing from claims for 68.5% of patients. LN+ disease was coded for only 12.1% of patients, and secondary cancer was coded for 6.1% of patients; these rates were substantially lower than the prevalence of LN+ and metastatic breast cancers in cancer registries.25

The 21-gene test is specifically designed for ER+ patients. Accordingly, ER+ patients were much more likely to be tested than ER- patients (21.4 vs. 2.0%). LN+ patients were more likely to be tested than those who did not have a code for LN+ disease (13.6 vs. 9.4%). Only 2.6% of patients with a secondary cancer diagnosis were tested, which was expected because the test is recommended for early-stage breast cancer patients. Analysis of the HCC score revealed that patients with fewer comorbidities were more likely to be tested. The mean HCC score of patients tested was 0.72 (SD = 0.56) compared to 0.97 (SD = 0.82) for patients not tested. Yet, some patients with serious conditions such as ESRD were still tested. Having an inpatient stay was negatively associated with testing. Undergoing other molecular tests was positively associated with undergoing the 21-gene test. Patients who underwent another breast cancer genomic test supplied by Agendia were almost twice as likely to undergo the 21-gene test.

Regional characteristics

We conducted several analyses to measure the association between residence and testing status. The number and percentage of patients tested varied widely by state (as shown in Supplementary Table S1 online). New Mexico had the highest percentage (14%) of patients tested, and Rhode Island and the Commonwealth of Puerto Rico had the lowest percentage (each at 4–5%). We also analyzed the association between patient proximity to an NCI cancer center and testing status. In contrast to our previous research analyzing lung cancer molecular testing,26 distance to an NCI cancer center was not significant. There was also no association between testing status and county-level income or education level. There was little variability in testing between various metropolitan and nonmetropolitan areas, which were classified according to Urban Influence Codes27 (Supplementary Table S2 online).

We then considered the association between HRR and testing status. Some HRRs were positively associated with testing, whereas others were negatively associated. The distribution of 21-gene RS testing among HRRs is presented in Figure 1 . Data used to create the figure are presented in Supplementary Table S3 online. There was an eightfold difference between HRRs with the highest and the lowest levels of testing, ranging from 2.9% in Ogden, Utah, to 23.6% in Terre Haute, Indiana.

Figure 1
figure 1

Distribution of 21-gene RS testing by Hospital Referral Region in 2011–2012. The denominator is the number of patients who had breast tissue analyzed.

Prediction model

The relationship between several independent variables and the likelihood of undergoing the 21-gene test persisted in multivariate logistic regression analysis. Table 3 summarizes the results of the logistic regression model.

Table 3 Characteristics that predict use of the 21-gene RS test for breast cancer patients

Several demographic characteristics predicted testing. A yearly increase in age decreased the likelihood of testing by 4.0% (odds ratio (OR) 0.96, confidence interval (CI) 0.95–0.96). Non-Hispanic white patients were more likely to be tested (OR 1.46, CI 1.39–1.52). Medicaid recipients were less likely to be tested (OR 0.74, CI 0.71–0.78).

Several clinical characteristics positively predicted testing. Patients coded with ER+ breast cancer were almost four times as likely to undergo the 21-gene test (OR 3.90, CI 3.79–4.02). Patients who underwent another breast cancer molecular test were also more likely to undergo the 21-gene test (OR 2.80, CI 2.72–2.89). Clinical characteristics that were negative predictors of testing included secondary cancer diagnosis (OR 0.16, CI 0.14–0.17), ESRD diagnosis (OR 0.82, CI 0.67–0.99), and inpatient stay (OR 0.89, CI 0.86–0.92). Poor health status, as measured by HCC score, was also a negative predictor of testing. To interpret the OR (0.68, CI 0.66–0.69) of HCC score (which is measured in increments of 0.001), we obtained the logit coefficient. For every 0.10-point increase in HCC score, there was a 4.8% decrease in likelihood of being tested.

HRRs were both strong positive and negative predictors of testing. After adjusting for other demographic and clinical characteristics included in the model, patients most likely to be tested lived in McAllen, Texas (OR 2.84, CI 1.95–4.12). Patients who lived in Ogden, Utah, were least likely to be tested (OR 0.20, CI 0.09 to 0.44). Overall, there were 54 HRRs that were positive predictors of testing and 57 HRRs that were negative predictors.

Discussion

This is the first study to analyze utilization of the 21-gene test among all Medicare beneficiaries with breast cancer. Several studies have analyzed test utilization among smaller groups of patients.28,29,30 One study of 7,375 patients from 2006 to 2008 found disparities in access to testing among black patients and those with lower education levels.31 Two larger studies of 70,802 and 44,044 patients in the SEER registry data linked to Medicare claims found a fourfold increase in testing from 2005 through 2009 and reported little variability in testing across SEER regions or by patient demographics.15,32 Our population-level analysis of 256,818 Medicare patients found variations in access according to age and health status, which is consistent with clinical guidelines. However, in contrast to the study by Dinan et al.,32 our study identified income, racial, and regional variations in access to the 21-gene test. In bivariate analyses, Hispanics and blacks were tested less often than non-Hispanic whites. In multivariate analyses, non-Hispanic whites were 1.5 times more likely to be tested than non-whites. Higher rates of ER- and triple-negative breast cancer among black and Hispanic women may explain some differences. If ER status was reliably coded in the claims, then we could have analyzed this, but ER status was missing in most cases. Also, we cannot exclude that testing may have positively affected reporting of ER+ status because the test is covered only for ER+ patients. By contrast, Medicaid status was a reliable indicator of poverty, and Medicaid recipients were much less likely to undergo testing, illustrating income disparities in access to genomic medicine.

Analysis of regional factors that predict testing demonstrate that, although there has been widespread implementation of the 21-gene test, there is also a modest level of both underutilization and inappropriate utilization. A comparison of the 57 HRRs where patients were less likely to be tested and other benchmarking measures demonstrated that many of these regions also have lower acute-care hospital capacity than the national average. Constrained capacity to care for patients probably impacts delivery of guideline-concordant cancer genomics care. Interestingly, the second lowest rate of testing occurred in Rochester, Minnesota, the location of an important medical research facility (Mayo Clinic) that also has a large commercial reference laboratory. One explanation for the low utilization of the 21-gene test in the Rochester, Minnesota, HRR may be that the Mayo Clinic was testing patients as part of its Breast Cancer Genome Guided Therapy protocol (BEAUTY). Rochester, Minnesota, also has fewer acute-care hospital beds per 1,000 residents than the national average.19

In HRRs where patients had significantly higher ORs than the national average, there may be overutilization of testing. We identified patients who were tested despite having comorbid clinical conditions that put into question the clinical utility of the test. However, our prediction model revealed that patients with secondary cancers, ESRD, and higher HCC scores were less likely to be tested. Still, utilization among these patients was not zero. Clinical guidelines and Medicare reimbursement policies specify that testing should be applied only for ER+ patients with early-stage, predominantly LN- disease for whom test results impact clinical decisions. If chemotherapy is precluded as a treatment option by a patient’s comorbid conditions, age, or stated declination of chemotherapy, then the test has no clinical utility. Furthermore, the recurrence-risk algorithm may not be applicable to patients with serious comorbid medical conditions such as ESRD or those at an advanced age. The life expectancy for ESRD patients is 7.9 years.33 These patients are at greater risk for cancer and there are significant challenges in treating cancer patients with chemotherapy when they also have ESRD.34 There is no evidence that the algorithm for the 21-gene test would be valid in a patient population with extensive comorbidities.

Consistent with other studies of regional variation in health-care delivery, our results illustrate that use of cancer genomics is subject to local practice patterns. We conclude that clinicians who order other molecular tests are also likely to order the 21-gene test. Remarkably, patients who underwent the 70-gene genomic test, which has similar utility for selecting patients for adjuvant chemotherapy, were more likely to undergo the 21-gene test.

Reliance exclusively on claims data limited this analysis in several ways. It is not feasible to use claims data to conclusively identify newly diagnosed breast cancer cases. By eliminating the 2010 claims and restricting analysis to patients who had new claims for breast surgical pathology procedures in 2011–2012, we identified a cohort of mostly newly diagnosed patients who had tissue available for testing. However, we know from SEER data that there are approximately 96,000 newly diagnosed breast cancer cases per year among Medicare beneficiaries. Clearly, therefore, our denominator included some prevalent patients who may have been previously diagnosed and tested prior to their eligibility for Medicare. Undercoding of ER status, LN status, and HER2 status substantially impeded our ability to identify the appropriate patient population for testing and made it difficult to evaluate the level of concordance with clinical practice guidelines. We can develop estimates based on analysis of SEER data suggesting that 44% of newly diagnosed patients are ER+, LN-, and HER2- and may be eligible for the 21-gene test.2 This approach suggests that approximately 126,720 patients (44% of (96,000 × 3)) may be eligible. However, if we restricted the denominator to Medicare beneficiaries based on age and comorbid conditions (selecting the patients for whom the test is most likely useful), then the eligible patient population would be much smaller. In 2011–2012 claims, we identified 148,227 patients younger than age 75. Based on SEER estimates, 65,220 (44%) were ER+, LN-, and HER2- patients. There were 20,304 patients younger than age 75 who had claims for the 21-gene test. If all these patients were ER+, LN-, and HER2-, then approximately 31% of guideline-recommended cases were tested. There are several reasons for believing that the market penetration of the test is even higher than 31%. The estimate of the ER+, LN-, and HER2- population does not consider the number of patients who have comorbid conditions that limit the clinical utility of the test. It also does not exclude patients whose stated preference is to forgo chemotherapy regardless of their risk score.

The 21-gene test is clinically most useful for young, healthy patients with newly diagnosed breast cancer. A definitive study analyzing population-level implementation of the 21-gene test in the United States would include private-payer claims data. However, data use agreements for aggregated private-payer claims prohibit researchers from identifying providers. This restriction impedes identification of the 21-gene test in private-payer claims. Ideally, third-party payers and researchers could use claims data to identify the population for whom guidelines recommend genomic testing.

Several studies have illustrated that the 21-gene test generates cost savings for third-party payers.35 Other studies have demonstrated that the prognostic strength of a score derived from standard pathology findings can be comparable to that derived from the 21-gene test,36,37,38 especially for patients with high- and low-risk scores as opposed to intermediate-risk scores. The intermediate-risk category remains the most challenging for both genomics-based and conventional decision making. When the TAILORx trial is completed for all patient groups (projected for December 2017), the results may shed more light on the utility of the 21-gene test for this group of patients.39

In comparison to other genomics tests, the 21-gene test has achieved remarkable market penetration. Several factors may explain the successful translation of the test. There has been a long history of understanding the importance of tumor molecular biology for breast cancer diagnosis and treatment. Therefore, it is not surprising that breast cancer genomic tests were adopted more rapidly than lung cancer genomic tests. Early coverage and consistent inclusion of the 21-gene test across several clinical practice guidelines facilitated implementation. There are also several strong patient-advocacy groups for breast cancer that have an important role in patient education. The fact that some beneficiaries are undergoing multiple tumor gene expression tests may demonstrate consumer-based demand for testing.

Despite widespread diffusion of the 21-gene test, estimates of the eligible patient population suggest that there may also be underutilization of the test. With only 25,352 patients being tested in 2011–2012 and less access for minorities, Medicaid patients, and patients seeking care in 54 HRRs, there is clearly room for improvement. Our analysis has illustrated opportunities to expand appropriate use of genomic testing to improve risk stratification among Medicare beneficiaries diagnosed with breast cancer.

Disclosure

Genomic Health, Inc., is funding a clinical trial within the Veterans Healthcare Administration. J.A.L. is the principal investigator of the study, and B.B. worked on the study. The other authors declare no conflict of interest.