Introduction

Face-to-face disclosure of genetic test results has been a longstanding practice,1 but the demand for services is outpacing the capacity of most clinics.2, 3 Some individuals, particularly those in rural areas, find it challenging to meet with genetic service providers in person.4 To expedite timely disclosure of genetic test results to as many patients as possible, genetic specialists are increasingly providing results via telephone.5, 6, 7

Analyses of telephone disclosure to date are encouraging. Many patients prefer it to in-person or mailed disclosure,8 and studies of its use during testing for hereditary cancer syndromes have shown comparable levels of patient understanding and satisfaction after telephone and in-person disclosure.9, 10, 11, 12, 13 Two randomized trials demonstrated noninferiority of telephone disclosure on a variety of outcomes, showing that differences from in-person disclosure on scales of test-related distress, knowledge, and overall satisfaction were not clinically meaningful.13, 14, 15 To date, however, studies of telephone disclosure have almost exclusively focused on hereditary cancer syndromes.

Genetic testing to determine risk of Alzheimer disease (AD) provides a rich context for examining disclosure of genetic risk information for common, complex conditions. The ɛ4 allele of apolipoprotein E (APOE) is a prevalent and robust genetic risk factor for AD,16 but is neither necessary nor sufficient for disease. The Risk Evaluation and Education for Alzheimer’s Disease (REVEAL) Study is a series of multicenter randomized clinical trials that have examined the impact of AD genetic susceptibility testing using APOE genotyping. Previous analyses have shown that testing does not increase risks for psychological harm, even among individuals who are ɛ4-positive;17, 18, 19 and that the majority of participants accurately recall test results.20, 21, 22 Here, we report on the third REVEAL Study trial, hypothesizing that mean scores on a variety of patient outcomes would be no worse following telephone disclosure than following in-person disclosure.

Materials and methods

Study population

We recruited cognitively normal adults using mailings to research registries, postings on institutional research websites, referrals from collaborating physicians, and advertisements in local newspapers at study sites in Boston, MA; Cleveland, OH; Ann Arbor, MI; and Washington, DC. We established targets to enroll equal numbers of adults over and under the age of 60, equal numbers of men and women, and a 1:3 ratio of subjects with zero and one AD-affected first-degree relative, respectively. Screening per prespecified criteria was conducted initially by phone, and again more extensively by study clinicians during the first in-person appointment. Individuals were excluded if they had histories suggestive of hereditary AD (two or more AD-affected first-degree relatives, family members with average AD onset under age 60), or scored below an education-adjusted 87 on the Modified Mini-Mental State Examination.23 We also screened out individuals with severe anxiety or depression per validated scales (defined below).

Study design

A multidisciplinary team designed the study protocol, approved by institutional review boards at each study site and an independent external Ethics and Safety Board, and the study was registered at ClinicalTrials.gov (NCT00462917). The study protocol, statistical code, and data set are available from the authors upon request. Briefly, participants who completed a phone interview and written questionnaire received educational materials developed in a prior trial.17 Participants then met with a genetic counselor and provided a blood sample for APOE genotyping at a CLIA-certified laboratory. Participants provided informed consent by telephone prior to the initial phone interview, then again in person prior to the blood draw for genotyping. Disclosure sessions were scheduled at the discretion of individual sites. Genetic counselors disclosed APOE genotypes and AD risk information using a script that allowed participants to raise any questions or concerns they had. Numeric estimates of lifetime (cumulative incidence from birth to the age of 85 years. Range: 6% to 73%) and remaining risk for AD (cumulative incidence from current age to the age of 85 years) were accompanied by graphs of AD risk curves. The methods for calculating risk estimates and generating risk curves were published previously.24, 25 A written summary of the risk assessments were given to participants at the end of in-person disclosure sessions or mailed to participants following telephone disclosure.

A 2 × 2 factorial design determined whether participants received risk assessments in person or via telephone, and whether or not participants additionally learned about an association between the ɛ4 allele of APOE and an increased risk for coronary artery disease (CAD), reported separately.19 Participants were randomized equally within strata, in blocks of size four, into “AD-only, in-person disclosure,” “AD-only, telephone disclosure,” “AD + CAD, in-person disclosure,” and “AD + CAD, telephone disclosure” arms. Randomization strata were defined by site, age (<60 vs. ≥60), family history of AD, and gender. Serially numbered envelopes concealed randomization statuses until the initial in-person visit. Randomization occurred prior to this appointment, which included confirmation of study eligibility, to allow consent forms customized to AD + CAD or AD-only randomization status to be mailed before in-person review.

Outcomes were assessed through questionnaires administered 6 weeks, 6 months, and 12 months after genotype and genetic risk disclosure. At the end of 6-week and 6-month follow-up appointments and after surveys were completed, genetic counselors verbally reminded participants about their genotypes, risk estimates, and about APOE-CAD associations (if appropriate). For safety purposes, subjects whose anxiety or depression scores exceeded standard cutoffs for severe mood disorders or increased by more than 15 points from baseline were immediately interviewed by a genetic counselor.

Measures

Outcome variables

Outcomes of interest are bolded and included anxiety, depression, test-related distress (assessed via two scales), positive impact, recall of results, and subjective impact at 6 weeks, 6 months, and 1 year after results disclosure, all assessed via self-administered questionnaires. Outcomes were selected based on concerns about how telephone disclosure may impact the communication of information.26

Psychological outcomes included measures of general anxiety, general depression, two scales assessing test-related distress, and positive impact. Anxiety was assessed using the Beck Anxiety Inventory (BAI),27 with scores ranging from 0 to 63 (>8: mild, >15: moderate, >25: severe). Depression was assessed using the 20-item Center for Epidemiological Studies–Depression Scale (CES-D),28 with scores ranging from 0 to 60 (>10: mild, >16: moderate, >26: severe).29 Test-related distress was assessed with the 15-item Impact of Event Scale (IES),30 with scores ranging from 0 to 75 (≥20: significant distress) and the distress subscale of Impact of Genetic Testing for Alzheimer’s disease instrument (IGT-AD distress), with higher scores on a 0–60 scale indicating more negative feelings.31 Positive impact was assessed with the positive subscale of the IGT-AD, where items were reverse-scored such that lower scores on a 0–20 scale indicated more positive feelings.31

Recall of results was measured as the sum of correct responses when participants report back their results: (i) number of AD risk-increasing alleles, (ii) genotype, (iii) lifetime AD risk estimate, and (iv) remaining AD risk estimate. The risk allele and genotype items were assessed via multiple choice and included a “don’t remember” option. Risk estimate items asked participants to provide their lifetime and remaining risk percentages and were considered correct if responses were within 5% of communicated results, as used in prior analyses.20, 21, 22 Item prompts encouraged participants to provide their best guess if they did not remember their estimate. An additional open-ended item asked, “What other disease did we tell you is associated with the APOE gene?” but is analyzed separately from other recall items because it was administered only to participants randomized to AD + CAD disclosure.

Subjective impact was assessed by asking participants to rate the “overall impact” that the risk information had on a 5-point scale (“very negative” to “very positive”).

Other variables

Participants reported demographic information during the phone interview and on the baseline written questionnaire. Personal and family history of AD, cardiovascular disease, and other medical conditions were assessed during the blood draw appointment, as was numeracy using a validated scale with scores ranging from 0 to 8 based on the number of items a participant answers correctly.32 Participants also completed a validated 4-item version of a scale assessing self-reported comfort with numbers, with scores ranging from 1 to 6 based on mean ratings across items.33 After disclosure sessions, genetic counselors completed a chart note and indicated if topics discussed addressed any 12 issues, including the accuracy of results or preventive measures. Disclosure session length was calculated in minutes by comparing the start and finish times recorded on chart notes.

Statistical analysis

A priori goals to enroll 280 participants and achieve 256 disclosures were set to power original hypotheses to compare disclosure of AD and CAD risk information against disclosure of only AD risk information, the main focus of the trial.19 Here we focused on the comparison between telephone and in-person disclosure. We used t-tests and chi-squared tests to compare demographics of the randomization arms and to analyze who dropped out after randomization. We used chi-squared tests to compare dropout rates of study arms after randomization, but before results disclosure. We used t-tests of log-transformed times to compare how long after the blood draw that disclosure sessions occurred and to compare the length of disclosure sessions, and used chi-squared tests to determine whether topics discussed during disclosure sessions varied by randomization status.

Telephone disclosure was intended to streamline service delivery rather than improve patient responses. Therefore, we used a noninferiority framework to test whether outcomes were no worse after telephone disclosure than after in-person disclosure. We asserted noninferiority of telephone disclosure on measures of anxiety, depression, test-related distress, and positive impact if the upper limit for the confidence interval (CI) of mean differences between telephone and in-person randomization arms was below a predefined margin.34

We used longitudinal analyses for all outcomes, including all observed data and imputing data for the few missing observations. We used generalized linear models fit with generalized estimating equations. For analyses of BAI, CES-D, IES, IGT-AD distress, and IGT-AD positive scores, we used a log link and gamma distribution to compare outcomes by phone versus in-person randomization status, as these measures were very skewed and had a high proportion with zeros. For analyses of ordinal variables, recall was dichotomized to compare full recall (all items recalled correctly) against less than full recall, and subjective impact was dichotomized to compare responses of “very” and “somewhat positive” against responses of “neutral” and “somewhat” or “very negative.” Analyses of recall and subjective impact used a logit link and binomial distribution. We used an autoregressive working correlation structure with robust standard errors to account for the repeated measures within participant. A value of one was added to BAI, CES-D, IES, and IGT measures to shift their distributions away from zero. Models included terms for phone or in-person disclosure randomization status, time as a categorical variable, interaction between time and randomization arm, and the corresponding baseline psychological measure where applicable. Analyses on the IGT-AD distress subscale included a term to account for an interaction between telephone/in-person randomization status and AD-only/AD + CAD randomization status (P < 0.05). Additional analyses were conducted to compare arms in APOE ɛ4–positive and APOE ɛ4–negative participants, as well as to further adjust for age, gender, education, race, family history of AD, AD + CAD or AD-only information disclosure, numeracy, and self-reported comfort in analyses of information recall.20 We also conducted analyses where missing data were not imputed to minimize potential biases in noninferiority analyses.34 These adjusted analyses and available case analyses are omitted from this report, because unadjusted models using imputed data were considered conservative in comparison (i.e., we report noninferiority only when it was demonstrated in all analyses). We used contrasts to compare randomization arms at specific time points and overall for a time-averaged comparison.

Consistent with analyses of prior REVEAL Study trials,17, 19 outcomes at 12 months were considered primary, while outcomes at 6 months and 6 weeks were considered secondary. Data for AD + CAD and AD-only disclosure arms were pooled in analyses because interactions between AD-only/AD + CAD randomization status and in-person/telephone disclosure randomization status were not observed (P values for two-way tests of interaction between the two treatment arms and three-way interactions between the treatment arms and time were all greater than 0.05 except for analyses of IGT-AD distress scores, as noted earlier). The margin of noninferiority for BAI, CES-D, and IES scores was 5 points, as used in prior REVEAL Study trials, indicative of medium to large effect sizes (Cohen’s d = 1.03, 0.64, and 0.66, respectively).17, 18 This criterion was a more conservative margin than 5- to 10-point definitions of “clinically meaningful” differences used in other studies that are based on the intervals between cutoffs for minimal, mild, moderate, and severe anxiety and depression.35, 36, 37 For IGT-AD scores where prior studies have not identified a meaningful difference, we set margins following the strategy used in a related noninferiority analysis:13 three points, representing a shift from “sometimes” to “never” on a single item and indicative of medium effect sizes (Cohen’s d = 0.43 and 0.48 for distress and positive subscales, respectively). Noninferiority margins were not established for recall or subjective impact measures.

We used 99% CIs for primary analyses (12-month outcomes) to account for testing of seven outcomes (anxiety, depression, test-related distress per the IES and IGT-AD, positive impact, and recall of results). These CIs are more conservative than a Bonferroni correction, given guidelines to use one-sided tests.34 To be consistent across outcomes, we used 99% CIs on all secondary analyses. Additional secondary analyses used 99% CIs of two-group tests of proportions to compare recall rates on individual recall items by randomization status and McNemar tests to compare whether participants were more likely to recall specific items more often than others.

Analyses included only participants receiving genetic risk information (genotype data for participants who provided blood but dropped out of the study before the disclosure session was destroyed per the institutional review board–approved protocol). We assumed data were missing at random and imputed missing values using multiple imputation (Markov chain Monte Carlo procedures with 40 imputed data sets). All analyses were conducted using SAS, version 9.3 (SAS Institute, Cary, NC).

Results

Of 290 randomized participants, 257 (88.6%) received genetic risk disclosure (Figure 1) with no observed differences in dropout rates between telephone and in-person disclosure arms (Δ = 0.1%, P = 0.99). Twenty-four randomized participants withdrew before disclosure for the following nonexclusive reasons: concerns about potential emotional responses (8); study demands (4); personal or family health problems (4); concerns about privacy, confidentiality, or discrimination (4); lack of AD prevention options (3); moved (2); no longer interested (1); concerns about test limitations (1); and desire for higher remuneration (1). One participant passed away, and four were lost to follow-up. Four others did not meet eligibility criteria due to low Modified Mini-Mental State scores, high CES-D scores, questionable family history of AD, and failure to attend study appointments.

Figure 1
figure 1

Enrollment flow chart. *A second randomization occurred at this point and is described in “Materials and Methods.” AD, Alzheimer disease; CAD, coronary artery disease.

Demographic characteristics of participants who received genetic risk information did not vary by disclosure method (Table 1) and were similar to those of the second REVEAL Study trial except for the inclusion of individuals without an AD-affected first-degree relative.17 On average, telephone disclosure occurred 7.4 days sooner than in-person disclosure (27.8 vs. 35.2 days after the blood draw, respectively; P = 0.002). Disclosure sessions ranged in duration from 6 to 40 minutes for telephone disclosure and 5 to 50 minutes for in-person disclosure, and, on average, telephone disclosure sessions were 6.6 minutes shorter than in-person disclosure sessions (P < 0.001). Disclosure session chart notes showed that participants were more likely to discuss preventive options during in-person disclosure than telephone disclosure (26% vs. 14%, respectively, P = 0.015). Disclosure sessions did not vary in length by APOE status (P = 0.15 in bivariate analyses, P = 0.16 in analyses controlling for telephone/in-person randomization status).

Table 1 Characteristics of subjects who received genetic risk disclosure

One year after disclosure and across randomization arms, participants remembered 3.0 of 4 items correctly, on average, and scored well below cutoffs for clinical concern on anxiety and depression scales (see Table 2). Overall, 24% of participants receiving in-person disclosure and 23% of participants receiving telephone disclosure reported moderate anxiety, depression, or test-related distress at one or more follow-up time points, with no differences observed by disclosure method (P = 0.61). The majority of participants (65.6%) rated the subjective impact of their risk assessment as positive, while 7.9% rated it negative and 26.5% rated it neutral. Mean scores on measures of general anxiety and depression were well below cutoffs for clinical concern at all time points, and positive impact scores increased over time (P < 0.001). With the exception of IGT-AD positive scores, CIs for mean differences between telephone and in-person disclosure arms were below prespecified margins of noninferiority for all scales at all time points in analyses that were not stratified by APOE status (Table 2) and in analyses of ɛ4-negative participants (Table 3). However, analyses in ɛ4-positive participants supported noninferiority of telephone disclosure at 12 months only on anxiety. Furthermore, upper limits of 99% CIs exceed margins for noninferiority on IES and IGT-AD distress scores at the 6-week and 6-month time points, and upper limits of 99% CIs exceeded margins for noninferiority on the IGT-AD positive scale at 6 weeks.

Table 2 Mean outcome scores by randomization group and time after APOE genotype disclosure
Table 3 Mean outcome scores, stratified by APOE status

Additional analyses suggested that telephone disclosure outperformed in-person disclosure on some components of recalling results (Table 4). Participants were more likely to correctly recall lifetime AD risk estimates at 6 months (Δ = 17.6%, 99% CI: 4.6% to 30.6%) and 12 months (Δ = 15.8%, 99% CI: 2.1% to 29.6%) after telephone disclosure compared to in-person disclosure. Independent of disclosure method or genotype, recall scores were higher at 6 weeks than at 6 months (Δ = 0.3, P < 0.001) or 12 months (Δ = 0.4, P < 0.001). At 12 months, ɛ4-positive participants were more likely than ɛ4-negative participants to correctly recall their genotype (74.0% vs. 56.3%, P = 0.008). Across genotypes and randomization arms, data for individual recall items were highly correlated (r > 0.57 in all pairwise comparisons), although participants were less likely to recall their specific genotypes compared to the number of risk alleles they had or their lifetime or remaining AD risk estimates (all pairwise comparisons P < 0.001). Participants who did not accurately recall numeric AD risk estimates tended to provide estimates lower than what were reported during disclosure sessions: 64.2% of participants with inaccurate lifetime risk recall provided estimates lower than those reported, and 61.5% of participants with inaccurate remaining risk recall provided estimates lower than those reported.

Table 4 Percentages recalling specific genetic test results correctly by disclosure method and time point, adjusted for APOE genotype

Discussion

On average, telephone disclosure of genetic risk information about AD was safe and did not increase psychological risks. Although we were unable to demonstrate statistical noninferiority for some outcomes within the subset of individuals who learned that they were APOE ɛ4–positive, their mean anxiety and depression scores were still well below cutoffs for clinical concern. These findings are notable because risk disclosure for AD has been used as an example of potentially distressing information, given the lack of proven preventive strategies. Anticipation of telephone disclosure did not appear to affect participants’ willingness to receive a genetic risk assessment, and wait times for disclosure sessions were shorter when results were disclosed by telephone, with the disclosure sessions themselves briefer than in-person disclosure sessions.

These findings are encouraging given that many genetic service providers are already disclosing test results via phone. Telephone disclosure is a longstanding practice in prenatal settings given time-sensitive implications for pregnancies, and it is now in wider use in other settings when results suggest no carrier or disease risks.6 Our data are consistent with findings from research on hereditary breast and ovarian cancer syndromes, showing that telephone disclosure does not increase risks for misremembering results or psychological harms.11, 12, 13 In fact, telephone disclosure seemed to improve participants’ abilities to retain numeric risk estimates. This improvement may be because participants received information at two distinct time points, first during phone disclosure sessions, then again when participants received the information via mail. The time savings noted in our study were also comparable to the time savings observed in the BRCA1/2 telephone disclosure studies, although our overall session lengths in our trial were often shorter. The comparably greater time needed for disclosing BRCA1/2 test results may be attributable to time needed to discuss medical management decisions and implications for other family members.

Genotype-specific analyses of our data suggest that genetic service providers should be mindful of the potential for additional distress when disclosing results indicating increased risk via telephone. Noninferiority was supported on all psychological outcomes except positive impact at 12 months among ɛ4-negative participants. However, noninferiority of telephone disclosure was not demonstrated on some 12-month outcomes among ɛ4-positive participants. Possible explanations for this include the reduced ability of providers to read nonverbal cues since genetic counselors are trained to use and respond to body language to encourage patients to engage with information and to reduce discomfort.1 Telephone conversations are often more succinct than those that are face to face,38 and participants or genetic counselors in our study may have felt pressure to be parsimonious about what they addressed on the phone. Our telephone disclosure protocol did not allow participants to view reports while genetic counselors were discussing results, and made it difficult for participants to have a support person present during disclosure. Whatever the reasons, genetic service providers may need to be selective about when to disclose potentially distressing genetic risk information via telephone. Predisclosure anxiety and depression scores are the strongest predictors of postdisclosure outcomes. Therefore, telephone disclosure of ɛ4-positive results to individuals with significant anxiety or depression may not be optimal.18 One strategy that many clinics have already implemented is to have an in-person consultation following telephone disclosure when patients have an increased genetic risk.10, 11

Our study also evaluated recall of genetic test results one year after disclosure. Typically, studies have examined recall immediately after or within 1–2 months of disclosure. Even after reiterating information 6 weeks and 6 months after disclosure, we observed a steady decrease in recall over time. Of note, this trial had higher percentages of correct recall, even at 6 weeks, than the first REVEAL Study trial,21 perhaps due to the study population (e.g., more self-referred participants, individuals without AD-affected first-degree relatives) or due to modifications in disclosure protocols. Data from this trial also reinforced findings from prior trials, showing that individuals who did not recall their lifetime and remaining risks correctly tended to underestimate their risk estimates, and that ɛ4-positive individuals were more likely to correctly recall their genotype than ɛ4-negative individuals.21 It is likely that individuals who were APOE ɛ4-positive saw a greater need to remember this information given their increased risk status.

Limitations

Study participants had higher education and numeracy than the general population. Measures were self-administered, and we could not ascertain whether participants reviewed written summaries during recall assessments. For questions about lifetime and remaining AD risk estimates, participants were not given a “don’t remember” option, and participants may have been categorized as correctly recalling information after guessing. Results may not generalize to situations in which results might be disclosed by health-care providers without genetics expertise. Post hoc calculations based on the differences that we observed suggested that we only had approximately 17% power to confirm noninferiority among ɛ4-positive participants on the IES, where differences were closest to noninferiority margins, and that we would need approximately 106 ɛ4-positive participants in each arm to achieve 80% power. Our sample size also limited our ability to examine interactions between telephone disclosure and factors such as education and race.9 Our study did not assess potential benefits such as increasing access to testing, decreasing costs to patients, or opening up clinic slots.5, 9 We also did not assess the impact of telephone disclosure on outcomes such as risk perceptions, health behaviors, and advance planning, although we have reported on the impact of genetic risk disclosure for AD on these outcomes previously.19, 39, 40, 41 Such analyses may be important given differences we observed between randomization arms in the likelihood of discussing prevention during disclosure sessions.

Conclusion

There remain important challenges to telephone disclosure of genetic test results, including reimbursement policies that encourage in-person visits, but payers are increasingly willing to pay for telephone consultations,42 and companies are now offering telephone genetic counseling. In addition, efforts are already under way to improve the timely disclosure of test results using patient portals associated with electronic medical record systems, mailing negative results, Web-based disclosure, and video conferencing. AD risk disclosure is likely to grow in importance as prediction algorithms improve by incorporating polygenic, clinical, and environmental risk factors.43 Our results regarding telephone disclosure are encouraging given the need for efficient and effective approaches for conveying risk information and test results for common, complex diseases.