Introduction

Every year in the United Kingdom, ~52,200 women develop breast cancer and as a result, 11,400 will die.1 The National Health Service Breast Screening Programme (NHSBSP) aims to provide early detection and thereby reduces mortality from breast cancer.2 However, it has been argued that the NHSBSP causes harm due to false-positive test results and overdiagnosis, although these appear not to outweigh the benefits in terms of lives saved.2

One possible means of improving the balance of harms and benefits of screening is to develop risk-stratified screening, which is increasingly being considered internationally.3 The National Institute for Health and Care Excellence (NICE) recommended in 2013 that women at high risk of breast cancer should have more frequent screening by mammography (annual between 40 and 60 years) and be offered chemoprevention therapy (tamoxifen or raloxifene).4 However, these NICE guidelines cannot currently be implemented with women undergoing NHSBSP screening as women who attend are not currently assessed for their breast cancer risk.

To make risk stratification possible and for consequent benefits to women to be realised, a formalised approach to identifying and communicating women’s breast cancer risk is required. In line with this, the Predicting Risk of Cancer at Screening (PROCAS) study5 found that it was possible to accurately estimate the individual breast cancer risk of a population sample of women aged 46–73 years in Greater Manchester, England using the Tyrer–Cuzick (T-C) model.6 The T-C model was developed to estimate breast cancer risk using up to three sources of information: (a) self-reports, e.g., of family history, parity, BMI, height, age at menarche/menopause/at first live birth, HRT use, (b) breast density, obtained from mammography and (c) genetic information, i.e., single-nucleotide polymorphisms (SNPs) derived from saliva. Importantly, the additional cancers identified in higher-risk women were more likely to be stage 2 and above and women in these groups were therefore most likely to benefit from earlier detection.6, 7

Before a service that provides breast cancer risk estimates could be implemented, or tested in an effectiveness study, it is important to establish the likely harms and benefits of providing women with their personalised breast cancer risk. Potential harms of breast screening include overdiagnosis, invasive testing and incidental findings.2 A key set of harms and benefits to consider are the emotional, cognitive and behavioural impacts of receiving personalised breast cancer risk estimates8 that have been considered in depth for breast cancer screening.9 These include undue anxiety and worry about cancer in women informed of high risk. The main potential benefits of breast cancer risk feedback to women would consist of increased knowledge allowing more informed choices regarding prevention options (e.g., chemoprevention), as well as potential changes in risk reducing behaviours such as increased physical activity, and reduced alcohol consumption.

The main aim of the present research is to assess the psychological impact of receiving personalised estimates of breast cancer on women who participated in the PROCAS study. Specifically, this manuscript examines the psychological impact of receiving risk estimates based on: (a) the T-C model incorporating breast density information versus, (b) the T-C model including breast density information, and incorporating genetic information from SNPs extracted from saliva, with (c) comparison women also from the larger PROCAS study awaiting personalised risk information. For the T-C and T-C(+SNPs) groups, the impact of levels of risk estimates was also compared for three groups of women at 'moderate-risk' (10-year risk 5–7.99%), 'average risk' (10-year risk 2–4.99%) and 'below average risk' (10-year risk <2%).

Materials and methods

Design

The present research was approved by Liverpool East NHS Research Ethics Committee. It was a natural experiment of women from the larger PROCAS study, and employed a 3×3 between-subjects design, with each woman being allocated to one testing group and receiving one of three categorical risk results (shown in Table 1). Allocation to experimental condition did not employ randomisation. SNP testing was introduced after the PROCAS study had already begun, and SNP testing was carried out at sites where it was logistically easier to collect the data, e.g., proximity to study team and facilities for saliva testing. There were three testing groups:

  1. (a)

    Women who received risk estimates based on the T-C algorithm, based on self-completed questionnaires and incorporating information on mammographic density.

  2. (b)

    Women who received risk estimates based on the T-C algorithm, based on not only self-completed questionnaires and mammographic density, but also genetic information derived from SNPs.

  3. (c)

    A comparison group of women who would receive their risk estimates in the near future, but were asked to complete questionnaires for the present study before receiving their risk estimates. Study team members were aware of estimated 10-year breast cancer risk of women in this group.

Table 1 Experimental design showing 3×3 experimental groups, with number of women recruited per experimental group

Further, within each of these three groups, the risk of participating women was estimated to be within one of three categories: (a) 'moderate-risk' (between 5 and 7.99% risk over next 10 years), (b) 'average risk' (between 2 and 4.99% risk) and (c) 'below average risk' (less than 2% risk). Women identified as being at high risk, i.e., 8% or higher were excluded, as they had already received offers of individualised risk consultations by telephone or face-to-face with consultants.

Participants

A total of 131,373 women in Greater Manchester who were invited for routine breast screening during 2009 and 2014 were also asked to participate in the overall PROCAS project, via invitation letter alongside their NHSBSP. In line with usual practice in England, women aged 50–70 years were invited, as well as women invited as part of the AgeX trial, who were 47–59 and 71–73 years (NCT01081288). Of women who attended for screening, a total of 53,596 women (40.8%) consented to the research when they attended their screening appointment, at designated sites within hospitals or in large vans in locations to cover populations that are not within easy reach of the hospital screening sites. Those women who consented then completed a two-page questionnaire to assess their personalised risk of breast cancer. Of these 53,596 women, 51,011 (95.2%) had requested receiving this breast cancer risk estimate in due course. The age of the PROCAS sample was between 47 and 73 years and the majority (90.9%) reported their ethnicity as 'white'.

In the overall PROCAS study, there were 3.2% of women identified as being at high risk, 10.3% of women identified as being at moderate risk, 59.3% of women were identified as being at average risk, and 27.2% of women were identified as being at below average risk.

Procedure

Women in the present study were selected from the overall PROCAS study on the basis of being scheduled to receive their estimated breast cancer risk during the study period of November 2015 to August 2016. As the main PROCAS study had the central objective of evaluating the risk model, women who were invited to participate in the present study were expecting a considerable delay between being tested and receiving their results.

Women in the T-C, or T-C(+SNPs) groups who were included in the present study were recruited to the study and had risk assessment between May 2012 and July 2013; women in the comparison group were recruited and had risk assessment between September 2013 and March 2015. Women in the (a) T-C and (b) T-C(+SNPs) groups were posted letters containing risk estimates and explanatory leaflets between November 2015 and December 2015. Women in these groups were also posted questionnaires for the present study between January 2016 and April 2016, and women in the (c) comparison group were posted questionnaires for the present study between February 2016 and August 2016. Non-responders were posted reminders after 4 weeks.

The risk estimates were accompanied by explanatory leaflets that provided information on what their result meant, what they could do to reduce their risk and advice on detecting symptoms of breast cancer. The leaflets and letters were co-designed in line with guidance,10 and involved 'think aloud' interviews followed by semi-structured interviews with 37 women. Risk estimates were fed back in categories, following service user input into how best to communicate this information and what labels should be used to best convey personal risk (Evans et al.)5. No women in the present study received face-to-face consultations: these were only received by women at 'high risk' who were not included in the present study.

All participants were sent a participant information sheet for the present study, consent form, questionnaire and pre-paid envelope. Those women who did not respond within 4 weeks were sent a reminder and a further reminder if they still had not replied within a further 4 weeks.

Measures

All participants were asked to complete the following measures:

State anxiety was assessed using the six-item short form11 of the Spielberger State-trait Anxiety Inventory,12 with participants indicated for six emotion adjectives (e.g., 'upset') their present feelings by selecting one of the following response options 'not at all', 'somewhat', 'moderately' and 'very much'; α = 0.86.

Breast cancer worry was assessed using the Lerman Cancer Worry Scale,13 consisting of six statements such as: 'how often have you thought about your chances of getting cancer'. Participants endorsed one of the following response options 'never', 'rarely', 'sometimes' and 'almost all the time'; α = 0.87.

Perceived relative risk of developing breast cancer was assessed using a single item that asked women to rate their risk of developing breast cancer in the next 10 years, compared with other women of their age, from 'much lower', 'a bit lower', 'about the same', 'a bit higher' and 'much higher'.14

Intentions to change five health-related behaviours related to breast cancer prevention were assessed using five-point scales adapted from previously published items:15 follow a calorie-controlled diet, be physically activity, attend next mammogram, drink less alcohol and discuss prevention options with their GP (response options 'strongly disagree', 'disagree', 'neither agree nor disagree', 'agree', 'strongly agree'). Intention to take up chemoprevention was assessed only for participants in the T-C, or T-C(+SNPs) groups.

Participants in the T-C, or T-C(+SNPs) groups but not the comparison group were also asked to complete the following measures.

Satisfaction with the information was assessed using four items (α = 0.86), adapted from a previously published scale,16 which asked women how clear they found the information, how confusing they found it, how well informed they feel about their breast cancer risk information and how satisfied they with the amount of information given. Response options were 'strongly disagree', 'disagree', 'disagree somewhat', 'undecided', 'somewhat agree', 'agree' and 'strongly agree'.

Understanding of test result was assessed by asking women to select the single option among the following 10 options provided in Table 2, which best described what their test result meant, adapted from a previously published item.16

Table 2 Demographic and clinical characteristics (mean [SD], % [n]) of testing groups, with some characteristics also broken down by test results

Demographic and clinical information was obtained from the PROCAS questionnaire that was used to estimate breast cancer risk, and information that was routinely collected by the NHSBSP. The index of multiple deprivation was obtained from participant postcode, and rates the area deprivation in deciles from 1 (most deprived) to 10 (least deprived).17

Data analysis

To assess demographic and clinical differences in continuous variables between testing groups, ANOVAs were conducted with testing group as an independent fixed effect with three levels. Chi-squared analyses were conducted to assess differences between categorical demographic and clinical variables.

Effects on state anxiety, worry, risk perceptions and intentions to change each of five behaviours were examined independently. In each case, ANCOVAs were conducted with testing group and risk result group as independent fixed effects, and BMI, deprivation and age as covariates. For each variable, two analyses were conducted: (1) a comparison of participants in the T-C and T-C(+SNPs) groups, and (2) a comparison of participants in the combined T-C and T-C(+SNPs) groups vs participants in the comparison group. Two sets of sensitivity analyses were conducted, with these analyses repeated with additional covariates: (a) whether the initial mammogram was a first screen or repeat screen and (b) whether women had a subsequent mammogram scheduled before they completed the study questionnaire. These covariates did not make a significant contribution to the outcomes reported in Table 3 in either set of analyses, and had little effect on the results reported.

Table 3 Measures of anxiety, worry, perceived risk and overall satisfaction (mean [SD]), with statistical tests for differences by test groups, test results and their interaction

Intentions to take chemoprevention and satisfaction with information were also examined using ANCOVAs as above, but only comparing participants in the T-C and T-C(+SNPs) groups.

Results

Overall, 2138 women who were scheduled to receive feedback in 2015/2016 were sent questionnaires. Responses were received from 765 women (36% overall response rate). At least 200 questionnaires were sent to women in each of the nine experimental conditions.

Matching of testing groups

The testing groups were not generally well matched (Table 4), with differences in a number of demographic and clinical variables. There was also variation in the number of days from being tested for the PROCAS study to being sent a risk estimate. It is important to note that, for most variables, the T-C(+SNPs) and comparison groups were most dissimilar, with the T-C group being intermediate. For some variables, such as age, women in the T-C group (mean age = 54.3 years) and comparison group (mean age = 52.4 years) were similar compared to the women in the T-C(+SNPs) group (mean age = 61.9 years). Women in the comparison group were, on average, younger, with lower BMI and least deprivation.

Table 4 Intentions to change behaviours (mean [SD]), with statistical tests for differences by test groups, test results and their interaction

Psychological impact of receiving personalised risk estimates

There were no significant differences in state anxiety between women in the T-C and T-C(+SNPs) groups (Table 3). By contrast, women in the comparison group had higher anxiety levels than women in the T-C and T-C(+SNPs) groups. In both analyses, women with higher personal risk were more anxious (P < 0.001), although there was a significant moderator effect, where women in the comparison group had higher anxiety levels irrespective of personal risk levels.

When these analyses were repeated using the cutoff for anxiety disorder previously suggested,12 the pattern of results was highly similar. Of 763 women who completed the STAI, 111 (14.5%) exceeded this cutoff. This proportion was higher (χ2 = 11.28, df = 1, N = 763, P = 0.001) in 59/296 women in the comparison group (19.9%) relative to 52/412 women in the T-C and T-C(+SNPs) groups (12.5%). The proportions were not significantly different (higher (χ2 = 1.46, df = 1, N = 467, P = 0.144) between 26/197 women in the T-C only group (13.2%) and 26/270 women in the T-C(+SNPs) group (9.6%). In the T-C and T-C(+SNPs) groups, proportions of women exceeding the cutoff was related to risk estimate received (χ2 = 10.22, df = 2, N = 467, P = 0.006), with differences between women who received moderate (28/159 or 17.6%), average (11/140 or 7.9%) and below average (13/168 or 7.7%) risk estimates

There were no significant differences in cancer worry between women in the T-C and T-C(+SNPs) groups, although women with higher personal risk were more worried (see Table 3). Women in the comparison group reported less cancer worry than women in the T-C and T-C(+SNPs) groups.

There were no significant differences in perceived risk between women in the T-C and T-C(+SNPs) groups (Table 3). In this analysis, women who were told their personal risk was higher accordingly rated their personal risk as higher. In comparisons of women in the T-C and T-C(+SNPs) groups with women in the comparison group, women in the comparison group generally rated their personal risk as higher, and women who received higher personal risk estimates accordingly rated their personal risk as higher. In these later analyses, the results were subject to a significant interaction effect, where the association between personal risk group and perceived risk was much lower for women in the comparison group (who had not been informed of their personal risk level before they completed the questionnaire).

There was little evidence of differences between the T-C and T-C(+SNPs) groups in terms of satisfaction with information received (Table 3). Satisfaction was lower in response to higher-risk results than in response to lower risk results. This effect was modified by an interaction with average risk results associated with lower satisfaction for the T-C group and higher satisfaction with the T-C(+SNPs) group.

There were generally few effects of either testing group or test result group on intentions to change a variety of behaviours (Table 5).

Table 5 Percentage (n) of respondents in the T-C (+SNPs) and T-C only test result groups endorsing statements indicating understandings of the meaning of their latest test result, and their associated anxiety and concern (mean, SD)

There was considerable variation in understanding of test results (see Table 2). Participants in the T-C and T-C(+SNPs) groups were asked to select one option from the 10 possible options that they felt best described what their result meant. Five response options were selected by at least 10% of participants, of which three were in line with the intended aim of the risk communications. These three options were 'I am unlikely to develop breast cancer' (26.5% of respondents), 'I am unlikely to have breast cancer' (15.8%) and 'I am very unlikely to develop breast cancer' (10.3%). By contrast, the other two options were at odds with the intended aim of communications: 'I definitely do not have breast cancer' (23.1%) and 'I am likely to develop breast cancer' (10.7%). Participants who selected this last option also had high state anxiety and cancer worry scores. This option was selected more often by women in the T-C group (13.7% or 27/197) than by women in the T-C(+SNPs) group (8.5% or 23/271). It is also notable that 10% (27/271) of women in the T-C(+SNPs) groups selected 'This result does not tell me anything about my future likelihood of breast cancer', in contrast to only 4.6% (9/197) of women in the T-C group.

Discussion

There were very few differences on any measure between the T-C only and T-C(+SNPs) groups. Women in both the T-C and T-C(+SNPs) groups had lower-state anxiety levels than women in the comparison group who did not know their risk. Women with higher personal risk were more anxious in all groups, although there was a significant moderator effect, where women in the comparison group had higher anxiety levels irrespective of personal risk levels. Women in the T-C and T-C(+SNPs) groups had slightly higher cancer worry than women in the comparison group. Overall, state anxiety and cancer worry levels were low. There was little evidence that risk communication had effects on intentions to change any behaviour. Women were generally satisfied with information provided, and risk perceptions were in line with the information that had been communicated. When women in the T-C and T-C(+SNPs) groups were asked to select options that they felt best described what their result meant, there was considerable variation in understanding of test results with some misunderstandings apparent.

Strengths and weaknesses

The present research has two features that make it highly innovative. First, the present research reports on the effects of communicating personalised breast cancer risk to women from the general breast screening population for the first time. Previous studies of the effects of communicating personalised risk across a range of conditions have focussed on highly selected samples in research settings.18 Further, studies of the effects of communicating genetic risk information have had an even narrower focus on clinical samples that were selected on the basis of existing high-risk previously detected by phenotypic markers.19 By contrast, in the present research, risk estimates were offered to a population sample of women who attended routine breast cancer screening. This makes the present findings much more informative about the likely population impact of risk communication in the context of risk-stratified screening.8

The present research compares risk estimates derived from phenotypic sources with risk estimates derived from both phenotypic and genotypic sources, to allow a comparison of how people respond to these two kinds of risk information. An innovative feature of this comparison is that in the present research, risk estimates using genotypic information were based on a large number of genetic markers that allowed more fine-grained estimation of risk.5 The use of a well-validated model incorporating genetic risk information stands in contrast to much research on the effects of communicating of genetic risk that is based on risk conferred by single allele mutations.19

A further strength of the present approach is that the approach taken to risk communication involved considerable user input. First, the decision to communicate risk estimates using categorical descriptors of risk was taken following considerable input from a patient-public involvement (PPI) user panel. Second, the letters and leaflets that were used to communicate risk estimates and the implications of this risk information were developed using several rounds of co-design with women who were drawn from the same population as women in the present research.

A major limitation of the present research is that the three testing groups of women were identified on the basis of convenience in communicating risk estimation and were not well matched on a variety of demographic, clinical and service-level variables. Although this lack of matching is far from ideal, there are good reasons to believe that it has not had a major biasing effect on the results reported. Importantly, for most demographic, clinical and service variables the T-C and comparison groups were similar, with the T-C(+SNPs) group being most different. For instance, women in the T-C group (mean age = 54.3 years) and comparison group (mean age = 52.4 years) were of similar age, but women in the T-C(+SNPs) group were considerably older (mean age = 61.9 years). By contrast, the T-C and T-C(+SNPs) groups showed a similar pattern of responding on the psychological impact variables, which were different from women’s responses in the comparison groups. Thus, it is implausible for the differences in demographic, clinical and service-level variables to be able to account for the results reported. There was no major seasonal variation or major trends in breast cancer screening attendance across England and the North-West of England where the study took place using data routinely reported by Cancer Research UK/ National Screening Committee.5

A second limitation of the present research lies in the long duration between consenting to receive risk estimates and receiving risk results, typically around three years (Table 4). This was an inevitable feature of the PROCAS study, which had a primary aim of validating the T-C algorithm and hence required some delay between risk estimation and breast cancers to be clinically detected.5 Nevertheless, the impact of receiving risk estimates three years after consenting to receive these may well have reduced the psychological impact of those estimates. It should be noted however, that questionnaires were completed around three-months following the receipt of risk estimates, so the impact on women receiving risk estimates was comparatively recent.

A thjrd limitation of the present research relates to the representativeness of the PROCAS sample, with 40% of women attending screening taking up the offer of risk estimation. As previously reported,20 deprivation scores were substantially lower in women who took up the offer of risk estimation compared to women who declined this offer. This is somewhat inevitable, as around 72% of eligible women currently attend breast screening,21 and of those, women had to consent to participate in the PROCAS study to be eligible for the present research. Based on this, it could be argued that participants were a group that were particularly keen to know their breast cancer risk estimate and this may have influenced the psychological impact of the information. This is possible, although given uptake rates in the NHS BSP and the requirement that people consent to research, there is little that could be done to ameliorate this problem. Further, if risk estimation were to be offered as part of routine NHS BSP, it would be expected that uptake would be socially patterned, so our results may mirror this. Finally, the present analyses controlled for deprivation, to ensure that deprivation did not confound comparisons between groups.

A final limitation is that there was only a 36% response rate to the questionnaires in the present study. Thus, it is possible that non-responders could have more or less anxiety and cancer worry than responders, with the results obtained concealing a substantial number of women who had higher levels of distress. Although the non-response rate was relatively similar between the groups, underlying differences and bias could exist in non-responders between the groups.

What this study adds to existing literature

The present results show that women receiving personalised risk information have lower-state anxiety levels and only slightly higher cancer worry than women who were awaiting their risk estimates. It is important to note that overall, the levels of state anxiety and cancer-specific worry were low. The higher levels of state anxiety observed in the comparison group in the present study are consistent with the observation that uncertainty may provoke more anxiety than definitive bad news.16 That is, the levels of state anxiety in women in the comparison group are similar to those of women at moderate-risk in the other two testing groups, whereas women at average or below average risk in these two testing groups have lower anxiety levels. It should be noted however, that the present study found evidence that cancer-related worry was slightly higher in women who received risk estimates than in the comparison group. These findings are consistent with the observation that false-positive breast screening test results can result in slight elevations in cancer-related distress in the long term, even when general state anxiety shows no effects of such test results.9 Overall, mindful of the limitations of the evidence generated by the present study, these results are consistent with the hypothesis that providing women with breast cancer risk estimates does not produce large effects on anxiety or cancer worry.

The present study found no consistent effects of communicating personalised breast cancer risk on intentions to change lifestyle behaviour. Such a finding is consistent with a recent systematic review of systematic reviews of the effects of communicating personalised disease risk on lifestyle behaviour.18 This review found that there was little effect of personalised risk on behaviour, especially when risk information is communicated in numerical form, as opposed to using visual or other vivid means of feedback. These findings are also in line with a systematic review of the effects of communicating genetic risk information on behaviour, which found even clearer evidence that genetic risk information has little effects on behaviour.19 It is also notable that women who were told that they were at lower risk than other women did not show intentions to engage in unhealthier behaviour. This finding is in line with systematic reviews which show a general absence of effects on behaviour due to false reassurance from negative screening results.22

Implications for practice

The major implication of the present research is that there is no compelling evidence that communicating personalised breast cancer risk estimates causes any major harms. There may be effects on cancer-related worry, but these effects were small. Further, there may be room to ameliorate such effects by improved communication of risk information. The ratings of perceived risk in the present study were consistent with the idea that women in the T-C and T-C(+SNPs) groups understood the risk information they had received, and satisfaction with information was generally high. However, there was evidence that satisfaction was lower for women who were told they were at higher risk. Further, understanding of risk results was variable, and examination of cancer worry ratings according to understanding of results suggests that cancer worry may be particularly high for women who (erroneously) thought their result indicated that they were likely to develop breast cancer.

The general absence of effects on intentions to change any behaviour suggest that providing risk information alone is unlikely to result in changes in behaviour without additional support being provided.18 Nevertheless, the provision of risk information is a necessary part of identifying women at higher risk who can then be offered additional screening or chemotherapy.

Given these results, progressing to further consider the feasibility of risk-stratified screening appears warranted (see also further evidence23). Increasing frequency of screening of women at high-risk and decreasing frequency of screening of women at low-risk is likely to lead to a better balance of benefits and harms of screening, and may be cost-neutral.

Implications for research

The present research is currently the best available evidence on the likely harms and benefits of risk-stratified breast screening as part of the NHS Breast Screening Programme. Future studies of the feasibility of risk-stratified screening appear warranted, but could benefit from several improvements on the present research. First, future research should compare women who are offered personalised testing with women offered routine breast cancer screening, which would be a better comparator than women awaiting personalised test results. Allocation of women to these conditions should be done via randomisation or other procedures to ensure well-matched samples.

Future research should consider effects in both the short-term and longer-term. Such research should make greater use of theories of risk communication to better understand reactions to the information provided. It would be useful to examine effects on behaviour using validated tools, rather than intentions to change behaviour as in the present research. This could include objectively assessed behaviours such as attendance at future screening rounds and uptake of chemoprevention as well as changes in behaviour such as physical activity. Such research could examine how effects of risk communication on positive health behaviour could be facilitated. Future research could consider aspects of informed decision making other than just knowledge/ understanding of test results.

Should such work show there to be no major harms of stratified screening in terms of emotional and behavioural outcomes, and if uptake of the offer of stratified screening is sufficiently high, a full evaluation of the effectiveness and cost-effectiveness of such a programme would be warranted.

Conclusions

The present research indicates that there would be no major emotional or behavioural harms of providing women with 10-year breast cancer risk estimates. Given this, further work of better quality to establish the feasibility of risk-stratified screening is warranted. As part of such work, there is a need to improve the risk communication materials to increase understanding and satisfaction with information, particularly for women at higher risk.