A systematic review and quality assessment of individualised breast cancer risk prediction models

Louro, Javier; Posso, Margarita; Hilton Boon, Michele; Román, Marta; Domingo, Laia; Castells, Xavier; Sala, María

doi:10.1038/s41416-019-0476-8

Download PDF

Article
Open access
Published: 22 May 2019

Epidemiology

A systematic review and quality assessment of individualised breast cancer risk prediction models

Javier Louro^1,2,3,
Margarita Posso^1,2,
Michele Hilton Boon⁴,
Marta Román^1,2,
Laia Domingo^1,2,
Xavier Castells ORCID: orcid.org/0000-0002-2528-0382^1,2 &
…
María Sala^1,2

British Journal of Cancer volume 121, pages 76–85 (2019)Cite this article

12k Accesses
84 Citations
23 Altmetric
Metrics details

Subjects

Abstract

Background

Individualised breast cancer risk prediction models may be key for planning risk-based screening approaches. Our aim was to conduct a systematic review and quality assessment of these models addressed to women in the general population.

Methods

We followed the Cochrane Collaboration methods searching in Medline, EMBASE and The Cochrane Library databases up to February 2018. We included studies reporting a model to estimate the individualised risk of breast cancer in women in the general population. Study quality was assessed by two independent reviewers. Results are narratively summarised.

Results

We included 24 studies out of the 2976 citations initially retrieved. Twenty studies were based on four models, the Breast Cancer Risk Assessment Tool (BCRAT), the Breast Cancer Surveillance Consortium (BCSC), the Rosner & Colditz model, and the International Breast Cancer Intervention Study (IBIS), whereas four studies addressed other original models. Four of the studies included genetic information. The quality of the studies was moderate with some limitations in the discriminative power and data inputs. A maximum AUROC value of 0.71 was reported in the study conducted in a screening context.

Conclusion

Individualised risk prediction models are promising tools for implementing risk-based screening policies. However, it is a challenge to recommend any of them since they need further improvement in their quality and discriminatory capacity.

The current status of risk-stratified breast screening

Article Open access 26 October 2021

Personalized early detection and prevention of breast cancer: ENVISION consensus statement

Article Open access 18 June 2020

Proactive breast cancer risk assessment in primary care: a review based on the principles of screening

Article Open access 03 February 2023

Background

Mammography screening has been associated with a reduction in breast cancer mortality and therefore organised breast cancer screening programmes using mammography have been well established worldwide.^1,2,3,4 Although there is not a single consensus, current screening programmes generally recommend biennial or triennial screening in Europe and annual or biennial screening in the US with variations in the recommended targeted age.^2,3,4,5 These recommendations usually consider age as the sole risk factor leading women to be invited for screening from age 40–50 until age 70–74, depending on the programmes.

The likelihood that a woman will benefit from screening mammography depends on her risk for developing clinically significant breast cancer in her lifetime. Taking individual risk factors beyond age into account should enable the classification of women into groups at varying risk of breast cancer. Personalised risk-based screening going beyond the current ‘one-size fits all' recommendation may increase the effectiveness and benefit-harm balance of breast cancer screening. Individualised risk prediction models for breast cancer are a key element to develop risk-based screening approaches since they are designed to quantify the risk that can predict whether an individual woman would develop breast cancer in a defined period.⁶

A number of risk prediction models that include classical risk factors are commonly used in clinical contexts.⁷ However, organised screening programmes do not use these models routinely. One reason for not including these models in screening context is the high uncertainty with regards to its applicability in screening settings. Also, the emergence of new risk prediction factors such as the expression of single nucleotide polymorphisms (SNPs) needs to be appropriately summarised before recommending one of the models into screening practice.

Like any other source of information, risk prediction models have limitations that should be evaluated before using them. A rigorous risk of bias assessment of the existing individualised risk models is needed to clarify the overall quality and applicability of each model. Therefore, the aim of this systematic review is to update the existing evidence, conduct a critical appraisal and risk of bias assessment and summarise the results of the individualised risk models which are used to estimate the risk of breast cancer in women in the general population.

Methods

Data sources and searches

We performed a systematic review of the literature following the standard Cochrane Collaboration methods⁸ and adhering to the PRISMA statement reporting recommendations.⁹ A predetermined review protocol was registered (CRD42018089842) in the PROSPERO database (date of registration 1 March 2018). The Patient, Intervention, Comparison, Outcomes (PICO) question of this systematic review is the following: Should individualised breast cancer risk prediction models vs. no risk prediction models be used to develop risk-based screening approaches for women in the general population?

We retrieved relevant literature by using a combination of controlled vocabulary and keyword search terms in the following databases: (i) Medline (accessed through PubMed); (ii) The Cochrane Library; and (iii) EMBASE (accessed through Ovid). Terms related to breast cancer recurrence were excluded in order to avoid retrieving citations out of the scope of this systematic review. We adapted the search algorithms to the requirements of each database and used validated filters to retrieve systematic reviews and primary studies as needed. We reviewed references of included studies that could potentially fulfil our eligibility criteria. The detailed search strategy is reported in Supplementary table 1.

We searched primary studies of individualised breast cancer risk models searching each database from its inception up to February 2018.

Study selection

Eligible studies were those published in English that reported a model to estimate the individualised risk of breast cancer in women in the general population. We included models that assessed more than one risk factor and reported the quantitative characteristics of the risk prediction model. If multiple publications were based on the same individualised risk model, the most extensive report of the model in terms of risk factors reported was chosen. We excluded external validation studies that replicated previous models without adding any additional information such as a new design for collecting the inputs data, modifications on the risk factors or the risk model method.

Articles identified from the search were loaded into EndNote X7.7.1 for Windows (2008, Version 12.0.4) and duplicates were removed.

Data extraction and quality assessment

One reviewer screened the search results based on title and abstract, and a second reviewer performed a quality check of the study screening by reviewing 20% of the references. Two reviewers independently confirmed eligibility based on the full text of the relevant articles. In case of disagreement between researchers, the inclusion of studies was determined by consensus. We reported the result of this process with a PRISMA flowchart (Fig. 1).

We used a predefined form to extract the following information from included studies: author, publication date, country, study design, the name of the model if available, sample characteristics, sample size, type of breast cancer, the method of analysis, and validation of the model. Data abstraction was conducted by one reviewer and checked by another.

Two reviewers carried out the assessment of the risk of bias independently and final quality assessment was based on consensus. We used the ISPOR-AMCP-NPC Questionnaire¹⁰ to assess the relevance and credibility of each risk prediction study and the following sources of limitations: (i) internal and external validation; (ii) bias due to the study design for risk estimates; (iii) limitations in data inputs; (iv) appropriateness of the model analysis; (v) reporting bias; (vi) interpretation bias; and (vii) conflict of interest. The risk of bias for each domain was rated as low, high or unclear. For systematic reviews we used the AMSTAR 2 critical appraisal tool.¹¹

Data synthesis and analysis

We evaluated the model validation by assessing both the discriminative power and the calibration accuracy estimated for the women in the general population. When available in the included publication, we extracted the area under the receiver operating characteristic curve (AUROC), the net reclassification index (NRI) and the expected observed (E/O) ratio. The NRI was not included in the tables because it was only reported in 2 out of 24 articles. The characteristics of the included models and the risk prediction outcomes reported preclude the possibility to pool data across studies. Therefore, a narrative synthesis has been conducted. Key study characteristics, validation and accuracy of individual risk models, and methodological quality are described in tables and summarised in a narrative manner. Results are presented according to the original model that they reported.

Results

Study inclusion

The database searches for primary studies retrieved 2974 citations, of which 79 were considered potentially relevant. These 79 studies were screened in full text. We found a systematic review of Anothaisintawee et al.,⁷ which we used as a source of primary studies. In addition, two studies were included after a manual inspection of papers’ references.^12,13 After the full text was checked, 24 studies^{12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35} met the inclusion criteria and were considered in the evidence synthesis. Details about study inclusion with reasons for exclusion are described in the flow-chart (Fig. 1), and a list of references to excluded studies is provided in Supplementary table 2.

Characteristics of the included studies

The included studies can be grouped according to the risk model that they reported, the Breast Cancer Risk Assessment Tool (BCRAT), the Breast Cancer Surveillance Consortium (BCSC), the Rosner & Colditz model, the International Breast Cancer Intervention Study (IBIS), and other original models. The study by Zhang et al.¹³ is included in two of the groups (BCRAT and Rosner & Colditz models) because it provides information of both models and presents its results separately. A brief summary of the 24 included studies is presented in Table 1 and the extended characteristics in Supplementary table 3.

a.
Breast Cancer Risk Assessment Tool ‘BCRAT’ model. This model was first published in the United States in 1989 assessing age, family history of breast cancer, age at first birth, menarche, and previous biopsies as risk factors for predicting individualised breast cancer risk.²² After this first publication, eight studies were identified that were based on BRCAT model but modified the data collection design, assessed additional risk factors or changed the statistical method. In addition to the five risk factors proposed in 1989, other variables such as body mass index (BMI), weight, hormone replacement therapy (HRT), alcohol consumption, physical activity, diet, breast density, atypical hyperplasia, breast inflammatory disease, parity, a polygenic risk score or hormones information have been included in updated versions (Table 1).^{13,14,16,17,20,23,25,26,30}
b.
Breast Cancer Surveillance Consortium ‘BCSC’ model. One relevant variation of the BCRAT model opens the path to the emergence of the BCSC model first published by Tice et al. in 2008 in the United States.³¹ In this study, Tice et al. used data from a cohort to create an individualised risk prediction model that combines age, family history, previous biopsies, breast density, and ethnicity. The BCSC model has been further evaluated by other authors^12,24,29,32 and it currently includes previous benign breast diseases and polygenetic risk score using SNPs as risk factors (Table 1).
c.
Rosner & Colditz model. Parallel to the BCSC model, another model based on the ‘Nurses' Health Study’ cohort developed by Rosner & Colditz in 1996 was also developed in the United States. This model currently includes 11 risk factors: age, menarche, menopause, age at first birth, age at subsequent births, previous benign breast disease, HRT, family history, weight, BMI, alcohol consumption, and oestradiol levels.^18,19,27,28 In the same way as in the BCRAT, Zhang et al.¹³ analysed this model adding breast density, a polygenic risk score and endogenous hormones as risk factors.
d.
International Breast Cancer Intervention Study ‘IBIS’ model. The IBIS model³³ includes genetic information adding the BRCA genes and a hypothetical susceptibility gene.
e.
Other models. Four studies reporting different models were also identified.^15,21,34,35 Apart from the above-mentioned risk factors, the models also assessed other variables such as abortion, breastfeeding, height, and previous mammography results. Particularly relevant is the Eriksson model²¹ since it was the only one targeted to the screening population. In this study, the authors included risk factors that were available at mammography screening examination: age, BMI, HRT, family history, menopause, breast density, and presence of microcalcifications and/or masses in the screen-mammogram.

Table 1 Summary of included studies

Full size table

Discriminatory accuracy

Fifteen out of the 24 studies reported the discriminatory accuracy as the AUROC (Table 1 and Fig. 2).

a.
BCRAT model. The first BCRAT model publication did not report the AUROC, however, later publications of this model reported a range that varied from 0.56 to 0.68. The three publications that included the original risk factors, age, family history of breast cancer, age at first birth, menarche, and previous biopsies, reported low AUROC values, 0.56 to 0.62.^14,20,23 Similarly, the AUROC reported by Boyle et al.¹⁶ and Matsuno et al.²⁵ were 0.60 and 0.61, although these authors added BMI, HRT, alcohol, physical activity and diet, and ethnicity into the model. Zhang et al.¹³ with the new variables reach an AUROC of 0.65 and Tice et al.³⁰ reported in 2005 a higher AUROC value of 0.68 which was obtained just adding breast density to the original five risk factors (Table 1). Zhang et al.¹³ also reported the NRI to validate that his model improved the previous ones with a result of 8%.
b.
BCSC model. The published value of the AUROC for the BCSC model was moderate, ranging from 0.64 to 0.69. Tice et al. included age, family history, previous biopsies, breast density reported by the Breast Imaging Reporting and Data System (BI-RADS), and ethnicity into the model in 2008 and obtained a value of 0.66 for the AUROC.³¹ Instead of BI-RADS, Kerlikowske et al. assessed changes in breast density obtaining a similar result, 0.64.²⁴ Using previous benign breast disease, Tice et al. obtained a slightly higher AUROC value of 0.67 in 2015.³² More recently, in 2015 and 2016, Vachon et al.¹² added to the model a polygenic risk score and Shieh et al.²⁹ a combination between a polygenic risk score and BMI reporting a value of 0.69 and 0.65 for the AUROC respectively (Table 1). Vachon et al.¹² also demonstrated the improvement of discriminatory accuracy estimating the NRI with a positive result of 11%.
c.
Rosner & Colditz model. The discriminatory accuracy of this model varied from 0.61 to 0.68. The authors assessed age, family history, age at first birth, menarche, BMI, benign breast disease, menopause, HRT, age at subsequent births, alcohol, and weight. They obtained an AUROC of 0.64 and 0.61 for ER + /PR + and ER-/PR- tumours, respectively.¹⁹ The addition of oestradiol levels to the model was tested by Rosner et al. who obtained a 0.65 AUROC value in 2008.²⁸ Finally the addition of a polygenic risk score, mammographic density and endogenous hormones by Zhang et al.¹³ reached a 0.68 AUROC value (Table 1) and obtained an improvement of the discriminative accuracy also reflected in a NRI of a 9.5%.
d.
IBIS model. The IBIS model original paper³³ does not include any validation and does not present the AUROC. Nevertheless, it has been externally validated showing an AUROC of 0.57 which increases to 0.61 when adding mammographic density.³⁶
e.
Other models. Overall, the AUROC values of these models were not higher than those shown by the above-mentioned models, varying from 0.62 to 0.64, although they included a large number of risk factors. However, the model reported by Eriksson et al.²¹ did show an AUROC of 0.71 that was the highest AUROC value identified in this systematic review (Table 1). This model, in addition, is the only one that estimates a 2-year risk, while the rest of models estimate the risk at a longer time horizon. This could explain the difference in AUROC values since it becomes more difficult to predict risk as the time horizon increases.

Calibration accuracy

Nine out of the 24 studies reported the calibration accuracy as the E/O ratio (Table 1).

a.
BCRAT model. Of the 10 studies derived of the BCRAT model, five reported the calibration accuracy. Banegas et al.¹⁴ presented heterogeneous results depending on the provenance of the population, reporting an E/O ratio of 0.93 for US-born and 1.52 for foreign-born women. Although Matsuno et al.²⁵ added new variables to the original BCRAT model, the E/O ratio was 0.85, which was the lowest of the group, whereas the other studies published E/O ratios that varied from 0.93 to 1.03^16,20,23 (Table 1).
b.
BCSC model. Tice et al. published in 2008 a value of 1.03 for the E/O ratio when looking at 5-year risk.³¹ Using previous benign breast disease, they obtained a similar result in 2015, with an E/O ratio of 1.04 for 5-year risk and 1.05 for 10-year risk.³² When Kerlikowske et al. assessed changes in breast density the ratio decreased obtaining a 0.98 for 5-year risk and 0.95 for 10-year risk.²⁴ The studies of Vachon et al. and Shieh et al. did not present validation regarding the calibration accuracy of the model (Table 1).
c.
Rosner & Colditz model. Of the five studies based on the Rosner & Colditz model,^{13,18,19,27,28} none of them reported calibration accuracy statistics of their models for the women in the general population.
d.
IBIS model. The IBIS model original paper³³ does not report any calibration statistic. Nevertheless, other articles have validated it showing an E/O ratio of 1.67.³⁶
e.
Other models. The study Barlow et al.¹⁵ was the only one that reported calibration accuracy and presented the closest E/O ratio to one of all the studies included in this review taking values of 1.00 and 1.01 for pre and post-menopausal status respectively (Table 1).

Quality assessment

The quality of the included studies was moderate due to some limitations in the discriminative power, study design, and data inputs. The studies did not show important limitations with regards to the validation, appropriateness of the model analysis, reporting or interpretation of the results (Fig. 3). A summary of the risk of bias assessment per each source of limitation is presented here and the detailed appraisal and judgements in Supplementary table 4.

Internal and external validation

Ten studies^{14,15,16,17,20,23,25,26,30,31} validated their models by comparing the results with those published by Gail et al.,²² three studies^24,29,32 compared with Tice et al.,³¹ one²¹ compared with both Gail et al.²² and Tyrer et al.,³³ one¹³ compared with both Gail et al.²² and the results of a Rosner & Colditz model external validation³⁷ and three studies did not report the model validation in the primary articles.^19,22,34 Six studies assessed internal validation with a sample of the population that generated data for the model,^{15,16,24,29,31,32} and four with an external population.^14,20,23,25 Despite not having reported the external validation in the primary articles, the Rosner & Colditz model^18,19,27,28 reported external validation in a subsequent article mentioned before.³⁷ Nine studies used the expected/observed event ratio to measure the calibration accuracy of the model.^{14,15,16,20,23,24,25,29,31}

Bias due to the study design

Thirteen studies used a case-control design to obtain breast cancer risk estimates,^{12,13,14,16,17,20,21,22,23,25,26,29,34} five studies used prospective cohorts,^{15,18,19,27,28} and four models used retrospective cohorts.^24,30,31,32 The study of Wang et al.³⁵ and the study of Tyrer et al.³³ used risk estimates obtained from a systematic review of the literature.

Limitations of data inputs

Sixteen studies obtained most of the input parameters from self-reported questionnaires.^{13,14,15,16,17,18,19,20,22,23,25,26,27,28,30,34} The study of Matsuno et al.²⁵ also imputed ethnicity for women with missing data.

Appropriateness of the model analysis

Thirteen studies^{12,13,14,15,16,17,20,22,23,25,26,29,34} used logistic regression to estimate the risk of having breast cancer according to the assessed risk factors, five used proportional hazard Cox models,^{21,24,30,31,32} four used Poisson regression models,^18,19,27,28 and the other two studies used risk estimates obtained from a systematic review of the literature.^33,35

Reporting bias

Twenty one studies reported all relevant and necessary information for the model creation.^{12,13,14,15,16,17,18,19,20,21,22,23,25,26,27,28,29,31,33,34,35} Conversely, a critical lack of information was found in the other three studies.^24,30,32

Discussion

Summary of main results

This systematic review included 24 studies that aimed to estimate the individual risk of developing breast cancer in women in the general population. Twenty studies were based on four specific risk models (the BCRAT, the BCSC, the Rosner & Colditz and the IBIS model),^{16,17,18,19,20,22,23,24,25,26,27,28,29,30,31,32,33} whereas four studies used other original models.^15,21,34,35 The most extensively used were the BCRAT, IBIS and the BCSC models. The number of risk factors included in the models ranged from five to 18. Other than age, which was the only risk factor present in all models, the BCRAT model also included family history, age at first birth, menarche, and previous biopsies. Breast density, benign breast disease, and polygenetic score were predominant in the BCSC model. Although during the last decade the models have shown improvements in their discriminatory accuracy, it remains at best moderate with a maximum AUROC value of 0.71 reported by Eriksson et al.²¹ The calibration accuracy was very heterogeneous ranging from 0.85 to 1.52. Furthermore, the quality of the studies was not high due to limitations in the discriminative accuracy, study design, and data inputs.

Agreements and disagreements with other reviews

In this systematic review, we found that the number of individualised breast cancer risk prediction models has increased steadily over the past three decades. This finding is in agreement with the narrative overview published by Cintolo-Gonzalez et al. in 2017,³⁸ and it updates the results of a previous systematic review published by Anothaisintawee et al. in 2012.⁷ In contrast to these reviews, however, our aim was to provide innovative information regarding the quality of the identified prediction models. Thus, we have identified and rigorously analysed the strengths and limitations of 24 individualised models in order to adjust our conclusions to the quality of the evidence.

We have identified two new trends with regards to the use and development of the models, which are the increased use of the BCSC model and the inclusion of common genetic variation in the prediction models. As compared to the information published in the review of Anothaisintawee et al.,⁷ we found that in contrast to the BCRAT and Rosner & Colditz models that were the most frequently cited models up to 2010⁷ the BCSC model has concentrated the attention of several authors during the last five years, although its discriminatory accuracy has not dramatically improved. Second, none of the models in the review of Anothaisintawee et al.⁷ included genetic information as a risk factor. By contrast, we have identified four models including genetic information: the IBIS model³³ that includes genetic phenotype in their updated version, the BCSC model that includes a polygenetic score in both 2015¹² and 2016²⁹ publications, as well as the article by Zhang et al. that added a polygenic risk score to both the BCRAT and the Rosner & Colditz models.¹³

Most of the included studies reported the AUROC to determine the probability that a randomly chosen woman with disease would be correctly categorised as higher risk compared to a randomly chosen woman without disease. The discriminatory accuracy estimate does not express whether the model is more or less accurate in predicting the risk of specific individuals but measures the capacity of the model to determine which women are at higher/lower risk for developing breast cancer. Thus, both calibration accuracy and discriminatory accuracy should be assessed. Contrary to what is expected, we found that authors reported the E/O ratio only in less than half of the included studies. In addition to the AUROC value, the studies of Zhang et al. and Vachon et al.^12,13 also reported an improvement in the net reclassification index (NRI) of the BCRAT, and Rosner & Colditz models, as well as in the BCSC model, respectively.

Overall, the information provided by the AUROC and the E/O ratio was consistent suggesting that the included models have moderate discriminatory accuracy and calibration accuracy when applied to the women in the general population. Nevertheless, it must be taken into account that despite the great importance of validation in terms of AUROC and E/O ratio, the presence of low values of AUROC or clearly different from 1 values of the E/O ratio does not mean that these models are useless. On the contrary, models are clinically useful even with moderate AUROC since they can reclassify individuals at the extremes of risk.³⁹ Thus, the verdict on risk models should not be based solely on these estimators. Instead, they need to be prospectively evaluated in clinical trials. In fact, there are currently two very large randomised trials assessing risk-based screening strategies. Both of them are using individualised models. Both the IBIS and the BCSC models are being tested in the European trial MyPeBS (My Personalised Breast Screening).⁴⁰ Also, the BCSC model is being tested in the US WISDOM trial (Women Informed to Screen Depending On Measures of risk).⁴¹

Applicability and completeness of evidence

The distribution of risk factors in such different populations may affect the applicability of the models to different contexts. The fact that different subtypes of breast cancer may have different genetic markers is widely accepted.⁴² These differences, the nature of breast cancer itself and its low incidence may condition a low discriminatory accuracy of a model. In other words, in the general population, there is a low probability of having breast cancer (even in the highest risk group). This low probability may mean that the discriminatory power of a breast cancer risk model won’t be as high as a risk model targeted to other common diseases such as cardiovascular events, for instance. Another potential limitation in the applicability in the screening context is the completeness and the number of included risk factors, which ranged from five to 18. Nevertheless, some potentially relevant risk factors such as genetic markers have been only included in few models. Recent studies^43,44 have shown that adding genetic information as a risk factor can increase the discriminative accuracy of the different models which opens the line for further evaluation. An evaluation that should first assess the calibration of these models in prospective cohort studies.

Overall, women are usually screened using mammography. Particularly in Europe, most programmes invite women for screening every 2 years.² The presence of some mammographic features in these screening mammograms may be related to the risk of developing breast cancer, as has been recently pointed out by some authors.^21,45 Only one of the 24 models identified in this systematic review included microcalcifications and masses found at mammography as risk factors in the model.²¹ Time-changing variables such as radiological variables may not be as stable as personal history. However, in a screening context, this information is especially relevant because it is easily available from previous screening examinations.

Quality of the evidence

We found variability in the design of the studies that were used to obtain the cancer risk estimates. Notably, the study design used in the BCSC model was a cohort, which is a robust epidemiology design that allows developing and validating prediction models. Another frequently used design was the case-control study, nested or not. Contrary to the cohort study, time-changing variables may not be well obtained in case-control studies.

Regarding the external validation, the models showed some limitations given that few of them were further evaluated in different contexts. As far as we know, there are numerous scientific publications reporting external model validation in different settings and countries. These studies may help to understand the performance of a model in a specific context, but this issue was out of the scope of our review and, therefore, we have not included external validation studies. As an example of the relevance of these studies, we can inform that the BCRAT model has more than 50 articles informing the external validation of these models in different countries.⁴⁶ The Rosner-Colditz model has also been validated in several studies, one of the most complete validations being the one performed in 2013 by the authors themselves.³⁷ On the other hand, we found that although the Eriksson et al.¹⁹ model reports the highest AUC (0.71), this model has not been externally validated, which increases the uncertainty about its applicability.

Also, there were limitations in data inputs, mostly due to the fact that in several models the information was provided by self-reported questionnaires that may affect the accuracy of the results. Finally, there is a limitation when comparing the AUROC or E/O ratio across the models given that there is great heterogeneity amongst them. The models were targeted to different populations, included different sets of risk factors, and often used different methodologies. We have taken into account all these variations and presented the results by model categories.

Potential biases in the review process

This systematic review was limited to studies published in English and did not involve an active search for grey literature, which is literature that is not formally published in sources such as books or journal articles. Therefore, some models may not have been identified. However, since we have conducted a comprehensive literature search in Medline, EMBASE and The Cochrane Library, we estimate that the loss of information due to the study selection criteria is low. Some key genetically oriented models, such as BOADICEA⁴⁷ and BRACAPRO⁴⁸ were not included in this review because they are aimed at high risk women and not useful for women in the general population in the screening context. Full-text screening and data abstraction process were performed by two researchers, which increase the quality of the review process. Moreover, as far as we know, this is the first review assessing the risk of bias of the identified risk prediction models.

Conclusions

The development of individualised breast cancer risk prediction models has increased over the last three decades, but the improvements in both the discriminatory power and calibration accuracy are still limited. Despite the time that has passed since the first model was published and a large number of available publications, only one model addressed to women attending a population-based screening programme²¹ was identified. Currently, it is still a challenge to recommend any of the models as the standard for predicting individual risk in screening context. However, the models have been updated by adding new variables, such as common genetic variation or radiologic variables and have shown improvements in their quality as well as in their discriminative accuracy. These new variables need further evaluation to confirm its promising impact in the prediction capacity to propose personalised strategies for breast cancer screening.

References

The Independent UK Panel on Breast Cancer Screening. The benefits and harms of breast cancer screening: an independent review. Lancet. 380, 1778–1786 (2012).
The European Commission Initiative on Breast Cancer (ECIBC). Recommendations from European Breast Guidelines. 2016. https://ecibc.jrc.ec.europa.eu/recommendations/.
U.S. Preventive Services Task Force. Screening for breast cancer: U.S. Preventive Services Task Force recommendation statement. Ann. Intern. Med. 151, 716–726 (2009). W-236.
Article Google Scholar
Oeffinger, K. C., Fontham, E. T., Etzioni, R., Herzig, A., Michaelson, J. S., Shih, Y. C. et al. Breast cancer screening for women at average risk: 2015 guideline update from the American Cancer Society. JAMA 314, 1599–1614 (2015).
Article CAS Google Scholar
Mandelblatt, J. S., Stout, N. K., Schechter, C. B., van den Broek, J. J., Miglioretti, D. L., Krapcho, M. et al. Collaborative modeling of the benefits and harms associated with different U.S. Breast Cancer Screening Strategies. Ann. Intern Med. 164, 215–225 (2016).
Article Google Scholar
Steyerberg, E. W. Clinical Prediction Models. A Practical Approach to Development, Validation, and Updating. (Springer Science, New York, 2009).
Google Scholar
Anothaisintawee, T., Teerawattananon, Y., Wiratkapun, C., Kasamesup, V. & Thakkinstian, A. Risk prediction models of breast cancer: a systematic review of model performances. Breast Cancer Res Treat. 133, 1–10 (2012).
Article Google Scholar
Higgins, J. P. T., Green, S. (eds). Cochrane Handbook for Systematic Reviews of Interventions Version 5.1.0. 2011. http://handbook-5-1.cochrane.org/.
Moher, D., Shamseer, L., Clarke, M., Ghersi, D., Liberati, A., Petticrew, M. et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst. Rev. 4, 1 (2015).
Article Google Scholar
Jaime Caro, J., Eddy, D. M., Kan, H., Kaltz, C., Patel, B., Eldessouki, R. et al. Questionnaire to assess relevance and credibility of modeling studies for informing health care decision making: an ISPOR-AMCP-NPC Good Practice Task Force report. Value Health 17, 174–182 (2014).
Article CAS Google Scholar
Shea, B. J., Reeves, B. C., Wells, G., Thuku, M., Hamel, C., Moran, J. et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ 358, j4008 (2017).
Article Google Scholar
Vachon, C. M., Pankratz, V. S., Scott, C. G., Haeberle, L., Ziv, E., Jensen, M. R., et al. The contributions of breast density and common genetic variation to breast cancer risk. J. Natl. Cancer Inst. 107, dju397 (2015).
Zhang, X., Rice, M., Tworoger, S. S., Rosner, B. A., Eliassen, A. H., Tamimi, R. M. et al. Addition of a polygenic risk score, mammographic density, and endogenous hormones to existing breast cancer risk prediction models: a nested case-control study. PLoS Med. 15, e1002644 (2018).
Article Google Scholar
Banegas, M. P., John, E. M., Slattery, M. L., Gomez, S. L., Yu, M., LaCroix, A. Z., et al. Projecting Individualized Absolute Invasive Breast Cancer Risk in US Hispanic Women. J. Natl. Cancer Inst. 109, djw215 (2017).
Article Google Scholar
Barlow, W. E., White, E., Ballard-Barbash, R., Vacek, P. M., Titus-Ernstoff, L., Carney, P. A. et al. Prospective breast cancer risk prediction model for women undergoing screening mammography. J. Natl. Cancer Inst. 98, 1204–1214 (2006).
Article Google Scholar
Boyle, P., Mezzetti, M., La Vecchia, C., Franceschi, S., Decarli, A. & Robertson, C. Contribution of three components to individual cancer risk predicting breast cancer risk in Italy. Eur. J. Cancer Prev. 13, 183–191 (2004).
Article CAS Google Scholar
Chen, J., Pee, D., Ayyagari, R., Graubard, B., Schairer, C., Byrne, C. et al. Projecting absolute invasive breast cancer risk in white women with a model that includes mammographic density. J. Natl. Cancer Inst. 98, 1215–1226 (2006).
Article Google Scholar
Colditz, G. A. & Rosner, B. Cumulative risk of breast cancer to age 70 years according to risk factor status: data from the Nurses’ Health Study. Am. J. Epidemiol. 152, 950–964 (2000).
Article CAS Google Scholar
Colditz, G. A., Rosner, B. A., Chen, W. Y., Holmes, M. D. & Hankinson, S. E. Risk factors for breast cancer according to estrogen and progesterone receptor status. J. Natl. Cancer Inst. 96, 218–228 (2004).
Article CAS Google Scholar
Decarli, A., Calza, S., Masala, G., Specchia, C., Palli, D. & Gail, M. H. Gail model for prediction of absolute risk of invasive breast cancer: independent evaluation in the Florence-European Prospective Investigation Into Cancer and Nutrition cohort. J. Natl. Cancer Inst. 98, 1686–1693 (2006).
Article Google Scholar
Eriksson, M., Czene, K., Pawitan, Y., Leifland, K., Darabi, H. & Hall, P. A clinical model for identifying the short-term risk of breast cancer. Breast Cancer Res. 19, 29 (2017).
Article Google Scholar
Gail, M. H., Brinton, L. A., Byar, D. P., Corle, D. K., Green, S. B., Schairer, C. et al. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J. Natl. Cancer Inst. 81, 1879–1886 (1989).
Article CAS Google Scholar
Gail, M. H., Costantino, J. P., Pee, D., Bondy, M., Newman, L., Selvan, M. et al. Projecting individualized absolute invasive breast cancer risk in African American women. J. Natl. Cancer Inst. 99, 1782–1792 (2007).
Article Google Scholar
Kerlikowske, K., Gard, C. C., Sprague, B. L., Tice, J. A. & Miglioretti, D. L. One versus two breast density measures to predict 5 and 10-year breast cancer risk. Cancer Epidemiol. Biomark. Prev. 24, 889–897 (2015).
Article Google Scholar
Matsuno, R. K., Costantino, J. P., Ziegler, R. G., Anderson, G. L., Li, H., Pee, D. et al. Projecting individualized absolute invasive breast cancer risk in Asian and Pacific Islander American women. J. Natl. Cancer Inst. 103, 951–961 (2011).
Article Google Scholar
Novotny, J., Pecen, L., Petruzelka, L., Svobodnik, A., Dusek, L., Danes, J. et al. Breast cancer risk assessment in the Czech female population—an adjustment of the original Gail model. Breast Cancer Res Treat. 95, 29–35 (2006).
Article Google Scholar
Rosner, B. & Colditz, G. A. Nurses’ health study: log-incidence mathematical model of breast cancer incidence. J. Natl. Cancer Inst. 88, 359–364 (1996).
Article CAS Google Scholar
Rosner, B., Colditz, G. A., Iglehart, J. D. & Hankinson, S. E. Risk prediction models with incomplete data with application to prediction of estrogen receptor-positive breast cancer: prospective data from the Nurses’ Health Study. Breast Cancer Res. 10, R55 (2008).
Article Google Scholar
Shieh, Y., Hu, D., Ma, L., Huntsman, S., Gard, C. C., Leung, J. W. et al. Breast cancer risk prediction using a clinical risk model and polygenic risk score. Breast Cancer Res Treat. 159, 513–525 (2016).
Article Google Scholar
Tice, J. A., Cummings, S. R., Ziv, E. & Kerlikowske, K. Mammographic breast density and the Gail model for breast cancer risk prediction in a screening population. Breast Cancer Res Treat. 94, 115–122 (2005).
Article Google Scholar
Tice, J. A., Cummings, S. R., Smith-Bindman, R., Ichikawa, L., Barlow, W. E. & Kerlikowske, K. Using clinical factors and mammographic breast density to estimate breast cancer risk: development and validation of a new predictive model. Ann. Intern Med. 148, 337–347 (2008).
Article Google Scholar
Tice, J. A., Miglioretti, D. L., Li, C. S., Vachon, C. M., Gard, C. C. & Kerlikowske, K. Breast density and benign breast disease: risk assessment to identify women at high risk of breast cancer. J. Clin. Oncol. 33, 3137–3143 (2015).
Article Google Scholar
Tyrer, J., Duffy, S. W. & Cuzick, J. A breast cancer prediction model incorporating familial and personal risk factors. Stat. Med. 23, 1111–1130 (2004).
Article Google Scholar
Ueda, K., Tsukuma, H., Tanaka, H., Ajiki, W. & Oshima, A. Estimation of individualized probabilities of developing breast cancer for Japanese women. Breast Cancer 10, 54–62 (2003).
Article Google Scholar
Wang, Y., Gao, Y., Battsend, M., Chen, K., Lu, W. & Wang, Y. Development of a risk assessment tool for projecting individualized probabilities of developing breast cancer for Chinese women. Tumour Biol. 35, 10861–10869 (2014).
Article Google Scholar
Brentnall, A. R., Harkness, E. F., Astley, S. M., Donnelly, L. S., Stavrinos, P., Sampson, S. et al. Mammographic density adds accuracy to both the Tyrer-Cuzick and Gail breast cancer risk models in a prospective UK screening cohort. Breast Cancer Res. 17, 147 (2015).
Article Google Scholar
Rosner, B. A., Colditz, G. A., Hankinson, S. E., Sullivan-Halley, J., Lacey, J. V. Jr. & Bernstein, L. Validation of Rosner-Colditz breast cancer incidence model using an independent data set, the California Teachers Study. Breast Cancer Res Treat. 142, 187–202 (2013).
Article CAS Google Scholar
Cintolo-Gonzalez, J. A., Braun, D., Blackford, A. L., Mazzola, E., Acar, A., Plichta, J. K. et al. Breast cancer risk models: a comprehensive overview of existing models, validation, and clinical applications. Breast Cancer Res Treat. 164, 263–284 (2017).
Article Google Scholar
Khera, A. V., Chaffin, M., Aragam, K. G., Haas, M. E., Roselli, C., Choi, S. H. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018).
Article CAS Google Scholar
MyPeBS. Randomized comparison of risk-stratified versus standard breast cancer screening in European women aged 40–70 (MyPeBS). 2017. http://www.brumammo.be/documents/docs/bmm-my-pebs-clinical-trial-protocol.pdf.
Esserman, L. J., Study, W. & Athena, I. The WISDOM Study: breaking the deadlock in the breast cancer screening debate. NPJ Breast Cancer 3, 34 (2017).
Article Google Scholar
Sorlie, T., Perou, C. M., Tibshirani, R., Aas, T., Geisler, S., Johnsen, H. et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc. Natl. Acad. Sci. USA. 98, 10869–10874 (2001).
Article CAS Google Scholar
Garcia-Closas, M., Gunsoy, N. B., Chatterjee, N. Combined associations of genetic and environmental risk factors: implications for prevention of breast cancer. J. Natl. Cancer Inst. 2014;106.
Article Google Scholar
Maas, P., Barrdahl, M., Joshi, A. D., Auer, P. L., Gaudet, M. M., Milne, R. L. et al. Breast cancer risk from modifiable and nonmodifiable risk factors among white women in the United States. JAMA Oncol. 2, 1295–1302 (2016).
Article Google Scholar
Castells, X., Tora-Rocamora, I., Posso, M., Roman, M., Vernet-Tomas, M., Rodriguez-Arana, A. et al. Risk of breast cancer in women with false-positive results according to mammographic features. Radiology 280, 379–386 (2016).
Article Google Scholar
Wang, X., Huang, Y., Li, L., Dai, H., Song, F. & Chen, K. Assessment of performance of the Gail model for predicting breast cancer risk: a systematic review and meta-analysis with trial sequential analysis. Breast Cancer Res. 20, 18 (2018).
Article Google Scholar
Antoniou, A. C., Pharoah, P. P., Smith, P. & Easton, D. F. The BOADICEA model of genetic susceptibility to breast and ovarian cancer. Br. J. Cancer 91, 1580–1590 (2004).
Article CAS Google Scholar
Berry, D. A., Parmigiani, G., Sanchez, J., Schildkraut, J. & Winer, E. Probability of carrying a mutation of breast-ovarian cancer gene BRCA1 based on family history. J. Natl. Cancer Inst. 89, 227–238 (1997).
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank Ms. Lorea Galnares-Cordero for her contributions to the design of the search strategy and the initial retrieval of the citations. The authors also thank Ms. Julieta Politi and Mr. José María Montero-Moraga for their contribution in the screening process. Javier Louro is a Ph.D. candidate at the Methodology of Biomedical Research and Public Health program, Universitat Autònoma de Barcelona (UAB), Barcelona, Spain.

Author contributions

J.L., M.P., M.S. and X.C. designed the study, and J.L. and M.P. wrote the manuscript. J.L., M.P. and M.H.B. performed the screening, data abstraction and quality assessment of included studies. M.S., L.D. and M.R. contributed to the analyses and interpreted the data. M.R., M.H.B. and X.C. collaborated in drafting the manuscript and revising it critically for important intellectual content. All authors read and approved the final manuscript.

Author information

Authors and Affiliations

Department of Epidemiology and Evaluation, IMIM (Hospital del Mar Medical Research Institute), Barcelona, Spain
Javier Louro, Margarita Posso, Marta Román, Laia Domingo, Xavier Castells & María Sala
Research Network on Health Services in Chronic Diseases (REDISSEC), Barcelona, Spain
Javier Louro, Margarita Posso, Marta Román, Laia Domingo, Xavier Castells & María Sala
European Higher Education Area (EHEA) Doctoral Programme in Methodology of Biomedical Research and Public Health in Department of Pediatrics, Obstetrics and Gynecology, Preventive Medicine and Public Health, Universitat Autónoma de Barcelona (UAB), Bellaterra, Barcelona, Spain
Javier Louro
MRC/CSO Social and Public Health Sciences Unit, University of Glasgow, Glasgow, UK
Michele Hilton Boon

Authors

Javier Louro
View author publications
You can also search for this author in PubMed Google Scholar
Margarita Posso
View author publications
You can also search for this author in PubMed Google Scholar
Michele Hilton Boon
View author publications
You can also search for this author in PubMed Google Scholar
Marta Román
View author publications
You can also search for this author in PubMed Google Scholar
Laia Domingo
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Castells
View author publications
You can also search for this author in PubMed Google Scholar
María Sala
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Margarita Posso.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval and consent to participate

All procedures performed in this study were in accordance with the ethical standards of the ethics committee of Parc de Salut Mar (CEIC Parc de Salut Mar) and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. Neither specific patient consent nor ethics committee’s approval were required because we used published articles that were obtained from open access databases.

Data availability

The datasets analysed during the current study are publicly available from the corresponding author.

Consent for publication

Not applicable.

Funding

This work was partially supported by Agència de Qualitat i Avaluació Sanitàries de Catalunya (AQuAS) and by grants from Instituto de Salud Carlos III FEDER (grant numbers: PI15/00098 and PI17/00047). JL is core funded by the Research Network on Health Services in Chronic Diseases (RD12/0001/0015). MHB is core funded by the UK Medical Research Council (funding code: MC_UU_12017/15) and the Scottish Government Chief Scientist Office (funding code: SPHSU15). None of the funders participated in the design of the study, collection, analysis, or interpretation of data, or in writing the manuscript.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Louro, J., Posso, M., Hilton Boon, M. et al. A systematic review and quality assessment of individualised breast cancer risk prediction models. Br J Cancer 121, 76–85 (2019). https://doi.org/10.1038/s41416-019-0476-8

Download citation

Received: 10 January 2019
Accepted: 25 April 2019
Published: 22 May 2019
Issue Date: 02 July 2019
DOI: https://doi.org/10.1038/s41416-019-0476-8

This article is cited by

Temporal changes in mammographic breast density and breast cancer risk among women with benign breast disease
- Maeve Mullooly
- Shaoqi Fan
- Gretchen L. Gierach
Breast Cancer Research (2024)
Prognosis prediction and risk stratification of breast cancer patients based on a mitochondria-related gene signature
- Yang Wang
- Ding-yuan Wang
- Bai-lin Zhang
Scientific Reports (2024)
“I Thought Cancer was a Tobacco Issue”: Perspectives of Veterans with and without HIV on Cancer and Other Health Risks Associated with Alcohol and Tobacco/Nicotine Use
- Elsa S. Briggs
- Rachel M. Thomas
- Emily C. Williams
AIDS and Behavior (2024)
Understanding the contribution of lifestyle in breast cancer risk prediction: a systematic review of models applicable to Europe
- Elly Mertens
- Antonio Barrenechea-Pulache
- José L. Peñalvo
BMC Cancer (2023)
Breast density analysis of digital breast tomosynthesis
- John Heine
- Erin E. E. Fowler
- Shelley S. Tworoger
Scientific Reports (2023)

Subjects

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

Background

Methods

Data sources and searches

Study selection

Data extraction and quality assessment

Data synthesis and analysis

Results

Study inclusion

Characteristics of the included studies

Discriminatory accuracy

Calibration accuracy

Quality assessment

Internal and external validation

Bias due to the study design

Limitations of data inputs

Appropriateness of the model analysis

Reporting bias

Discussion

Summary of main results

Agreements and disagreements with other reviews

Applicability and completeness of evidence

Quality of the evidence

Potential biases in the review process

Conclusions

References

Acknowledgements

Author contributions

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Ethics approval and consent to participate

Data availability

Consent for publication

Funding

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links