Machine learning-based lifetime breast cancer risk reclassification compared with the BOADICEA model: impact on screening recommendations

Ming, Chang; Viassolo, Valeria; Probst-Hensch, Nicole; Dinov, Ivo D.; Chappuis, Pierre O.; Katapodi, Maria C.

doi:10.1038/s41416-020-0937-0

Download PDF

Article
Open access
Published: 22 June 2020

Epidemiology

Machine learning-based lifetime breast cancer risk reclassification compared with the BOADICEA model: impact on screening recommendations

Chang Ming ORCID: orcid.org/0000-0002-2817-3865¹,
Valeria Viassolo²,
Nicole Probst-Hensch³,
Ivo D. Dinov^4,5,6,7,
Pierre O. Chappuis^2,8^na1 &
…
Maria C. Katapodi^1,7^na1

British Journal of Cancer volume 123, pages 860–867 (2020)Cite this article

4733 Accesses
26 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Background

The clinical utility of machine-learning (ML) algorithms for breast cancer risk prediction and screening practices is unknown. We compared classification of lifetime breast cancer risk based on ML and the BOADICEA model. We explored the differences in risk classification and their clinical impact on screening practices.

Methods

We used three different ML algorithms and the BOADICEA model to estimate lifetime breast cancer risk in a sample of 112,587 individuals from 2481 families from the Oncogenetic Unit, Geneva University Hospitals. Performance of algorithms was evaluated using the area under the receiver operating characteristic (AU-ROC) curve. Risk reclassification was compared for 36,146 breast cancer-free women of ages 20–80. The impact on recommendations for mammography surveillance was based on the Swiss Surveillance Protocol.

Results

The predictive accuracy of ML-based algorithms (0.843 ≤ AU-ROC ≤ 0.889) was superior to BOADICEA (AU-ROC = 0.639) and reclassified 35.3% of women in different risk categories. The largest reclassification (20.8%) was observed in women characterised as ‘near population’ risk by BOADICEA. Reclassification had the largest impact on screening practices of women younger than 50.

Conclusion

ML-based reclassification of lifetime breast cancer risk occurred in approximately one in three women. Reclassification is important for younger women because it impacts clinical decision- making for the initiation of screening.

A systematic review and quality assessment of individualised breast cancer risk prediction models

Article Open access 22 May 2019

The current status of risk-stratified breast screening

Article Open access 26 October 2021

Personalized early detection and prevention of breast cancer: ENVISION consensus statement

Article Open access 18 June 2020

Background

Breast cancer is the most common malignancy affecting women worldwide and the fifth leading cause of cancer death.¹ In Switzerland, about 6000 women are diagnosed with breast cancer each year, and more than 1350 die from the disease.² Most established risk factors, i.e., age, family history, genetic predisposition, hormone and reproductive factors and history of benign breast disease, are not applicable for primary prevention to reduce breast cancer incidence and mortality.³ Survival of breast cancer patients in the past few decades has mostly improved through screening, especially if tumours are diagnosed at early stages, and through advances in therapeutic approaches.^3,4,5 Breast cancer remains a public health problem, and early detection is currently the best option to reduce its impact.

Breast cancer screening with biennial mammograms for women 50–74-years old has been recommended by the U.S. Preventive Services Task Force since 2009.^6,7 In Europe, nationally organised screening programmes began around 1985 in the Nordic countries and the United Kingdom, followed by other European countries.^8,9 Most of these programmes target women from 50 to 69 years old for screening.¹⁰ In 1995, the Swiss Federal Office of Public Health and the Swiss Cancer League adopted a national programme recommending biennial mammography screening for women over 50 years old.^2,11 In 2013, the Swiss Cancer League adopted the UK NICE Clinical Guidelines, which recommend screening with mammography and MRI based on women’s risk classification. The guidelines classify women into moderate (17% ≤ lifetime risk <30%) or high (lifetime risk ≥30%) breast cancer risk calculated with the BOADICEA model based on different scenarios of family history.^12,13

Age over 50 years is the sole risk factor considered for entering a population-based screening programme.¹⁴ However, about 25% of all breast cancers are diagnosed in younger women.^15,16 Moreover, mammography is less effective as a screening tool for younger women, who are more likely to have dense breast tissue, compromising the efficiency of routine mammograms in this age group. This contributes to diagnostic delays and increased morbidity and mortality.^16,17 In the era of personalised medicine, a screening strategy based on individual breast cancer risk may improve the benefit–harm ratio of mammography, and increase the efficiency of screening programmes.^18,19 Many medical societies and professional groups proposed that risk-based screening would be more effective, less morbid and more cost-effective.^{3,19,20,21,22,23,24}

Although many models are used to predict breast cancer risk, such as the Breast Cancer Risk Assessment Tool (BCRAT, also referred as the Gail model), the International Breast Intervention Study (IBIS) model, the Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA model),^25,26,27 no consistent model has been incorporated into routine clinical practice and/or screening programmes due to limited discriminatory accuracy and applicability. The discriminatory ability, area under the receiver operating characteristic (AU-ROC) curve, of these models is between 0.53 and 0.64.^{26,28,29,30,31} A comprehensive risk prediction model with an improved discriminatory power to classify women into clinically meaningful risk groups will enable targeting high-risk women, while reducing interventions in those at low risk.

Machine-learning (ML) algorithms offer an alternative approach to standard prediction modelling that may address current limitations and improve the accuracy of breast cancer prediction models.^32,33 A series of ML techniques, including our own work, have been developed and used in breast cancer prediction and prognosis, demonstrating that the application of ML methods could improve the prediction accuracy of cancer susceptibility, recurrence and survival models.^{34,35,36,37,38,39} Previous studies presented the discriminatory accuracy, sensitivity, specificity and calibration performance of different ML algorithms. However, clinical utility, in terms of potential clinical consequences of using new ML prediction models, is rarely examined. The objective of the current study is to assess the impact of using ML-based breast cancer risk prediction on screening practices. Using data from a large clinical population, we quantified performance measure and reclassification of lifetime breast cancer risk generated from ML algorithms and from the BOADICEA model. We also examined the clinical impact of reclassification of breast cancer risk on screening practices based on the Swiss Surveillance Protocol.¹³

Methods

Swiss clinic-based retrospective data

The Oncogenetic Unit at the Geneva University Hospitals has been offering genetic counselling and testing for hereditary cancer syndromes since 1994 to patients and asymptomatic individuals concerned with their family history. The common reasons for genetic consultation are familial aggregation of breast or colorectal cancer or suspicion of hereditary cancer syndromes, mainly due to breast, ovarian or colon cancer. For each individual seen in consultation, demographic, personal and family history, previous genetic test results and a detailed family pedigree are collected and recorded with the ‘Progeny' software.⁴⁰ Data used in this study were collected as part of routine medical records. The Regional Research Ethics Committee at the University Hospitals of Geneva has approved the data collection and management processes. Informed consent was obtained from all participants included in the study before genetic testing.

For the purposes of this study, information regarding pathology reports, archived tumour tissue and cancer treatment, was extracted from medical records for cancer patients and affected relatives, whenever possible. Data from genetic consultation records and Progeny files were extracted with R packages ‘tm’ and ‘gdata’.⁴¹ Extracted data were suitable for risk calculations with the BOADICEA model for multiple female members from each family. There were about 13% missing values. BRCA1/BRCA2 status, oestrogen receptor and progesterone receptor status contributed 11%. In addition to ‘positive' and ‘negative', missing values for BRCA pathogenic variants and hormone receptor testing were characterised as ‘unknown' in subsequent analyses. This approach is also consistent with the flexibility of the BOADICEA model in handling missing information.

BOADICEA model classification

Lifetime risk predictions were generated with the web-based batch processing from the BOADICEA web application (version 3.0) using data from 2481 families with 112,587 family members. The lifetime breast cancer risk for each woman in every family was calculated using data from all other family members, and by assigning every female family member once as the targeted woman for risk calculation.

ML risk classifications

We generated breast cancer lifetime risk predictions for all female members within each family.

Based on previous reports of method reliability, effectiveness and performance in identifying, tracking and exploiting salient features in similar samples with the same data structures, we selected three ML algorithms, i.e., Markov Chain Monte Carlo generalised linear mixed model (MCMC GLMM),⁴² adaptive boosting (ADA) and random forest (RF).^{32,34,42,43,44,45} The input for the ML algorithms used identical risk factors as the ones included in the BOADICEA model in order to have fair comparisons among the different risk prediction models. The variables included in each comparison are presented in Supplementary Table 1.

In our supervised classification, we rebalanced the breast cancer patients and cancer-free controls to reduce potential bias with the R packages ‘unbalanced' (Racing for Unbalanced Methods Selection) and ‘SMOTE' (Synthetic Minority Over-sampling TEchnique).^46,47 SMOTE implements known ML techniques to adaptively select the most appropriate strategy for a given unbalanced task. To ensure the reliability of ML predictions and the consistency of the forecasts, we used 10-fold cross-validation with 20 repetitions. This strategy provides a powerful preventive measure against model overfitting.^48,49,50

Comparisons of performance measure and classification

BOADICEA cannot be applied for females older than 80 years, for males and for deceased individuals. Thus, we excluded all predictions generated for those individuals when we compared the performance of ML algorithms with the BOADICEA model. The performance of BOADICEA was evaluated from n = 45,110 women using the AU-ROC, while the performance of ML techniques is presented with the mean AU-ROC from 10-fold cross-validations.

According to the Swiss Surveillance Protocol, we applied the following cut-offs for lifetime breast cancer risk: <17% as near-population risk, ≥17% and < 30% as moderate risk and ≥30% as high-risk group. We excluded women who were under 20-years old or had been diagnosed with breast cancer to be consistent with the clinical utility of the protocol. We estimated differences in breast cancer risk classification using the BOADICEA model and the best-performing ML algorithm, based on the data from n = 36,146 breast cancer-free women.

Statistical analyses

Frequencies, percentages, means and standard deviations were used to describe the demographics and clinical characteristics of 36,146 breast cancer-free women. We present classifications by age and risk categories using the BOADICEA model as the reference standard. Differences in classification for mammography surveillance according to the Swiss Surveillance Protocol were calculated for the moderate- and high-risk groups.

Results

A consort flow diagram (Fig. 1) presents sample acquisition, prediction, classification and surveillance status, and the overall process of methodology and materials.

Model performance

The mean age of the 45,110 women was 49.82 (±11.02) years old. There were 4911 breast cancer patients with average age onset at 51.76 (±9.79) years old. Among them, 554 had a second breast cancer diagnosis. There were 119 cases with first-ductal carcinoma in situ (DCIS). Table 1 presents the performance comparison of the three ML algorithms compared with the BOADICEA model. Using the same risk factors, the accuracy of ML techniques was superior to the BOADICEA model for the Swiss clinic-based samples. Predictive accuracy reached 88.9% using ADA, 85.1% using MCMC GLMM and 84.3% using RF versus 63.9% using the BOADICEA model, showing an approximately 20–25% increase in accuracy. Figure 2 presents the ROC curves that visualise the accuracy improvement between the BOADICEA model and ADA, which was the best-performing ML approach.

Table 1 Performance by area under the receiver operating characteristic curve (AU-ROC) of the machine-learning (ML) algorithms predicting breast cancer lifetime risk derived from 10-fold cross-validations compared with the BOADICEA model.

Full size table

Breast cancer-free women

Table 2 presents demographic and clinical characteristics of the Swiss clinic-based sample. Among n = 36,146 breast cancer-free women, 2617 (7.24%) had a diagnosis of another type of cancer. In the total sample, only few breast cancer-free women (462; 1.3%) were tested for BRCA1 and/or BRCA2 germline pathogenic variants, including both complete and targeted testing. Most of these women had a targeted genetic testing, i.e., the search for a pathogenic variant previously identified in the family, since consultations were limited to situations that are highly suggestive for a hereditary syndrome and, whenever possible, genetic testing was offered first to breast cancer patients belonging to the family.

Table 2 Characteristics of the breast cancer-free female cohort.

Full size table

Classification comparison

When using the BOADICEA model as the reference standard, and based on the lifetime breast cancer risk cut-offs from the Swiss clinical guidelines, 58.8% of all samples were categorised as near-population risk, 32.3% as moderate risk and 8.8% as high risk (Table 3). Compared with the BOADICEA model, ML-ADA classified 7968 women into the high-risk group, which is an increase of 4790 samples. ML-ADA also classified 16,465 women into near-population risk group, which is a decrease of 4818 samples compared with the BOADICEA model. Concordance between the BOADICEA model and ML-ADA was ~60% in the near-population and the moderate-risk groups, while it was 87.95% in the high-risk group. ML-ADA classified 9595 women (26.55%) to a higher- risk group and 3174 (8.78%) women to a lower-risk group. When we combined Table 3 with the Swiss Surveillance Protocol, we identified an additional 2469 (14.83%) women younger than 50 who needed early-onset screening.

Table 3 Comparisons of lifetime risk classification between ML-Adapt Boosting (ML-ADA) algorithm and the BOADICEA model (reference standard) for the breast cancer-free cohort.

Full size table

Clinical impact on mammography surveillance

Table 4 presents the overall number differences in mammography surveillance when applying the BOADICEA and ML-ADA models, and based on the Swiss Surveillance Protocol. For women 40–59 years old, ML-ADA grouped an additional 184 women in the moderate-risk group, suggesting annual mammography surveillance. ML-ADA grouped an additional 4,790 women in the high-risk group, among which 2535 women were between 30 and 59 years old, suggesting annual mammography, and 1865 women older than 60 years, suggesting biennial mammography.

Table 4 Clinical impact on mammography screening based on Swiss Surveillance Protocol.

Full size table

Discussion

We used a novel approach to identify individuals at increased risk of breast cancer by using ML algorithms. We analysed family history, cancer pathology and clinic–demographic data from a large retrospective dataset of n = 112,587 individuals from 2481 families. We examined whether ML algorithms could improve predictive accuracy for breast cancer compared with the BOADICEA model. We also quantified the differences in risk classification and the impact on screening between these two techniques based on the Swiss Surveillance Protocol. Compared with the BOADICEA model, all three ML techniques were superior at distinguishing cancer cases from cancer-free women, and improved the predictive accuracy by 20–25% using exactly the same risk factors as the BOADICEA model. These findings clearly demonstrate the inherently better predictive ability of ML algorithms.

About one in four women were classified into a higher-risk group compared with the BOADICEA model. Given that ML approaches achieved much higher discriminatory accuracy, some women’s breast cancer risk would have been underestimated when using the BOADICEA model, while one in eleven women’s risk would have been overestimated. When taking into account the Swiss Surveillance Protocol, the major discordance for mammography surveillance was observed for the high-risk group. About 10–15% women 30–80 years old were underscreened when using the BOADICEA model compared with ML-ADA.

Consistent with other national screening programmes, the Swiss national breast cancer screening programme is based on age alone, starting at 50 years old. This approach will miss some breast cancers in moderate- and high-risk women 40–49 years old and in high-risk women 30–49 years old. The development and implementation of risk-based breast cancer control and prevention strategies have important public health implications. Common risk estimation models, like the BOADICEA model, are currently used in clinical practice to provide evidence for adjustment of screening, i.e., more frequent mammographic screening and initiation at a younger age. However, low discriminatory accuracy has greatly limited the clinical utility of these models. At the population level, ML algorithms have reached high sensitivity and can be implemented to identify high-risk women who should initiate earlier breast cancer screening. At the individual level, the decision for preventive interventions, such as prophylactic mastectomy or use of tamoxifen as a risk-reducing agent, is influenced by a woman’s individualised breast cancer risk estimate. When using ML, one in three women were classified into different risk categories compared with the BOADICEA model, which may lead to different preventive interventions.

Given that breast cancer screening guidelines were established after the release of several commonly used risk prediction models, including BOADICEA, the guideline cut-offs (risk categorisation) have been greatly influenced by these models. According to several validation studies of the BOADICEA model, about 80–90% of women were classified as having a lifetime breast cancer risk between 5 and 25% (near population or moderate risk).²⁵ This risk distribution was also observed in our study. However, using a 17% cut-off within a non-disperse risk distribution may have resulted in low discriminatory accuracy for women around that cut-off (17% or ‘near population risk'). When we reclassified women with ML algorithms, applying cut-offs of 17% and 30% resulted in shifting relatively large proportions of women between different risk groups. This indicates that for ML algorithms, categorisation of different risk groups (i.e., near population, moderate or high risk) should be probably based on different cut-offs, based on a clinically meaningful decision of their sensitivity and specificity.⁵¹

There are several barriers for using risk prediction models in a wide variety of settings. First, each risk prediction model uses different risk factors. The panel of risk factors used in the development of each model limits its applicability and validity in broader segments of the population. ML models can be applied in medical consultation contexts where similar data inputs were collected. Currently, the most feasible way of following the Swiss Surveillance Protocol is through consultation with a medical specialist. In this context, clinical decisions about risk management options are likely influenced by risk calculations from such prediction models. Secondly, existing infrastructures for collection and assessment of clinical data limit the development of risk prediction models and their generalisability in broader segments of the population. ML approaches have the potential to achieve better accuracy, and can incorporate different types of information, including mammographic images, family history, germline genetic data and clinical factors. However, currently there are no comprehensive systems that incorporate data from such diverse sources, e.g., screening programmes, medical consultations and medical records. In order to develop a risk prediction model that can be used to enhance national screening programmes, the usefulness of accessible risk factors from screening practice, e.g., breast density and previous benign breast disease, should be assessed. Based on the predictive ability of each risk factor, and the feasibility of collecting relevant data in the screening setting, a parsimony panel of risk factors would be applied in ML modelling to develop a comprehensive model that supports effective clinical decision-making. However, limited resources have been invested into this promising new analytic approach.

Strengths and limitations

Our results are reliable because we used a limited number of well-established breast cancer risk factors without feature selection and relatively non-complex ML models, which helps mitigate the ‘black-box' nature of ML algorithms. They are also reliable due to the large sample size, completeness and high accuracies of the data. Our models have been evaluated for internal validity, since we have reproduced similar accuracy performance in this study compared with our previous study.³⁴ They have been partially evaluated for external validity using internal statistical cross-validation, a process where each fold iteration relies on separate and independent training and testing datasets. For fully assessing the external validity, we need to evaluate prospective samples from populations intrinsically different from the development sample, in respect to location, time or methods/criteria used for data collection, which is a gradual process commonly applied to prediction models.^26,30 Current screening guidelines already incorporate risk estimates from existing prediction tools based on inputs from medical consultation contexts. Thus, it is important to study the potential clinical utility of ML as a promising alternative analytic approach, even with limited information from screening practice. Finally, breast cancer surveillance guidelines define ‘population level risk' as having a lifetime risk <17% calculated from the BOADICEA model. This risk estimate does not necessarily mean ‘low' risk in the general population due to potential misclassifications. In our reclassification results, the BOADICEA model classified 21,293 (58.9%) samples into the population-level risk. Thus, our sample is ‘suitable' for the comparison and covers sufficiently women with a wide range of risk estimates based on current recommendations.

One limitation of the study is that the performance of our approaches was evaluated with k-Fold cross-validation process in the same dataset, which could result in an optimistic model performance. However, the k-Fold cross-validation process generally results in a less biased or less optimistic estimate of the model skill compared with other commonly used methods, e.g., simple train/test split.⁵² Moreover, we used retrospective cross-sectional data, which limit the ability of ML algorithms to generate 5- or 10-year risk estimates. Analysing prospective longitudinal data with ML algorithms may reveal additional implications for clinical decision support.

In summary, we calculated lifetime breast cancer risk with ML algorithms and compared their discriminatory accuracy, classification and impact on mammography screening with the BOADICEA model according to the Swiss Surveillance Protocol. Differences in classification and impact on breast cancer surveillance were considerable. The ability of our model to detect individuals with high suspicion of breast cancer, should be further evaluated with other datasets and prospective samples. Future studies can enhance the performance of ML algorithms through incorporation of additional clinical data, such as lifestyle, medications, breast images, exact histology of benign breast diseases and co-morbidities.^36,37,53 Future studies can also include resource rearrangement involving health policymakers and other stakeholders, in terms of cost-effectiveness and adaptability in different clinical settings. A prospective clinical trial would provide more functional and extended evaluation of the performance of ML algorithms, and findings can be compared with ongoing personalised breast cancer screening trials like ‘My PeBS' and ‘WISDOM'.^54,55

References

Ferlay, J., Soerjomataram, I., Dikshit, R., Eser, S., Mathers, C., Rebelo, M. et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int. J. cancer 136, E359–E386 (2015).
CAS PubMed Google Scholar
Bouchardy Magnin, C., Pury, P., Lorez, M., Clough-Gorr, K. & Bordoni, A. Trends in breast cancer survival in Switzerland. Bull. suisse du Cancer 4, 326–328 (2011).
Eccles, S. A., Aboagye, E. O., Ali, S., Anderson, A. S., Armes, J., Berditchevski, F. et al. Critical research gaps and translational priorities for the successful prevention and treatment of breast cancer. Breast Cancer Res. 15, R92 (2013).
PubMed PubMed Central Google Scholar
Jemal, A., Ward, E. M., Johnson, C. J., Cronin, K. A., Ma, J., Ryerson, B. et al. Annual report to the nation on the status of cancer, 1975-2014, featuring survival. J. Natl Cancer Inst. 109, djx030 (2017).
PubMed Central Google Scholar
Jones, T., Duquette, D., Underhill, M., Ming, C., Mendelsohn-Victor, K. E., Anderson, B. et al. Surveillance for cancer recurrence in long-term young breast cancer survivors randomly selected from a statewide cancer registry. Breast Cancer Res Treat. 169, 141–152 (2018).
PubMed Google Scholar
Nelson, H. D., Tyne, K., Naik, A., Bougatsos, C., Chan, B. K. & Humphrey, L. Screening for breast cancer: an update for the U.S. Preventive Services Task Force. Ann. Intern Med. 151, 727–737 (2009). w237–42.
PubMed PubMed Central Google Scholar
Qin, X., Tangka, F. K., Guy, G. P. Jr. & Howard, D. H. Mammography rates after the 2009 revision to the United States Preventive Services Task Force breast cancer screening recommendation. Cancer Causes Control 28, 41–48 (2017).
CAS PubMed Google Scholar
Shapiro, S., Coleman, E. A., Broeders, M., Codd, M., de Koning, H., Fracheboud, J. et al. Breast cancer screening programmes in 22 countries: current policies, administration and guidelines. International Breast Cancer Screening Network (IBSN) and the European Network of Pilot Projects for Breast Cancer Screening. Int. J. Epidemiol. 27, 735–742 (1998).
CAS PubMed Google Scholar
Sardanelli, F., Aase, H. S., Alvarez, M., Azavedo, E., Baarslag, H. J., Balleyguier, C. et al. Position paper on screening for breast cancer by the European Society of Breast Imaging (EUSOBI) and 30 national breast radiology bodies from Austria, Belgium, Bosnia and Herzegovina, Bulgaria, Croatia, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Israel, Lithuania, Moldova, The Netherlands, Norway, Poland, Portugal, Romania, Serbia, Slovakia, Spain, Sweden, Switzerland and Turkey. Eur. Radiol. 27, 2737–2743 (2017).
PubMed Google Scholar
Lauby-Secretan, B., Scoccianti, C., Loomis, D., Benbrahim-Tallaa, L., Bouvard, V., Bianchini, F. et al. Breast-cancer screening— viewpoint of the IARC Working Group. N. Engl. J. Med. 372, 2353–2358 (2015).
CAS PubMed Google Scholar
Arie, S. Switzerland debates dismantling its breast cancer screening programme. BMJ: B. Med. J. 348, g1625 (2014).
Google Scholar
Excellence NNIfHaC. Familial breast cancer: classification, care and managing breast cancer and related risks in people with a family history of breast cancer 2019. Available from: https://www.nice.org.uk/guidance/cg164 (2019).
Krebsliga Schweiz. Increased risk of breast cancer due to family history. Bundesamt für Gesundheit BAG. Available from: https://www.bag.admin.ch/bag/fr/home/gesetze-und-bewilligungen/gesetzgebung/gesetzgebung-versicherungen/gesetzgebung-krankenversicherung/kvg/referenzdokumente-zur-klv-und-deren-anhaenge.html (2015).
Mainiero, M. B., Moy, L., Baron, P., Didwania, A. D., diFlorio, R. M., Green, E. D. et al. ACR appropriateness criteria((R)) breast cancer screening. J. Am. Coll. Radiol. 14, S383–S390 (2017).
PubMed Google Scholar
King, M. C., Levy-Lahad, E. & Lahad, A. Population-based screening for BRCA1 and BRCA2: 2014 Lasker Award. J. Am. Med. Assoc. 312, 1091–1092 (2014).
CAS Google Scholar
Azim, H. A. Jr. & Partridge, A. H. Biology of breast cancer in young women. Breast Cancer Res. 16, 427 (2014).
PubMed PubMed Central Google Scholar
Rosenberg, S. M., Newman, L. A. & Partridge, A. H. Breast cancer in young women: rare disease or public health problem? JAMA Oncol. 1, 877–878 (2015).
PubMed Google Scholar
Autier, P. & Boniol, M. Mammography screening: a major issue in medicine. Eur. J. Cancer 90, 34–62 (2018).
PubMed Google Scholar
van Ravesteyn, N. T., Miglioretti, D. L., Stout, N. K., Lee, S. J., Schechter, C. B., Buist, D. S. et al. Tipping the balance of benefits and harms to favor screening mammography starting at age 40 years: a comparative modeling study of risk. Ann. Intern. Med. 156, 609–617 (2012).
PubMed PubMed Central Google Scholar
Maas, P., Barrdahl, M., Joshi, A. D., Auer, P. L., Gaudet, M. M., Milne, R. L. et al. Breast cancer risk from modifiable and nonmodifiable risk factors among white women in the United States. JAMA Oncol. 2, 1295–1302 (2016).
PubMed PubMed Central Google Scholar
Mandelblatt, J. S., Cronin, K. A., Bailey, S., Berry, D. A., de Koning, H. J., Draisma, G. et al. Effects of mammography screening under different screening schedules: model estimates of potential benefits and harms. Ann. Intern. Med. 151, 738–747 (2009).
PubMed PubMed Central Google Scholar
Pashayan, N., Duffy, S. W., Chowdhury, S., Dent, T., Burton, H., Neal, D. E. et al. Polygenic susceptibility to prostate and breast cancer: implications for personalised screening. Br. J. Cancer 104, 1656–1663 (2011).
CAS PubMed PubMed Central Google Scholar
Schousboe, J. T., Kerlikowske, K., Loh, A. & Cummings, S. R. Personalizing mammography by breast density and other risk factors for breast cancer: analysis of health benefits and cost-effectiveness. Ann. Intern. Med. 155, 10–20 (2011).
PubMed PubMed Central Google Scholar
Vilaprinyo, E., Forne, C., Carles, M., Sala, M., Pla, R., Castells, X. et al. Cost-effectiveness and harm-benefit analyses of risk-based screening strategies for breast cancer. PLoS ONE 9, e86858 (2014).
PubMed PubMed Central Google Scholar
Lee, A., Mavaddat, N., Wilcox, A. N., Cunningham, A. P., Carver, T., Hartley, S. et al. BOADICEA: a comprehensive breast cancer risk prediction modelincorporating genetic and nongenetic risk factors. Genet. Med. 21, 1708–1718 (2019).
PubMed PubMed Central Google Scholar
Wang, X., Huang, Y., Li, L., Dai, H., Song, F. & Chen, K. Assessment of performance of the Gail model for predicting breast cancer risk: a systematic review and meta-analysis with trial sequential analysis. Breast Cancer Res. 20, 18 (2018).
PubMed PubMed Central Google Scholar
Tyrer, J., Duffy, S. W. & Cuzick, J. A breast cancer prediction model incorporating familial and personal risk factors. Stat. Med. 23, 1111–1130 (2004).
PubMed Google Scholar
Amir, E., Evans, D. G., Shenton, A., Lalloo, F., Moran, A., Boggis, C. et al. Evaluation of breast cancer risk assessment packages in the family history evaluation and screening programme. J. Med. Genet. 40, 807–814 (2003).
CAS PubMed PubMed Central Google Scholar
Brentnall, A. R., Harkness, E. F., Astley, S. M., Donnelly, L. S., Stavrinos, P., Sampson, S. et al. Mammographic density adds accuracy to both the Tyrer-Cuzick and Gail breast cancer risk models in a prospective UK screening cohort. Breast Cancer Res. 17, 147 (2015).
PubMed PubMed Central Google Scholar
Meads, C., Ahmed, I. & Riley, R. D. A systematic review of breast cancer incidence risk prediction models with meta-analysis of their performance. Breast Cancer Res. Treat. 132, 365–377 (2012).
PubMed Google Scholar
Tice, J. A., Cummings, S. R., Smith-Bindman, R., Ichikawa, L., Barlow, W. E. & Kerlikowske, K. Using clinical factors and mammographic breast density to estimate breast cancer risk: development and validation of a new predictive model. Ann. Intern. Med. 148, 337–347 (2008).
PubMed PubMed Central Google Scholar
Obermeyer, Z. & Emanuel, E. J. Predicting the future—big data, machine learning, and clinical medicine. N. Engl. J. Med. 375, 1216–1219 (2016).
PubMed PubMed Central Google Scholar
Dreiseitl, S. & Ohno-Machado, L. Logistic regression and artificial neural network classification models: a methodology review. J. Biomed. Inf. 35, 352–359 (2002).
Google Scholar
Ming, C., Viassolo, V., Probst-Hensch, N., Chappuis, P. O., Dinov, I. D. & Katapodi, M. C. Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models. Breast Cancer Res. 21, 75 (2019).
PubMed PubMed Central Google Scholar
Chen, H. C., Kodell, R. L., Cheng, K. F. & Chen, J. J. Assessment of performance of survival prediction models for cancer prognosis. BMC Med. Res. Methodol. 12, 102 (2012).
PubMed PubMed Central Google Scholar
Kourou, K., Exarchos, T. P., Exarchos, K. P., Karamouzis, M. V. & Fotiadis, D. I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 13, 8–17 (2015).
CAS PubMed Google Scholar
Reinbolt, R. E., Sonis, S., Timmers, C. D., Fernandez-Martinez, J. L., Cernea, A., de Andres-Galiana, E. J. et al. Genomic risk prediction of aromatase inhibitor-related arthralgia in patients with breast cancer using a novel machine-learning algorithm. Cancer Med. 7, 240–253 (2018).
CAS PubMed Google Scholar
Vanneschi, L., Farinaccio, A., Mauri, G., Antoniotti, M., Provero, P. & Giacobini, M. A comparison of machine learning techniques for survival prediction in breast cancer. BioData Min. 4, 12 (2011).
PubMed PubMed Central Google Scholar
Heidari, M., Khuzani, A. Z., Hollingsworth, A. B., Danala, G., Mirniaharikandehei, S., Qiu, Y. et al. Prediction of breast cancer risk using a machine learning approach embedded with a locality preserving projection algorithm. Phys. Med. Biol. 63, 035020 (2018).
PubMed PubMed Central Google Scholar
Progeny Software LLC, Delray Beach, FL, www.progenygenetics.com.
Team RC. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2017).
Dinov, I. D. Data Science and Predictive Analytics: Biomedical and Health Applications Using R (Springer, 2018).
Murdoch, T. B. & Detsky, A. S. The inevitable application of big data to health care. J. Am. Med. Assoc. 309, 1351–1352 (2013).
CAS Google Scholar
Toga, A. W. & Dinov, I. D. Sharing big biomedical data. J. Big Data 2, 7 (2015).
PubMed PubMed Central Google Scholar
Dinov, I. D., Heavner, B., Tang, M., Glusman, G., Chard, K., Darcy, M. et al. Predictive big data analytics: a study of Parkinson’s disease using large, complex, heterogeneous, incongruent, multi-source and incomplete observations. PLoS ONE 11, e0157077 (2016).
PubMed PubMed Central Google Scholar
Dal Pozzolo A, Caelen O, Waterschoot S, Bontempi G, editors. Racing for Unbalanced Methods Selection. Berlin, Heidelberg: Springer Berlin Heidelberg; 2013.
Kim, Z., Min, S. Y., Yoon, C. S., Jung, K. W., Ko, B. S., Kang, E. et al. The Basic Facts of Korean Breast Cancer in 2012: results from a Nationwide Survey and Breast Cancer Registry Database. J. Breast Cancer 18, 103–111 (2015).
PubMed PubMed Central Google Scholar
Korbinian Strimmer. (2015). crossval: Generic Functions for Cross Validation. R package version 1.0.3. https://CRAN.R-project.org/package=crossval.
Kohavi R. A study of cross-validation and bootstrap for accuracy estimation and model selection. Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2; Montreal, Quebec, Canada: Morgan Kaufmann Publishers Inc.; 1995. p. 1137–1143.
Ng AY. Preventing “Overfitting” of Cross-Validation Data. Proceedings of the Fourteenth International Conference on Machine Learning: Morgan Kaufmann Publishers Inc.; 1997. p. 245–253.
Brinton, J. T., Hendrick, R. E., Ringham, B. M., Kriege, M. & Glueck, D. H. Improving the diagnostic accuracy of a stratified screening strategy by identifying the optimal risk cutoff. Cancer Causes Control 30, 1145–1155 (2019).
PubMed PubMed Central Google Scholar
Tabe-Bordbar, S., Emad, A., Zhao, S. D. & Sinha, S. A closer look at cross-validation for assessing the accuracy of gene regulatory networks and models. Sci. Rep. 8, 6620 (2018).
PubMed PubMed Central Google Scholar
O’Neill, S. C., Leventhal, K. G., Scarles, M., Evans, C. N., Makariou, E., Pien, E. et al. Mammographic breast density as a risk factor for breast cancer: awareness in a recently screened clinical sample. Women’s Health Issues 24, e321–e326 (2014).
PubMed PubMed Central Google Scholar
Burrion, J. B. Breast cancer screening: present situation and prospects. Rev. Med. Brux. 39, 406–409 (2018).
CAS PubMed Google Scholar
Eklund, M., Broglio, K., Yau, C., Connor, J. T., Stover Fiscalini, A. & Esserman, L. J. The WISDOM personalized breast cancer screening trial: simulation study to assess potential bias and analytic approaches. JNCI Cancer Spectr. 2, pky067 (2018).
PubMed Google Scholar

Download references

Author information

These authors contributed equally: Pierre O. Chappuis, Maria C. Katapodi

Authors and Affiliations

Department of Clinical Research, Faculty of Medicine, University of Basel, Basel, Switzerland
Chang Ming & Maria C. Katapodi
Oncogenetics and Cancer Prevention, Geneva University Hospitals, Geneva, Switzerland
Valeria Viassolo & Pierre O. Chappuis
Swiss Tropical and Public Health Institute, University of Basel, Basel, Switzerland
Nicole Probst-Hensch
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
Ivo D. Dinov
Michigan Institute for Data Science, University of Michigan, Ann Arbor, MI, USA
Ivo D. Dinov
Statistics Online Computational resource, University of Michigan, Ann Arbor, MI, USA
Ivo D. Dinov
University of Michigan School of Nursing, Ann Arbor, MI, USA
Ivo D. Dinov & Maria C. Katapodi
Genetic Medicine, Geneva University Hospitals, Geneva, Switzerland
Pierre O. Chappuis

Authors

Chang Ming
View author publications
You can also search for this author in PubMed Google Scholar
Valeria Viassolo
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Probst-Hensch
View author publications
You can also search for this author in PubMed Google Scholar
Ivo D. Dinov
View author publications
You can also search for this author in PubMed Google Scholar
Pierre O. Chappuis
View author publications
You can also search for this author in PubMed Google Scholar
Maria C. Katapodi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.M. and M.K. prepared the initial draft of the paper. C.M., V.V. and P.C. collected the data. C.M. and I.D. analysed the data. C.M., P.C. and M.K. did the conceptualisation. C.M., V.V., N.P., P.C., I.D. and M.K. edited the paper. All authors read and approved the final paper.

Corresponding author

Correspondence to Chang Ming.

Ethics declarations

Ethics approval and consent to participate

The study was carried out in accordance with the Declaration of Helsinki. Data used in this study were collected as part of routine medical records. The Regional Research Ethics Committee at the University Hospitals of Geneva has approved the data collection and management processes. Informed consent was obtained from all participants included in the study before genetic testing.

Consent to publish

Not applicable.

Data availability

The datasets used and analysed during the current study are available from the corresponding author upon reasonable request and gaining signed clinical data transfer agreement from Geneva University Hospital. We also shared the computational protocol via GitHub (https://github.com/SOCR/ML_BCP/).

Competing interests

The authors declare no competing interests.

Funding information

This work was supported by the University of Basel and the Freiwillige Akademische Gesellschaft (FAG) to Chang Ming. Research related to ML algorithms was supported in part by the U.S. National Science Foundation (Grant Numbers 1916425, 1734853, 1636840 and 1416953) and the National Institutes of Health (Grants P20 NR015331, P30 DK089503, UL1TR002240, R01CA233487 and R01MH12107) to Ivo D. Dinov, PhD. The funders had no role in study design, data collection and analyses, decision to publish or preparation of this paper.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ming, C., Viassolo, V., Probst-Hensch, N. et al. Machine learning-based lifetime breast cancer risk reclassification compared with the BOADICEA model: impact on screening recommendations. Br J Cancer 123, 860–867 (2020). https://doi.org/10.1038/s41416-020-0937-0

Download citation

Received: 02 December 2019
Revised: 13 May 2020
Accepted: 29 May 2020
Published: 22 June 2020
Issue Date: 01 September 2020
DOI: https://doi.org/10.1038/s41416-020-0937-0

This article is cited by

Application of machine learning approaches in predicting clinical outcomes in older adults – a systematic review and meta-analysis
- Robert T. Olender
- Sandipan Roy
- Prasad S. Nishtala
BMC Geriatrics (2023)
The current status of risk-stratified breast screening
- Ash Kieran Clift
- David Dodwell
- Julia Hippisley-Cox
British Journal of Cancer (2022)
An artificial intelligence approach for predicting cardiotoxicity in breast cancer patients receiving anthracycline
- Wei-Ting Chang
- Chung-Feng Liu
- Jhih-Yuan Shih
Archives of Toxicology (2022)
Modeling and prediction of pressure injury in hospitalized patients using artificial intelligence
- Christine Anderson
- Zerihun Bekele
- Ivo D. Dinov
BMC Medical Informatics and Decision Making (2021)
Adoption of artificial intelligence in breast imaging: evaluation, ethical constraints and limitations
- Sarah E. Hickman
- Gabrielle C. Baxter
- Fiona J. Gilbert
British Journal of Cancer (2021)

Subjects

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

Background

Methods

Swiss clinic-based retrospective data

BOADICEA model classification

ML risk classifications

Comparisons of performance measure and classification

Statistical analyses

Results

Model performance

Breast cancer-free women

Classification comparison

Clinical impact on mammography surveillance

Discussion

Strengths and limitations

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent to publish

Data availability

Competing interests

Funding information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links