Germline BRCA 1-2 status prediction through ovarian ultrasound images radiogenomics: a hypothesis generating study (PROBE study)

Nero, Camilla; Ciccarone, Francesca; Boldrini, Luca; Lenkowicz, Jacopo; Paris, Ida; Capoluongo, Ettore Domenico; Testa, Antonia Carla; Fagotti, Anna; Valentini, Vincenzo; Scambia, Giovanni

doi:10.1038/s41598-020-73505-2

Download PDF

Article
Open access
Published: 05 October 2020

Germline BRCA 1-2 status prediction through ovarian ultrasound images radiogenomics: a hypothesis generating study (PROBE study)

Camilla Nero^1,4,
Francesca Ciccarone¹,
Luca Boldrini²,
Jacopo Lenkowicz²,
Ida Paris¹,
Ettore Domenico Capoluongo³,
Antonia Carla Testa¹,
Anna Fagotti¹,
Vincenzo Valentini² &
…
Giovanni Scambia¹

Scientific Reports volume 10, Article number: 16511 (2020) Cite this article

2843 Accesses
14 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Radiogenomics is a specific application of radiomics where imaging features are linked to genomic profiles. We aim to develop a radiogenomics model based on ovarian US images for predicting germline BRCA1/2 gene status in women with healthy ovaries. From January 2013 to December 2017 a total of 255 patients addressed to germline BRCA1/2 testing and pelvic US documenting normal ovaries, were retrospectively included. Feature selection for univariate analysis was carried out via correlation analysis. Multivariable analysis for classification of germline BRCA1/2 status was then carried out via logistic regression, support vector machine, ensemble of decision trees and automated machine learning pipelines. Data were split into a training (75%) and a testing (25%) set. The four strategies obtained a similar performance in terms of accuracy on the testing set (from 0.54 of logistic regression to 0.64 of the auto-machine learning pipeline). Data coming from one of the tested US machine showed generally higher performances, particularly with the auto-machine learning pipeline (testing set specificity 0.87, negative predictive value 0.73, accuracy value 0.72 and 0.79 on training set). The study shows that a radiogenomics model on machine learning techniques is feasible and potentially useful for predicting gBRCA1/2 status in women with healthy ovaries.

Validation of the diagnostic efficacy of O-RADS in adnexal masses

Article Open access 21 September 2023

Predicting Breast Cancer in Breast Imaging Reporting and Data System (BI-RADS) Ultrasound Category 4 or 5 Lesions: A Nomogram Combining Radiomics and BI-RADS

Article Open access 15 August 2019

Comparison of the ADNEX and ROMA risk prediction models for the diagnosis of ovarian cancer: a multicentre external validation in patients who underwent surgery

Article 19 January 2024

Introduction

Breast cancer susceptibility genes (BRCA1 and BRCA2) pathogenic variants (PVs) are correlated with a substantial elevated lifetime risk of developing breast (BC) and/or ovarian cancer (OC)^1,2,3,4,5,6. In particular, germline BRCA1 pathogenic variant carriers (g-BRCA 1 PV) have a lifetime risk of 65–80% of developing BC and 37–62% of developing OC, while BRCA 2 pathogenic variant carriers (g-BRCA 2 PV) have a lifetime risk of 45–85% for BC and 11–23% for OC respectively^1,2,3,4,5.

Healthy women resulting as g-BRCA 1/2 PV carriers are offered lifesaving prophylactic procedures and medications to prevent the onset of cancer^7,8.

Many studies documented an OC reduction risk by 85% to 95% after risk-reducing salpingo-oophorectomy and a BC risk reduction of 90–95% after bilateral risk-reducing mastectomy^{7,8,9,10,11,12,13}.

The prevalence in the general population of gBRCA1/2 PV varies between 0.2 and 0.3%^14,15, but the high penetrance of these PVs should be considered in relationship to patients familial history and age^16,17. Around 10–20% of ovarian cancer and 6% breast cancer overall are caused by inheritable BRCA1/BRCA2 PVs¹⁸.

A population-based BRCA mutation screening has already been considered as too expensive¹⁹, therefore different strategies are needed for the future.

Current guidelines suggest that the assessment for cancer predisposition should be warranted only to those patients who are likely to have an inherited mutation, mainly based on personal or family history of BC or OR^11,17.

Although this strategy is currently considered as the most cost-effective option, the available tools (such as BOADICEA, Myriad, BRCAPRO, etc.) combined with pre-test genetic counselling, showed a moderate to high diagnostic accuracy (around 2.6% diagnostic error rate) in predicting gBRCA1/2 mutations, however, a number of individuals, potentially at risk, might not be identified^{20,21,22,23,24,25}.

Demand for BRCA testing is increasing and in 2018, in the USA, the first direct-to-consumer test to report on gBRCA1/2 PVs was authorized²⁶.

The technological development of analytical tools, able to generate and deal with large volumes of data, has been dramatically changing the overall scenario in the last years.

In particular, the possibility to perform quantitative analysis of medical imaging and its association with biological and clinical markers, such as gene expression, has led to a novel approach called “radiogenomics”^27,28,29.

Pelvic ultrasound (US) scan is a widespread technique, regularly used in clinical practice to detect possible adnexal disorders. Normal fallopian tubes are not typically identified on US while exquisite images of the ovaries can be provided in both pre- and postmenopausal women³⁰. BRCA-associated OCs seem to originate in the fallopian tube but there is some evidence that up to 21% of occult cancers may involve the ovary alone³¹.

If gBRCA1/2 PVs affect ovarian imaging features, it could be possible to develop a model to assess gBRCA1/2 status through a radiogenomic analysis of US images of normal ovaries.

This model could help to improve patients’ selection for BRCA1/2 testing, overcoming the limits of the current criteria-based strategies.

We aimed at designing such a model and assessing its feasibility and performance.

Methods

Study design

This is an observational, single center study with patients retrospectively enrolled at the Fondazione Policlinico Universitario Agostino Gemelli IRCCS of Rome, Italy from January 2013 to December 2017.

A radiomics machine learning approach, based on features extracted from US images of the ovaries, was proposed to predict gBRCA1/2 carrier status.

The retrospective data on gBRCA1/2 testing performed on patients with NGS technique was considered as the reference standard of the radiomics analysis.

Transvaginal pelvic US was performed by dedicated gynecologic sonographers according to current international indications^32,33.

This study was conducted in accordance with the declaration of Helsinki and was approved by the the Fondazione Policlinico Universitario Agostino Gemelli IRCCS ethics committee (Prot. 50543/19; ID: 2907), with the requirement for informed consent.

Study population

The inclusion criteria were defined as follows: (i) patients recommended for BRCA testing according to international guidelines^7,17 and (ii) patients who underwent gBRCA1/2 testing in our institution (iii) and patients who underwent transvaginal pelvic US performed in our institution providing at least one picture of one healthy ovary and (iv) patient’s images had to be stored in .dicom format.

The exclusion criteria were (i) personal diagnosis of OC or (ii) ovarian abnormal findings (eg. ovarian endometriomas, dermoid cyst, ovarian cystoadenofibroma, ovarian borderline tumour…) but functional cysts at pelvic US or (iii) unavailable results of gBRCA1/2 testing or (iv) refusal to provide written informed consent.

gBRCA1/2 testing data acquisition

Recommendations for genetic counseling were provided according to international guidelines^7,17.

All patients included in the study received oncogenetic counseling before BRCA testing and signed written informed consent for BRCA analysis. Standardized procedures according to previously published workflows were observed^34,35.

DNA for genotyping was isolated from whole blood samples by a non-automated method based on a commercial kit (Roche Diagnostics, Basel, Switzerland), and spectral analysis was performed using a NanoPhotometer (Implen, München, Germany). In order to achieve the highest efficiency in multiplex polymerase chain reaction (PCR), DNA was extracted just before the amplification step.

Finally, DNA samples showing concentrations of 15 ng/mL, low quality (A260/A280 of 1.7), or a ‘smeared profile’ were re-extracted.

Molecular analysis by massive parallel sequencing using the Illumina MiSeq system (Illumina Inc., San Diego, CA, USA) was then performed^{36,37,38,39,40}.

In addition, horizon reference material was also used³⁶ and the bioinformatics pipeline was validated using reference materials (EMQN scheme)³⁷.

As previously reported, the bioinformatic “Amplicon Suite” CE- IVD tool (SmartSeq s.r.l., Novara, Italy) was used to determine depth of region covered, copy number status and variant calling³⁸.

Variants with disrupting effects, such as frameshift, nonsense, canonical-splice site and such missense, along with large rearrangements, which were overall unequivocally reported as “pathogenic (class 5)”, within the main databases based on multiple lines of evidence and expert reviewing board (like ENIGMA Consortium) and the International Agency for research on Cancer (IARC) annotation, were assigned to the gBRCA1/2 PV group.

All DNA samples resulting as negative for gBRCA1/2 alterations at the massive parallel sequencing, were further analyzed by multiplex ligation dependent probe amplification (MLPA) or multiplex amplicon quantification assays³⁹.

All amplicons with coverage under 38x, and all gBRCA1/2 variants that were classified as damaging (class 5), probably damaging (class 4) and variants of uncertain significance (class 3, VUS) were confirmed by targeted Sanger sequencing. Finally, all positive results detected by MLPA on two independent DNA samples were confirmed by long-range PCR (Expand Long Template PCR System, Roche Applied Science)⁴⁰.

Pelvic US imaging protocol

All pelvic US examinations were performed by dedicated gynecologic oncologists with more than 10 years of experience in gynecological ultrasound. The examinations were carried out using 4 different high quality ultrasound equipment (GE Healthcare Voluson E10, Canon Toshiba Aplio-i900, Samsung Elite, Esaote-My LabTM Twice). Only transvaginal US probes were used to acquire images (7–10 MHz) using a standardized examination technique⁴¹.

When performing ovarian examination with real‐time 2D‐US, the patient was positioned in lithotomy position, with empty bladder and the operator scanned the ovary in both axial and coronal planes, to identify the best image.

The ovary had to occupy at least 50% of the screen along its largest axis.

Three orthogonal maximum diameters of the ovary (total length; antero-posterior measurement on the same scan; transverse length in a perpendicular image) for a maximum of 2 images per ovary were required, for a maximum of 2 images per ovary if possible. Current international indications were followed by all sonographers^32,33. Other ultrasound features described were: morphology, echostructure and eventually abnormal vascularitation. The presence of functional cysts or the absence of one ovary, were was not considered an exclusion criterion. The presence of ovarian tumors, benign, uncertain or suspicious for malignant, according to the IOTA classification led to the exclusion from the current analysis⁴¹. In case of artifact, such as bowel gas shadowing, the patients were excluded if the images were not clear for analysis.

Image postprocessing

Code for the project is available at https://github.com/kbolab/PROBE. The repository contains the scripts for all the different part of the analysis, from ROI extraction, to feature computation, to statistical analysis. Everything regarding radiomics processing is a direct porting of the R library Moddicom.

The regions of interest (ROI) were manually delineated on the US image via the software Aliza version 1.48 by a trained gynecologist (Fig. 1)⁴².

Guidelines describing how to define the boundaries of the ovary were provided before starting ROI delineation and two senior dedicated gynecologic sonographers with at least 10 years’ experience in pelvic US imaging were responsible for the evaluation and independent check of ovarian segmentation.

Since the BRCA status outcome was associated to the patient, and each patient could be associated with a number of images varying from 1 to 4, the corresponding DICOM folder structure was organized in a hierarchical way so that a single ROI was labeled with the following set of tags: US machine, patient ID, left or right ovary, longitudinal or coronal plane. Each leaf folder structure contained therefore two files: the original DICOM and the ROI mask DICOM.

All the image files had been previously anonymized according to the existing privacy regulations.

US radiomics feature extraction and validation

The original DICOM file and the corresponding ROI segmentation masks were uploaded on the MODDICOM platform, an in-house developed software for radiomics analysis⁴³.

The software extracted the pixel values contained inside the ROI boundaries and computed 232 radiomics features. Prior to feature extraction, all the ROIs were rescaled to pixel value histograms between 0 and 255.

After that, four groups of imaging features were extracted from each normalized US image with manually segmented ROIs: 20 first order statistics, 14 morphological, 183 texture and 15 fractal features. First order statistics features describe the properties of the image grey level histogram considered as a whole. Morphological features describe the properties of the ROI in space. Texture features describe the properties of the local distribution of grey levels inside the ROI. Fractal features compute the fractal dimension through the box-counting algorithm. The software implementation, was entirely validated within the Image Biomarker Standardization Initiative⁴⁴ to ensure reproducibility and methodological robustness of the radiomics features extraction pipeline.

Statistical analysis

Feature-outcome association in univariate analysis

Radiomics features are often strongly correlated with each other, so that one can identify clusters of similarly distributed features through the pair-wise Pearson correlation test.

This approach is useful to reduce features dimensionality; the higher the number of features tested against an outcome, the higher the probability of over-fitting the data or getting a significant result just by chance will be⁴⁵.

The feature selection for univariate analysis was carried out via correlation analysis to remove all but one feature among groups of highly correlated features by setting a Pearson correlation threshold of 0.9. The remaining features were tested for association with the outcome with the Wilcoxon–Mann–Whitney test. A p-value of 0.05 was considered as statistically significant.

The number of significant test results was then compared to the expected number of type I errors to account for multiple testing⁴⁶.

Machine learning-based models

Multivariable analysis for classification of gBRCA1/2 status was carried out via four strategies involving machine learning techniques: logistic regression (A), support vector machine (B), ensemble of decision trees (C) and automated machine learning pipelines (D). The corresponding models were trained on the 75% of the data, while the remaining 25% of data was left out for testing purposes.

A first feature selection was carried out for each strategy with correlation analysis to remove all but one highly correlated feature by setting a Pearson correlation threshold of 0.9, as performed in the univariate analysis.

Strategies A and B involved logistic regression and radial kernel support vector machine, respectively: both strategies had a forward feature selection based Cohen Kappa values at the 0.75 splitted training set ROC Youden index.

We used Cohen’s Kappa as confusion matrix evaluation metrics to take into account the class imbalance and the related fact that expected accuracy is “skewed” towards values higher than 50%. Also, for the same reason, this metric allows for direct comparison between results obtained on datasets with different class distribution, so that for future work on external validation or prospective data the classification performances will be directly comparable with the ones reported in this paper.

Detailed steps of these iterative strategies, starting from n equals to 1, are:

i.
All the possible different models with n features are trained using all the different groups of n features;
ii.
The ROC curve on the training set is computed for every n feature model;
iii.
The optimal cut-off point according to Youden index will be found on every ROC curve of ii) and the Cohen Kappa value of the classification matrix at the optimal point will be computed;
iv.
The model with the highest value of the Cohen Kappa computed in iii) is then held for the next iteration, while all the others are discarded;
v.
All the possible different models with n + 1 features are trained adding to the n features selected in the previous step, all the features left available, one by one;
vi.
The process is then repeated until the Cohen’s Kappa value does not increase by adding a new feature to the model.

Strategy C involved Extreme Gradient Boosting (XGBoost)⁴⁶ with extensive grid search on the hyperparameters space and fivefold cross validation error on the 0.75 split training set as the metric to be minimized. The selected model was then tested on the remaining 25% of data.

Auto Machine Learning for classification with TPOT Python library⁴⁷ has been used in strategy D, testing the spectrum of classification models and their hyperparameters with a heuristic search conducted through a genetic algorithm with the fivefold cross validation error on the 0.75 splitted training set as the metric to be minimized.

The genetic algorithm was initialized with TPOT default parameters. The selected model was then tested on the remaining 25% of data.

All statistical analyses were conducted with R version 3.4.4⁴⁸ and Python version 3.7.4⁴⁹.

Results

Clinical characteristics of the study cohort

From January 2013 to December 2017, 1865 consecutive patients underwent gBRCA 1/2 testing in our institution. Among these women, 248 had a gBRCA1 PV or VUS (13.3%), 231 had a gBRCA2 PV or VUS (12.3%), 3 had gBRCA-1 and -2 PVs (compound heterozygotes, 0.16%) while 1383 were wild type (WT) (74%).

Patients with personal history of OC and/or endometrial cancer (109) were excluded.

Moreover, 684 patients out of 1756 underwent pelvic US at our institution. Pelvic US was part of the annual gynecologic assessment worldwide recommended for the optimal promotion of women’s health⁵⁰.

Patients who did not have US images stored as .dicom files and those with abnormal findings regarding the ovaries at US imaging were excluded from the study. Even in cases of unilateral lesion, we preferred not to use for analysis the contra lateral ovary in order to reduce possible bias.

A total of 890 images was finally analyzed in the current study.

Figure 2 illustrates the general flowchart of the study.

Table 1 reports the clinical characteristics of the 255 included women; no significant differences were found in most clinico-pathologic factors between the testing and training sets.

Table 1 The clinical characteristics of the testing and training dataset.

Full size table

Overall, the mean age was 45.5 years but the majority of women (62%) were in menopause (both iatrogenic or natural). This is consistent with the fact that the indication to the test was mainly related to personal history of BC (80%). The mean number of images per patient was 3.5, ensuring quality to US approach.

No significant differences were observed in the two image sets.

Correlation and univariate analysis

After the segmentation of the ROIs and the following feature extraction, the final feature set was composed by 232 radiomics features.

The outcome was binarized into gBRCA1/2 WT patients (558 images) and gBRCA1/2-ve patients (332 images). gBRCA 1/2 VUS were considered among gBRCA 1/2 PVs because although the pathogenetic potential of this group of mutations is unknown, these patients deserve to be tested and followed up.

A specific dataset for each US machine was also produced for the analysis with the following number of images: Voluson (350 images), Toshiba (190 images), Samsung (221 images), Esaote (134 images).

The proportion of g-BRCA 1-2 pathogenetic cases was comparable in all the datasets.

A first feature selection was carried out via correlation analysis on each dataset to reduce the feature space at a Pearson correlation threshold of 0.9, resulting in 61 features for the full dataset, 45 for Voluson, 42 for Toshiba, 44 for Samsung and 40 for Esaote.

The overlapping features identified from each machine after Pearson correlation analysis are shown in Supplementary Table 1.

The radiomics features in these sorted datasets were tested individually in univariate analysis using Wilcoxon–Mann–Whitney test with the binary outcome.

The features which resulted in a statistically significant test were 6 for the full dataset, 16 for the Voluson dataset, 3 for the Toshiba dataset, 3 for the Samsung dataset and 2 for the Esaote dataset. As a percentage of the tested features, these figures represent the 9.8%, 35.0%, 7.1%, 6.8% and 5.0% respectively (as shown in Table 2).

Table 2 Number of features selected excluding (also, in percentage of the number of selected features).

Full size table

The overlapping statistically significant features identified from each machine after Wilcoxon–Mann–Whitney test, are shown in Supplementary Table 2.

Since the significance threshold was set at 0.05, we observed that the number of significant features was above the expected value of type I errors in four out five datasets, with a particularly high rate of statistically significant associations for the features coming from Voluson images.

The p-values of the Wilcoxon–Mann–Whitney test for the significant features are reported in Table 3.

Table 3 Radiomics features whose association test with the binary outcome was statistically significative with corresponding p-value, for the different datasets.

Full size table

The vast majority of statistically significant features were texture features: 15 out of 16 for Voluson; 2 out of 2 for Esaote; 2 out of 3 for Samsung; 2 out of 3 for Toshiba. Among these, the most selected texture type of features were co-occurrence matrix and run-length matrix based features.

Performance of machine learning models

Multivariable analysis for gBRCA pathogenetic variant classification was carried out with the four described machine learning strategies (A), (B), (C), and (D). Classification metrics for the different models on the five datasets are reported in Supplementary Table 3 and summarized in Fig. 3. The train-test split was per patient so to prevent information leakage from training to testing set. After the split, the model was trained and tested per image, and its diagnostic performance is reported on an image basis.

Considering the full dataset, the four strategies obtained a similar performance in terms of classification accuracy on the testing set, varying from 0.54 of logistic regression of strategy A to 0.64 of strategy D with the auto-machine learning pipeline.

Strategy D showed also the highest value of specificity on the testing set (0.91) and a negative predictive value of 0.65, comparable to the one obtained with strategy C but lower than the corresponding values obtained with strategies A and B, which were 0.74 and 0.80, respectively.

Considering the Voluson data only, performances were generally higher: in particular, testing set specificity and negative predictive value reached 0.77 and 0.81, respectively, with the radial kernel support vector machine of strategy B; 0.82 and 0.70 with the extreme gradient boosting of strategy C, and 0.87 and 0.73 with the auto-machine learning pipeline of strategy D.

The latter combination, strategy D on Voluson data, showed also the best consistency between training and testing set accuracy (0.79 and 0.72 respectively) suggesting that the adopted strategy performance was particularly robust to avoid data overfitting, which might improve the classification performances on larger image numbers to obtain both a higher specificity and a higher negative predictive value.

Discussion

In this single center study, we developed an automated machine learning pipeline model with encouraging performances to identify gBRCA1/2 status based on US images of healthy ovaries, acquired on different US machines.

This result can be considered as an instance of “real world data” from the perspective of radiomics analysis and paves the way to possible developments of this model with direct consequences on patients diagnostic workflow and prognostic stratification, in the frame of the most modern personalized medicine paradigms.

More specifically, the described model, as a large-scale screening test, would assign only 13 out of 100 gBRCA1/2 WT patients (with the current value of specificity at 0.87) to unnecessary genetic screening.

In our series, 157 out of 255 patients (61.5%) were assigned to genetic screening based on current clinical selection criteria, but resulted as gBRCA1/2 WT.

Not many data have been published on the performance of the current clinical criteria selection in detecting BRCA1/BRCA2 mutations. Nevertheless, as reported by Manchanda et al.¹⁸, despite 25 years of BRCA testing and effective mechanisms for prevention, current guidelines and access to testing/treatment pathways remain complex and associated with a massive under-utilisation of genetic testing. Only 20% of eligible US women have accessed/undergone genetic testing. A UK study showed the large majority (> 97%) of BRCA carriers in the population remain unidentified⁵¹. In addition, current approaches use established clinical-criteria/family-history based a priori BRCA probability thresholds to identify high-risk individuals eligible for BRCA testing. The estimation of likelihood of being carrier of BRCA1/2 PV is based on different tools, with possible selection biases related to different algorithms performing the risk assessment: in such countries the threshold is 20% while in others 10%. Over 50% BRCA carriers do not fulfil clinical criteria and are therefore missed⁵².

Our model, when tested and validated on the general population, could be an extremely cost-effective strategy for a population-based BRCA mutation screening.

Nevertheless, the current described negative predictive value of 0.73 implies that 27 out of 100 gBRCA1/2 PV carriers would not be assigned to the needed genetic test, probably representing the most significant limit of this hypothesis generating radiogenomics approach.

Compared to CT or MRI, radiomics on US images is a relatively new and less explored type of analysis. Nonetheless, it is gaining gound^53,54,55,56.

Most significant features for the screening tool both in univariate analysis and in the machine learning models are texture features, which are often not quite easy to visualize. Nevertheless, among the most influential features (especially considering the Voluson dataset) those related to clusters of high-intensity grey levels are predominant. In particular, the “short run high grey level emphasis” feature is the leading feature in the xgboost model with roughly twice the “gain” value than the second most important feature. Also, the univariate statistical association of this feature with the binary outcome through Wilcoxon–Mann–Whitney test is significative with a p-value of order 10⁻⁵, the sign of the effect being that BRCA mutated group have an higher feature median value (i.e. BRCA mutated images have more short runs, or small clusters, of high grey level intensities).

As previously discussed, Voluson machine showed more significant features and higher performances compared with the other machines. The explanation may lie in the fact that Voluson dataset has 60% more records than the biggest among Samsung, Esaote, and Toshiba datasets (which is the Samsung dataset with 260 records). This can cause stronger statistical associations if the effect is there. However, more data with a balanced number of images among various US machines are needed to clarify this aspect.

Based on this analysis, authors believe that this model deserves further investigation to increase its performance and to better describe its translational application in daily clinical practice.

Several issues and limitations have been encountered during the analysis.

First, the retrospective nature of the study represents an unavoidable source of selection bias and imaging data inhomogeneity. In particular, patients were not divided in subgroups according to their hormonal status and for fertile age women (36%) the time on which the ultrasound was performed was not established. All these aspects may have impacted on textural radiomic features.

Second, the monocentric nature of the study and the qualification of our institution as a referral center for US in gynecological malignancies may not fully reflect real word data, especially in terms of US image quality.

Third, the absence of an external validation does not allow us to draw any definitive conclusion on the replicability of the model, even if the impact of this limit is reduced thanks to the heterogeneity of the tested population and the use of different US scanners.

Fourth, our patients were only screened for BRCA1/2: no other genes involved in the same homologous recombination repair pathways were included in the analysis. Therefore, we cannot exclude the presence of PVs in the other genes whose mutational status could correlate with US patterns, limiting the clinical impact of the proposed approach.

Fifth, the model was developed on women with high risk factors for gBRCA1/2 PVs therefore could be applied only to this clinical setting.

These aspects will be taken into account at best to improve the power of our model in following investigations.

Overall, the model could help patients who may have unnecessary genetic tests when identified by the clinical screening method. However, its actual performance, in terms of negative predictive values, does not allow applying it as a reliable screening tool to optimize patient selection for genetic screening of BRCA1/2 genes.

However, the novelty of this radiogenomics application, its potentiality as a screening tool and the encouraging preliminary results support the reliability and the value of our findings.

For these reasons, we are planning to expand the series including patients enrolled in 2018 and 2019, validate the model on external cohorts of patients and finally test it in the general population as a screening tool.

In conclusion, the authors believe that in the next future a radiogenomics approach based on ovarian US images can become a successful screening tool to identify women who are highly likely to have gBRCA1/2 PV and therefore should be assigned to genetic testing and counselling pursuing healthcare cost-containment policies.

This radiogenomic model could represent a cost-effective, highly reproducible, large-scale extendable and time saving screening tool although further investigations are necessary to improve sensitivity and specificity before its effective clinical release.

References

Antoniou, A. et al. Average risks of breast and ovarian cancer associated with BRCA1 or BRCA2 mutations detected in case series unselected for family history: A combined analysis of 22 studies. Am. J. Hum. Genet. 72, 1117–1130 (2003).
Article CAS Google Scholar
Chen, S. & Parmigiani, G. Meta-analysis of BRCA1 and BRCA2 penetrance. J. Clin. Oncol. 25, 1329–1333 (2007).
Article Google Scholar
Mavaddat, N. et al. Cancer risks for BRCA1 and BRCA2 mutation carriers: Results from prospective analysis of EMBRACE. J. Natl. Cancer Inst. 105, 812–822 (2013).
Article CAS Google Scholar
Hartmann, L. C. & Lindor, N. M. Risk-reducing surgery in hereditary breast and ovarian cancer. N. Engl. J. Med. 374(24), 2404. https://doi.org/10.1056/NEJMc1602861 (2016). (No abstract available).
Article PubMed Google Scholar
Balmaña, J., Díez, O., Castiglione, M., & ESMO Guidelines Working Group. BRCA in breast cancer: ESMO clinical recommendations. Ann. Oncol. 20(Suppl 4), 19–20 (2009).
Article Google Scholar
Torre, L. A. et al. Global cancer statistics, 2012. CA Cancer J Clin. 65(2), 87–108. https://doi.org/10.3322/caac.21262 (2015). (Epub 2015 Feb 4).
Article PubMed Google Scholar
National Comprehensive Cancer Network. Genetic/Familial High-Risk Assessment: Breast and Ovarian. NCCN Clinical Practice Guidelines in Oncology. Version 3.2019—January 18, 2019
Walker, J. L. et al. Society of Gynecologic Oncology recommendations for the prevention of ovarian cancer. Cancer 121(13), 2108–2120. https://doi.org/10.1002/cncr.29321 (2015). (Epub 2015 Mar 27, Review).
Article PubMed Google Scholar
Rebbeck, T. R., Kauff, N. D. & Domchek, S. M. Meta-analysis of risk reduction estimates associated with risk-reducing salpingo-oophorectomy in BRCA1 or BRCA2 mutation carriers. J. Natl. Cancer Inst. 101(2), 80–87. https://doi.org/10.1093/jnci/djn442 (2009). (Epub 2009 Jan 13).
Article CAS PubMed PubMed Central Google Scholar
Domchek, S. M. et al. Mortality after bilateral salpingo-oophorectomy in BRCA1 and BRCA2 mutation carriers: A prospective cohort study. Lancet Oncol. 7(3), 223–229 (2006).
Article CAS Google Scholar
Eleje, G. U. et al. Risk-reducing bilateral salpingo-oophorectomy in women with BRCA1 or BRCA2 mutations. Cochrane Database Syst. Rev. 8, CD012464. https://doi.org/10.1002/14651858.CD012464.pub2 (2018). (Review).
Article PubMed Google Scholar
Ludwig, K. K., Neuner, J., Butler, A., Geurts, J. L. & Kong, A. L. Risk reduction and survival benefit of prophylactic surgery in BRCA mutation carriers, a systematic review. Am. J. Surg. 212(4), 660–669. https://doi.org/10.1016/j.amjsurg.2016.06.010 (2016). (Epub 2016 Jul 18, Review).
Article PubMed Google Scholar
Mann, G. J. et al. Analysis of cancer risk and BRCA1 and BRCA2 mutation prevalence in the kConFab familial breast cancer resource. Breast Cancer Res. 8(1), R12 (2006).
Article Google Scholar
Ferla, R. et al. Founder mutations in BRCA1 and BRCA2 genes. Ann. Oncol. 18(Suppl 6), vi93–vi98 (2007).
Article Google Scholar
Nelson, H.D., Fu, R., Goddard, K. et al.Risk Assessment, Genetic Counseling, and Genetic Testing for BRCA-Related Cancer: Systematic Review to Update the U.S. Preventive Services Task Force Recommendation. Report No.: 12-05164-EF-1. (Agency for Healthcare Research and Quality (US), Rockville, 2013).
Kuchenbaecker, K. B. et al. Risks of breast, ovarian, and contralateral breast cancer for BRCA1 and BRCA2 mutation carriers. JAMA 317(23), 2402–2416. https://doi.org/10.1001/jama.2017.7112 (2017).
Article CAS PubMed Google Scholar
Paluch-Shimon, S. et al. Prevention and screening in BRCA mutation carriers and other breast/ovarian hereditary cancer syndromes: ESMO Clinical Practice Guidelines for cancer prevention and screening. Ann. Oncol. 27(suppl 5), v103–v110 (2016).
Article CAS Google Scholar
Gaba, F. et al. Population study of ovarian cancer risk prediction for targeted screening and prevention. Cancers. 12(5), 124. https://doi.org/10.3390/cancers12051241 (2020).
Article Google Scholar
D’Andrea, E. et al. Which BRCA genetic testing programs are ready for implementation in health care? A systematic review of economic evaluations. Genet. Med. 18, 1171–1180 (2016).
Article Google Scholar
Nelson, H. D., Pappas, M., Cantor, A., Haney, E. & Holmes, R. Risk assessment, genetic counseling, and genetic testing for BRCA-related cancer in women: Updated evidence report and systematic review for the US Preventive Services Task Force. JAMA 322(7), 666–685. https://doi.org/10.1001/jama.2019.8430 (2019).
Article PubMed Google Scholar
Tuffaha, H. W. et al. Cost-effectiveness analysis of germ-line BRCA testing in women with breast cancer and cascade testing in family members of mutation carriers. Genet. Med. 20(9), 985–994. https://doi.org/10.1038/gim.2017.231 (2018). (Epub 2018 Jan 4).
Article PubMed Google Scholar
Anglian Breast Cancer Study Group. Prevalence and penetrance of BRCA1 and BRCA2 mutations in a population-based series of breast cancer cases. Br. J. Cancer. 83, 1301–1308 (2000).
Article Google Scholar
Wood, M. E., Flynn, B. S. & Stockdale, A. Primary care physician management, referral, and relations with specialists concerning patients at risk for cancer due to family history. Public Health Genomics. 16, 75–82 (2013).
Article CAS Google Scholar
Manickam, K. et al. Exome sequencing-based screening for BRCA1/2 expected pathogenic variants among adult biobank participants. JAMA Netw. Open. 1(5), e182140. https://doi.org/10.1001/jamanetworkopen.2018.2140 (2018).
Article PubMed PubMed Central Google Scholar
Ellison, G., Wallace, A., Kohlmann, A. & Patton, S. A comparative study of germline BRCA1 and BRCA2 mutation screening methods in use in 20 European clinical diagnostic laboratories. Br. J. Cancer 117(5), 710–716 (2017).
Article CAS Google Scholar
U.S Food and Drug Administration. FDA authorizes, with special controls, direct-to-consumer test that reports three mutations in the BRCA breast cancer genes [news release]. Silver Spring (MD): FDA; (2018). https://www.fda.gov/NewsEvents/Newsroom/PressAnnouncements/ucm599560.htm. Accessed 7 May 2018.
Story, M. D. & Durante, M. Radiogenomics. Med. Phys. 45(11), e1111–e1122. https://doi.org/10.1002/mp.13064. (2018). (Review).
Article PubMed Google Scholar
Bodalal, Z., Trebeschi, S. & Beets-Tan, R. Radiomics: A critical step towards integrated healthcare. Insights Imaging. 9(6), 911–914. https://doi.org/10.1007/s13244-018-0669-3 (2018). (Epub 2018 Nov 12).
Article PubMed PubMed Central Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach [Erratum in: Nat Commun. 2014;5:4644. Cavalho, Sara (corrected to Carvalho, Sara)]. Nat. Commun. 5, 4006. https://doi.org/10.1038/ncomms5006 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Panchal, S. & Nagori, C. Imaging techniques for assessment of tubal status. J. Hum. Reprod. Sci. 7(1), 2–12 (2014).
Article Google Scholar
Yates, M. S. et al. Microscopic and early-stage ovarian cancers in BRCA1/2 mutation carriers: Building a model for early BRCA-associated tumorigenesis. Cancer Prev. Res. 4, 463–470 (2011).
Article Google Scholar
Coelho Neto, M. A. et al. Counting ovarian antral follicles by ultrasound: A practical guide. Ultrasound Obstet Gynecol. 51(1), 10–20. https://doi.org/10.1002/uog.18945 (2018).
Article CAS PubMed Google Scholar
Abuhamad, A. et al. Obstetric and gynecologic ultrasound curriculum and competency assessment in residency training programs: Consensus report. J. Ultrasound Med. 37(1), 19–50. https://doi.org/10.1002/jum.14519 (2018).
Article PubMed Google Scholar
Capoluongo, E. et al. Guidance statement on BRCA1/2 tumor testing in ovarian cancer patients. Semin. Oncol. 44(3), 187–197 (2017).
Article CAS Google Scholar
Capoluongo, E. BRCA to the future: Towards best testing practice in the era of personalised healthcare. Eur. J. Hum. Genet. 24(Suppl 1), S1–S2 (2016).
Article CAS Google Scholar
BRCA Germline I Reference Standard gDNA. https://www.horizondiscovery.com/brca-germline-i-hd793
https://www.emqn.org/schemes/breast-ovarian-cancer-familial-full-version/
Concolino, P. & Capoluongo, E. Genetic test reports were collected as part of the medical record. For the purpose of the study the outcome was binarized: g- BRCA1-2 p and VUS vs g-BRCA 1-2 wild type (g-BRCA WT). Expert Rev. Mol. Diagn. 19(9), 795–802 (2019).
Article CAS Google Scholar
Minucci, A. et al. Clinical impact on ovarian cancer patients of massive parallel sequencing for BRCA mutation detection: The experience at Gemelli hospital and a literature review. Expert Rev. Mol. Diagn. 15(10), 1383–1403 (2015).
Article CAS Google Scholar
Concolino, P. et al. A preliminary quality control (QC) for next-generation sequencing (NGS) library evaluation turns out to be a very useful tool for a rapid detection of BRCA1/2 deleterious mutations. Clin. Chim. Acta. 437, 72–77 (2014). (Review).
Article CAS Google Scholar
Timmerman, D. et al. Terms, definitions and measurements to describe the sonographic features of adnexal tumors: A consensus opinion from the International Ovarian Tumor Analysis (IOTA) Group. Ultrasound Obstet. Gynecol. 16(5), 500–505 (2000).
Article CAS Google Scholar
https://www.aliza-dicom-viewer.com/
Dinapoli, N. et al. Moddicom: a complete and easily accessible library for prognostic evaluations relying on image features. In Conference Proceeding IEEE Engineering in Medicine and Biology Society 771–774. (2015)
Zwanenburg, A. et al. The image biomarker standardization initiative: Standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295(2), 328–338. https://doi.org/10.1148/radiol.2020191145 (2020).
Article PubMed Google Scholar
Park, J. E., Park, S. Y., Kim, H. J. & Kim, H. S. Reproducibility and generalizability in radiomics modeling: Possible strategies in radiologic and statistical perspectives. Korean J. Radiol. 20(7), 1124–1137 (2019).
Article Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In 22nd SIGKDD Conference on Knowledge Discovery and Data Mining (2016)
Olson, R.S., Urbanowicz, R.J., Andrews, P.C., Lavender, N.A. & Moore, J.H. Moore Automating biomedical data science through tree-based pipeline optimization. In: Applications of Evolutionary Computation 123–137 (2016).
R Core Team R: A language and environment for statistical computing. https://www.R-project.org/. (R Foundation for Statistical Computing, Vienna, 2018)
Python Software Foundation. Python Language Reference, version 2.7. https://www.python.org
ACOG Committee Opinion, number 755 Vol. 132, No. 4, October 2018 (Replaces Committee Opinion No. 534, August 2012)
Manchanda, R. et al. Current detection rates and time-to-detection of all identifiable BRCA carriers in the Greater London population. J. Med. Genet. 55, 538–545 (2018).
Article Google Scholar
Beitsch, P. D. et al. Underdiagnosis of hereditary breast cancer: Are genetic testing guidelines a tool or an obstacle?. J. Clin. Oncol. 37, 453 (2018).
Article Google Scholar
Zhou, H. et al. Differential diagnosis of benign and malignant thyroid nodules using deep learning radiomics of thyroid ultrasound images. Eur. J. Radiol. https://doi.org/10.1016/j.ejrad.2020.108992 (2020).
Article PubMed PubMed Central Google Scholar
Guo, Y. et al. Radiomics analysis on ultrasound for prediction of biologic behavior in breast invasive ductal carcinoma. Clin. Breast Cancer. https://doi.org/10.1016/j.clbc.2017.08.002 (2018).
Article PubMed Google Scholar
Du, Y. et al. Application of ultrasound-based radiomics technology in fetal lung texture analysis in pregnancies complicated by gestational diabetes or pre-eclampsia. Ultrasound Obstet. Gynecol. https://doi.org/10.1002/uog.22037 (2020).
Article PubMed Google Scholar
Liu, Z. et al. The applications of radiomics in precision diagnosis and treatment of oncology: Opportunities and challenges. Theranostics. 9(5), 1303–1322 (2019).
Article Google Scholar

Download references

Acknowledgements

We wish to thank Franziska Lohmeyer for her English language assistance.

Author information

Authors and Affiliations

Dipartimento per le Scienze della salute della donna, del bambino e di sanità pubblica, Fondazione Policlinico Universitario A. Gemelli IRCCS, Gynecologic Oncology, Rome, Italy
Camilla Nero, Francesca Ciccarone, Ida Paris, Antonia Carla Testa, Anna Fagotti & Giovanni Scambia
Dipartimento di Diagnostica per immagini, radioterapia oncologica ed ematologia, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy
Luca Boldrini, Jacopo Lenkowicz & Vincenzo Valentini
Department of Molecular Medicine and Medical Biotechnology, Federico II University-CEINGE, Advanced Biotechnology, Naples, Italy
Ettore Domenico Capoluongo
Department of Obstetrics and Gynecology, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University of the Sacred Heart, L.go A. Gemelli 8, 00168, Rome, Italy
Camilla Nero

Authors

Camilla Nero
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Ciccarone
View author publications
You can also search for this author in PubMed Google Scholar
Luca Boldrini
View author publications
You can also search for this author in PubMed Google Scholar
Jacopo Lenkowicz
View author publications
You can also search for this author in PubMed Google Scholar
Ida Paris
View author publications
You can also search for this author in PubMed Google Scholar
Ettore Domenico Capoluongo
View author publications
You can also search for this author in PubMed Google Scholar
Antonia Carla Testa
View author publications
You can also search for this author in PubMed Google Scholar
Anna Fagotti
View author publications
You can also search for this author in PubMed Google Scholar
Vincenzo Valentini
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Scambia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, C.N.; Methodology, C.N., F.C. and L.B.; Formal Analysis, J.L. and L.B.; Data Curation, F.C., I.P., E.D.C. and L.B.; Writing—Original Draft Preparation, C.N. and J.L.; Writing—Review and Editing E.D.C., G.S., V.V., A.C.T., A.F.; Supervision, G.S. and V.V.

Corresponding author

Correspondence to Camilla Nero.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nero, C., Ciccarone, F., Boldrini, L. et al. Germline BRCA 1-2 status prediction through ovarian ultrasound images radiogenomics: a hypothesis generating study (PROBE study). Sci Rep 10, 16511 (2020). https://doi.org/10.1038/s41598-020-73505-2

Download citation

Received: 08 June 2020
Accepted: 14 September 2020
Published: 05 October 2020
DOI: https://doi.org/10.1038/s41598-020-73505-2

This article is cited by

A cost-effectiveness analysis of an integrated clinical-radiogenomic screening program for the identification of BRCA 1/2 carriers (e-PROBE study)
- A. Di Pilla
- C. Nero
- G. Scambia
Scientific Reports (2024)
DrABC: deep learning accurately predicts germline pathogenic mutation status in breast cancer patients based on phenotype data
- Jiaqi Liu
- Hengqiang Zhao
- Nan Wu
Genome Medicine (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Validation of the diagnostic efficacy of O-RADS in adnexal masses

Predicting Breast Cancer in Breast Imaging Reporting and Data System (BI-RADS) Ultrasound Category 4 or 5 Lesions: A Nomogram Combining Radiomics and BI-RADS

Comparison of the ADNEX and ROMA risk prediction models for the diagnosis of ovarian cancer: a multicentre external validation in patients who underwent surgery

Introduction

Methods

Study design

Study population

gBRCA1/2 testing data acquisition

Pelvic US imaging protocol

Image postprocessing

US radiomics feature extraction and validation

Statistical analysis

Feature-outcome association in univariate analysis

Machine learning-based models

Results

Clinical characteristics of the study cohort

Correlation and univariate analysis

Performance of machine learning models

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A cost-effectiveness analysis of an integrated clinical-radiogenomic screening program for the identification of BRCA 1/2 carriers (e-PROBE study)

DrABC: deep learning accurately predicts germline pathogenic mutation status in breast cancer patients based on phenotype data

Comments

Search

Quick links