Abstract
Contrast-enhanced mammography (CEM) has shown to be superior to full-field digital mammography (FFDM), but current results are dominated by studies performed on systems by one vendor. Information on diagnostic accuracy of other CEM systems is limited. Therefore, we aimed to evaluate the diagnostic performance of CEM on an alternative vendor’s system. We included all patients who underwent CEM in one hospital in 2019, except those with missing data or in whom CEM was used as response monitoring tool. Three experienced breast radiologists scored the low-energy images using the BI-RADS classification. Next, the complete CEM exams were scored similarly. Histopathological results or a minimum of one year follow-up were used as reference standard. Diagnostic performance and AUC were calculated and compared between low-energy images and the complete CEM examination, for all readers independently as well as combined. Breast cancer was diagnosed in 23.0% of the patients (35/152). Compared to low-energy images, overall CEM sensitivity increased from 74.3 to 87.6% (p < 0.0001), specificity from 87.8 to 94.6% (p = 0.0146). AUC increased from 0.872 to 0.957 (p = 0.0001). Performing CEM on the system tested, showed that, similar to earlier studies mainly performed on another vendor’s systems, both sensitivity and specificity improved when compared to FFDM.
Similar content being viewed by others
Introduction
Contrast-enhanced mammography (CEM) is an emerging breast imaging modality in which the intravenous administration of an iodine based contrast agent, combined with dual-energy mammography, is used to increase the diagnostic accuracy of full-field digital mammography (FFDM). The first commercially available FDA approved CEM system was introduced by GE Healthcare in 20111. The use of CEM in the breast imaging community is steadily increasing, as is not only reflected by the increasing number of installations and scientific literature, but also by the fact that multiple vendors now have their own commercially available mammography systems capable to perform CEM.
The different CEM capable mammography units are released under different brand names by various vendors. For example, GE Healthcare marketed the name ‘Contrast-Enhanced Spectral Mammography’ (CESM), while both Hologic and Fujifilm christened their techniques ‘Contrast-Enhanced 2D imaging’ (CE2D). Siemens Healthineers released their product under the name ‘Titanium Contrast-Enhanced Mammography’ (TiCEM)2.
Studies have shown that CEM is consistently superior to FFDM in terms of diagnostic accuracy, especially in women with dense breasts3. Due to this increased diagnostic accuracy, it is even being considered for screening purposes4. In a review by Zhu et al., including 18 studies, the pooled sensitivity and specificity of CEM was 0.89 and 0.84, respectively5. More recently, Suter et al. published a meta-analysis of eight prospective studies, finding a pooled sensitivity and specificity of 0.85 and 0.77, respectively6.
Current knowledge of CEM, however, is mainly based on studies that have used a system from GE Healthcare. Zanardo et al. observed that only 14 of the 84 studies (1755/14012 cases) included in their review were performed on another system7. Even then, some of these studies focused on other topics than diagnostic accuracy (e.g., evaluation of disease extent, radiation dose studies, or CEM as response monitoring tool in women treated with neoadjuvant chemotherapy). To facilitate a broader acceptance of CEM in clinical practice, it is important to study the consistency and reproducibility of results, also on other vendors and their systems. However, it is unethical to have women undergo more than one CEM exam solely for this research purpose.
Therefore, the aim of this retrospective study was to evaluate the diagnostic performance of FFDM versus CEM on a Hologic system using three different readers.
Methods and materials
Patient population
The CEM system used was installed in January 2019 and the first clinical CEM was performed in February 2019. All consecutive patients who underwent CEM in the period of February to December 2019 were eligible for this retrospective study. Indications for CEM imaging included: additional imaging after screening recall, screening of women with hereditary breast cancer when breast MRI was contra indicated, patients with symptoms which raise suspicion of breast cancer (e.g., palpable masses, skin abnormalities, etc.), follow-up after breast cancer treatment, inconclusive findings after prior breast imaging, supplemental imaging in dense breasts, and coincidental findings in non-breast specific imaging (e.g., chest CT). Excluded were patients who were treated with neoadjuvant chemotherapy, where CEM was solely used as response monitoring tool.
The Medical Research Involving Human Subject Act (WMO) did not apply to this study protocol and an official approval of this study by the Medical Ethics Review Committee of University hospital Maastricht and Maastricht University (METC azM/UM) was not required (decision no. METC 2019-1427). Due to the retrospective nature of the study, the METC azM/UM and the Trial Advisory Committee of Laurentius Hospital (decision no. 20/10) waived the requirement for obtaining informed consent. All methods were carried out in accordance with relevant guidelines and regulations.
Imaging protocol
The CEM imaging principle and protocol was described earlier2,8,9. In short, an iodine based contrast agent is injected intravenously two minutes prior to image acquisition. After two minutes, a dual-energy mammography examination is performed in at least craniocaudal (CC) and mediolateral oblique (MLO) view of each breast. As final result, the radiologist can read a low-energy (LE) image (which is comparable to a standard full-field digital mammogram or FFDM10,11) and a recombined (iodine) image (in which areas of contrast uptake can be appreciated). In this study, a non-ionic, low-osmolar monomer, iobitridol (Xenetix 300® (300 mg I/ml), Guerbet) was administered (dose 1.5 mL/kg body weight, flow rate of 2.5–3 mL/s, followed by saline flush) using a power injector (MEDRAD Stellant, Bayer). All CEM exams were performed on a single dedicated mammography system (Selenia Dimensions, with I-View CE2D upgrade, Hologic Inc.).
Image analysis
Three readers independently assessed all CEM exams on a dedicated workstation (Coronis Fusion 6MP LED monitor, Barco; SecurView workstation, Hologic). All readers had extensive experience reading breast imaging exams, ranging from 11 to 25 years. First, the LE images were assessed and scored using the Breast Imaging Reporting And Data System (BI-RADS) classification12. Second, the complete CEM exams (i.e., LE plus recombined images) were assessed. When viewing the complete CEM exam, the radiologists were allowed to adjust their initial BI-RADS classification if deemed necessary. All readers were blinded for each other’s classifications and the final diagnosis. The breast density classification was retrieved from the imaging reports.
Reference standard to assess true disease status
Histopathology was the gold standard to ascertain true disease status for patients in whom biopsy was performed. The diagnosis of a cyst was confirmed by an eclipse sign on the CEM or by ultrasound-guided aspiration13. All histological analysis were performed according to our current national guidelines. The maximum diameter of the breast cancer diagnosed, its phenotype, grade, receptor status, and axillary lymph node status were retrieved from the pathology reports. A minimum of one year follow-up was used in the final analysis as additional reference standard for cases that were considered benign (including cysts) or negative after CEM evaluation, in order to identify any false negative imaging results. If no cancer was diagnosed after this period, the case was definitely considered benign or negative.
Statistical analysis
Diagnostic parameters including sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were compared between LE and CEM, separately for the three readers and for all readers combined. BI-RADS 1–3 lesions were considered to be negative or benign, and BI-RADS 4–5 were considered to be malignant. To account for the correlation between multiple observations within patients, the variance and 95% confidence intervals were obtained with a robust variance estimator14. A logistic random effects model was used to test the differences in sensitivity and specificity between LE and CEM for statistical significance. Receiver operating characteristics (ROC) curves and the area under the curve (AUC) with corresponding 95% confidence intervals were calculated for both LE images and the complete CEM exam. The difference between the AUC for LE images and CEM was tested for significance using an algorithm suggested by DeLong et al.15. Statistical significance was set at p values ≤ 0.05. Statistical analyses were performed using SPSS (IBM Corp. version 26) and STATA (StataCorp LLC., version 14).
Results
Patient characteristics
A detailed overview of image indications and diagnoses is presented in Table 1. Of the 162 patients who underwent CEM, 152 patients met the inclusion criteria. The ten excluded patients had received CEM to monitor tumor response during neoadjuvant chemotherapy treatment (n = 4) or had incomplete CEM data (n = 6). Most patients (95/152; 62.5%) were referred to the hospital for additional imaging after a recall from the national breast cancer screening program. Mean age was 57.9 years (± SD: 10, range 34–83 years).
The majority of the patients did not have breast cancer: 117 cases (77.0%) were considered benign, of which 74 were negative (48.7%), 31 cases were cysts (20.4%) and in twelve cases biopsy results showed a benign lesion (7.9%) (Table 1). Thirty-five patients were diagnosed with breast cancer (and/or ductal carcinoma in situ (DCIS)), and two patients with lymphoma. These two patients suffered from axillary lymph node enlargement and were considered as negative cases, since no intramammary lesion was detected with CEM. The diagnosis of lymphoma was made based on a palpable mass in the axilla. Figure 1 shows a flow chart of the patients’ diagnostic pathways.
Tumor characteristics
In the 35 breast cancer and DCIS patients, a total of 39 malignant lesions were detected. The mean tumor diameter of these 39 lesions on CEM was 14.4 mm ± 10.2 (range 4–60 mm). In four patients, two malignant lesions were detected within the same breast. Histopathology confirmed that in three of these four patients the multifocal lesions were identical to the primary index tumor concerning subtype of breast cancer and grade. The fourth patient, had a primary tumor consisting of invasive carcinoma NST with a second lesion being DCIS grade 2. The majority of malignant cases had invasive carcinoma NST (29/35; 82.9%), followed by invasive lobular carcinoma (4/35; 11.4%). Estrogen receptor (ER) status was positive in 31 of the 35 invasive malignancies (88.6%), Progesterone receptor (PR) was positive in 25 malignancies (71.4%), and Human Epidermal growth factor Receptor 2 (HER2/neu receptor) status was positive in seven malignancies (20.0%). In 19 of the 35 malignant cases (54.3%) tumor grade was I, 15 cases (42.9%) were grade II, and one case (2.9%) was grade III. An overview of tumor characteristics is provided in Table 2.
All patients underwent treatment for their (pre-) malignancy, the majority with primary surgery (n = 30) and surgery after neo-adjuvant chemotherapy (n = 2). In one case the patient did not undergo surgery and in two other cases the patients were operated elsewhere. The mean tumor diameter of the operated 34 lesions was 12.4 mm ± 6.9 (range 0–29 mm). Axillary lymph node status was pN + in seven cases, pN0 in 24 cases, and not reported or unknown in four cases.
Diagnostic accuracy
Median follow-up of CEM exams initially classified as benign/negative was 25 months (range 17–28 months). In this period, no additional breast cancer diagnoses were established in these cases, classifying them as definitely benign/negative.
CEM exams showed overall higher sensitivity and specificity than LE exams for all three readers. Mean sensitivity for all readers combined increased from 74.3% (LE) to 87.6% (CEM) (p < 0.001), while mean specificity increased from 87.8% (LE) to 94.6% (CEM) (p = 0.0146). Mean PPV and mean NPV increased from 64.5% to 82.9% and 91.9% to 96.2%, respectively. The AUC increased for all three readers, with statistically significant increases being observed in readers 1 and 2. The AUC of all readers combined increased from 0.872 for LE images to 0.957 for CEM exams (p = 0.0001). Table 3 shows the diagnostic accuracy parameters per reader and for all readers combined. Figure 2 and Table 4 provide an overview of the ROC curves and the corresponding AUC values of all readers independently and combined.
Discrepancies between readers and diagnosis
None of the 152 cases was scored false positive on CEM by all three readers. Eighteen cases were scored as false positive on CEM by one reader and one case by two readers. Two cancer diagnoses were overlooked on CEM by all readers: one case of invasive lobular carcinoma, and one case of mucinous carcinoma, with tumor diameters of 1.0 cm and 0.7 cm, respectively. This case of invasive lobular carcinoma was also missed by all readers on LE, while one reader did detect the mucinous carcinoma on LE. In clinical practice the missed mucinous carcinoma was detected on CEM by the radiologist on duty, while the missed invasive lobular carcinoma was detected by additional breast MRI. Figure 3 shows an example of a true positive case, true negative case, false positive case, and a false negative case.
Discussion
CEM is an emerging breast imaging modality, introduced in 2011 by GE Healthcare, but is currently being offered by four vendors as commercial systems2. Many studies have shown that CEM’s diagnostic accuracy is superior to FFDM, even matching the performance of breast MRI16,17,18,19. However, these results are mainly based on studies performed on GE Healthcare systems. In this study, we demonstrated that an increase in diagnostic performance can also be achieved with a Hologic system when compared to FFDM. When using Hologic CEM, sensitivity significantly increased from 74.3 to 87.6%, and specificity from 87.8 to 94.6%.
Although several reviews and meta-analyses have been published5,6,20, we believe that the one published by Zhu et al. is the most comprehensive one. In our opinion, their review provided the most extensive and up-to-date overview, including 18 studies published between 2014 and 2017. Seventeen of the 18 studies were performed on systems by GE Healthcare and only one study on a Hologic system. No studies using equipment of other vendors were included. In this review, the pooled sensitivity and specificity for CEM was 89% and 84%, respectively. The AUC value based on their summary ROC curve was 0.957. The combination of sensitivity and specificity observed in our study falls within the 95% confidence interval of the summary ROC curve given by Zhu et al.5. Based on these findings, we concluded that sensitivity and specificity for CEM on a Hologic system are in line with prior findings for sensitivity and specificity of CEM studies that were mainly performed on GE Healthcare equipment.
Reproducibility is one of the pillars in science. In clinical studies, this means that certain results obtained can be achieved again (‘reproduced’) when the study is repeated with a similar methodology but by other, independent researchers. Reproducibility is important, since no claim can be proven by a single published study. Despite the utmost efforts of investigators to perform a study to the best of their ability, reproducible results provide the scientific community the only true transparency in results, giving us confidence in the results as well. In the context of the current study, it is important to reproduce the claims of CEM superiority over FFDM mainly made on GE Healthcare equipment, as this creates confidence that the results stand even when other equipment is used.
The number of studies using Hologic CEM units is still limited, often focusing on equally important aspects of CEM, such as studies on radiation exposure or evaluation of disease extent. To the best of our knowledge, there are currently eleven studies published that assessed diagnostic accuracy18,19,21,22,23,24,25,26,27,28,29. However, these studies often stated only partial diagnostic performance, for example only sensitivity. These studies were also relatively small, with sample sizes varying from 34 to 100 patients, with two exceptions up to 208 and 326 patients. One study provided a case-by-case description of the performance of CEM26, but in the other ten studies, sensitivity of CEM ranged from 81.5 to 100%18,19,21,22,23,24,25,27,28,29, with specificity available in just three studies and ranging from 42.6 to 79.6%27,28,29. However, the differences in methodologies used in these studies hamper a robust comparison of our results with these eleven studies.
Our study has some limitations. Firstly, the dominating indication for the CEM examinations in this population was the evaluation of recalls from breast cancer screening. However, a sub analysis of these (recalled) women versus all other indications showed that there was no statistically significant difference in sensitivity and specificity (data not shown). Secondly, the follow-up period used to determine the true negative disease status of cases deemed to be benign or negative could be considered to be short. A follow-up period of at least two years is preferable, which is a commonly accepted time interval used in screening programs. However, the follow-up period used was considered the minimum time interval. Half of the cases had a follow-up period of more than two years, while the shortest follow-up period lasted 17 months. In addition, the prevalence of interval carcinomas is low (0.21%), rendering it unlikely that an extra year of follow-up would reveal a substantial number of missed cancer cases30. Consequently, prolonging follow-up will not result in significant alterations of the study conclusion.
In summary, this study showed that CEM performed on a Hologic system improved both sensitivity and specificity when compared to FFDM. Our observed diagnostic performance falls within the 95% CI of summary ROC curves of a previously published meta-analysis. Thus, it can be safely assumed that the type of CEM unit does not seem to be a decisive factor in studies that are using equipment from different vendors.
Data availability
The dataset generated and analysed during the current study are available from the corresponding author on reasonable request.
References
GE Healthcare. GE Healthcare announces FDA 510(k) clearance of SenoBright Contrast Enhanced Spectral Mammography (CESM) for breast cancer diagnosis. https://www.ge.com/news/press-releases/ge-healthcare-announces-fda-510k-clearance-senobright-contrast-enhanced-spectral (2011).
Jochelson, M. S. & Lobbes, M. B. I. Contrast-enhanced mammography: State of the art. Radiology 299, 36–48 (2021).
Sorin, V. et al. Contrast-enhanced spectral mammography in women with intermediate breast cancer risk and dense breasts. AJR. Am. J. Roentgenol. 211, W267–W274 (2018).
Sung, J. S. et al. Performance of dual-energy contrast-enhanced digital mammography for screening women at increased risk of breast cancer. Radiology 293, 81–88 (2019).
Zhu, X. et al. Diagnostic value of contrast-enhanced spectral mammography for screening breast cancer: Systematic review and meta-analysis. Clin. Breast Cancer 18, e985–e995 (2018).
Suter, M. B. et al. Diagnostic accuracy of contrast-enhanced spectral mammography for breast lesions: A systematic review and meta-analysis. Breast 53, 8–17 (2020).
Zanardo, M. et al. Technique, protocols and adverse reactions for contrast-enhanced spectral mammography (CESM): A systematic review. Insights Imaging 10, 76 (2019).
James, J. J. & Tennant, S. L. Contrast-enhanced spectral mammography (CESM). Clin. Radiol. 73, 715–723 (2018).
Patel, B. K., Lobbes, M. B. I. & Lewin, J. Contrast enhanced spectral mammography: A review. Semin. Ultrasound CT MRI 38, 70–79 (2018).
Francescone, M. A. et al. Low energy mammogram obtained in contrast-enhanced digital mammography (CEDM) is comparable to routine full-field digital mammography (FFDM). Eur. J. Radiol. 83, 1350–1355 (2014).
Lalji, U. C. et al. Evaluation of low-energy contrast-enhanced spectral mammography images by comparing them to full-field digital mammography using EUREF image quality criteria. Eur. Radiol. 25, 2813–2820 (2015).
D’Orsi, C. J., Sickles, E., Mendelson, E., Morris & E. ACR BI-RADS Atlas, Breast Imaging Reporting and Data System. (American College of Radiology, Reston, VA, 2013).
Lobbes, M. B. I. et al. Contrast-enhanced spectral mammography in patients referred from the breast cancer screening programme. Eur. Radiol. 24, 1668–1676 (2014).
Genders, T. S. S. et al. Methods for calculating sensitivity and specificity of clustered data_a tutorial. Radiology 265, 910–916 (2012).
Delong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 837–845 (2016).
Fallenberg, E. M. et al. Contrast-enhanced spectral mammography vs. mammography and MRI—Clinical performance in a multi-reader evaluation. Eur. Radiol. 27, 2752–2764 (2017).
Li, L. et al. Contrast-enhanced spectral mammography (CESM) versus breast magnetic resonance imaging (MRI): A retrospective comparison in 66 breast lesions. Diagn. Interv. Imaging 98, 113–123 (2017).
Sumkin, J. H. et al. Diagnostic performance of MRI, molecular breast imaging, and contrast-enhanced mammography in women with newly diagnosed breast cancer. Radiology 293, 531–540 (2019).
Kim, E. Y. et al. Diagnostic value of contrast-enhanced digital mammography versus contrast-enhanced magnetic resonance imaging for the preoperative evaluation of breast cancer. J. Breast Cancer 21, 453–462 (2018).
Tagliafico, A. S. et al. Diagnostic performance of contrast-enhanced spectral mammography: Systematic review and meta-analysis. Breast 28, 13–19 (2016).
Zuley, M. L. et al. Contrast enhanced digital mammography (CEDM) helps to safely reduce benign breast biopsies for low to moderately suspicious soft tissue lesions. Acad. Radiol. 27, 969–976 (2020).
Youn, I. et al. Contrast enhanced digital mammography versus magnetic resonance imaging for accurate measurement of the size of breast cancer. Br. J. Radiol. 92, 20180929 (2019).
Bicchierai, G. et al. Evaluation of contrast-enhanced digital mammography (CEDM) in the preoperative staging of breast cancer: Large-scale single-center experience. Breast J. 26, 1276–1283 (2020).
Lewis, T. C. et al. Contrast-enhanced digital mammography: A single-institution experience of the first 208 cases. Breast J. 23, 67–76 (2017).
Ali-Mucheru, M. et al. Contrast-enhanced digital mammography in the surgical management of breast cancer. Ann. Surg. Oncol. 23, 649–655 (2016).
Depretto, C. et al. Contrast-enhanced mammography in the evaluation of breast calcifications: Preliminary experience. Tumori J. 106, 491–496 (2020).
Patel, B. K. et al. Clinical utility of contrast-enhanced spectral mammography as an adjunct for tomosynthesis-detected architectural distortion. Clin. Imaging 46, 44–52 (2017).
Petrillo, A. et al. Digital breast tomosynthesis and contrast-enhanced dual-energy digital mammography alone and in combination compared to 2D digital synthetized mammography and MR imaging in breast cancer detection and classification. Breast J. 26, 860–872 (2020).
Goh, Y. et al. Architecture distortion score (ADS) in malignancy risk stratification of architecture distortion on contrast-enhanced digital mammography. Eur. Radiol. 31, 2657–2666 (2021).
Netherlands comprehensive cancer organisation (IKNL). Monitor bevolkingsonderzoek borstkanker. 1–6 https://www.iknl.nl/borstkankermonitor (2020).
Funding
This study has received funding by the Netherlands Organization for Health Research and Development (ZonMw Efficiency Studies Grant Number 843001801), GE Healthcare, and the Department of Radiology and Nuclear Medicine of the Maastricht University Medical Center+.
Author information
Authors and Affiliations
Contributions
H.R., S.M., R.S. and M.L. performed the image analysis. L.N., S.T., and M.L. collected the data. Data analysis was performed by L.N. and P.N. L.N. wrote the main manuscript. All authors participated in the study design, reviewed the manuscript, and have approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
ML received several research grant and speaker’s fees from GE Healthcare, Hologic, Bayer and Guerbet. JW received institutional grants and speaker’s fees from AGFA, Bayer Healthcare, Bard Medical, GE Healthcare, Optimed, Philips Healthcare, Siemens Healthineers. The other authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Neeter, L.M.F.H., Raat, H.P.J., Meens-Koreman, S.D. et al. The diagnostic value of contrast-enhanced 2D mammography in everyday clinical use. Sci Rep 11, 22224 (2021). https://doi.org/10.1038/s41598-021-01622-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-01622-7
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.