Prediction of Treatment Response to Neoadjuvant Chemotherapy for Breast Cancer via Early Changes in Tumor Heterogeneity Captured by DCE-MRI Registration

Jahani, Nariman; Cohen, Eric; Hsieh, Meng-Kang; Weinstein, Susan P.; Pantalone, Lauren; Hylton, Nola; Newitt, David; Davatzikos, Christos; Kontos, Despina

doi:10.1038/s41598-019-48465-x

Download PDF

Article
Open access
Published: 20 August 2019

Prediction of Treatment Response to Neoadjuvant Chemotherapy for Breast Cancer via Early Changes in Tumor Heterogeneity Captured by DCE-MRI Registration

Nariman Jahani¹,
Eric Cohen¹,
Meng-Kang Hsieh¹,
Susan P. Weinstein¹,
Lauren Pantalone¹,
Nola Hylton²,
David Newitt²,
Christos Davatzikos¹ &
…
Despina Kontos ORCID: orcid.org/0000-0001-9031-5126¹

Scientific Reports volume 9, Article number: 12114 (2019) Cite this article

4959 Accesses
36 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We analyzed DCE-MR images from 132 women with locally advanced breast cancer from the I-SPY1 trial to evaluate changes of intra-tumor heterogeneity for augmenting early prediction of pathologic complete response (pCR) and recurrence-free survival (RFS) after neoadjuvant chemotherapy (NAC). Utilizing image registration, voxel-wise changes including tumor deformations and changes in DCE-MRI kinetic features were computed to characterize heterogeneous changes within the tumor. Using five-fold cross-validation, logistic regression and Cox regression were performed to model pCR and RFS, respectively. The extracted imaging features were evaluated in augmenting established predictors, including functional tumor volume (FTV) and histopathologic and demographic factors, using the area under the curve (AUC) and the C-statistic as performance measures. The extracted voxel-wise features were also compared to analogous conventional aggregated features to evaluate the potential advantage of voxel-wise analysis. Voxel-wise features improved prediction of pCR (AUC = 0.78 (±0.03) vs 0.71 (±0.04), p < 0.05 and RFS (C-statistic = 0.76 ( ± 0.05), vs 0.63 ( ± 0.01)), p < 0.05, while models based on analogous aggregate imaging features did not show appreciable performance changes (p > 0.05). Furthermore, all selected voxel-wise features demonstrated significant association with outcome (p < 0.05). Thus, precise measures of voxel-wise changes in tumor heterogeneity extracted from registered DCE-MRI scans can improve early prediction of neoadjuvant treatment outcomes in locally advanced breast cancer.

Segment anything in medical images

Article Open access 22 January 2024

Jun Ma, Yuting He, … Bo Wang

Spatial transcriptomics reveals discrete tumour microenvironments and autocrine loops within ovarian cancer subclones

Article Open access 03 April 2024

Elena Denisenko, Leanne de Kock, … Alistair R. R. Forrest

Foundation model for cancer imaging biomarkers

Article Open access 15 March 2024

Suraj Pai, Dennis Bontempi, … Hugo J. W. L. Aerts

Introduction

For women with locally advanced breast cancer, longitudinal patterns of tumor response during neoadjuvant chemotherapy (NAC) can be an important marker in evaluating treatment response and likelihood for overall survival. When dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is part of the NAC protocol, in addition to assessing structural changes in tumor size and shape, it provides an opportunity to evaluate changes in enhancement patterns which reflect functional tumor properties as potential earlier indicators of treatment response^1,2,3. Towards this end, while much progress has been made, most approaches reported to date still have important limitations by either falling short of investigating the tumor longitudinally or by overlooking the finer details of the longitudinal imaging phenotype by primarily relying on aggregate measures of tumor structure and function^4,5,6,7. For example, although Hylton et al.⁶ have shown that measuring the aggregate change of functional tumor volume (FTV) during NAC can be an indicator of pathologic complete response (pCR) and long-term recurrence-free survival (RFS), FTV does not adequately capture intra-tumor heterogeneity which has increasingly been shown to be a major indicator of tumor aggressiveness and treatment resistance⁸.

As tumors are known to be temporally and spatially heterogeneous and tend to deform regionally during treatment^9,10, more precise and longitudinal quantification of phenotypic tumor heterogeneity could provide new insight for early prediction of treatment response and long-term survival. To calculate regional longitudinal changes, deformable image registration techniques can be used to match images from different imaging sessions voxel-by-voxel^11,12. However, the lack of robust image registration techniques has led to many breast cancer investigations to overlook voxel-wise approaches for capturing such heterogeneous tumor changes^12,13. Recently, a registration method based on attribute-matching¹⁴ has been developed shown to have improved accuracy compared to conventional intensity-based registration methods^15,16. Implementing an accurate image registration technique, a parametric response map (PRM)^17,18, as well as regional deformation measures^19,20,21, can provide quantitative voxel-based information regarding heterogeneous changes within the tumor during treatment.

We evaluated phenotypic changes in tumor heterogeneity, quantified with voxel-wise image registration, for augmenting early prediction of pCR and RFS after NAC for locally advanced breast cancer. The rationale is to benefit from early-treatment information, captured via a robust deformable image registration technique, in order to precisely quantify voxel-wise changes in morphologic, structural, and kinetic tumor features. We hypothesize that imaging markers capturing such early changes within the tumor can improve prediction of pCR and RFS for women diagnosed with locally advanced breast cancer, and thus providing additional information to help better guide their treatment.

Methods

Patient population and data acquisition

This study was approved by the institutional review board of University of Pennsylvania. No consent or waiver was required as data were obtained de-identified from the National Cancer Institute’s Cancer Imaging Archive²². The patient population analyzed for our study was a subset of the multicenter Investigation of Serial Studies to Predict Your Therapeutic Response with Imaging and moLecular Analysis and American College of Radiology Imaging Network 6657 trial (I-SPY 1 TRIAL/ACRIN 6657) which recruited women with T3 tumors who received anthracycline-cyclophosphamide NAC². Four MR imaging examinations were performed, including pre-treatment (first examination four weeks before the treatment), early-treatment (second examination performed at least two weeks after the first cycle of chemotherapy) and between treatments (third examination), and fourth examination performed before surgery and after completion of NAC.

The data acquisition was previously described according to the ACRIN 6657/ISPY-1 protocols². In summary, DCE-MR scans were collected from nine different centres using 1.5-T MR imaging systems with a dedicated breast radiofrequency coil was used to acquire pre- and post-contrast images at each examination. The imaging procedure included a localization scan and T2-weighted sequences followed by T1-weighted of DCE-MRI series. The T1-weighted sequence was acquired once before contrast injection and at least twice afterwards. The first two contrast-enhanced images were acquired 2.5 and 7.5 minutes after contrast injections.

Clinical, demographic, and histopathologic data including age, race and hormone receptor status (coded as a three-level categorical variable: 1. HR-positive and Her2-negative, 2. Her2-positive, 3. triple negative) were available for each patient (Table 1), as well as functional tumor volume measurements at pre-treatment (FTV₁) and early-treatment visits (FTV₂). Furthermore, the RFS outcomes were reported and measured according to the STEEP criteria²³, as the time from the first cycle of chemotherapy to disease recurrence or death. The pCR outcomes were also defined as no remaining invasive cancer in axillary lymph nodes or breast²⁴.

Table 1 Patient characteristics for the ISPY-1/ACRIN 6657 trial sample analyzed in our study.

Full size table

For our study, we focused only on the information extracted from the first two MRI examinations (i.e., at pre-treatment and early-treatment visits), as outcome prediction before the initiation or early in the course of treatment would be of particular clinical value. From the available 222 I-SPY 1/ACRIN 6657 trial participants, limiting ourselves to those with complete clinical and imaging data at pre-treatment and early-treatment visits reduced the analysis set to 142 patients; excluding an additional 10 for whom image registration could not complete (see the next section for details of registration). That resulted in a sample of 132 patients available for RFS analysis in our study. pCR information was missing for 5 participants, leaving 127 patients for pCR analysis (Fig. 1).

Image pre-processing

Three image pre-processing steps were implemented before quantitative analyses. First, a nonparametric non-uniform intensity normalization (N3) method was implemented for bias-field correction to reduce the negative impact of MR imaging artifacts. Then, histogram matching was done between images at pre- and early-treatment for more accurate registration implementation. Finally, all images from different patients were resampled to the same spatial resolution to have consistent intensity values for feature extraction.

Deformable image registration and tumor segmentation

After image pre-processing steps, we applied a deformable image registration algorithm to spatially and anatomically align the early-treatment MR images to the pre-treatment ones¹⁴ (Fig. 2). The registration algorithm is based on attribute matching and mutual-saliency weighting. This registration calculates a spatial transformation, T, mapping each voxel, x, to its image, T(x). T is computed by minimizing a cost function, E, that is a function of (1) mutual saliency, where ms(x₁, x₂) measures the dissimilarity between two voxels; and (2) attribute matching, where A(x) is a vector encoding the anatomic and geometric properties of a voxel:

$$E=\mathop{\int }\limits_{\begin{array}{c}x{\epsilon }\\ breast\\ volume\end{array}}ms(x,T(x))(\frac{1}{d})\Vert {A}_{1}(x)-{A}_{2}{(T(x))\Vert }^{2}dx$$

(1)

where d is the number of image dimensions.

The algorithm has been previously validated for longitudinal MR image registration for breast cancer and shown to be significantly more accurate compared to conventional intensity-based registration algorithms¹⁹. The rationale in our study is that this accurate matching allows for monitoring changes within the corresponding voxels between pre- and early-treatment images. Registration was, therefore, first applied to the entire breast, and then, subsequent voxel-wise image analyses were performed within the tumor region of the pre-treatment image (FTV₁). Thus, FTV₁ mask was applied to both pre-treatment and registered early-treatment images to track voxel-wise changes within the initial tumor region. A signal enhancement ratio method was used to analyze DCE-MR images and segment functional tumor volumes (FTV₁ and FTV₂)^6,25.

Voxel-wise longitudinal imaging features

Feature maps extraction

Comparing each pair of corresponding voxels extracted from the registration of pre- and early-treatment DCE-MRI scans, two groups of imaging features were computed to quantify tumor changes: (i) voxel-wise tumor deformation, and (ii) voxel-wise changes of kinetic features (PRM of kinetic features):

Voxel deformation measures: Evaluation of voxel deformation provides the opportunity to track how the tumor deforms in response to therapy by quantifying regional changes in tumor size, shape and orientation. Specifically, three independent voxel-wise measures of tumor deformation were calculated: (1) Jacobian, representing the volume expansion or contraction of each voxel computed as the ratio of the volume at early-treatment image to the corresponding volume at pre-treatment image in a given point, (2) the anisotropic deformation index (ADI) a measure of the magnitude of the anisotropic (non-shape-preserving) deformation at each voxel, and (3) the slab-rod index (SRI) a measure of the shape (orientation preference) of the anisotropic deformation (Fig. 2):

The transformation function T derived from image registration extracts information regarding voxel-wise changes in volume and shape between pre-treatment and early-treatment imaging. At a given voxel, the eigenvalues of ∇T(∇T)^T, λ₁, λ₂, and λ₃, denote the principal strains where λ₁ > λ₂ > λ₃.

Jacobian

The Jacobian, J, is the voxel-wise volume ratio between early-treatment and pre-treatment images, indicating local contraction (Jacobian < 1) or local expansion (Jacobian > 1).

$$Jacobian=\frac{{v}_{early-treatment}}{{v}_{pre-treatment}}={\lambda }_{1}{\lambda }_{2}{\lambda }_{3}$$

(2)

Anisotropy indices

However, the Jacobian is unable to capture information about the directionality and shape of local deformation. The anisotropic deformation index (ADI) and the slab-rod index (SRI)²⁶ capture two such measures.

The ADI defined at each point as

$$ADI=\sqrt{{(\frac{{\lambda }_{1}-{\lambda }_{2}}{{\lambda }_{2}})}^{2}+{(\frac{{\lambda }_{2}-{\lambda }_{3}}{{\lambda }_{3}})}^{2}}$$

(3)

It measures how much the local transformation is anisotropic (directional). The ADI ranges from 0 to ∞; when λ₁ = λ₂ = λ₃, the ADI is zero implying isotropic deformation (deformation that is equal in all directions, shape-preserving deformation), and larger ADI indicates more anisotropy (Fig. 2).

The SRI defined at each point as

$$SRI=\frac{ta{n}^{-1}({\lambda }_{3}({\lambda }_{1}-{\lambda }_{2})/{\lambda }_{2}({\lambda }_{2}-{\lambda }_{3}))}{\pi /2}$$

(4)

shows whether the voxel deforms mainly in one direction (rod-like deformation, SRI ≈ 1) or two directions (slab-like, SRI ≈ 0) (Fig. 2).

PRMs of kinetic features: Besides deformation, image registration allows for constructing voxel-wise maps of changes in enhancement patterns extracted from kinetic features in DCE-MRI, which can be a means of characterizing intra-tumor functional heterogeneity²⁷. Here, we hypothesize that such voxel-wise measures can also quantify de novo changes in tumor heterogeneity which can be early indicators of therapy resistance, and thus markers of treatment response.

During the acquisition of DCE-MRI scans (a pre-contrast image, at time point t₀, followed by two post-contrast images, taken at two different delay times after injection of the contrast agent, t₁ and t₂, respectively), signal intensity of each voxel can be recorded at each time point (I(t)). From that, four kinetic features were computed to quantify the enhancement pattern for each voxel: peak enhancement (PE), wash-in slope (WIS), wash-out slope (WOS), and signal enhancement ratio (SER).

$$PE=\mathop{{\rm{\max }}}\limits_{t={t}_{PE}}\frac{I(t)-I({t}_{0})}{I({t}_{0})}$$

(5)

$$WIS=\{\begin{array}{cc}\frac{PE}{{t}_{PE}-{t}_{0}} & if\,{t}_{PE}\ne 0\\ 0 & otherwise\end{array}$$

(6)

$$WOS=\{\begin{array}{cc}\frac{I({t}_{2})-I({t}_{1})}{{t}_{2}-{t}_{PE}} & if\,{t}_{2}\ne {t}_{PE}\\ 0 & otherwise\end{array}$$

(7)

$$SER=\frac{I({t}_{2})-I({t}_{0})}{I({t}_{1})-I({t}_{0})}$$

(8)

For a given kinetic feature F, to analyze the voxel-wise change in F between the pre-treatment and early-treatment visits, we constructed the parametric response map (PRM) for F. Given the transformation T between pre-treatment voxels and their corresponding voxels in the early-treatment image, the PRM (of F) at any voxel x is defined as

$$PRM(x)=J\times {F}_{early-treatment}(T(x))-{F}_{pre-treatment}(x)$$

(9)

J here is the Jacobian, the proportional volume change at x between visits, which scales the value in cases when a voxel in one image corresponds to a larger or smaller volume in the other image.

Heterogeneity indices of the imaging features

Based on prior research^17,18, within the FTV₁ of each tumor, we calculated feature values as the fraction of voxels for each corresponding Jacobian and PRM of kinetic features whose value increased between pre- and early treatment visit (i.e., number of voxels with positive value/total number of voxels).

For the ADI and SRI features, since anisotropic deformation indicates a single relative measure between each corresponding voxels, we calculated the entropy of the corresponding ADI and SRI feature maps to specifically quantify the heterogeneity of the tumor deformation²⁸.

$$Entropy=\mathop{\sum }\limits_{i=1}^{N}\,P(i)lo{g}_{2}P(i)$$

(10)

where N is the number of values the measure takes over all voxels in the tumor, and P(i) is the probability that the feature will be equal to level i at any given voxel. (When ADI or SRI is equally likely to take every value that it takes over the image, entropy is low; when it takes some values frequently and others infrequently, entropy is high).

These computations resulted in a total of seven measures for each tumor, namely the proportion of increasing voxels for each of the Jacobian, PE, SER, WIS, and WOS features, and the entropy of the ADI and SRI measures (Fig. 3).

Analogous aggregate longitudinal features

To compare the performance of the proposed voxel-wise imaging features with currently established DCE-MRI measures, we calculated analogous, longitudinal tumor-wide aggregate features (i.e., mean values within FTVs). As an aggregate analogue to the Jacobian, we calculated the tumor-wide change in the entire volume (FTV₂/FTV₁). For each kinetic feature, the corresponding aggregate feature was calculated as the change in its average value:

$${{\rm{\Delta }}}_{f}=({f}_{early-treatment}-{f}_{pre-treatment})/{f}_{early-treatment}$$

(11)

where f is the average value of the feature (PE, WIS, WOS, SER) over the whole tumor. Aggregate features for the pre-treatment and early-treatment images were calculated over FTV₁ and FTV₂, respectively. This resulted in five imaging features for aggregate analyses. No corresponding aggregate features were calculated for ADI and SRI, as these measure voxel-wise orientational changes captured specifically and only by image registration.

Statistical analysis

First, a baseline model including the established covariates of age, race, hormone receptor status, and tumor volume (in this case, FTV₂) was built and tested. Then, features extracted from both voxel-wise and aggregate measures were tested as additions to this baseline model. Logistic regression was performed to assess the strength of associations of features with pCR, where the area under the receiver-operating-characteristic curve (AUC) was used to assess model performance. Cox proportional hazard modeling was used for time-to-event analysis to assess the strength of association of features with RFS, where the C-statistic was used as a measure of predictive performance²⁹.

For both pCR and RFS, five-fold cross-validation was performed, where the best model for each cross-validation loop was selected in two steps: first, using only the training set, each of the seven voxel-wise imaging features (five imaging features for the aggregate analysis) was evaluated as a univariable addition to the baseline model, and features were ranked based on their performance (AUC for pCR and C-statistic for RFS). Then, best subset model selection was used where seven (again, five for aggregate models) models were built and evaluated: one with the single best feature, one with the two best features, and so on, where the Akaike information criterion (AIC) was used to choose the best multivariable model from these seven (or five) models. Finally, the selected model was applied to the unseen test set, and the AUC or C-statistic was calculated. Averaging over all five cross-validation loops, the mean AUC or C-statistic was used as the final, cross-validated, measure of model performance.

To estimate the odds ratios or hazard ratios for each model (voxel-wise or aggregate), the features selected in more than 80% of the cross-validation loops (4 or 5) were then used in multivariable models fitted to the full dataset. Using the likelihood ratio test, the proposed voxel-wise and aggregate models were compared with the baseline model to assess their added value as covariates. Furthermore, RFS analysis was evaluated via Kaplan-Meier plots and survival ratios derived from hazard as predicted from the model — a participant’s risk signature — dichotomized at the median into high- and low-risk groups. For a given model, the risk signature of each participant was defined as that participant’s values of the covariates in the model (age, race, hormone receptor status, FTV₂, and selected imaging features) weighted by the corresponding coefficients of those covariates in the model, to arrive at a predicted risk score^30,31. The p-value of 0.05 cutoff was used to determine statistical significance throughout. Statistical analysis was conducted using R (R version 3.3.2, R Foundation for Statistical Computing, Vienna, Austria).

Results

Patient population

Of 132 participants used in our study for RFS analysis, 39 had an event (recurrence or death), over a median follow-up time of 3.62 years. Of the 127 participants for whom the pCR outcome was recorded, 38 experienced pCR (Table 1).

Pathologic complete response

The baseline model for pCR had a mean cross-validated AUC = 0.71 (Supporting Information Table S1). Hormone receptor status had a statistically significant association with pCR in this multivariable model (odds ratio: 2.06, p < 0.05) whereas FTV₂ and other clinical variables had no statistically significant associations with treatment response (p > 0.05). Adding the voxel-wise features to the baseline features and using the best models derived as described above (Supplementary Table S2), the performance of the baseline model was improved significantly (p < 0.05), resulting in mean cross-validated AUC of 0.78 (Table E3). The voxel deformation features (Jacobian, ADI and SRI) were selected in all five folds, while the PRM features, PRM_WOS and PRM_PE were selected twice and once, respectively (Supplementary Table S3 for selected features in each fold). In the aggregate-measures models, although some features showed consistency in being selected among training sets (e.g., FTV₂/FTV₁ and ∆_WIS were selected in four out of five folds), no model demonstrated improvement in performance (AUC = 0.71, p > 0.05). A model based on only the voxel-wise features showed mean cross-validated AUC of 0.74, demonstrating better performance than the baseline and aggregate models, despite not incorporating standard baseline covariates such as FTV₂ and hormone receptor status. Fitting the multivariable model to the full dataset, all three selected voxel-wise imaging features showed statistically significant associations with pCR (p < 0.05), while FTV₂ and other aggregate features had no statistically significant associations with pCR (Table 2). Furthermore, the model augmented with voxel-wise features showed a statistically significant improvement over the baseline model, as determined by the likelihood ratio test (p < 0.001) while the proposed features in the aggregate model did not (p = 0.14).

Table 2 Statistical analysis of voxel-wise versus aggregate features in multivariable pCR models.

Full size table

Recurrence-free survival

The baseline model gave a mean cross-validated C-statistic of 0.63 (Supplementary Table S1). Fitting this model to the full dataset showed a statistically significant association of FTV₂ with RFS (hazard ratio: 1.81, p < 0.001) while age, race, and hormone receptor status did not show associations with RFS (Supplementary Table S1). When adding the voxel-wise features to the baseline model (Supplementary Table S4), PRM_PE and PRM_WIS were selected in all five folds, and Jacobian and SRI were selected in 4 folds (Supplementary Table S5). Voxel-wise models performed significantly better than the baseline model (p < 0.05), with a mean cross-validated C-statistic of 0.76 (Supplementary Table S5). Furthermore, a model based on only the voxel-wise features gave a mean cross-validated C-statistic of 0.73, showing better performance than the baseline model even without predictors such as FTV₂ and hormone receptor status. The aggregate-feature models had even lower performance with a mean cross-validated C-statistic of 0.61 (Supplementary Table S5).

Building a Cox model on the full dataset, including the baseline features, and the selected voxel-wise features PRM_PE, PRM_WIS, Jacobian, and SRI (these features were included in four or five of the cross-validation runs), all the voxel-wise features showed statistically significant associations with RFS (Table 3). In contrast, in an analogous model, none of the aggregate imaging features had a statistically significant association with RFS (Table 3). As was true for pCR modeling, the likelihood ratio test showed that augmenting the model with voxel-wise features resulted in a statistically significant improvement over the baseline model, (p < 0.001) while adding the aggregate features did not (p = 0.23).

Table 3 Statistical analysis of voxel-wise versus aggregate features in multivariable RFS models.

Full size table

Finally, splitting the patients into low and high-risk groups based on their median risk score signature (Fig. 4) gave a significantly higher ratio (i.e., greater separation) of survival probabilities between high-risk and low-risk patients, when modelling was performed via the final multivariable voxel-wise imaging signature (log-rank p < 0.001) rather than using the corresponding selected aggregate features (log-rank p = 0.51). Furthermore, when combining the voxel-wise features with the baseline predictors, the performance improved significantly (ratio at median survival time = 1.55, log-rank p < 0.001) compared to the performance of the baseline predictors alone (ratio at median survival time = 1.11, log-rank p = 0.032).

Voxel-wise versus aggregate representations

To better understand the differences between voxel-wise and aggregate representations, Fig. 5 demonstrates feature maps for a few representative patients, showing how patients with similar aggregate feature values may have different heterogeneity due to different voxel-wise distributions. Supplementary Figs S1 and S2 show the distributions of proposed imaging features according to the treatment outcomes. All feature values except the entropy of ADI for pCR analysis indicated distinct distribution (p < 0.05). Furthermore, Supplementary Table S6 summarizes how to interpret the proposed voxel-wise imaging features to improve the prediction of pCR and RFS.

Discussion

The importance of early-treatment response assessment in optimizing patient care and treatment adjustment have been proven^32,33. Our study suggests that voxel-wise longitudinal analyses of DCE-MR images can quantify heterogeneous changes within the tumor as an indicator of therapy response and improve prediction of RFS and pCR, compared to conventional tumor volume and aggregate kinetic measures, as early as the first treatment time point in NAC. Importantly, the proposed voxel-wise features provide information independent of conventional predictive covariates such as age, race, hormone receptor status, and tumor volume.

Using registration, we extracted two types of feature maps from the longitudinal data: voxel-wise deformation, and PRMs of kinetic features. The anisotropy indices (ADI and SRI) in combination with the Jacobian, provide a complete descriptor of local tumor deformation²⁶, which can capture heterogeneous changes within tumor transformations³⁴. Features based on the PRMs of kinetic features are also important in capturing functional tumor heterogeneity regarding changes in enchantment patterns to augment models of pCR and RFS. It should be noted that the consistent selection of the voxel-wise features in most training folds (80% of folds) of the cross-validation suggests that they were robust across training sets. The combination of techniques in our study —robust registration; use of voxel-wise measures; use of deformation measures and PRMs of kinetic features — provide statistically significant improvements over previous similar analyses with conventional tumor volume measures and aggregate kinetic features in predicting RFS and pCR^2,6.

Although recent investigations for pCR prediction attempted to characterize tumor heterogeneity during chemotherapy, quantification of heterogeneity was performed separately at different time points³⁵ without the incorporation of image registration, and relative changes were measured by averaging corresponding feature values over time¹³. Cho et al. evaluated the PRM of signal intensity during chemotherapy to predict pCR but in sub-volumes rather than voxels³⁶. Our results suggest that using longitudinal voxel-wise markers, even without tumor volume, can outperform conventional approaches for the prediction of both RFS and pCR.

There were some limitaions in our study, one was the relatively small sample size of the patients (132 participants for RFS, 127 for pCR) with a low number of events (39 for RFS and 38 for pCR). We, therefore, limited our evaluation to a single first-order feature value for each type of our measures (i.e., percent of voxels with relative increase between pre- and early- treatment scans and entropy of anisotropic deformations) to avoid overfitting and used five-fold cross-validation to get a preliminary estimate of the generalizability of our findings. In addition, although we showed significant improvement in predicting pCR and RFS by extracting voxel-wise temporal feature changes, when the I-SPY 1 TRIAL was conducted (i.e., from May 2002 to March 2006) temporal resolutions were set to 2.5 and 7.5 minutes for post-contrast images. As DCE-MRI was still relatively in its early stages, these temporal resolutions were considered standard of care, especially when considering the multi-institutional setting of I-SPY 1 and the need to standardize acquisitions across sites. However, recent advances in MRI techniques provide significantly higher temporal resolution, and according to recommendations from EUSOMA for breast imaging, the minimum temporal resolution should be less than 2 minutes³⁷. It has been shown that the most informative feature values for tumor characterization should be available at 2 minutes or less after the injection of contrast agent³⁸. Thus, the delayed phase of post-contrast images in this study may not fully utilize the most valuable feature information in predicting pCR and RFS. We hypothesize that using the proposed feature maps with more current, advanced MRI techniques would enhance the prediction of RFS and pCR even further.

Furthermore, since neoadjuvant trastuzumab was not used as standard therapy until 2005, most patients with HER2+ were only under neoadjuvant chemotherapy in this study; there were only a few (n = 16) that got trastuzumab but those were excluded from the original I-SPY 1 trial imaging analysis for consistency⁶, which we also did for the purposes of our study. However, currently, patients with HER2+ usually also use targeted therapy drugs including trastuzumab (Herceptin) and pertuzumab (Perjeta) which improve pathologic complete response and overall survival when added to chemotherapy³⁹. Therefore, it would be necessary to investigate the performance of the imaging signatures proposed here on the outcomes for HER2+ patients who have received the targeted therapies in addition to neoadjuvant chemotherapy.

To address above limitations, we plan to perform such an evaluation when the imaging data from a larger independent validation data set become available. The current work can also be extended by applying these analyses to longitudinal images at additional mid- and late- treatment time points to better characterize heterogeneous tumor responses and the effects of treatment over time. Also, combining these first-order voxel-wise deformations and PRMs of kinetic features with second- and third order voxel-wise imaging features, such as texture and shape-based features, may provide even more predictive signatures in treatment response assessment^30,40.

In conclusion, we demonstrated that evaluation of voxel-wise changes in longitudinal analyses of DCE-MR images can reveal valuable phenotypic tumor heterogeneity markers to significantly improve early therapy response prediction compared to conventional tumor volume and aggregate kinetic measures, as early as the first treatment time point. Such phenotypic markers can be derived from imaging that is the current standard of care in neoadjuvant chemotherapy response assessment, and thus potentially provide valuable information with no additional invasive procedures to better tailor treatment selection for individual patients.

References

Fangberget, A. et al. Neoadjuvant chemotherapy in breast cancer-response evaluation and prediction of response to treatment using dynamic contrast-enhanced and diffusion-weighted MR imaging. Eur. Radiol. 21, 1188–1199 (2011).
Article CAS Google Scholar
Hylton, N. M. et al. Locally Advanced Breast Cancer: MR Imaging for Prediction of Response to Neoadjuvant Chemotherapy—Results from ACRIN 6657/I-SPY TRIAL. Radiology 263, 663–672 (2012).
Article Google Scholar
Teruel, J. R. et al. Dynamic contrast-enhanced MRI texture analysis for pretreatment prediction of clinical and pathological response to neoadjuvant chemotherapy in patients with locally advanced breast cancer. NMR Biomed. 27, 887–896 (2014).
Article Google Scholar
Wu, J. et al. Intratumoral Spatial Heterogeneity at Perfusion MR Imaging Predicts Recurrence-free Survival in Locally Advanced Breast Cancer Treated with Neoadjuvant Chemotherapy. Radiology 172462, https://doi.org/10.1148/radiol.2018172462 (2018).
Article Google Scholar
Kim, J.-H. et al. Breast Cancer Heterogeneity: MR Imaging Texture Analysis and Survival Outcomes. Radiology 282, 665–675 (2016).
Article Google Scholar
Hylton, N. M. et al. Neoadjuvant Chemotherapy for Breast Cancer: Functional Tumor Volume by MR Imaging Predicts Recurrence-free Survival—Results from the ACRIN 6657/CALGB 150007 I-SPY 1 TRIAL. Radiology 279, 44–55 (2015).
Article Google Scholar
Braman, N. M. et al. Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res. 19, 57 (2017).
Article Google Scholar
Beca, F. & Polyak, K. Intratumor Heterogeneity in Breast Cancer. In Novel Biomarkers in the Continuum of Breast Cancer (ed. Stearns, V.) 169–189, https://doi.org/10.1007/978-3-319-22909-6_7 (Springer International Publishing, 2016).
Google Scholar
Li, X. et al. DCE-MRI analysis methods for predicting the response of breast cancer to neoadjuvant chemotherapy: Pilot study findings. Magn. Reson. Med. 71, 1592–1602 (2014).
Article Google Scholar
O’Connor, J. P. B. et al. Imaging Intratumor Heterogeneity: Role in Therapy Response, Resistance, and Clinical Outcome. Clin. Cancer Res. 21, 249–257 (2015).
Article Google Scholar
Sotiras, A., Davatzikos, C. & Paragios, N. Deformable Medical Image Registration: A Survey. IEEE Trans. Med. Imaging 32, 1153–1190 (2013).
Article Google Scholar
Li, X. et al. A nonrigid registration algorithm for longitudinal breast MR images and the analysis of breast tumor response. Magn. Reson. Imaging 27, 1258–1270 (2009).
Article ADS Google Scholar
Parikh, J. et al. Changes in Primary Breast Cancer Heterogeneity May Augment Midtreatment MR Imaging Assessment of Response to Neoadjuvant Chemotherapy. Radiology 272, 100–112 (2014).
Article Google Scholar
Ou, Y., Sotiras, A., Paragios, N. & Davatzikos, C. DRAMMS: Deformable registration via attribute matching and mutual-saliency weighting. Med. Image Anal. 15, 622–639 (2011).
Article Google Scholar
Rueckert, D. et al. Nonrigid registration using free-form deformations: application to breast MR images. IEEE Trans. Med. Imaging 18, 712–721 (1999).
Article CAS Google Scholar
Vercauteren, T., Pennec, X., Perchant, A. & Ayache, N. Diffeomorphic demons: Efficient non-parametric image registration. NeuroImage 45, S61–S72 (2009).
Article Google Scholar
Galbán, C. J. et al. Computed tomography–based biomarker provides unique signature for diagnosis of COPD phenotypes and disease progression. Nat. Med. 18, nm.2971 (2012).
Article Google Scholar
Galbán, C. J. et al. The parametric response map is an imaging biomarker for early cancer treatment outcome. Nat. Med. 15, 572–576 (2009).
Article Google Scholar
Ou, Y. et al. Deformable registration for quantifying longitudinal tumor changes during neoadjuvant chemotherapy. Magn. Reson. Med. 73, 2343–2356 (2015).
Article Google Scholar
Li, X. et al. Early DCE-MRI Changes after Longitudinal Registration May Predict Breast Cancer Response to Neoadjuvant Chemotherapy. In Biomedical Image Registration 229–235, https://doi.org/10.1007/978-3-642-31340-0_24 (Springer, Berlin, Heidelberg, 2012).
Chapter Google Scholar
Hurtado, D. E. et al. Spatial patterns and frequency distributions of regional deformation in the healthy human lung. Biomech. Model. Mechanobiol. 16, 1413–1423 (2017).
Article Google Scholar
Clark, K. et al. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. J. Digit. Imaging 26, 1045–1057 (2013).
Article Google Scholar
Hudis, C. A. et al. Proposal for Standardized Definitions for Efficacy End Points in Adjuvant Breast Cancer Trials: The STEEP System. J. Clin. Oncol. 25, 2127–2132 (2007).
Article Google Scholar
Esserman, L. J. et al. Pathologic Complete Response Predicts Recurrence-Free Survival More Effectively by Cancer Subset: Results From the I-SPY 1 TRIAL—CALGB 150007/150012, ACRIN 6657. J. Clin. Oncol. 30, 3242–3249 (2012).
Article Google Scholar
Hylton, N. M. Vascularity assessment of breast lesions with gadolinium-enhanced MR imaging. Magn. Reson. Imaging Clin. N. Am. 7(x), 411–20 (1999).
CAS PubMed Google Scholar
Amelon, R. et al. Three-dimensional characterization of regional lung deformation. J. Biomech. 44, 2489–2495 (2011).
Article Google Scholar
Ashraf, A. et al. Breast DCE-MRI Kinetic Heterogeneity Tumor Markers: Preliminary Associations With Neoadjuvant Chemotherapy Response. Transl. Oncol. 8, 154–162 (2015).
Article Google Scholar
Dercle, L. et al. Limits of radiomic-based entropy as a surrogate of tumor heterogeneity: ROI-area, acquisition protocol and tissue site exert substantial influence. Sci. Rep. 7, 7952 (2017).
Article ADS Google Scholar
Uno, H., Cai, T., Pencina, M. J., D’Agostino, R. B. & Wei, L. J. On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Stat. Med. 30, 1105–1117 (2011).
MathSciNet PubMed PubMed Central Google Scholar
Aerts, H. J. W. L. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5 (2014).
Coroller, T. P. et al. CT-based radiomic signature predicts distant metastasis in lung adenocarcinoma. Radiother. Oncol. 114, 345–350 (2015).
Article Google Scholar
Rousseau, C. et al. Monitoring of Early Response to Neoadjuvant Chemotherapy in Stage II and III Breast Cancer by [18F]Fluorodeoxyglucose Positron Emission Tomography. J. Clin. Oncol. 24, 5366–5372 (2006).
Article Google Scholar
Sharma, U., Danishad, K. K. A., Seenu, V. & Jagannathan, N. R. Longitudinal study of the assessment by MRI and diffusion-weighted imaging of tumor response in patients with locally advanced breast cancer undergoing neoadjuvant chemotherapy. NMR Biomed. 22, 104–113 (2009).
Article Google Scholar
Jahani, N. et al. Assessment of regional ventilation and deformation using 4D-CT imaging for healthy human lungs during tidal breathing. J. Appl. Physiol. 119, 1064–1074 (2015).
Article ADS CAS Google Scholar
Ahmed, A., Gibbs, P., Pickles, M. & Turnbull, L. Texture analysis in assessment and prediction of chemotherapy response in breast cancer. J. Magn. Reson. Imaging 38, 89–101 (2013).
Article Google Scholar
Cho, N. et al. Breast Cancer: Early Prediction of Response to Neoadjuvant Chemotherapy Using Parametric Response Maps for MR Imaging. Radiology 272, 385–396 (2014).
Article Google Scholar
Sardanelli, F. et al. Magnetic resonance imaging of the breast: Recommendations from the EUSOMA working group. Eur. J. Cancer 46, 1296–1316 (2010).
Article Google Scholar
Macura, K. J., Ouwerkerk, R., Jacobs, M. A. & Bluemke, D. A. Patterns of Enhancement on Breast MR Images: Interpretation and Imaging Pitfalls. RadioGraphics 26, 1719–1734 (2006).
Article Google Scholar
Gianni, L. et al. Neoadjuvant chemotherapy with trastuzumab followed by adjuvant trastuzumab versus neoadjuvant chemotherapy alone, in patients with HER2-positive locally advanced breast cancer (the NOAH trial): a randomised controlled superiority trial with a parallel HER2-negative cohort. The Lancet 375, 377–384 (2010).
Article CAS Google Scholar
Kickingereder, P. et al. Radiomic Profiling of Glioblastoma: Identifying an Imaging Predictor of Patient Survival with Improved Performance over Established Clinical and Radiologic Risk Models. Radiology 280, 880–889 (2016).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the NIH grant: 1R01CA197000-01A1.

Author information

Authors and Affiliations

Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
Nariman Jahani, Eric Cohen, Meng-Kang Hsieh, Susan P. Weinstein, Lauren Pantalone, Christos Davatzikos & Despina Kontos
Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, CA, 94115, USA
Nola Hylton & David Newitt

Authors

Nariman Jahani
View author publications
You can also search for this author in PubMed Google Scholar
Eric Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Meng-Kang Hsieh
View author publications
You can also search for this author in PubMed Google Scholar
Susan P. Weinstein
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Pantalone
View author publications
You can also search for this author in PubMed Google Scholar
Nola Hylton
View author publications
You can also search for this author in PubMed Google Scholar
David Newitt
View author publications
You can also search for this author in PubMed Google Scholar
Christos Davatzikos
View author publications
You can also search for this author in PubMed Google Scholar
Despina Kontos
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.J. participated in the literature review, conception and design of the study, analysis of the data, statistical analysis, interpretation of the results, writing the manuscript. E.C. participated in statistical analysis and revising the manuscript. M.-K.H. participated in the analysis of data and the acquisition of data. L.P. participated in the acquisition of data. S.W. participated in the design of the study and revising the manuscript. N.H. participated in the design of the study. D.N. participated in the design of the study and revising the manuscript. C.D. participated in the design of the study, interpretation of the data and revising the manuscript. D.K. coordinated the study and participated in the literature review, design of the study, writing the manuscript. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Despina Kontos.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information_sr

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jahani, N., Cohen, E., Hsieh, MK. et al. Prediction of Treatment Response to Neoadjuvant Chemotherapy for Breast Cancer via Early Changes in Tumor Heterogeneity Captured by DCE-MRI Registration. Sci Rep 9, 12114 (2019). https://doi.org/10.1038/s41598-019-48465-x

Download citation

Received: 25 April 2019
Accepted: 05 August 2019
Published: 20 August 2019
DOI: https://doi.org/10.1038/s41598-019-48465-x

This article is cited by

Radiomics and artificial intelligence in breast imaging: a survey
- Tianyu Zhang
- Tao Tan
- Ritse M. Mann
Artificial Intelligence Review (2023)
Early prediction of neoadjuvant chemotherapy response by exploiting a transfer learning approach on breast DCE-MRIs
- Maria Colomba Comes
- Annarita Fanizzi
- Raffaella Massafra
Scientific Reports (2021)
Dynamic contrast-enhanced breast MRI features correlate with invasive breast cancer angiogenesis
- Jennifer Xiao
- Habib Rahbar
- Savannah C. Partridge
npj Breast Cancer (2021)
A comparative study of the value of amide proton transfer-weighted imaging and diffusion kurtosis imaging in the diagnosis and evaluation of breast cancer
- Nan Meng
- Xuejia Wang
- Meiyun Wang
European Radiology (2021)
Functional 4-D clustering for characterizing intratumor heterogeneity in dynamic imaging: evaluation in FDG PET as a prognostic biomarker for breast cancer
- Rhea Chitalia
- Varsha Viswanath
- Despina Kontos
European Journal of Nuclear Medicine and Molecular Imaging (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Patient population and data acquisition

Image pre-processing

Deformable image registration and tumor segmentation

Voxel-wise longitudinal imaging features

Feature maps extraction

Jacobian

Anisotropy indices

Heterogeneity indices of the imaging features

Analogous aggregate longitudinal features

Statistical analysis

Results

Patient population

Pathologic complete response

Recurrence-free survival

Voxel-wise versus aggregate representations

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links