Integrated radiomic framework for breast cancer and tumor biology using advanced machine learning and multiparametric MRI

Parekh, Vishwa S.; Jacobs, Michael A.

doi:10.1038/s41523-017-0045-3

Download PDF

Article
Open access
Published: 14 November 2017

Integrated radiomic framework for breast cancer and tumor biology using advanced machine learning and multiparametric MRI

npj Breast Cancer volume 3, Article number: 43 (2017) Cite this article

5582 Accesses
119 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Radiomics deals with the high throughput extraction of quantitative textural information from radiological images that not visually perceivable by radiologists. However, the biological correlation between radiomic features and different tissues of interest has not been established. To that end, we present the radiomic feature mapping framework to generate radiomic MRI texture image representations called the radiomic feature maps (RFM) and correlate the RFMs with quantitative texture values, breast tissue biology using quantitative MRI and classify benign from malignant tumors. We tested our radiomic feature mapping framework on a retrospective cohort of 124 patients (26 benign and 98 malignant) who underwent multiparametric breast MR imaging at 3 T. The MRI parameters used were T1-weighted imaging, T2-weighted imaging, dynamic contrast enhanced MRI (DCE-MRI) and diffusion weighted imaging (DWI). The RFMs were computed by convolving MRI images with statistical filters based on first order statistics and gray level co-occurrence matrix features. Malignant lesions demonstrated significantly higher entropy on both post contrast DCE-MRI (Benign-DCE entropy: 5.72 ± 0.12, Malignant-DCE entropy: 6.29 ± 0.06, p = 0.0002) and apparent diffusion coefficient (ADC) maps as compared to benign lesions (Benign-ADC entropy: 5.65 ± 0.15, Malignant ADC entropy: 6.20 ± 0.07, p = 0.002). There was no significant difference between glandular tissue entropy values in the two groups. Furthermore, the RFMs from DCE-MRI and DWI demonstrated significantly different RFM curves for benign and malignant lesions indicating their correlation to tumor vascular and cellular heterogeneity respectively. There were significant differences in the quantitative MRI metrics of ADC and perfusion. The multiview IsoSVM model classified benign and malignant breast tumors with sensitivity and specificity of 93 and 85%, respectively, with an AUC of 0.91.

Segment anything in medical images

Article Open access 22 January 2024

Jun Ma, Yuting He, … Bo Wang

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

Richard J. Chen, Tong Ding, … Faisal Mahmood

Microenvironmental reorganization in brain tumors following radiotherapy and recurrence revealed by hyperplexed immunofluorescence imaging

Article Open access 15 April 2024

Spencer S. Watson, Benoit Duc, … Johanna A. Joyce

Introduction

Radiomics is an emerging field which deals with the high throughput extraction of quantitative textural features from radiological images.^1,2,3,4 The central hypothesis of radiomics is that by examining the textural features in medical images, it is possible to decode tissue characteristics and pathology. The current radiomic methods extract information about the gray-scale patterns, inter-pixel relationships and shape based properties of the region of interest (ROI).^{5,6,7,8,9,10,11} Multiple studies have used radiomic analysis for differentiating between benign and malignant breast tumors,^{12,13,14,15,16,17,18,19,20} associating radiomic features with histological types of invasive breast cancer²¹ and predicting chemotherapy response in breast cancer patients.^22,23 In addition, radiomic analysis has been applied to different tissue pathologies such as lung, prostate cancer, and liver cancer, and recently reviewed.⁴ Most of the studies in radiomics are focused on extraction of a single quantitative texture value corresponding to all the voxels within the tumor. As a result, visualization or interpretation of tumor heterogeneity or the correlation between tissue biology of the tumor and the surrounding normal tissue has not been explored. The unique ability of multiparametric magnetic resonance imaging (mpMRI) to better characterize tissue parameters provides us with an opportunity to investigate the correlation between tissue biology and quantitative radiomic metrics. Multiparametric magnetic resonance imaging (MRI) of breast involves acquisition of advanced functional MRI parameters of dynamic contrast enhanced-MRI (DCE-MRI) and diffusion weighted imaging (DWI). In DCE-MRI, a time series acquisition of T1-weighted MRI scans results in time intensity curves corresponding to different tissue types. Moreover, tissue vascularity can be evaluated using pharmacokinetic modeling (PK) of the DCE. Similarly, radiomic analysis of the PK images would produce textural evolution curves which provide information about the underlying vascular “texture” heterogeneity corresponding to different tissue types. Similarly, radiomic analysis applied to the apparent diffusion coefficient (ADC) map obtained from DWI investigates the underlying cellular heterogeneity of the tissue of interest.

To that end, we propose a radiomic feature mapping framework which transforms MRI images into radiomic feature maps for visualization and analysis of textural information present in the images. The radiomic feature maps (RFMs) highlight unique textural information such as contrast, uniformity, heterogeneity, etc. about the radiological images. This information can be correlated with quantitative texture values and quantitative MRI metrics. The motivation behind the development of radiomic feature mapping is to empower the radiologists with the ability to “see” the hidden textural information present in the radiological images and correlate it with tissue biology. Furthermore, we evaluate the textural information of the normal tissue (glandular), benign, and malignant tumors and correlate this textural information with the corresponding vascular and cellular properties of these tissue types. Finally, we analyzed the diagnostic capabilities of RFM features for prediction of clinical diagnosis as benign or malignant using a new multi-view feature embedding and classification model.

Results

Experimental summary

The radiomic feature maps were computed and analyzed for one hundred and twenty-four women with breast lesions that underwent mpMRI scan. The mean age of the patients was 52 years (range: 24–80 years). Ninety-eight women (79%) had malignant lesions and twenty-six women (21%) had benign lesions. Figures 1 and 2 illustrates typical entropy feature maps corresponding to a benign and a malignant patient. The radiomic features were extracted using our radiomic method that creates whole breast texture images of each feature. The overview of the radiomic feature mapping procedure for classification of a multiparametric radiological dataset as benign or malignant is illustrated in Fig. 3. A total of 690 RFMs were generated for the twenty-three image multiparametric MRI dataset of each patient. Regions of interest were defined on each tissue type using the Eigen filter segmentation method and the MR radiomic feature maps were computed for different breast tissue. Finally, ROI size (area in cm²), ADC value and PK-DCE parameters for different regions of interest in each of these patients were obtained. There was no significant difference in the tumor size between benign and malignant patient groups (Benign size: 3.24 ± 2.67 cm², Malignant size: 2.44 ± 0.31 cm², p = 0.77).

Table 1 summarizes the entropy values corresponding to the different regions of interest from the DCE-MRI and ADC map. Malignant lesions demonstrated significantly higher entropy on both post contrast DCE-MRI and ADC maps as compared to benign lesions (Benign DCE entropy: 5.72 ± 0.12, Malignant DCE entropy: 6.29 ± 0.06, p = 0.0002; Benign ADC entropy: 5.65 ± 0.15, Malignant ADC entropy: 6.20 ± 0.07, p = 0.002). There was no significant difference in the glandular tissue entropy values between the two groups (Benign DCE entropy: 6.08 ± 0.10 Malignant DCE entropy: 5.91 ± 0.05, p = 0.16; Benign ADC entropy: 6.06 ± 0.32 Malignant ADC entropy: 6.06 ± 0.19, p = 1.00).

Table 1 Summary of radiomic feature values and quantitative MpMRI metrics

Full size table

Analysis of textural evolution curves

Figure 4 illustrates the textural evolution curves corresponding to the different radiomic features obtained from DCE MRI. Figure 4a and 4b exhibit the textural evolution curves of the normalized mean values obtained from the entropy feature maps (top) corresponding to tumor and glandular tissue respectively. The error bars in the textural evolution curves represent the standard error for the normalized mean of the entropy values. In Fig. 4b, there is no change in the texture for glandular tissue for both benign and malignant patients. However, the shapes of textural evolution curves were significantly different between benign and malignant lesions illustrating the difference in contrast uptake within benign and malignant lesions. The normalized entropy values during the wash-in phase were significantly (p < 0.05) higher for malignant than for benign lesions depicting a rapid textural enhancement for malignant lesions. Similarly, the normalized entropy values during the washout phase were significantly lower for malignant than for benign lesions depicting a rapid textural washout for malignant lesions. Moreover, similar trends were observed in the textural evolution curves obtained from range feature maps as illustrated in Fig. 4 (Bottom). Preliminary analysis of textural evolution curves from entropy feature maps based on the time to peak and the textural wash out slope is shown below.

a.
Time to peak: The average time to peak for benign lesions (2.21 ± 0.16 mins) was significantly longer (p = 0.0003) than for malignant lesions (1.24 ± 0.07 mins).
b.
Textural wash out slope: The slope of the textural washout curves was also significantly different (p = 0.001) between benign (0.001 ± 0.001) and malignant lesions (−0.002 ± 0.0003).

Analysis of textural evolution metric on DWI

We observed an increase in the first order energy of the lesion tissue from DWI-b0 to b600 with a texture evolution metric that was significantly higher (p < 0.001) for malignant (3.09 ± 0.23) than for benign patients (1.84 ± 0.25). Similarly, the contrast in the lesion tissue also increased significantly (p = 0.001) for malignant (1.73 ± 0.14) than benign lesions (1.07 ± 0.13). The texture evolution metrics for five different radiomic feature maps portraying different textural characteristics have been summarized in Table 2.

Table 2 Summary of the texture evolution metric extracted from different RFMs DWI

Full size table

Multi-view feature embedding and classification

The multi-view feature embedding and classification framework was set up as illustrated in Fig. 5. The optimal set of hyperparameters for the multi-view classification framework, obtained using leave one out cross validation based grid search, were f1 = 18, f2 = 0 f3 = 6, f4 = 8, f5 = 0, f6 = 0, neighborhood parameter k = 45, dimensionality d = 10 and misclassification penalty ratio = 2.5:1. The parameter space for each of the input parameters were set as follows:

a.
The subset of features, f_i selected from each MRI dataset were iteratively selected based on the area under the ROC curve computed using univariate logistic regression.
b.
The neighborhood parameter was varied from 5 through 120 in steps of 5.
c.
The dimensionality of the transformed feature space was varied between one and ten.
d.
The misclassification penalty ratio between benign and malignant classes was selected from the set {2:1, 2.5:1, 3:1, 3.5:1, 4:1}.

The multi-view feature embedding and classification model trained using leave-one-out cross validation resulted in sensitivity and specificity of 93 and 85%, respectively, with an AUC of 0.91 in classifying benign from malignant lesions. The ROC curves for the IsoSVM classification model and other kernels are shown in Fig. 6. The search space for the misclassification penalty parameter for the SVM kernels was increased to all the ratios in the set {2:1, 2.5:1, 3:1,3.51, 4:1, 4.5:1,5:1,5.5:1,6:1} The resultant sensitivity, specificity and AUC from all the SVM classifiers are shown in Table 3. The multi-view feature transformation and classification framework was further tested using ten-fold cross validation performed across 100 trials. The optimal set of hyperparameters obtained with ten-fold cross validation concurred with the previously obtained optimal set of hyperparameters using leave one out cross validation. The average sensitivity and specificity achieved from ten-fold cross validation experiment were 91 and 82%, respectively, with an AUC of 0.87. The result from ten-fold cross validation ascertains the stability of the unified RFM signature, as well as the IsoSVM classifier. For comparison, the classification of benign from malignant using tumor size alone produced an AUC of 0.77 which was significantly lower than the AUC of 0.91 obtained from the unified RFM signature using the IsoSVM classifier.

Table 3 Summary of sensitivity, specificity and AUC for the IsoSVM classifier and various SVM kernels

Full size table

Discussion

We have demonstrated that our radiomic feature maps for visualization and evaluation of radiological texture in radiological images produced excellent features that were correlated to breast tissue biology and compared with quantitative metrics of radiological parameters. Malignant lesions demonstrated increased entropy compared to benign lesions for both ADC maps and DCE MRI. In contrast, glandular tissue entropy was similar across all subjects. Furthermore, the radiomic feature maps (RFMs) demonstrated excellent sensitivity and specificity in classifying benign from malignant lesions. Moreover, this study relates the quantitative metrics of ADC maps and PK-DCE to radiomics values for characterization of breast lesions and normal tissue.

Radiomic features, such as, entropy have been shown to classify between benign and malignant tumors in addition to predicting patient survival and treatment response in previous studies as reviewed in ref. 4. However, our work explored the whole image visualization and interpretation of these quantitative radiomic values employing RFMs. Indeed, the entropy feature maps exhibit higher entropy and intra-tumor heterogeneity for malignant tumors compared to benign tumors. The RFMs would provide the radiologists with a tool for visual interpretation of the radiomic feature values. Furthermore, radiomic feature maps provide a visualization of intra-tumor heterogeneity as opposed to a single quantitative value provided by quantitative radiomic analysis. In addition, radiomic feature maps produce voxel-wise radiomic values improving the quantitative measure. In contrast, single quantitative value corresponding to the whole tumor region may not define the entire tumor. This study investigated the relationship between RFMs and underlying tissue biology derived from quantitative radiological images. Preliminary analysis of RFMs corresponding to DCE-MRI suggest that time evolution of RFMs is indicative of heterogeneity in the vasculature of the tissue microenvironment. We observed that the textural evolution curves obtained from mean value of the radiomic feature maps had significantly different curve characteristics for benign and malignant tumors. Furthermore, the glandular tissue corresponding to benign and malignant patients demonstrated no shape difference, indicating there is no textural evolution difference with contrast uptake within glandular tissue. The radiomic features provided new metrics for comparison of the different tissue types. Moreover, the vascular parameters of K ^trans and EVF have been shown to be different between benign and malignant tumors. In concordance, the radiomic values also demonstrated significant differences between tissue types. In the previous studies, the ADC value and PK-DCE for a given region of interest has been established as an excellent biomarker in classification between benign and malignant breast tumors.^{24,25,26,27,28,29,30,31} Here, we establish radiomic entropy (and others) of the ADC map and DCE-MRI within the tumor ROI as a biomarker for correlation with cellular and vascular heterogeneity. The ADC and DCE entropy was significantly different between benign and malignant tumors. Furthermore, the entropy ADC feature map provides more insight into the cellular distribution within the tumor, whereas, the DCE radiomic metric provides information about the vascularity texture of the tissue. Additionally, a metric for quantification of tissue heterogeneity evolution with increasing b value was developed and analyzed. A subset of the texture evolution metrics for DWI were significantly different between benign and malignant lesions indicating a potential biomarker in the texture evolution metric. In addition, the glandular ADC and radiomics values were similar across all subjects. These findings lay the groundwork in radiomic metrics to describe normal vs abnormal tissue, which is needed for increased use of radiomics in clinical applications.

The training efficacy of most machine learning algorithms depend on the balance between the number of instances corresponding to each class. Typically, benign breast tumors are more frequently observed in clinical setting as compared to malignant tumors. However, in research setting, MRI for malignant breast tumors are more frequently obtained than for benign breast tumors producing a class imbalance that may result in performance bias of the trained classifier towards one class. Class imbalance is a frequent occurrence in health care machine learning applications. Our work approached the problem of class imbalance by assigning different misclassification penalty to each class type. Our results indicate that setting an appropriate misclassification penalty significantly improves the classification accuracy.

Our work has certain limitations. First, the radiomic feature map creation and classification were performed on retrospective data and no separate validation data was used. Second, this study evaluated radiomic feature maps corresponding to only first and second order statistical features. Other statistical radiomic methods such gray level run length matrix features,⁸ Neighborhood gray tone difference matrix feature¹⁰ have not been evaluated in this study. Third, RFMs provide voxel wise heterogeneity information of the whole tissue of interest. However, the feature used in the texture evolution curves and classification model was the mean derived from the RFMs. In the future studies, the goal would be to evaluate each voxel of interest and produce voxel-wise classification.

In summary, RFMs present a new powerful tool for analysis of textural information present within anatomical and quantitative radiological images and may provide a new perspective into the biological information that radiomics is capable of providing and the potential it holds in future diagnostic applications.

Conclusion

This work presents radiomic feature maps (RFMs) for visualization of textural information present in anatomical and quantitative radiological images. The correlation between the quantitative radiological biomarkers (ADC and PK-DC) with radiomic values provided by RFMs was established in this paper. Our results suggest that the textural evolution curves obtained from DCE-MRI RFMs were significantly different for benign and malignant tumors establishing a correlation with vascular heterogeneity. Similarly, the cellular heterogeneity, evaluated using entropy of the ADC map along with textural evolution metric on DWI was significantly different between malignant and benign tumors. Finally, the IsoSVM feature selection and classification model achieved excellent sensitivity and specificity in comparison to state of the art classification methods despite highly imbalanced data.

Materials and methods

Clinical data

The radiomic feature mapping framework was tested on a multiparametric MRI dataset obtained from a retrospective cohort of 124 patients (mean age = 52, range = 24–80) to classify between malignant and benign lesions. Out of the 124 patients, 98 had malignant lesions while 26 patients had benign lesions. Patients were selected for the study based on the potential for malignant breast lesions with a BIRADS score of 3 or greater from 2008–2010. This retrospective study was approved by the IRB at our facility and conforms to HIPAA requirements.

Multiparametric MRI imaging protocol

All patients were scanned on a 3-T clinical MRI system using a bilateral dedicated 4 channel phase-array breast coil in the prone position. MRI sequences acquired included ultrafast spoiled gradient echo (T1-TFE) T1-weighted images (TR/TE: 5.37/2.3 ms; Slice thickness (ST) = 3 mm; Field of view (FOV): 35 × 35 cm; Flip angle (FA) = 12⁰) and fat suppressed (FS) T2-weighted spin echo images (TR/TE: 6122/70 ms; ST = 4 mm; FOV:35 × 35 cm; FA = 90⁰). The DCE-MRI was obtained using FS and non-FS, three-dimensional, FSPGR T₁-weighted (TR/TE = 4.2/2.1 ms; FOV:35 × 35 cm; ST = 5 mm) sequences. One non-FS pre-contrast and fourteen post-contrast images (15 secs per acquisition) for PK analysis were obtained after intravenous administration via a power injector of a GdDTPA contrast agent (0.2 mL/kg(0.1 mmol/kg)).^32,33 Two minutes of T1 fat-suppressed high temporal resolution (15 sec per acquisition) imaging was obtained to capture the wash-in phase of contrast enhancement, followed by a high spatial resolution scan for 2 min. Diffusion weighted imaging was obtained using an FS spin echo Echo Planar Imaging sequence (TR/TE = 5000/90 ms, SENSE = 2, ST = 3–4 mm, b = 0–600 s/mm²) on three planes.

Quantitative MR image analysis

ADC maps were constructed from DWI using a monoexponential equation. DCE Subtraction images were constructed by subtracting pre-contrast from the post-contrast high spatial resolution DCE images.

Dynamic MRI with Pharmacokinetic (PK) Contrast Enhancement

The vascularity of breast tissue was obtained using different semi-quantitative and quantitative metrics.^34,35 The semi-quantitative metrics use the temporal evolution of the time series curves from the DCE MR images and are scaled into three categories relating to the potential characterization of the tissue and other known metrics.^32,36,37

The PK-DCE quantitative metrics derived were volume transfer constant (K ^trans (min⁻¹)), the fractional volume of the extracellular extravascular space (EVF (V _e)), and the transfer rate constant (k _ep (min⁻¹)) using commercial software DynaCad (InVivo, Gainesville, Florida).^32,33 For both benign and malignant patients, glandular and lesion tissue, the mean values and standard deviations of the transfer constant (K ^trans) extra-vascular volume fraction-EVF (V _e) were recorded.

Registration

The mpMRI images were co-registered using the algorithm developed in ref. 38. The registration algorithm reduces information loss during rescaling and reslicing of the MRI volumes using a 3D wavelet transformation. The pre-contrast DCE image was used as the reference registration image for all other images.

Image segmentation

Normal glandular tissue, benign and malignant tumors were segmented from the mpMRI dataset using the Eigenfilter algorithm.³⁹ Eigenfilter is a well-established semi-automatic segmentation algorithm based the Gram-Schmidt orthonormalization algorithm using tissue signatures to define different tissue types.^40,41,42

Radiomic feature mapping

The radiomic feature mapping framework transforms each MRI image into a multidimensional radiomic feature map space (RFMS), defined as RFMS={RFM ₁, RFM ₂,…,RFM _N } ∈ R ^D,where RFM_i represents the ith radiomic feature map (RFM), D represents the number of voxels in the MRI image and N represents the number of RFMs generated. The RFM framework algorithm that transforms radiological data into the RFMS is defined below(Fig. 3). First, a set S of N radiomic filters from different radiomic features is generated. The size of the radiomic filters or the neighborhood scaling parameter, W is determined by the user depending on the spatial resolution of the input MRI image. Second, the quantization of the MRI image intensities to G levels for radiomic first and second order statistics require the MRI image intensities to be quantized. The value of G is determined by the user based on the range of intensities, as well as the number of bits required to represent voxel intensity in the input image. In the final step, the MRI image is convolved with each of the N radiomic filters in the set, S to produce N radiomic feature maps. As a result, every voxel in the original MRI image has a corresponding radiomic feature value in each RFM. The mean of the radiomic values were calculated from different regions of interest (ROI) in each RFM as features for classification and further analysis. Consequently, every RFM feature from every patient corresponds to the average value taken from sliding same sized image window (W x W) across the whole ROI ensuring there is no mathematical dependence between the computed RFM features and size of the ROI.

The input parameters for this study were G = 256, W = 9, and N = 30. Out of 30 RFMs, 7 were generated using first order statistics, 22 were generated using second order statistics and one was generated using Laplacian of Gaussian filter.

Comparison between radiomic feature maps, quantitative radiomic metrics, and MRI metrics

Single quantitative radiomic values corresponding to benign lesion, malignant lesion, and glandular tissue were computed. The quantitative radiomic values corresponding to each ROI were compared with the functional metrics from the DWI and PK-DCE MRI images for their diagnostic ability to classify between benign and malignant lesions using a paired t-test and univariate logistic regression.

Textural evolution curves

The PK-DCE MRI RFMs provided information on the vascular heterogeneity within lesion tissue. The RFM textural evolution curves capture the time evolution of tissue heterogeneity as a function of contrast uptake using the time series derived from DCE images. The mean radiomic feature value across all the voxels within a region of interest in the RFMs was used to construct textural evolution curves for each tissue type. In order to compare the textural evolution curves across different ROIs, normalization of the radiomic feature values were applied.

Analysis of textural evolution curves from entropy feature maps were done using the time to peak and the textural wash out slope from the DCE RFMs. The time to peak is defined as the time it took for the textural evolution curves to reach its maximum value. The textural wash out slope is as the textural wash out of the textural evolution curves within the lesion tissue. This was computed as the slope of the line connecting the peak texture enhancement in the first 2 min to the last time point including all the intermediate time points.

Textural evolution metric for DWI

The textural evolution metric (TEM) was developed to investigate the change in tissue heterogeneity of different tissue types. We defined a textural evolution metric (TEM) for DWI as follows:

$$TE{M_{DWI}} = \frac{{RF{M_{b >0}}}}{{RF{M_{b0}}}}$$

Similarly, a textural evolution metric (TEM) was defined for the high spatial resolution DCE-MRI dataset (HR-DCE-MRI) as shown here:

$$TE{M_{HR - DCE - MRI}} = \frac{{Pos{t_{HR - DCE - MRI}}}}{{Pr{e_{HR - DCE - MRI}}}}$$

Statistical analysis

We computed summary statistics (mean and standard error of the mean) for the radiomic metrics and functional metrics from mpMRI. An unpaired t-test was performed to compare the RFMs for the benign and malignant patient datasets. Statistical significance was set at p ≤ 0.05. Univariate logistic regression analysis was used to find associations between the RFMs and the final diagnosis. Receiver operating characteristic (ROC) curve analysis was performed to assess the diagnostic performance of each RFM in characterizing benign vs. malignant lesions.

Multi-view IsoSVM framework for feature embedding and classification

The radiomic feature maps were computed from a mpMRI dataset resulting in a high dimensional feature space. Furthermore, the radiomic feature maps corresponding to different imaging sequences highlight different functional textural properties of the tissue of interest. Consequently, we developed a multiview feature embedding and classification framework termed IsoSVM by modifying and combining the Isomap⁴³ and support vector machine (SVM) algorithms.⁴⁴ The overview of the IsoSVM framework is shown in Fig. 5.

Computation of radiomic signatures

The high dimensional mpMRI radiomic feature space was first analyzed to compute six different radiomics signatures as follows:

a.
PK-DCE MRI radiomic signature: The textural evolution curves corresponding to the radiomic feature maps were transformed into a radiomic signature using the Isomap algorithm.⁴³ For the PK-DCE RFM dataset, the 15 dimensional textural curves were transformed into a single dimensional representation of the textural evolution curve characteristic. The correlation coefficient between the textural evolution curves of different patients was used as the distance metric to compute the geodesic distances for the low dimensional embedding.
b.
DWI and DCE-MRI High spatial resolution radiomic signatures: The vector of the textural evolution metrics for the radiomic feature maps was used as the radiomic signature for both the datasets.
c.
ADC map, T1WI, and T2WI radiomic signatures: Each one was a single image, making the vector of the mean of the radiomic feature maps, their radiomic signature.

Feature selection

The set of radiomic signatures from the MRI datasets resulted in a 180-dimensional radiomic feature space. The 180-dimensional radiomic feature space was then transformed and modeled into an IsoSVM classification model as detailed below.

a.
The feature set from each of the radiomic signatures was sorted from largest to smallest based on the area under the receiver operating characteristic curve obtained using univariate logistic regression.
b.
A subset of top features (f1, f2, …, f6) from each of the radiomic signatures were selected to create a unified radiomic signature, g = U_fi.
c.
The unified radiomic signature, g was then transformed into a linearly separable, low dimensional feature space, h using the Isomap algorithm. The feature transformation was executed using Isomap because the Isomap algorithm is not prone to overfitting because of its unsupervised nature and at the same time, accounts for the dependencies between different RFMs.

Classification

a.
The support vector machine algorithm trains a classification model to classify between benign and malignant patients on the transformed feature space, h. Because SVM is a linear binary classification algorithm that attempts to create a hyperplane that best separates the different groups, the application of Isomap algorithm prior to SVM reduces the non-linearity in the data by transforming the feature space, g to h. The steps c and d combine to form the hybrid IsoSVM classification model. Mathematically, the hybrid IsoSVM classifier is represented using the following equation:
$$f\left( x \right) = \mathop {\sum }\limits_{i = 1}^N {\alpha _i}{y_i} < \phi \left( {{x_i}} \right),\phi \left( x \right) > + b$$
where φ() is the Isomap transformation function that maps the unified radiomic signature, g into a linearly separable space, h, N is the number of patients in the training set, α _i are the Lagrange multipliers, x _i represents the radiomic signatures of training set patients and x represents the radiomic signature of the test patient. y _i represents the classes where x _i resides.
b.
For comparison, we tested five different SVM kernels on the unified radiomic signature, g including the hybrid IsoSVM kernel to classify the benign and malignant patients to determine the optimal kernel.
c.
Finally, due to class imbalance between the number of benign and malignant patients, the ratio between penalty for misclassification of different patient data sets was varied to identify the optimal penalty ratio and shown in the supplementary data. ⁴⁵

The complete set of input parameters (input feature space: f1, f2, …,f6; Isomap neighborhood parameter, k; Isomap dimensionality, d; and misclassification penalty ratio) were estimated using leave-one-out and k fold cross validation (k = 10).

K-fold cross validation

The set of benign and malignant patients were first separately divided into ten randomly sampled subsets due to imbalance in the number of patients in each patient group. Next, the ten subsets from both categories were combined to form ten patient subsets. As a result, the ratio between the number of benign and malignant patients was maintained similar to the original patient cohort in the patient subsets. The ten-fold cross validation procedure was performed on these ten subsets. The complete procedure of generating ten subsets and performing ten-fold cross validation was repeated 100 times to avoid any bias that may occur due to specific partitioning of the data.

Code availability

Our software will be freely available to academic users after issue of pending patents and a materials research agreement is obtained from the university. Due to University regulations, any patent pending software is not available until a patent is issued.

Data availability

All relevant clinical data are available upon request with adherence to HIPPA laws and the institutions IRB policies.

References

Kumar, V. et al. Radiomics: the process and the challenges. Magn. Reson. Imaging 30, 1234–1248 (2012).
Article PubMed PubMed Central Google Scholar
Lambin, P. et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 48, 441–446 (2012).
Article PubMed PubMed Central Google Scholar
Gillies, R. J., Kinahan, P. E. & Hricak, H. Radiomics: images are more than pictures, they are data. Radiology 278, 563–577 (2016).
Article PubMed Google Scholar
Parekh, V. & Jacobs, M. A. Radiomics: a new application from established techniques. Exp. Rev. Precision Med. Drug Dev. 1, 207–226 (2016).
Article Google Scholar
Shannon, C. E. A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423 (1948).
Article Google Scholar
Mandelbrot, B. B. The fractal geometry of nature. Vol. 173 (Macmillan, 1983).
Haralick, R. M., Shanmugam, K. & Dinstein, I. H. Textural features for image classification. IEEE Trans. Syst. Man Cybernetics 3, 610–621 (1973).
Galloway, M. M. Texture analysis using gray level run lengths. Comput. Graphics Image Process. 4, 172–179 (1975).
Article Google Scholar
Laws, K. I. in 24th annual technical symposium. (International Society for Optics and Photonics) pp 376–381.
Amadasun, M. & King, R. Textural features corresponding to textural properties. IEEE Trans. Syst. Man Cybernetics 19, 1264–1274 (1989).
Article Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 1–9 (2014).
Sinha, S. et al. Multifeature analysis of Gd‐enhanced MR images of breast lesions. J. Magn. Reson. Imaging 7, 1016–1026 (1997).
Article CAS PubMed Google Scholar
Gibbs, P. & Turnbull, L. W. Textural analysis of contrast‐enhanced MR images of the breast. Magn. Reson. Med. 50, 92–98 (2003).
Article PubMed Google Scholar
Ertas, G., Gulcur, H. O. & Tunaci, M. Improved lesion detection in MR mammography: three-dimensional segmentation, moving voxel sampling, and normalized maximum intensity-time ratio entropy. Acad. Radiol. 14, 151–161 (2007).
Article PubMed Google Scholar
Nie, K. et al. Quantitative analysis of lesion morphology and texture features for diagnostic prediction in breast MRI. Acad. Radiol. 15, 1513–1525 (2008).
Article PubMed PubMed Central Google Scholar
McLaren, C. E., Chen, W. P., Nie, K. & Su, M. Y. Prediction of malignant breast lesions from MRI features: a comparison of artificial neural network and logistic regression techniques. Acad. Radiol. 16, 842–851 (2009).
Article PubMed PubMed Central Google Scholar
Agner, S. C. et al. Textural kinetics: a novel dynamic contrast-enhanced (DCE)-MRI feature for breast lesion classification. J. Digital Imaging 24, 446–463 (2011).
Article Google Scholar
Cai, H., Liu, L., Peng, Y., Wu, Y. & Li, L. Diagnostic assessment by dynamic contrast-enhanced and diffusion-weighted magnetic resonance in differentiation of breast lesions under different imaging protocols. BMC Cancer 14, 366 (2014).
Article PubMed PubMed Central Google Scholar
Cai, H., Peng, Y., Ou, C., Chen, M. & Li, L. Diagnosis of breast masses from dynamic contrast-enhanced and diffusion-weighted MR: a machine learning approach. PloS ONE 9, e87387 (2014).
Article PubMed PubMed Central Google Scholar
Wang, T. C. et al. Computer-aided diagnosis of breast DCE-MRI using pharmacokinetic model and 3-D morphology analysis. Magn. Reson. Imaging 32, 197–205 (2014).
Article CAS PubMed Google Scholar
Holli, K. et al. Characterization of breast cancer types by texture analysis of magnetic resonance images. Acad. Radiol. 17, 135–141 (2010).
Article PubMed Google Scholar
Ahmed, A., Gibbs, P., Pickles, M. & Turnbull, L. Texture analysis in assessment and prediction of chemotherapy response in breast cancer. J. Magn. Reson. Imaging 38, 89–101 (2013).
Article PubMed Google Scholar
Parikh, J. et al. Changes in primary breast cancer heterogeneity may augment midtreatment MR imaging assessment of response to neoadjuvant chemotherapy. Radiology 272, 100–112 (2014).
Article PubMed Google Scholar
Degani, H., Gusis, V., Weinstein, D., Fields, S. & Strano, S. Mapping pathophysiological features of breast tumors by MRI at high spatial resolution. Nat. Med. 3, 780–782 (1997).
Article CAS PubMed Google Scholar
Weinstein, D. et al. Breast fibroadenoma: mapping of pathophysiologic features with three-time-point, contrast-enhanced MR imaging--pilot study. Radiology 210, 233–240 (1999).
Article CAS PubMed Google Scholar
Guo, Y. et al. Differentiation of clinically benign and malignant breast lesions using diffusion-weighted imaging. J. Magn. Reson. Imaging 16, 172–178 (2002).
Article PubMed Google Scholar
Woodhams, R. et al. Diffusion-weighted imaging of malignant breast tumors: the usefulness of apparent diffusion coefficient (ADC) value and ADC map for the detection of malignant breast tumors and evaluation of cancer extension. J. Comput. Assist. Tomogr. 29, 644–649 (2005).
Article PubMed Google Scholar
Park, M. J., Cha, E. S., Kang, B. J., Ihn, Y. K. & Baik, J. H. The role of diffusion-weighted imaging and the apparent diffusion coefficient (ADC) values for breast tumors. Korean J. Radiol. 8, 390–396 (2007).
Article PubMed PubMed Central Google Scholar
deSouza, N. M. et al. Diffusion-weighted magnetic resonance imaging: a potential non-invasive marker of tumour aggressiveness in localized prostate cancer. Clin. Radiol. 63, 774–782 (2008).
Article CAS PubMed Google Scholar
Partridge, S. C. et al. Quantitative diffusion-weighted imaging as an adjunct to conventional breast MRI for improved positive predictive value. Am. J. Roentgenol. 193, 1716–1722 (2009).
Article Google Scholar
Ei Khouli, R. H. et al. Diffusion-weighted imaging improves the diagnostic accuracy of conventional 3.0-T breast MR imaging. Radiology 256, 64–73 (2010).
Article PubMed PubMed Central Google Scholar
El Khouli, R. H., Macura, K. J., Barker, P. B., Jacobs, M. A. & Bluemke, D. A. Dynamic Contrast Enhanced Magnetic Resonance Imaging of the Breast: Effect of Temporal Resolution. J. Magn. Reson. Imaging 30, 999–1004 (2009).
Article PubMed PubMed Central Google Scholar
El Khouli, R. H. et al. Dynamic contrast-enhanced MRI of the breast: quantitative method for kinetic curve type assessment. Am. J. Roentgenol. 193, W295–W300 (2009).
Article Google Scholar
Tofts, P. S. Modeling tracer kinetics in dynamic Gd-DTPA MR imaging. J. Magn. Reson. Imaging 7, 91–101 (1997).
Article CAS PubMed Google Scholar
Tofts, P. S. et al. Estimating kinetic parameters from dynamic contrast-enhanced T(1)-weighted MRI of a diffusable tracer: standardized quantities and symbols. J. Magn. Reson. Imaging 10, 223–232 (1999).
Article CAS PubMed Google Scholar
Kuhl, C. K. et al. Dynamic breast MR imaging: Are signal intensity time course data useful for differential diagnosis of enhancing lesions? Radiology 211, 101–110 (1999).
Article CAS PubMed Google Scholar
Schnall, M. D. et al. Diagnostic architectural and dynamic features at breast MR imaging: multicenter study. Radiology 238, 42–53 (2006).
Article PubMed Google Scholar
Akhbardeh, A. & Jacobs, M. A. Methods and systems for registration of radiological images. US patent US 9,008,462 (2015).
Windham, J. P., Abd-Allah, M. A., Reimann, D. A., Froelich, J. W. & Haggar, A. M. Eigenimage filtering in MR imaging. J. Comput. Assist. Tomogr. 12, 1–9 (1988).
Article CAS PubMed Google Scholar
Soltanian-Zadeh, H., Windham, J. & Jenkins, J. Error propagation in eigenimage filtering. IEEE Trans. Med. Imaging 9, 405–420 (1990).
Article CAS PubMed Google Scholar
Peck, D. et al. Cerebral tumor volume calculations using planimetric and eigenimage analysis. Med. Phys. 23, 2035–2042 (1996).
Article CAS PubMed Google Scholar
Jacobs, M. A. et al. Identification of cerebral ischemic lesions in rat using eigenimage filtered magnetic resonance imaging. Brain Res. 837, 83–94 (1999).
Article CAS PubMed Google Scholar
Tenenbaum, J. B., De Silva, V. & Langford, J. C. A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000).
Article CAS PubMed Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Machine Learn. 20, 273–297 (1995).
Google Scholar
Elkan, C. The foundations of cost-sensitive learning. Int. Joint Conference Artif. Intell. 17, 973–978 (2001).
Google Scholar

Download references

Acknowledgements

Funding provided by the National Institutes of Health grant numbers: 5P30CA006973 (IRAT), U01CA140204, 1R01CA190299, Siemens Medical Grant:JHU-2012-MR-86-01-36819 and the equipment Tesla K40 used for this research was donated by the NVIDIA Corporation.

Author information

Authors and Affiliations

The Russell H. Morgan Department of Radiology and Radiological Science, Division of Cancer Imaging, The Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA
Vishwa S. Parekh & Michael A. Jacobs
Department of Computer Science, The Johns Hopkins University, Baltimore, MD, 21208, USA
Vishwa S. Parekh
Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA
Michael A. Jacobs

Authors

Vishwa S. Parekh
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Jacobs
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.A.J. and V.S.P. developed the concept, performed the testing, algorithm development, statistical methods, and manuscript writing.

Corresponding author

Correspondence to Michael A. Jacobs.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Parekh, V.S., Jacobs, M.A. Integrated radiomic framework for breast cancer and tumor biology using advanced machine learning and multiparametric MRI. npj Breast Cancer 3, 43 (2017). https://doi.org/10.1038/s41523-017-0045-3

Download citation

Received: 08 March 2017
Revised: 04 October 2017
Accepted: 06 October 2017
Published: 14 November 2017
DOI: https://doi.org/10.1038/s41523-017-0045-3

This article is cited by

Interpretable Radiomic Signature for Breast Microcalcification Detection and Classification
- Francesco Prinzi
- Alessia Orlando
- Salvatore Vitabile
Journal of Imaging Informatics in Medicine (2024)
Attention-based deep learning for breast lesions classification on contrast enhanced spectral mammography: a multicentre study
- Ning Mao
- Haicheng Zhang
- Heng Ma
British Journal of Cancer (2023)
Fully automatic classification of breast lesions on multi-parameter MRI using a radiomics model with minimal number of stable, interpretable features
- Jing Zhang
- Chenao Zhan
- Guang Yang
La radiologia medica (2023)
Glioblastomas with and without peritumoral fluid-attenuated inversion recovery (FLAIR) hyperintensity present morphological and microstructural differences on conventional MR images
- Qiuyue Han
- Yiping Lu
- Bo Yin
European Radiology (2023)
Radiomics and artificial intelligence in breast imaging: a survey
- Tianyu Zhang
- Tao Tan
- Ritse M. Mann
Artificial Intelligence Review (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Experimental summary

Analysis of textural evolution curves

Analysis of textural evolution metric on DWI

Multi-view feature embedding and classification

Discussion

Conclusion

Materials and methods

Clinical data

Multiparametric MRI imaging protocol

Quantitative MR image analysis

Dynamic MRI with Pharmacokinetic (PK) Contrast Enhancement

Registration

Image segmentation

Radiomic feature mapping

Comparison between radiomic feature maps, quantitative radiomic metrics, and MRI metrics

Textural evolution curves

Textural evolution metric for DWI

Statistical analysis

Multi-view IsoSVM framework for feature embedding and classification

Computation of radiomic signatures

Feature selection

Classification

K-fold cross validation

Code availability

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links